Track D: Development & Engineering

Progress: 0/62 tasks complete (0%)

This track covers all implementation work for the CODITECT AI Agent Security Layer, derived directly from SDD-CODITECT-SEC-001 and TDD v1.0.0. Tasks follow the five-phase development plan defined in SDD Section 12.1.

Phase structure:

D.1–D.3: Phase 1 — Core Enforcement (6 weeks)
D.4: Phase 2 — Output Scanning and Redaction (3 weeks)
D.5: Phase 3 — Human Confirmation Flow (2 weeks)
D.6: Phase 4 — Dashboard and Alerting (4 weeks)
D.7: Phase 5 — Tenant Configuration and Operations (3 weeks)

Status Summary

Section	Total	Status
D.1 SecurityGateHook	11	Pending
D.2 PatternEngine	13	Pending
D.3 RiskAnalyzer, ActionRouter & AuditLogger	11	Pending
D.4 Output Scanner (PostToolUse)	8	Pending
D.5 Human Confirmation Flow	5	Pending
D.6 Monitor Dashboard	9	Pending
D.7 Tenant Configuration & Operations	5	Pending

D.1 SecurityGateHook

SDD Reference: Section 3.1 | Phase: 1 | Estimated Effort: Medium (~500 LOC Python)

The entry point for all security inspection. Receives hook events from CODITECT's hook dispatch system, assembles inspection payloads, coordinates the scan pipeline, and returns enforcement decisions. Registers as three CODITECT hooks: PreAgentStart, PreToolUse, PostToolUse.

D.2 PatternEngine

SDD Reference: Sections 3.2, 6.2, Appendix A | Phase: 1 | Estimated Effort: High (~800 LOC Python + ~2,000 lines YAML)

Evaluates tool call payloads against the unified pattern library. The rule authoring effort (YAML files) is the largest single effort in the entire project.

D.3 RiskAnalyzer, ActionRouter & AuditLogger

SDD Reference: Sections 3.3, 3.4, 3.6, 6.1 | Phase: 1 | Estimated Effort: Low-Medium (~600 LOC Python)

The scoring, decision, and persistence components. These are designed to be straightforward implementations of well-specified algorithms.

D.4 Output Scanner (PostToolUse)

SDD Reference: Sections 2.2, 4.2, 3.1 (on_tool_result) | Phase: 2 | Estimated Effort: Medium (~400 LOC Python)

PostToolUse scanning for secrets and PII in tool outputs before they are passed back to the agent context window. Builds on the PatternEngine and ActionRouter from Phase 1.

D.4.1 Implement OutputScanner class — wraps SecurityGateHook.on_tool_result() with output-phase scan logic; serializes tool_output dict to scannable raw text
D.4.2 Configure secret detection rules to apply to tool_output phase — verify apply_to: ["tool_input", "tool_output"] is set on all 13 SD-* rules and all PII-* rules
D.4.3 Implement output redaction in ActionRouter — for REDACT decisions on output scan, replace matched secrets with [REDACTED:<rule_id>] tokens; return sanitized output as ToolResultDecision.redacted_output
D.4.4 Implement PatternEngine.scan(phase="output") path — output scans apply only categories with apply_to containing "tool_output"; destructive-commands.yaml rules do not apply to output phase
D.4.5 Implement PostToolUse hook registration — PostToolUse hook with priority: 10, timeout_ms: 300; lower priority than PreToolUse because output scan is non-blocking for most cases
D.4.6 Implement correlation between PreToolUse and PostToolUse events — AuditEvent for output scan includes pre_event_id foreign key linking to the corresponding PreToolUse audit event
D.4.7 Implement TOOL_REDACTED audit event — emitted when output scanner redacts secrets; redacted_fields list contains field names only (never values); synchronous write to org.db
D.4.8 Validate redaction does not mutate agent context — confirm redacted output returned to CODITECT dispatch layer replaces original output entirely; no partial redaction that leaves discoverable fragments

D.5 Human Confirmation Flow

SDD Reference: Section 3.4 (CONFIRM action), 4.1 (control flow) | Phase: 3 | Estimated Effort: Medium (~400 LOC Python + UI integration)

The CONFIRM action pauses MEDIUM-severity tool calls pending human approval via the CODITECT UI. Timeout escalates to BLOCK.

D.5.1 Implement CONFIRM suspension mechanism in CODITECT dispatch layer — register pending confirmation request; return CONFIRM decision to dispatch layer; dispatch suspends tool call execution
D.5.2 Implement CONFIRM response handler — listen for human approval or rejection via CODITECT UI confirmation dialog; resume or block tool call based on response
D.5.3 Implement CONFIRM timeout escalation — if no human response within confirm_timeout_seconds (default: 30), emit TOOL_CONFIRM_TIMEOUT audit event and escalate decision to BLOCK
D.5.4 Implement CONFIRM rate limiter — maximum 3 CONFIRM requests per session per minute; excess CONFIRM decisions automatically escalate to BLOCK to prevent false-positive DoS
D.5.5 Integrate CONFIRM dialog with CODITECT UI — surface confirmation dialog showing tool name, risk score, primary threat category, and reasoning string; provide Approve / Block buttons with 30-second countdown timer display

D.6 Monitor Dashboard

SDD Reference: Section 3.5, 5.2, 9.4 | Phase: 4 | Estimated Effort: High (~1,200 LOC Python + TypeScript)

Real-time visibility into security events across active agent sessions. FastAPI backend with WebSocket streaming; React TypeScript frontend with four dashboard widgets.

D.6.1 Implement DashboardServer (FastAPI) — six REST routes: GET /api/v1/security/sessions, GET /api/v1/security/events, POST /api/v1/security/gateway/{tenant_id}/kill, GET /api/v1/security/export, GET /api/v1/security/stats, GET /api/v1/security/alerts; use existing CODITECT JWT authentication
D.6.2 Implement EventStreamBus — in-memory pub/sub for real-time event distribution from AuditLogger to WebSocket clients; overflow drops dashboard events without affecting enforcement; batch polling fallback at 30-second intervals when bus degrades
D.6.3 Implement WebSocket endpoint (/ws/v1/security/stream) — token auth via query parameter; broadcasts SecurityEvent messages to all connected clients within 200ms of security event; handles 100 concurrent connections without degradation
D.6.4 Implement kill switch endpoint (POST /api/v1/security/gateway/{tenant_id}/kill) — admin + MFA-gated (X-MFA-Token header required); terminates all active agent sessions for tenant within 5 seconds; writes KILL_SWITCH_ACTIVATED event to kill_switch_events table (5-year retention); emits KILL_SWITCH_ACTIVATED to MonitorDashboard
D.6.5 Implement AlertDispatcher — webhook delivery to Discord, Slack, PagerDuty targets; POST with retry up to 3 times with exponential backoff (base: 2s); per SDD alert payload schema including dashboard_url
D.6.6 Implement export endpoint (GET /api/v1/security/export) — NDJSON, CSV, JSON formats; date range filtering; generates up to 30 days of audit data; streams response to avoid memory pressure on large exports
D.6.7 Implement React dashboard frontend — four widgets: Live Feed (WebSocket real-time events), Session Map (active sessions with risk level), Pattern Hit Rates (last 1 hour by rule_id), System Health (PatternEngine/RiskAnalyzer/AuditLogger status + scan p99 vs 500ms limit)
D.6.8 Implement useSecurityStream React hook — manages WebSocket lifecycle, auto-reconnects with exponential backoff on disconnect, buffers events during reconnection, exposes typed SecurityEvent stream
D.6.9 Implement nightly_pattern_effectiveness_job.py — SQL aggregation of match_count, block_count, false_positive_count per rule_id per tenant_id per day; populates pattern_effectiveness table; runs via systemd timer or cron

D.7 Tenant Configuration & Operations

SDD Reference: Sections 7.1, 10.1, 14.1-14.4 | Phase: 5 | Estimated Effort: Medium (~400 LOC Python + UI)

Multi-tenant rule layering, admin configuration UI, and operational hardening for production readiness.

D.7.1 Implement three-layer rule resolution at scan time — Layer 0 (platform base, always included, process-level cache), Layer 1 (platform recommended, tenant-overridable, 60s TTL per tenant_id), Layer 2 (tenant custom rules from tenant_security_configs.rule_overrides, 30s TTL per tenant_id); Layer 0 wins over all; Layer 1 wins over Layer 2 for CRITICAL severity
D.7.2 Implement tenant config management API — GET/PUT /api/v1/security/tenant/{tenant_id}/config; stores TenantSecurityConfig in tenant_security_configs table; validates that CRITICAL action overrides are rejected; logs all config changes as TENANT_OVERRIDE events
D.7.3 Implement admin UI for tenant security configuration — view and edit action_overrides, allowlisted_tools, rule_overrides, alert_webhooks, confirm_timeout; requires admin role; displays current config version and last-updated-by
D.7.4 Run load test — 50 concurrent scan requests; validate p99 scan latency under 500ms; validate kill switch terminates all sessions within 5 seconds under 100 concurrent sessions; validate 100 WebSocket connections without degradation
D.7.5 Write operational runbook — three sections: SecurityGateCircuitOpen response (check PatternEngine/RiskAnalyzer, inspect logs, verify rule YAML syntax, circuit auto-recovery); SecurityGateAuditDbDown response (check disk space, WAL lock, notify tenants, do not disable audit); SecurityGateHighBlockRate response (review recent blocks, differentiate false positives from genuine threats, open PR to adjust rules if false positive)

D.8 GitHub Actions CI Security Gate

Task	Done	Total	Status
D.8 GitHub Actions CI Security Gate	0	6	Pending

SDD Reference: ADR-001 (native architecture), ADR-002 (YAML patterns) | Phase: 1.5 | Estimated Effort: Small (~300 LOC Python + YAML)

GitHub Actions workflow that scans push/PR diffs against the YAML pattern library to prevent secrets, destructive commands, and prompt injection from entering the repository. Reuses PatternEngine and aggregate_score from the core security modules.

D.8.1 Implement PatternEngine.load_patterns() — YAML loading from config/security-patterns/ directory; compile regex patterns; return count loaded
D.8.2 Create scripts/ci-security-scan.py — standalone scanner that loads patterns, scans git diff, outputs GitHub Actions annotations, returns exit code 1 on critical/high findings
D.8.3 Create .github/workflows/security-scan.yml — GitHub Actions workflow triggered on push and pull_request; runs ci-security-scan.py against changed files
D.8.4 Write unit tests for PatternEngine.load_patterns() and CI scanner
D.8.5 Create scripts/ci-security-scan-README.md — usage documentation for the scanner and GitHub Action
D.8.6 Validate end-to-end: push test commit with known secret pattern, verify action blocks

Status Summary​

D.1 SecurityGateHook​

D.2 PatternEngine​

D.3 RiskAnalyzer, ActionRouter & AuditLogger​

D.4 Output Scanner (PostToolUse)​

D.5 Human Confirmation Flow​

D.6 Monitor Dashboard​

D.7 Tenant Configuration & Operations​

D.8 GitHub Actions CI Security Gate​