Skill Health Tracker - Reference Documentation

Version: 1.0.0 Status: Active Created: 2026-01-11 Owner: CODITECT Framework Team

Overview

The Skill Health Framework provides continuous monitoring of component health scores, trend analysis, regression detection, and optimization recommendations. It tracks skill performance over time to ensure consistent quality across the CODITECT framework.

Components

1. Skill Health Tracker (`scripts/skill-health-tracker.py`)

Purpose: Core health tracking with rolling averages and trend analysis.

Features:

Historical score storage per skill per session
Rolling averages (7-day, 30-day)
Trend detection (improving/stable/degrading)
Baseline comparison
Dashboard visualization

Usage:

# Show dashboard
python3 scripts/skill-health-tracker.py --dashboard

# View specific skill
python3 scripts/skill-health-tracker.py --skill <name>

# Record current scores
python3 scripts/skill-health-tracker.py --record

# Create baseline
python3 scripts/skill-health-tracker.py --baseline

# Compare to baseline
python3 scripts/skill-health-tracker.py --compare

# Export data
python3 scripts/skill-health-tracker.py --export --json

2. Regression Detector (`hooks/skill-regression-detector.py`)

Purpose: Detect skill health regressions after retrospectives.

Features:

Compares current scores to baseline
Flags skills with >10% score drop
Generates regression reports
Auto-creates improvement tasks (optional)

Usage:

# Check for regressions
python3 hooks/skill-regression-detector.py --check

# Generate improvement tasks
python3 hooks/skill-regression-detector.py --auto-task

# Custom threshold
python3 hooks/skill-regression-detector.py --threshold 15

# Save report
python3 hooks/skill-regression-detector.py --save-report

3. /skill-health Command (`commands/skill-health.md`)

Purpose: User-facing command for health dashboard.

Usage:

/skill-health                      # Dashboard view
/skill-health --skill <name>       # Specific skill
/skill-health --compare            # Baseline comparison
/skill-health --record             # Record current scores

4. /which Health Integration (`commands/which.md`)

Purpose: Show health scores in agent recommendations.

Features:

Health score displayed in output: [XX%] ↑/↓/→
Health warning for <50% agents
Health-aware routing (health weighted 40% in ranking)

Data Storage

skill-health.json

Location: context-storage/skill-health.json

Schema:

{
  "version": "1.0.0",
  "last_updated": "2026-01-11T12:00:00Z",
  "skills": {
    "skill-name": {
      "scores": [
        {
          "date": "2026-01-11T12:00:00Z",
          "score": 85,
          "invocations": 42
        }
      ],
      "rolling_7d": 82.5,
      "rolling_30d": 78.3,
      "trend": "improving",
      "total_invocations": 156,
      "first_tracked": "2026-01-01T00:00:00Z",
      "last_updated": "2026-01-11T12:00:00Z"
    }
  }
}

Baseline Snapshots

Location: context-storage/baselines/skill-baseline-YYYY-MM-DD.json

Schema:

{
  "version": "1.0.0",
  "created": "2026-01-11T12:00:00Z",
  "name": "2026-01-11",
  "skills": {
    "skill-name": {
      "score": 85,
      "invocations": 42,
      "date": "2026-01-11T12:00:00Z",
      "trend": "stable"
    }
  }
}

Regression Reports

Location: context-storage/regressions/regression-report-YYYYMMDD_HHMMSS.json

Schema:

{
  "status": "complete",
  "baseline_name": "2026-01-01",
  "threshold": 10,
  "checked_at": "2026-01-11T12:00:00Z",
  "regressions": [
    {
      "skill": "problem-skill",
      "baseline_score": 85,
      "current_score": 70,
      "delta": -15,
      "severity": "warning"
    }
  ],
  "warnings": [],
  "improvements": [],
  "summary": {
    "total_checked": 150,
    "regression_count": 3,
    "warning_count": 5,
    "improvement_count": 12
  }
}

Health Scoring

Score Calculation

Health scores are calculated from skill invocation history:

health_score = (
    success_rate * 0.5 +      # 50% weight: successful invocations / total
    trend_score * 0.3 +       # 30% weight: recent score trajectory
    (1 - error_penalty) * 0.2 # 20% weight: deduction for errors
)

Status Levels

Score	Status	Icon	Action
90-100	Excellent	🟢	Maintain
70-89	Good	🟡	Monitor
50-69	Needs Work	🟠	Improve
0-49	Critical	🔴	Urgent Fix

Trend Detection

Trend	Criteria	Icon
Improving	Score rising >5%	↑
Stable	Score within ±5%	→
Degrading	Score falling >5%	↓

Workflow Integration

Session Retrospective Flow

Session End
    │
    ▼
┌─────────────────────┐
│ Run Retrospective   │
│ (session-           │
│  retrospective.py)  │
└─────────────────────┘
    │
    ▼
┌─────────────────────┐
│ Record Health       │
│ (skill-health-      │
│  tracker.py)        │
└─────────────────────┘
    │
    ▼
┌─────────────────────┐
│ Check Regressions   │
│ (skill-regression-  │
│  detector.py)       │
└─────────────────────┘
    │
    ▼
┌─────────────────────┐
│ Alert if            │
│ Regressions Found   │
└─────────────────────┘

Recommended Schedule

Action	Frequency	Command
View dashboard	Daily	`/skill-health`
Record scores	After retrospective	`--record`
Check regressions	Weekly	`--compare`
Create baseline	Before major changes	`--baseline`
Full export	Monthly	`--export`

Hook Configuration

Add to ~/.claude/settings.json:

{
  "hooks": {
    "PostToolUse:retrospective": [
      {
        "command": "python3 ~/.coditect/hooks/skill-regression-detector.py --check --save-report",
        "timeout": 30000
      }
    ]
  }
}

Component	Type	Purpose
`skill-pattern-analyzer.py`	Script	Analyze patterns, generate recommendations
`session-retrospective.py`	Hook	End-of-session analysis
`skill-learnings.json`	Data	Source of skill invocation history
`/optimize-skills`	Command	Alias for /skill-health
`/which`	Command	Agent recommendations with health

Troubleshooting

No Health Data Available

# Record initial scores
python3 scripts/skill-health-tracker.py --record

# Verify data file exists
ls -la context-storage/skill-health.json

No Baseline Found

# Create baseline
python3 scripts/skill-health-tracker.py --baseline

# Verify baseline exists
ls -la context-storage/baselines/

Scores Not Updating

# Check skill-learnings.json has data
cat context-storage/skill-learnings.json | python3 -m json.tool | head -50

# Run retrospective to populate learnings
python3 hooks/session-retrospective.py --analyze

ADR References

F.5.6: Skill Health Framework Enhancement (PILOT Plan)
F.5.5: Component Success Rate Improvement (completed prerequisite)
ADR-061: Skill Improvement Loop (related)

Last Updated: 2026-01-11 Maintained By: CODITECT Framework Team

Overview​

Components​

1. Skill Health Tracker (scripts/skill-health-tracker.py)​

2. Regression Detector (hooks/skill-regression-detector.py)​

3. /skill-health Command (commands/skill-health.md)​

4. /which Health Integration (commands/which.md)​

Data Storage​

skill-health.json​

Baseline Snapshots​

Regression Reports​

Health Scoring​

Score Calculation​

Status Levels​

Trend Detection​

Workflow Integration​

Session Retrospective Flow​

Recommended Schedule​

Hook Configuration​

Related Components​

Troubleshooting​

No Health Data Available​

No Baseline Found​

Scores Not Updating​

ADR References​

Overview

Components

1. Skill Health Tracker (`scripts/skill-health-tracker.py`)

2. Regression Detector (`hooks/skill-regression-detector.py`)

3. /skill-health Command (`commands/skill-health.md`)

4. /which Health Integration (`commands/which.md`)

Data Storage

skill-health.json

Baseline Snapshots

Regression Reports

Health Scoring

Score Calculation

Status Levels

Trend Detection

Workflow Integration

Session Retrospective Flow

Recommended Schedule

Hook Configuration

Related Components

Troubleshooting

No Health Data Available

No Baseline Found

Scores Not Updating

ADR References