Skip to main content

Skill Health Tracker - Reference Documentation

Version: 1.0.0 Status: Active Created: 2026-01-11 Owner: CODITECT Framework Team


Overview

The Skill Health Framework provides continuous monitoring of component health scores, trend analysis, regression detection, and optimization recommendations. It tracks skill performance over time to ensure consistent quality across the CODITECT framework.

Components

1. Skill Health Tracker (H.P.004-SCRIPTS/skill-health-tracker.py)

Purpose: Core health tracking with rolling averages and trend analysis.

Features:

  • Historical score storage per skill per session
  • Rolling averages (7-day, 30-day)
  • Trend detection (improving/stable/degrading)
  • Baseline comparison
  • Dashboard visualization

Usage:

# Show dashboard
python3 H.P.004-SCRIPTS/skill-health-tracker.py --dashboard

# View specific skill
python3 H.P.004-SCRIPTS/skill-health-tracker.py --skill <name>

# Record current scores
python3 H.P.004-SCRIPTS/skill-health-tracker.py --record

# Create baseline
python3 H.P.004-SCRIPTS/skill-health-tracker.py --baseline

# Compare to baseline
python3 H.P.004-SCRIPTS/skill-health-tracker.py --compare

# Export data
python3 H.P.004-SCRIPTS/skill-health-tracker.py --export --json

2. Regression Detector (H.P.005-HOOKS/skill-regression-detector.py)

Purpose: Detect skill health regressions after retrospectives.

Features:

  • Compares current scores to baseline
  • Flags H.P.003-SKILLS with >10% score drop
  • Generates regression reports
  • Auto-creates improvement tasks (optional)

Usage:

# Check for regressions
python3 H.P.005-HOOKS/skill-regression-detector.py --check

# Generate improvement tasks
python3 H.P.005-HOOKS/skill-regression-detector.py --auto-task

# Custom threshold
python3 H.P.005-HOOKS/skill-regression-detector.py --threshold 15

# Save report
python3 H.P.005-HOOKS/skill-regression-detector.py --save-report

3. /skill-health Command (H.P.002-COMMANDS/skill-health.md)

Purpose: User-facing command for health dashboard.

Usage:

/skill-health                      # Dashboard view
/skill-health --skill <name> # Specific skill
/skill-health --compare # Baseline comparison
/skill-health --record # Record current scores

4. /which Health Integration (H.P.002-COMMANDS/which.md)

Purpose: Show health scores in agent recommendations.

Features:

  • Health score displayed in output: [XX%] ↑/↓/→
  • Health warning for <50% H.P.001-AGENTS
  • Health-aware routing (health weighted 40% in ranking)

Data Storage

skill-health.json

Location: context-storage/skill-health.json

Schema:

{
"version": "1.0.0",
"last_updated": "2026-01-11T12:00:00Z",
"H.P.003-SKILLS": {
"skill-name": {
"scores": [
{
"date": "2026-01-11T12:00:00Z",
"score": 85,
"invocations": 42
}
],
"rolling_7d": 82.5,
"rolling_30d": 78.3,
"trend": "improving",
"total_invocations": 156,
"first_tracked": "2026-01-01T00:00:00Z",
"last_updated": "2026-01-11T12:00:00Z"
}
}
}

Baseline Snapshots

Location: context-storage/baselines/skill-baseline-YYYY-MM-DD.json

Schema:

{
"version": "1.0.0",
"created": "2026-01-11T12:00:00Z",
"name": "2026-01-11",
"H.P.003-SKILLS": {
"skill-name": {
"score": 85,
"invocations": 42,
"date": "2026-01-11T12:00:00Z",
"trend": "stable"
}
}
}

Regression Reports

Location: context-storage/regressions/regression-report-YYYYMMDD_HHMMSS.json

Schema:

{
"status": "complete",
"baseline_name": "2026-01-01",
"threshold": 10,
"checked_at": "2026-01-11T12:00:00Z",
"regressions": [
{
"skill": "problem-skill",
"baseline_score": 85,
"current_score": 70,
"delta": -15,
"severity": "warning"
}
],
"warnings": [],
"improvements": [],
"summary": {
"total_checked": 150,
"regression_count": 3,
"warning_count": 5,
"improvement_count": 12
}
}

Health Scoring

Score Calculation

Health scores are calculated from skill invocation history:

health_score = (
success_rate * 0.5 + # 50% weight: successful invocations / total
trend_score * 0.3 + # 30% weight: recent score trajectory
(1 - error_penalty) * 0.2 # 20% weight: deduction for errors
)

Status Levels

ScoreStatusIconAction
90-100Excellent🟢Maintain
70-89Good🟡Monitor
50-69Needs Work🟠Improve
0-49Critical🔴Urgent Fix

Trend Detection

TrendCriteriaIcon
ImprovingScore rising >5%
StableScore within ±5%
DegradingScore falling >5%

Workflow Integration

Session Retrospective Flow

Session End


┌─────────────────────┐
│ Run Retrospective │
│ (session- │
│ retrospective.py) │
└─────────────────────┘


┌─────────────────────┐
│ Record Health │
│ (skill-health- │
│ tracker.py) │
└─────────────────────┘


┌─────────────────────┐
│ Check Regressions │
│ (skill-regression- │
│ detector.py) │
└─────────────────────┘


┌─────────────────────┐
│ Alert if │
│ Regressions Found │
└─────────────────────┘
ActionFrequencyCommand
View dashboardDaily/skill-health
Record scoresAfter retrospective--record
Check regressionsWeekly--compare
Create baselineBefore major changes--baseline
Full exportMonthly--export

Hook Configuration

Add to ~/.claude/settings.json:

{
"H.P.005-HOOKS": {
"PostToolUse:retrospective": [
{
"command": "python3 ~/.coditect/H.P.005-HOOKS/skill-regression-detector.py --check --save-report",
"timeout": 30000
}
]
}
}

ComponentTypePurpose
skill-pattern-analyzer.pyScriptAnalyze patterns, generate recommendations
session-retrospective.pyHookEnd-of-session analysis
skill-learnings.jsonDataSource of skill invocation history
/optimize-H.P.003-SKILLSCommandAlias for /skill-health
/whichCommandAgent recommendations with health

Troubleshooting

No Health Data Available

# Record initial scores
python3 H.P.004-SCRIPTS/skill-health-tracker.py --record

# Verify data file exists
ls -la context-storage/skill-health.json

No Baseline Found

# Create baseline
python3 H.P.004-SCRIPTS/skill-health-tracker.py --baseline

# Verify baseline exists
ls -la context-storage/baselines/

Scores Not Updating

# Check skill-learnings.json has data
cat context-storage/skill-learnings.json | python3 -m json.tool | head -50

# Run retrospective to populate learnings
python3 H.P.005-HOOKS/session-retrospective.py --analyze

ADR References

  • F.5.6: Skill Health Framework Enhancement (PILOT Plan)
  • F.5.5: Component Success Rate Improvement (completed prerequisite)
  • ADR-061: Skill Improvement Loop (related)

Last Updated: 2026-01-11 Maintained By: CODITECT Framework Team