Model Evaluation
Comprehensive model evaluation using cross-validation, multiple metrics, confusion matrices, ROC/PR curves, and performance comparison against baselines.
Complexity: Moderate | Duration: 15-30m | Category: Devops
Tags: ml evaluation validation metrics
Workflow Diagram
Steps
Step 1: Cross
Agent: validation
ml-engineer - K-fold CV with stratification
Step 2: Metrics computation
Agent: ml
engineer - Accuracy, precision, recall, F1, AUC-ROC, AUC-PR
Step 3: Confusion matrix
Agent: ml
engineer - Analyze TP, FP, TN, FN distributions
Step 4: ROC/PR curves
Agent: ml
engineer - Plot and analyze curve shapes
Step 5: Baseline comparison
Agent: ml
engineer - Compare against dummy classifier/regressor
Step 6: Error analysis
Agent: testing
specialist - Identify patterns in misclassifications
Step 7: Performance report
Agent: testing
specialist - Generate comprehensive evaluation report
Step 8: Acceptance review
Agent: testing
specialist - Validate against business requirements
Usage
To execute this workflow:
/workflow devops/model-evaluation.workflow
Related Workflows
See other workflows in this category for related automation patterns.