Skip to main content

Monitoring Automation Setup

Implement automated monitoring with health checks, metric collection, alerting rules, and incident automation.

Complexity: Moderate | Duration: 15-30m | Category: Devops

Tags: automation monitoring observability alerting

Workflow Diagram

Steps

Step 1: Health check implementation

Agent: devops

engineer - Implement /health endpoints

Step 2: Metric collection

Agent: devops

engineer - Instrument code with Prometheus, StatsD

Step 3: Log aggregation

Agent: devops

engineer - Configure ELK, Loki, or CloudWatch Logs

Step 4: Alerting rules

Agent: automation

specialist - Define alert conditions (error rate, latency, uptime)

Step 5: Alert routing

Agent: automation

specialist - Configure PagerDuty, Opsgenie, or email

Step 6: Dashboard creation

Agent: devops

engineer - Build Grafana/Datadog dashboards

Step 7: Incident automation

Agent: automation

specialist - Auto-restart, auto-scale, runbook automation

Step 8: Testing

Agent: devops

engineer - Test alerts by simulating failures

Usage

To execute this workflow:

/workflow devops/monitoring-automation-setup.workflow

See other workflows in this category for related automation patterns.