Scaling Policies
Implement auto-scaling policies with horizontal and vertical scaling, load-based triggers, and cost optimization.
Complexity: Moderate | Duration: 15-30m | Category: Devops
Tags: infrastructure scaling autoscaling optimization
Workflow Diagram
Steps
Step 1: Baseline metrics
Agent: devops
engineer - Establish normal CPU, memory, request load
Step 2: Scaling strategy
Agent: cloud
architect - Choose horizontal (add instances) vs. vertical (bigger instances)
Step 4: Horizontal scaling
Agent: devops
engineer - Configure ASG/GCE MIG with min/max instances
Step 5: Vertical scaling
Agent: cloud
architect - Configure VPA for Kubernetes
Step 6: Cooldown periods
Agent: devops
engineer - Set cooldown to prevent flapping
Step 7: Load testing
Agent: testing
specialist - Test scaling behavior under load
Step 8: Cost optimization
Agent: cloud
architect - Right-size instances, use spot instances
Usage
To execute this workflow:
/workflow devops/scaling-policies.workflow
Related Workflows
See other workflows in this category for related automation patterns.