Skip to main content

Scaling Policies

Implement auto-scaling policies with horizontal and vertical scaling, load-based triggers, and cost optimization.

Complexity: Moderate | Duration: 15-30m | Category: Devops

Tags: infrastructure scaling autoscaling optimization

Workflow Diagram

Steps

Step 1: Baseline metrics

Agent: devops

engineer - Establish normal CPU, memory, request load

Step 2: Scaling strategy

Agent: cloud

architect - Choose horizontal (add instances) vs. vertical (bigger instances)

Step 4: Horizontal scaling

Agent: devops

engineer - Configure ASG/GCE MIG with min/max instances

Step 5: Vertical scaling

Agent: cloud

architect - Configure VPA for Kubernetes

Step 6: Cooldown periods

Agent: devops

engineer - Set cooldown to prevent flapping

Step 7: Load testing

Agent: testing

specialist - Test scaling behavior under load

Step 8: Cost optimization

Agent: cloud

architect - Right-size instances, use spot instances

Usage

To execute this workflow:

/workflow devops/scaling-policies.workflow

See other workflows in this category for related automation patterns.