Scaling Azure App Service | MSDN Documentation

Azure App Service provides robust options for scaling your web applications to meet demand. Scaling allows you to increase or decrease the resources allocated to your app, ensuring it remains available and performant under varying loads.

Understanding Scaling in App Service

There are two primary types of scaling in Azure App Service:

Vertical Scaling (Scale Up): This involves increasing the compute resources (CPU, RAM, Disk Space) of your existing instances. You achieve this by moving to a higher pricing tier of App Service Plan.
Horizontal Scaling (Scale Out): This involves increasing the number of instances running your application. This distributes the load across multiple machines, improving throughput and availability.

Manual Scaling

Manual scaling allows you to set a specific number of instances for your application. This is useful for predictable workloads or when you need precise control over resource allocation.

Steps to Manually Scale Out:

Navigate to your App Service in the Azure portal.
In the left-hand menu, under "Settings", select "Scale out (App Service plan)".
Choose "Manual scale".
Set the "Instance count" to the desired number of instances.
Click "Save".

Automatic Scaling

Automatic scaling, also known as auto-scaling, adjusts the number of instances based on predefined rules and metrics. This is ideal for dynamic workloads where traffic patterns fluctuate.

Key Features of Auto-scaling:

Metrics-based scaling: Rules can be configured based on metrics like CPU percentage, Memory percentage, HTTP queue length, and data in/out.
Schedule-based scaling: You can define schedules to scale up or down at specific times or days of the week (e.g., scale up during business hours).
Minimum and Maximum Instances: Define the boundaries for your auto-scaling to prevent over-provisioning or under-provisioning.

Configuring Auto-scaling Rules:

Navigate to your App Service in the Azure portal.
In the left-hand menu, under "Settings", select "Scale out (App Service plan)".
Choose "Custom auto scale".
Set the "Default" instance count, and define the "Minimum" and "Maximum" instance counts.
Click "+ Add a scale rule" to define your scaling conditions.
Configure the metric, threshold, and the operation (e.g., increase count by 1).
Optionally, add a "Scale out action" and "Scale in action" with cooldown periods to prevent rapid scaling fluctuations.
Click "Save".

Tip: When configuring auto-scaling, carefully monitor your application's performance and adjust your rules accordingly. A common setup is to scale out when CPU utilization exceeds 70% and scale in when it drops below 30%.

Scaling Considerations

Instance Size: Ensure your chosen App Service Plan tier provides sufficient resources for individual instances.
Application Architecture: Design your application to be stateless to effectively leverage horizontal scaling.
Cool-down Periods: Configure appropriate cool-down periods for scaling actions to avoid rapid scaling cycles that can be inefficient or impact performance.
Monitoring: Continuously monitor your application's performance and scaling metrics to optimize your configuration.

Example Auto-scaling Rule (Scale Out):


    - Trigger: CPU Percentage
      Statistic: Average
      Operator: GreaterThan
      Threshold: 70
      ScaleAction:
        Direction: Increase
        Type: Count
        Value: 1
        Cooldown: PT5M

Example Auto-scaling Rule (Scale In):


    - Trigger: CPU Percentage
      Statistic: Average
      Operator: LessThan
      Threshold: 30
      ScaleAction:
        Direction: Decrease
        Type: Count
        Value: 1
        Cooldown: PT10M

By effectively utilizing the scaling capabilities of Azure App Service, you can ensure your applications are always available and performant, no matter the demand.