Home > Azure > App Services > Scaling

Scaling Azure App Services

Azure App Service provides powerful tools to scale your web applications, APIs, and mobile backends. Scaling allows your application to handle varying loads, ensuring performance and availability for your users. There are two primary scaling methods: manual scaling and automatic scaling.

Understanding Scaling Types

Vertical Scaling (Scale Up)

Vertical scaling involves increasing the resources allocated to a single instance of your App Service. This means moving to a higher pricing tier that offers more CPU, memory, and storage. It's like giving your existing server more power.

Pros: Simple to implement, often requires minimal code changes.
Cons: Has a physical limit, can become expensive at the highest tiers, may require application restarts.

Vertical scaling is managed directly in the Azure portal under the "Scale up (App Service plan)" section of your App Service.

Horizontal Scaling (Scale Out)

Horizontal scaling involves increasing the number of instances running your application. Each instance runs on its own compute resources. This is ideal for distributing load and increasing capacity.

Pros: Highly scalable, can handle very large loads, improves fault tolerance.
Cons: Requires your application to be stateless or designed for distributed environments, potential for increased complexity.

Horizontal scaling can be performed manually or automatically.

Manual Scaling

Manual scaling allows you to set a fixed number of instances for your App Service. This is useful if you have predictable traffic patterns or want to quickly adjust capacity.

To manually scale:

Navigate to your App Service in the Azure portal.
In the left-hand menu, select "Scale out (custom zones)" or "Scale out (managed)" depending on your plan.
Under "Scale instance count", set the desired number of instances.
Click "Save".

Automatic Scaling (Autoscaling)

Automatic scaling allows your App Service to scale out or in based on predefined rules and metrics. This ensures optimal performance and cost efficiency by automatically adjusting the number of instances to match demand.

Configuring Autoscaling Rules

Autoscaling rules are configured based on metrics such as CPU percentage, memory usage, HTTP queue length, or custom metrics.

Tip: It's recommended to configure both scale-out and scale-in rules. Scale-out rules handle increasing load, while scale-in rules reduce the number of instances when the load decreases, saving costs.

Common Autoscaling Metrics:

CPU Percentage: Scales based on the average CPU utilization across all instances.
Memory Percentage: Scales based on the average memory utilization.
HTTP Queue Length: Scales when the number of pending HTTP requests exceeds a threshold.
Disk Queue Length: Scales based on I/O operations.

Azure App Service Autoscaling Rules Example

Example of configuring autoscaling rules for CPU percentage.

Autoscaling Settings:

Minimum Number of Instances: The smallest number of instances your app will run.
Maximum Number of Instances: The largest number of instances your app can scale to.
Default Number of Instances: The initial number of instances when autoscaling is enabled.
Scale Actions: Define the metric to monitor, the condition (e.g., greater than, less than), the threshold value, and the number of instances to add or remove.
Cooldown Period: The duration after a scale action during which no other scale actions are triggered. This prevents rapid fluctuations.

Important: Ensure your application is designed to handle multiple instances gracefully. This includes managing session state and avoiding single points of failure. Stateless applications are generally easier to scale horizontally.

App Service Plan Tiers and Scaling

The scalability options available depend on your App Service plan tier. Free and Shared tiers have limited or no scaling capabilities. Dedicated tiers (Basic, Standard, Premium, Isolated) offer full vertical and horizontal scaling features.

Tier	Vertical Scaling	Horizontal Scaling	Autoscaling
Free/Shared	Limited	No	No
Basic	Yes (within tier limits)	Manual (up to a small number)	No
Standard	Yes (within tier limits)	Manual (more instances)	Yes
Premium	Yes (more powerful instances)	Manual (many instances)	Yes
Isolated	Yes (dedicated hardware)	Manual (many instances)	Yes

Best Practices for Scaling

Monitor Performance: Regularly monitor your application's performance metrics to understand its behavior under load.
Set Realistic Limits: Define appropriate minimum and maximum instance counts to balance performance and cost.
Test Your Rules: Thoroughly test your autoscaling rules under simulated load conditions.
Stateless Design: Aim for a stateless application architecture to simplify horizontal scaling. Use external services for session state if needed.
Choose the Right Tier: Select an App Service plan tier that meets your performance and scaling requirements.
Consider Zones: For high availability, consider deploying across multiple availability zones using the Scale out (custom zones) option.

By effectively implementing scaling strategies, you can ensure your Azure App Service applications remain responsive, available, and cost-efficient, regardless of user traffic fluctuations.