Azure Documentation

General Troubleshooting

This section provides guidance on common troubleshooting scenarios and solutions for Azure services.

Common Issues and Solutions

Here are some frequently encountered problems and their recommended resolutions:

  • Service Not Responding: Check Azure Service Health dashboard, review resource logs, and restart the service if applicable.
  • Authentication Failures: Verify credentials, check access policies, and ensure correct permissions are assigned.
  • Performance Degradation: Monitor resource utilization (CPU, memory, network), optimize queries, and consider scaling up or out.
  • Deployment Errors: Review deployment logs for specific error messages, validate ARM templates or Terraform configurations, and check resource quotas.
  • Connectivity Problems: Examine network security groups, firewalls, virtual network configurations, and DNS settings.

Gathering Diagnostic Information

Effective troubleshooting often requires collecting relevant diagnostic data. Key tools and resources include:

  • Azure Monitor: For metrics, logs, and activity logs.
  • Application Insights: For application performance monitoring and diagnostics.
  • Resource Logs: Specific to each Azure service, providing detailed operational information.
  • Azure CLI / PowerShell: For scripting diagnostic commands and retrieving resource information.
  • Network Watcher: For network diagnostics like connection troubleshoot, IP flow verify, and packet capture.
Tip: Enable diagnostic settings for your Azure resources to send logs and metrics to a storage account or Log Analytics workspace for easier analysis.

Step-by-Step Troubleshooting Guides

Follow these guides for specific troubleshooting scenarios:

Scenario 1: Unable to connect to a Virtual Machine

  1. Verify that the VM is running.
  2. Check Network Security Group (NSG) rules to ensure the required ports (e.g., RDP 3389, SSH 22) are open.
  3. Confirm that the VM's operating system firewall is not blocking traffic.
  4. Use Network Watcher's "IP Flow Verify" to check if traffic is being denied by an NSG.
  5. If using a public IP, ensure it's correctly assigned and accessible.
  6. Try connecting via the Azure portal's serial console for OS-level diagnostics.

Scenario 2: Application deployment failed

  1. Go to the deployment resource in the Azure portal and check the "Deployment details" for specific error messages.
  2. Review the Activity Log for the resource group or subscription to identify any related events.
  3. Ensure that all required parameters and resource dependencies are correctly specified in your deployment template (ARM, Bicep, Terraform).
  4. Check Azure service quotas and limits for the subscription.
  5. If deploying to App Service or Functions, examine the application logs for runtime errors.

Common Error Codes

Understanding common error codes can significantly speed up resolution:

  • HTTP 401 Unauthorized: Check authentication credentials and permissions.
  • HTTP 403 Forbidden: Verify authorization and access control.
  • HTTP 404 Not Found: Ensure the requested resource exists and the URL is correct.
  • HTTP 500 Internal Server Error: Indicates a problem on the server side. Review application and server logs.
  • HTTP 503 Service Unavailable: The service is temporarily overloaded or down. Check service health and consider retrying.
Warning: Do not blindly retry operations that result in persistent errors. Investigate the root cause first to avoid exacerbating the problem.

Next Steps

If you are still experiencing issues after consulting this guide, please refer to the Azure Support page to open a support request or consult community forums.