Deploying Your First Azure OpenAI Model
Overview
Azure OpenAI Service allows you to leverage powerful language models like GPT-4, GPT-3.5 Turbo, and DALL-E for a wide range of natural language processing tasks. Deploying a model is the first step to integrating these capabilities into your applications.
Prerequisites
- An Azure Subscription.
- Access to Azure OpenAI Service.
- An Azure OpenAI Service resource created in your subscription.
If you haven't already, please visit the Azure OpenAI Service access request page to apply for access.
Steps to Deploy a Model
- Navigate to your Azure OpenAI Resource: Go to the Azure portal and find your Azure OpenAI Service resource.
- Open the Azure OpenAI Studio: Within your resource, click on the "Go to Azure OpenAI Studio" button.
- Select "Deployments": In the Studio, navigate to the "Deployments" section in the left-hand menu.
- Create a New Deployment: Click the "+ Create new deployment" button.
- Choose a Model: Select the desired model from the dropdown list (e.g., `gpt-35-turbo`, `text-davinci-003`).
- Configure Deployment Settings:
- Deployment name: A unique name for your deployment (e.g., `my-chat-deployment`).
- Content filter: Configure content filtering settings as needed.
- Advanced options: Set capacity (tokens per minute) based on your needs and the model's limits.
- Create: Click the "Create" button to deploy the model.
Example Deployment (CLI)
You can also deploy models using the Azure CLI:
az cognitiveservices account deployment create --name <deployment-name> --resource-group <resource-group-name> --server <your-azure-openai-resource-name> --model-name <model-name> --capacity <tokens-per-minute>
Replace the placeholders like <deployment-name>, <resource-group-name>, <your-azure-openai-resource-name>, <model-name>, and <tokens-per-minute> with your specific values.
Note: Deployment names are unique within a resource and are used in API calls. Model availability may vary by region.
Next Steps
Once your model is deployed, you can start interacting with it:
- Explore the playground in Azure OpenAI Studio.
- Integrate your deployment into applications using the REST API or SDKs.
- Learn about prompt engineering to get the best results.