Azure Synapse Analytics

Unify your data warehousing and Big Data analytics experience.

What is Azure Synapse Analytics?

Azure Synapse Analytics is an enterprise analytics service that accelerates time to insight across data warehouses and Big Data systems. It brings together data integration, enterprise data warehousing, and Big Data analytics into uniquely integrated experiences.

Synapse brings together the best of SQL and Spark. It offers a unified workspace where you can ingest, prepare, manage, and serve data for immediate BI and machine learning needs.

Key Benefit

Synapse unifies disparate data sources and analytic technologies, simplifying the development and operationalization of complex data solutions.

Key Features

Architecture Overview

Azure Synapse Analytics comprises several core components:

These components work together to provide a flexible and powerful analytics platform.

Getting Started with Synapse

Follow these steps to begin using Azure Synapse Analytics:

  1. Create a Synapse Workspace: Provision a new workspace in the Azure portal.
  2. Ingest Data: Use Synapse Pipelines or other tools to bring your data into Synapse or a linked data lake.
  3. Explore Data:
    • Use Serverless SQL pools to query data directly from Azure Data Lake Storage Gen2.
    • Use Spark Notebooks to perform complex data transformations and analysis.
    • Load data into Dedicated SQL pools for high-performance warehousing.
  4. Build and Deploy: Develop ETL/ELT pipelines, machine learning models, and BI dashboards.

Refer to the official Azure Synapse Analytics documentation for detailed guides and tutorials.

Example: Querying data with Serverless SQL

You can query CSV files in your data lake using T-SQL:


SELECT
    TOP 100 *
FROM
    OPENROWSET(
        BULK 'https://yourdatalake.dfs.core.windows.net/yourcontainer/yourfolder/*.csv',
        FORMAT = 'CSV'
    ) AS [result]
            

Common Use Cases