Apache Airflow - Reference
Welcome to the reference documentation for Apache Airflow. This section provides detailed information about Airflow's core components, configuration options, and best practices.
Overview
Apache Airflow is an open-source platform to programmatically author, schedule, and monitor workflows. Airflow's power lies in its flexibility and extensibility, allowing you to define complex data pipelines as directed acyclic graphs (DAGs) in Python.
This reference section is organized to help you quickly find the information you need:
- API Reference: Comprehensive documentation for Airflow's REST API, Python API, and CLI commands.
- Operators and Hooks: Detailed descriptions of built-in and community-provided operators and hooks.
- Providers: Information on how to use and manage Airflow providers.
- Configuration: A complete guide to Airflow's configuration options and their impact.
- Variables and Connections: How to manage Airflow Variables and Connections securely.
- Logging: Understanding and configuring Airflow's logging mechanisms.
API Reference
Dive deep into the technical details of Airflow's programmatic interfaces.
Operators and Hooks
Operators define individual tasks in your DAGs, while hooks provide interfaces to external platforms and databases.
- Built-in Operators and Hooks
- Provider Packages: Discover and use integrations from the Airflow Provider ecosystem.
Providers
Providers are packages that bundle integrations for specific technologies and services. This section guides you through their management and usage.
Configuration
Customize Airflow's behavior by understanding its extensive configuration options.
Airflow is configured primarily through the airflow.cfg file or environment variables. The following settings cover aspects like executors, logging, security, and more.
Variables and Connections
Variables allow you to store and retrieve arbitrary key-value pairs from your DAGs. Connections store credentials and metadata for external systems.
Logging
Effective logging is crucial for monitoring and debugging your Airflow deployments. Learn how to configure and access logs.