Last week, we unpacked the Medallion Architecture (Bronze, Silver, Gold) and how layered design brings clarity to modern data platforms. But layers alone don't keep things running, you need orchestration to move data reliably between them. That's where orchestration tools come in. Let's break down the big three: Airflow, Prefect, and Dagster.
Introduction
Orchestration is how you ensure that jobs run in the right order, with the right dependencies, and with visibility when they fail. In other words: it's the glue between extraction, transformation, and serving.
Modern orchestration has evolved from cron jobs to robust frameworks with scheduling, retries, observability, and lineage.
Apache Airflow
Overview:
- The OG of modern orchestration, created at Airbnb (2014)
- Uses Directed Acyclic Graphs (DAGs) to define dependencies
Strengths:
- Widely adopted, massive community
- Highly extensible with operators/plugins
- Great for complex, dependency-heavy workflows
Tradeoffs:
- Steeper learning curve (Python-heavy)
- Scheduler overhead can get tricky at scale
- UI feels dated compared to newer tools
Best for: Large orgs with mixed batch/ETL workloads and in-house expertise.
Prefect
Overview:
- Built as a simpler, more Pythonic alternative to Airflow
- Uses a "flow" abstraction instead of DAGs
Strengths:
- Easier developer experience
- Flexible: works locally, in the cloud, or hybrid
- Rich observability built-in
Tradeoffs:
- Smaller ecosystem vs. Airflow
- Less battle-tested at extreme scale
Best for: Teams that want orchestration without heavy infra overhead.
Dagster
Overview:
- Newer player focused on data-aware orchestration
- Strong integration with data assets and lineage
Strengths:
- Treats data as first-class: you define assets, not just tasks
- Great observability, type systems, and lineage tracking
- Strong fit with dbt and modern data stacks
Tradeoffs:
- Younger ecosystem, smaller adoption
- Requires mindset shift for engineers used to Airflow/Prefect
Best for: Modern data teams embracing data products, lineage, and quality.
How to Choose
- Airflow: You need enterprise-scale, complex workflows, and your team is comfortable managing infra.
- Prefect: You want simplicity, fast adoption, and cloud-native ease.
- Dagster: You're building modern, asset-centric pipelines with strong lineage needs.
Real-World Example
A fintech startup:
- Uses Dagster to orchestrate dbt models and monitor asset lineage
- Runs Prefect for quick ad-hoc automations
- Still has some legacy Airflow DAGs for batch ETL jobs
Over time, they migrate more workloads into Dagster to align with their data product mindset.
Best Practices
- Start simple: Don't overcomplicate with 3 tools at once
- Document DAGs/flows as part of onboarding
- Add alerts + observability from day one
- Align tool choice with your team's skill set and stack maturity
Conclusion
Data orchestration isn't about picking the best tool, it's about picking the right fit for your stack and team.
Airflow is the veteran, Prefect is the pragmatist, and Dagster is the visionary. All three can keep your Bronze, Silver, Gold layers flowing.