Apache Airflow vs. dbt

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Apache Airflow
Score 8.4 out of 10
N/A
Apache Airflow is an open source tool that can be used to programmatically author, schedule and monitor data pipelines using Python and SQL. Created at Airbnb as an open-source project in 2014, Airflow was brought into the Apache Software Foundation’s Incubator Program 2016 and announced as Top-Level Apache Project in 2019. It is used as a data orchestration solution, with over 140 integrations and community support.N/A
dbt
Score 9.2 out of 10
N/A
dbt is an SQL development environment, developed by Fishtown Analytics, now known as dbt Labs. The vendor states that with dbt, analysts take ownership of the entire analytics engineering workflow, from writing data transformation code to deployment and documentation. dbt Core is distributed under the Apache 2.0 license, and paid Teams and Enterprise editions are available.
$0
per month per seat
Pricing
Apache Airflowdbt
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
Apache Airflowdbt
Free Trial
NoYes
Free/Freemium Version
YesYes
Premium Consulting/Integration Services
NoYes
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details——
More Pricing Information
Community Pulse
Apache Airflowdbt
Considered Both Products
Apache Airflow

No answer on this topic

dbt
Chose dbt
Airflow can accomplish the same work as dbt (data build tool), however, dbt's (data build tool) development workflow and UI can open up data transformation and modeling work to non-data engineering teams. Looker might also be able to define data models via LookML with a …
Top Pros
Top Cons
Features
Apache Airflowdbt
Workload Automation
Comparison of Workload Automation features of Product A and Product B
Apache Airflow
8.2
9 Ratings
0% below category average
dbt
-
Ratings
Multi-platform scheduling8.89 Ratings00 Ratings
Central monitoring8.49 Ratings00 Ratings
Logging8.19 Ratings00 Ratings
Alerts and notifications7.99 Ratings00 Ratings
Analysis and visualization7.99 Ratings00 Ratings
Application integration8.49 Ratings00 Ratings
Data Transformations
Comparison of Data Transformations features of Product A and Product B
Apache Airflow
-
Ratings
dbt
9.7
5 Ratings
17% above category average
Simple transformations00 Ratings9.65 Ratings
Complex transformations00 Ratings9.95 Ratings
Data Modeling
Comparison of Data Modeling features of Product A and Product B
Apache Airflow
-
Ratings
dbt
9.0
5 Ratings
11% above category average
Data model creation00 Ratings9.15 Ratings
Metadata management00 Ratings8.65 Ratings
Business rules and workflow00 Ratings8.05 Ratings
Collaboration00 Ratings9.93 Ratings
Testing and debugging00 Ratings9.65 Ratings
Best Alternatives
Apache Airflowdbt
Small Businesses

No answers on this topic

Skyvia
Skyvia
Score 9.8 out of 10
Medium-sized Companies
ActiveBatch Workload Automation
ActiveBatch Workload Automation
Score 8.4 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
Enterprises
Redwood RunMyJobs
Redwood RunMyJobs
Score 9.3 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Apache Airflowdbt
Likelihood to Recommend
7.4
(9 ratings)
9.6
(7 ratings)
User Testimonials
Apache Airflowdbt
Likelihood to Recommend
Apache
For a quick job scanning of status and deep-diving into job issues, details, and flows, AirFlow does a good job. No fuss, no muss. The low learning curve as the UI is very straightforward, and navigating it will be familiar after spending some time using it. Our requirements are pretty simple. Job scheduler, workflows, and monitoring. The jobs we run are >100, but still is a lot to review and troubleshoot when jobs don't run. So when managing large jobs, AirFlow dated UI can be a bit of a drawback.
Read full review
dbt Labs
If you can load your data first into your warehouse, dbt is excellent. It does the T(ransformation) part of ELT brilliantly but does not do the E(xtract) or L(oad) part. If you know SQL or your development team knows SQL, it's a framework and extension around that. So, it's easy to learn and easy to hire people with that technical skill (as opposed to specific Informatica, SnapLogic, etc. experience). dbt uses plain text files and integrates with GitHub. You can easily see the changes made between versions. In GUI-based UIs it was always hard to tell what someone had changed. Each "model" is essentially a "SELECT" statement. You never need to do a "CREATE TABLE" or "CREATE VIEW" - it's all done for you, leaving you to work on the business logic. Instead of saying "FROM specific_db.schema.table" you indicate "FROM ref('my_other_model')". It creates an internal dependency diagram you can view in a DAG. When you deploy, the dependencies work like magic in your various environments. They also have great documentation, an active slack community, training, and support. I like the enhancements they have been making and I believe they are headed in a good direction.
Read full review
Pros
Apache
  • In charge of the ETL processes.
  • As there is no incoming or outgoing data, we may handle the scheduling of tasks as code and avoid the requirement for monitoring.
Read full review
dbt Labs
  • user experience makes it easy to work with SQL and version control
  • customer success team and the dbt (data build tool) community help establish best practices
  • thorough and clear documentation
Read full review
Cons
Apache
  • they should bring in some time based scheduling too not only event based
  • they do not store the metadata due to which we are not able to analyze the workflows
  • they only support python as of now for scripted pipeline writing
Read full review
dbt Labs
  • Slow load times of the dbt cloud environment (they're working on it via a new UI though)
  • More out-of-the-box solutions for managing procedures, functions, etc would be nice to have, but honestly, it's pretty easy to figure out how to adapt dbt macros
Read full review
Alternatives Considered
Apache
There are a number of reasons to choose Apache Airflow over other similar platforms- Integrations—ready-to-use operators allow you to integrate Airflow with cloud platforms (Google, AWS, Azure, etc) Apache Airflow helps with backups and other DevOps tasks, such as submitting a Spark job and storing the resulting data on a Hadoop cluster It has machine learning model training, such as triggering a Sage maker job.
Read full review
dbt Labs
Most ETL pipeline products have a T layer, but dbt just does it better. The transformation is on steroids compared to the others. Also, just allows much more Adhoc solutions for very specific projects. Those ETL tools are probably better on the T part if you don't need too many transforms - also dbt is pretty much free dependent on how you work it, also extremely scalable.
Read full review
Return on Investment
Apache
  • A lot of helpful features out-of-the-box, such as the DAG visualizations and task trees
  • Allowed us to implement complex data pipelines easily and at a relatively low cost
Read full review
dbt Labs
  • Simplified our BI layer for faster load times
  • Increased the quality of data reaching our end users
  • Makes complex transformations manageable
Read full review
ScreenShots