AWS Data Pipeline vs. dbt

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
AWS Data Pipeline
Score 9.5 out of 10
N/A
AWS Data Pipeline is a web service used to process and move data between different AWS compute and storage services, as well as on-premises data sources, at specified intervals. With AWS Data Pipeline, users can regularly access data where it’s stored, transform and process it at scale, and transfer the results to AWS services such as Amazon S3, Amazon RDS, Amazon DynamoDB, and Amazon EMR. AWS Data Pipeline is designed to help create complex data processing workloads that are fault tolerant,…N/A
dbt
Score 9.4 out of 10
N/A
dbt is an SQL development environment, developed by Fishtown Analytics, now known as dbt Labs. The vendor states that with dbt, analysts take ownership of the entire analytics engineering workflow, from writing data transformation code to deployment and documentation. dbt Core is distributed under the Apache 2.0 license, and paid Teams and Enterprise editions are available.
$0
per month per seat
Pricing
AWS Data Pipelinedbt
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
AWS Data Pipelinedbt
Free Trial
NoYes
Free/Freemium Version
NoYes
Premium Consulting/Integration Services
NoYes
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Features
AWS Data Pipelinedbt
Data Transformations
Comparison of Data Transformations features of Product A and Product B
AWS Data Pipeline
-
Ratings
dbt
9.7
5 Ratings
15% above category average
Simple transformations00 Ratings9.55 Ratings
Complex transformations00 Ratings9.95 Ratings
Data Modeling
Comparison of Data Modeling features of Product A and Product B
AWS Data Pipeline
-
Ratings
dbt
9.0
5 Ratings
10% above category average
Data model creation00 Ratings9.15 Ratings
Metadata management00 Ratings8.65 Ratings
Business rules and workflow00 Ratings8.05 Ratings
Collaboration00 Ratings9.83 Ratings
Testing and debugging00 Ratings9.55 Ratings
Best Alternatives
AWS Data Pipelinedbt
Small Businesses
Skyvia
Skyvia
Score 9.6 out of 10
Skyvia
Skyvia
Score 9.6 out of 10
Medium-sized Companies
Astera Centerprise
Astera Centerprise
Score 8.8 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
Enterprises
Astera Centerprise
Astera Centerprise
Score 8.8 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
AWS Data Pipelinedbt
Likelihood to Recommend
10.0
(1 ratings)
9.6
(7 ratings)
User Testimonials
AWS Data Pipelinedbt
Likelihood to Recommend
Amazon AWS
AWS Data Pipeline is a web service that helps you reliably process and move data between different AWS compute and storage services, as well as on-premises data sources, at specified intervals. With AWS Data Pipeline, you can regularly access your data where it’s stored, transform and process it at scale, and efficiently transfer the results to AWS services such as Amazon S3, Amazon RDS, Amazon DynamoDB, and Amazon EMR.
Read full review
dbt Labs
If you can load your data first into your warehouse, dbt is excellent. It does the T(ransformation) part of ELT brilliantly but does not do the E(xtract) or L(oad) part. If you know SQL or your development team knows SQL, it's a framework and extension around that. So, it's easy to learn and easy to hire people with that technical skill (as opposed to specific Informatica, SnapLogic, etc. experience). dbt uses plain text files and integrates with GitHub. You can easily see the changes made between versions. In GUI-based UIs it was always hard to tell what someone had changed. Each "model" is essentially a "SELECT" statement. You never need to do a "CREATE TABLE" or "CREATE VIEW" - it's all done for you, leaving you to work on the business logic. Instead of saying "FROM specific_db.schema.table" you indicate "FROM ref('my_other_model')". It creates an internal dependency diagram you can view in a DAG. When you deploy, the dependencies work like magic in your various environments. They also have great documentation, an active slack community, training, and support. I like the enhancements they have been making and I believe they are headed in a good direction.
Read full review
Pros
Amazon AWS
  • Helps you easily create complex data processing workloads
  • Fault tolerant
  • Highly available
Read full review
dbt Labs
  • user experience makes it easy to work with SQL and version control
  • customer success team and the dbt (data build tool) community help establish best practices
  • thorough and clear documentation
Read full review
Cons
Amazon AWS
  • Pipeline Stuck in Pending Status
  • Pipeline Component Stuck in Waiting for Runner Status
  • EMR Cluster Fails With Error
Read full review
dbt Labs
  • Slow load times of the dbt cloud environment (they're working on it via a new UI though)
  • More out-of-the-box solutions for managing procedures, functions, etc would be nice to have, but honestly, it's pretty easy to figure out how to adapt dbt macros
Read full review
Alternatives Considered
Amazon AWS
AWS data pipelines are easy to use over data factory for data engineers
Read full review
dbt Labs
Most ETL pipeline products have a T layer, but dbt just does it better. The transformation is on steroids compared to the others. Also, just allows much more Adhoc solutions for very specific projects. Those ETL tools are probably better on the T part if you don't need too many transforms - also dbt is pretty much free dependent on how you work it, also extremely scalable.
Read full review
Return on Investment
Amazon AWS
  • Easy to use
  • Data engineers are able to create the data pipelines quickly and effectively
  • Scalable and Fault tolerant
Read full review
dbt Labs
  • Simplified our BI layer for faster load times
  • Increased the quality of data reaching our end users
  • Makes complex transformations manageable
Read full review
ScreenShots