Apache Hadoop vs. dbt

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Hadoop
Score 7.2 out of 10
N/A
Hadoop is an open source software from Apache, supporting distributed processing and data storage. Hadoop is popular for its scalability, reliability, and functionality available across commoditized hardware.N/A
dbt
Score 9.0 out of 10
N/A
dbt is an SQL development environment, developed by Fishtown Analytics, now known as dbt Labs. The vendor states that with dbt, analysts take ownership of the entire analytics engineering workflow, from writing data transformation code to deployment and documentation. dbt Core is distributed under the Apache 2.0 license, and paid Teams and Enterprise editions are available.
$0
per month per seat
Pricing
Apache Hadoopdbt
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
Hadoopdbt
Free Trial
NoYes
Free/Freemium Version
YesYes
Premium Consulting/Integration Services
NoYes
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details——
More Pricing Information
Features
Apache Hadoopdbt
Data Transformations
Comparison of Data Transformations features of Product A and Product B
Apache Hadoop
-
Ratings
dbt
9.5
7 Ratings
16% above category average
Simple transformations00 Ratings10.07 Ratings
Complex transformations00 Ratings9.17 Ratings
Data Modeling
Comparison of Data Modeling features of Product A and Product B
Apache Hadoop
-
Ratings
dbt
9.0
7 Ratings
12% above category average
Data model creation00 Ratings9.57 Ratings
Metadata management00 Ratings8.57 Ratings
Business rules and workflow00 Ratings8.97 Ratings
Collaboration00 Ratings10.05 Ratings
Testing and debugging00 Ratings8.17 Ratings
Best Alternatives
Apache Hadoopdbt
Small Businesses

No answers on this topic

Skyvia
Skyvia
Score 9.8 out of 10
Medium-sized Companies
Cloudera Manager
Cloudera Manager
Score 9.9 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
Enterprises
IBM Analytics Engine
IBM Analytics Engine
Score 7.8 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Apache Hadoopdbt
Likelihood to Recommend
8.0
(37 ratings)
10.0
(9 ratings)
Likelihood to Renew
9.6
(8 ratings)
-
(0 ratings)
Usability
8.0
(6 ratings)
9.5
(2 ratings)
Performance
8.0
(1 ratings)
-
(0 ratings)
Support Rating
7.5
(3 ratings)
-
(0 ratings)
Online Training
6.1
(2 ratings)
-
(0 ratings)
User Testimonials
Apache Hadoopdbt
Likelihood to Recommend
Apache
Altogether, I want to say that Apache Hadoop is well-suited to a larger and unstructured data flow like an aggregation of web traffic or even advertising. I think Apache Hadoop is great when you literally have petabytes of data that need to be stored and processed on an ongoing basis. Also, I would recommend that the software should be supplemented with a faster and interactive database for a better querying service. Lastly, it's very cost-effective so it is good to give it a shot before coming to any conclusion.
Read full review
dbt Labs
The prerequisite is that you have a supported database/data warehouse and have already found a way to ingest your raw data. Then dbt is very well suited to manage your transformation logic if the people using it are familiar with SQL. If you want to benefit from bringing engineering practices to data, dbt is a great fit. It can bring CI/CD practices, version control, automated testing, documentation generation, etc. It is not so well suited if the people managing the transformation logic do not like to code (in SQL) but prefer graphical user interfaces.
Read full review
Pros
Apache
  • Handles large amounts of unstructured data well, for business level purposes
  • Is a good catchall because of this design, i.e. what does not fit into our vertical tables fits here.
  • Decent for large ETL pipelines and logging free-for-alls because of this, also.
Read full review
dbt Labs
  • dbt supports version control through GIT, this allows teams to collaborate and track the data transformation logic.
  • dbt allows us to build data models which helps to break complex transformation logic into simple and smaller logic.
  • dbt is completely based on SQL which allows data analyst and data engineers to build the transformation logic.
  • dbt can be easily integrated with snowflake.
Read full review
Cons
Apache
  • Less organizational support system. Bugs need to be fixed and outside help take a long time to push updates
  • Not for small data sets
  • Data security needs to be ramped up
  • Failure in NameNode has no replication which takes a lot of time to recover
Read full review
dbt Labs
  • Field-level lineage (currently at table level)
  • Documentation inheritance - if a field is documented the downstream field of the same name could inherit the doc info
  • Adding python model support (in beta now)
Read full review
Likelihood to Renew
Apache
Hadoop is organization-independent and can be used for various purposes ranging from archiving to reporting and can make use of economic, commodity hardware. There is also a lot of saving in terms of licensing costs - since most of the Hadoop ecosystem is available as open-source and is free
Read full review
dbt Labs
No answers on this topic
Usability
Apache
As Hadoop enterprise licensed version is quite fine tuned and easy to use makes it good choice for Hadoop administrators. It’s scalability and integration with Kerberos is good option for authentication and authorisation. installation can be improved. logging can be improved so that it become easier for debugging purposes. parallel processing of data is achieved easily.
Read full review
dbt Labs
It requires proficiency with SQL coding and with git practices, but with these prerequisites, it is easy to use. Especially with the dbt cloud, you get a nice interface that makes all the administrative tasks like scheduling jobs quite easy. I also like the built-in SQL editor with syntax highlighting and auto-completion.
Read full review
Support Rating
Apache
It's a great value for what you pay, and most Data Base Administrators (DBAs) can walk in and use it without substantial training. I tend to dabble on the analyst side, so querying the data I need feels like it can take forever, especially on higher traffic days like Monday.
Read full review
dbt Labs
No answers on this topic
Online Training
Apache
Hadoop is a complex topic and best suited for classrom training. Online training are a waste of time and money.
Read full review
dbt Labs
No answers on this topic
Alternatives Considered
Apache
Not used any other product than Hadoop and I don't think our company will switch to any other product, as Hadoop is providing excellent results. Our company is growing rapidly, Hadoop helps to keep up our performance and meet customer expectations. We also use HDFS which provides very high bandwidth to support MapReduce workloads.
Read full review
dbt Labs
I actually don't know what the alternative to dbt is. I'm sure one must exist other than more 'roll your own' options like Apache Airflow, say, bu tin terms of super easy managed/cloud data transforms, dbt really does seem to be THE tool to use. It's $50/month per dev, BUT there's a FREE version for 1 dev seat with no read-only access for anyone else, so you can always start with that and then buy yourself a seat later.
Read full review
Return on Investment
Apache
  • There are many advantages of Hadoop as first it has made the management and processing of extremely colossal data very easy and has simplified the lives of so many people including me.
  • Hadoop is quite interesting due to its new and improved features plus innovative functions.
Read full review
dbt Labs
  • Simplified our BI layer for faster load times
  • Increased the quality of data reaching our end users
  • Makes complex transformations manageable
Read full review
ScreenShots