Apache Hadoop vs. Hortonworks Data Platform vs. IBM Db2 Big SQL

Apache Hadoop

Apache Hadoop

270 Reviews and Ratings

Hortonworks Data Platform

Hortonworks Data Platform

37 Reviews and Ratings

IBM Db2 Big SQL

IBM Db2 Big SQL

17 Reviews and Ratings

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Hadoop	Score 7.5 out of 10	N/A	Hadoop is an open source software from Apache, supporting distributed processing and data storage. Hadoop is popular for its scalability, reliability, and functionality available across commoditized hardware.	N/A
Hortonworks Data Platform	Score 5.0 out of 10	N/A	Hortonworks Data Platform (HDP) is an open source framework for distributed storage and processing of large, multi-source data sets. HDP modernizes IT infrastructure and keeps data secure—in the cloud or on-premises—while helping to drive new revenue streams, improve customer experience, and control costs. Hortonworks merged with Cloudera in eary 2019.	N/A
Db2 Big SQL	Score 9.0 out of 10	N/A	IBM offers Db2 Big SQL, an enterprise grade hybrid ANSI-compliant SQL on Hadoop engine, delivering massively parallel processing (MPP) and advanced data query. Big SQL offers a single database connection or query for disparate sources such as HDFS, RDMS, NoSQL databases, object stores and WebHDFS.	N/A

Pricing

Apache Hadoop

Hortonworks Data Platform

IBM Db2 Big SQL

Editions & Modules

No answers on this topic

No answers on this topic

No answers on this topic

Offerings

Pricing Offerings
Hadoop	Hortonworks Data Platform	Db2 Big SQL
Free Trial
No	No	No
Free/Freemium Version
Yes	No	No
Premium Consulting/Integration Services
No	No	No

Entry-level Setup Fee

No setup fee

No setup fee

No setup fee

Additional Details

—

—

—

More Pricing Information

Community Pulse
	Apache Hadoop	Hortonworks Data Platform	IBM Db2 Big SQL
Considered Multiple Products	Hadoop Piyush Routray Senior Software Developer Chose Apache Hadoop Hadoop being open source, is cheaper to use and do POCs for clients. Cloudera, Hortonworks and MapR also compete to contribute to open source Hadoop and keep their product conceptually similar to Hadoop. Incentivized Helpful? Vinay Suneja Senior Consultant Level II Chose Apache Hadoop Amazon Redshift is some what closer to Hadoop. But to analyze Petabytes of data Hadoop as better performance. Incentivized Helpful?	Hortonworks Data Platform Verified User Engineer Chose Hortonworks Data Platform Hortonworks Data Platform is on par with, if not better than, Cloudera or MapR. It provides a big list of components (25-30) that you can pick and use based on your needs. It provides an easy and convenient way to add/remove any of those. It provides a good way of integrating … Incentivized Helpful? Fernando López Bello Big Data & Cognitive Computing Practice Leader Chose Hortonworks Data Platform There are many alternatives, but in order to provide a short list: - Cloudera CDP is the obvious contendant or alternative, being a leader in big data platforms - MapR Incentivized Helpful? Verified User Engineer Chose Hortonworks Data Platform With its great performance and other benefits, we eventually moved from Cloudera to the Hortonworks platform. Incentivized Helpful? Bharadwaj (Brad) Chivukula Sr.Technical Manager/Delivery Manager Chose Hortonworks Data Platform Licensing cost is high when compared to other distribution partners VM setup - It's not as good as what Cloudera provides Incentivized Helpful? Piyush Routray Senior Software Developer Chose Hortonworks Data Platform While Apache Hadoop is completely open sourced, Hortonworks Data Platform offers support as well as keeps pace with the open source versions. Also, the HDP open sources its own products, thus giving back to the community. I find using the Hortonworks Data Platform more … Incentivized Helpful? Wonoh Kim Principal Software Engineer Chose Hortonworks Data Platform Apache, Cloudera, MapR, and IBM. Hortonworks Data Platform is more efficient to use than Apache since you don't need to configure everything by yourself. Again, Cloudera, MapR, and IBM is proprietary software. Incentivized Helpful?	Db2 Big SQL No answer on this topic

Best Alternatives
	Apache Hadoop	Hortonworks Data Platform	IBM Db2 Big SQL
Small Businesses	No answers on this topic	No answers on this topic	No answers on this topic
Medium-sized Companies	Cloudera Manager Score 9.9 out of 10	Cloudera Manager Score 9.9 out of 10	Cloudera Manager Score 9.9 out of 10
Enterprises	IBM Analytics Engine Score 7.2 out of 10	IBM Analytics Engine Score 7.2 out of 10	IBM Analytics Engine Score 7.2 out of 10
All Alternatives	View all alternatives	View all alternatives	View all alternatives

User Ratings
	Apache Hadoop	Hortonworks Data Platform	IBM Db2 Big SQL
Likelihood to Recommend	8.0 (37 ratings)	7.0 (9 ratings)	9.0 (2 ratings)
Likelihood to Renew	9.6 (8 ratings)	- (0 ratings)	- (0 ratings)
Usability	8.0 (6 ratings)	- (0 ratings)	8.0 (1 ratings)
Performance	8.0 (1 ratings)	- (0 ratings)	- (0 ratings)
Support Rating	7.5 (3 ratings)	- (0 ratings)	8.8 (2 ratings)
Online Training	6.1 (2 ratings)	- (0 ratings)	- (0 ratings)
Implementation Rating	- (0 ratings)	9.0 (1 ratings)	- (0 ratings)

User Testimonials
	Apache Hadoop	Hortonworks Data Platform	IBM Db2 Big SQL
Likelihood to Recommend	Apache Altogether, I want to say that Apache Hadoop is well-suited to a larger and unstructured data flow like an aggregation of web traffic or even advertising. I think Apache Hadoop is great when you literally have petabytes of data that need to be stored and processed on an ongoing basis. Also, I would recommend that the software should be supplemented with a faster and interactive database for a better querying service. Lastly, it's very cost-effective so it is good to give it a shot before coming to any conclusion. Incentivized Peter Suter Senior Software Engineer (GUI) Read full review	Cloudera I find HDP easy to use and solves most of the problems for people looking to manage their big data. Evaluating the Hortonworks Data Platform is easy as it is free to download and install in your cluster. Single node cluster available as Sandbox is also easy for POCs. Incentivized Piyush Routray Senior Software Developer Read full review	IBM My recommendation obviously would depend on the application. But I think given the right requirements, IBM DB2 Big SQL is definitely a contender for a database platform. Especially when disparate data and multiple data stores are involved. I like the fact I can use the product to federate my data and make it look like it's all in one place. The engine is high performance and if you desire to use Hadoop, this could be your platform. Incentivized Gene Baker Vice President, Chief Architect, Development Manager and Software Engineer Read full review
Pros	Apache Handles large amounts of unstructured data well, for business level purposes Is a good catchall because of this design, i.e. what does not fit into our vertical tables fits here. Decent for large ETL pipelines and logging free-for-alls because of this, also. Incentivized JH Joe Hughes Senior DevOps Engineer Read full review	Cloudera It does a good job of packaging a lot of big data components into bundles and lets you use the ones you are interested in or need. It supports an extensive list of components which lets us solve many problems. It provides the ability to manage installations and maintenance using Apache Ambari. It helps us in using management packs to install/upgrade components easily. It also helps us add, remove components, add, remove hosts, perform upgrades in a convenient manner. It also provides alerts and notifications and monitors the environment. What they excel in is packaging open source components that are relevant and are useful to solve and complement each other as well as contribute to enhancing those components. They do a great job in the community to keep on top of what would be useful to users, fixing bugs and working with other companies and individuals to make the platform better. Incentivized Verified User Anonymous Read full review	IBM data storage data manipulation data definitions data reliability Incentivized JS John Spies Database Administrator Read full review
Cons	Apache Less organizational support system. Bugs need to be fixed and outside help take a long time to push updates Not for small data sets Data security needs to be ramped up Failure in NameNode has no replication which takes a lot of time to recover Incentivized Bharadwaj (Brad) Chivukula Sr. Engineering Manager/Delivery Manager Read full review	Cloudera Since it doesn't come with propriety tools for big data management, additional integration is need (for query handling, search, etc). It was very straightforward to store clinical data without relations, such as data from sensors of a medical device. But it has limitations when needed to combine the data with other clinical data in structured format (e.g. lab results, diagnosis). Overall look and feel of front-end management tools (e.g. monitoring) are not good. It is not bad but it doesn't look professional. Incentivized Verified User Anonymous Read full review	IBM Cloud readiness. Ease of implementation. Incentivized Gene Baker Vice President, Chief Architect, Development Manager and Software Engineer Read full review
Likelihood to Renew	Apache Hadoop is organization-independent and can be used for various purposes ranging from archiving to reporting and can make use of economic, commodity hardware. There is also a lot of saving in terms of licensing costs - since most of the Hadoop ecosystem is available as open-source and is free Bhushan Lakhe Senior Vice President Read full review	Cloudera No answers on this topic	IBM No answers on this topic
Usability	Apache As Hadoop enterprise licensed version is quite fine tuned and easy to use makes it good choice for Hadoop administrators. It’s scalability and integration with Kerberos is good option for authentication and authorisation. installation can be improved. logging can be improved so that it become easier for debugging purposes. parallel processing of data is achieved easily. Incentivized Verified User Anonymous Read full review	Cloudera No answers on this topic	IBM IBM DB2 is a solid service but hasn't seen much innovation over the past decade. It gets the job done and supports our IT operations across digital so it is fair. Incentivized JS John Spies Database Administrator Read full review
Support Rating	Apache It's a great value for what you pay, and most Data Base Administrators (DBAs) can walk in and use it without substantial training. I tend to dabble on the analyst side, so querying the data I need feels like it can take forever, especially on higher traffic days like Monday. Incentivized Blake Baron Senior Financial Analyst Read full review	Cloudera No answers on this topic	IBM IBM did a good job of supporting us during our evaluation and proof of concept. They were able to provide all necessary guidance, answer questions, help us architect it, etc. We were pleased with the support provided by the vendor. I will caveat and say this support was all before the sale, however, we have a ton of IBM products and they provide the same high level of support for all of them. I didn't see this being any different. I give IBM support two thumbs up! Incentivized Gene Baker Vice President, Chief Architect, Development Manager and Software Engineer Read full review
Online Training	Apache Hadoop is a complex topic and best suited for classrom training. Online training are a waste of time and money. Bhushan Lakhe Senior Vice President Read full review	Cloudera No answers on this topic	IBM No answers on this topic
Implementation Rating	Apache No answers on this topic	Cloudera Try not to change variable names. Incentivized Wonoh Kim Principal Software Engineer Read full review	IBM No answers on this topic
Alternatives Considered	Apache Not used any other product than Hadoop and I don't think our company will switch to any other product, as Hadoop is providing excellent results. Our company is growing rapidly, Hadoop helps to keep up our performance and meet customer expectations. We also use HDFS which provides very high bandwidth to support MapReduce workloads. Incentivized Verified User Anonymous Read full review	Cloudera We chose [Hortonworks Data Platform] because it's free and because [it] was an IBM partner, suggested as big data platform after biginsights platform. You can install in more physical computer without high specs, then you can use it in order to learn how to deploy, configure a complete big data cluster. We installed also in a cloud infrastructure of 5 virtual machine Incentivized Andrea Bardone Project List 2018 - 2012 Read full review	IBM MS SQL Server was ruled out given we didn't feel we could collapse environments. We thought of MS-SQL as more of a one for one replacement for Sybase ASE, i.e., server for server. SAP HANA was evaluated and given a big thumbs up but was rejected because the SQL would have to be rewritten at the time (now they have an accelerator so you don't have to). Also, there was a very low adoption rate within the enterprise. IBM DB2 Big SQL was not selected even though technically it achieved high scores, because we could not find readily available talent and low adoption rate within the enterprise (basically no adoption at the time). We ended up selecting Exadata because of the high adoption rate within the enterprise even though technically HANA and Big SQL were superior in our evaluations. Incentivized Gene Baker Vice President, Chief Architect, Development Manager and Software Engineer Read full review
Return on Investment	Apache There are many advantages of Hadoop as first it has made the management and processing of extremely colossal data very easy and has simplified the lives of so many people including me. Hadoop is quite interesting due to its new and improved features plus innovative functions. Incentivized Chantel Moreno Finance & Accounting Professional Read full review	Cloudera It is difficult to have a negative impact, because the required investment is not that high. The big open community behind Hortonworks and related Apache Project makes it easy to put 'the wheel to meet the road' quite quickly. We have seen management meetings where the attendants were impressed by the results achieved with the datalake built on HDP. Incentivized Fernando López Bello Big Data & Cognitive Computing Practice Leader Read full review	IBM better data visibility solid reliability for mission critical data Incentivized JS John Spies Database Administrator Read full review
ScreenShots