Apache Hadoop vs. Cloudera Enterprise Data Hub

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Hadoop
Score 7.3 out of 10
N/A
Hadoop is an open source software from Apache, supporting distributed processing and data storage. Hadoop is popular for its scalability, reliability, and functionality available across commoditized hardware.N/A
Cloudera Enterprise Data Hub
Score 9.0 out of 10
N/A
The Cloudera Enterprise Data Hub powered by SDX is a multifunction analytics solution that supports a range of operational and analytic use cases for enterprises.N/A
Pricing
Apache HadoopCloudera Enterprise Data Hub
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
HadoopCloudera Enterprise Data Hub
Free Trial
NoNo
Free/Freemium Version
YesYes
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
Apache HadoopCloudera Enterprise Data Hub
Considered Both Products
Hadoop
Chose Apache Hadoop
Hadoop being open source, is cheaper to use and do POCs for clients. Cloudera, Hortonworks and MapR also compete to contribute to open source Hadoop and keep their product conceptually similar to Hadoop.
Cloudera Enterprise Data Hub
Chose Cloudera Enterprise Data Hub
It was the first and best Hadoop distribution when we started years ago. But the situation changed now and if given a choice, may end up choosing something else.
Chose Cloudera Enterprise Data Hub
I have used Amazon Elastic Cloud Compute EC2, Windows Azure. But the difference with these products and Cloudera is Amazon and Azure are more costly. But Cloudera is best because of Data sensitivity and privacy. We have all the shareholder activity data for funds that business …
Chose Cloudera Enterprise Data Hub
I have not evaluated any similar products, and in fact, don't know of any direct competitor. Amazon's Redshift has a similar spirit.
Chose Cloudera Enterprise Data Hub
A deep bench of Hadoop experts, major contributions to the Hadoop open source community and a solid head start getting market recognition, skills and awareness across the teams.
Top Pros
Top Cons
Best Alternatives
Apache HadoopCloudera Enterprise Data Hub
Small Businesses

No answers on this topic

Google BigQuery
Google BigQuery
Score 8.6 out of 10
Medium-sized Companies
Cloudera Manager
Cloudera Manager
Score 9.7 out of 10
Snowflake
Snowflake
Score 9.0 out of 10
Enterprises
IBM Analytics Engine
IBM Analytics Engine
Score 8.8 out of 10
Oracle Exadata
Oracle Exadata
Score 8.2 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Apache HadoopCloudera Enterprise Data Hub
Likelihood to Recommend
8.9
(36 ratings)
9.0
(12 ratings)
Likelihood to Renew
9.6
(8 ratings)
8.2
(7 ratings)
Usability
8.5
(5 ratings)
-
(0 ratings)
Performance
8.0
(1 ratings)
-
(0 ratings)
Support Rating
7.5
(3 ratings)
-
(0 ratings)
Online Training
6.1
(2 ratings)
-
(0 ratings)
User Testimonials
Apache HadoopCloudera Enterprise Data Hub
Likelihood to Recommend
Apache
Altogether, I want to say that Apache Hadoop is well-suited to a larger and unstructured data flow like an aggregation of web traffic or even advertising. I think Apache Hadoop is great when you literally have petabytes of data that need to be stored and processed on an ongoing basis. Also, I would recommend that the software should be supplemented with a faster and interactive database for a better querying service. Lastly, it's very cost-effective so it is good to give it a shot before coming to any conclusion.
Read full review
Cloudera
Cloudera excels at seamless migrations and upgrades.



Cloudera supports self-healing and data center
replacement of failed cloud instances while maintaining the state.



Cloudera is essential to increase or decrease
capacity through the user interface or API.



Cloudera is great at simplifying big data analytics
by providing the technology and tools needed to gain insights from IoT and
connected devices to help monitor and condition our assets.



Cloudera's cybersecurity platform option offers
stronger anomaly detection, visibility, and prevention, as well as faster
behavioral analysis.



Cloudera is beneficial for enabling and utilizing
the platform's machine learning and ad-hoc queries while securely storing,
retrieving, and analyzing any volume of data at scale.
Read full review
Pros
Apache
  • Handles large amounts of unstructured data well, for business level purposes
  • Is a good catchall because of this design, i.e. what does not fit into our vertical tables fits here.
  • Decent for large ETL pipelines and logging free-for-alls because of this, also.
Read full review
Cloudera
  • Excellent management capabilities via Cloudera Manager.
  • Open source and does not restrict our data to be bound by a proprietary format.
  • Offers excellent support for data governance and auditing.
  • Has all the components that would help us build a data hub.
  • Excellent platform support offered by Cloudera.
Read full review
Cons
Apache
  • Less organizational support system. Bugs need to be fixed and outside help take a long time to push updates
  • Not for small data sets
  • Data security needs to be ramped up
  • Failure in NameNode has no replication which takes a lot of time to recover
Read full review
Cloudera
  • Not fully Open Source, couple of components of the distributions are privately owned, meaning with public contributions are not welcome
  • Improvements to Cloudera manager can only be recommended. its very hard to get it done once recommended as the full control is with them.
  • Should make components more aligned to Open Source rather than making it closed sourced.
  • Custom Features of open source software tools supported only by Cloudera are tricky. Cant commit changes to tools like Hue.
  • Improvements to Cluster Management tool is required, which are already available to its competitors.
Read full review
Likelihood to Renew
Apache
Hadoop is organization-independent and can be used for various purposes ranging from archiving to reporting and can make use of economic, commodity hardware. There is also a lot of saving in terms of licensing costs - since most of the Hadoop ecosystem is available as open-source and is free
Read full review
Cloudera
Likely to renew the use in case the requirements for Cloudera remain valid. The rapid change in customer requirements and solutions that must be validated, integrated or tested changes. As the maturity of the solution increases, the requirements to renew use decrease. From a solution feature perspective by itself would probably grade 10.
Read full review
Usability
Apache
Great! Hadoop has an easy to use interface that mimics most other data warehouses. You can access your data via SQL and have it display in a terminal before exporting it to your business intelligence platform of choice. Of course, for smaller data sets, you can also export it to Microsoft Excel.
Read full review
Cloudera
No answers on this topic
Support Rating
Apache
We went with a third party for support, i.e., consultant. Had we gone with Azure or Cloudera, we would have obtained support directly from the vendor. my rating is more on the third party we selected and doesn't reflect the overall support available for Hadoop. I think we could have done better in our selection process, however, we were trying to use an already approved vendor within our organization. There is plenty of self-help available for Hadoop online.
Read full review
Cloudera
No answers on this topic
Online Training
Apache
Hadoop is a complex topic and best suited for classrom training. Online training are a waste of time and money.
Read full review
Cloudera
No answers on this topic
Alternatives Considered
Apache
Not used any other product than Hadoop and I don't think our company will switch to any other product, as Hadoop is providing excellent results. Our company is growing rapidly, Hadoop helps to keep up our performance and meet customer expectations. We also use HDFS which provides very high bandwidth to support MapReduce workloads.
Read full review
Cloudera
Cloudera is
compatible with Windows operating systems, and Mac allows cloud-based
deployment, it is also very useful to configure data encryption, guarantee
protocols, and security policies. It also provides integrated auditing and
monitoring capabilities, as well as a control comprehensive data repository for
the enterprise, and ensures vendor compatibility through its open-source
architecture.
Read full review
Return on Investment
Apache
  • There are many advantages of Hadoop as first it has made the management and processing of extremely colossal data very easy and has simplified the lives of so many people including me.
  • Hadoop is quite interesting due to its new and improved features plus innovative functions.
Read full review
Cloudera
  • Cloudera products are the most widely. It is more business friendly as data is more secure. The sensitive data that you operate on is local to you and your project rather than processing this data on Cloud.
  • Cloudera is definitely faster as wait time is reduced if on Cloud.
  • A lot range of products are covered. So it is definitely good for businesses and had good returns on investments.
Read full review
ScreenShots