Best commercial hadoop distribution product
October 02, 2017

Best commercial hadoop distribution product

Anonymous | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User
Review Source

Overall Satisfaction with Hortonworks Data Platform

We use our hortonworks hadoop cluster to process web & email logs. It has enabled us to process huge volume of data much quicker. It is widely used across the company!
  • HDP is the closest to an open source platform you can get in hadoop ecosystem with more choice of tools than everything else. The convenience of Ambari UI and API for building, deploying and managing the cluster makes it relatively easy to get started.
  • With Yarn and Spark you can mix different nodes for storage and compute and master nodes to manage loads.
  • The tez engine - hortonworks sandbox which can be installed for learning and development purposes.
  • Version upgrades are more challenging than anticipated. Each upgrade has its own quirks and compatibility issues that need to be resolved manually.
  • Real time analytics like impala is unavailable.
  • Monitoring isn't that great. Ambari Management interface on HDP is just a basic one and does not have many rich features.
  • Enterprise support cost is lower as compared to CDH (per node).
  • Employs Committers of Top Apache Projects (HDFS,YARN,HBase,Hive,Pig, etc).
  • Version upgrades require manual work.
  • cloudera
With its great performance and other benefits, we eventually moved from Cloudera to the Hortonworks platform.

Hortonworks provides a framework comprising open source projects which is good for any open source lovers. Easy to start with its great tutorials.