A brief Cloudera Review
July 19, 2017

A brief Cloudera Review

Anonymous | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Review Source

Software Version

Enterprise

Overall Satisfaction with Cloudera Enterprise

We used Cloudera to implement our big data service platform. Our team manages a central Hadoop platform where various departments were subscribed in a pay as you go model where they used it to solve various BI and analytical problems. On a day to day basis application teams used it for running Mapreduce/Tez/Spark jobs against their data set which is stored in HDFS.
  • One of the oldest distributors of enterprise standard Hadoop.
  • Distribution is based on open source Hadoop even though customizations are done on top of that.
  • Faster updates and bug fixes to the products as they have Apache committers.
  • Central configuration and control of your Hadoop platform (but still needs improvements).
  • Not fully Open Source, couple of components of the distributions are privately owned, meaning with public contributions are not welcome
  • Improvements to Cloudera manager can only be recommended. its very hard to get it done once recommended as the full control is with them.
  • Should make components more aligned to Open Source rather than making it closed sourced.
  • Custom Features of open source software tools supported only by Cloudera are tricky. Cant commit changes to tools like Hue.
  • Improvements to Cluster Management tool is required, which are already available to its competitors.
  • Improvements request would take a while.
  • Fast bug fixes and stability in releases are great.
  • Migration to other platforms would not be that easy if you start using proprietary tools.
It was the first and best Hadoop distribution when we started years ago. But the situation changed now and if given a choice, may end up choosing something else.
I personally started my big data career by learning Cloudera Distribution and obtaining certification years ago. Comparatively, it was the only distribution worth mentioning back then, but competitors may have outgrown them as of now, the rate at which Cloudera advances in terms of quality of the product and services could be at a lower pace than they used to be. Cluster Management Interface used to be of high quality. Cloudera never supported distribution on a Windows platform where competitors do. I am not a big fan of privately owned repos mixed with open source tools. I will try to stay away from those as that may create a dependency later on.