Cloudera Products, one of the best available in Big Data market
May 14, 2014

Cloudera Products, one of the best available in Big Data market

Anonymous | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Review Source

Software Version

Enterprise

Overall Satisfaction with Cloudera Enterprise

I used the Cloudera Enterprise in my previous company wherein I had to used the Cloudera Impala product. I have also used Cloudera Hadoop, Hive, Sqoop, Pig, Mahout etc. Since my project had requirement for trying different tools available in market, we were doing research on which tool is the best, thats where we landed on Cloudera Enterprise products. It is used only in my department. To address the shareholder data in files from different suppliers which needs to be processed via MapReduce framework.
  • Wide range of Products available on Cloudera which runs on MapReduce framework.
  • Easy to install on Virtual Machines. All the products are available on the website and steps for installation are easily available and well documented with examples and case scenarios.
  • All the products with latest versions are available which can be easily installed via yum.
  • Cloudera Impala which the fastest way to query the HDFS without using MapReduce framework. This is freely available on their website while with Amazon EC2 machines and Windows Azure you have to shell some money for using their machine.
  • Some of the previous versions have compatibility issues with latest CentOS Virtual machines. So you have to take care of the Cloudera product's versions as well as CentOS version.
  • Compatibility issues with installation on Mac. Everything is Linux based so user has to have good knowledge of Linux commands.
  • GUI's are missing
  • I ended up spending time on matching the exact Cloudera product version with CentOS version. So had invested a lot of time in installtion/uninstallation of these products until u match/find the exact version. I had issues while installing Impala maybe it was new when i had a chance to work on Cloudera Impala.
  • Cloudera products are the most widely. It is more business friendly as data is more secure. The sensitive data that you operate on is local to you and your project rather than processing this data on Cloud.
  • Cloudera is definitely faster as wait time is reduced if on Cloud.
  • A lot range of products are covered. So it is definitely good for businesses and had good returns on investments.
I have used Amazon Elastic Cloud Compute EC2, Windows Azure. But the difference with these products and Cloudera is Amazon and Azure are more costly. But Cloudera is best because of Data sensitivity and privacy. We have all the shareholder activity data for funds that business uses after applying business rules on that data. Hence sensitivity is the most useful factor which Cloudera products provide us.
Cloudera is expanding with varied range of products and it looks to me as a challenging company and very competitive in the Big Data market. I would recommend to using the latest stable version as new features are added with each version and different bugs are fixed. As Big data is a constantly evolving so trying new Cloudera products helps businesses to stay competitive in the market domain as every big organization is now trying to move to Big Data.
It is well suited when you locally want all the Cloudera products (MapReduce/Hive/Pig/Impala/Sqoop etc) on your machine. The best thing it is free. Though Amazon EC2 and Windows Azure Virtual machines which have these products are installed are more friendly but you have to shell a lot of money on hourly basis.