Hadoop for better economy and efficiency
Updated April 29, 2015

Hadoop for better economy and efficiency

Bhushan Lakhe | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Review Source

Modules Used

  • Hadoop Common
  • Hadoop Distributed File System
  • Hadoop MapReduce
  • Hive, HBase, Sqoop, Flume, Oozie

Overall Satisfaction with Hadoop

Hadoop is used for storing and analyzing log data (logs from warehouse loads or other data processing) as well as storing and retrieving financial data from JD Edwards. It's also planned to be used for archival. Hadoop is used by several departments within our organization. Currently, we are paying a lot of money for hosting historical data and we plan to move that to Hadoop; reducing our storage costs. Also, we got a much better performance out of our Hadoop cluster for processing a large amount of financial data. So, in that senese, Hadoop addressed multiple business problems for us.
  • Hadoop stores and processes unstructured data such as web access logs or logs of data processing very well
  • Hadoop can be effectively used for archiving; providing a very economic, fast, flexible, scalable and reliable way to store data
  • Hadoop can be used to store and process a very large amount of data very fast
  • Security is a piece that's missing from Hadoop - you have to supplement security using Kerberos etc.
  • Hadoop is not easy to learn - there are various modules with little or no documentation
  • Hadoop being open-source, testing, quality control and version control are very difficult
  • We had a large ROI due to improved performance and expedited reporting - our clients were happier and business improved
  • Our storage costs reduced
  • Our infrastructure costs reduced - we used old hardware for our Hadoop cluster
not applicable - I have not evaluated any other products
Hadoop is best suited for warehouse or OLAP processing. It's not suitable for OLTP or small transaction processing

Using Hadoop

50 - Various - IT, business users, vendors
3 - Hadoop Administrator, Java Developer, Hive deveoper
  • Use of HDFS / Hive for storage / analysis of data processing logs
  • Use of HDFS / Hive for storage / analysis of historical financial data
  • Use of HDFS for Archival
  • Archival
  • Reporting
  • ETL
  • Data transfer
  • Staging area
  • Historical reporting
Hadoop is organization-independent and can be used for various purposes ranging from archiving to reporting and can make use of economic, commodity hardware. There is also a lot of saving in terms of licensing costs - since most of the Hadoop ecosystem is available as open-source and is free

Evaluating Hadoop and Competitors

Yes - We replaced 5 Windows based servers by a 10 node CentOS based desktops. Saved a lot on hardware and Windows server licenses
  • Price
  • Product Features
  • Product Usability
Price. We saved a lot of money
I will evaluate the ROI more closely

Hadoop Training

Hadoop is a complex topic and best suited for classrom training. Online training are a waste of time and money.

Hadoop Support

Using Hadoop