Overall Satisfaction with Hadoop
Hadoop is used for storing and analyzing log data (logs from warehouse loads or other data processing) as well as storing and retrieving financial data from JD Edwards. It's also planned to be used for archival. Hadoop is used by several departments within our organization. Currently, we are paying a lot of money for hosting historical data and we plan to move that to Hadoop; reducing our storage costs. Also, we got a much better performance out of our Hadoop cluster for processing a large amount of financial data. So, in that senese, Hadoop addressed multiple business problems for us.
- Hadoop stores and processes unstructured data such as web access logs or logs of data processing very well
- Hadoop can be effectively used for archiving; providing a very economic, fast, flexible, scalable and reliable way to store data
- Hadoop can be used to store and process a very large amount of data very fast
- Security is a piece that's missing from Hadoop - you have to supplement security using Kerberos etc.
- Hadoop is not easy to learn - there are various modules with little or no documentation
- Hadoop being open-source, testing, quality control and version control are very difficult
- We had a large ROI due to improved performance and expedited reporting - our clients were happier and business improved
- Our storage costs reduced
- Our infrastructure costs reduced - we used old hardware for our Hadoop cluster
not applicable - I have not evaluated any other products
Using Hadoop
50 - Various - IT, business users, vendors
3 - Hadoop Administrator, Java Developer, Hive deveoper
- Use of HDFS / Hive for storage / analysis of data processing logs
- Use of HDFS / Hive for storage / analysis of historical financial data
- Use of HDFS for Archival
- Archival
- Reporting
- ETL
- Data transfer
- Staging area
- Historical reporting
Evaluating Hadoop and Competitors
Yes - We replaced 5 Windows based servers by a 10 node CentOS based desktops. Saved a lot on hardware and Windows server licenses
- Price
- Product Features
- Product Usability
Price. We saved a lot of money
I will evaluate the ROI more closely