Hadoop review
Updated March 20, 2015

Hadoop review

Pramod Deshmukh | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User

Software Version

2

Modules Used

  • Hadoop Common
  • Hadoop Distributed File System
  • Hadoop MapReduce
  • HBase

Overall Satisfaction with Hadoop

We use Hadoop for our ETL and analytic functions. We stream data and land it on HDFS and then massage and transform data. We then use Hive interface to query this data. Using Sqoop we export and import data in and out of hadoop ecosystem. We store the data on HDFS in Avro and Parquet file formats.
  • Streaming data and loading to HDFS
  • Load jobs using Oozie and Sqoop for exporting data.
  • Analytic queries using MapReduce, Spark and Hive
  • Speed is one of the improvements we are looking for. We see Spark as an option and we are excited.
  • Fast ETL and realtime streaming data
  • Transformation and loading jobs are orchestrated using Oozie
OLTP is a scenario I think it is less appropriate. But future will be certainly different.

Using Hadoop

I am impressed with the improvements and automation compared to what we did before.

Using Hadoop