Open source Hadoop: smart choice smart price
January 14, 2025

Open source Hadoop: smart choice smart price

Anonymous | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User

Modules Used

  • Hadoop Common

Overall Satisfaction with Apache Hadoop

We are using the Apache Hadoop to handle the data which is continuously coming from different devices in real time from different geographical location across the globe and then run spark jobs and notebook to ingest the data and process it and then load it other external systems for further processing.

Pros

  • It’s ability to handle magnitude of data is what makes Hadoop a go to open source product
  • It’s open source nature makes if quite configurable
  • Its community support is superb.

Cons

  • It’s set up is quite complex which requires good knowledge of it
  • It’s fine tuning in terms of configuration requires in depth knowledge of the product
  • It’s logging can be improved
  • As it was open source makes it popular choice for handling large chuck of datasets
  • It was free earlier but now it’s licensed but still enterprise is a fine tuned version which makes it easier for new users and administrators to use it
  • Our investment is worth every single penny.
  • Initial cost is more as you might need to hire administrators to setup the cluster and make them in scalable. But once done it’s pretty easy
As Hadoop enterprise licensed version is quite fine tuned and easy to use makes it good choice for Hadoop administrators. It’s scalability and integration with Kerberos is good option for authentication and authorisation.
installation can be improved.

logging can be improved so that it become easier for debugging purposes.
parallel processing of data is achieved easily.
It’s open source nature
it’s community support
its being configurable
its maintenance

Do you think Apache Hadoop delivers good value for the price?

Yes

Are you happy with Apache Hadoop's feature set?

Yes

Did Apache Hadoop live up to sales and marketing promises?

Yes

Did implementation of Apache Hadoop go as expected?

Yes

Would you buy Apache Hadoop again?

Yes

When you have real time data which amounts to massive volumes close to terabytes daily, it’s become quite imperative that we should have a system which can handle it and ingest without losing it. Having Hadoop in place makes our product more robust, its stability comes handy.

The only challenge in running huge clusters is it require huge amount of space and memory for efficient working.

Comments

More Reviews of Hadoop