Hadoop Reviews & Ratings 2024

Name: What is Hadoop?
Uploaded: 2012-07-14T22:23:27.000Z
Duration: 3 min 7 s
Description: What is Hadoop?

Overview

What is Hadoop?

Hadoop is an open source software from Apache, supporting distributed processing and data storage. Hadoop is popular for its scalability, reliability, and functionality available across commoditized hardware.

Recent Reviews

TrustRadius Insights

December 14, 2023

Hadoop has been widely adopted by organizations for various use cases. One of its key use cases is in storing and analyzing log data, …

Hadoop: A Robust Big Data Platform

9 out of 10

April 11, 2022

Incentivized

Hadoop is being used to solve big data modeling problems in our firm. The corporate analytics team uses Hadoop to perform functions like …

Great enterprise tool for handling large data

9 out of 10

August 17, 2021

Incentivized

Apache Hadoop is one of the most effective and efficient software which has been storing and processing an extremely colossal amount of …

Good tool for unstructured data

9 out of 10

July 21, 2021

Incentivized

Apache Hadoop is an open-source software library that is designed for the collection, storage, and analysis of large amounts of data sets. …

Good solution for storing and processing large data

7 out of 10

May 20, 2021

Incentivized

We use Apache Hadoop to store and process large amounts of data (petabytes per day) across thousands of data pipelines. Hadoop works …

Apache Hadoop Can Save on the Headaches

7 out of 10

January 16, 2021

Incentivized

[Apache Hadoop] is being handled as it is (mostly) intended. For large, unstructured data management from our data flows to include …

Hadoop -- Great Value for What You Pay

7 out of 10

September 21, 2020

Incentivized

It's used organization-wide for older data that's not used as frequently. We use Teradata to warehouse our more recent data, but for data …

Fault Tolerance and High Availablility Made Easy with Hadoop

10 out of 10

September 20, 2020

Incentivized

We are using it within my department to process large sets of data that can't be processed in a timely fashion on a single computer or …

Hadoop vs. Alternatives

8 out of 10

June 05, 2019

Incentivized

It is being used at our Fortune 500 clients. It is great for storage, but it is not well understood by the business. The challenge is that …

Hadoop Review

7 out of 10

May 16, 2018

Incentivized

It is massively being used in our organization for data storage, data backup, and machine learning analytics. Managing vast amounts of …

Great Option for Unstructured Data

10 out of 10

March 28, 2018

Incentivized

Used for Massive data collection, storage, and analytics
Used for MapReduce processes, Hive tables, Spark job input, and for backing up data

Hadoop is pretty Badass

9 out of 10

January 04, 2018

Incentivized

Apache Hadoop is a cost effective solution for storing and managing vast amounts of data efficiently. It is dependable and works even when …

Hadoop: Highly available, scalable and cost effective for big data storage and processing.

8 out of 10

December 13, 2017

Incentivized

Currently, there are two directorates using Hadoop for processing a vast amount of data from various data sources in my organization. …

Hadoop for Justifying Business Decisions with Hard Data

10 out of 10

October 24, 2017

Incentivized

Hadoop has been an amazing development in the world of Big Data. Where relational databases fall short with regard to tuning and …

Hadoop review 2346

9 out of 10

September 22, 2017

Incentivized

Hadoop is used to build a data lake where all enterprise data for my entire company can be stored. With data centralization and …

Hadoop for Big Data

10 out of 10

August 24, 2017

Incentivized

[It was used] As a proof of concept to analyze a huge amount of data. We were building a product to analyze huge data and eventually sell …

Read all reviews

Return to navigation

Product Demos

Installation of Apache Hadoop 2.x or Cloudera CDH5 on Ubuntu | Hadoop Practical Demo

YouTube

Big Data Complete Course and Hadoop Demo Step by Step | Big Data Tutorial for Beginners | Scaler

YouTube

Hadoop Tutorial For Beginners | Apache Hadoop Tutorial For Beginners | Hadoop Tutorial | Simplilearn

YouTube

Return to navigation

Product Details

About
Tech Details
FAQs

What is Hadoop?

Hadoop Video

What is Hadoop?

Hadoop Technical Details

Operating Systems	Unspecified
Mobile Application	No

Frequently Asked Questions

Reviewers rate Data Sources highest, with a score of 8.7.

The most common users of Hadoop are from Enterprises (1,001+ employees).

Return to navigation

Comparisons

View all alternatives

Compare with

Reviews and Ratings

(270)

December 15th 2023

Community Insights

TrustRadius Insights are summaries of user sentiment data from TrustRadius reviews and, when necessary, 3rd-party data sources. Have feedback on this content? Let us know!

Business Problems Solved

Hadoop has been widely adopted by organizations for various use cases. One of its key use cases is in storing and analyzing log data, financial data from systems like JD Edwards, and retail catalog and session data for an omnichannel experience. Users have found that Hadoop's distributed processing capabilities allow for efficient and cost-effective storage and analysis of large amounts of data. It has been particularly helpful in reducing storage costs and improving performance when dealing with massive data sets. Furthermore, Hadoop enables the creation of a consistent data store that can be integrated across platforms, making it easier for different departments within organizations to collect, store, and analyze data. Users have also leveraged Hadoop to gain insights into business data, analyze patterns, and solve big data modeling problems. The user-friendly nature of Hadoop has made it accessible to users who are not necessarily experts in big data technologies. Additionally, Hadoop is utilized for ETL processing, data streaming, transformation, and querying data using Hive. Its ability to serve as a large volume ETL platform and crunching engine for analytical and statistical models has attracted users who were previously reliant on MySQL data warehouses. They have observed faster query performance with Hadoop compared to traditional solutions. Another significant use case for Hadoop is secure storage without high costs. Hadoop efficiently stores and processes large amounts of data, addressing the problem of secure storage without breaking the bank. Moreover, Hadoop enables parallel processing on large datasets, making it a popular choice for data storage, backup, and machine learning analytics. Organizations have found that it helps maintain and process huge amounts of data efficiently while providing high availability, scalability, and cost efficiency. Hadoop's versatility extends beyond commercial applications—it is also used in research computing clusters to complete tasks faster using the MapReduce framework. Finally, the Systems and IT department relies on Hadoop to create data pipelines and consult on potential projects involving Hadoop. Overall, the use cases of Hadoop span across industries and departments, providing valuable solutions for data collection, storage, and analysis.

Attribute Ratings

Reviews

(1-25 of 27)

Sort By *

Companies can't remove reviews or game the system. Here's why

April 11, 2022

Hadoop: A Robust Big Data Platform

Kunal Sonalkar

Data Research Analyst

Southwest Florida Water Management District (Higher Education, 5001-10,000 employees)

Score 9 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Apache Spark

Apache Spark can be considered as an alternative because of its similar capabilities around processing and storing big data. The reason we went with Hadoop was the literature available online and integration capability with platforms like R Studio. The popularity of Hadoop has helped us in debugging issues and solving problems at a faster rate.

August 17, 2021

Great enterprise tool for handling large data

Chantel Moreno

Finance & Accounting Professional

Fagron (Pharmaceuticals, 1001-5000 employees)

Score 9 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Different departments of my organization have been getting the benefit from Apache Hadoop as it serves the purpose of saving lives when large amounts of data is unable to be converted and processed in a timely manner from a node or a simple computer. Hadoop also has an easier process of configuration in a clustered environment. Additionally, from my experience, I have noticed that Hadoop provides great scalability and redundancy. Also, it provides enterprise-level support from a variety of vendors. Lastly, I think that a great positive fact of Hadoop is its horizontal scaling.

July 21, 2021

Good tool for unstructured data

Peter Suter

Senior Software Engineer (GUI)

SIX (Financial Services, 1001-5000 employees)

Score 9 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Azure Data Lake Storage

I feel that this is a highly reliable and scalable solution computing technology that is highly capable of processing large data sets across multiple servers and thousands of machines in a well-defined and distributed manner. Apache Hadoop can automatically scale up the number of servers and machines that are needed to process, store, and analyze data sets. It also handles explosions in data with big data technology. Apache Hadoop is good at handling all node failures as well.

May 20, 2021

Good solution for storing and processing large data

Verified User

Analyst in Marketing

Internet Company, 501-1000 employees

Score 7 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Apache Spark and Google BigQuery

Spark is a good alternative to Hadoop that can have faster querying and processing performance and can offer more flexibility in terms of applications that it can support.

Google BigQuery has also been a great alternative and is especially great in terms of ease of use. The capacity to process data and the speed are great without having to do any settings tuning or optimization. It also doesn't require any on-site hosting, making it a great hands off solution.

January 16, 2021

Apache Hadoop Can Save on the Headaches

Joe Hughes

Senior DevOps Engineer

Simpli.fi (Online Media, 201-500 employees)

Score 7 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

MariaDB - Better to be already in the cloud you will use it for. Issues have improved as it has matured over the year.s
CockroachDB - Not nearly as performant (even out of the box) as Apache Hadoop. More configurations required just to make it work. In memory cacheing is an issue.

September 21, 2020

Hadoop -- Great Value for What You Pay

Blake Baron

Senior Financial Analyst

Lowe's Companies, Inc. (Retail, 10,001+ employees)

Score 7 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Teradata Data Warehouse Appliance, Teradata Database, Amazon Web Services and Google Cloud Datastore

Hadoop utilizes a SQL structure, which is great. You pay less for the services, but it's definitely less of an enterprise-level option and more just a good place to store your seldom-used data. Teradata and AWS are a lot faster in returning queries than Hadoop, but you pay more, of course.

September 20, 2020

Fault Tolerance and High Availablility Made Easy with Hadoop

Gene Baker

Vice President, Chief Architect, Development Manager and Software Engineer

WySTAR Global Retirement Solutions, a Wells Fargo Company (Financial Services, 10,001+ employees)

Score 10 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Cloudera Data Platform, Microsoft SQL Server and Azure Data Lake Storage

Hands down, Hadoop is less expensive than the other platforms we considered. Cloudera was easier to set up but the expense ruled it out. MS-SQL didn't have the performance we saw with the Hadoop clusters and was more expensive. We considered MS-SQL mainly for its ability to support SQL queries in hopes we could leverage the existing codebase. Azure was just more expensive but again was easier to setup. In the end, cost won out because even though the competition was easier to set up, it's not like Hadoop was that much harder to setup.

June 05, 2019

Hadoop vs. Alternatives

Verified User

Executive in Professional Services

Information Services Company, 51-200 employees

Score 8 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

IBM Spectrum Scale

When comparing to the sophistication of IBM GPFS (Spectrum Scale) to Hadoop, it is clear that Spectrum Scale is a much better choice. That is maybe something you don't want to hear, but in all of our research, this has been the final decision of the client.

May 16, 2018

Hadoop Review

Kartik Chavan

Peer Educator (Tutor) & Supplemental Instructions (SI) Leader

The University of Texas at Arlington (Higher Education, 1001-5000 employees)

Score 7 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Apache Spark, Apache Spark MLib, Apache Pig and Amazon Redshift

For real-time streaming, use Spark; can provide a stark contrast to the way MR works
Hadoop offers a scalable, cost-effective and highly available solution for big data storage and processing.
Amazon Redshift is somewhat closer to Hadoop. But to analyze Petabytes of data Hadoop as better performance.
Hadoop is being open source, is cheaper to use and do POCs for client

March 28, 2018

Great Option for Unstructured Data

Bharadwaj (Brad) Chivukula

Sr. Engineering Manager/Delivery Manager

Nisum Technologies, Inc. (Retail, 10,001+ employees)

Score 10 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Apache Spark

For real-time streaming, use Spark; can provide a stark contrast to the way MR works
Use Hive for querying purposes

December 13, 2017

Hadoop: Highly available, scalable and cost effective for big data storage and processing.

Johanes Siregar

Big Data Analytics - Data Engineer

Telkomsel (Telecommunications, 1001-5000 employees)

Score 8 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Teradata Database, Amazon Elastic MapReduce and Elastic Grid

Hadoop offers a scalable, cost-effective and highly available solution for big data storage and processing. The use of a non-proprietary physical layer greatly reduces dependency on technology. It also offers elastic dimensioning capability when deployed on virtual machines or even on IAAS cloud. The main challenge, however, is to manage user access and to maintain security.

October 24, 2017

Hadoop for Justifying Business Decisions with Hard Data

Verified User

Engineer in Engineering

Telecommunications Company, 51-200 employees

Score 10 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

I haven't worked with other Big Data aggregation services like Hadoop. As far as I know, Hadoop is the leading choice in this field with good cause. There is a lot of community support, custom modules, paid consultants, free and paid training. All this makes it an ideal choice for facilitating Big Data aggregation.

September 22, 2017

Hadoop review 2346

Gyan Dwibedy

Chief Data & Analytic Officer

Molina Healthcare (Hospital & Health Care, 10,001+ employees)

Score 9 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

No SQL database were evaluated along with MPP platform. Hadoop performs very well compared to the other platforms. Also since lot of investment goes into Hadoop there is a good chance of getting what one needs from the developer community.

August 24, 2017

Hadoop for Big Data

Vinay Suneja

Senior Consultant Level II

Protiviti (Utilities, 201-500 employees)

Score 10 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Amazon Redshift is some what closer to Hadoop. But to analyze Petabytes of data Hadoop as better performance.

June 03, 2016

A newbie's look at Hadoop

Mark Gargiulo

Senior Automation Engineer

NTENT (Computer Software, 51-200 employees)

Score 8 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

As I am new to the hadoop ecosystem I have not used or evaluated any other similar products at this time. This was handed to me from a previous much older installation that was very under utilized. Our new platform will be working the new cluster much harder with jobs that run indefinitely. I'm not sure that any of the other "big data" technologies out there have as many certified components or work with such a diverse collection but as I said I am pretty new to this and so have only tertiary knowledge of competing products.

May 26, 2016

Experience with Hadoop by a novice user.

Muhammad Fazalul Rahman

Research Assistant

Rochester Institute of Technology (Research, 1-10 employees)

Score 7 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

MapReduce and Amazon Elastic MapReduce

Hadoop was a cheaper alternative to Amazon. Since I had to pay for every minute I use with Amazon, I had to make sure multiple times that the code was good enough before I purchased with Amazon. But since Hadoop was available on the cluster, I had the opportunity to code on the way.

February 16, 2016

Apache Hadoop is the best open source product I used.

Piyush Routray

Senior Software Developer

Nvent (Information Technology and Services, 11-50 employees)

Score 9 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Cloudera and Hortonworks Data Platform

Hadoop being open source, is cheaper to use and do POCs for clients. Cloudera, Hortonworks and MapR also compete to contribute to open source Hadoop and keep their product conceptually similar to Hadoop.

February 14, 2016

Hadoop an awesome tool for large scale batch processing.

Tushar Kulkarni

Employee in Research & Development

Student (Computer Hardware, 51-200 employees)

Score 10 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Apache Spark and Apache Flink

Apache Spark has an in memory processing model, making it powerful for lightning fast data processing. Apache Spark also exposes Scala and Python in APIs which is one of the most commonly used programming languages in data analytic and data processing domains.

December 09, 2015

Hadoop - best data optimization for the Enterprise

Verified User

Engineer in Engineering

Computer Software Company, 51-200 employees

Score 9 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Not used any other product than Hadoop and I don't think our company will switch to any other product, as Hadoop is providing excellent results. Our company is growing rapidly, Hadoop helps to keep up our performance and meet customer expectations. We also use HDFS which provides very high bandwidth to support MapReduce workloads.

December 01, 2015

Hadoop - Effective tool for large scale distributed processing.

Mrugen Deshmukh

Senior Software Engineer

San Jose State University (Computer Software, 51-200 employees)

Score 8 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

MongoDB and Neo4j

Hadoop provides storage for large data sets and a powerful processing model to crunch and transform huge amounts of data. It does not assume the underlying hardware or infrastructure and enables the users to build data processing infrastructure from commodity hardware. All the modules in Hadoop are designed with a fundamental assumption that hardware failures are commonplace and thus should be automatically handled in software by the framework, relieving the developers from handling every edge scenario that can occur in a large distributed system.
Hadoop can be deployed in a traditional onsite datacenter as well as in the cloud. The cloud allows organizations to deploy Hadoop without hardware to acquire or a specific setup expertise. Many vendors who currently have an offer for the cloud include Microsoft, Amazon and Google.

December 01, 2015

Hadoop the solution to big data problems

Sudhakar Kamanboina

Software Engineer

VMware (Computer Software, 10,001+ employees)

Score 10 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Splunk

Hadoop has a master slave architecture and comes with more features than Splunk.

December 01, 2015

Fast and Reliable, Use Hadoop!

Gaurav Kasliwal

Software Development Engineer

Cisco (Computer Software, 11-50 employees)

Score 10 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Tableau Public, Azure and Cloudera

Fast and scalable. More reliable as compared to the other products I have used.

December 01, 2015

From the experience of a naive developer!

Verified User

Engineer in Engineering

Internet Company, 10,001+ employees

Score 9 out of 10

Vetted Review

Verified User

Incentivized

Alternatives Considered

Nope

Processing of big data has been the ultimate need for the me choosing Hadoop. Big data is massive and messy, and it’s coming at you uncontrolled. Data are gathered to be analyzed to discover patterns and correlations that could not be initially apparent, but might be useful in making business decisions in an organization. These data are often personal data, which are useful from a marketing viewpoint to understand the desires and demands of potential customers and in analyzing and predicting their buying tendencies.

I think Hadoop processes it very efficiently.

November 11, 2015