What users are saying about
245 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>Score 8.4 out of 100
Based on 245 reviews and ratings
Top Rated
153 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>Score 8.5 out of 100
Based on 153 reviews and ratings
Likelihood to Recommend
Hadoop
Apache Hadoop (and its subsequent add-ons) are well-suited to larger, unstructured data flows, such as aggregation of web traffic or advertising. Geospatial algorithms and their outputs are well-suited for this kind of aggregation as structuring that data is challenging, but leaving it unstructured and performing queries as-needed is a better fit for most business models. With the advent of data science, I would expect Hadoop fits a LOT of their initial outputs quite well.
Senior DevOps Engineer
Simpli.fiOnline Media, 201-500 employees
Amazon Redshift
If the number of connections is expected to be low, but the amounts of data are large or projected to grow it is a good solutions especially if there is previous exposure to PostgreSQL. Speaking of Postgres, Redshift is based on several versions old releases of PostgreSQL so the developers would not be able to take advantage of some of the newer SQL language features. The queries need some fine-tuning still, indexing is not provided, but playing with sorting keys becomes necessary. Lastly, there is no notion of the Primary Key in Redshift so the business must be prepared to explain why duplication occurred (must be vigilant for)
Senior Business Intelligence Consultant
SermoResearch, 201-500 employees
Pros
Hadoop
- HDFS is reliable and solid, and in my experience with it, there are very few problems using it
- Enterprise support from different vendors makes it easier to 'sell' inside an enterprise
- It provides High Scalability and Redundancy
- Horizontal scaling and distributed architecture
Sr. Engineering Manager/Delivery Manager
Nisum Technologies, Inc.Retail, 10,001+ employees
Amazon Redshift
- Redshift is fully managed. Small teams do not have the resources to maintain a cluster. CloudWatch metrics are provided out-of-the-box, and it is easy to configure alarms.
- Redshift's console allows you to easily inspect and manage queries, and manage the performance of the cluster.
- Redshift is ubiquitous; many products (e.g., ETL services) integrate with it out-of-the-box.
- Writing .csvs to S3 and querying them through Redshift Spectrum is convenient.
Data Scientist
Wonder (AskWonder.com)Research, 11-50 employees
Cons
Hadoop
- Hadoop is a batch oriented processing framework, it lacks real time or stream processing.
- Hadoop's HDFS file system is not a POSIX compliant file system and does not work well with small files, especially smaller than the default block size.
- Hadoop cannot be used for running interactive jobs or analytics.
Senior Software Engineer
San Jose State UniversityComputer Software, 51-200 employees
Amazon Redshift
- It could benefit from adding data integrity and programming tools common to other database management systems.
- Amazon Redshift is based on PostgreSQL 8.0.2. That version of PostgreSQL was released in December 2006. While PostgreSQL was much improved since then, the new features were not implemented in Redshift. Many basic features are missing from it.
- Primary keys can be declared but not enforced. Referential integrity (foreign keys) can be declared but not enforced. UNIQUE and CHECK constraints are not supported and cannot be declared.
- IDENTITY can be declared on a column, and Redshift will put unique values into it. However: IDENTITY values in the newly inserted rows won’t be incremental or sequential. To implement a sequential number, you need to write your own custom code.
- There are no stored procedures in Redshift. We are writing SQL script files, and then parsing and running them one statement at a time from a Python program. This also enabled us to implement execution-time error logging.
- In SQL scripts, to check for the row count of affected rows, a complicated join query against some system tables or views has to be executed.
- Data Control Language (DCL) does not exist. No statements like IF, WHILE, DO, RAISERROR, etc.
- On performance of views… Views do not “pass-through” a query parameter which is a potential problem for performance.
- When selecting against a view with the WHERE clause outside of the view, the inner query of the view will be executed first without consideration for the WHERE clause, and only then the WHERE clause will be applied.
- Certain clauses of SQL work many times faster than other clauses. So be careful and test your statements for performance earlier rather than later, especially if working with a large data set.
- There was a situation when DELETE FROM JOIN was unacceptably slow. Replacing JOIN with the USING clause made DELETE instantaneous.
Principal Data Architect
IntuitComputer Software, 5001-10,000 employees
Likelihood to Renew
Hadoop
Hadoop 9.6
Based on 8 answers
Hadoop is organization-independent and can be used for various purposes ranging from archiving to reporting and can make use of economic, commodity hardware. There is also a lot of saving in terms of licensing costs - since most of the Hadoop ecosystem is available as open-source and is free
Senior Vice President
IpsosInformation Technology and Services, 10,001+ employees
Amazon Redshift
No score
No answers yet
No answers on this topic
Usability
Hadoop
Hadoop 8.5
Based on 5 answers
Great! Hadoop has an easy to use interface that mimics most other data warehouses. You can access your data via SQL and have it display in a terminal before exporting it to your business intelligence platform of choice. Of course, for smaller data sets, you can also export it to Microsoft Excel.
Senior Financial Analyst
Lowe's Companies, Inc.Retail, 10,001+ employees
Amazon Redshift
Amazon Redshift 8.5
Based on 8 answers
Just very happy with the product, it fits our needs perfectly. Amazon pioneered the cloud and we have had a positive experience using RedShift. Really cool to be able to see your data housed and to be able to query and perform administrative tasks with ease.
Senior Developer
American Board of Internal MedicineNon-Profit Organization Management, 201-500 employees
Support Rating
Hadoop
Hadoop 6.9
Based on 6 answers
We went with a third party for support, i.e., consultant. Had we gone with Azure or Cloudera, we would have obtained support directly from the vendor. my rating is more on the third party we selected and doesn't reflect the overall support available for Hadoop. I think we could have done better in our selection process, however, we were trying to use an already approved vendor within our organization. There is plenty of self-help available for Hadoop online.
Vice President, Chief Architect, Development Manager and Software Engineer
WySTAR Global Retirement Solutions, a Wells Fargo CompanyFinancial Services, 10,001+ employees
Amazon Redshift
Amazon Redshift 7.5
Based on 11 answers
The support was great and helped us in a timely fashion. We did use a lot of online forums as well, but the official documentation was an ongoing one, and it did take more time for us to look through it. We would have probably chosen a competitor product had it not been for the great support

Verified User
Engineer in Engineering
Financial Services Company, 5001-10,000 employeesOnline Training
Hadoop
Hadoop 6.1
Based on 2 answers
Hadoop is a complex topic and best suited for classrom training. Online training are a waste of time and money.
Senior Vice President
IpsosInformation Technology and Services, 10,001+ employees
Amazon Redshift
No score
No answers yet
No answers on this topic
Alternatives Considered
Hadoop
Not used any other product than Hadoop and I don't think our company will switch to any other product, as Hadoop is providing excellent results. Our company is growing rapidly, Hadoop helps to keep up our performance and meet customer expectations. We also use HDFS which provides very high bandwidth to support MapReduce workloads.

Verified User
Engineer in Engineering
Computer Software Company, 51-200 employeesAmazon Redshift
Than Vertica: Redshift is cheaper and AWS integrated (which was a plus because the whole company was on AWS).
Than BigQuery: Redshift has a standard SQL interface, though recently I heard good things about BigQuery and would try it out again.
Than Hive: Hive is great if you are in the PB+ range, but latencies tend to be much slower than Redshift and it is not suited for ad-hoc applications.
Than BigQuery: Redshift has a standard SQL interface, though recently I heard good things about BigQuery and would try it out again.
Than Hive: Hive is great if you are in the PB+ range, but latencies tend to be much slower than Redshift and it is not suited for ad-hoc applications.

Verified User
Director in Engineering
Computer Software Company, 1001-5000 employeesReturn on Investment
Hadoop
- Hadoop has allowed us to scale out a few of our tier-1, customer facing applications to provide very fast access to reports and analytics.
- Hadoop was easily implemented by our Linux team and onboarded by our Hadoop Admins.
- Hadoop has been a very stable platform and only goes down due to server patching or other maintenance.
Senior Network Administrator
Vizient, Inc.Hospital & Health Care, 1001-5000 employees
Amazon Redshift
- Redshift has had a very positive impact on our business. It has been used to provide analytics on marketing campaigns to boost revenue.
- Redshift is instrumental in our payment collection business processes. It powers everything from who gets called to who gets sent collection emails.
Software Engineer
Stansberry ResearchPublishing, 51-200 employees
Pricing Details
Hadoop
General
Free Trial
—Free/Freemium Version
Yes
Premium Consulting/Integration Services
—Entry-level set up fee?
No
Hadoop Editions & Modules
—
Additional Pricing Details
—Amazon Redshift
General
Free Trial
—Free/Freemium Version
—Premium Consulting/Integration Services
—Entry-level set up fee?
No
Amazon Redshift Editions & Modules
Edition
Current Generation | $0.25 - $13.041 |
---|---|
Previous Generation | $0.25 - $4.081 |
Redshift Spectrum | $5.002 |
Redshift Managed Storage | $0.243 |
- per hour
- per terabyte of data scanned
- per GB per month