What users are saying about
245 Ratings
Top Rated
153 Ratings
245 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.4 out of 100
Top Rated
153 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.5 out of 100

Likelihood to Recommend

Hadoop

Apache Hadoop (and its subsequent add-ons) are well-suited to larger, unstructured data flows, such as aggregation of web traffic or advertising. Geospatial algorithms and their outputs are well-suited for this kind of aggregation as structuring that data is challenging, but leaving it unstructured and performing queries as-needed is a better fit for most business models. With the advent of data science, I would expect Hadoop fits a LOT of their initial outputs quite well.
Joe Hughes | TrustRadius Reviewer

Amazon Redshift

If the number of connections is expected to be low, but the amounts of data are large or projected to grow it is a good solutions especially if there is previous exposure to PostgreSQL. Speaking of Postgres, Redshift is based on several versions old releases of PostgreSQL so the developers would not be able to take advantage of some of the newer SQL language features. The queries need some fine-tuning still, indexing is not provided, but playing with sorting keys becomes necessary. Lastly, there is no notion of the Primary Key in Redshift so the business must be prepared to explain why duplication occurred (must be vigilant for)
Arthur Zubarev | TrustRadius Reviewer

Pros

Hadoop

  • HDFS is reliable and solid, and in my experience with it, there are very few problems using it
  • Enterprise support from different vendors makes it easier to 'sell' inside an enterprise
  • It provides High Scalability and Redundancy
  • Horizontal scaling and distributed architecture
Bharadwaj (Brad) Chivukula | TrustRadius Reviewer

Amazon Redshift

  • Redshift is fully managed. Small teams do not have the resources to maintain a cluster. CloudWatch metrics are provided out-of-the-box, and it is easy to configure alarms.
  • Redshift's console allows you to easily inspect and manage queries, and manage the performance of the cluster.
  • Redshift is ubiquitous; many products (e.g., ETL services) integrate with it out-of-the-box.
  • Writing .csvs to S3 and querying them through Redshift Spectrum is convenient.
Gavin Hackeling | TrustRadius Reviewer

Cons

Hadoop

  • Hadoop is a batch oriented processing framework, it lacks real time or stream processing.
  • Hadoop's HDFS file system is not a POSIX compliant file system and does not work well with small files, especially smaller than the default block size.
  • Hadoop cannot be used for running interactive jobs or analytics.
Mrugen Deshmukh | TrustRadius Reviewer

Amazon Redshift

  • It could benefit from adding data integrity and programming tools common to other database management systems.
  • Amazon Redshift is based on PostgreSQL 8.0.2. That version of PostgreSQL was released in December 2006. While PostgreSQL was much improved since then, the new features were not implemented in Redshift. Many basic features are missing from it.
  • Primary keys can be declared but not enforced. Referential integrity (foreign keys) can be declared but not enforced. UNIQUE and CHECK constraints are not supported and cannot be declared.
  • IDENTITY can be declared on a column, and Redshift will put unique values into it. However: IDENTITY values in the newly inserted rows won’t be incremental or sequential. To implement a sequential number, you need to write your own custom code.
  • There are no stored procedures in Redshift. We are writing SQL script files, and then parsing and running them one statement at a time from a Python program. This also enabled us to implement execution-time error logging.
  • In SQL scripts, to check for the row count of affected rows, a complicated join query against some system tables or views has to be executed.
  • Data Control Language (DCL) does not exist. No statements like IF, WHILE, DO, RAISERROR, etc.
  • On performance of views… Views do not “pass-through” a query parameter which is a potential problem for performance.
  • When selecting against a view with the WHERE clause outside of the view, the inner query of the view will be executed first without consideration for the WHERE clause, and only then the WHERE clause will be applied.
  • Certain clauses of SQL work many times faster than other clauses. So be careful and test your statements for performance earlier rather than later, especially if working with a large data set.
  • There was a situation when DELETE FROM JOIN was unacceptably slow. Replacing JOIN with the USING clause made DELETE instantaneous.
Michael Romm | TrustRadius Reviewer

Likelihood to Renew

Hadoop

Hadoop 9.6
Based on 8 answers
Hadoop is organization-independent and can be used for various purposes ranging from archiving to reporting and can make use of economic, commodity hardware. There is also a lot of saving in terms of licensing costs - since most of the Hadoop ecosystem is available as open-source and is free
Bhushan Lakhe | TrustRadius Reviewer

Amazon Redshift

No score
No answers yet
No answers on this topic

Usability

Hadoop

Hadoop 8.5
Based on 5 answers
Great! Hadoop has an easy to use interface that mimics most other data warehouses. You can access your data via SQL and have it display in a terminal before exporting it to your business intelligence platform of choice. Of course, for smaller data sets, you can also export it to Microsoft Excel.
Blake Baron | TrustRadius Reviewer

Amazon Redshift

Amazon Redshift 8.5
Based on 8 answers
Just very happy with the product, it fits our needs perfectly. Amazon pioneered the cloud and we have had a positive experience using RedShift. Really cool to be able to see your data housed and to be able to query and perform administrative tasks with ease.
Brendan McKenna | TrustRadius Reviewer

Support Rating

Hadoop

Hadoop 6.9
Based on 6 answers
We went with a third party for support, i.e., consultant. Had we gone with Azure or Cloudera, we would have obtained support directly from the vendor. my rating is more on the third party we selected and doesn't reflect the overall support available for Hadoop. I think we could have done better in our selection process, however, we were trying to use an already approved vendor within our organization. There is plenty of self-help available for Hadoop online.
Gene Baker | TrustRadius Reviewer

Amazon Redshift

Amazon Redshift 7.5
Based on 11 answers
The support was great and helped us in a timely fashion. We did use a lot of online forums as well, but the official documentation was an ongoing one, and it did take more time for us to look through it. We would have probably chosen a competitor product had it not been for the great support
Anonymous | TrustRadius Reviewer

Online Training

Hadoop

Hadoop 6.1
Based on 2 answers
Hadoop is a complex topic and best suited for classrom training. Online training are a waste of time and money.
Bhushan Lakhe | TrustRadius Reviewer

Amazon Redshift

No score
No answers yet
No answers on this topic

Alternatives Considered

Hadoop

Not used any other product than Hadoop and I don't think our company will switch to any other product, as Hadoop is providing excellent results. Our company is growing rapidly, Hadoop helps to keep up our performance and meet customer expectations. We also use HDFS which provides very high bandwidth to support MapReduce workloads.
Anonymous | TrustRadius Reviewer

Amazon Redshift

Than Vertica: Redshift is cheaper and AWS integrated (which was a plus because the whole company was on AWS).
Than BigQuery: Redshift has a standard SQL interface, though recently I heard good things about BigQuery and would try it out again.
Than Hive: Hive is great if you are in the PB+ range, but latencies tend to be much slower than Redshift and it is not suited for ad-hoc applications.
Anonymous | TrustRadius Reviewer

Return on Investment

Hadoop

  • Hadoop has allowed us to scale out a few of our tier-1, customer facing applications to provide very fast access to reports and analytics.
  • Hadoop was easily implemented by our Linux team and onboarded by our Hadoop Admins.
  • Hadoop has been a very stable platform and only goes down due to server patching or other maintenance.
Mark McCully | TrustRadius Reviewer

Amazon Redshift

  • Redshift has had a very positive impact on our business. It has been used to provide analytics on marketing campaigns to boost revenue.
  • Redshift is instrumental in our payment collection business processes. It powers everything from who gets called to who gets sent collection emails.
Seth Goldberg | TrustRadius Reviewer

Pricing Details

Hadoop

General

Free Trial
Free/Freemium Version
Yes
Premium Consulting/Integration Services
Entry-level set up fee?
No

Hadoop Editions & Modules

Additional Pricing Details

Amazon Redshift

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Amazon Redshift Editions & Modules

Edition
Current Generation$0.25 - $13.041
Previous Generation$0.25 - $4.081
Redshift Spectrum$5.002
Redshift Managed Storage$0.243
  1. per hour
  2. per terabyte of data scanned
  3. per GB per month
Additional Pricing Details

Add comparison