Amazon EMR (Elastic MapReduce) vs. Oracle Autonomous Data Warehouse vs. Amazon Redshift

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Amazon EMR
Score 8.9 out of 10
N/A
Amazon EMR is a cloud-native big data platform for processing vast amounts of data quickly, at scale. Using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi (Incubating), and Presto, coupled with the scalability of Amazon EC2 and scalable storage of Amazon S3, EMR gives analytical teams the engines and elasticity to run Petabyte-scale analysis.N/A
Oracle Autonomous Data Warehouse
Score 8.3 out of 10
N/A
Oracle Autonomous Data Warehouse is optimized for analytic workloads, including data marts, data warehouses, data lakes, and data lakehouses. With Autonomous Data Warehouse, data scientists, business analysts, and nonexperts can discover business insights using data of any size and type. The solution is built for the cloud and optimized using Oracle Exadata.N/A
Amazon Redshift
Score 8.9 out of 10
N/A
Amazon Redshift is a hosted data warehouse solution, from Amazon Web Services.
$0.24
per GB per month
Pricing
Amazon EMR (Elastic MapReduce)Oracle Autonomous Data WarehouseAmazon Redshift
Editions & Modules
No answers on this topic
No answers on this topic
Redshift Managed Storage
$0.24
per GB per month
Current Generation
$0.25 - $13.04
per hour
Previous Generation
$0.25 - $4.08
per hour
Redshift Spectrum
$5.00
per terabyte of data scanned
Offerings
Pricing Offerings
Amazon EMROracle Autonomous Data WarehouseAmazon Redshift
Free Trial
NoNoNo
Free/Freemium Version
NoNoNo
Premium Consulting/Integration Services
NoNoNo
Entry-level Setup FeeNo setup feeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
Amazon EMR (Elastic MapReduce)Oracle Autonomous Data WarehouseAmazon Redshift
Considered Multiple Products
Amazon EMR
Chose Amazon EMR (Elastic MapReduce)
Snowflake is a lot easier to get started with than the other options. Snowflake's data lake building capabilities are far more powerful. Although Amazon EMR isn't our first pick, we've had an excellent experience with EC2 and S3. Because of our current API interfaces, it made …
Chose Amazon EMR (Elastic MapReduce)
Having one of these enterprise edition license comes at its own costs. But, the flexibility to have the cluster spin up with the workbenches and code snippets on the same is really beneficial. Especially, if one had to move out of EMR and consider an option which reduces the …
Oracle Autonomous Data Warehouse
Chose Oracle Autonomous Data Warehouse
In my personal opinion, Amazon Redshift is much better than Oracle Data Warehouse in two main ways. First, it's in the Cloud which eliminates the need to purchase and maintain dedicated hardware. Second, the pricing models for Redshift are far more flexible and affordable. …
Chose Oracle Autonomous Data Warehouse
Oracle is a lot cheaper than traditional data warehouse appliance solutions, even if you get an expensive DBA who knows what he/she is doing. It definitely takes a lot more work to ensure it scales as your data size grows. While it won't scale past the terabyte sized data sets, …
Amazon Redshift
Chose Amazon Redshift
We evaluated [Amazon] Redshift vs BigQuery vs Amazon EMR, back in 2014.
Back then BigQuery cost was slightly higher than that of [Amazon] Redshift price structure.
Amazon EMR, needs lots more management (Admin tasks) and EMR is designed to be ephemeral and not designed to be a …
Chose Amazon Redshift
Prezi is using AWS so Amazon Redshift was the obvious choice. It is fast, scalable and easy to use. Supplemented with Spark and Hive I'm completely satisfied using Redhsift. Sometimes I miss commands I used earlier on MS and Oracle SQL and the lack of procedural features is …
Chose Amazon Redshift
  1. Compared to Oracle Data Warehouse, Redshift is a better data warehouse. However, this comes at a cost of advanced functionality and the ability to do OLTP style processing. What you gain is faster querying time and better scalability.
  2. Compared to MySQL, you gain a WHOLE lot. MySQ…
Best Alternatives
Amazon EMR (Elastic MapReduce)Oracle Autonomous Data WarehouseAmazon Redshift
Small Businesses

No answers on this topic

Google BigQuery
Google BigQuery
Score 8.8 out of 10
Google BigQuery
Google BigQuery
Score 8.8 out of 10
Medium-sized Companies
Cloudera Manager
Cloudera Manager
Score 9.9 out of 10
Cloudera Enterprise Data Hub
Cloudera Enterprise Data Hub
Score 9.0 out of 10
Snowflake
Snowflake
Score 8.7 out of 10
Enterprises
IBM Analytics Engine
IBM Analytics Engine
Score 7.2 out of 10
Oracle Exadata
Oracle Exadata
Score 9.8 out of 10
Snowflake
Snowflake
Score 8.7 out of 10
All AlternativesView all alternativesView all alternativesView all alternatives
User Ratings
Amazon EMR (Elastic MapReduce)Oracle Autonomous Data WarehouseAmazon Redshift
Likelihood to Recommend
8.0
(19 ratings)
8.9
(32 ratings)
9.0
(38 ratings)
Likelihood to Renew
-
(0 ratings)
8.0
(1 ratings)
-
(0 ratings)
Usability
7.0
(4 ratings)
-
(0 ratings)
9.0
(10 ratings)
Support Rating
9.0
(3 ratings)
-
(0 ratings)
9.0
(7 ratings)
Implementation Rating
-
(0 ratings)
9.0
(1 ratings)
-
(0 ratings)
Contract Terms and Pricing Model
-
(0 ratings)
-
(0 ratings)
10.0
(1 ratings)
User Testimonials
Amazon EMR (Elastic MapReduce)Oracle Autonomous Data WarehouseAmazon Redshift
Likelihood to Recommend
Amazon AWS
We are running it to perform preparation which takes a few hours on EC2 to be running on a spark-based EMR cluster to total the preparation inside minutes rather than a few hours. Ease of utilization and capacity to select from either Hadoop or spark. Processing time diminishes from 5-8 hours to 25-30 minutes compared with the Ec2 occurrence and more in a few cases.
Read full review
Oracle
II would recommend Oracle Autonomous Data Warehouse to someone looking to fully automate the transferring of data especially in a warehouse scenario though I can see the elasticity of the suite that is offered and can see it is applicable in other scenarios not just warehouses.
Read full review
Amazon AWS
If the number of connections is expected to be low, but the amounts of data are large or projected to grow it is a good solutions especially if there is previous exposure to PostgreSQL. Speaking of Postgres, Redshift is based on several versions old releases of PostgreSQL so the developers would not be able to take advantage of some of the newer SQL language features. The queries need some fine-tuning still, indexing is not provided, but playing with sorting keys becomes necessary. Lastly, there is no notion of the Primary Key in Redshift so the business must be prepared to explain why duplication occurred (must be vigilant for)
Read full review
Pros
Amazon AWS
  • EMR does well in managing the cost as it uses the task node cores to process the data and these instances are cheaper when the data is stored on s3. It is really cost efficient. No need to maintain any libraries to connect to AWS resources.
  • EMR is highly available, secure and easy to launch. No much hassle in launching the cluster (Simple and easy).
  • EMR manages the big data frameworks which the developer need not worry (no need to maintain the memory and framework settings) about the framework settings. It's all setup on launch time. The bootstrapping feature is great.
Read full review
Oracle
  • Very easy and fast to load data into the Oracle Autonomous Data Warehouse
  • Exceptionally fast retrieval of data joining 100 million row table with a billion row table plus the size of the database was reduced by a factor of 10 due to how Oracle store[s] and organise[s] data and indexes.
  • Flexibility with scaling up and down CPU on the fly when needed, and just stop it when not needed so you don't get charged when it is not running.
  • It is always patched and always available and you can add storage dynamically as you need it.
Read full review
Amazon AWS
  • [Amazon] Redshift has Distribution Keys. If you correctly define them on your tables, it improves Query performance. For instance, we can define Mapping/Meta-data tables with Distribution-All Key, so that it gets replicated across all the nodes, for fast joins and fast query results.
  • [Amazon] Redshift has Sort Keys. If you correctly define them on your tables along with above Distribution Keys, it further improves your Query performance. It also has Composite Sort Keys and Interleaved Sort Keys, to support various use cases
  • [Amazon] Redshift is forked out of PostgreSQL DB, and then AWS added "MPP" (Massively Parallel Processing) and "Column Oriented" concepts to it, to make it a powerful data store.
  • [Amazon] Redshift has "Analyze" operation that could be performed on tables, which will update the stats of the table in leader node. This is sort of a ledger about which data is stored in which node and which partition with in a node. Up to date stats improves Query performance.
Read full review
Cons
Amazon AWS
  • It would have been better if packages like HBase and Flume were available with Amazon EMR. This would make the product even more helpful in some cases.
  • Products like Cloudera provide the options to move the whole deployment into a dedicated server and use it at our discretion. This would have been a good option if available with EMR.
  • If EMR gave the option to be used with any choice of cloud provider, it would have helped instead of having to move the data from another cloud service to S3.
Read full review
Oracle
  • It is very expensive product. But not to mention, there's good reasons why it is expensive.
  • The product should support more cloud based services. When we made the decision to buy the product (which was 20 years ago,) there was no such thing to consider, but moving to a cloud based data warehouse may promise more scalability, agility, and cost reduction. The new version of Data Warehouse came out on the way, but it looks a bit behind compared to other competitors.
  • Our healthcare data consists of 30% coded data (such as ICD 10 / SNOMED C,T) but the rests is narrative (such as clinical notes.). Oracle is the best for warehousing standardized data, but not a good choice when considering unstructured data, or a mix of the two.
Read full review
Amazon AWS
  • We've experienced some problems with hanging queries on Redshift Spectrum/external tables. We've had to roll back to and old version of Redshift while we wait for AWS to provide a patch.
  • Redshift's dialect is most similar to that of PostgreSQL 8. It lacks many modern features and data types.
  • Constraints are not enforced. We must rely on other means to verify the integrity of transformed tables.
Read full review
Likelihood to Renew
Amazon AWS
No answers on this topic
Oracle
Because
  • It is really simple to provision and configure.
  • Does not require continous attention from the DBA, autonomous features allows the database to perform most of the regular admin tasks without need for human intervention.
  • Allows to integrate multiple data sources on a central data warehouse, and explode the information stored with different analytic and reporting tools.
Read full review
Amazon AWS
No answers on this topic
Usability
Amazon AWS
Documentation is quite good and the product is regularly updated, so new features regularly come out. The setup is straightforward enough, especially once you have already established the overall platform infrastructure and the aws-cli APIs are easy enough to use. It would be nice to have some out-of-the-box integrations for checking logs and the Spark UI, rather than relying on know-how and digging through multiple levels to find the informations
Read full review
Oracle
No answers on this topic
Amazon AWS
Just very happy with the product, it fits our needs perfectly. Amazon pioneered the cloud and we have had a positive experience using RedShift. Really cool to be able to see your data housed and to be able to query and perform administrative tasks with ease.
Read full review
Support Rating
Amazon AWS
I give the overall support for Amazon EMR this rating because while the support technicians are very knowledgeable and always able to help, it sometimes takes a very long time to get in contact with one of the support technicians. So overall the support is pretty good for Amazon EMR.
Read full review
Oracle
No answers on this topic
Amazon AWS
The support was great and helped us in a timely fashion. We did use a lot of online forums as well, but the official documentation was an ongoing one, and it did take more time for us to look through it. We would have probably chosen a competitor product had it not been for the great support
Read full review
Implementation Rating
Amazon AWS
No answers on this topic
Oracle
Understanding Oracle Cloud Infrastructure is really simple, and Autonomous databases are even more. Using shared or dedicated infrastructure is one of the few things you need to consider at the moment of starting provisioning your Oracle Autonomous Data Warehouse.
Read full review
Amazon AWS
No answers on this topic
Alternatives Considered
Amazon AWS
Snowflake is a lot easier to get started with than the other options. Snowflake's data lake building capabilities are far more powerful. Although Amazon EMR isn't our first pick, we've had an excellent experience with EC2 and S3. Because of our current API interfaces, it made more sense for us to continue with Hadoop rather than explore other options.
Read full review
Oracle
As I mentioned, I have also worked with Amazon Redshift, but it is not as versatile as Oracle Autonomous Data Warehouse and does not provide a large variety of products. Oracle Autonomous Data Warehouse is also more reliable than Amazon Redshift, hence why I have chosen it
Read full review
Amazon AWS
Than Vertica: Redshift is cheaper and AWS integrated (which was a plus because the whole company was on AWS).
Than BigQuery: Redshift has a standard SQL interface, though recently I heard good things about BigQuery and would try it out again.
Than Hive: Hive is great if you are in the PB+ range, but latencies tend to be much slower than Redshift and it is not suited for ad-hoc applications.
Read full review
Contract Terms and Pricing Model
Amazon AWS
No answers on this topic
Oracle
No answers on this topic
Amazon AWS
Redshift is relatively cheaper tool but since the pricing is dynamic, there is always a risk of exceeding the cost. Since most of our team is using it as self serve and there is no continuous tracking by a dedicated team, it really needs time & effort on analyst's side to know how much it is going to cost.
Read full review
Return on Investment
Amazon AWS
  • It was obviously cheaper and convenient to use as most of our data processing and pipelines are on AWS. It was fast and readily available with a click and that saved a ton of time rather than having to figure out the down time of the cluster if its on premises.
  • It saved time on processing chunks of big data which had to be processed in short period with minimal costs. EMR solved this as the cluster setup time and processing was simple, easy, cheap and fast.
  • It had a negative impact as it was very difficult in submitting the test jobs as it lags a UI to submit spark code snippets.
Read full review
Oracle
  • Overall the business objective of all of our clients have been met positively with Oracle Data Warehouse. All of the required analysis the users were able to successfully carry out using the warehouse data.
  • Using a 3-tier architecture with the Oracle Data Warehouse at the back end the mid-tier has been integrated well. This is big plus in providing the necessary tools for end users of the data warehouse to carry out their analysis.
  • All of the various BI products (OBIEE, Cognos, etc.) are able to use and exploit the various analytic built-in functionalities of the Oracle Data Warehouse.
Read full review
Amazon AWS
  • Our company is moving to the AWS infrastructure, and in this context moving the warehouse environments to Redshift sounds logical regardless of the cost.
  • Development organizations have to operate in the Dev/Ops mode where they build and support their apps at the same time.
  • Hard to estimate the overall ROI of moving to Redshift from my position. However, running Redshift seems to be inexpensive compared to all the licensing and hardware costs we had on our RDBMS platform before Redshift.
Read full review
ScreenShots