25 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 8.2 out of 101
12 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 8.6 out of 101

Add comparison

Likelihood to Recommend

Amazon EMR

EMR is suited if the jobs are long running and doesn't really need much monitoring. EMR is really flexible in processing the data on s3 as a developer doesn't need to spend time on debugging the connections to s3 from a big data framework as most of the configuration is taken care of by Amazon. Very cheap when compared to most of the solutions on the market and the ready to go configuration at the launch time reduces the amount of time required for admin tasks. So, considering the cheap cost, processing options on s3 and scalability via adding task nodes, EMR serves a better purpose for startups considering open source and cost efficient options. However, EMR comes with its own disadvantages. There is no proper UI to track real time jobs which is however possible with Enterprise editions like Cloudera, Hortonworks etc. EMR could provide an interface to add workbooks and code snippets in the cluster as it would reduce the time to submit the tasks. EMR also lags the potential to automatically replace unhealthy nodes.
No photo available

Databricks Unified Analytics Platform

  • DB generally fits 95% of what you need to do
  • Primarily the ability to transform data and or do ad-hoc DS work
No photo available

Feature Rating Comparison

Platform Connectivity

Amazon EMR
Databricks Unified Analytics Platform
8.3
Connect to Multiple Data Sources
Amazon EMR
Databricks Unified Analytics Platform
9.0
Extend Existing Data Sources
Amazon EMR
Databricks Unified Analytics Platform
9.0
Automatic Data Format Detection
Amazon EMR
Databricks Unified Analytics Platform
7.0

Data Exploration

Amazon EMR
Databricks Unified Analytics Platform
6.0
Visualization
Amazon EMR
Databricks Unified Analytics Platform
6.0
Interactive Data Analysis
Amazon EMR
Databricks Unified Analytics Platform
6.0

Data Preparation

Amazon EMR
Databricks Unified Analytics Platform
8.0
Interactive Data Cleaning and Enrichment
Amazon EMR
Databricks Unified Analytics Platform
8.0
Data Transformations
Amazon EMR
Databricks Unified Analytics Platform
9.0
Data Encryption
Amazon EMR
Databricks Unified Analytics Platform
7.0
Built-in Processors
Amazon EMR
Databricks Unified Analytics Platform
8.0

Platform Data Modeling

Amazon EMR
Databricks Unified Analytics Platform
8.3
Multiple Model Development Languages and Tools
Amazon EMR
Databricks Unified Analytics Platform
9.0
Automated Machine Learning
Amazon EMR
Databricks Unified Analytics Platform
8.0
Single platform for multiple model development
Amazon EMR
Databricks Unified Analytics Platform
9.0
Self-Service Model Delivery
Amazon EMR
Databricks Unified Analytics Platform
7.0

Model Deployment

Amazon EMR
Databricks Unified Analytics Platform
7.5
Flexible Model Publishing Options
Amazon EMR
Databricks Unified Analytics Platform
7.0
Security, Governance, and Cost Controls
Amazon EMR
Databricks Unified Analytics Platform
8.0

Pros

  • Distributed computing
  • Fault tolerant
  • Uptime
No photo available
  • Extremely Flexible in Data Scenarios
  • Fantastic Performance
  • DB is always updating the system so we can have latest features.
No photo available

Cons

  • Sometimes bootstrapping certain tools comes with debugging costs. The tools provided by some of the enterprise editions are great compared to EMR.
  • Like some of the enterprise editions EMR does not provide on premises options.
  • No UI client for saving the workbooks or code snippets. Everything has to go through submitting process. Not really convenient for tracking the job as well.
No photo available
  • Better Localized Testing
  • When they were primarily OSS Spark; it was easier to test/manage releases versus the newer DB Runtime. Wish there was more configuration in Runtime less pick a version.
  • Graphing Support went non-existent; when it was one of their compelling general engine.
No photo available

Usability

No score
No answers yet
No answers on this topic
Databricks Unified Analytics Platform9.0
Based on 1 answer
This has been very useful in my organization for shared notebooks, integrated data pipeline automation and data sources integrations. Integration with AWS is seamless. Non tech users can easily learn how to use Databricks. You can have your company LDAP connect to it for login based access controls to some extent
No photo available

Alternatives Considered

Perhaps the biggest advantage Amazon Elastic MapReduce has over competing big data management software is the user base. Elastic MapReduce, compliments of its connection with Amazon, has a large user base to whom questions about functionality can be addressed. The software also has a very nice user interface. Additionally, Elastic MapReduce runs fairly quickly and the results are generally easier to manipulate. With this said, Elastic MapReduce is definitely not the easiest nor quickest tool for big data analytics.
Thomas Young profile photo
When we started using it, only the notebook experience was mature. However, DB was very helpful giving us direct support to get onto their platform. Really there was little in the way to compare to them at the time. AWS has services but not the same low-cost angle
No photo available

Return on Investment

  • Amazon Elastic MapReduce has had a positive ROI in the sense that it saved time managing big data projects where analysts were using different big data tools. Essentially, an increase in employee productivity.
  • Elastic MapReduce is not worth it in cases where you're just trying things out. You'll likely lose money unless you're sure that using MapReduce is a good idea.
  • Elastic MapReduce takes some time learning, although not too much. If the employee is less well-versed in big data analytics, the software is a high hill to climb that eats up employee time.
Thomas Young profile photo
  • The ability to spin up a BIG Data platform with little infrastructure overhead allows us to focus on business value not admin
  • DB has the ability to terminate/time out instances which helps manage cost.
  • The ability to quickly access typical hard to build data scenarios easily is a strength.
No photo available

Pricing Details

Amazon EMR

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details

Databricks Unified Analytics Platform

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details