Amazon EMR (Elastic MapReduce) vs. Google Cloud Dataplex

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Amazon EMR
Score 8.6 out of 10
N/A
Amazon EMR is a cloud-native big data platform for processing vast amounts of data quickly, at scale. Using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi (Incubating), and Presto, coupled with the scalability of Amazon EC2 and scalable storage of Amazon S3, EMR gives analytical teams the engines and elasticity to run Petabyte-scale analysis.N/A
Google Cloud Dataplex
Score 0.0 out of 10
N/A
Available on Google Cloud, Dataplex’s intelligent data fabric enables organizations to centrally discover, manage, monitor, and govern their data across data lakes, data warehouses, and data marts with consistent controls, providing access to data and powering analytics. Pricing for Dataplex is pay-as-you-go.
$0.06
per DCU-hour
Pricing
Amazon EMR (Elastic MapReduce)Google Cloud Dataplex
Editions & Modules
No answers on this topic
Dataplex Shuffle Storage
$0.040
per GB-month
Standard Processing Fee
$0.060
per DCU-hour
Premium Processing Fee
$0.089
per DCU-hour
Offerings
Pricing Offerings
Amazon EMRGoogle Cloud Dataplex
Free Trial
NoNo
Free/Freemium Version
NoYes
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
Amazon EMR (Elastic MapReduce)Google Cloud Dataplex
Top Pros

No answers on this topic

Top Cons

No answers on this topic

Best Alternatives
Amazon EMR (Elastic MapReduce)Google Cloud Dataplex
Small Businesses

No answers on this topic

No answers on this topic

Medium-sized Companies
Cloudera Manager
Cloudera Manager
Score 9.9 out of 10

No answers on this topic

Enterprises
IBM Analytics Engine
IBM Analytics Engine
Score 8.8 out of 10
Talend Data Fabric
Talend Data Fabric
Score 9.5 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Amazon EMR (Elastic MapReduce)Google Cloud Dataplex
Likelihood to Recommend
8.4
(18 ratings)
-
(0 ratings)
Usability
8.3
(3 ratings)
-
(0 ratings)
Support Rating
9.0
(3 ratings)
-
(0 ratings)
User Testimonials
Amazon EMR (Elastic MapReduce)Google Cloud Dataplex
Likelihood to Recommend
Amazon AWS
We are running it to perform preparation which takes a few hours on EC2 to be running on a spark-based EMR cluster to total the preparation inside minutes rather than a few hours. Ease of utilization and capacity to select from either Hadoop or spark. Processing time diminishes from 5-8 hours to 25-30 minutes compared with the Ec2 occurrence and more in a few cases.
Read full review
Google
No answers on this topic
Pros
Amazon AWS
  • EMR does well in managing the cost as it uses the task node cores to process the data and these instances are cheaper when the data is stored on s3. It is really cost efficient. No need to maintain any libraries to connect to AWS resources.
  • EMR is highly available, secure and easy to launch. No much hassle in launching the cluster (Simple and easy).
  • EMR manages the big data frameworks which the developer need not worry (no need to maintain the memory and framework settings) about the framework settings. It's all setup on launch time. The bootstrapping feature is great.
Read full review
Google
No answers on this topic
Cons
Amazon AWS
  • It would have been better if packages like HBase and Flume were available with Amazon EMR. This would make the product even more helpful in some cases.
  • Products like Cloudera provide the options to move the whole deployment into a dedicated server and use it at our discretion. This would have been a good option if available with EMR.
  • If EMR gave the option to be used with any choice of cloud provider, it would have helped instead of having to move the data from another cloud service to S3.
Read full review
Google
No answers on this topic
Usability
Amazon AWS
I give Amazon EMR this rating because while it is great at simplifying running big data frameworks, providing the Amazon EMR highlights, product details, and pricing information, and analyzing vast amounts of data, it can be run slow, freeze and glitch sometimes. So overall Amazon EMR is pretty good to use other than some basic issues.
Read full review
Google
No answers on this topic
Support Rating
Amazon AWS
There's a vast group of trained and certified (by AWS) professionals ready to work for anyone that needs to implement, configure or fix EMR. There's also a great amount of documentation that is accessible to anyone who's trying to learn this. And there's also always the help of AWS itself. They have people ready to help you analyze your needs and then make a recommendation.
Read full review
Google
No answers on this topic
Alternatives Considered
Amazon AWS
Snowflake is a lot easier to get started with than the other options. Snowflake's data lake building capabilities are far more powerful. Although Amazon EMR isn't our first pick, we've had an excellent experience with EC2 and S3. Because of our current API interfaces, it made more sense for us to continue with Hadoop rather than explore other options.
Read full review
Google
No answers on this topic
Return on Investment
Amazon AWS
  • It was obviously cheaper and convenient to use as most of our data processing and pipelines are on AWS. It was fast and readily available with a click and that saved a ton of time rather than having to figure out the down time of the cluster if its on premises.
  • It saved time on processing chunks of big data which had to be processed in short period with minimal costs. EMR solved this as the cluster setup time and processing was simple, easy, cheap and fast.
  • It had a negative impact as it was very difficult in submitting the test jobs as it lags a UI to submit spark code snippets.
Read full review
Google
No answers on this topic
ScreenShots