Cost reduction with Amazon EMR on EKS
January 06, 2025

Cost reduction with Amazon EMR on EKS

Anonymous | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User

Overall Satisfaction with Amazon EMR (Elastic MapReduce)

Amazon EMR (Elastic MapReduce) is heavily used at my organization for most if not all data pipeline computations: we started by using EC2 instances, we then moved to EMR Serverless and we are actually completing the transition to EMR on EKS. In general we use it for long-running analysis (SQLs with a lot of JOINs) and overall for batch processing. From what I've seen, we use it with Spark under the hood.

Pros

  • EMR on EKS is really flexible and cost-saving
  • Flexibility on how to run the jobs (and different implementations to choose from)
  • Support online and it's a regularly updated product

Cons

  • EMR on EKS could be better documented, especially since for the "magic" it does under the hood when using Spark
  • UI can be improved (especially for EMR on EKS)
  • Switching to EMR on EKS most of our EMR on EC2 jobs has produced a reduction of 4% in the overall costs (while maintaining the same level of data freshness)
Documentation is quite good and the product is regularly updated, so new features regularly come out. The setup is straightforward enough, especially once you have already established the overall platform infrastructure and the aws-cli APIs are easy enough to use. It would be nice to have some out-of-the-box integrations for checking logs and the Spark UI, rather than relying on know-how and digging through multiple levels to find the informations

Do you think Amazon EMR (Elastic MapReduce) delivers good value for the price?

Yes

Are you happy with Amazon EMR (Elastic MapReduce)'s feature set?

Yes

Did Amazon EMR (Elastic MapReduce) live up to sales and marketing promises?

I wasn't involved with the selection/purchase process

Did implementation of Amazon EMR (Elastic MapReduce) go as expected?

Yes

Would you buy Amazon EMR (Elastic MapReduce) again?

Yes

Based on my experience, Amazon EMR is well suited for companies with a good level of support on the Platform and Data Platform level, since it needs to be properly set up to avoid incurring in extra costs: it's quite easy to give more and more resources, so a job will eventually run but it's important to avoid extra costs. In general EMR on EC2 has been the most expensive of the EMR subproducts, while EMR on EKS has a good balance of giving enough resources to the jobs to run while maintaining costs low. The other recommendation is to use the latest versions of the EMR images, as otherwise the support from Amazon might not be very helpful.

Comments

More Reviews of Amazon EMR