Most Commonly Comparedto Amazon EMR

Best Amazon EMR Alternatives for Medium-sized Companies

Cloudera Manager

Score 9.7 out of 10

Cloudera Manager is a management application for Apache Hadoop and the enterprise data hub, from Cloudera. Its automated wizards let users quickly deploy a cluster, no matter what the scale or the deployment environment, complete with intelligent, system-based default settings.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Azure Data Lake Storage

Score 8.4 out of 10

Azure Data Lake Storage Gen2 is a highly scalable and cost-effective data lake solution for big data analytics. It combines the power of a high-performance file system with massive scale and economy to help you speed your time to insight. Data Lake Storage Gen2 extends Azure Blob Storage capabilities and is optimized for analytics workloads.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Apache Spark

Score 8.7 out of 10

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Hadoop

Score 7.4 out of 10

Hadoop is an open source software from Apache, supporting distributed processing and data storage. Hadoop is popular for its scalability, reliability, and functionality available across commoditized hardware.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Apache Hive

Score 8.2 out of 10

Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Best Amazon EMR Alternatives for Enterprises

IBM Analytics Engine

Score 8.9 out of 10

IBM BigInsights is an analytics and data visualization tool leveraging hadoop.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Apache Spark

Score 8.7 out of 10

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Azure Data Lake Storage

Score 8.4 out of 10

Azure Data Lake Storage Gen2 is a highly scalable and cost-effective data lake solution for big data analytics. It combines the power of a high-performance file system with massive scale and economy to help you speed your time to insight. Data Lake Storage Gen2 extends Azure Blob Storage capabilities and is optimized for analytics workloads.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Hadoop

Score 7.4 out of 10

Hadoop is an open source software from Apache, supporting distributed processing and data storage. Hadoop is popular for its scalability, reliability, and functionality available across commoditized hardware.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Apache Hive

Score 8.2 out of 10

Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Apache Pig

Score 8.4 out of 10

Apache Pig is a programming tool for creating MapReduce programs used in Hadoop.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Presto

Score 3.0 out of 10

Presto is an open source SQL query engine designed to run queries on data stored in Hadoop or in traditional databases. Teradata supported development of Presto followed the acquisition of Hadapt and Revelytix.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.

Hortonworks Data Platform

Score 7.0 out of 10

Hortonworks Data Platform (HDP) is an open source framework for distributed storage and processing of large, multi-source data sets. HDP modernizes IT infrastructure and keeps data secure—in the cloud or on-premises—while helping to drive new revenue streams, improve customer experience, and control costs. Hortonworks merged with Cloudera in eary 2019.

Higher Rated Features

There is not enough information to display features

Popular Integrations

There is not enough information to display integrations.