Best Hadoop-Related Software33Hadoop1https://dudodiprj2sv7.cloudfront.net/product-logos/Iz/Fd/2KTEU64L1CK3.PNGApache Hive2https://dudodiprj2sv7.cloudfront.net/product-logos/O7/uK/NSF4U658JPGR.jpegApache Spark3https://dudodiprj2sv7.cloudfront.net/product-logos/0H/3D/90TJJ6JJ6KNK.jpegHortonworks Data Platform4https://dudodiprj2sv7.cloudfront.net/product-logos/sr/Ot/SKJR5FC930CT.pngDatameer5https://dudodiprj2sv7.cloudfront.net/vendor-logos/fk/od/27ARVEKSXRWT-180x180.PNGIBM Analytics Engine6https://dudodiprj2sv7.cloudfront.net/vendor-logos/yf/sf/DNSXTG99HOK3-180x180.JPEGAmazon Elastic MapReduce7https://dudodiprj2sv7.cloudfront.net/vendor-logos/LY/YM/1TDXH4LPI5BH-180x180.JPEGMapR8https://dudodiprj2sv7.cloudfront.net/vendor-logos/Ot/AZ/RO9FBJVXH6GQ-180x180.PNGApache Pig9https://dudodiprj2sv7.cloudfront.net/product-logos/xl/r1/RM6U3778FRLX.gifPresto10https://dudodiprj2sv7.cloudfront.net/vendor-logos/5l/z7/2FHZJK4K1FK3-180x180.JPEGCloudera Manager11https://dudodiprj2sv7.cloudfront.net/vendor-logos/Em/N4/OE63LH0T3KBT-180x180.JPEGSAP Vora12https://dudodiprj2sv7.cloudfront.net/vendor-logos/sW/OA/CZD3RG21S16S-180x180.JPEGApache Drill13https://dudodiprj2sv7.cloudfront.net/product-logos/0L/eF/G51VTS8OD4PS.jpegApache Sqoop14https://dudodiprj2sv7.cloudfront.net/product-logos/bN/zJ/38XBGZCA9KJR.pngAzure HDInsight15https://dudodiprj2sv7.cloudfront.net/vendor-logos/tf/J4/RTX1AO2GSVNS-180x180.JPEGCloudera Data Science Workbench16https://dudodiprj2sv7.cloudfront.net/vendor-logos/Em/N4/OE63LH0T3KBT-180x180.JPEGApache Flume17https://dudodiprj2sv7.cloudfront.net/vendor-logos/LN/4T/QCXUVWPH4BJL.pngOracle Big Data Cloud18https://dudodiprj2sv7.cloudfront.net/vendor-logos/VC/02/T4E108T4IWP2-180x180.PNGNGDATA Lily19https://dudodiprj2sv7.cloudfront.net/vendor-logos/4v/jG/T02FUS1ARGMA-180x180.JPEGPivotal Greenplum20https://dudodiprj2sv7.cloudfront.net/product-logos/NW/0Z/W55XLT2DEPSX.PNGAlluxio (formerly Tachyon)21https://dudodiprj2sv7.cloudfront.net/vendor-logos/nI/9Z/DMA1PDH7ZQDW-180x180.PNGAtScale22https://dudodiprj2sv7.cloudfront.net/vendor-logos/pQ/1d/5IGXLDD6U1VQ-180x180.JPEGJethro23https://dudodiprj2sv7.cloudfront.net/vendor-logos/rR/b5/JGV86J3M4AQO-180x180.JPEGTrillium Quality for Big Data24https://dudodiprj2sv7.cloudfront.net/vendor-logos/t5/RQ/3OV1XOV53NUX-180x180.JPEGZettaset Data Platform25https://dudodiprj2sv7.cloudfront.net/product-logos/rT/jz/7EGSSS12L0XR.png

Hadoop-Related Software

Best Hadoop-Related Software

TrustMaps are two-dimensional charts that compare products based on satisfaction ratings and research frequency by prospective buyers. Products must have 10 or more ratings to appear on this TrustMap.

Hadoop-Related Software Overview

What is Hadoop Software?

Hadoop is a very unusual kind of open-source data store from the Apache Foundation. However, an entire ecosystem of products has evolved around the Hadoop data store, to the point where it has become its own technology category.


The central idea of Hadoop is that data is spread across many commodity, inexpensive servers, although there are several commercial distributions of Hadoop from Cloudera and Hortonworks who wrap services around the technology.


Unlike a traditional database, Hadoop can handle huge volumes of both structured and unstructured data including log files, streaming data, images, audio and video files. All of this data can be put into the Hadoop cluster and accessed, modified and processed in place, eliminating the need to duplicate and structure data in a traditional warehouse.


Once this huge volume of structured and unstructured data has been stored, how do you extract any value from it? Since Hadoop is not a structured database, structured query languages like SQL do not work. But Hadoop has its own data processing and query framework called MapReduce. Developers can use MapReduce to write programs that can retrieve whatever data is needed. However, MapReduce has several constraints affecting performance and a newer product like Apache Spark provides an alternative distributed computing framework, which is significantly more efficient. Similarly, products like Hive and Cloudera Impala provide a SQL-like query language, which is much easier for data analysts to learn and use.

Hadoop-Related Products

Listings (1-25 of 33)

222 Ratings

Hadoop is an open source software from Apache, supporting distributed processing and data storage. Hadoop is popular for its scalability, reliability, and functionality available across commoditized hardware.

66 Ratings

Apache Hive is database/data warehouse software that supports data querying and analysis of large datasets stored in the Hadoop distributed file system (HDFS) and other compatible systems, and is distributed under an open source license.

9 Ratings

Presto is an open source SQL query engine supported by Teradata designed to run queries on data stored in Hadoop or in traditional databases. Teradata's development of Presto followed the acquisition of Hadapt and Revelytix.

2 Ratings

SAP Vora is a computing engine designed to provide better accessibility to Hadoop data from SAP HANA. SAP Vora manages unstructured Hadoop data by building structured data hierarchies and making the data queryable through an SQL interface.

25 Ratings

HDInsight is an implementation of the Apache Hadoop technology stack on the Microsoft Azure cloud platform: It is based on the Hortonworks Hadoop distribution. Microsoft Azure HDInsight includes implementations of Apache Spark, HBase, Storm, Pig, Hive, Sqoop, Oozie, Ambari, etc. It also integra...

We don't have enough ratings and reviews to provide an overall score.

NGDATA Lily CDP is a hadoop-based platform acquired by and supported by Belgian company NGDATA since original developer Outerthought's acquisition by that company.

We don't have enough ratings and reviews to provide an overall score.

Pivotal Greenplum (formerly from EMC) is a massively parallel processing (MPP) data platform, based on the open source Greenplum Database. The data warehouse application is supported by Pivotal Software.

We don't have enough ratings and reviews to provide an overall score.

Jethro, from the company of the same name headquartered in New York, delivers interactive enterprise business intelligence and enterprise data warehouse services on hadoop.

We don't have enough ratings and reviews to provide an overall score.

Trillium Quality for Big Data supports enterprises using a Big Data framework like Hadoop with data quality functions like data integration, data cleansing, standardization and parsing, with prebuilt process flows that can be configured to meet business needs.