Best Hadoop-Related Software33Hydrograph1https://media.trustradius.com/vendor-logos/uo/sZ/4EIO9FBOK63L-180x180.JPEGBitwise Hadoop Adaptor for Mainframe Data2https://media.trustradius.com/vendor-logos/uo/sZ/4EIO9FBOK63L-180x180.JPEGCohesity Imanis Data3Azure Data Lake Storage4https://media.trustradius.com/vendor-logos/tf/J4/RTX1AO2GSVNS-180x180.JPEGHPE BlueData EPIC5https://media.trustradius.com/vendor-logos/qq/uK/ZTECRXT03ME3-180x180.JPEGCloudera Distribution Hadoop (CDH)6https://media.trustradius.com/vendor-logos/vs/zF/28EBZ7FNB71M-180x180.PNGIBM Analytics for Apache Spark7https://media.trustradius.com/product-logos/Uv/Xp/77N37PEPH17Z-180x180.PNGKognitio8https://media.trustradius.com/product-logos/De/RS/QH6439JXHSKK-180x180.JPEGStarburst Presto9https://media.trustradius.com/vendor-logos/nD/xK/1W04172AEDRZ-180x180.JPEGJethro10https://media.trustradius.com/vendor-logos/rR/b5/JGV86J3M4AQO-180x180.JPEGArcadia Data11https://media.trustradius.com/vendor-logos/f2/VO/TNGF1OQEFJQ9-180x180.PNGSyncfusion Big Data Platform12https://media.trustradius.com/product-logos/ge/im/2PCLBSRCUB2D-180x180.JPEGPivotal Greenplum13https://media.trustradius.com/product-logos/00/0u/W55XLT2DEPSX-180x180.PNG

Hadoop-Related Software

Best Hadoop-Related Software

TrustMaps are two-dimensional charts that compare products based on satisfaction ratings and research frequency by prospective buyers. Products must have 10 or more ratings to appear on this TrustMap.

Hadoop-Related Software Overview

What is Hadoop Software?

Hadoop is a very unusual kind of open-source data store from the Apache Foundation. However, an entire ecosystem of products has evolved around the Hadoop data store, to the point where it has become its own technology category.


The central idea of Hadoop is that data is spread across many commodity, inexpensive servers, although there are several commercial distributions of Hadoop from Cloudera and Hortonworks who wrap services around the technology.


Unlike a traditional database, Hadoop can handle huge volumes of both structured and unstructured data including log files, streaming data, images, audio and video files. All of this data can be put into the Hadoop cluster and accessed, modified and processed in place, eliminating the need to duplicate and structure data in a traditional warehouse.


Once this huge volume of structured and unstructured data has been stored, how do you extract any value from it? Since Hadoop is not a structured database, structured query languages like SQL do not work. But Hadoop has its own data processing and query framework called MapReduce. Developers can use MapReduce to write programs that can retrieve whatever data is needed. However, MapReduce has several constraints affecting performance and a newer product like Apache Spark provides an alternative distributed computing framework, which is significantly more efficient. Similarly, products like Hive and Cloudera Impala provide a SQL-like query language, which is much easier for data analysts to learn and use.

Hadoop-Related Products

Listings (26-38 of 38)

We don't have enough ratings and reviews to provide an overall score.

Bitwise offers Hydrograph, a data integration tool with provides ETL functionality on Hadoop and Spark.

We don't have enough ratings and reviews to provide an overall score.

Imanis Data, now from Cohesity (acquired May 2019) is designed to present radically simple backup, recovery, and data management for Hadoop Distributed File System and NoSQL distributed databases including MongoDB, Cassandra, CouchbaseDB, Hbase, and others.

We don't have enough ratings and reviews to provide an overall score.

Azure Data Lake Storage Gen2 is a highly scalable and cost-effective data lake solution for big data analytics. It combines the power of a high-performance file system with massive scale and economy to help you speed your time to insight. Data Lake Storage Gen2 extends Azure Blob Storage capabilitie…

We don't have enough ratings and reviews to provide an overall score.

BlueData is designed to provide a simple and easy way to provide self-service provisioning, policy-based automation, and push-button upgrades for AI and Big Data applications. By using the BlueData EPIC software platform, the user eliminates cluster sprawl across the enterprise and reduces data dupl…

We don't have enough ratings and reviews to provide an overall score.

CDH is Cloudera’s 100% open source platform distribution, including Apache Hadoop and built specifically to meet enterprise demands. CDH delivers everything needed for enterprise use right out of the box. By integrating Hadoop with more than a dozen other critical open source projects, Cloudera has …

We don't have enough ratings and reviews to provide an overall score.

WX2 is the data and analytics focused data warehouse appliance solution from UK company Kognitio.

We don't have enough ratings and reviews to provide an overall score.

Presto is a distributed SQL query engine used for large-scale, interactive analytics, enabling users to run analytic SQL queries across a wide variety of data sources with elastic scaling. Starburst Presto Enterprise Edition (EE) is Starburst Data's paid solution that supplies performance, security,…

We don't have enough ratings and reviews to provide an overall score.

Jethro, from the company of the same name headquartered in New York, delivers interactive enterprise business intelligence and enterprise data warehouse services on hadoop.

The Syncfusion Big Data Platform is a Hadoop distribution designed for Windows. Its users can develop on Windows using familiar tools, and deploy on Windows. The vendor says they have taken the advantages of the Hadoop environment – from easy querying across structured and unstructured data to cost…

We don't have enough ratings and reviews to provide an overall score.

Pivotal Greenplum (formerly from EMC) is a massively parallel processing (MPP) data platform, based on the open source Greenplum Database. The data warehouse application is supported by Pivotal Software.