Hadoop-Related Software - Reviews

Overall Rating

Reviewer's Company Size

Vendor

Product

Last Updated

Industry

Department

Experience

Job Type

Role

Reviews (1-25 of 116)

Anonymous | TrustRadius Reviewer
September 15, 2019

Cloudera review

Score 6 out of 10
Vetted Review
Verified User
Review Source
Anonymous | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User
Review Source
Anonymous | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User
Review Source

If your data is very huge, I recommend converting the underlying technology into Apache Spark. This will save you a lot of time and effort in the near future due to your growing data. The Apache Spark scalability feature also means it handles all the future data related processing.

Anonymous | TrustRadius Reviewer
March 16, 2019

Apache Spark Review

Score 7 out of 10
Vetted Review
Verified User
Review Source

We used Apache Spark within our department as a Solution Architecture team. It helped make big data processing more efficient since the same framework can be used for batch and stream processing.

Anonymous | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User
Review Source

IBM Analytics Engine works well for managing and running multiple clusters, keeping them organized and monitoring your budget better by separating out the computer costs from the storage costs. It’s only a good option if you are already working within IBM Cloud, if you are an Azure or AWS shop, you …

Anonymous | TrustRadius Reviewer
March 06, 2019

Sparking the future

Score 8 out of 10
Vetted Review
Verified User
Review Source

Only one of our departments is using Apache Spark to work on very large datasets. We are thinking of implementing it to other departments as well.

Anonymous | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Review Source

All in all, it is a great product and a convenient way of getting a lot of components for big data installed and configured. It provides components for most things you want to perform in ingesting, streaming and setting up for analytics. It also does a great job with the dashboard tool by integratin…

Thomas Young | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User
Review Source

Amazon Elastic MapReduce is used by my department to produce big data analytics for certain clients. The software address data mining and predictive analytics for data sets that take a long time to process. The software is not used for econometric or other analytical evaluation because the size of t…

Thomas Young | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Review Source

The software appears to run more efficiently than other big data tools, such as Hadoop. Given that, Apache Spark is well-suited for querying and trying to make sense of very, very large data sets. The software offers many advanced machine learning and econometrics tools, although these tools are use…

Kunal Sonalkar | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User
Review Source

Hadoop is very well suited for big data modeling problems in various industries like finance, insurance, healthcare, automobiles, CRM, etc. In every industry where you need data analysis in real time, Hadoop is a perfect fit in terms of storage, analysis, retrieval, and processing. It won't be a ver…

Shiv Shivakumar | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User
Review Source

Apache Spark is very well suited for big data analytics in conjunction with the hadoop file system and also does a good job of providing fast access to data in SQL workloads since it has an in memory data processing engine that can very quickly process data. In addition, it can also be used for stre…

Fernando López Bello | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Reseller
Review Source

It is best used where organizations need to build a data lake from scratch, leveraging its capabilities for ingesting huge volumes from a vast number of different sources -including sensors, logs, text, transactional systems and more.

Dhinesh Kumar Ganeshan,PMP,CSM | TrustRadius Reviewer
October 01, 2018

GDK Vora Review

Score 6 out of 10
Vetted Review
Verified User
Review Source

I believe this product has potential but it is not easy to adopt. SAP has to keep in mind how open-source big data technologies are able to deliver quick results. I know SAP is stabilizing and fighting hard against many open source technologies, but it still has a long way to go there.

Carla Borges | TrustRadius Reviewer
Score 10 out of 10
Vetted Review
Verified User
Review Source

It helps us a lot in the transmission of data, as it is 100 times faster than Hadoop MapReduce in memory and 10 times faster in disk, as we work with Java this application. It allows native links for Java programming languages, ​​and as it is compatible with SQL, is completely adapted to the needs o…

Subhadipto Poddar | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User
Review Source

Apache Pig is being used as a map-reduce platform. It is used to handle transportation problems and use large volume of data. It can handle data streaming from multiple sources and join them. This can be used to extract key findings, aggregate results and finally process output which is used for dif…

Kartik Chavan | TrustRadius Reviewer
August 29, 2018

My Apache Hive Review

Score 8 out of 10
Vetted Review
Verified User
Review Source

Apache Hive is being used in our company mainly for big data analysis. It has greatly helps us with data processing & analysis. Querying in Apache Hive is very simple because it is very similar to SQL.

Anonymous | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Review Source

The IBM Analytics Engine is particularly well-suited for situations in which you are required to analyze data from a myriad of sources. The drill-down capabilities make this a very powerful tool, but the implementation of large-scale projects requires watching the tutorials first, in our opinion. It…