Hadoop-Related Software - Reviews

Show Filters 
Hide Filters 
Filter 667 vetted Hadoop-Related reviews and ratings
Clear all filters
Overall Rating
Reviewer's Company Size
Vendor
Product
Last Updated
Industry
Department
Experience
Job Type
Role

Reviews (1-25 of 111)

No photo available
Score 8 out of 10
Vetted Review
Verified User
Review Source
No photo available
September 15, 2019

Cloudera review

Score 6 out of 10
Vetted Review
Verified User
Review Source
No photo available
Score 9 out of 10
Vetted Review
Verified User
Review Source

If your data is very huge, I recommend converting the underlying technology into Apache Spark. This will save you a lot of time and effort in the near future due to your growing data. The Apache Spark scalability feature also means it handles all the future data related processing.

No photo available
March 16, 2019

Apache Spark Review

Score 7 out of 10
Vetted Review
Verified User
Review Source

We used Apache Spark within our department as a Solution Architecture team. It helped make big data processing more efficient since the same framework can be used for batch and stream processing.

Thomas Young profile photo
Score 8 out of 10
Vetted Review
Verified User
Review Source

Amazon Elastic MapReduce is used by my department to produce big data analytics for certain clients. The software address data mining and predictive analytics for data sets that take a long time to process. The software is not used for econometric or other analytical evaluation because the size o...

Thomas Young profile photo
Score 7 out of 10
Vetted Review
Verified User
Review Source

The software appears to run more efficiently than other big data tools, such as Hadoop. Given that, Apache Spark is well-suited for querying and trying to make sense of very, very large data sets. The software offers many advanced machine learning and econometrics tools, although these tools are ...

Kunal Sonalkar profile photo
Score 8 out of 10
Vetted Review
Verified User
Review Source

Hadoop is very well suited for big data modeling problems in various industries like finance, insurance, healthcare, automobiles, CRM, etc. In every industry where you need data analysis in real time, Hadoop is a perfect fit in terms of storage, analysis, retrieval, and processing. It won't be a ...

Shiv Shivakumar profile photo
Score 9 out of 10
Vetted Review
Verified User
Review Source

Apache Spark is very well suited for big data analytics in conjunction with the hadoop file system and also does a good job of providing fast access to data in SQL workloads since it has an in memory data processing engine that can very quickly process data. In addition, it can also be used for s...

Fernando López Bello profile photo
Score 9 out of 10
Vetted Review
Reseller
Review Source

It is best used where organizations need to build a data lake from scratch, leveraging its capabilities for ingesting huge volumes from a vast number of different sources -including sensors, logs, text, transactional systems and more.

Dhinesh Kumar Ganeshan,PMP,CSM profile photo
October 01, 2018

GDK Vora Review

Score 6 out of 10
Vetted Review
Verified User
Review Source

I believe this product has potential but it is not easy to adopt. SAP has to keep in mind how open-source big data technologies are able to deliver quick results. I know SAP is stabilizing and fighting hard against many open source technologies, but it still has a long way to go there.

Subhadipto Poddar profile photo
Score 8 out of 10
Vetted Review
Verified User
Review Source

Apache Pig is being used as a map-reduce platform. It is used to handle transportation problems and use large volume of data. It can handle data streaming from multiple sources and join them. This can be used to extract key findings, aggregate results and finally process output which is used for ...

No photo available
Score 9 out of 10
Vetted Review
Verified User
Review Source

IBM Analytics Engine works well for managing and running multiple clusters, keeping them organized and monitoring your budget better by separating out the computer costs from the storage costs. It’s only a good option if you are already working within IBM Cloud, if you are an Azure or AWS shop, y...

No photo available
March 06, 2019

Sparking the future

Score 8 out of 10
Vetted Review
Verified User
Review Source

Only one of our departments is using Apache Spark to work on very large datasets. We are thinking of implementing it to other departments as well.

No photo available
Score 7 out of 10
Vetted Review
Verified User
Review Source

All in all, it is a great product and a convenient way of getting a lot of components for big data installed and configured. It provides components for most things you want to perform in ingesting, streaming and setting up for analytics. It also does a great job with the dashboard tool by integra...

No photo available
Score 7 out of 10
Vetted Review
Verified User
Review Source

The IBM Analytics Engine is particularly well-suited for situations in which you are required to analyze data from a myriad of sources. The drill-down capabilities make this a very powerful tool, but the implementation of large-scale projects requires watching the tutorials first, in our opinion....

No photo available
Score 8 out of 10
Vetted Review
Verified User
Review Source

IBM Analytics Engine is being used by my organization to analyze our data across multiple applications. We use it primarily to gather data to be used to analyze application performance and reliability. This data allows us to ensure that we are delivering the best possible application performance ...

No photo available
Score 9 out of 10
Vetted Review
Verified User
Review Source

My company is a non-profit healthcare delivery institute consists of hospitals and clinics. We started a new working group to see possibility of using big data technology with machine learning to improve patient care and healthcare quality improvement. We adopted HortonWorks data platform to stor...

No photo available
October 16, 2018

Oracle experience

Score 10 out of 10
Vetted Review
Verified User
Review Source

We use the cloud storage and the portability for a lot of our documents which usually contain very confidential information. Oracle has been able to provide strong support in helping us maintain Confidentiality. The speed of the system is also very impressive.

Nitin Pasumarthy profile photo
Score 10 out of 10
Vetted Review
Verified User
Review Source

Spark really shines with its bigger API support and its ability to read from and write to multiple data sources. Using Spark one can easily switch between declarative versus imperative versus functional type programming easily based on the situation. Also it doesn't need special data ingestion or...

Kartik Chavan profile photo
Score 7 out of 10
Vetted Review
Verified User
Review Source

As a requirement of a distributed processing system, we are using Apache Pig within our Information Technology department. I use it to an extent of generating reports with advanced statistical methods, both for internal use as well as external purposes. But our Data Science team and Data Engineer...

Carla Borges profile photo
Score 10 out of 10
Vetted Review
Verified User
Review Source

It helps us a lot in the transmission of data, as it is 100 times faster than Hadoop MapReduce in memory and 10 times faster in disk, as we work with Java this application. It allows native links for Java programming languages, ​​and as it is compatible with SQL, is completely adapted to the need...