What users are saying about
103 Ratings
103 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 8.5 out of 101
10 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 7.9 out of 101

Add comparison

Likelihood to Recommend

Apache Spark

Spark is great as a workflow process and extract transform layer process tool. Is really good for machine learning especially for large datasets that can be processed in split file paralallelization. Spark streaming is scalable for close to real-time data workflow process.what it's not good for, is smaller subset of data processing.
Anson Abraham profile photo

Cloudera Manager

It would be suited for customers who feel more comfortable with using a GUI. It is less appropriate for developers or engineers who are comfortable with command line
Ethan Tran profile photo

Pros

  • We used to make our batch processing faster. Spark is faster in batch processing than MapReduce with it in memory computing
  • Spark will run along with other tools in the Hadoop ecosystem including Hive and Pig
  • Spark supports both batch and real-time processing
  • Apache Spark has Machine Learning Algorithms support
No photo available
  • Cloudera Manager has an easy to use web GUI. You can start and stop cluster and services. It will start and stop services in a cluster in the right order. You can monitor the cluster, services, and physical host hardware as well.
  • Cloudera Manager has an easy to use API that allows us to create scripts to automate deployment process.
  • Cloudera Manager has an option to add additional services that you could manage via the web GUI.
Ethan Tran profile photo

Cons

  • Data visualization.
  • Waiting for Web Development for small apps to be started with Spark as backbone middleware and HDFS as data retrieval file system.
  • Transformations and actions available are limited so must modify API to work for more features.
Kamesh Emani profile photo
  • Cloudera Manager needs to be more agile with integrating other applications, such as Accumulo 1.7, to their software.
  • Cloudera Manager can do a better job at explaining why a node fails to add to a cluster using their assistant.
  • Cloudera Manager should show graphs only when there is data, instead of showing just an empty box.
Ethan Tran profile photo

Likelihood to Renew

No score
No answers yet
No answers on this topic
Cloudera Manager8.5
Based on 2 answers
It meets all my customer's needs.
Ethan Tran profile photo

Alternatives Considered

Apache Pig and Apache Hive provide most of the things spark provide but apache spark has more features like actions and transformations which are easy to code. Spark uses optimization technique as we can select driver program and manipulate DAG (Directed Acyclic Graph)Python can be used even for data transformations but it requires lot of coding compared to Spark and it is even so slow.
Kamesh Emani profile photo
I have not used any competitors, such as Hortonworks, because Cloudera Manager just works and meets all my customer's needs. I only have deployed Hadoop using command line, which is not easy to use and manage.
Ethan Tran profile photo

Return on Investment

  • By learning Spark, we can become certified and/or provide proper recommendations or implementations on Spark solutions.
  • With a background in Hadoop distributed processes, it has been easy to understand and diagnose how Spark handles the transfer of data within a cluster. Especially when using YARN as the resource manager and HDFS as the data source.
  • Staying up to date with the latest changes to Spark has become a repetitive task. While most Hadoop distributions only support Spark 1.6 at the moment, Spark 2.0 has introduced some useful features, but those require a re-write of existing applications.
Jordan Moore profile photo
  • Cloudera Manager has a positive impact because our users have the windows mentality. They are more comfortable with using an easy to use GUI.
  • We save so much time and effort by having an automated scripted to deploy our environment. It takes at the most 15 minutes to run a one line script to build zookeepers, hadoop, and accumulo database. Run it, come back 15 minutes later, and its done.
  • Due to ease of deployment, our developers and system administrators are happy that they can quickly build and test their application.
Ethan Tran profile photo

Pricing Details

Apache Spark

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details

Cloudera Manager

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details