What users are saying about

Apache Spark

97 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 8.6 out of 101

IBM Analytics Engine

19 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 7.8 out of 101

Add comparison

Likelihood to Recommend

Apache Spark

Apache Spark has rich APIs for regular data transformations or for ML workloads or for graph workloads, whereas other systems may not such a wide range of support. Choose it when you need to perform data transformations for big data as offline jobs, whereas use MongoDB-like distributed database systems for more realtime queries.
Nitin Pasumarthy profile photo

IBM Analytics Engine

  • Well suited for my big data related project or a static data set analysis especially for uploading huge dataset to the cluster.
  • But had some issues with connecting IoT real-time data and feeding to Power BI. It might be my understanding please take it as a mere comment rather than a suggestion.
Prasanna Nattuthurai profile photo

Pros

  • Rich APIs for data transformation making for very each to transform and prepare data in a distributed environment without worrying about memory issues
  • Faster in execution times compare to Hadoop and PIG Latin
  • Easy SQL interface to the same data set for people who are comfortable to explore data in a declarative manner
  • Interoperability between SQL and Scala / Python style of munging data
Nitin Pasumarthy profile photo
  • Compact clusters. Via the trial versions that I use, I have a great insight of how different organizations create clusters and manage data
  • Fast: the load time at registration is a bit slow. But after creating the first cluster, everything went way faster
  • Realistic: hands on experience
  • Reasonable cost
No photo available

Cons

  • Data visualization.
  • Waiting for Web Development for small apps to be started with Spark as backbone middleware and HDFS as data retrieval file system.
  • Transformations and actions available are limited so must modify API to work for more features.
Kamesh Emani profile photo
  • Lagging at some point. Inconsistency in maintaining and running clusters.
  • Should allow the cluster to last longer before refreshing
  • It’s a bit confusing to navigate through the site. Sometime I thought I hit the right section then it lead me to another one.
  • The log in page is inconsistent. I could find 3 different IBM log in page via Google. And then have to click many things that will lead to where I want.
No photo available

Alternatives Considered

Apache Pig and Apache Hive provide most of the things spark provide but apache spark has more features like actions and transformations which are easy to code. Spark uses optimization technique as we can select driver program and manipulate DAG (Directed Acyclic Graph)Python can be used even for data transformations but it requires lot of coding compared to Spark and it is even so slow.
Kamesh Emani profile photo
  • I have been using Azure for my previous analysis, I had a difficult time in understanding the Analytics engine rather IBM provided step by step tutorial for setup.
  • Also turning off a machine was not an option in Azure for some of the services so I had to pay for the service whether I use it or not
Prasanna Nattuthurai profile photo

Return on Investment

  • By learning Spark, we can become certified and/or provide proper recommendations or implementations on Spark solutions.
  • With a background in Hadoop distributed processes, it has been easy to understand and diagnose how Spark handles the transfer of data within a cluster. Especially when using YARN as the resource manager and HDFS as the data source.
  • Staying up to date with the latest changes to Spark has become a repetitive task. While most Hadoop distributions only support Spark 1.6 at the moment, Spark 2.0 has introduced some useful features, but those require a re-write of existing applications.
Jordan Moore profile photo
  • Increasing learning success. My class and I were able to practice real tools
  • The only downsize is without the school, it would be unaffordable to use the tools
No photo available

Pricing Details

Apache Spark

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details

IBM Analytics Engine

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details