Apache Spark vs. Heroku Platform

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Apache Spark
Score 8.9 out of 10
N/A
N/AN/A
Heroku Platform
Score 8.9 out of 10
N/A
The Heroku Platform, now from Salesforce, is a platform-as-a-service based on a managed container system, with integrated data services and ecosystem for deploying modern apps. It takes an app-centric approach for software delivery, integrated with developer tools and workflows. It’s three main tool are: Heroku Developer Experience (DX), Heroku Operational Experience (OpEx), and Heroku Runtime. Heroku Developer Experience (DX) Developers deploy directly from tools like…
$85
per month
Pricing
Apache SparkHeroku Platform
Editions & Modules
No answers on this topic
Production
$25.00
per month
Advanced
$250.00
per month
Offerings
Pricing Offerings
Apache SparkHeroku Platform
Free Trial
NoNo
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
Apache SparkHeroku Platform
Top Pros
Top Cons
Features
Apache SparkHeroku Platform
Platform-as-a-Service
Comparison of Platform-as-a-Service features of Product A and Product B
Apache Spark
-
Ratings
Heroku Platform
8.1
43 Ratings
1% above category average
Ease of building user interfaces00 Ratings7.626 Ratings
Scalability00 Ratings8.343 Ratings
Platform management overhead00 Ratings7.642 Ratings
Workflow engine capability00 Ratings8.429 Ratings
Platform access control00 Ratings7.142 Ratings
Services-enabled integration00 Ratings8.141 Ratings
Development environment creation00 Ratings8.738 Ratings
Development environment replication00 Ratings8.737 Ratings
Issue monitoring and notification00 Ratings8.241 Ratings
Issue recovery00 Ratings8.438 Ratings
Upgrades and platform fixes00 Ratings8.443 Ratings
Best Alternatives
Apache SparkHeroku Platform
Small Businesses

No answers on this topic

AWS Elastic Beanstalk
AWS Elastic Beanstalk
Score 9.5 out of 10
Medium-sized Companies
Cloudera Manager
Cloudera Manager
Score 9.9 out of 10
AWS Elastic Beanstalk
AWS Elastic Beanstalk
Score 9.5 out of 10
Enterprises
IBM Analytics Engine
IBM Analytics Engine
Score 7.8 out of 10
AWS Elastic Beanstalk
AWS Elastic Beanstalk
Score 9.5 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Apache SparkHeroku Platform
Likelihood to Recommend
9.4
(24 ratings)
7.0
(47 ratings)
Likelihood to Renew
10.0
(1 ratings)
9.5
(6 ratings)
Usability
8.7
(4 ratings)
9.2
(17 ratings)
Availability
-
(0 ratings)
8.0
(1 ratings)
Performance
-
(0 ratings)
9.0
(1 ratings)
Support Rating
8.7
(4 ratings)
8.7
(19 ratings)
Online Training
-
(0 ratings)
6.0
(1 ratings)
Implementation Rating
-
(0 ratings)
9.0
(3 ratings)
User Testimonials
Apache SparkHeroku Platform
Likelihood to Recommend
Apache
Well suited: To most of the local run of datasets and non-prod systems - scalability is not a problem at all. Including data from multiple types of data sources is an added advantage. MLlib is a decently nice built-in library that can be used for most of the ML tasks. Less appropriate: We had to work on a RecSys where the music dataset that we used was around 300+Gb in size. We faced memory-based issues. Few times we also got memory errors. Also the MLlib library does not have support for advanced analytics and deep-learning frameworks support. Understanding the internals of the working of Apache Spark for beginners is highly not possible.
Read full review
Salesforce
Heroku is very well suited for startups looking to get a server stack up and running quickly. There is little to no overhead when managing your instances. However, you'll need a background in basic DevOps or system management to make sure everything is set up correctly. In addition, it's easy to accidentally go crazy on pricing. Make sure you're only creating the server instances you need to run the base application and set up an auto-scaler plugin to handle peaks.
Read full review
Pros
Apache
  • Rich APIs for data transformation making for very each to transform and prepare data in a distributed environment without worrying about memory issues
  • Faster in execution times compare to Hadoop and PIG Latin
  • Easy SQL interface to the same data set for people who are comfortable to explore data in a declarative manner
  • Interoperability between SQL and Scala / Python style of munging data
Read full review
Salesforce
  • Heroku has a very simple deployment model, making it easy to get your application up-and-running with minimal effort. We can focus on our efforts the unique aspects of our application.
  • The robust add-on marketplace makes it easy to try out new approaches with minimal effort and investment -- and when we settle on a solution, we can easily scale it.
  • Heroku's support is quite good -- their staff is quite technical and willing to get into the weeds to diagnose even complicated problems.
Read full review
Cons
Apache
  • Memory management. Very weak on that.
  • PySpark not as robust as scala with spark.
  • spark master HA is needed. Not as HA as it should be.
  • Locality should not be a necessity, but does help improvement. But would prefer no locality
Read full review
Salesforce
  • Large price jumps between certain resource tiers (2x Dyno for $50 per month versus Performance Dyno for $250). Free Postgres next jumps to $50 per month.
  • Marketing/Branding to non-technical stakeholders. As the years pass, I've had to fight more to convince stakeholders on the value of Heroku over AWS.
  • Improve Buildpack documentation. This is one area where Heroku's documentation is fairly confusing.
Read full review
Likelihood to Renew
Apache
Capacity of computing data in cluster and fast speed.
Read full review
Salesforce
Heroku is easy to use, services a ton of functions for you out of the box, and provides a means to get a software product off the ground and managed quickly and easily. The tools provide allows a small to medium size org to move very quickly. The CLI tools provided make managing an entire technical infrastructure simple.
Read full review
Usability
Apache
If the team looking to use Apache Spark is not used to debug and tweak settings for jobs to ensure maximum optimizations, it can be frustrating. However, the documentation and the support of the community on the internet can help resolve most issues. Moreover, it is highly configurable and it integrates with different tools (eg: it can be used by dbt core), which increase the scenarios where it can be used
Read full review
Salesforce
Easy to use web based console and easy to use command line tools; deployment is done directly from a GIT repository. What more could you ask for? The one thing that keeps me from giving it a 10 is that custom build packs are almost incomprehensible. We used one for a while because we needed cairo graphics processing. Fortunately, I was able to figure out a different way to do what we needed so that we could get off the custom build pack.
Read full review
Reliability and Availability
Apache
No answers on this topic
Salesforce
Heroku availability correlates pretty strongly to AWS US EAST availability. We had a couple of times where there was a Heroku-specific issue but not for the last 7-8 months.
Read full review
Performance
Apache
No answers on this topic
Salesforce
The only issue that I ever have is that about 1 out of 20 deployments (git push) will hang and need to be cancelled and done again.
Read full review
Support Rating
Apache
1. It integrates very well with scala or python. 2. It's very easy to understand SQL interoperability. 3. Apache is way faster than the other competitive technologies. 4. The support from the Apache community is very huge for Spark. 5. Execution times are faster as compared to others. 6. There are a large number of forums available for Apache Spark. 7. The code availability for Apache Spark is simpler and easy to gain access to. 8. Many organizations use Apache Spark, so many solutions are available for existing applications.
Read full review
Salesforce
I've used it for many years without facing any major problem. It's not hard at all to get used to it, it's documentation is outstanding and simple. We are close to 2020 and I don't think most of the existing companies or startups should still face old problems such as wasting time deploying code and calculate computing resources.
Read full review
Implementation Rating
Apache
No answers on this topic
Salesforce
Be ready to pay a bit more than expected in the beginning if you're migrating from a big server. The application is probably not ready for the change and you have to keep improving it with time.
It's also important to consider that you can't save anything to the disc as it will be lost when your application restarts, so you have to think about using something like S3.
Read full review
Alternatives Considered
Apache
Spark in comparison to similar technologies ends up being a one stop shop. You can achieve so much with this one framework instead of having to stitch and weave multiple technologies from the Hadoop stack, all while getting incredibility performance, minimal boilerplate, and getting the ability to write your application in the language of your choosing.
Read full review
Salesforce
Heroku is the more expensive option for hosting compared to some of the cloud platforms we investigated, but it's worth it for us because of the plug-and-play nature of Heroku deployment. We can be up and running in a few minutes and know with precision how much it will cost us each month to run the application, unlike Amazon Web Services where you have to go to great pains to configure it correctly or else you might end up with a shocking monthly bill. Overall, spending the time to configure Amazon Web Services or one of its competitors is likely the more affordable and powerful choice, because you have control over so many specifics of the configuration. But it also requires the burden of continuing to maintain and update your AWS instance, whereas with Heroku they take care of security fixes and platform upgrades. It's a great service and we are happy to pay the extra cost for the value-adds Heroku provides.
Read full review
Return on Investment
Apache
  • Business leaders are able to take data driven decisions
  • Business users are able access to data in near real time now . Before using spark, they had to wait for at least 24 hours for data to be available
  • Business is able come up with new product ideas
Read full review
Salesforce
  • It has been critical in seamlessly operating our platform with runs all of our programs.
  • It has been impressive with its ability to scale quickly which results in the growth of our work.
  • It allows for tracking of different features which allows for quick problem solving which saves us time.
Read full review
ScreenShots