Apache Spark vs. Db2 vs. SAP HANA Cloud

Apache Spark

164 Reviews and Ratings

Db2

273 Reviews and Ratings

SAP HANA Cloud

1027 Reviews and Ratings

Learn More

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Apache Spark	Score 9.0 out of 10	N/A	Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.	N/A
Db2	Score 8.7 out of 10	N/A	DB2 is a family of relational database software solutions offered by IBM. It includes standard Db2 and Db2 Warehouse editions, either deployable on-cloud, or on-premise.	$0
SAP HANA Cloud	Score 8.9 out of 10	N/A	SAP HANA is an application that uses in-memory database technology to process very large amounts of real-time data from relational databases, both SAP and non-SAP, in a very short time. The in-memory computing engine allows HANA to process data stored in RAM as opposed to reading it from a disk which means that the data can be accessed in real time by the applications using HANA. The product is sold both as an appliance and as a cloud-based software solution.	$0.95 per month Capacity Units

Pricing

Apache Spark

Db2

SAP HANA Cloud

Editions & Modules

No answers on this topic

Db2 on Cloud Lite: $0
Db2 on Cloud Standard: $99
per month
Db2 Warehouse on Cloud Flex One: $898
per month
Db2 on Cloud Enterprise: $946
per month
Db2 Warehouse on Cloud Flex for AWS: 2,957
per month
Db2 Warehouse on Cloud Flex: $3,451
per month
Db2 Warehouse on Cloud Flex Performance: 13,651
per month
Db2 Warehouse on Cloud Flex Performance for AWS: 13,651
per month
Db2 Standard Edition: Contact Sales
Db2 Advanced Edition: Contact Sales

No answers on this topic

Offerings

Pricing Offerings
Apache Spark	Db2	SAP HANA Cloud
Free Trial
No	Yes	Yes
Free/Freemium Version
No	Yes	No
Premium Consulting/Integration Services
No	Yes	No

Entry-level Setup Fee

No setup fee

Optional

Additional Details

—

Includes a one year free trial.

More Pricing Information

Pricing Info

Community Pulse
	Apache Spark	Db2	SAP HANA Cloud
Considered Multiple Products	Apache Spark Riyaz Khan Staff Engineer Chose Apache Spark Apache Spark is a fast-processing in-memory computing framework. It is 10 times faster than Apache Hadoop. Earlier we were using Apache Hadoop for processing data on the disk but now we are shifted to Apache Spark because of its in-memory computation capability. Also in SAP … Helpful? Verified User Executive Chose Apache Spark Databricks uses Spark as a foundation, and is also a great platform. It does bring several add-ons, which we did not feel needed by the time we evaluated - and haven't needed since then. One interesting plus in our opinion was the engineering support, which is great depending … Incentivized Helpful? SS Shiv Shivakumar Acquisitions Leader Chose Apache Spark We evaluated SAS alongside with Apache Spark but during the course of proof of concept found that Apache Spark was able to support the hadoop eco-system and hadoop file system much better. It was much faster at that time while having the ability to process data quickly for the … Incentivized Helpful?	Db2 Rohan Kumar Software Engineer Chose Db2 Implementation and administration complexity, user learning curves, cost considerations, migration difficulties, and possible support and documentation issues are some of the drawbacks of SAP HANA Cloud. With IBM Db2 it is also incredibly safe, effective, and user-friendly. … Incentivized Helpful?	SAP HANA Cloud Verified User Manager Chose SAP HANA Cloud DB2 and Oracle are more mature products, however, HANA stacks very well against it in terms of reliability and management. Cost is a huge factor in HANA's favor as well, especially given Oracle's excessive costs. Incentivized Helpful? Verified User Employee Chose SAP HANA Cloud DB2 does an implicit ordering. The hardware base was different, that's why it is hard to compare both of them. Incentivized Helpful? LAura Birt Information Technology Manager Chose SAP HANA Cloud IBM has been a credible name for us as we have implemented some of the IBM tools and are going great but when it comes to IBM DB2 it was our not-so-good experience. We planned to save our time and cost with IBM DB2 Incentivized Helpful? Satarupa Panda Senior Test Engineer Chose SAP HANA Cloud Developer friendly environment and real time data access and processing Helpful? Verified User Vice-President Chose SAP HANA Cloud Interestingly Workday financials is getting paired with Workday HCM.. Do not find it a comforting approach if one has to have tight integration with logistics operations Incentivized Helpful? Sreedhar Sree Sr. Automation QA in Machine Learning Chose SAP HANA Cloud As SAP HANA is an in-memory database, it can process data swiftly and can provide detailed analysis reports compared to other tools. Another advantage is it supports different data types, so if any application is looking for scalability, performance, security, and risk … Incentivized Helpful? Verified User Analyst Chose SAP HANA Cloud SAP HANA as a solution works real good. We chose mainly for real time/streaming analytics and it works well. Incentivized Helpful? SJ Sanjiv Jain GCIO Chose SAP HANA Cloud As users are comfortable using SAP HANA and now all solutions available with SAP HANA add-on modules the integration becomes much easier and cost effective else you need to have persons of different skill sets to maintain and operate the systems. Incentivized Helpful? Ali Kazempour Information Technology Adviser Chose SAP HANA Cloud Much faster speeds and features that go hand in hand with other SAP tools and products Incentivized Helpful? Filippo Orlando Advanced Analytics Manager Chose SAP HANA Cloud The choice of the SAP HANA solution was mainly determined by the choice of the new company ERP, which having been SAP, naturally led to the choice of its DB solution. Incentivized Helpful? Verified User Administrator Chose SAP HANA Cloud Similar to other big DBMS, but better or equal at stability and technical maintenance. Better or equal at documentation. There is room for improvement at SQL path analyzing. Incentivized Helpful? VS Vakas Shaikh Financial Analyst Chose SAP HANA Cloud SAP HANA is so agile. It is more than just a database and of course it's a SAP product so reliability. Incentivized Helpful? Verified User Manager Chose SAP HANA Cloud We are trusting SAP and the roadmap they have provided. It just makes sense. Incentivized Helpful? SS Shiv Shivakumar Acquisitions Leader Chose SAP HANA Cloud We compared Microsoft BI with SAP HANA. The reasons to go with SAP HANA were - 1. ability to ingest data into HANA from a non SAP database 2. in-memory database resulting in faster real time analytics 3. ability to scale up 4. ability to replicate data real time 5. very solid … Incentivized Helpful?

Features

Apache Spark

Db2

SAP HANA Cloud

Relational Databases

Comparison of Relational Databases features of Product A and Product B
	Apache Spark - Ratings	Db2 - Ratings	SAP HANA Cloud 7.6 24 Ratings 4% below category average
ACID compliance	00 Ratings	00 Ratings	8.317 Ratings
Database monitoring	00 Ratings	00 Ratings	7.523 Ratings
Database locking	00 Ratings	00 Ratings	7.819 Ratings
Encryption	00 Ratings	00 Ratings	7.320 Ratings
Disaster recovery	00 Ratings	00 Ratings	7.920 Ratings
Flexible deployment	00 Ratings	00 Ratings	7.322 Ratings
Multiple datatypes	00 Ratings	00 Ratings	7.422 Ratings

Best Alternatives
	Apache Spark	Db2	SAP HANA Cloud
Small Businesses	No answers on this topic	InterSystems IRIS Score 7.9 out of 10	InterSystems IRIS Score 7.9 out of 10
Medium-sized Companies	Cloudera Manager Score 9.9 out of 10	InterSystems IRIS Score 7.9 out of 10	InterSystems IRIS Score 7.9 out of 10
Enterprises	IBM Analytics Engine Score 7.2 out of 10	SAP IQ Score 10.0 out of 10	SAP IQ Score 10.0 out of 10
All Alternatives	View all alternatives	View all alternatives	View all alternatives

User Ratings
	Apache Spark	Db2	SAP HANA Cloud
Likelihood to Recommend	9.0 (24 ratings)	8.9 (113 ratings)	9.6 (308 ratings)
Likelihood to Renew	10.0 (1 ratings)	7.9 (12 ratings)	10.0 (11 ratings)
Usability	8.0 (4 ratings)	9.2 (9 ratings)	9.6 (29 ratings)
Availability	- (0 ratings)	9.1 (64 ratings)	3.6 (1 ratings)
Performance	- (0 ratings)	9.1 (12 ratings)	3.6 (1 ratings)
Support Rating	8.7 (4 ratings)	8.9 (6 ratings)	9.1 (251 ratings)
In-Person Training	- (0 ratings)	8.2 (1 ratings)	- (0 ratings)
Implementation Rating	- (0 ratings)	5.9 (3 ratings)	9.1 (2 ratings)
Configurability	- (0 ratings)	9.1 (2 ratings)	3.6 (1 ratings)
Ease of integration	- (0 ratings)	8.0 (4 ratings)	4.5 (1 ratings)
Product Scalability	- (0 ratings)	8.5 (66 ratings)	4.5 (1 ratings)
Vendor post-sale	- (0 ratings)	8.9 (2 ratings)	4.5 (1 ratings)
Vendor pre-sale	- (0 ratings)	8.9 (2 ratings)	3.6 (1 ratings)

User Testimonials
	Apache Spark	Db2	SAP HANA Cloud
Likelihood to Recommend	Apache Well suited: To most of the local run of datasets and non-prod systems - scalability is not a problem at all. Including data from multiple types of data sources is an added advantage. MLlib is a decently nice built-in library that can be used for most of the ML tasks. Less appropriate: We had to work on a RecSys where the music dataset that we used was around 300+Gb in size. We faced memory-based issues. Few times we also got memory errors. Also the MLlib library does not have support for advanced analytics and deep-learning frameworks support. Understanding the internals of the working of Apache Spark for beginners is highly not possible. Incentivized Ananth Gouri Assistant Professor Read full review	IBM I have primarily used it as the basis for a SIS - but I have migrated more than a few systems from there database systems to DB2 (Filemaker, MySQL, etc.). DB2 does have a better structural approach, as opposed to Filemaker, which allows for more data consistency, but this can also lead to an inflexibility that can sometimes be counterintuitive when attempting to compensate for the flexibility of the work environment as Schools tend to have an all in one approach. Incentivized Verified User Anonymous Read full review	SAP I think if you have a large organization, it's probably the product and the marketplace to go to. We're a large management consulting firm operating in four to seven countries. And generally speaking, I think that's the size and the scope where it scales best. I can't speak to smaller companies, but I can't see smaller companies leveraging the benefits as much as a larger organization can. Incentivized Verified User Anonymous Read full review
Pros	Apache Rich APIs for data transformation making for very each to transform and prepare data in a distributed environment without worrying about memory issues Faster in execution times compare to Hadoop and PIG Latin Easy SQL interface to the same data set for people who are comfortable to explore data in a declarative manner Interoperability between SQL and Scala / Python style of munging data Incentivized Nitin Pasumarthy Software Engineer Read full review	IBM While we query a large set of data, the results are generally available within a minute or so. Always reliable - I have never experienced an application going down. It is easy to write queries and find tables and columns. We can log in smoothly without any headaches. Incentivized GH Ganesh Hegde Senior System Engineer Read full review	SAP Real-time reporting and analytics on data: because of its in-memory architecture, it is perfect for businesses that need to make quick decisions based on current information. Managing workload with complex data: it can handle a vast range of data types, including relational, documental, geospatial, graph, vector, and time series data. Developing and deploying intelligent data applications: it provides various tools for such applications and can be used for machine learning and artificial intelligence to automate tasks, gain insights from data, and make predictions. SP sonam pahwa SAP consultant Read full review
Cons	Apache Memory management. Very weak on that. PySpark not as robust as scala with spark. spark master HA is needed. Not as HA as it should be. Locality should not be a necessity, but does help improvement. But would prefer no locality Incentivized Anson Abraham Data Czar Read full review	IBM Learning curve for DB resources - Improvements to UI or native command line built-ins can help with increasing efficiencies for DB resources Better resource utilization monitoring and recommendations Continue to adopt support for modern frameworks and languages making it easier for organizations to see making Db2 the easy first choice Incentivized Verified User Anonymous Read full review	SAP Requires higher processing power, otherwise it won't fly. How ever computing costs are lower. Incase you are migrating to cloud please do not select the highest config available in that series . Upgrading it later against a reserved instance can cost you dearly with a series change Lack of clarity on licensing is one major challenge Unless S/4 with additional features are enabled mere migration HANA DB is not a rewarding journey. Power is in S/4 Incentivized Verified User Anonymous Read full review
Likelihood to Renew	Apache Capacity of computing data in cluster and fast speed. Steven Li Senior Software Developer (Consultant) Read full review	IBM The DB2 database is a solid option for our school. We have been on this journey now for 3-4 years so we are still adapting to what it can do. We will renew our use of DB2 because we don’t see. Major need to change. Also, changing a main database in a school environment is a major project, so we’ll avoid that if possible. Incentivized Verified User Anonymous Read full review	SAP We would rate our likelihood of renewing at 9/10. SAP HANA Cloud has proven to be a highly reliable and scalable data platform that consistently delivers strong performance. Its seamless integration with our overall SAP landscape, combined with improved analytics and real-time data capabilities, makes it a core part of our long-term technology strategy. Incentivized Verified User Anonymous Read full review
Usability	Apache If the team looking to use Apache Spark is not used to debug and tweak settings for jobs to ensure maximum optimizations, it can be frustrating. However, the documentation and the support of the community on the internet can help resolve most issues. Moreover, it is highly configurable and it integrates with different tools (eg: it can be used by dbt core), which increase the scenarios where it can be used Incentivized Verified User Anonymous Read full review	IBM You have to be well versed in using the technology, not only from a GUI interface but from a command line interface to successfully use this software to its fullest. Incentivized J.R. Bullington Senior Infrastructure Engineer Read full review	SAP It is useful solution which helps you improve SAP applications performance. It offers you faster data processing, robust disaster management, higher availability, scalability, advanced analytical capabilities, etc. It provides you simple, clean, organized user interface designed to facilitate smooth navigation. Its user interface is simple and intuitive which allow you to complete task efficiently. Incentivized AD Arvind Dixit Assistant Manager - IT Read full review
Reliability and Availability	Apache No answers on this topic	IBM I have never had DB2 go down unexpectedly. It just works solidly every day. When I look at the logs, sometimes DB2 has figured out there was a need to build an index. Instead of waiting for me to do it, the database automatically created the index for me. At my current company, we have had zero issues for the past 8 years. We have upgrade the server 3 times and upgraded the OS each time and the only thing we saw was that DB2 got better and faster. It is simply amazing. Incentivized Stewart Locklear Senior Information Technology Project Manager Read full review	SAP so far, we didn't get any outage Incentivized Verified User Anonymous Read full review
Performance	Apache No answers on this topic	IBM The performances are exceptional if you take care to maintain the database. It is a very powerful tool and at the same time very easy to use. In our installation, we expect a DB machine on the mainframe with access to the database through ODBC connectors directly from branch servers, with fabulous end users experience. Incentivized Verified User Anonymous Read full review	SAP so far good Incentivized Verified User Anonymous Read full review
Support Rating	Apache 1. It integrates very well with scala or python. 2. It's very easy to understand SQL interoperability. 3. Apache is way faster than the other competitive technologies. 4. The support from the Apache community is very huge for Spark. 5. Execution times are faster as compared to others. 6. There are a large number of forums available for Apache Spark. 7. The code availability for Apache Spark is simpler and easy to gain access to. 8. Many organizations use Apache Spark, so many solutions are available for existing applications. YM Yogesh Mhasde Technical Manager Read full review	IBM Easily the best product support team. :) Whenever we have questions, they have answered those in a timely manner and we like how they go above and beyond to help. Incentivized Jay Padhya Data Scientist Read full review	SAP However, I am not the right person to answer this as we have another department to handle support and contact the service provider for any support required. Although i will say that they are the quick respondent and knows how to handle querry of the customers and provide quick and better support. Incentivized Verified User Anonymous Read full review
In-Person Training	Apache No answers on this topic	IBM the material was very clear and all subjects have been handled Incentivized GP Guy Przytula DBA Read full review	SAP No answers on this topic
Implementation Rating	Apache No answers on this topic	IBM db2 work well with the application, also the replication tool can keep it up richard shen Senior DBA Read full review	SAP Professional GIS people are some of the most risk-averse there are, and it's difficult to get them to move to HANA in one step. Start with small projects building to 80% use of HANA spatial over time. Incentivized Verified User Anonymous Read full review
Alternatives Considered	Apache Spark in comparison to similar technologies ends up being a one stop shop. You can achieve so much with this one framework instead of having to stitch and weave multiple technologies from the Hadoop stack, all while getting incredibility performance, minimal boilerplate, and getting the ability to write your application in the language of your choosing. Incentivized Verified User Anonymous Read full review	IBM DB2 was more scalable and easily configurable than other products we evaluated and short listed in terms of functionality and pricing. IBM also had a good demo on premise and provided us a sandbox experience to test out and play with the product and DB2 at that time came out better than other similar products. Incentivized Verified User Anonymous Read full review	SAP I have deep knowledge of other disk based DBMSs. They are venerable technology, but the attempts to extend them to current architectures belie the fact they are built on 40 year old technology. There are some good columnar in-memory databases but they lack the completeness of capability present in the HANA platform. Incentivized TT Tom Turchioe AP Ecosystem Technology Lead - Global Read full review
Contract Terms and Pricing Model	Apache No answers on this topic	IBM No answers on this topic	SAP I don't have visibility in licensing Incentivized Verified User Anonymous Read full review
Scalability	Apache No answers on this topic	IBM By using DB2 only to support my IzPCA activities, my knowledge here is somewhat limited. Anyway, from what I was able to understand, DB2 is extremely scallable. Maybe the information below could serve as an example of scalability. Customer have an huge mainframe environment, 13x z15 CECs, around 80 LPARs, and maybe more than 50 Sysplexes (I am not totally sure about this last figure...) Today we have 7 IzPCA databases, each one in a distinct Syplex. Plans are underway to have, at the end, an small LPAR, with only one DB2 sub-system, and with only one database, then transmit the data from a lot of other LPARs, and then process all the data in this only one database. The IzPCA collect process (read the data received, manipulate it, and insert rows in the tables) today is a huge process, demanding many elapsed hours, and lots of CPU. Almost 100% of the tables are PBR type, insert jobs run in parallel, but in 4 of the 7 database, it is a really a huge and long process. Combining the INSERTs loads from the 7 databases in only one will be impossible.......,,,, But, IzPCA recently introduced a new feature, called "Continuous Collector". By using that feature, small amounts of data will be transmited to the central LPAR at every 5 minutes (or even less), processed immediately,in a short period of time, and with small use of CPU, instead of one or two transmissions by day, of very large amounts of data and the corresponding collect jobs occurring only once or twice a day, with long elapsed times, and huge comsumption of CPU I suspect the total CPU seconds consumed will be more or less the same in both cases, but in the new method it will occur in small bursts many times a day!! Incentivized RV Ricardo Viegas Systems Support Read full review	SAP Limitation of training deliverable by organization Incentivized SA Shoaib Ansari DEVELOPER Read full review
Professional Services	Apache No answers on this topic	IBM No answers on this topic	SAP We are still in process for the first applciaiton Incentivized Verified User Anonymous Read full review
Return on Investment	Apache Business leaders are able to take data driven decisions Business users are able access to data in near real time now . Before using spark, they had to wait for at least 24 hours for data to be available Business is able come up with new product ideas Incentivized Surendranatha Reddy Chappidi Senior Data Engineer Read full review	IBM Negative: Difficult and manual deployment Negative: Missing assistants from common monitoring metrics Positive: Stability Positive: Performance Positive: Resiliency and high availability (HADR) Positive: Data Replication (Q-Rep) Positive: Interaction with storage subsystems for backups (TSM, SVC) Positive: Gigantic monitoring features in the form of table functions Incentivized MS Miguel Sanjurjo Lead Expert Managed Platforms & Infrastructure Read full review	SAP ROI has always been high in terms of the functionality that it offers and the security features it comes with. Managing large volumes of data in real-time is not an easy task, but it does it pretty well with faster data processing. Incentivized Nishant Kumar Lead Engineer Read full review
ScreenShots		Db2 Screenshots