Apache Spark vs. Oracle Database

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Apache Spark
Score 8.9 out of 10
N/A
N/AN/A
Oracle Database
Score 7.7 out of 10
N/A
Oracle Database, currently in edition 23c, offers native support for property graph data structures and graph queries. If you're looking for flexibility to build graphs in conjunction with transactional data, JSON, Spatial, and other data types, we got you covered. Developers can now easily build graph applications with SQL using existing SQL development tools and frameworks.N/A
Pricing
Apache SparkOracle Database
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
Apache SparkOracle Database
Free Trial
NoNo
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Best Alternatives
Apache SparkOracle Database
Small Businesses

No answers on this topic

Google Cloud SQL
Google Cloud SQL
Score 8.8 out of 10
Medium-sized Companies
Cloudera Manager
Cloudera Manager
Score 9.9 out of 10
Snowflake
Snowflake
Score 8.8 out of 10
Enterprises
IBM Analytics Engine
IBM Analytics Engine
Score 7.9 out of 10
SAP IQ
SAP IQ
Score 9.0 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Apache SparkOracle Database
Likelihood to Recommend
10.0
(23 ratings)
9.0
(178 ratings)
Likelihood to Renew
10.0
(1 ratings)
9.0
(6 ratings)
Usability
10.0
(3 ratings)
7.4
(5 ratings)
Support Rating
8.7
(4 ratings)
10.0
(4 ratings)
Implementation Rating
-
(0 ratings)
9.6
(3 ratings)
User Testimonials
Apache SparkOracle Database
Likelihood to Recommend
Apache
Well suited: To most of the local run of datasets and non-prod systems - scalability is not a problem at all. Including data from multiple types of data sources is an added advantage. MLlib is a decently nice built-in library that can be used for most of the ML tasks. Less appropriate: We had to work on a RecSys where the music dataset that we used was around 300+Gb in size. We faced memory-based issues. Few times we also got memory errors. Also the MLlib library does not have support for advanced analytics and deep-learning frameworks support. Understanding the internals of the working of Apache Spark for beginners is highly not possible.
Read full review
Oracle
I believe Oracle Database is still the best RDBMS database which is the database to consider for OLTP applications and for Adhoc requests. They are good in Datawarehousing in certain aspects but not the best. Oracle is also a great database for scaling up with their Clusterware solution which also makes the database highly available with services moving to the live instance without much trouble.
Read full review
Pros
Apache
  • Rich APIs for data transformation making for very each to transform and prepare data in a distributed environment without worrying about memory issues
  • Faster in execution times compare to Hadoop and PIG Latin
  • Easy SQL interface to the same data set for people who are comfortable to explore data in a declarative manner
  • Interoperability between SQL and Scala / Python style of munging data
Read full review
Oracle
  • Best thing about it is that it supports PL/SQL which is helpful in writing complex quarries easily.
  • Its storage capacity , backup and recovery features make it the best database storage tool available.
  • Other thing I like about this software is its interface is so good.
Read full review
Cons
Apache
  • Memory management. Very weak on that.
  • PySpark not as robust as scala with spark.
  • spark master HA is needed. Not as HA as it should be.
  • Locality should not be a necessity, but does help improvement. But would prefer no locality
Read full review
Oracle
  • The memory demand and management makes it impossible to run it in a container.
  • It is hard to perform local unit testing with Oracle even using the personal edition (aggressive all the available memory grab for itself).
  • Lack of built in database migrations (e.g. as Flyway).
  • The need to install the Oracle client in addition to its drivers.
  • The cost of running it, especially in the Cloud.
  • Comes with very spartan community grade client/management tools whereas the commercial offerings tend to demand a premium price.
Read full review
Likelihood to Renew
Apache
Capacity of computing data in cluster and fast speed.
Read full review
Oracle
There is a lot of sunk cost in a product like Oracle 12c. It is doing a great job, it would not provide us much benefit to switch to another product even if it did the same thing due to the work involved in making such a switch. It would not be cost effective.
Read full review
Usability
Apache
The only thing I dislike about spark's usability is the learning curve, there are many actions and transformations, however, its wide-range of uses for ETL processing, facility to integrate and it's multi-language support make this library a powerhouse for your data science solutions. It has especially aided us with its lightning-fast processing times.
Read full review
Oracle
Many of the powerful options can be auto-configured but there are still many things to take into account at the moment of installing and configuring an Oracle Database, compared with SQL Server or other databases. At the same time, that extra complexity allows for detailed configuration and guarantees performance, scalability, availability and security.
Read full review
Support Rating
Apache
1. It integrates very well with scala or python. 2. It's very easy to understand SQL interoperability. 3. Apache is way faster than the other competitive technologies. 4. The support from the Apache community is very huge for Spark. 5. Execution times are faster as compared to others. 6. There are a large number of forums available for Apache Spark. 7. The code availability for Apache Spark is simpler and easy to gain access to. 8. Many organizations use Apache Spark, so many solutions are available for existing applications.
Read full review
Oracle
1. I have very good experience with Oracle Database support team. Oracle support team has pool of talented Oracle Analyst resources in different regions. To name a few regions - EMEA, Asia, USA(EST, MST, PST), Australia. Their support staffs are very supportive, well trained, and customer focused. Whenever I open Oracle Sev1 SR(service request), I always get prompt update on my case timely. 2. Oracle has zoom call and chat session option linked to Oracle SR. Whenever you are in Oracle portal - you can chat with the Oracle Analyst who is working on your case. You can request for Oracle zoom call thru which you can share the your problem server screen in no time. This is very nice as it saves lot of time and energy in case you have to follow up with oracle support for your case. 3.Oracle has excellent knowledge base in which all the customer databases critical problems and their solutions are well documented. It is very easy to follow without consulting to support team at first.
Read full review
Implementation Rating
Apache
No answers on this topic
Oracle
Overall the implementation went very well and after that everything came out as expected - in terms of performance and scalability. People should always install and upgrade a stable version for production with the latest patch set updates, test properly as much as possible, and should have a backup plan if anything unexpected happens
Read full review
Alternatives Considered
Apache
Spark in comparison to similar technologies ends up being a one stop shop. You can achieve so much with this one framework instead of having to stitch and weave multiple technologies from the Hadoop stack, all while getting incredibility performance, minimal boilerplate, and getting the ability to write your application in the language of your choosing.
Read full review
Oracle
Oracle is more of an enterprise-level database than Access and SAP Adaptive Server Enterprise isn't getting developed much (some people wonder how close it is to end of life) but SQL Server is miles ahead of Oracle IMO in terms of user experience and comparable in terms of performance AFAIK. As stated, a vendor forced our hand to use Oracle so we did not have a choice. If you are looking for help with an issue you are having, there are lots of SQL Server articles, etc. on the web and the community of SQL Server developers and DBA's is very strong and supportive. Oracle's help on the web is much more limited and often has an attitude that goes with it of superiority and lacking in compassion, IMO. For instance, check out the Ask Tom Oracle blog - a world of difference. If you choose Oracle, go into it with eyes wide open.
Read full review
Return on Investment
Apache
  • Business leaders are able to take data driven decisions
  • Business users are able access to data in near real time now . Before using spark, they had to wait for at least 24 hours for data to be available
  • Business is able come up with new product ideas
Read full review
Oracle
  • Oracle Database 12c has had a very positive impact on our ability to build strong and robust custom applications in house without the need to come up with our own methods of data storage and management.
  • Oracle Database 12c has the strongest user interface of any database I have worked with and continuously is improving its strength with the addition of support for JSON and XML type objects in the database.
  • Oracle Database 12c is sometimes very heavy and DBA intensive, but the benefits far outweigh the costs, which we need to spend on DBA support for enabling security and access features.
Read full review
ScreenShots