Apache Spark vs. Splunk Enterprise

Apache Spark

159 Reviews and Ratings

Splunk Enterprise

Splunk Enterprise

455 Reviews and Ratings

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Apache Spark	Score 8.7 out of 10	N/A	N/A	N/A
Splunk Enterprise	Score 8.4 out of 10	N/A	Splunk is software for searching, monitoring, and analyzing machine-generated big data, via a web-style interface. It captures, indexes and correlates real-time data in a searchable repository from which it can generate graphs, reports, alerts, dashboards and visualizations.	N/A

Pricing

Apache Spark

Splunk Enterprise

Editions & Modules

No answers on this topic

No answers on this topic

Offerings

Pricing Offerings
Apache Spark	Splunk Enterprise
Free Trial
No	Yes
Free/Freemium Version
No	Yes
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

No setup fee

Additional Details

—

—

More Pricing Information

Community Pulse
	Apache Spark	Splunk Enterprise
Considered Both Products	Apache Spark No answer on this topic	Splunk Enterprise Verified User Team Lead Chose Splunk Enterprise We have also used ELK (Elastic Logstash Kibana) with some benefits, but Splunk is way better than ELK. We also use AWS CloudWatch for Lambdas that are written in AWS. However CloudWatch is not a replacement for Splunk. Incentivized Helpful?
Top Pros	Pro Machine learning Pro Data sets Pro Easy to use	Pro Real time Pro Easy to use Pro Set up
Top Cons	Minus Data visualization Minus Learning curve Minus Amounts of data	Minus Learning curve Minus Steep learning curve Minus Ability to create

Features

Apache Spark

Splunk Enterprise

Security Information and Event Management (SIEM)

Comparison of Security Information and Event Management (SIEM) features of Product A and Product B
	Apache Spark - Ratings	Splunk Enterprise 7.5 54 Ratings 4% below category average
Centralized event and log data collection	00 Ratings	6.553 Ratings
Correlation	00 Ratings	6.052 Ratings
Event and log normalization/management	00 Ratings	6.053 Ratings
Deployment flexibility	00 Ratings	7.549 Ratings
Integration with Identity and Access Management Tools	00 Ratings	7.549 Ratings
Custom dashboards and workspaces	00 Ratings	8.554 Ratings
Host and network-based intrusion detection	00 Ratings	7.037 Ratings
Data integration/API management	00 Ratings	8.45 Ratings
Behavioral analytics and baselining	00 Ratings	7.84 Ratings
Rules-based and algorithmic detection thresholds	00 Ratings	7.84 Ratings
Response orchestration and automation	00 Ratings	6.94 Ratings
Reporting and compliance management	00 Ratings	7.94 Ratings
Incident indexing/searching	00 Ratings	8.95 Ratings

Best Alternatives
	Apache Spark	Splunk Enterprise
Small Businesses	No answers on this topic	AlienVault USM Score 8.0 out of 10
Medium-sized Companies	Cloudera Manager Score 9.7 out of 10	InsightIDR Score 8.6 out of 10
Enterprises	IBM Analytics Engine Score 8.8 out of 10	InsightIDR Score 8.6 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Apache Spark	Splunk Enterprise
Likelihood to Recommend	9.9 (24 ratings)	7.0 (70 ratings)
Likelihood to Renew	10.0 (1 ratings)	10.0 (17 ratings)
Usability	10.0 (3 ratings)	9.0 (3 ratings)
Availability	- (0 ratings)	10.0 (1 ratings)
Support Rating	8.7 (4 ratings)	8.4 (17 ratings)
Online Training	- (0 ratings)	8.0 (1 ratings)
Implementation Rating	- (0 ratings)	9.0 (2 ratings)
Product Scalability	- (0 ratings)	9.1 (1 ratings)

User Testimonials
	Apache Spark	Splunk Enterprise
Likelihood to Recommend	Apache Well suited: To most of the local run of datasets and non-prod systems - scalability is not a problem at all. Including data from multiple types of data sources is an added advantage. MLlib is a decently nice built-in library that can be used for most of the ML tasks. Less appropriate: We had to work on a RecSys where the music dataset that we used was around 300+Gb in size. We faced memory-based issues. Few times we also got memory errors. Also the MLlib library does not have support for advanced analytics and deep-learning frameworks support. Understanding the internals of the working of Apache Spark for beginners is highly not possible. Incentivized Ananth Gouri Assistant Professor Read full review	Splunk Pros: Splunk is very well suited if you have multiple log sources of related data. All of them can be correlated and tasks can be automated based on the requirement. Other than alerts, Splunk can also run a specific script of your choice, based on some defined conditions. Cons: If you have a few logs but a large number of log sources, Splunk can be very expensive. Incentivized Kuntal Das Security Analyst Read full review
Pros	Apache Apache Spark makes processing very large data sets possible. It handles these data sets in a fairly quick manner. Apache Spark does a fairly good job implementing machine learning models for larger data sets. Apache Spark seems to be a rapidly advancing software, with the new features making the software ever more straight-forward to use. Incentivized Thomas Young Owner, previous CEO Read full review	Splunk Real-time + Scheduled alerts - i-e you can set up alerts which are actively monitoring your logs Pretty good response time for search results. With our key/value logging, Splunk makes it blazing fast to query the data. Dashboards provide insights into historical data Love how Splunk indexes all of the data and provides keys to search on Incentivized Pooja Gada Engineering Tech Lead Read full review
Cons	Apache Memory management. Very weak on that. PySpark not as robust as scala with spark. spark master HA is needed. Not as HA as it should be. Locality should not be a necessity, but does help improvement. But would prefer no locality Incentivized Anson Abraham Data Czar Read full review	Splunk At times some queries can run slowly if indices are not on a portion of the query you use. Setup time initially can be difficult if your logs aren't stored in common locations or in a common way to write the log. Ability to ingest logs from different locations without having to change code to put logs in a certain place (pro and con). Searches can be a bit more difficult to look through if your log isn't pulled in a manner that is easy to read through splunk. Incentivized Verified User Anonymous Read full review
Likelihood to Renew	Apache Capacity of computing data in cluster and fast speed. Steven Li Senior Software Developer (Consultant) Read full review	Splunk We are using Splunk extensively in our projects and we have recently upgraded to Splunk version 6.0 which is quite efficient and giving expected results. We keep track of updates and new features Splunk introduces periodically and try to introduce those features in our day to day activities for improvement in our reporting system and other tasks. Verified User Anonymous Read full review
Usability	Apache The only thing I dislike about spark's usability is the learning curve, there are many actions and transformations, however, its wide-range of uses for ETL processing, facility to integrate and it's multi-language support make this library a powerhouse for your data science solutions. It has especially aided us with its lightning-fast processing times. Incentivized Verified User Anonymous Read full review	Splunk You can literally throw in a single word into Splunk and it will pull back all instances of that word across all of your logs for the time span you select (provided you have permission to see that data). We have several users who have taken a few of the free courses from Splunk that are able to pull data out of it everyday with little help at all. Incentivized Kenneth Taitingfong Splunk Architect / Engineer Read full review
Reliability and Availability	Apache No answers on this topic	Splunk When properly setup and configured, Splunk is extremely reliable. Incentivized Verified User Anonymous Read full review
Support Rating	Apache 1. It integrates very well with scala or python. 2. It's very easy to understand SQL interoperability. 3. Apache is way faster than the other competitive technologies. 4. The support from the Apache community is very huge for Spark. 5. Execution times are faster as compared to others. 6. There are a large number of forums available for Apache Spark. 7. The code availability for Apache Spark is simpler and easy to gain access to. 8. Many organizations use Apache Spark, so many solutions are available for existing applications. YM Yogesh Mhasde Technical Manager Read full review	Splunk Splunk maintains a well resourced support system that has been consistent since we purchased the product. They help out in a timely manner and provide expert level information as needed. We typically open cases online and communicate when possible via e-mail and are able to resolve most issues with that method. Incentivized Verified User Anonymous Read full review
Online Training	Apache No answers on this topic	Splunk The online course was simple clear and described the main capabilities of the solution. There is also an initial module that can be done for free so anyone can familiarize themselves with the functionality of this solution. On the other hand, however, there could be more free online courses. Maybe even with a certificate, this would broaden the group of people who are familiar with the platform while increasing familiarity with the solution itself. Verified User Anonymous Read full review
Implementation Rating	Apache No answers on this topic	Splunk Smooth without too many major issues. Incentivized Verified User Anonymous Read full review
Alternatives Considered	Apache All the above systems work quite well on big data transformations whereas Spark really shines with its bigger API support and its ability to read from and write to multiple data sources. Using Spark one can easily switch between declarative versus imperative versus functional type programming easily based on the situation. Also it doesn't need special data ingestion or indexing pre-processing like Presto. Combining it with Jupyter Notebooks (https://github.com/jupyter-incubator/sparkmagic), one can develop the Spark code in an interactive manner in Scala or Python Incentivized Nitin Pasumarthy Software Engineer Read full review	Splunk I wanted to learn a new language that I can quickly master and implement. Splunk is easy, fun to use and best of all, it can be developed in hours not days or weeks. Splunk is fundamentally a programming language that is minimal but yet powerful enough to collect, analyze and visualize data. Incentivized Verified User Anonymous Read full review
Scalability	Apache No answers on this topic	Splunk Splunk can scale in to the petabyte per day range which of course is awesome Rick Yetter Regional Director Read full review
Return on Investment	Apache Faster turn around on feature development, we have seen a noticeable improvement in our agile development since using Spark. Easy adoption, having multiple departments use the same underlying technology even if the use cases are very different allows for more commonality amongst applications which definitely makes the operations team happy. Performance, we have been able to make some applications run over 20x faster since switching to Spark. This has saved us time, headaches, and operating costs. Incentivized Verified User Anonymous Read full review	Splunk Overall very positive. It has provided visibility to what is going on within our network. One drawback is the time it takes to get up to speed with the application, but this is up to the user, and Splunk education is excellent. In my field, IT Security, there are few other friends to have in your back pocket better than Splunk. They are just that good. Incentivized Verified User Anonymous Read full review
ScreenShots