Apache Spark Streaming vs. Elasticsearch vs. Rockset

Apache Spark Streaming

7 Reviews and Ratings

Elasticsearch

218 Reviews and Ratings

Rockset

4 Reviews and Ratings

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Apache Spark Streaming	Score 8.7 out of 10	N/A	Apache Spark Streaming is a scalable fault-tolerant streaming processing system that natively supports both batch and streaming workloads.	N/A
Elasticsearch	Score 8.5 out of 10	N/A	Elasticsearch is an enterprise search tool from Elastic in Mountain View, California.	$16 per month
Rockset	Score 1.3 out of 10	N/A	Rockset is a serverless search and analytics engine that does fast SQL on NoSQL data from Kafka, DynamoDB, S3 and more. According to the vendor, it delivers millisecond-latency SQL over TBs of raw data, without any ETL. Rockset integrates with the user's database, data stream or data lake to continuously ingest new data without requiring a schema, while providing full SQL support for filtering, aggregations and joining streaming data with other data sets. Rockset powers data-driven applications…	N/A

Pricing

Apache Spark Streaming

Elasticsearch

Rockset

Editions & Modules

No answers on this topic

Standard: $16.00
per month
Gold: $19.00
per month
Platinum: $22.00
per month
Enterprise: Contact Sales

No answers on this topic

Offerings

Pricing Offerings
Apache Spark Streaming	Elasticsearch	Rockset
Free Trial
No	No	No
Free/Freemium Version
No	No	No
Premium Consulting/Integration Services
No	No	No

Entry-level Setup Fee

No setup fee

Additional Details

—

More Pricing Information

Community Pulse
	Apache Spark Streaming	Elasticsearch	Rockset

Features

Apache Spark Streaming

Elasticsearch

Rockset

Streaming Analytics

Comparison of Streaming Analytics features of Product A and Product B
	Apache Spark Streaming 8.4 1 Ratings 5% above category average	Elasticsearch - Ratings	Rockset - Ratings
Real-Time Data Analysis	8.01 Ratings	00 Ratings	00 Ratings
Visualization Dashboards	9.01 Ratings	00 Ratings	00 Ratings
Data Ingestion from Multiple Data Sources	9.01 Ratings	00 Ratings	00 Ratings
Low Latency	8.01 Ratings	00 Ratings	00 Ratings
Integrated Development Tools	8.01 Ratings	00 Ratings	00 Ratings
Data wrangling and preparation	8.01 Ratings	00 Ratings	00 Ratings
Linear Scale-Out	8.01 Ratings	00 Ratings	00 Ratings
Machine Learning Automation	9.01 Ratings	00 Ratings	00 Ratings
Data Enrichment	9.01 Ratings	00 Ratings	00 Ratings

Best Alternatives
	Apache Spark Streaming	Elasticsearch	Rockset
Small Businesses	IBM Streams (discontinued) Score 9.0 out of 10	Yext Score 7.9 out of 10	Yext Score 7.9 out of 10
Medium-sized Companies	Confluent Score 9.2 out of 10	Guru Score 9.4 out of 10	Guru Score 9.4 out of 10
Enterprises	Spotfire Streaming Score 5.1 out of 10	Guru Score 9.4 out of 10	Guru Score 9.4 out of 10
All Alternatives	View all alternatives	View all alternatives	View all alternatives

User Ratings
	Apache Spark Streaming	Elasticsearch	Rockset
Likelihood to Recommend	9.0 (1 ratings)	9.0 (48 ratings)	9.0 (1 ratings)
Likelihood to Renew	- (0 ratings)	10.0 (1 ratings)	- (0 ratings)
Usability	- (0 ratings)	10.0 (1 ratings)	- (0 ratings)
Support Rating	- (0 ratings)	7.8 (9 ratings)	- (0 ratings)
Implementation Rating	- (0 ratings)	9.0 (1 ratings)	- (0 ratings)

User Testimonials
	Apache Spark Streaming	Elasticsearch	Rockset
Likelihood to Recommend	Apache Apache Spark Streaming is a tool that we are using for almost a year and is excellent in managing batch processing. It is user-friendly. Using it, we can even process our massive data in fractions of seconds. Its pricing is its other plus point. Only its In-memory processing is its demerit as it occupies a large memory. Incentivized CR Christa Raine Business Manager Read full review	Elastic Elasticsearch is a really scalable solution that can fit a lot of needs, but the bigger and/or those needs become, the more understanding & infrastructure you will need for your instance to be running correctly. Elasticsearch is not problem-free - you can get yourself in a lot of trouble if you are not following good practices and/or if are not managing the cluster correctly. Licensing is a big decision point here as Elasticsearch is a middleware component - be sure to read the licensing agreement of the version you want to try before you commit to it. Same goes for long-term support - be sure to keep yourself in the know for this aspect you may end up stuck with an unpatched version for years. Incentivized Borislav Traykov DevOps Team Leader Read full review	Rockset 1. Usage based billing - helpful in creating query lambdas and workflow around them. 2. Fast data refresh and scheduled running of queries is required. 3. Wherever real-time data is in play in terms of visibility. Incentivized Verified User Anonymous Read full review
Pros	Apache It is amazing in solving complicated transformative logic. It is straightforward to program. It is a very quick tool. It processes large data within a fraction of seconds. Incentivized CR Christa Raine Business Manager Read full review	Elastic As I mentioned before, Elasticsearch's flexible data model is unparalleled. You can nest fields as deeply as you want, have as many fields as you want, but whatever you want in those fields (as long as it stays the same type), and all of it will be searchable and you don't need to even declare a schema beforehand! Elastic, the company behind Elasticsearch, is super strong financially and they have a great team of devs and product managers working on Elasticsearch. When I first started using ES 3 years ago, I was 90% impressed and knew it would be a good fit. 3 years later, I am 200% impressed and blown away by how far it has come and gotten even better. If there are features that are missing or you don't think it's fast enough right now, I bet it'll be suitable next year because the team behind it is so dang fast! Elasticsearch is really, really stable. It takes a lot to bring down a cluster. It's self-balancing algorithms, leader-election system, self-healing properties are state of the art. We've never seen network failures or hard-drive corruption or CPU bugs bring down an ES cluster. Incentivized Verified User Anonymous Read full review	Rockset Fast ingest and real-time indexing Complex SQL queries run really fast Lower cost in terms of deployment. Incentivized Verified User Anonymous Read full review
Cons	Apache There must be more documentation. It is a profoundly complex tool. Its in-memory processing consumes massive memory. Incentivized CR Christa Raine Business Manager Read full review	Elastic Joining data requires duplicate de-normalized documents that make parent child relationships. It is hard and requires a lot of synchronizations Tracking errors in the data in the logs can be hard, and sometimes recurring errors blow up the error logs Schema changes require complete reindexing of an index Incentivized Keith Lubell Chief Technology Officer Read full review	Rockset User interface AI in queries, auto-generated queries Graphing tools Incentivized Verified User Anonymous Read full review
Likelihood to Renew	Apache No answers on this topic	Elastic We're pretty heavily invested in ElasticSearch at this point, and there aren't any obvious negatives that would make us reconsider this decision. Incentivized Aaron Gussman Senior Technologist Read full review	Rockset No answers on this topic
Usability	Apache No answers on this topic	Elastic To get started with Elasticsearch, you don't have to get very involved in configuring what really is an incredibly complex system under the hood. You simply install the package, run the service, and you're immediately able to begin using it. You don't need to learn any sort of query language to add data to Elasticsearch or perform some basic searching. If you're used to any sort of RESTful API, getting started with Elasticsearch is a breeze. If you've never interacted with a RESTful API directly, the journey may be a little more bumpy. Overall, though, it's incredibly simple to use for what it's doing under the covers. Incentivized Verified User Anonymous Read full review	Rockset No answers on this topic
Support Rating	Apache No answers on this topic	Elastic We've only used it as an opensource tooling. We did not purchase any additional support to roll out the elasticsearch software. When rolling out the application on our platform we've used the documentation which was available online. During our test phases we did not experience any bugs or issues so we did not rely on support at all. Incentivized Verified User Anonymous Read full review	Rockset No answers on this topic
Implementation Rating	Apache No answers on this topic	Elastic Do not mix data and master roles. Dedicate at least 3 nodes just for Master Incentivized Verified User Anonymous Read full review	Rockset No answers on this topic
Alternatives Considered	Apache Apache Spark Streaming stands above all the huge data transformative tools because of its speed of processing which was quite slow in Presto as it takes a lot of our time in the data processing. Spark, comfortably provides integration with Jupyter like notebook environment. and Spark's combination with Jupyter and Python results in enhancing the speed . Incentivized CR Christa Raine Business Manager Read full review	Elastic As far as we are concerned, Elasticsearch is the gold standard and we have barely evaluated any alternatives. You could consider it an alternative to a relational or NoSQL database, so in cases where those suffice, you don't need Elasticsearch. But if you want powerful text-based search capabilities across large data sets, Elasticsearch is the way to go. Incentivized Verified User Anonymous Read full review	Rockset No answers on this topic
Return on Investment	Apache Cost and time-effective tool for our business. We can integrate with Jupyter with many conveniences. Its high-speed data processing has proved beneficial for us. Incentivized CR Christa Raine Business Manager Read full review	Elastic We have had great luck with implementing Elasticsearch for our search and analytics use cases. While the operational burden is not minimal, operating a cluster of servers, using a custom query language, writing Elasticsearch-specific bulk insert code, the performance and the relative operational ease of Elasticsearch are unparalleled. We've easily saved hundreds of thousands of dollars implementing Elasticsearch vs. RDBMS vs. other no-SQL solutions for our specific set of problems. Incentivized Anatoly Geyfman Founder & CEO Read full review	Rockset Didn't need to invest in an external billing software. Major customer feature ask was fulfilled - realtime usage overview Incentivized Verified User Anonymous Read full review
ScreenShots