Apache CouchDB vs. Apache Spark

Apache CouchDB

30 Reviews and Ratings

Apache Spark

159 Reviews and Ratings

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
CouchDB	Score 6.2 out of 10	N/A	Apache CouchDB is an HTTP + JSON document database with Map Reduce views and bi-directional replication. The Couch Replication Protocol is implemented in a variety of projects and products that span computing environments from globally distributed server-clusters, over mobile phones to web browsers.	N/A
Apache Spark	Score 8.7 out of 10	N/A	N/A	N/A

Pricing

Apache CouchDB

Apache Spark

Editions & Modules

No answers on this topic

No answers on this topic

Offerings

Pricing Offerings
CouchDB	Apache Spark
Free Trial
No	No
Free/Freemium Version
No	No
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

No setup fee

Additional Details

—

—

More Pricing Information

Community Pulse
	Apache CouchDB	Apache Spark
Considered Both Products	CouchDB Verified User Manager Chose Apache CouchDB It stacks up well against Mongo DB. Mongo DB definitely has more marketing and developer and customer mindshare because it is so widely known. Incentivized Helpful?	Apache Spark No answer on this topic
Top Pros	Pro Easy to use Pro Able to store Pro Web application	Pro Machine learning Pro Data sets Pro Easy to use
Top Cons	Minus Hard to set Minus Used correctly Minus First time users	Minus Data visualization Minus Learning curve Minus Amounts of data

Features

Apache CouchDB

Apache Spark

NoSQL Databases

Comparison of NoSQL Databases features of Product A and Product B
	Apache CouchDB 7.9 2 Ratings 11% below category average	Apache Spark - Ratings
Performance	8.02 Ratings	00 Ratings
Availability	8.52 Ratings	00 Ratings
Concurrency	8.52 Ratings	00 Ratings
Security	6.02 Ratings	00 Ratings
Scalability	8.02 Ratings	00 Ratings
Data model flexibility	7.02 Ratings	00 Ratings
Deployment model flexibility	9.02 Ratings	00 Ratings

Best Alternatives
	Apache CouchDB	Apache Spark
Small Businesses	IBM Cloudant Score 8.3 out of 10	No answers on this topic
Medium-sized Companies	IBM Cloudant Score 8.3 out of 10	Cloudera Manager Score 9.7 out of 10
Enterprises	IBM Cloudant Score 8.3 out of 10	IBM Analytics Engine Score 8.8 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Apache CouchDB	Apache Spark
Likelihood to Recommend	9.0 (10 ratings)	9.9 (24 ratings)
Likelihood to Renew	9.0 (9 ratings)	10.0 (1 ratings)
Usability	8.0 (1 ratings)	10.0 (3 ratings)
Support Rating	- (0 ratings)	8.7 (4 ratings)
Implementation Rating	9.0 (1 ratings)	- (0 ratings)

User Testimonials
	Apache CouchDB	Apache Spark
Likelihood to Recommend	Apache Great for REST API development, if you want a small, fast server that will send and receive JSON structures, CouchDB is hard to beat. Not great for enterprise-level relational database querying (no kidding). While by definition, document-oriented databases are not relational, porting or migrating from relational, and using CouchDB as a backend is probably not a wise move as it's reliable, but It may not always be highly available. Incentivized Verified User Anonymous Read full review	Apache Well suited: To most of the local run of datasets and non-prod systems - scalability is not a problem at all. Including data from multiple types of data sources is an added advantage. MLlib is a decently nice built-in library that can be used for most of the ML tasks. Less appropriate: We had to work on a RecSys where the music dataset that we used was around 300+Gb in size. We faced memory-based issues. Few times we also got memory errors. Also the MLlib library does not have support for advanced analytics and deep-learning frameworks support. Understanding the internals of the working of Apache Spark for beginners is highly not possible. Incentivized Ananth Gouri Assistant Professor Read full review
Pros	Apache It can replicate and sync with web browsers via PouchDB. This lets you keep a synced copy of your database on the client-side, which offers much faster data access than continuous HTTP requests would allow, and enables offline usage. Simple Map/Reduce support. The M/R system lets you process terabytes of documents in parallel, save the results, and only need to reprocess documents that have changed on subsequent updates. While not as powerful as Hadoop, it is an easy to use query system that's hard to screw up. Sharding and Clustering support. As of CouchDB 2.0, it supports clustering and sharding of documents between instances without needing a load balancer to determine where requests should go. Master to Master replication lets you clone, continuously backup, and listen for changes through the replication protocol, even over unreliable WAN links. Incentivized Sean Lang Developer Read full review	Apache Apache Spark makes processing very large data sets possible. It handles these data sets in a fairly quick manner. Apache Spark does a fairly good job implementing machine learning models for larger data sets. Apache Spark seems to be a rapidly advancing software, with the new features making the software ever more straight-forward to use. Incentivized Thomas Young Owner, previous CEO Read full review
Cons	Apache NoSQL DB can become a challenge for seasoned RDBMS users. The map-reduce paradigm can be very demanding for first-time users. JSON format documents with Key-Value pairs are somewhat verbose and consume more storage. Incentivized Johanes Siregar Risk Insight Product Read full review	Apache Memory management. Very weak on that. PySpark not as robust as scala with spark. spark master HA is needed. Not as HA as it should be. Locality should not be a necessity, but does help improvement. But would prefer no locality Incentivized Anson Abraham Data Czar Read full review
Likelihood to Renew	Apache Because our current solution S3 is working great and CouchDB was a nightmare. The worst is that at first, it seemed fine until we filled it with tons of data and then started to create views and actually delete. Incentivized Josh Stapp Sr. Software Engineer Read full review	Apache Capacity of computing data in cluster and fast speed. Steven Li Senior Software Developer (Consultant) Read full review
Usability	Apache Couchdb is very simple to use and the features are also reduced but well implemented. In order to use it the way its designed, the ui is adequate and easy. Of course, there are some other task that can't be performed through the admin ui but the minimalistic design allows you to use external libraries to develop custom scripts Incentivized victor pease solano General Manager Read full review	Apache The only thing I dislike about spark's usability is the learning curve, there are many actions and transformations, however, its wide-range of uses for ETL processing, facility to integrate and it's multi-language support make this library a powerhouse for your data science solutions. It has especially aided us with its lightning-fast processing times. Incentivized Verified User Anonymous Read full review
Support Rating	Apache No answers on this topic	Apache 1. It integrates very well with scala or python. 2. It's very easy to understand SQL interoperability. 3. Apache is way faster than the other competitive technologies. 4. The support from the Apache community is very huge for Spark. 5. Execution times are faster as compared to others. 6. There are a large number of forums available for Apache Spark. 7. The code availability for Apache Spark is simpler and easy to gain access to. 8. Many organizations use Apache Spark, so many solutions are available for existing applications. YM Yogesh Mhasde Technical Manager Read full review
Implementation Rating	Apache it support is minimal also hw requirements. Also for development, we can have databases replicated everywhere and the replication is automagical. once you set up the security and the rules for replication, you are ready to go. The absence of a model let you build your app the way you want it Incentivized victor pease solano General Manager Read full review	Apache No answers on this topic
Alternatives Considered	Apache It has been 5+ years since we chose CouchDB. We looked an MongoDB, Cassandra, and probably some others. At the end of the day, the performance, power potential, and simplicity of CouchDB made it a simple choice for our needs. No one should use just because we did. As I said early, make sure you understand your problems, and find the right solution. Some random reading that might be useful: http://www.julianbrowne.com/article/viewer/brewers-cap-theorem https://www.couchbase.com/nosql-resources/why-nosql\ https://www.infoq.com/articles/cap-twelve-years-later-how-the-rules-have-changed Incentivized Dave Lundberg CTO & Co-founder Read full review	Apache All the above systems work quite well on big data transformations whereas Spark really shines with its bigger API support and its ability to read from and write to multiple data sources. Using Spark one can easily switch between declarative versus imperative versus functional type programming easily based on the situation. Also it doesn't need special data ingestion or indexing pre-processing like Presto. Combining it with Jupyter Notebooks (https://github.com/jupyter-incubator/sparkmagic), one can develop the Spark code in an interactive manner in Scala or Python Incentivized Nitin Pasumarthy Software Engineer Read full review
Return on Investment	Apache It has saved us hours and hours of coding. It is has taught us a new way to look at things. It has taught us patience as the first few weeks with CouchDB were not pleasant. It was not easy to pick up like MongoDB. Incentivized LeVar Berry Chairman & CEO Read full review	Apache Faster turn around on feature development, we have seen a noticeable improvement in our agile development since using Spark. Easy adoption, having multiple departments use the same underlying technology even if the use cases are very different allows for more commonality amongst applications which definitely makes the operations team happy. Performance, we have been able to make some applications run over 20x faster since switching to Spark. This has saved us time, headaches, and operating costs. Incentivized Verified User Anonymous Read full review
ScreenShots