Skip to main content
TrustRadius: an HG Insights Company
Cassandra

Cassandra

Overview

What is Cassandra?

Cassandra is a no-SQL database from Apache.

Read more

Learn from top reviewers

Return to navigation

Pricing

View all pricing
N/A
Unavailable

What is Cassandra?

Cassandra is a no-SQL database from Apache.

Entry-level set up fee?

  • No setup fee

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

84 people also want pricing

Alternatives Pricing

What is MongoDB?

MongoDB is an open source document-oriented database system. It is part of the NoSQL family of database systems. Instead of storing data in tables as is done in a "classical" relational database, MongoDB stores structured data as JSON-like documents with dynamic schemas (MongoDB calls the format…

What is MarkLogic Server?

MarkLogic Server is a multi-model database that has both NoSQL and trusted enterprise data management capabilities. The vendor states it is the most secure multi-model database, and it’s deployable in any environment. They state it is an ideal database to power a data hub.

Return to navigation

Product Demos

Presto and Cassandra: Doing SQL and Joins on Cassandra Tables

YouTube

CassandraDB Connector Demo | CassandraDB Integration

YouTube

Open Source BI Tools and Cassandra

YouTube

Spark and Cassandra: Doing SQL and Joins on Cassandra Tables

YouTube

Real-time IoT data analytics and visualization with Kaa, Apache Cassandra, and Apache Zeppelin

YouTube
Return to navigation

Features

NoSQL Databases

NoSQL databases are designed to be used across large distrusted systems. They are notably much more scalable and much faster and handling very large data loads than traditional relational databases.

8
Avg 8.8
Return to navigation

Product Details

Cassandra Technical Details

Operating SystemsUnspecified
Mobile ApplicationNo

Frequently Asked Questions

Cassandra is a no-SQL database from Apache.

Reviewers rate Scalability highest, with a score of 9.5.

The most common users of Cassandra are from Enterprises (1,001+ employees).
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews From Top Reviewers

(1-5 of 12)

Cassandra - pretty good if you know what you are doing

Rating: 8 out of 10
October 13, 2015
Vetted Review
Verified User
Apache Cassandra
3 years of experience
Cassandra is being used as a time series store for sensor data and is used by several researchers within our department.
It serves as the storage layer in our home grown sensor analytics platforms that utilizes spark for the computation. We use it to store billions of samples of wearable sensor data that is collected in various studies and experiments.
  • High Availability - we utilize the data replication features of Cassandra. This enables us to access our data even when several nodes have gone down
  • Data Locality - our architecture combines Cassandra storage nodes and computation nodes in the same machine. This enables us to utilize data locality and limit expensive network IO to read data.
  • Elasticity - Cassandra is a shared nothing architecture. Nodes can be added very easily and they discover the network topology. As soon as a node has joined the Cassandra ring, the data is redistributed among the existing nodes and streamed to it automatically.
Cons
  • Cassandra runs on the JVM and therefor may require a lot of GC tuning for read/write intensive applications.
  • Requires manual periodic maintenance - for example it is recommended to run a cleanup on a regular basis.
  • There are a lot of knobs and buttons to configure the system. For many cases the default configuration will be sufficient, but if its not - you will need significant ramp up on the inner workings of Cassandra in order to effectively tune it.
Cassandra has excellent high availability and partition tolerance and has a robust architecture.
It is well suited for storing immutable data as deletes are extremely inefficient. As such, it is well suited for data archive and deep storage.
It is less appropriate for OLAP as has limited aggregation and filtering abilities, and no grouping whatsoever.

Cassandra: A highly available and scalable database

Rating: 8 out of 10
March 15, 2019
Vetted Review
Verified User
Apache Cassandra
3 years of experience
We use Cassandra as the NoSQL database for our use cases. We stream a lot of API data into this database and rely on the availability it gives us. It has proven to be consistent, which we use to our advantage. Cassandra can distribute data across multiple machines in an app-transparent manner, thus helping us to expand it on demand.
  • Cassandra is a masterless design, hence massively scalable. It is great for applications and use cases that cannot afford to lose data. There is no single point of failure.
  • You can add more nodes to Cassandra to linearly increase your transactions/requests. Also, it has great support across cloud regions and data centers.
  • Cassandra provides features like tunable consistency, data compression and CQL(Cassandra Query Language) which we use.
Cons
  • The underlying medium of Cassandra is a key-value store. So when you model your data, it is based on how you would want to query it and not how the data is structured. This results in a repetition of data when storing. Hence, there is no referential integrity - there is no concept of JOIN connections in Cassandra.
  • Data aggregation functions like SUM, MIN, MAX, AVG, and others are very costly even if possible. Hence Ad-hoc query or analysis is difficult.
You should be very clear where you want to use Cassandra because there is no referential integrity (JOIN) in Cassandra. You have to model data based on how you want to query it, hence what use cases it can be used for should be considered carefully.

You can use it where you want to store log or user-behavior types of data. You can use it in heavy-write or time-series data storage. It is good in retail applications for fast product catalog inputs and lookups

Cassandra at scale

Rating: 8 out of 10
March 16, 2019
GK
Vetted Review
Verified User
Apache Cassandra
1 year of experience
It’s one of the database platforms we offer to the development community in our organization. We have various selections when it comes to databases including DB2, SQL Server, Oracle, and hadoop for data warehousing. Cassandra becomes the choice when developers want to use a highly available NoSQL db.
  • Availability
  • Fast performance
  • Horizontal scalability
  • Memory first
  • Partition based
Cons
  • Dealing with tombstone
  • Maintenance/upgrade
  • Compaction and repair
We use it for collecting user preferences on our website which can be quickly reused. It's also well suited for document ID lookup systems. It’s not good for high consistency level of information like account balance in your banking system.

One of the Best NoSQL Databases!

Rating: 8 out of 10
June 30, 2019
Cassandra is currently used for our enterprise eCommerce platform. So far our experience is good with Cassandra its an extremely powerful NoSQL Database with high performance—distributed, scalable, and highly available database platform.
  • Continuous data availability is extremely powerful feature of Cassandra.
  • Overall cost effective and low maintenance database platform.
  • High performance and low tolerance no SQL database.
Cons
  • Moving data from and to Cassandra to any relational database platform can be improved.
  • Database event logging can be handled more efficiently.
It's perfect for big data or high volume data to load log files, event files, and streaming or video/image data. It gives really high performance dealing with big data fetches. But when you need to make table joins or you need more of a relational data structure, I do not think Cassandra will fit for that.

Cassandra - a tunable NoSQL datastore

Rating: 9 out of 10
March 07, 2019
Cassandra is a NoSQL database which is used to store a large amount of data quickly. It has a very fast write speed, allowing a large volume of data storage within a small amount of time. It is tunable and can be used to store data. It is more suitable for storing flat data rather than relational data.
  • Write speed. Cassandra is very fast while writing data due to its unique architecture.
  • Tunable consistency - During data replication, consistency can be tuned for a particular data set to be available during an outage.
  • CQL - cassandra query language is a subset of SQL and eases the transition from a more traditional database.
Cons
  • Aggregation functions are not very efficient.
  • Ad-hoc queries do not perform well. Queries which were visualized while designing the databases only perform well.
  • Performance is unpredictable.
Cassandra is well suited to storing a large volume of data within a very small period of time. It is relatively fast and the data consistency can be tuned for datasets for custom availability during an outage. It can be interacted with using CQL-- Cassandra query language-- which is similar to SQL, and thus transition is easier. It however performs less during aggregation and querying.
Return to navigation