Databricks Data Intelligence Platform vs. Amazon Redshift

Databricks Data Intelligence Platform

Databricks Data Intelligence Platform

103 Reviews and Ratings

Amazon Redshift

Amazon Redshift

217 Reviews and Ratings

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Databricks Data Intelligence Platform	Score 8.7 out of 10	N/A	Databricks in San Francisco offers the Databricks Lakehouse Platform (formerly the Unified Analytics Platform), a data science platform and Apache Spark cluster manager. The Databricks Unified Data Service aims to provide a reliable and scalable platform for data pipelines, data lakes, and data platforms. Users can manage full data journey, to ingest, process, store, and expose data throughout an organization. Its Data Science Workspace is a collaborative environment for practitioners to run…	$0.07 Per DBU
Amazon Redshift	Score 8.7 out of 10	N/A	Amazon Redshift is a hosted data warehouse solution, from Amazon Web Services.	$0.24 per GB per month

Pricing

Databricks Data Intelligence Platform

Amazon Redshift

Editions & Modules

Standard: $0.07
Per DBU
Premium: $0.10
Per DBU
Enterprise: $0.13
Per DBU

Redshift Managed Storage: $0.24
per GB per month
Current Generation: $0.25 - $13.04
per hour
Previous Generation: $0.25 - $4.08
per hour
Redshift Spectrum: $5.00
per terabyte of data scanned

Offerings

Pricing Offerings
Databricks Data Intelligence Platform	Amazon Redshift
Free Trial
No	No
Free/Freemium Version
No	No
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

No setup fee

Additional Details

—

—

More Pricing Information

Community Pulse
	Databricks Data Intelligence Platform	Amazon Redshift
Considered Both Products	Databricks Data Intelligence Platform Verified User Director Chose Databricks Data Intelligence Platform When we started using it, only the notebook experience was mature. However, DB was very helpful giving us direct support to get onto their platform. Really there was little in the way to compare to them at the time. AWS has services but not the same low-cost angle. Incentivized Helpful? Verified User Director Chose Databricks Data Intelligence Platform Easier to set up and get started. Less of a learning curve. Incentivized Helpful?	Amazon Redshift NM Narayan Motamarri Staff Data Engineer Chose Amazon Redshift We evaluated [Amazon] Redshift vs BigQuery vs Amazon EMR, back in 2014. Back then BigQuery cost was slightly higher than that of [Amazon] Redshift price structure. Amazon EMR, needs lots more management (Admin tasks) and EMR is designed to be ephemeral and not designed to be a … Incentivized Helpful?

Best Alternatives
	Databricks Data Intelligence Platform	Amazon Redshift
Small Businesses	No answers on this topic	Google BigQuery Score 8.7 out of 10
Medium-sized Companies	Snowflake Score 8.7 out of 10	Snowflake Score 8.7 out of 10
Enterprises	Snowflake Score 8.7 out of 10	Snowflake Score 8.7 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Databricks Data Intelligence Platform	Amazon Redshift
Likelihood to Recommend	10.0 (18 ratings)	9.0 (38 ratings)
Usability	10.0 (4 ratings)	9.0 (10 ratings)
Support Rating	8.7 (2 ratings)	9.0 (7 ratings)
Contract Terms and Pricing Model	8.0 (1 ratings)	10.0 (1 ratings)
Professional Services	10.0 (1 ratings)	- (0 ratings)

User Testimonials
	Databricks Data Intelligence Platform	Amazon Redshift
Likelihood to Recommend	Databricks Medium to Large data throughput shops will benefit the most from Databricks Spark processing. Smaller use cases may find the barrier to entry a bit too high for casual use cases. Some of the overhead to kicking off a Spark compute job can actually lead to your workloads taking longer, but past a certain point the performance returns cannot be beat. Incentivized Austin Franchino Senior Data and Security Engineer Read full review	Amazon AWS If the number of connections is expected to be low, but the amounts of data are large or projected to grow it is a good solutions especially if there is previous exposure to PostgreSQL. Speaking of Postgres, Redshift is based on several versions old releases of PostgreSQL so the developers would not be able to take advantage of some of the newer SQL language features. The queries need some fine-tuning still, indexing is not provided, but playing with sorting keys becomes necessary. Lastly, there is no notion of the Primary Key in Redshift so the business must be prepared to explain why duplication occurred (must be vigilant for) Incentivized Arthur Zubarev Senior Business Intelligence Consultant Read full review
Pros	Databricks Process raw data in One Lake (S3) env to relational tables and views Share notebooks with our business analysts so that they can use the queries and generate value out of the data Try out PySpark and Spark SQL queries on raw data before using them in our Spark jobs Modern day ETL operations made easy using Databricks. Provide access mechanism for different set of customers Incentivized Verified User Anonymous Read full review	Amazon AWS [Amazon] Redshift has Distribution Keys. If you correctly define them on your tables, it improves Query performance. For instance, we can define Mapping/Meta-data tables with Distribution-All Key, so that it gets replicated across all the nodes, for fast joins and fast query results. [Amazon] Redshift has Sort Keys. If you correctly define them on your tables along with above Distribution Keys, it further improves your Query performance. It also has Composite Sort Keys and Interleaved Sort Keys, to support various use cases [Amazon] Redshift is forked out of PostgreSQL DB, and then AWS added "MPP" (Massively Parallel Processing) and "Column Oriented" concepts to it, to make it a powerful data store. [Amazon] Redshift has "Analyze" operation that could be performed on tables, which will update the stats of the table in leader node. This is sort of a ledger about which data is stored in which node and which partition with in a node. Up to date stats improves Query performance. Incentivized NM Narayan Motamarri Staff Data Engineer Read full review
Cons	Databricks Connect my local code in Visual code to my Databricks Lakehouse Platform cluster so I can run the code on the cluster. The old databricks-connect approach has many bugs and is hard to set up. The new Databricks Lakehouse Platform extension on Visual Code, doesn't allow the developers to debug their code line by line (only we can run the code). Maybe have a specific Databricks Lakehouse Platform IDE that can be used by Databricks Lakehouse Platform users to develop locally. Visualization in MLFLOW experiment can be enhanced Incentivized Verified User Anonymous Read full review	Amazon AWS We've experienced some problems with hanging queries on Redshift Spectrum/external tables. We've had to roll back to and old version of Redshift while we wait for AWS to provide a patch. Redshift's dialect is most similar to that of PostgreSQL 8. It lacks many modern features and data types. Constraints are not enforced. We must rely on other means to verify the integrity of transformed tables. Incentivized Gavin Hackeling Data Scientist Read full review
Usability	Databricks Because it is an amazing platform for designing experiments and delivering a deep dive analysis that requires execution of highly complex queries, as well as it allows to share the information and insights across the company with their shared workspaces, while keeping it secured. in terms of graph generation and interaction it could improve their UI and UX Incentivized Verified User Anonymous Read full review	Amazon AWS Just very happy with the product, it fits our needs perfectly. Amazon pioneered the cloud and we have had a positive experience using RedShift. Really cool to be able to see your data housed and to be able to query and perform administrative tasks with ease. Incentivized Brendan McKenna Senior Developer Read full review
Support Rating	Databricks One of the best customer and technology support that I have ever experienced in my career. You pay for what you get and you get the Rolls Royce. It reminds me of the customer support of SAS in the 2000s when the tools were reaching some limits and their engineer wanted to know more about what we were doing, long before "data science" was even a name. Databricks truly embraces the partnership with their customer and help them on any given challenge. Jonatan Bouchard Director Data Science Read full review	Amazon AWS The support was great and helped us in a timely fashion. We did use a lot of online forums as well, but the official documentation was an ongoing one, and it did take more time for us to look through it. We would have probably chosen a competitor product had it not been for the great support Incentivized Verified User Anonymous Read full review
Alternatives Considered	Databricks The most important differentiating factor for Databricks Lakehouse Platform from these other platforms is support for ACID transactions and the time travel feature. Also, native integration with managed MLflow is a plus. EMR, Cloudera, and Hortonworks are not as optimized when it comes to Spark Job Execution. Other platforms need to be self-managed, which is another huge hassle. Incentivized Verified User Anonymous Read full review	Amazon AWS Than Vertica: Redshift is cheaper and AWS integrated (which was a plus because the whole company was on AWS). Than BigQuery: Redshift has a standard SQL interface, though recently I heard good things about BigQuery and would try it out again. Than Hive: Hive is great if you are in the PB+ range, but latencies tend to be much slower than Redshift and it is not suited for ad-hoc applications. Incentivized Verified User Anonymous Read full review
Contract Terms and Pricing Model	Databricks No answers on this topic	Amazon AWS Redshift is relatively cheaper tool but since the pricing is dynamic, there is always a risk of exceeding the cost. Since most of our team is using it as self serve and there is no continuous tracking by a dedicated team, it really needs time & effort on analyst's side to know how much it is going to cost. Incentivized Sameera Srivastava Analytics Lead Read full review
Return on Investment	Databricks The ability to spin up a BIG Data platform with little infrastructure overhead allows us to focus on business value not admin DB has the ability to terminate/time out instances which helps manage cost. The ability to quickly access typical hard to build data scenarios easily is a strength. Incentivized Verified User Anonymous Read full review	Amazon AWS Our company is moving to the AWS infrastructure, and in this context moving the warehouse environments to Redshift sounds logical regardless of the cost. Development organizations have to operate in the Dev/Ops mode where they build and support their apps at the same time. Hard to estimate the overall ROI of moving to Redshift from my position. However, running Redshift seems to be inexpensive compared to all the licensing and hardware costs we had on our RDBMS platform before Redshift. Incentivized Michael Romm Principal Data Architect Read full review
ScreenShots