Databricks Lakehouse Platform vs. IBM DataStage

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Databricks Lakehouse Platform
Score 8.3 out of 10
N/A
Databricks in San Francisco offers the Databricks Lakehouse Platform (formerly the Unified Analytics Platform), a data science platform and Apache Spark cluster manager. The Databricks Unified Data Service aims to provide a reliable and scalable platform for data pipelines, data lakes, and data platforms. Users can manage full data journey, to ingest, process, store, and expose data throughout an organization. Its Data Science Workspace is a collaborative environment for practitioners to run…
$0.07
Per DBU
IBM DataStage
Score 8.6 out of 10
N/A
IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns. A basic version of the software is available for on-premises deployment, and the cloud-based DataStage for IBM Cloud Pak® for Data offers automated integration capabilities in a hybrid or multicloud environment.N/A
Pricing
Databricks Lakehouse PlatformIBM DataStage
Editions & Modules
Standard
$0.07
Per DBU
Premium
$0.10
Per DBU
Enterprise
$0.13
Per DBU
No answers on this topic
Offerings
Pricing Offerings
Databricks Lakehouse PlatformIBM DataStage
Free Trial
NoYes
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Features
Databricks Lakehouse PlatformIBM DataStage
Data Source Connection
Comparison of Data Source Connection features of Product A and Product B
Databricks Lakehouse Platform
-
Ratings
IBM DataStage
9.1
9 Ratings
10% above category average
Connect to traditional data sources00 Ratings9.59 Ratings
Connecto to Big Data and NoSQL00 Ratings8.88 Ratings
Data Transformations
Comparison of Data Transformations features of Product A and Product B
Databricks Lakehouse Platform
-
Ratings
IBM DataStage
9.5
9 Ratings
13% above category average
Simple transformations00 Ratings9.89 Ratings
Complex transformations00 Ratings9.39 Ratings
Data Modeling
Comparison of Data Modeling features of Product A and Product B
Databricks Lakehouse Platform
-
Ratings
IBM DataStage
9.0
9 Ratings
10% above category average
Data model creation00 Ratings9.46 Ratings
Metadata management00 Ratings8.78 Ratings
Business rules and workflow00 Ratings8.18 Ratings
Collaboration00 Ratings9.09 Ratings
Testing and debugging00 Ratings9.59 Ratings
Data Governance
Comparison of Data Governance features of Product A and Product B
Databricks Lakehouse Platform
-
Ratings
IBM DataStage
8.9
8 Ratings
8% above category average
Integration with data quality tools00 Ratings8.88 Ratings
Integration with MDM tools00 Ratings9.08 Ratings
Best Alternatives
Databricks Lakehouse PlatformIBM DataStage
Small Businesses

No answers on this topic

Skyvia
Skyvia
Score 9.6 out of 10
Medium-sized Companies
Snowflake
Snowflake
Score 9.0 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
Enterprises
Snowflake
Snowflake
Score 9.0 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Databricks Lakehouse PlatformIBM DataStage
Likelihood to Recommend
8.4
(17 ratings)
8.8
(9 ratings)
Usability
9.4
(3 ratings)
9.0
(2 ratings)
Performance
-
(0 ratings)
9.0
(1 ratings)
Support Rating
8.6
(2 ratings)
9.6
(3 ratings)
Contract Terms and Pricing Model
8.0
(1 ratings)
-
(0 ratings)
Professional Services
10.0
(1 ratings)
-
(0 ratings)
User Testimonials
Databricks Lakehouse PlatformIBM DataStage
Likelihood to Recommend
Databricks
If you need a managed big data megastore, which has native integration with highly optimized Apache Spark Engine and native integration with MLflow, go for Databricks Lakehouse Platform. The Databricks Lakehouse Platform is a breeze to use and analytics capabilities are supported out of the box. You will find it a bit difficult to manage code in notebooks but you will get used to it soon.
Read full review
IBM
Excellent Cloud data mapping tool and easy creating multiple project data analytics in real-time and the report distribution are excellent via this IBM product. Easy tool to provide data visualization and the integration is effective and helpful to migrating huge amounts of data across other platforms and different websites insights gathering.
Read full review
Pros
Databricks
  • Process raw data in One Lake (S3) env to relational tables and views
  • Share notebooks with our business analysts so that they can use the queries and generate value out of the data
  • Try out PySpark and Spark SQL queries on raw data before using them in our Spark jobs
  • Modern day ETL operations made easy using Databricks. Provide access mechanism for different set of customers
Read full review
IBM
  • Data movement
  • Seamless integration of scripts and etl jobs
  • Descriptive logging
  • Ability to work with myriad of data assets
  • Direct integration for Governance catalog
Read full review
Cons
Databricks
  • Connect my local code in Visual code to my Databricks Lakehouse Platform cluster so I can run the code on the cluster. The old databricks-connect approach has many bugs and is hard to set up. The new Databricks Lakehouse Platform extension on Visual Code, doesn't allow the developers to debug their code line by line (only we can run the code).
  • Maybe have a specific Databricks Lakehouse Platform IDE that can be used by Databricks Lakehouse Platform users to develop locally.
  • Visualization in MLFLOW experiment can be enhanced
Read full review
IBM
  • Connector Stages to Snowflake on the cloud. We had some issues initially but since then had been corrected.
  • Accessing tool from a browser (zero foot-print). Currently we need to either install locally or connect to a server to do ETL work.
  • Diversify ways of authenticating users.
Read full review
Usability
Databricks
Because it is an amazing platform for designing experiments and delivering a deep dive analysis that requires execution of highly complex queries, as well as it allows to share the information and insights across the company with their shared workspaces, while keeping it secured.

in terms of graph generation and interaction it could improve their UI and UX
Read full review
IBM
Because it is robust, and it is being continuously improved. DS is one of the most used and recognized tools in the market. Large companies have implemented it in the first instance to develop their DW, but finding the advantages it has, they could use it for other types of projects such as migrations, application feeding, etc.
Read full review
Performance
Databricks
No answers on this topic
IBM
It could load thousands of records in seconds. But in the Parallel version, you need to understand how to particionate the data. If you use the algorithms erroneously, or the functionalities that it gives for the parsing of data, the performance can fall drastically, even with few records. It is necessary to have people with experience to be able to determine which algorithm to use and understand why.
Read full review
Support Rating
Databricks
One of the best customer and technology support that I have ever experienced in my career. You pay for what you get and you get the Rolls Royce. It reminds me of the customer support of SAS in the 2000s when the tools were reaching some limits and their engineer wanted to know more about what we were doing, long before "data science" was even a name. Databricks truly embraces the partnership with their customer and help them on any given challenge.
Read full review
IBM
I believe that IBM generally has one of the worst and most complex assistance systems (physical and online) that exists.
Read full review
Alternatives Considered
Databricks
Compared to Synapse & Snowflake, Databricks provides a much better development experience, and deeper configuration capabilities. It works out-of-the-box but still allows you intricate customisation of the environment. I find Databricks very flexible and resilient at the same time while Synapse and Snowflake feel more limited in terms of configuration and connectivity to external tools.
Read full review
IBM
It's obvious since they both are from the same vendors and it makes it easier and can get better rates for licensing. Also, sales rapes are very helpful in case of escalations and critical issues.
Read full review
Return on Investment
Databricks
  • The ability to spin up a BIG Data platform with little infrastructure overhead allows us to focus on business value not admin
  • DB has the ability to terminate/time out instances which helps manage cost.
  • The ability to quickly access typical hard to build data scenarios easily is a strength.
Read full review
IBM
  • Reduce development time by 65% compared with hand coding.
  • Reduces ETL process maintenance times.
  • Better data governance for technical and non-technical people.
  • Improve time to market for initiatives that require data integration.
Read full review
ScreenShots