AWS Glue vs. Azure Databricks

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
AWS Glue
Score 8.7 out of 10
N/A
AWS Glue is a managed extract, transform, and load (ETL) service designed to make it easy for customers to prepare and load data for analytics. With it, users can create and run an ETL job in the AWS Management Console. Users point AWS Glue to data stored on AWS, and AWS Glue discovers data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, data is immediately searchable, queryable, and available for ETL.
$0.44
billed per second, 1 minute minimum
Azure Databricks
Score 8.6 out of 10
N/A
Azure Databricks is a service available on Microsoft's Azure platform and suite of products. It provides the latest versions of Apache Spark so users can integrate with open source libraries, or spin up clusters and build in a fully managed Apache Spark environment with the global scale and availability of Azure. Clusters are set up, configured, and fine-tuned to ensure reliability and performance without the need for monitoring. The solution includes autoscaling and auto-termination to improve…N/A
Pricing
AWS GlueAzure Databricks
Editions & Modules
per DPU-Hour
$0.44
billed per second, 1 minute minimum
No answers on this topic
Offerings
Pricing Offerings
AWS GlueAzure Databricks
Free Trial
NoNo
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
AWS GlueAzure Databricks
Features
AWS GlueAzure Databricks
Platform Connectivity
Comparison of Platform Connectivity features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
7.3
4 Ratings
13% below category average
Connect to Multiple Data Sources00 Ratings6.04 Ratings
Extend Existing Data Sources00 Ratings7.74 Ratings
Automatic Data Format Detection00 Ratings7.34 Ratings
MDM Integration00 Ratings8.03 Ratings
Data Exploration
Comparison of Data Exploration features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
6.8
4 Ratings
22% below category average
Visualization00 Ratings6.04 Ratings
Interactive Data Analysis00 Ratings7.73 Ratings
Data Preparation
Comparison of Data Preparation features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
8.6
4 Ratings
5% above category average
Interactive Data Cleaning and Enrichment00 Ratings8.34 Ratings
Data Transformations00 Ratings9.04 Ratings
Data Encryption00 Ratings9.44 Ratings
Built-in Processors00 Ratings7.94 Ratings
Platform Data Modeling
Comparison of Platform Data Modeling features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
7.9
4 Ratings
6% below category average
Multiple Model Development Languages and Tools00 Ratings6.34 Ratings
Automated Machine Learning00 Ratings8.64 Ratings
Single platform for multiple model development00 Ratings8.44 Ratings
Self-Service Model Delivery00 Ratings8.44 Ratings
Model Deployment
Comparison of Model Deployment features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
8.3
4 Ratings
3% below category average
Flexible Model Publishing Options00 Ratings8.04 Ratings
Security, Governance, and Cost Controls00 Ratings8.64 Ratings
Best Alternatives
AWS GlueAzure Databricks
Small Businesses
IBM SPSS Modeler
IBM SPSS Modeler
Score 9.5 out of 10
Jupyter Notebook
Jupyter Notebook
Score 8.4 out of 10
Medium-sized Companies
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
Posit
Posit
Score 10.0 out of 10
Enterprises
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
Posit
Posit
Score 10.0 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
AWS GlueAzure Databricks
Likelihood to Recommend
8.9
(10 ratings)
7.9
(5 ratings)
Usability
9.3
(3 ratings)
7.6
(3 ratings)
Support Rating
7.0
(1 ratings)
-
(0 ratings)
User Testimonials
AWS GlueAzure Databricks
Likelihood to Recommend
Amazon AWS
One of AWS Glue's most notable features that aid in the creation and transformation of data is its data catalog. Support, scheduling, and the automation of the data schema recognition make it superior to its competitors aside from that. It also integrates perfectly with other AWS tools. The main restriction may be integrated with systems outside of the AWS environment. It functions flawlessly with the current AWS services but not with other goods. Another potential restriction that comes to mind is that glue operates on a spark, which means the engineer needs to be conversant in the language.
Read full review
Microsoft
Centralised notebooks are out directly into production. This can lead to poorly engineered code. It is very good for fast queries and our data team are always able to provide what we ask for. It is a big cost to our business so it is important it runs efficiently and returns on our investment.
Read full review
Pros
Amazon AWS
  • It is extremely fast, easy, and self-intuitive. Though it is a suite of services, it requires pretty less time to get control over it.
  • As it is a managed service, one need not take care of a lot of underlying details. The identification of data schema, code generation, customization, and orchestration of the different job components allows the developers to focus on the core business problem without worrying about infrastructure issues.
  • It is a pay-as-you-go service. So, there is no need to provide any capacity in advance. So, it makes scheduling much easier.
Read full review
Microsoft
  • Data Processing and Transformations based on Spark
  • Delta Lakehouse when clubbed with an external cloud storage
  • Governance using Unity Catalog to unify IAM
  • Delta Live Tables is a product, which although relatively newer, has a great potential with the visuals of a pipeline.
Read full review
Cons
Amazon AWS
  • In-Stream schema registries feature people can not use this more efficiently
  • in Connections feature they can add more connectors as well
  • The crucial problem with AWS Glue is that it only works with AWS.
Read full review
Microsoft
  • Intuitive interface
  • Ease of use
  • Providing FAQ or QRGs
Read full review
Usability
Amazon AWS
While easy to set up and manage monitoring for large datasets, its complexity can be a barrier for new users. Integration with AWS Ecosystem, Managed Monitoring, Dashboards and monitoring tools for AWS Glue are generally easy to set up and maintain, Automated Data Pipelines. Automates data pipeline creation, making it efficient for certain data integration
Read full review
Microsoft
The developers are able to switch between Python and SQL in the Notebook which allows the collaboration of SQL analyst and Data scientist. The integration of Mosaic AI allows users to write complex codes in natural languages. Unity catalog has centralized the security and governance features and simplified the process of maintaining it
Read full review
Support Rating
Amazon AWS
Amazon responds in good time once the ticket has been generated but needs to generate tickets frequent because very few sample codes are available, and it's not cover all the scenarios.
Read full review
Microsoft
No answers on this topic
Alternatives Considered
Amazon AWS
AWS Glue is a fully managed ETL service that automates many ETL tasks, making it easier to set AWS Glue simplifies ETL through a visual interface and automated code generation.
Read full review
Microsoft
I have found Azure Databricks to be much better than Snowflake for handling bigger, diverse data types. Snowflake is much simpler and better for smaller warehousing. The real time processing is much better in Azure Databricks and we have much more language options. Snowflake is more expensive but simpler to use. Both are great for different needs.
Read full review
Return on Investment
Amazon AWS
  • We are using GLUE for our ETL purpose. it’s ease with other our AWS services makes our ROI, 100% ROI.
  • One missing piece was compatibility with other data source for which we found a work around and made our data source as S3 only, so our dependencies on other data source is also reducing
Read full review
Microsoft
  • The support team is amazing, they help you at every stage of the projects, from sales to delivery.
  • On a framework level, it has had an amazing impact and has reduced the clients overall data platform costs by a staggering 65%
  • There has been a 40% Manual work requirement on average for the clients when they move to Azure Databricks Data Platform
Read full review
ScreenShots