AWS Glue vs. Azure Databricks

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
AWS Glue
Score 7.9 out of 10
N/A
AWS Glue is a managed extract, transform, and load (ETL) service designed to make it easy for customers to prepare and load data for analytics. With it, users can create and run an ETL job in the AWS Management Console. Users point AWS Glue to data stored on AWS, and AWS Glue discovers data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, data is immediately searchable, queryable, and available for ETL.
$0.44
billed per second, 1 minute minimum
Azure Databricks
Score 8.5 out of 10
N/A
Azure Databricks is a service available on Microsoft's Azure platform and suite of products. It provides the latest versions of Apache Spark so users can integrate with open source libraries, or spin up clusters and build in a fully managed Apache Spark environment with the global scale and availability of Azure. Clusters are set up, configured, and fine-tuned to ensure reliability and performance without the need for monitoring. The solution includes autoscaling and auto-termination to improve…N/A
Pricing
AWS GlueAzure Databricks
Editions & Modules
per DPU-Hour
$0.44
billed per second, 1 minute minimum
No answers on this topic
Offerings
Pricing Offerings
AWS GlueAzure Databricks
Free Trial
NoNo
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details——
More Pricing Information
Community Pulse
AWS GlueAzure Databricks
Top Pros

No answers on this topic

Top Cons

No answers on this topic

Features
AWS GlueAzure Databricks
Platform Connectivity
Comparison of Platform Connectivity features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
8.5
2 Ratings
1% above category average
Connect to Multiple Data Sources00 Ratings7.62 Ratings
Extend Existing Data Sources00 Ratings9.02 Ratings
Automatic Data Format Detection00 Ratings9.42 Ratings
MDM Integration00 Ratings8.01 Ratings
Data Exploration
Comparison of Data Exploration features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
5.5
2 Ratings
42% below category average
Visualization00 Ratings5.22 Ratings
Interactive Data Analysis00 Ratings5.82 Ratings
Data Preparation
Comparison of Data Preparation features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
8.2
2 Ratings
0% above category average
Interactive Data Cleaning and Enrichment00 Ratings7.02 Ratings
Data Transformations00 Ratings8.62 Ratings
Data Encryption00 Ratings9.42 Ratings
Built-in Processors00 Ratings7.82 Ratings
Platform Data Modeling
Comparison of Platform Data Modeling features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
8.5
2 Ratings
1% above category average
Multiple Model Development Languages and Tools00 Ratings8.82 Ratings
Automated Machine Learning00 Ratings8.62 Ratings
Single platform for multiple model development00 Ratings8.42 Ratings
Self-Service Model Delivery00 Ratings8.42 Ratings
Model Deployment
Comparison of Model Deployment features of Product A and Product B
AWS Glue
-
Ratings
Azure Databricks
8.7
2 Ratings
1% above category average
Flexible Model Publishing Options00 Ratings8.02 Ratings
Security, Governance, and Cost Controls00 Ratings9.42 Ratings
Best Alternatives
AWS GlueAzure Databricks
Small Businesses
IBM SPSS Modeler
IBM SPSS Modeler
Score 7.3 out of 10
Jupyter Notebook
Jupyter Notebook
Score 8.9 out of 10
Medium-sized Companies
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
Posit
Posit
Score 9.8 out of 10
Enterprises
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
Posit
Posit
Score 9.8 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
AWS GlueAzure Databricks
Likelihood to Recommend
7.0
(8 ratings)
9.1
(3 ratings)
Usability
7.0
(1 ratings)
8.0
(1 ratings)
Support Rating
7.0
(1 ratings)
-
(0 ratings)
User Testimonials
AWS GlueAzure Databricks
Likelihood to Recommend
Amazon AWS
One of AWS Glue's most notable features that aid in the creation and transformation of data is its data catalog. Support, scheduling, and the automation of the data schema recognition make it superior to its competitors aside from that. It also integrates perfectly with other AWS tools. The main restriction may be integrated with systems outside of the AWS environment. It functions flawlessly with the current AWS services but not with other goods. Another potential restriction that comes to mind is that glue operates on a spark, which means the engineer needs to be conversant in the language.
Read full review
Microsoft
Suppose you have multiple data sources and you want to bring the data into one place, transform it and make it into a data model. Azure Databricks is a perfectly suited solution for this. Leverage spark JDBC or any external cloud based tool (ADG, AWS Glue) to bring the data into a cloud storage. From there, Azure Databricks can handle everything. The data can be ingested by Azure Databricks into a 3 Layer architecture based on the delta lake tables. The first layer, raw layer, has the raw as is data from source. The enrich layer, acts as the cleaning and filtering layer to clean the data at an individual table level. The gold layer, is the final layer responsible for a data model. This acts as the serving layer for BI For BI needs, if you need simple dashboards, you can leverage Azure Databricks BI to create them with a simple click! For complex dashboards, just like any sql db, you can hook it with a simple JDBC string to any external BI tool.
Read full review
Pros
Amazon AWS
  • It is extremely fast, easy, and self-intuitive. Though it is a suite of services, it requires pretty less time to get control over it.
  • As it is a managed service, one need not take care of a lot of underlying details. The identification of data schema, code generation, customization, and orchestration of the different job components allows the developers to focus on the core business problem without worrying about infrastructure issues.
  • It is a pay-as-you-go service. So, there is no need to provide any capacity in advance. So, it makes scheduling much easier.
Read full review
Microsoft
  • SQL
  • Data management
  • Data access
Read full review
Cons
Amazon AWS
  • In-Stream schema registries feature people can not use this more efficiently
  • in Connections feature they can add more connectors as well
  • The crucial problem with AWS Glue is that it only works with AWS.
Read full review
Microsoft
  • Their pipeline workflow orchestration is pretty primitive. Lacks some common features
  • Workspace UI and navigation requires steep learning curve
  • Personally, I am not fond of their autosave feature. Its dangerous for production level notebooks scripts
Read full review
Usability
Amazon AWS
We give 7 rating because of usefulness in AWS world without worrying about infrastructure and services interaction, it’s pretty out of the box gives us the flexibility to interact with them and use them. we take the data source in s3 from external system and then transform it using other AWS services and putting it back for other external services to consume in S3 form.
Read full review
Microsoft
Based on my extensive use of Azure Databricks for the past 3.5 years, it has evolved into a beautiful amalgamation of all the data domains and needs. From a data analyst, to a data engineer, to a data scientist, it jas got them all! Being language agnostic and focused on easy to use UI based control, it is a dream to use for every Data related personnel across all experience levels!
Read full review
Support Rating
Amazon AWS
Amazon responds in good time once the ticket has been generated but needs to generate tickets frequent because very few sample codes are available, and it's not cover all the scenarios.
Read full review
Microsoft
No answers on this topic
Alternatives Considered
Amazon AWS
AWS Glue is a fully managed ETL service that automates many ETL tasks, making it easier to set AWS Glue simplifies ETL through a visual interface and automated code generation.
Read full review
Microsoft
Against all the tools I have used, Azure Databricks is by far the most superior of them all! Why, you ask? The UI is modern, the features are never ending and they keep adding new features. And to quote Apple, "It just works!" Far ahead of the competition, the delta lakehouse platform also fares better than it counterparts of Iceberg implementation or a loosely bound Delta Lake implementation of Synapse
Read full review
Return on Investment
Amazon AWS
  • It had a positive impact on the way we build our data lake.
  • It is the single source of truth for data structure (schemas/tables/views).
Read full review
Microsoft
  • Helped reduce time for collecting data
  • Reduced cost in maintaining multiple data sources
  • Access for multiple users and management of users/data in a single platform
Read full review
ScreenShots