Skip to main content
TrustRadius
Azure Databricks

Azure Databricks

Overview

What is Azure Databricks?

Azure Databricks is a service available on Microsoft's Azure platform and suite of products. It provides the latest versions of Apache Spark so users can integrate with open source libraries, or spin up clusters and build in a fully managed…

Read more
Recent Reviews
Read all reviews

Awards

Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards

Reviewer Pros & Cons

View all pros & cons
Return to navigation

Pricing

View all pricing
N/A
Unavailable

What is Azure Databricks?

Azure Databricks is a service available on Microsoft's Azure platform and suite of products. It provides the latest versions of Apache Spark so users can integrate with open source libraries, or spin up clusters and build in a fully managed Apache Spark environment with the global scale and…

Entry-level set up fee?

  • No setup fee

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

3 people also want pricing

Alternatives Pricing

What is Anaconda?

Anaconda provides access to the foundational open-source Python and R packages used in modern AI, data science, and machine learning. These enterprise-grade solutions enable corporate, research, and academic institutions around the world to harness open-source for competitive advantage and…

What is RapidMiner?

RapidMiner is a data science and data mining platform, from Altair since the late 2022 acquisition. RapidMiner offers full automation for non-coding domain experts, an integrated JupyterLab environment for seasoned data scientists, and a visual drag-and-drop designer. RapidMiner’s project-based…

Return to navigation

Product Details

What is Azure Databricks?

Azure Databricks is a service available on Microsoft's Azure platform and suite of products. It provides the latest versions of Apache Spark so users can integrate with open source libraries, or spin up clusters and build in a fully managed Apache Spark environment with the global scale and availability of Azure. Clusters are set up, configured, and fine-tuned to ensure reliability and performance without the need for monitoring. The solution includes autoscaling and auto-termination to improve total cost of ownership (TCO).

It is presented as an open and unified platform to run all types of analytics workloads, whether as a data scientist, data engineer, or a business analyst. Choice of language can include Python, Scala, R, and SQL. It provides version control of notebooks with GitHub and Azure DevOps.

It provides advanced automated machine learning capabilities using the integrated Azure Machine Learning to identify suitable algorithms and hyperparameters. The solution helps to simplify management, monitoring, and updating of machine learning models deployed from the cloud to the edge. Azure Machine Learning also provides a central registry for experiments, machine learning pipelines, and models.

Azure Databricks Technical Details

Operating SystemsUnspecified
Mobile ApplicationNo
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews and Ratings

(23)

Reviews

(1-2 of 2)
Companies can't remove reviews or game the system. Here's why
Pranshu Gupta | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User
Incentivized
We are leveraging Databricks capabilities in various use cases. For instance, to design a tailor made change data capture that keep track of users account details and keep it updated in delta lake. We have also designed numerous ETL processes which is scheduled to provide data to data analytics on strict delivery timelines. Moreover, the workspaces is integrated with other Azure services such as Azure Synapse Analytics, Azure data lake, Azure Data Factory. Some of our Databricks are triggered by Azure Data Factory.
  • Consistently great performance when dealing with huge scale data with the help of spark architecture
  • Magic commands such as spark sql, pyspark, scala . This comes really handy in day to day work
  • Integration with other Azure services is super smooth and robust
  • Their pipeline workflow orchestration is pretty primitive. Lacks some common features
  • Workspace UI and navigation requires steep learning curve
  • Personally, I am not fond of their autosave feature. Its dangerous for production level notebooks scripts
It works great for use cases where you want to have a more customized solution able to handle huge data volumes ( cluster nodes power and spark). Also, if you want to migrate native spark solution to cloud. Or if you want to integrate your existing Azure data services together
Score 9 out of 10
Vetted Review
Verified User
Incentivized
We use Databricks to pull performance metrics for our content hosted on the company website. Having one tool to view and analyze the data has been a game changer for us, saving many hours of collecting the data various sources in the past.
  • SQL
  • Data management
  • Data access
  • Intuitive interface
  • Ease of use
  • Providing FAQ or QRGs
Having access to all databases and tables in one place is what has helped me and my team to function better. The in built functionality/access to SQL and Python is definitely an added bonus! The icing on the cake is the ability to export your data into an Excel spreadsheet for additional analysis. If you have less to no working knowledge of SQL or Python, its better to look at alternatives.
Platform Connectivity (4)
72.5%
7.3
Connect to Multiple Data Sources
100%
10.0
Extend Existing Data Sources
90%
9.0
Automatic Data Format Detection
100%
10.0
MDM Integration
N/A
N/A
Data Exploration (2)
40%
4.0
Visualization
40%
4.0
Interactive Data Analysis
40%
4.0
Data Preparation (4)
85%
8.5
Interactive Data Cleaning and Enrichment
70%
7.0
Data Transformations
80%
8.0
Data Encryption
100%
10.0
Built-in Processors
90%
9.0
Platform Data Modeling (4)
90%
9.0
Multiple Model Development Languages and Tools
100%
10.0
Automated Machine Learning
80%
8.0
Single platform for multiple model development
90%
9.0
Self-Service Model Delivery
90%
9.0
Model Deployment (2)
90%
9.0
Flexible Model Publishing Options
80%
8.0
Security, Governance, and Cost Controls
100%
10.0
  • Helped reduce time for collecting data
  • Reduced cost in maintaining multiple data sources
  • Access for multiple users and management of users/data in a single platform
Return to navigation