Azure Synapse Analytics is described as the former Azure SQL Data Warehouse, evolved, and as a limitless analytics service that brings together enterprise data warehousing and Big Data analytics. It gives users the freedom to query data using either serverless or provisioned resources, at scale. Azure Synapse brings these two worlds together with a unified experience to ingest, prepare, manage, and serve data for immediate BI and machine learning needs.
$4,700
per month 5000 Synapse Commit Units (SCUs)
Dataiku
Score 8.5 out of 10
N/A
The Dataiku platform unifies data work from analytics to Generative AI. It supports enterprise analytics with visual, cloud-based tooling for data preparation, visualization, and workflow automation.
N/A
Pricing
Azure Synapse Analytics
Dataiku
Editions & Modules
Tier 1
$4,700
per month 5,000 Synapse Commit Units (SCUs)
Tier 2
$9,200
per month 10,000 Synapse Commit Units (SCUs)
Tier 3
$21,360
per month 24,000 Synapse Commit Units (SCUs)
Tier 4
$50,400
per month 60,000 Synapse Commit Units (SCUs)
Tier 5
$117,000
per month 150,000 Synapse Commit Units (SCUs)
Tier 6
$259,200
per month 360,000 Synapse Commit Units (SCUs)
Discover
Contact sales team
Business
Contact sales team
Enterprise
Contact sales team
Offerings
Pricing Offerings
Azure Synapse Analytics
Dataiku
Free Trial
No
Yes
Free/Freemium Version
No
Yes
Premium Consulting/Integration Services
No
No
Entry-level Setup Fee
No setup fee
No setup fee
Additional Details
—
—
More Pricing Information
Community Pulse
Azure Synapse Analytics
Dataiku
Features
Azure Synapse Analytics
Dataiku
Platform Connectivity
Comparison of Platform Connectivity features of Product A and Product B
Azure Synapse Analytics
-
Ratings
Dataiku
8.6
5 Ratings
3% above category average
Connect to Multiple Data Sources
00 Ratings
8.05 Ratings
Extend Existing Data Sources
00 Ratings
10.04 Ratings
Automatic Data Format Detection
00 Ratings
10.05 Ratings
MDM Integration
00 Ratings
6.52 Ratings
Data Exploration
Comparison of Data Exploration features of Product A and Product B
Azure Synapse Analytics
-
Ratings
Dataiku
10.0
5 Ratings
17% above category average
Visualization
00 Ratings
10.05 Ratings
Interactive Data Analysis
00 Ratings
10.05 Ratings
Data Preparation
Comparison of Data Preparation features of Product A and Product B
Azure Synapse Analytics
-
Ratings
Dataiku
9.5
5 Ratings
15% above category average
Interactive Data Cleaning and Enrichment
00 Ratings
9.05 Ratings
Data Transformations
00 Ratings
9.05 Ratings
Data Encryption
00 Ratings
10.04 Ratings
Built-in Processors
00 Ratings
10.04 Ratings
Platform Data Modeling
Comparison of Platform Data Modeling features of Product A and Product B
Azure Synapse Analytics
-
Ratings
Dataiku
8.5
5 Ratings
1% above category average
Multiple Model Development Languages and Tools
00 Ratings
8.05 Ratings
Automated Machine Learning
00 Ratings
8.05 Ratings
Single platform for multiple model development
00 Ratings
8.05 Ratings
Self-Service Model Delivery
00 Ratings
10.04 Ratings
Model Deployment
Comparison of Model Deployment features of Product A and Product B
It's well suited for large, fastly growing, and frequently changing data warehouses (e.g., in startups). It's also suited for companies that want a single, relatively easy-to-use, centralized cloud service for all their data needs. Larger, more structured organizations could still benefit from this service by using Synapse Dedicated SQL Pools, knowing that costs will be much higher than other solutions. I think this product is not suited for smaller, simpler workloads (where an Azure SQL Database and a Data Factory could be enough) or very large scenarios, where it may be better to build custom infrastructure.
Dataiku is an awesome tool for data scientists. It really makes our lives easier. It is also really good for non technical users to see and follow along with the process. I do think that people can fall into the trap of using it without any knowledge at all because so much is automated, but I dont think that is the fault of Dataiku.
Quick to return data. Queries in a SQL data warehouse architecture tend to return data much more quickly than a OLTP setup. Especially with columnar indexes.
Ability to manage extremely large SQL tables. Our databases contain billions of records. This would be unwieldy without a proper SQL datawarehouse
Backup and replication. Because we're already using SQL, moving the data to a datawarehouse makes it easier to manage as our users are already familiar with SQL.
With Azure, it's always the same issue, too many moving parts doing similar things with no specialisation. ADF, Fabric Data Factory and Synapse pipeline serve the same purpose. Same goes for Fabric Warehouse and Synapse SQL pools.
Could do better with serverless workloads considering the competition from databricks and its own fabric warehouse
Synapse pipelines is a replica of Azure Data Factory with no tight integration with Synapse and to a surprise, with missing features from ADF. Integration of warehouse can be improved with in environment ETl tools
The integrated windows of frontend and backend in web applications make it cumbersome for the developer.
When dealing with multiple data flows, it becomes really confusing, though they have introduced a feature (Zones) to cater to this issue.
Bundling, exporting, and importing projects sometimes create issues related to code environment. If the code environment is not available, at least the schema of the flow we should be able to import should be.
The data warehouse portion is very much like old style on-prem SQL server, so most SQL skills one has mastered carry over easily. Azure Data Factory has an easy drag and drop system which allows quick building of pipelines with minimal coding. The Spark portion is the only really complex portion, but if there's an in-house python expert, then the Spark portion is also quiet useable.
The user experience is very good. Everything feels intuitive and "flows" (sorry excuse the pun) so nicely, and the customization level is also appropriate to the tool. Even as a newer data scientist, it felt easy to use and the explanations/tutorials were very good. The documentation is also at a good level
Microsoft does its best to support Synapse. More and more articles are being added to the documentation, providing more useful information on best utilizing its features. The examples provided work well for basic knowledge, but more complex examples should be added to further assist in discovering the vast abilities that the system has.
The open source user community is friendly, helpful, and responsive, at times even outdoing commercial software vendors. Documentation is also top notch, and usually resolves issues without the need for human interactions. Great product design, with a focus on user experience, also makes platform use intuitive, thus reducing the need for explicit support.
In comparing Azure Synapse to the Google BigQuery - the biggest highlight that I'd like to bring forward is Azure Synapse SQL leverages a scale-out architecture in order to distribute computational processing of data across multiple nodes whereas Google BigQuery only takes into account computation and storage.
Anaconda is mainly used by professional data scientists who have profound knowledge of Python coding, mainly used for building some new algorithm block or some optimization, then the module will be integrated into the Dataiku pipeline/workflow. While Dataiku can be used by even other kinds of users.
Licensing fees is replaced with Azure subscription fee. No big saving there
More visibility into the Azure usage and cost
It can be used a hot storage and old data can be archived to data lake. Real time data integration is possible via external tables and Microsoft Power BI