Dataiku - a complete Data Analytic and AI/ML solution
June 30, 2020

Dataiku - a complete Data Analytic and AI/ML solution

Anonymous | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User

Overall Satisfaction with Dataiku DSS

Dataiku is being used as the integrated data analytic/AI/ML platform. It is a corporate-level standard solution, across multiple regions and business domains. The data scientists use this platform to develop various data pipelines, and/or train the AI/ML models, verify the model performances and eventually deploy the model as service to benefit business critical IT applications (majorly serve the predictive analysis/automation and integration with RPA).
  • Very intuitive and easy to use UI, making a lot of types of users can collaborate with each other easily, by visualizing the same workflow.
  • Many building blocks can be reused immediately, avoid a lot of non-standard boiler plate implementation.
  • Data pre-analysis and feature engineering assistance increase the productivity as well as the efficiency of data scientists.
  • Many data connectors support wide range of data storage, from SQL, TeraData, Hadoop Hive, etc.
  • Support from research till final MaaS solution deployment.
  • The visualization feature of flow still has a lot room to improve, when the flow is complex.
  • The "non-coding" template/building block for deep learning lack of many important configurable parameters.
  • Lack of the unified way to allow applying the "design pattern" on the Python codes (if we want to develop our own module or building blocks.
  • Dataiku provides a consistent platform, covering almost all needs from the data analytic till AI/ML areas.
  • This platform "glues" all departments and business flows and IT data source together, making the data more exploitative.
Anaconda is mainly used by professional data scientists who have profound knowledge of Python coding, mainly used for building some new algorithm block or some optimization, then the module will be integrated into the Dataiku pipeline/workflow. While Dataiku can be used by even other kinds of users.
The support team is very helpful, and even when we discover the missing features, after providing enough rational reasons and requirements, they put into it their development pipeline for the future release.

Do you think Dataiku delivers good value for the price?

Yes

Are you happy with Dataiku's feature set?

Yes

Did Dataiku live up to sales and marketing promises?

Yes

Did implementation of Dataiku go as expected?

Yes

Would you buy Dataiku again?

Yes

Chameleon, Cloudera DataFlow (formerly Hortonworks DataFlow), Sparx Systems Enterprise Architect
Dataiku is suitable for many steps of data processing pipeline development (from data collecting, filtering till cleaning, transformation and enhancement), and it is also good for the user who doesn't have too much in-depth AI/ML knowledge to quickly jump into it and give a try to solve some real-world problem.

Dataiku Feature Ratings

Connect to Multiple Data Sources
10
Extend Existing Data Sources
9
Automatic Data Format Detection
9
MDM Integration
6
Visualization
8
Interactive Data Analysis
8
Interactive Data Cleaning and Enrichment
8
Data Transformations
8
Data Encryption
5
Built-in Processors
7
Multiple Model Development Languages and Tools
7
Automated Machine Learning
7
Single platform for multiple model development
7
Self-Service Model Delivery
7
Flexible Model Publishing Options
7
Security, Governance, and Cost Controls
7