21 Reviews and Ratings
7 Reviews and Ratings
Dataiku DSS is very well suited to handle large datasets and projects which requires a huge team to deliver results. This allows users to collaborate with each other while working on individual tasks. The workflow is easily streamlined and every action is backed up, allowing users to revert to specific tasks whenever required. While Dataiku DSS works seamlessly with all types of projects dealing with structured datasets, I haven't come across projects using Dataiku dealing with images/audio signals. But a workaround would be to store the images as vectors and perform the necessary tasks.Incentivized
Paxata can be highly useful to someone who doesn't like/have any experience with writing codes to treat data before using it as input into BI dashboards. Paxata can accelerate data cleaning in environments where a large amount of unclean data is generated and business decisions on the go are required. It performs really well while dealing with natural language.Incentivized
The intuitiveness of this tool is very good.Click or Code - If you are a coder, you can code. If you are a manager, you can wrangle with data with visualsThe way you can control things, the set of APIs gives a lot of flexibility to a developer.Incentivized
Visualize distributions in large data sets effectively which enable the user to quickly spot outliers and treat them appropriatelyProvides recommendation to merge datasets based on matching column valuesThe cluster and edit feature in my opinion is its most powerful feature and reduces cardinality in column with textIncentivized
End product deployment.Incentivized
Doesn't provide recommendation on how to impute valuesThere is a lag quite oftenWe can say whether a column has errors or quality issues in the first lookIncentivized
As I have described earlier, the intuitiveness of this tool makes it great as well as the variety of users that can use this tool. Also, the plugins available in their repository provide solutions to various data science problems.Incentivized
The support team is very helpful, and even when we discover the missing features, after providing enough rational reasons and requirements, they put into it their development pipeline for the future release.Incentivized
Strictly for Data Science operations, Anaconda can be considered as a subset of Dataiku DSS. While Anaconda supports Python and R programming languages, Dataiku also provides this facility, but also provides GUI to creates models with just a click of a button. This provides the flexibility to users who do not wish to alter the model hyperparameters in greater depths. Writing codes to extract meaningful data is time consuming compared to Dataiku's ability to perform feature engineering and data transformation through click of a button.Incentivized
Paxata is a much better tool when it comes to handling natural language but Talend provides recommendations on how to impute missing values and outliers. Paxata provides recommendations on dataset tie-ups and joins but Talend doesn't provide any such recommendations. In paxata you can visualize distribution of data in a column and filter them by dragging and selecting the section you'd like to retain Incentivized
Given its open source status, only cost is the learning curve, which is minimal compared to time savings for data exploration.Platform also ease tracking of data processing workflow, unlike Excel.Build-in data visualizations covers many use cases with minimal customization; time saver.Incentivized
It saves time to clean dataIt reduces the requirement of too many data engineer/stewards and hence adds positive impact on the return of the businessIncentivized