Skip to main content
TrustRadius
Data Science Workbench

Data Science Workbench

Overview

What is Data Science Workbench?

Cloudera Data Science Workbench enables secure self-service data science for the enterprise. It is a collaborative environment where developers can work with a variety of libraries and frameworks.

Read more
Recent Reviews

Cloudera review

6 out of 10
September 15, 2019
Incentivized
Cloudera is being used on a 6-node Hadoop cluster used for sandbox demonstrations and development. The business problem it was selected to …
Continue reading
Read all reviews
Return to navigation

Pricing

View all pricing
N/A
Unavailable

What is Data Science Workbench?

Cloudera Data Science Workbench enables secure self-service data science for the enterprise. It is a collaborative environment where developers can work with a variety of libraries and frameworks.

Entry-level set up fee?

  • No setup fee

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

4 people also want pricing

Alternatives Pricing

What is Mathematica?

Wolfram's flagship product Mathematica is a modern technical computing application featuring a flexible symbolic coding language and a wide array of graphing and data visualization capabilities.

What is IBM SPSS Modeler?

IBM SPSS Modeler is a visual data science and machine learning (ML) solution designed to help enterprises accelerate time to value by speeding up operational tasks for data scientists. Organizations can use it for data preparation and discovery, predictive analytics, model management and…

Return to navigation

Product Demos

Demo de Cloudera Data Science Workbench con vuelos comerciales en R

YouTube

CoolTalks 2021 - Machine Learning and Data Visualisation with Cloudera​

YouTube

Cloudera Data Science Workbench 1.4 Accelerates Everyday Workflows for Data Scientists

YouTube
Return to navigation

Product Details

What is Data Science Workbench?

Cloudera Data Science Workbench enables secure self-service data science for the enterprise. It is a collaborative environment where developers can work with a variety of libraries and frameworks.

Data Science Workbench Technical Details

Operating SystemsUnspecified
Mobile ApplicationNo
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews and Ratings

(14)

Attribute Ratings

Reviews

(1-3 of 3)
Companies can't remove reviews or game the system. Here's why
Score 9 out of 10
Vetted Review
Verified User
Incentivized
Cloudera Data Science Workbench (CDSW) is mainly being used by data engineers in the IT department for Big Data Analytics pipeline from ingestion until feature extraction phase. It is also being used by data scientists in Analytics department for building machine learning models. On top of that, it is also used by business analyst in Big Data Monetization business units for exploration and reporting. CDSW reduces time to market from exploring, modeling, and deploying to production.
  • Enterprise grade security.
  • Self-service analytics platform.
  • Popular programming support.
  • Lacks features offered by competition.
  • Limited license scheme options.
  • Installation in production can be challenging.
Organizations which already implemented on-premise Hadoop based Cloudera Data Platform (CDH) for their Big Data warehouse architecture will definitely get more value from seamless integration of Cloudera Data Science Workbench (CDSW) with their existing CDH Platform. However, for organizations with hybrid (cloud and on-premise) data platform without prior implementation of CDH, implementing CDSW can be a challenge technically and financially.
Platform Connectivity (4)
75%
7.5
Connect to Multiple Data Sources
70%
7.0
Extend Existing Data Sources
80%
8.0
Automatic Data Format Detection
70%
7.0
MDM Integration
80%
8.0
Data Exploration (2)
75%
7.5
Visualization
70%
7.0
Interactive Data Analysis
80%
8.0
Data Preparation (4)
77.5%
7.8
Interactive Data Cleaning and Enrichment
70%
7.0
Data Transformations
80%
8.0
Data Encryption
80%
8.0
Built-in Processors
80%
8.0
Platform Data Modeling (4)
75%
7.5
Multiple Model Development Languages and Tools
80%
8.0
Automated Machine Learning
70%
7.0
Single platform for multiple model development
70%
7.0
Self-Service Model Delivery
80%
8.0
Model Deployment (2)
80%
8.0
Flexible Model Publishing Options
80%
8.0
Security, Governance, and Cost Controls
80%
8.0
  • Reducing cost by eliminating silos.
  • Faster adoption and time to market.
  • Added cost for enterprise license.
  • Azure Data Science Virtual Machines (DSVM)
Since our organization had already implemented Cloudera Data Platform as our Big Data Warehouse platform, implementing CDSW as the go-to Analytic and Data Science Platform is the most logical and cost-effective decision to make. It integrates seamlessly with our CDH clusters and it also provides enterprise-grade security for on-premise implementation.
Cloudera Data Science Workbench has excellence online resources support such as documentation and examples. On top of that the enterprise license also comes with SLA on opening a ticket to Cloudera Services and support for complaint handling and troubleshooting by email or through a phone call. On top of that it also offers additional paid training services.
September 15, 2019

Cloudera review

Score 6 out of 10
Vetted Review
Verified User
Incentivized
Cloudera is being used on a 6-node Hadoop cluster used for sandbox demonstrations and development. The business problem it was selected to address was the ability to create Machine Learning models in an enterprise environment based on data lake architecture.
  • The ability to use multiple languages.
  • GitHub integration.
  • Scalable.
  • Installation is difficult.
  • Upgrades are difficult.
  • Licensing options are not flexible.
The use cases are specific to my industry, and we’re implemented for experimentation and scoring of predictive models.
Platform Connectivity (4)
70%
7.0
Connect to Multiple Data Sources
60%
6.0
Extend Existing Data Sources
70%
7.0
Automatic Data Format Detection
70%
7.0
MDM Integration
80%
8.0
Data Exploration (2)
90%
9.0
Visualization
90%
9.0
Interactive Data Analysis
90%
9.0
Data Preparation (4)
77.5%
7.8
Interactive Data Cleaning and Enrichment
80%
8.0
Data Transformations
80%
8.0
Data Encryption
80%
8.0
Built-in Processors
70%
7.0
Platform Data Modeling (3)
96.66666666666666%
9.7
Multiple Model Development Languages and Tools
90%
9.0
Single platform for multiple model development
100%
10.0
Self-Service Model Delivery
100%
10.0
Model Deployment (2)
70%
7.0
Flexible Model Publishing Options
100%
10.0
Security, Governance, and Cost Controls
40%
4.0
  • Paid off for demonstration purposes.
It is expensive and difficult to install and maintain.
Bharadwaj (Brad) Chivukula | TrustRadius Reviewer
Score 8 out of 10
Vetted Review
Verified User
Incentivized
  • Used by the Data Science/Engineering Team as a collaboration tool.
  • Combines all the efforts of various departments under a single IDE and provides a holistic view in the retail setting.
  • Use of data to project sales numbers, marketing etc.
  • One single IDE (browser based application) that makes Scala, R, Python integrated under one tool
  • For larger organizations/teams, it lets you be self reliant
  • As it sits on your cluster, it has very easy access of all the data on the HDFS
  • Linking with Github is a very good way to keep the code versions intact
  • Not as great as RStudio; lacks some features when compared with it
  • It is quite simple still (because its very early in its initiative), and companies may want to wait until they see a more developed product
  • If you already have a Cloudera partnership and a cluster, having this is a no brainer.
  • It integrates well with your existing ecosystem and it immediately starts working on projects, accessing full datasets and share analysis and results.
  • With the inclusion of Kubernetes, CPU and memory across worker nodes can be managed effectively.
  • As the tool itself can access all the HDFS, Spark data easily, the wait time between teams has reduced
  • Installation was a breeze, and ramp up time was fairly easy
Both the tools have similar features and have made it pretty easy to install/deploy/use. Depending on your existing platform (Cloudera vs. Azure) you need to pick the Workbench. Another observation is that Cloudera has better support where you can get feedback on your questions pretty fast (unlike MS). As its a new product, I expect MS to be more efficient in handling customers questions.
Return to navigation