Cloudera Data Science Workbench enables secure self-service data science for the enterprise. It is a collaborative environment where developers can work with a variety of libraries and frameworks.
N/A
HPE Data Fabric
Score 9.4 out of 10
N/A
HPE Data Fabric (formerly MapR, acquired by HPE in 2019) is a software-defined datastore and file system that simplifies data management and analytics by unifying data across core, edge, and multicloud sources into a single platform.
N/A
Pricing
Cloudera Data Science Workbench
HPE Data Fabric
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
Data Science Workbench
HPE Data Fabric
Free Trial
No
No
Free/Freemium Version
No
No
Premium Consulting/Integration Services
No
No
Entry-level Setup Fee
No setup fee
No setup fee
Additional Details
—
—
More Pricing Information
Community Pulse
Cloudera Data Science Workbench
HPE Data Fabric
Features
Cloudera Data Science Workbench
HPE Data Fabric
Platform Connectivity
Comparison of Platform Connectivity features of Product A and Product B
Cloudera Data Science Workbench
7.5
2 Ratings
10% below category average
HPE Data Fabric
-
Ratings
Connect to Multiple Data Sources
7.02 Ratings
00 Ratings
Extend Existing Data Sources
8.02 Ratings
00 Ratings
Automatic Data Format Detection
7.02 Ratings
00 Ratings
MDM Integration
8.02 Ratings
00 Ratings
Data Exploration
Comparison of Data Exploration features of Product A and Product B
Cloudera Data Science Workbench
7.6
2 Ratings
10% below category average
HPE Data Fabric
-
Ratings
Visualization
7.12 Ratings
00 Ratings
Interactive Data Analysis
8.02 Ratings
00 Ratings
Data Preparation
Comparison of Data Preparation features of Product A and Product B
Cloudera Data Science Workbench
7.8
2 Ratings
4% below category average
HPE Data Fabric
-
Ratings
Interactive Data Cleaning and Enrichment
7.02 Ratings
00 Ratings
Data Transformations
8.02 Ratings
00 Ratings
Data Encryption
8.02 Ratings
00 Ratings
Built-in Processors
8.02 Ratings
00 Ratings
Platform Data Modeling
Comparison of Platform Data Modeling features of Product A and Product B
Cloudera Data Science Workbench
7.6
2 Ratings
9% below category average
HPE Data Fabric
-
Ratings
Multiple Model Development Languages and Tools
8.02 Ratings
00 Ratings
Automated Machine Learning
7.01 Ratings
00 Ratings
Single platform for multiple model development
7.12 Ratings
00 Ratings
Self-Service Model Delivery
8.12 Ratings
00 Ratings
Model Deployment
Comparison of Model Deployment features of Product A and Product B
Organizations which already implemented on-premise Hadoop based Cloudera Data Platform (CDH) for their Big Data warehouse architecture will definitely get more value from seamless integration of Cloudera Data Science Workbench (CDSW) with their existing CDH Platform. However, for organizations with hybrid (cloud and on-premise) data platform without prior implementation of CDH, implementing CDSW can be a challenge technically and financially.
MapR had very fast I/O throughput. The write speed was several times faster than what we could achieve with the other Hadoop vendors (Cloudera and Hortonworks). This is because MapR does not use HDFS, which is essentially a "meta filesystem". HDFS is built on top of the filesystem provided by the OS. MapR has their filesystem called MapR-FS, which is a true filesystem and accesses the raw disk drives.
The MapR filesystem is very easy to integrate with other Linux filesystems. When working with HDFS from Apache Hadoop, you usually have to use either the HDFS API or various Hadoop/HDFS command line utilities to interact with HDFS. You cannot use command line utilities native to the host operation system, which is usually Linux. At least, it is not easily done without setting up NFS, gateways, etc. With MapR-FS, you can mount the filesystem within Linux and use the standard Unix commands to manipulate files.
The HBase distribution provided by MapR is very similar to the Apache HBase distribution. Cloudera and Hortonworks add GUIs and other various tools on top of their HBase distributions. The MapR HBase distribution is very similar to the Apache distribution, which is nice if you are more accustomed to using Apache HBase.
Cloudera Data Science Workbench has excellence online resources support such as documentation and examples. On top of that the enterprise license also comes with SLA on opening a ticket to Cloudera Services and support for complaint handling and troubleshooting by email or through a phone call. On top of that it also offers additional paid training services.
Both the tools have similar features and have made it pretty easy to install/deploy/use. Depending on your existing platform (Cloudera vs. Azure) you need to pick the Workbench. Another observation is that Cloudera has better support where you can get feedback on your questions pretty fast (unlike MS). As its a new product, I expect MS to be more efficient in handling customers questions.
Increased employee efficiency for sure. Our clients have various levels of expertise in their deployment and user teams, and we never receive complaints about MapR.
MapR is used by one of our financial services clients who uses it for fraud detection and user pattern analysis. They are able to turn around data much faster than they previously had with in-house applications