Cloudera Data Science Workbench enables secure self-service data science for the enterprise. It is a collaborative environment where developers can work with a variety of libraries and frameworks.
N/A
Elasticsearch
Score 8.7 out of 10
N/A
Elasticsearch is an enterprise search tool from Elastic in Mountain View, California.
$16
per month
Pricing
Cloudera Data Science Workbench
Elasticsearch
Editions & Modules
No answers on this topic
Standard
$16.00
per month
Gold
$19.00
per month
Platinum
$22.00
per month
Enterprise
Contact Sales
Offerings
Pricing Offerings
Data Science Workbench
Elasticsearch
Free Trial
No
No
Free/Freemium Version
No
No
Premium Consulting/Integration Services
No
No
Entry-level Setup Fee
No setup fee
No setup fee
Additional Details
—
—
More Pricing Information
Community Pulse
Cloudera Data Science Workbench
Elasticsearch
Features
Cloudera Data Science Workbench
Elasticsearch
Platform Connectivity
Comparison of Platform Connectivity features of Product A and Product B
Cloudera Data Science Workbench
7.5
2 Ratings
10% below category average
Elasticsearch
-
Ratings
Connect to Multiple Data Sources
7.02 Ratings
00 Ratings
Extend Existing Data Sources
8.02 Ratings
00 Ratings
Automatic Data Format Detection
7.02 Ratings
00 Ratings
MDM Integration
8.02 Ratings
00 Ratings
Data Exploration
Comparison of Data Exploration features of Product A and Product B
Cloudera Data Science Workbench
7.6
2 Ratings
10% below category average
Elasticsearch
-
Ratings
Visualization
7.12 Ratings
00 Ratings
Interactive Data Analysis
8.02 Ratings
00 Ratings
Data Preparation
Comparison of Data Preparation features of Product A and Product B
Cloudera Data Science Workbench
7.8
2 Ratings
4% below category average
Elasticsearch
-
Ratings
Interactive Data Cleaning and Enrichment
7.02 Ratings
00 Ratings
Data Transformations
8.02 Ratings
00 Ratings
Data Encryption
8.02 Ratings
00 Ratings
Built-in Processors
8.02 Ratings
00 Ratings
Platform Data Modeling
Comparison of Platform Data Modeling features of Product A and Product B
Cloudera Data Science Workbench
7.6
2 Ratings
9% below category average
Elasticsearch
-
Ratings
Multiple Model Development Languages and Tools
8.02 Ratings
00 Ratings
Automated Machine Learning
7.01 Ratings
00 Ratings
Single platform for multiple model development
7.12 Ratings
00 Ratings
Self-Service Model Delivery
8.12 Ratings
00 Ratings
Model Deployment
Comparison of Model Deployment features of Product A and Product B
Organizations which already implemented on-premise Hadoop based Cloudera Data Platform (CDH) for their Big Data warehouse architecture will definitely get more value from seamless integration of Cloudera Data Science Workbench (CDSW) with their existing CDH Platform. However, for organizations with hybrid (cloud and on-premise) data platform without prior implementation of CDH, implementing CDSW can be a challenge technically and financially.
Elasticsearch is a really scalable solution that can fit a lot of needs, but the bigger and/or those needs become, the more understanding & infrastructure you will need for your instance to be running correctly. Elasticsearch is not problem-free - you can get yourself in a lot of trouble if you are not following good practices and/or if are not managing the cluster correctly. Licensing is a big decision point here as Elasticsearch is a middleware component - be sure to read the licensing agreement of the version you want to try before you commit to it. Same goes for long-term support - be sure to keep yourself in the know for this aspect you may end up stuck with an unpatched version for years.
As I mentioned before, Elasticsearch's flexible data model is unparalleled. You can nest fields as deeply as you want, have as many fields as you want, but whatever you want in those fields (as long as it stays the same type), and all of it will be searchable and you don't need to even declare a schema beforehand!
Elastic, the company behind Elasticsearch, is super strong financially and they have a great team of devs and product managers working on Elasticsearch. When I first started using ES 3 years ago, I was 90% impressed and knew it would be a good fit. 3 years later, I am 200% impressed and blown away by how far it has come and gotten even better. If there are features that are missing or you don't think it's fast enough right now, I bet it'll be suitable next year because the team behind it is so dang fast!
Elasticsearch is really, really stable. It takes a lot to bring down a cluster. It's self-balancing algorithms, leader-election system, self-healing properties are state of the art. We've never seen network failures or hard-drive corruption or CPU bugs bring down an ES cluster.
To get started with Elasticsearch, you don't have to get very involved in configuring what really is an incredibly complex system under the hood. You simply install the package, run the service, and you're immediately able to begin using it. You don't need to learn any sort of query language to add data to Elasticsearch or perform some basic searching. If you're used to any sort of RESTful API, getting started with Elasticsearch is a breeze. If you've never interacted with a RESTful API directly, the journey may be a little more bumpy. Overall, though, it's incredibly simple to use for what it's doing under the covers.
Cloudera Data Science Workbench has excellence online resources support such as documentation and examples. On top of that the enterprise license also comes with SLA on opening a ticket to Cloudera Services and support for complaint handling and troubleshooting by email or through a phone call. On top of that it also offers additional paid training services.
We've only used it as an opensource tooling. We did not purchase any additional support to roll out the elasticsearch software. When rolling out the application on our platform we've used the documentation which was available online. During our test phases we did not experience any bugs or issues so we did not rely on support at all.
Both the tools have similar features and have made it pretty easy to install/deploy/use. Depending on your existing platform (Cloudera vs. Azure) you need to pick the Workbench. Another observation is that Cloudera has better support where you can get feedback on your questions pretty fast (unlike MS). As its a new product, I expect MS to be more efficient in handling customers questions.
As far as we are concerned, Elasticsearch is the gold standard and we have barely evaluated any alternatives. You could consider it an alternative to a relational or NoSQL database, so in cases where those suffice, you don't need Elasticsearch. But if you want powerful text-based search capabilities across large data sets, Elasticsearch is the way to go.
We have had great luck with implementing Elasticsearch for our search and analytics use cases.
While the operational burden is not minimal, operating a cluster of servers, using a custom query language, writing Elasticsearch-specific bulk insert code, the performance and the relative operational ease of Elasticsearch are unparalleled.
We've easily saved hundreds of thousands of dollars implementing Elasticsearch vs. RDBMS vs. other no-SQL solutions for our specific set of problems.