Cloudera Data Platform (CDP), launched September 2019, is designed to combine the best of Hortonworks and Cloudera technologies to deliver an enterprise data cloud. CDP includes the Cloudera Data Warehouse and machine learning services as well as a Data Hub service for building custom business applications.
$0.04
per CCU (hourly rate)
Vertica
Score 9.2 out of 10
N/A
The Vertica Analytics Platform supplies enterprise data warehouses with big data analytics capabilities and modernization. Vertica is owned and supported by Micro Focus.
In the creation of maintenance models Cloudera can excel greatly, because it allows you to create very scalable and applicable processes in different types of technology, either for reporting based purely on data, or to implement its analysis modalities to develop IT projects. In case of requiring detailed review of different phases of IT processes, Cloudera can be useful in security operations that require high expertise in shared data.
Vertica as a data warehouse to deliver analytics in-house and even to your client base on scale is not rivaled anywhere in the market. Frankly, in my experience it is not even close to equaled. Because it is such a powerful data warehouse, some people attempt to use it as a transactional database. It certainly is not one of those. Individual row inserts are slow and do not perform well. Deletes are a whole other story. RDBMS it is definitely not. OLAP it rocks.
Could use some work on better integrating with cloud providers and open source technologies. For AWS you will find an AMI in the marketplace and recently a connector for loading data from S3 directly was created. With last release, integration with Kafka was added that can help.
Managing large workloads (concurrent queries) is a bit challenging.
Having a way to provide an estimate on the duration for currently executing queries / etc. can be helpful. Vertica provides some counters for the query execution engine that are helpful but some may find confusing.
Unloading data over JDBC is very slow. We've had to come up with alternatives based on vsql, etc. Not a very clean, official on how to unload data.
We have utilized Cloudera support quite frequently and are very satisfied with the capability and responsiveness of that team. Often, the new features delivered with the platform give us an opportunity to mature the way we're doing things, and the support team have been valuable in developing those new patterns.
I haven't had any recent opportunity to reach out to Vertica support. From what I remember, I believe whenever I reached out to them the experience was smooth.
IBM's offering of the Cloud Pak for Data has been a moving target and difficult to compare to Cloudera Data Platform. We have implemented our solution on Amazon Web Services, which appears to be supported by IBM at this point, but the migration would be very expensive for us to endeavor.
Vertica performs well when the query has good stats and is tuned well. Options for GUI clients are ugly and outdated. IO optimized: it's a columnar store with no indexing structures to maintain like traditional databases. The indexing is achieved by storing the data sorted on disk, which itself is run transparently as a background process.