Cloudera Data Platform (CDP), launched September 2019, is designed to combine the best of Hortonworks and Cloudera technologies to deliver an enterprise data cloud. CDP includes the Cloudera Data Warehouse and machine learning services as well as a Data Hub service for building custom business applications.
$0.04
per CCU (hourly rate)
Oracle Autonomous Data Warehouse
Score 8.3 out of 10
N/A
Oracle Autonomous Data Warehouse is optimized for analytic workloads, including data marts, data warehouses, data lakes, and data lakehouses. With Autonomous Data Warehouse, data scientists, business analysts, and nonexperts can discover business insights using data of any size and type. The solution is built for the cloud and optimized using Oracle Exadata.
N/A
Saturn Cloud
Score 7.7 out of 10
N/A
Saturn Cloud is an ML platform for individuals and teams, available on multiple clouds: AWS, Azure, GCP, and OCI. It provides access to computing resources with customizable amounts of memory and power, including GPUs and Dask distributed computing clusters, in a wholly hosted environment. Saturn Cloud is presented as flexible and straightforward for new data scientists while giving senior and experienced staff the
capabilities and configurability they need.…
I have seen that Cloudera Data Platform is well suited for large batch processes. It works really well for our indication analyses that are performed by the actuaries. I feel that rapid streaming operations may be a situation where additional technology would be needed to provide for a robust solution.
II would recommend Oracle Autonomous Data Warehouse to someone looking to fully automate the transferring of data especially in a warehouse scenario though I can see the elasticity of the suite that is offered and can see it is applicable in other scenarios not just warehouses.
Saturn Cloud is a powerful data science platform that offers numerous benefits to organizations. It simplifies and streamlines the development, deployment, and scaling of data science and machine learning models. The platform addresses common business problems such as scalability, collaboration, efficiency, and cost-effectiveness. With Saturn Cloud, organizations can easily handle large datasets and complex computations, collaborate effectively among data science teams, automate repetitive tasks, optimize workflows, and utilize flexible and cost-efficient cloud resources. By leveraging Saturn Cloud, organizations can accelerate their data science projects, improve productivity, and achieve better outcomes in areas such as predictive modeling, recommendation systems, fraud detection, and more.
Very easy and fast to load data into the Oracle Autonomous Data Warehouse
Exceptionally fast retrieval of data joining 100 million row table with a billion row table plus the size of the database was reduced by a factor of 10 due to how Oracle store[s] and organise[s] data and indexes.
Flexibility with scaling up and down CPU on the fly when needed, and just stop it when not needed so you don't get charged when it is not running.
It is always patched and always available and you can add storage dynamically as you need it.
It is very expensive product. But not to mention, there's good reasons why it is expensive.
The product should support more cloud based services. When we made the decision to buy the product (which was 20 years ago,) there was no such thing to consider, but moving to a cloud based data warehouse may promise more scalability, agility, and cost reduction. The new version of Data Warehouse came out on the way, but it looks a bit behind compared to other competitors.
Our healthcare data consists of 30% coded data (such as ICD 10 / SNOMED C,T) but the rests is narrative (such as clinical notes.). Oracle is the best for warehousing standardized data, but not a good choice when considering unstructured data, or a mix of the two.
While Saturn Cloud offers a range of pre-built templates and workflows, there is currently limited support for customization. For example, users may not be able to modify the pre-configured environments that come with the templates, or may find it difficult to integrate their own custom libraries and tools. Offering more flexibility in this area could help users tailor the platform to their specific needs and workflows.
While Saturn Cloud offers a variety of pre-built environments for data science and machine learning workloads, some users may prefer to use custom Docker images instead. However, the platform currently has limited support for Docker, which can be a limitation for users who need to work with specific dependencies or custom libraries. Adding more robust support for Docker could help to make the platform more versatile and adaptable to a wider range of use cases.
Does not require continous attention from the DBA, autonomous features allows the database to perform most of the regular admin tasks without need for human intervention.
Allows to integrate multiple data sources on a central data warehouse, and explode the information stored with different analytic and reporting tools.
This is user friendly , better than its counterparts. Anyone familiar working with other cloud solutions for GPU will agree on this. Hence the rating of 10 was given to this. I personally love the fact that I get so much compute time for being a free user which is very efficient in terms of budget
We have utilized Cloudera support quite frequently and are very satisfied with the capability and responsiveness of that team. Often, the new features delivered with the platform give us an opportunity to mature the way we're doing things, and the support team have been valuable in developing those new patterns.
Understanding Oracle Cloud Infrastructure is really simple, and Autonomous databases are even more. Using shared or dedicated infrastructure is one of the few things you need to consider at the moment of starting provisioning your Oracle Autonomous Data Warehouse.
IBM's offering of the Cloud Pak for Data has been a moving target and difficult to compare to Cloudera Data Platform. We have implemented our solution on Amazon Web Services, which appears to be supported by IBM at this point, but the migration would be very expensive for us to endeavor.
As I mentioned, I have also worked with Amazon Redshift, but it is not as versatile as Oracle Autonomous Data Warehouse and does not provide a large variety of products. Oracle Autonomous Data Warehouse is also more reliable than Amazon Redshift, hence why I have chosen it
Saturn Cloud provides an R server, that's super important. Even you can write R on CoLab with different settings, but it is inconvenient and slow. Saturn Cloud can give me a different IDE environment that I'm more used to, even if I'm using Python. Whereas CoLab is more dedicated to Jupyter notebook
Overall the business objective of all of our clients have been met positively with Oracle Data Warehouse. All of the required analysis the users were able to successfully carry out using the warehouse data.
Using a 3-tier architecture with the Oracle Data Warehouse at the back end the mid-tier has been integrated well. This is big plus in providing the necessary tools for end users of the data warehouse to carry out their analysis.
All of the various BI products (OBIEE, Cognos, etc.) are able to use and exploit the various analytic built-in functionalities of the Oracle Data Warehouse.
Although we are still in the implementation phase with Saturn Cloud, we anticipate significant positive impacts on our business objectives.
The platform is expected to enhance our computational capabilities with its easy access to top-tier NVIDIA GPUs, which should accelerate our AI and machine learning projects. We believe this will lead to reduced development times and faster deployment of our generative AI models.
While Saturn Cloud provides excellent computational resources and reliable uptime, I find that their user interface could be improved. The UI can be unintuitive at times, making it a bit challenging to navigate and configure certain settings. Enhancing the user interface to be more streamlined and user-friendly would significantly improve the overall experience. Having pre-configured stacks readily available would also save time and make the platform even more efficient to use.