Skip to main content
TrustRadius
Google Cloud Dataproc

Google Cloud Dataproc

Overview

What is Google Cloud Dataproc?

Dataproc, on Google Cloud, is a fully managed and scalable service for running Apache Hadoop, Apache Spark, Apache Flink, Presto, and 30+ open source tools and frameworks. Dataproc is used for data lake modernization, ETL, and secure data science, at…

Read more
Recent Reviews
TrustRadius

Leaving a review helps other professionals like you evaluate Database Management Systems

Be the first one in your network to review Google Cloud Dataproc, and make your voice heard!

Awards

Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards

Return to navigation

Pricing

View all pricing
N/A
Unavailable

What is Google Cloud Dataproc?

Dataproc, on Google Cloud, is a fully managed and scalable service for running Apache Hadoop, Apache Spark, Apache Flink, Presto, and 30+ open source tools and frameworks. Dataproc is used for data lake modernization, ETL, and secure data science, at scale, integrated with Google…

Entry-level set up fee?

  • No setup fee
For the latest information on pricing, visithttps://cloud.google.com/dataproc/prici…

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

1 person also want pricing

Alternatives Pricing

What is Devart Data Access Components?

Devart's Data Access Components is a component library for direct access to databases from Delphi, C++ Builder and Lazarus, supporting Windows, Mac OS X, iOS, Android, Linux, Free BSD for 32-bit and 64-bit platforms. Devart's Data Access Components, available in editions supporting a variety of…

What is MssqlMerge?

MssqlMerge is a diff and merge GUI tool for Microsoft SQL Server databases used to compare and sync both schema and data changes. The application has tabbed UI, there are several types of tabs responsible for particular application features and scope of tasks. The starting point is a Home tab -…

Return to navigation

Product Details

What is Google Cloud Dataproc?

Dataproc, on Google Cloud, is a fully managed and scalable service for running Apache Hadoop, Apache Spark, Apache Flink, Presto, and 30+ open source tools and frameworks. Dataproc is used for data lake modernization, ETL, and secure data science, at scale, integrated with Google Cloud.

Key features

Fully managed and automated big data open source software
Serverless deployment, logging, and monitoring so users can focus on data and analytics, not on your infrastructure. Reduces TCO of Apache Spark management, enables data scientists and engineers to build and train models faster, compared to traditional notebooks, through integration with Vertex AI Workbench. The Dataproc Jobs API makes it easy to incorporate big data processing into custom applications, while Dataproc Metastore eliminates the need to run a Hive metastore or catalog service.

Containerize Apache Spark jobs with Kubernetes
Apache Spark jobs built using Dataproc on Kubernetes can use Dataproc with Google Kubernetes Engine (GKE) to provide job portability and isolation.

Enterprise security integrated with Google Cloud
When creating a Dataproc cluster, users can enable Hadoop Secure Mode via Kerberos by adding a Security Configuration. Additionally, some of the most commonly used Google Cloud-specific security features used with Dataproc include default at-rest encryption, OS Login, VPC Service Controls, and customer-managed encryption keys (CMEK).

The best of open source with the best of Google Cloud
Dataproc lets users take the open source tools, algorithms, and programming languages they prefer, but makes it easy to apply them on cloud-scale datasets. At the same time, Dataproc has out-of-the-box integration with the rest of the Google Cloud analytics, database, and AI ecosystem. Data scientists and engineers can quickly access data and build data applications connecting Dataproc to BigQuery, Vertex AI, Cloud Spanner, Pub/Sub, or Data Fusion.


Google Cloud Dataproc Technical Details

Operating SystemsUnspecified
Mobile ApplicationNo
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews

Sorry, no reviews are available for this product yet

Return to navigation