Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Apache Spark
Score 8.6 out of 10
N/A
N/AN/A
Google BigQuery
Score 8.6 out of 10
N/A
Google's BigQuery is part of the Google Cloud Platform, a database-as-a-service (DBaaS) supporting the querying and rapid analysis of enterprise data.
$6.25
per TiB (after the 1st 1 TiB per month, which is free)
Pricing
Apache SparkGoogle BigQuery
Editions & Modules
No answers on this topic
Standard edition
$0.04 / slot hour
Enterprise edition
$0.06 / slot hour
Enterprise Plus edition
$0.10 / slot hour
Offerings
Pricing Offerings
Apache SparkGoogle BigQuery
Free Trial
NoYes
Free/Freemium Version
NoYes
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
Apache SparkGoogle BigQuery
Considered Both Products
Apache Spark

No answer on this topic

Google BigQuery
Chose Google BigQuery
Other locally hosted solutions are capable of providing the required level of performance, but the administration requirements are significantly more involved than with BigQuery. Additionally, there are capacity and availability concerns with locally hosted platforms that are a …
Chose Google BigQuery
BigQuery by far the best solution in all angles compared to other ones: Especially scalability, ease of use, performance and there is no need to manage any cluster of servers. Also it's ABSOLUTELY pay as you go! No one in market currently provide such service that can compete …
Chose Google BigQuery
Comparing to competitors, Google BigQuery has the lowest cost and most flexible pricing model. Definitely higher ROI.
Top Pros
Top Cons
Features
Apache SparkGoogle BigQuery
Database-as-a-Service
Comparison of Database-as-a-Service features of Product A and Product B
Apache Spark
-
Ratings
Google BigQuery
8.4
53 Ratings
4% below category average
Automatic software patching00 Ratings8.117 Ratings
Database scalability00 Ratings8.853 Ratings
Automated backups00 Ratings8.524 Ratings
Database security provisions00 Ratings8.746 Ratings
Monitoring and metrics00 Ratings8.448 Ratings
Automatic host deployment00 Ratings8.113 Ratings
Best Alternatives
Apache SparkGoogle BigQuery
Small Businesses

No answers on this topic

SingleStore
SingleStore
Score 9.8 out of 10
Medium-sized Companies
Cloudera Manager
Cloudera Manager
Score 9.7 out of 10
SingleStore
SingleStore
Score 9.8 out of 10
Enterprises
IBM Analytics Engine
IBM Analytics Engine
Score 8.8 out of 10
SingleStore
SingleStore
Score 9.8 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Apache SparkGoogle BigQuery
Likelihood to Recommend
9.9
(24 ratings)
8.6
(52 ratings)
Likelihood to Renew
10.0
(1 ratings)
7.0
(1 ratings)
Usability
10.0
(3 ratings)
9.4
(3 ratings)
Support Rating
8.7
(4 ratings)
10.0
(9 ratings)
Contract Terms and Pricing Model
-
(0 ratings)
10.0
(1 ratings)
Professional Services
-
(0 ratings)
8.2
(2 ratings)
User Testimonials
Apache SparkGoogle BigQuery
Likelihood to Recommend
Apache
Well suited: To most of the local run of datasets and non-prod systems - scalability is not a problem at all. Including data from multiple types of data sources is an added advantage. MLlib is a decently nice built-in library that can be used for most of the ML tasks. Less appropriate: We had to work on a RecSys where the music dataset that we used was around 300+Gb in size. We faced memory-based issues. Few times we also got memory errors. Also the MLlib library does not have support for advanced analytics and deep-learning frameworks support. Understanding the internals of the working of Apache Spark for beginners is highly not possible.
Read full review
Google
Google BigQuery really shines in scenarios requiring real-time analytics on large data streams and predictive analytics with its machine learning integration. Teams have been using it extensively all over. However, it may not be the best fit for organizations dealing with small datasets because of the higher costs. And also, it might not be the best fit for highly complex data transformations, where simpler or more specialized solutions could be more appropriate.
Read full review
Pros
Apache
  • Apache Spark makes processing very large data sets possible. It handles these data sets in a fairly quick manner.
  • Apache Spark does a fairly good job implementing machine learning models for larger data sets.
  • Apache Spark seems to be a rapidly advancing software, with the new features making the software ever more straight-forward to use.
Read full review
Google
  • Its serverless architecture and underlying Dremel technology are incredibly fast even on complex datasets. I can get answers to my questions almost instantly, without waiting hours for traditional data warehouses to churn through the data.
  • Previously, our data was scattered across various databases and spreadsheets and getting a holistic view was pretty difficult. Google BigQuery acts as a central repository and consolidates everything in one place to join data sets and find hidden patterns.
  • Running reports on our old systems used to take forever. Google BigQuery's crazy fast query speed lets us get insights from massive datasets in seconds.
Read full review
Cons
Apache
  • Memory management. Very weak on that.
  • PySpark not as robust as scala with spark.
  • spark master HA is needed. Not as HA as it should be.
  • Locality should not be a necessity, but does help improvement. But would prefer no locality
Read full review
Google
  • It is challenging to predict costs due to BigQuery's pay-per-query pricing model. User-friendly cost estimation tools, along with improved budget alerting features, could help users better manage and predict expenses.
  • The BigQuery interface is less intuitive. A more user-friendly interface, enhanced documentation, and built-in tutorial systems could make BigQuery more accessible to a broader audience.
Read full review
Likelihood to Renew
Apache
Capacity of computing data in cluster and fast speed.
Read full review
Google
We have to use this product as its a 3rd party supplier choice to utilise this product for their data side backend so will not be likely we will move away from this product in the future unless the 3rd party supplier decides to change data vendors.
Read full review
Usability
Apache
The only thing I dislike about spark's usability is the learning curve, there are many actions and transformations, however, its wide-range of uses for ETL processing, facility to integrate and it's multi-language support make this library a powerhouse for your data science solutions. It has especially aided us with its lightning-fast processing times.
Read full review
Google
web UI is easy and convenient. Many RDBMS clients such as aqua data studio, Dbeaver data grid, and others connect. Range of well-documented APIs available. The range of features keeps expanding, increasing similar features to traditional RDBMS such as Oracle and DB2
Read full review
Support Rating
Apache
1. It integrates very well with scala or python. 2. It's very easy to understand SQL interoperability. 3. Apache is way faster than the other competitive technologies. 4. The support from the Apache community is very huge for Spark. 5. Execution times are faster as compared to others. 6. There are a large number of forums available for Apache Spark. 7. The code availability for Apache Spark is simpler and easy to gain access to. 8. Many organizations use Apache Spark, so many solutions are available for existing applications.
Read full review
Google
BigQuery can be difficult to support because it is so solid as a product. Many of the issues you will see are related to your own data sets, however you may see issues importing data and managing jobs. If this occurs, it can be a challenge to get to speak to the correct person who can help you.
Read full review
Alternatives Considered
Apache
All the above systems work quite well on big data transformations whereas Spark really shines with its bigger API support and its ability to read from and write to multiple data sources. Using Spark one can easily switch between declarative versus imperative versus functional type programming easily based on the situation. Also it doesn't need special data ingestion or indexing pre-processing like Presto. Combining it with Jupyter Notebooks (https://github.com/jupyter-incubator/sparkmagic), one can develop the Spark code in an interactive manner in Scala or Python
Read full review
Google
I have used Snowflake and DataGrip for data retrieval as well as Google BigQuery and can say that all these tools compete for head to head. It is very difficult to say which is better than the other but some features provided by Google BigQuery give it an edge over the others. For example, the reliability of Google is unmatchable by others. One thing that I really like is the ability to integrate Data Studio so easily with Google BigQuery.
Read full review
Contract Terms and Pricing Model
Apache
No answers on this topic
Google
None so far. Very satisfied with the transparency on contract terms and pricing model.
Read full review
Professional Services
Apache
No answers on this topic
Google
Google Support has kindly provide individual support and consultants to assist with the integration work. In the circumstance where the consultants are not present to support with the work, Google Support Helpline will always be available to answer to the queries without having to wait for more than 3 days.
Read full review
Return on Investment
Apache
  • Faster turn around on feature development, we have seen a noticeable improvement in our agile development since using Spark.
  • Easy adoption, having multiple departments use the same underlying technology even if the use cases are very different allows for more commonality amongst applications which definitely makes the operations team happy.
  • Performance, we have been able to make some applications run over 20x faster since switching to Spark. This has saved us time, headaches, and operating costs.
Read full review
Google
  • Pricing has been very reasonable for us. The first 10 GB of storage is free each month and costs start at 2 cents per GB per month after that. For example, if you store 1 terabyte (TB) for a month, then the cost would be $20. Streaming data inserts start at 1 cent per 200 megabytes (MBs). The first 1 TB of queries is free, with additional analysis at $5 per TB thereafter. Meta data operations are free.
  • Big Query helps reduce the bar for data analytics, ML and AI. BQ takes care of mundane tasks and streamlines for easy data processing, consumption. The most impressive thing is the ML and AI integration as SQL functions, so the need for moving data around is minimized.
  • The visuals of ML models is very helpful to fine tune training, model building and prediction, etc.
Read full review
ScreenShots

Google BigQuery Screenshots

Screenshot of Migrating data warehouses to BigQuery - Features a streamlined migration path from Netezza, Oracle, Redshift, Teradata, or Snowflake to BigQuery using the fully managed BigQuery Migration Service.Screenshot of bringing any data into BigQuery - Data files can be uploaded from local sources, Google Drive, or Cloud Storage buckets, using BigQuery Data Transfer Service (DTS), Cloud Data Fusion plugins, by replicating data from relational databases with Datastream for BigQuery, or by leveraging Google's data integration partnerships.Screenshot of generative AI use cases with BigQuery and Gemini models - Data pipelines that blend structured data, unstructured data and generative AI models together can be built to create a new class of analytical applications. BigQuery integrates with Gemini 1.0 Pro using Vertex AI. The Gemini 1.0 Pro model is designed for higher input/output scale and better result quality across a wide range of tasks like text summarization and sentiment analysis. It can be accessed using simple SQL statements or BigQuery’s embedded DataFrame API from right inside the BigQuery console.Screenshot of insights derived from images, documents, and audio files, combined with structured data - Unstructured data represents a large portion of untapped enterprise data. However, it can be challenging to interpret, making it difficult to extract meaningful insights from it. Leveraging the power of BigLake, users can derive insights from images, documents, and audio files using a broad range of AI models including Vertex AI’s vision, document processing, and speech-to-text APIs, open-source TensorFlow Hub models, or custom models.Screenshot of event-driven analysis - Built-in streaming capabilities automatically ingest streaming data and make it immediately available to query. This allows users to make business decisions based on the freshest data. Or Dataflow can be used to enable simplified streaming data pipelines.Screenshot of predicting business outcomes AI/ML - Predictive analytics can be used to streamline operations, boost revenue, and mitigate risk. BigQuery ML democratizes the use of ML by empowering data analysts to build and run models using existing business intelligence tools and spreadsheets.