Apache Pig vs. Google BigQuery

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Apache Pig
Score 8.2 out of 10
N/A
Apache Pig is a programming tool for creating MapReduce programs used in Hadoop.N/A
Google BigQuery
Score 8.7 out of 10
N/A
Google's BigQuery is part of the Google Cloud Platform, a database-as-a-service (DBaaS) supporting the querying and rapid analysis of enterprise data.
$4
per 100 slots
Pricing
Apache PigGoogle BigQuery
Editions & Modules
No answers on this topic
Queries (Hourly Flex Slots)
$4
per 100 slots
Queries (On-Demand)
$5
per TB
Queries (Annual Flat Rate)
$1,700
per 100 slots
Queries (Monthly Flat Rate)
$2000
per 100 slots
Offerings
Pricing Offerings
Apache PigGoogle BigQuery
Free Trial
NoNo
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
Apache PigGoogle BigQuery
Considered Both Products
Apache Pig
Chose Apache Pig
It takes me less time to write a Pig script than get a Spark program running for batch ETL workloads. Compared to Spark, Pig has a steeper learning curve because it employs a proprietary programming language. In one script and one fine, it can handle both Map Reduce and Hadoop. …
Google BigQuery
Chose Google BigQuery
Comparing to competitors, Google BigQuery has the lowest cost and most flexible pricing model. Definitely higher ROI.
Top Pros
Top Cons
Features
Apache PigGoogle BigQuery
Database-as-a-Service
Comparison of Database-as-a-Service features of Product A and Product B
Apache Pig
-
Ratings
Google BigQuery
8.8
30 Ratings
1% above category average
Automatic software patching00 Ratings8.717 Ratings
Database scalability00 Ratings9.430 Ratings
Automated backups00 Ratings8.924 Ratings
Database security provisions00 Ratings9.124 Ratings
Monitoring and metrics00 Ratings7.926 Ratings
Automatic host deployment00 Ratings8.813 Ratings
Best Alternatives
Apache PigGoogle BigQuery
Small Businesses

No answers on this topic

IBM Cloudant
IBM Cloudant
Score 9.3 out of 10
Medium-sized Companies
Cloudera Manager
Cloudera Manager
Score 9.7 out of 10
IBM Cloudant
IBM Cloudant
Score 9.3 out of 10
Enterprises
IBM Analytics Engine
IBM Analytics Engine
Score 9.1 out of 10
IBM Cloudant
IBM Cloudant
Score 9.3 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Apache PigGoogle BigQuery
Likelihood to Recommend
8.0
(9 ratings)
8.9
(31 ratings)
Usability
10.0
(1 ratings)
9.5
(3 ratings)
Support Rating
6.0
(2 ratings)
10.0
(16 ratings)
Contract Terms and Pricing Model
-
(0 ratings)
10.0
(1 ratings)
Professional Services
-
(0 ratings)
8.4
(2 ratings)
User Testimonials
Apache PigGoogle BigQuery
Likelihood to Recommend
Apache
Apache Pig is best suited for ETL-based data processes. It is good in performance in handling and analyzing a large amount of data. it gives faster results than any other similar tool. It is easy to implement and any user with some initial training or some prior SQL knowledge can work on it. Apache Pig is proud to have a large community base globally.
Read full review
Google
One of the most important aspects while working with data warehousing solutions and analytics is the ability to handle large datasets. Google BigQuery is the best in business for that particular aspect. It is ridiculously fast while handling large data sets. Another aspect where it is well suited is the ability to integrate it with data visualization tools like Data Studio. It is fast, easy to use, and very reliable. The only aspect where I feel it is less appropriate where you have to pay more of inefficient scripts and that can hamper the growth of the company a bit.
Read full review
Pros
Apache
  • Its performance, ease of use, and simplicity in learning and deployment.
  • Using this tool, we can quickly analyze large amounts of data.
  • It's adequate for map-reducing large datasets and fully abstracted MapReduce.
Read full review
Google
  • BigQuery is ridiculously fast and has the ability to query absurdly large data sets to return results immediately.
  • BigQuery allows for storage of a massive amount of data for relatively low prices.
  • Easy to learn. BiqQuery uses SQL-like queries and is easy to transfer your existing skills to use.
Read full review
Cons
Apache
  • UDFS Python errors are not interpretable. Developer struggles for a very very long time if he/she gets these errors.
  • Being in early stage, it still has a small community for help in related matters.
  • It needs a lot of improvements yet. Only recently they added datetime module for time series, which is a very basic requirement.
Read full review
Google
  • One issue with Google Cloud Storage is its price. For one to have that premium Google Cloud Storage, for the purpose of massive storage, he/she must have adequate cash. Otherwise, Google Cloud Storage is a safe and perfect online storage platform.
  • The only thing that can come to mind that would be annoying with this software was that sometimes when trying to share files on the Cloud with coworkers, it would just not share at all, or there would be a massive delay in when I shared them and when they received them. Other than that though, everything is perfect with this.
Read full review
Usability
Apache
It is quick, fast and easy to implement Apache Pig which makes is quite popular to be used.
Read full review
Google
web UI is easy and convenient. Many RDBMS clients such as aqua data studio, Dbeaver data grid, and others connect. Range of well-documented APIs available. The range of features keeps expanding, increasing similar features to traditional RDBMS such as Oracle and DB2
Read full review
Support Rating
Apache
The documentation is adequate. I'm not sure how large of an external community there is for support.
Read full review
Google
It’s Google, they’re big and well organized, the documentation is abundant and the scalability is amazing. The UX is good too, considering it’s a professional tool expected to be used by people with a specific technical background. Overall, it makes me feels good and secure that we know where to store the data, how to use that data and that the data is handled with utmost security and performance practices.
Read full review
Alternatives Considered
Apache
Apache Pig might help to start things faster at first and it was one of the best tool years back but it lacks important features that are needed in the data engineering world right now. Pig also has a steeper learning curve since it uses a proprietary language compared to Spark which can be coded with Python, Java.
Read full review
Google
Spinning up, provisioning, maintaining and debugging a Hadoop solution can be non-trivial, painful. I'm talking about both GCE based or HDInsight clusters. It requires expertise (+ employee hire, costs). With BigQuery if someone has a good SQL knowledge (and maybe a little programming), can already start to test and develop. All of the infrastructure and platform services are taken care of. Google BigQuery is a magnitudes simpler to use than Hadoop, but you have to evaluate the costs. BigQuery billing is dependent on your data size and how much data your query touches.
Read full review
Contract Terms and Pricing Model
Apache
No answers on this topic
Google
None so far. Very satisfied with the transparency on contract terms and pricing model.
Read full review
Professional Services
Apache
No answers on this topic
Google
Google Support has kindly provide individual support and consultants to assist with the integration work. In the circumstance where the consultants are not present to support with the work, Google Support Helpline will always be available to answer to the queries without having to wait for more than 3 days.
Read full review
Return on Investment
Apache
  • Higher learning curve than other similar technologies so on-boarding new engineers or change ownership of Apache Pig code tends to be a bit of a headache
  • Once the language is learned and understood it can be relatively straightforward to write simple Pig scripts so development can go relatively quickly with a skilled team
  • As distributed technologies grow and improve, overall Apache Pig feels left in the dust and is more legacy code to support than something to actively develop with.
Read full review
Google
  • Google BigQuery has had enormous impact in terms of ROI to our business, as it has allowed us to ease our dependence on our physical servers, which we pay for monthly from another hosting service. We have been able to run multiple enterprise scale data processing applications with almost no investment
  • Since our business is highly client focused, Google Cloud Platform, and BigQuery specifically, has allowed us to get very granular in how our usage should be attributed to different projects, clients, and teams.
  • Plain and simple, I believe the meager investments that we have made in Google BigQuery have paid themselves back hundreds of times over.
Read full review
ScreenShots