What users are saying about

Apache Drill

3 Ratings

Apache Pig

18 Ratings

Apache Drill

3 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 8.1 out of 101

Apache Pig

18 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow'>trScore algorithm: Learn more.</a>
Score 7.3 out of 101

Add comparison

Likelihood to Recommend

Apache Drill

if you're doing joins from hBASE, hdfs, cassandra and redis, then this works.Using it as a be all end all does not suit it. This is not your straight forward magic software that works for all scenarios. One needs to determine the use case to see if Apache Drill fits the needs. 3/4 of the time, usually it does.
Anson Abraham profile photo

Apache Pig

- Custom load, store, filter functionalities are needed and writing Java map reduce code is not an option due susceptible to bugs.- Chain multiple MR jobs into one pig job.
No photo available

Pros

  • queries multiple data sources with ease.
  • supports sql, so non technical users who know sql, can run query sets
  • 3rd party tools, like tableau, zoom data and looker were able to connect with no issues
Anson Abraham profile photo
  • Apache pig DSL provides a better alternative to Java map reduce code and the instruction set is very easy to learn and master.
  • It has many advanced features built-in such as joins, secondary sort, many optimizations, predicate push-down, etc.
  • When Hive was not very advanced (extremely slow) few years ago, pig has always been the go to solution. Now with Spark and Hive (after significant updates), the need to learn apache pig may be questionable.
No photo available

Cons

  • deployment. Not as easy
  • configuration isn't as straight forward, especially with the documentation
  • Garbage collection could be improved upon
Anson Abraham profile photo
  • Improve Spark support and compatibility
  • Spark and Hive are already being used main-stream, both of them have an instruction set that is easier to learn and master in a matter of days. While apache pig used to be a great alternative to writing java map reduce, Hive after significant updates is now either equal or better than pig.
No photo available

Likelihood to Renew

Apache Drill7.0
Based on 1 answer
if Presto comes up with more support (ie hbase, s3), then its strongly possible that we'll move from apache drill to prestoDB. However, Apache drill needs more configuration ease, especially when it comes to garbage collection tuning. If apache drill could support also sparkSQL and Flume, then it does change drill into being something more valuable than prestoDB
Anson Abraham profile photo
No score
No answers yet
No answers on this topic

Usability

No score
No answers yet
No answers on this topic
Apache Pig10.0
Based on 1 answer
It is quick, fast and easy to implement Apache Pig which makes is quite popular to be used.
Subhadipto Poddar profile photo

Alternatives Considered

compared to presto, has more support than prestodb.Impala has limitations to what drill can supportapache phoenix only supports for hbase. no support for cassandra. Apache drill was chosen, because of the multiple data stores that it supports htat the other 3 do not support. Presto does not support hbase as of yet. Impala does not support query to cassandra
Anson Abraham profile photo
I use both Apache Pig and its alternatives like Apache Spark & Apache Hive. Apache Pig was one of the best options in Big Data's initial stages. But now alternatives have taken over the market, rendering Apache Pig behind in the competition. But it is still a better alternative to Map Reduce. It is also a good option for working with unstructured datasets. Moreover, in certain cases, Apache Pig is much faster than Hive & Spark.
Kartik Chavan profile photo

Return on Investment

  • Configuration has taken some serious time out.
  • Garbage collection tuning. is a constant hassle. time and effort applied to it, vs dedicating resources elsewhere.
  • w/ sql support, reduces the need of devs to generate the resultset for analysts, when they can run queries themselves (if they know sql).
Anson Abraham profile photo
  • Return on Investments are significant considering what it can do with traditional analysis techniques. But, other alternatives like Apache Spark, Hive being more efficient, it is hard to stick to Apache Pig.
  • It can handle large datasets pretty easily compared to SQL. But, again, alternatives are more efficient.
  • While working on unstructured, decentralized dataset, Pig is highly beneficial, as it is not a complete deviation from SQL, but it does not take you in complexity MapReduce as well.
Kartik Chavan profile photo

Pricing Details

Apache Drill

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details

Apache Pig

General
Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No
Additional Pricing Details