What users are saying about
19 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.7 out of 100
24 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 8.5 out of 100

Likelihood to Recommend

Apache Pig

It is one great option in terms of database pipelining. It is highly effective for unstructured datasets to work with. Also, Apache Pig being a procedural language, unlike SQL, it is also easy to learn compared to other alternatives. But other alternatives like Apache Spark would be my recommendation due to the high availability of advanced libraries, which will reduce our extra efforts of writing from scratch
Kartik Chavan | TrustRadius Reviewer

Databricks Lakehouse Platform

Databricks has helped my teams write PySpark and Spark SQL jobs and test them out before formally integrating them in Spark jobs. Through Databricks we can create parquet and JSON output files. Datamodelers and scientists who are not very good with coding can get good insight into the data using the notebooks that can be developed by the engineers.
Anonymous | TrustRadius Reviewer

Feature Rating Comparison

Platform Connectivity

Apache Pig
Databricks Lakehouse Platform
8.3
Connect to Multiple Data Sources
Apache Pig
Databricks Lakehouse Platform
9.0
Extend Existing Data Sources
Apache Pig
Databricks Lakehouse Platform
9.0
Automatic Data Format Detection
Apache Pig
Databricks Lakehouse Platform
7.0

Data Exploration

Apache Pig
Databricks Lakehouse Platform
6.0
Visualization
Apache Pig
Databricks Lakehouse Platform
6.0
Interactive Data Analysis
Apache Pig
Databricks Lakehouse Platform
6.0

Data Preparation

Apache Pig
Databricks Lakehouse Platform
8.0
Interactive Data Cleaning and Enrichment
Apache Pig
Databricks Lakehouse Platform
8.0
Data Transformations
Apache Pig
Databricks Lakehouse Platform
9.0
Data Encryption
Apache Pig
Databricks Lakehouse Platform
7.0
Built-in Processors
Apache Pig
Databricks Lakehouse Platform
8.0

Platform Data Modeling

Apache Pig
Databricks Lakehouse Platform
8.3
Multiple Model Development Languages and Tools
Apache Pig
Databricks Lakehouse Platform
9.0
Automated Machine Learning
Apache Pig
Databricks Lakehouse Platform
8.0
Single platform for multiple model development
Apache Pig
Databricks Lakehouse Platform
9.0
Self-Service Model Delivery
Apache Pig
Databricks Lakehouse Platform
7.0

Model Deployment

Apache Pig
Databricks Lakehouse Platform
7.5
Flexible Model Publishing Options
Apache Pig
Databricks Lakehouse Platform
7.0
Security, Governance, and Cost Controls
Apache Pig
Databricks Lakehouse Platform
8.0

Pros

Apache Pig

  • Iterative Development - you can write aliases/variables, which are not immediately executed and these are stored in a DAG, which is only evaluated upon dumping or storing another alias.
  • Fast execution - Works with MapReduce, Tez, or Spark execution frameworks to provide fast run times at large scales.
  • Local and remote interoperability - Scripts that depend on testing a small dataset locally before moving to the full thing can simply be done with "pig -x local."
Jordan Moore | TrustRadius Reviewer

Databricks Lakehouse Platform

  • Extremely Flexible in Data Scenarios
  • Fantastic Performance
  • DB is always updating the system so we can have latest features.
Anonymous | TrustRadius Reviewer

Cons

Apache Pig

  • May not fit every need and a SQL-like abstraction may be more effective for some tasks (look at Spark-SQL, Hive, or even an actual DBMS)
  • All Pig jobs are written in a Domain Specific Language so not a lot of transferable knowledge
  • Writing your own User Defined Functions (UDFS) is a nice feature but can be painful to implement in practice
Anonymous | TrustRadius Reviewer

Databricks Lakehouse Platform

  • The navigation through which one would create a workspace is a bit confusing at first. It takes a couple minutes to figure out how to create a folder and upload files since it is not the same as traditional file systems such as box.com
  • Also, when you create a table, if you forgot to copy the link where the table is stored, it is hard to relocate it. Most of the time I would have to delete the table and re-created.
Ann Le | TrustRadius Reviewer

Usability

Apache Pig

Apache Pig 10.0
Based on 1 answer
It is quick, fast and easy to implement Apache Pig which makes is quite popular to be used.
Subhadipto Poddar | TrustRadius Reviewer

Databricks Lakehouse Platform

Databricks Lakehouse Platform 9.0
Based on 1 answer
This has been very useful in my organization for shared notebooks, integrated data pipeline automation and data sources integrations. Integration with AWS is seamless. Non tech users can easily learn how to use Databricks. You can have your company LDAP connect to it for login based access controls to some extent
Anonymous | TrustRadius Reviewer

Support Rating

Apache Pig

Apache Pig 6.0
Based on 2 answers
The documentation is adequate. I'm not sure how large of an external community there is for support.
Jordan Moore | TrustRadius Reviewer

Databricks Lakehouse Platform

No score
No answers yet
No answers on this topic

Alternatives Considered

Apache Pig

- Provided better ways for optimized hadoop jobs than Hive but not anymore.- Spark DSL is much more advanced and compute times are significantly less.
Anonymous | TrustRadius Reviewer

Databricks Lakehouse Platform

Easier to set up and get started. Less of a learning curve.
Anonymous | TrustRadius Reviewer

Return on Investment

Apache Pig

  • Return on Investments are significant considering what it can do with traditional analysis techniques. But, other alternatives like Apache Spark, Hive being more efficient, it is hard to stick to Apache Pig.
  • It can handle large datasets pretty easily compared to SQL. But, again, alternatives are more efficient.
  • While working on unstructured, decentralized dataset, Pig is highly beneficial, as it is not a complete deviation from SQL, but it does not take you in complexity MapReduce as well.
Kartik Chavan | TrustRadius Reviewer

Databricks Lakehouse Platform

  • Rapid growth of analytics within our company.
  • Cost model aligns with usage allowing us to make a reasonable initial investment and scale the cost as we realize the value.
  • Platform is easy to learn and Databricks provides excellent support and training.
  • Platform does not require a large DevOPs investment
Anonymous | TrustRadius Reviewer

Pricing Details

Apache Pig

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Apache Pig Editions & Modules

Additional Pricing Details

Databricks Lakehouse Platform

General

Free Trial
Free/Freemium Version
Premium Consulting/Integration Services
Entry-level set up fee?
No

Databricks Lakehouse Platform Editions & Modules

Edition
Standard$0.071
Premium$0.101
Enterprise$0.131
  1. Per DBU
Additional Pricing Details

Rating Summary

Likelihood to Recommend

Apache Pig
8.0
Databricks Lakehouse Platform
8.9

Usability

Apache Pig
10.0
Databricks Lakehouse Platform
9.0

Support Rating

Apache Pig
6.0
Databricks Lakehouse Platform

Add comparison