A great ETL tool for your big data
May 23, 2022

A great ETL tool for your big data

Anonymous | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User

Overall Satisfaction with Apache Pig

We are working on a large data analytics project where we have to work on big data, large datasets, and databases. We have used Apache Pig as it helps to explore and process large datasets. It helps in performing several operations such as local execution environments in a single Java Virtual Machine. Apache Pig is somehow easy to learn and use and the data structures are nested and richer. We have used largely whenever we used the analytical insights for our sampling data.
  • It provides great support to large datasets and ad-hoc reporting.
  • It has almost all the set of operators to perform actions such as Join, Sort, Merge, etc.
  • Anybody can use Apache Pig with some initial training and it is very much familiar with SQL.
  • It can handle almost all structured, and unstructured data.
  • Apache Pig is built using the data flows, users can easily see all the processes and information.
  • One of the most important limitations of Apache Pig is it does not support OLTP (Online Transaction Processing) as it only supports OLAP (Online Analytical Processing).
  • Apache Pig has very high latency as compared to Map Reduce.
  • Apache Pig is designed for ETL and thus not perfectly suited for real-time analysis.
  • The training materials are hard to learn and need improvements.
  • Apache Pig helps us in processing our large datasets for data analytics.
  • Apache Pig helps us process Map Reduce in a single script file.
  • Apache Pig has good training materials for users, although required some improvements.
  • It helps us in providing local and remote interoperability.
  • Apache Pig is best known for its fast execution of data processing (+ROI).
  • Scaled up large parallel processing on data.
  • It helps in saving our time in data processing (+ROI).
  • Large community base for quick resolutions (+ROI).
  • Compatibility with other 3rd parties applications and tools (-ROI).

Do you think Apache Pig delivers good value for the price?

Yes

Are you happy with Apache Pig's feature set?

Yes

Did Apache Pig live up to sales and marketing promises?

I wasn't involved with the selection/purchase process

Did implementation of Apache Pig go as expected?

I wasn't involved with the implementation phase

Would you buy Apache Pig again?

Yes

Apache Pig is best suited for ETL-based data processes. It is good in performance in handling and analyzing a large amount of data. it gives faster results than any other similar tool. It is easy to implement and any user with some initial training or some prior SQL knowledge can work on it. Apache Pig is proud to have a large community base globally.