22 Reviews and Ratings
4 Reviews and Ratings
No answers on this topic
Apache Pig is best suited for ETL-based data processes. It is good in performance in handling and analyzing a large amount of data. it gives faster results than any other similar tool. It is easy to implement and any user with some initial training or some prior SQL knowledge can work on it. Apache Pig is proud to have a large community base globally.Incentivized
What you have are different strategies for data encoding, which makes the process quite flexible, it is perfectly done so that a joint and collaborative work can be carried out, this information analyzed in large quantities, is extremely vital for the company, by giving it the correct and timely readingIncentivized
Its performance, ease of use, and simplicity in learning and deployment.Using this tool, we can quickly analyze large amounts of data.It's adequate for map-reducing large datasets and fully abstracted MapReduce.Incentivized
Ultra fast query results.IN Memory Database.Easy integration to reporting services.Incentivized
UDFS Python errors are not interpretable. Developer struggles for a very very long time if he/she gets these errors.Being in early stage, it still has a small community for help in related matters.It needs a lot of improvements yet. Only recently they added datetime module for time series, which is a very basic requirement.Incentivized
Problems Could Be Encountered is particularly pronounced in more complex analyses.Categorical variables are often not precise enoughIncentivized
It is quick, fast and easy to implement Apache Pig which makes is quite popular to be used.Incentivized
The documentation is adequate. I'm not sure how large of an external community there is for support.Incentivized
Apache Pig might help to start things faster at first and it was one of the best tool years back but it lacks important features that are needed in the data engineering world right now. Pig also has a steeper learning curve since it uses a proprietary language compared to Spark which can be coded with Python, Java. Incentivized
We selected Kognitio because of the legacy systems that are still running. Also, we have legacy systems in place that are fit for Kognitio. End-user has good feedback on our side when we started implementing this solution. Current servers are compatible with Kognitio in place.Incentivized
Higher learning curve than other similar technologies so on-boarding new engineers or change ownership of Apache Pig code tends to be a bit of a headacheOnce the language is learned and understood it can be relatively straightforward to write simple Pig scripts so development can go relatively quickly with a skilled teamAs distributed technologies grow and improve, overall Apache Pig feels left in the dust and is more legacy code to support than something to actively develop with.Incentivized
The implementation of the formats to integrate the users we have and the program is also good.I also improve the control of aspects related to the work environmentIncentivized