Data Integration with Pentaho Kettle review
Overall Satisfaction with Pentaho
I used Pentaho Kettle as a team manager of development and later as a CIO. After that I opened a company that consults and implement business intelligence solutions. We are using Pentaho mostly with the data integration module. I think that of all the modules of Pentaho, Kettle is the most complete. It can give a "fair fight" to source solutions that are not open. The problem it addresses of course is to extract data from various sources; transform them; ”play with data”; and then load it to the target. I find the transformation most valuable and rich with functionality. I even made a full scale course about it, you can find it on udemy.
Pros
- Pentaho Kettle gives you a great graphic user interface to plan your transformation and jobs.
- Pentaho Kettle makes it easy to handle errors, logging and performance.
- Pentaho Kettle has dozen of great steps like: lookup and SCD functionality.
Cons
- Several steps have performance issues like the Json input.
- The community edition does not include scheduler and job manager so you need to figure it out yourself, unless of course you buy the Enterprise edition.
- I think that web service should be easier to operate.
We have experience with Informatica and Talend. I think that between Talend and Pentaho it's a close fight, although I prefer, personally, Pentaho Kettle (Larger community, more resources).
I think that you can say informatica is better than both of them but it is way more expensive and the differences are small.
Let's say, I didn't find something I can't do with Pentaho - maybe it took a little bit more creativity or code (java / javascript).
I think that you can say informatica is better than both of them but it is way more expensive and the differences are small.
Let's say, I didn't find something I can't do with Pentaho - maybe it took a little bit more creativity or code (java / javascript).
Comments
Please log in to join the conversation