A powerful ETL tool which is open source
Yinghua Hu | TrustRadius Reviewer
Updated March 01, 2016

A powerful ETL tool which is open source

Score 9 out of 10
Vetted Review
Verified User
Review Source

Software Version


Modules Used

  • kettle

Overall Satisfaction with Pentaho

Pentaho is used as main ETL tool in the data analytics team. It solves the problem of processing and populating financial and ads related data.
  • Populate relational database
  • Transform and clean data
  • Create periodic job and generate report
  • Aggregate data
  • It will be helpful to have modules supporting Google Adwords and Facebook API and Twilio API
  • It has "add constant", but does not have "multiply constant" module.
  • Unit transform module
Pentaho is more powerful with more functionality. Also it is Java based and is therefore platform independent.
Pentaho is free and powerful. It is user friendly and with ample documentation.
Number of users is 1 since only I have experience with that. Type of user is Data Scientist. It is used only in the data team.
Range of data sources include MySQL and Impala, which are two main sources we need. It is very easy to set up also.
I am not familiar with this functionality since I am the only user of Pentaho inside my company. But it is easy to distribute a simple report using "Send mail".
I haven't used the visualization tool of Pentaho. But would love to learn more if required.
Pentaho is well suited for ETL processing and database population. It is less appropriate for visualization and analytics. Key question is the benefit it can bring and the cost and robustness.

Upgrading Pentaho

Yes - The new release is fully compatible with earlier release. There is no unexpected impact.