Using the Pentaho tools to solve ETL challenges
May 07, 2021
Using the Pentaho tools to solve ETL challenges
Score 10 out of 10
Vetted Review
Verified User
Overall Satisfaction with Pentaho
Before working for Hitachi Vantara, I had experience using the Pentaho Tools for personal projects mainly. I had the chance to work directly with the teams that supported the Pentaho tools, and I can tell with much objectivity that the Pentaho tools are by far one of the best options in the market when it comes to all the ETL processes. Data science requires extracting data from different sources, organizing it, and transforming it according to each necessity. Machine learning is built on top of these concepts, and with the Pentaho tools, you can accomplish most of it. Since I supported the Pentaho tools while working for Hitachi Vantara, my perspective is kind of unique; I can tell that the tools were used to solve internal problems such as integrations with our release tools and some of our agile tools, so the tools were used to enhance the newer versions.
Solving problems such as extracting metadata from thousands of files, organizing this information, and filtering it to create release files, determining how to create meta information files is just an example of the ETL cycle that can be performed with the Pentaho tools.
Solving problems such as extracting metadata from thousands of files, organizing this information, and filtering it to create release files, determining how to create meta information files is just an example of the ETL cycle that can be performed with the Pentaho tools.
- Open source, the Pentaho tools have a free to use version with a lot of support.
- Performance. The Pentaho tools can be setup so they process gigabytes of data seamlessly.
- Support from the open source developer community.
- Documentation up-to-date.
- The web versions of the Pentaho tools are limited to the server component.
- Worker nodes features are being improved but more documentation and support is always welcome.
- Being able to successfully extract data as required
- Able to run complex data transformation from a UI view
Perhaps Snowflake and SalesForce have some components which align with the Pentaho tools. The Pentaho tools have integrations with these technologies to add more value to the final users. Perhaps the only weakness I can honestly find in the Pentaho tools right now is the lack of a powerful web interface for data transformations. There is a web component from which you can access existing data transformations created with the Pentaho Data Integration tool. Still, the web component only allows visualization of the data transformation and remote execution. A complete web interface with remote execution would be excellent, and I'm sure that we might see something like this available at some point in the future.
Do you think Pentaho delivers good value for the price?
Yes
Are you happy with Pentaho's feature set?
Yes
Did Pentaho live up to sales and marketing promises?
I wasn't involved with the selection/purchase process
Did implementation of Pentaho go as expected?
Yes
Would you buy Pentaho again?
Yes