Pentaho Reviews

109 Ratings
<a href='https://www.trustradius.com/static/about-trustradius-scoring' target='_blank' rel='nofollow noopener noreferrer'>trScore algorithm: Learn more.</a>
Score 7.0 out of 101

Do you work for this company? Manage this listing

Overall Rating

Reviewer's Company Size

Last Updated

By Topic

Industry

Department

Experience

Job Type

Role

Reviews (1-25 of 26)

No photo available
December 06, 2019

The best BI software

Score 8 out of 10
Vetted Review
Verified User
Review Source
Our company has technical offices in many municipalities in Spain. In them, we develop dashboards through the Pentaho suite. They are mainly developed by programmers from the IT department. Dashboards made with CDE improve the decision making of mayors.
  • Unlicensed.
  • Complete suite with many tools.
  • Integrates with third-party tools.
  • Custom reports.
  • More robust as payment solutions such as microstrategy.
  • More default templates in CDE.
Making custom dashboards and custom reports are very simple with CDE and the rest of the tools in the Pentaho suite. The improvement in decision making since we have implemented these dashboards is huge. The tool is not very difficult for a programmer with JavaScript knowledge, perhaps it lacks the robustness of payment tools such as MicroStrategy or PowerBI, but it is still appropriate for demanding production environments.
Read this authenticated review
No photo available
Score 6 out of 10
Vetted Review
Verified User
Review Source
Pentaho is used as our organizational-wide reporting and BI platform. It is bringing together data from multiple enterprise applications including ERPs, HRM, EAM and PLMs. We are using it to create both prepackaged PDF reports as well as interactive analysis reports where users can compose their on add-how reports.
  • It is relatively stable, robust and reliable.
  • Interactive analysis reports are very useful and relatively easy to use.
  • Relatively simple to integrate with other systems
  • The type of graphs and layouts proposed for diagrams and charts are a little bit limited and partly outdated.
  • For certain reports you can feel that it is not using the latest web technologies.
  • The tools for creating new report template designs could be improved.
Pentaho is well suited for bringing together data from multiple data sources in order to perform analyses and to create reports.

If you want to create very nicely looking interactive graphs using the latest web technologies, there might be other tools that are more suited. We have also seen performance problems with large amounts of data, but it is not certain if this only was the fault of Pentaho but could also be the infrastructure around.
Read this authenticated review
No photo available
Score 6 out of 10
Vetted Review
Verified User
Review Source
We rolled out Pentaho within one of our business units as a way to get at trapped data from an old home-grown CRM system that had been recently updated to .net. Pentaho quickly got us access to the data and the built-in ETL tools were easy and quick to use and learn. It enabled a team of two to provide enough data, reporting and analytical dashboards to support over $1.5B in sales.
  • The built-in ETL tools are easy to learn, and can quickly import and transform any data you have.
  • The excellent visualizations and charts are pleasing to the eye, and looks are important in sales and marketing presentations.
  • The rollout was fast, we installed the software and were building dashboards within minutes.
  • I think the relative obscurity of the tool is a downside, not as many developers, consultants or peers you can tap into.
  • Lack of a solid user community held us back, looking at Power BI and Qlik, they have huge user communities that help each other out. Would have liked that here.
  • Smaller company means smaller sales force, and the lack of a local presence made it hard to only interact online with the account rep. Other companies have someone local who often stops by with pre-sales developers to just pitch in free of charge when they have time.
The tool is fast, visually attractive and intuitive to use, but we also have an implementation of Qlik Sense and Pentaho pales in comparison to the overall capabilities. I recommend the tool, but everyone’s solution need is unique, so it may not be the right fit. Larger tools like Qlik are more versatile and easier to recommend. More expensive, yes, but you usually get what you pay for.
Read this authenticated review
Ilham AB profile photo
Score 8 out of 10
Vetted Review
Verified User
Review Source
Currently, Pentaho was using it for our internal IT Department. We are using Pentaho to address more specific needs to integrate and analyze the company data that resides on all systems. It is necessary to have a secondary system that runs for the same purpose to run small and simpler modeling tools for Data Warehouses and Business Intelligence. It also acts as a comparison system to build simple jobs and transformations.
  • Easy to create simple jobs and transformations
  • Mature data access and data transformation capabilities
  • Suitable for complex analytic use cases
  • Consume data from many sources
  • At first, one needs to learn the basic concept of this software before going for the implementation. Once you are an expert on this then you will rock it.
  • Some of the basic features need to be enhanced such as sharing and collaboration capabilities
  • Needs to support automatically generated analytic visualizations or models
Pentaho will work great to embed analytic content with full features of Machine Learning capabilities. It supports many machine-learning frameworks and can be easily integrated. So, you also can connect with many data sources and have no worries about it. In our case, we are using it to help us create many day-to-day jobs and transformations that our internal users need. But to achieve this, we need to train our team to be able to understand the main concepts of how this software works.
Read Ilham AB's full review
Mihaela Olariu profile photo
Score 9 out of 10
Vetted Review
Verified User
Review Source
We are currently using this application for all the companies. We have managed to develop a lot of reports and dashboards and also bursting reports. Until now we have been able to meet the requirements of reporting in the company in all departments. Pentaho is a good BI solution to also provide the option of creating many operational reports if you don't have a good option in your ERP.
  • SQL query + scripting will help you design different objects, tables, pivots, dashboards, linked t other reports.
  • Organizing in which way you want, synchronizing with AD is very helpful to set security/folder/group of users/single users.
  • By using Eclipse you can set another layer of security by filtering information users's need to access.
  • Sending reports y email also can be set up by security, by groups or list of users.
  • Design seems pretty secure, from time to time can fail when there is too much design.
  • Would like to have a format copy for different objects, tables, cross-tabs, grids, it would make design much more easier. Of course, we use template and CSS design for these objects, from time to time we design something different and it makes it much harder to do this manually for each similar object.
  • Possibility to use multiple DB in the .xaction.
  • Operational reports, big lists of COs, POs, DOs, master information and details. Bursting reports by email and scheduling tasks are interesting and very useful.
  • Complex aggregations are useful to use in cross tabs and calculated functions in the tables. I would recommend a new tool if the purpose is dashboards, but they still can be created in Pentaho as well.
Read Mihaela Olariu's full review
Nikhil Karkare profile photo
Score 9 out of 10
Vetted Review
Verified User
Review Source
Pentaho is the primary source for the Business Intelligence in my company. We have many Pentaho users across the whole organization that are making a good use of Pentaho Analyzer to create the reports. At the report side, it helps the business users to evaluate all the work that has been done by the different teams across the organizations. By seeing the reports they can check, for example, how many issues are open, how many bugs have been fixed in a specific duration, how many builds failed in which phase etc.
  • First thing what I have experienced about Pentaho is that it is user friendly. The best thing about Pentaho is Pentaho Data Integration. I have never used more user friendlier ETL tool like PDI. All the jobs and transformation steps are easy to understand. And I like the sample transformations and jobs that are provided with the package. It is so user friendly that, even if you don't know SQL, it will generate it for you. If you don't want to write scripts, that's fine, you can do it in PDI.
  • I have found that Pentaho can be integrated with any technology or framework. I have easily and successfully integrated it with HDFS, EMR, S3, CouchDB, many different RDBMSs. I would consider it as a strength of Pentaho. Also if you are stuck or you find any error, the type of logging will have an answer for you. I found the logging mechanism very effective.
  • I have mostly used Pentaho Analyzer and Schema Workbench at the BI side. It is user friendly too and we have a very few users who come to the developers to help them understanding the UI of PUC.
  • Most of the companies use star schema in their Data Warehouses but they are not the pure star schemas. There are the bridge tables, group tables but when using Schema Workbench to design a cube, it gets very painful for the developers to accommodate such schema in it. To do this, I have to go to the XML file and add the new elements. I would love to see the feature where Schema Workbench can accommodate the bridge tables as they are the part of star schema too.
  • When it comes to ETL, I have found PDI to be the best tool, but at the report side, it is not as good as the other tools available in the market. Especially the users always complain about the graphs in the Pentaho Analyzer. I think. the UI needs a lot of improvement.
  • PDI is slow reading the JSON files. There is a fast JSON input step available in the marketplace but I think I would be great if Pentaho can make the JSON reading even faster.
  • When I export the repository, I see the files names are encoded with UTF-8 encoding. It would be great, if the spaces and the special characters can be preserved while exporting a BA repository.
Well suited scenarios: When we need to deal with any type of RDBMS, from data input to the data loading, Pentaho is super fast. It has many bulk loaders available too. Dealing with the tables is the specialty of Pentaho.

Less Appropriate scenarios: When you have a star schemas with the bridge tables or snow-flake schemas, you will need a lot of additional work to be done in Pentaho apparently. Also, dealing with the files is not bad, but it should be improved.
** My review is for Pentaho 5.4.0.8 or previous releases.
Read Nikhil Karkare's full review
Gordon Yeh profile photo
Score 8 out of 10
Vetted Review
Verified User
Review Source
Pentaho is used for scheduled ETL between Remedy and other platforms to help centralize and enrich data.
  • Easy to use!
  • Reliable for small jobs
  • Open source
  • Easy to learn
  • Great community online
  • Fast
  • Open sourced
  • Running on our vendor's VM... could be the reason why we are seeing some data limitation issues
  • Future friendly
Open sourced ETL tool to help corporations load data. Very easy to learn but you have to take the time to do it! Lots of reading and exploring by yourself and there are some free jobs and transformations online to help you get started. However, we have heard and seen cases with the jobs just do not load the data entirely; it'll have issues after 50K records.
Read Gordon Yeh's full review
Jordan Squire profile photo
Score 8 out of 10
Vetted Review
Verified User
Review Source
We utilize the Community Edition of Pentaho across our organization for business intelligence. We use Pentaho Data Integration (PDI) to gather data from disparate data sources and perform ETL to populate our data warehouse. We also use PDI to deliver custom reports to various internal and external clients. We use Mondrian as a data engine behind our customer-facing website to process MDX queries from our customer reporting portal. We use Pentaho Report Designer to author pixel-perfect reports for internal and external audiences. We also use the Pentaho User Console for our business analysts to slice-and-dice the data for various departments. In summary, Pentaho is used to allow our internal and external clients to benchmark the performance of our products and services and gain visibility into customer behavior and activity.
  • Pentaho Data Integration (PDI), which is Pentaho's ETL tool, is a powerful visual scripting tool. It is a very mature ETL tool that can process large quantities of data quickly when provided with appropriate hardware.
  • Pentaho Analyzer which is Pentaho's Enterprise browser-based analysis and pivot table tool is powerful and intuitive. When provided with a well defined data warehouse schema it is easy for even non-technical users to quickly generate reports and graphs.
  • Pentaho allows you to connect to virtually any datasource provided there exists a JDBC connector, REST API, or some other API end point.
  • Pentaho has an open-source Community Edition which provides much of the functionality of the Enterprise Edition without any licensing fees.
  • A major problem we have had with Pentaho is their enterprise licensing. As a client who understands very well what is offered within the free Community Edition we felt their mark up on their enterprise features was much too high. They wanted to charge us for migrating from MySQL to Amazon RedShift since RedShift is an "analytical database" while nearly every other database could be connected to for free. Due to licensing concerns we terminated our enterprise license.
  • Pentaho's visualization tools are very capable, but have a very steep learning curve and engineering cost. Due to this we ended switching from Pentaho for the majority of our internal dashboarding and reporting to purchasing Tableau licenses. If Pentaho could improve their visualization and dashboarding capabilities, they could truly be an end-to-end BI solution.
As far as generating and maintaining a data warehouse, Pentaho does an excellent job and I would highly recommend it. For analysis and gaining insight into your data, Pentaho is excellent. For visualization, dashboarding, or executive grade reporting it may not be as beautiful or flashy as your clients require.
Read Jordan Squire's full review
Yinghua Hu profile photo
Score 9 out of 10
Vetted Review
Verified User
Review Source
Pentaho is used as main ETL tool in the data analytics team. It solves the problem of processing and populating financial and ads related data.
  • Populate relational database
  • Transform and clean data
  • Create periodic job and generate report
  • Aggregate data
  • It will be helpful to have modules supporting Google Adwords and Facebook API and Twilio API
  • It has "add constant", but does not have "multiply constant" module.
  • Unit transform module
Pentaho is well suited for ETL processing and database population. It is less appropriate for visualization and analytics. Key question is the benefit it can bring and the cost and robustness.
Read Yinghua Hu's full review
Ivan Miller profile photo
Score 7 out of 10
Vetted Review
Verified User
Review Source
Pentaho's BI stack is being used by our entire company. We use their tooling for ETL, delivering reports, and as a sort of end-point for a custom web app that we've built. The end point uses xaction and Mondrian to deliver data to our front end. We also make use of the community edition of the BI server--this houses several reports that our analyst team has built as well as resources necessary to support our custom app. In the near future we'll also be using the tool's big data plugins to ingest data into our data warehouse from hive/hadoop.
  • ETL, fairly wide support for a number of different data sources, a good API for writing plugins, and great out-of-the-box functionality.
  • Community support and great documentation for using their tooling.
  • Mondrian/OLAP, great engine for processing MDX queries.
  • pentaho's analyzer tool/front-end. This doesn't come close to competing with products like Tableau.
  • Pentaho Report Designer, this looks like something that was built in the early 90's and is extremely clunky to use for new-users
  • Schema Workbench, would be nice to see better support for snowflake type schemas
Pentaho is very well suited for organizations looking for end-to-end BI solutions without wanting to break the bank--particularly because the community edition has most of the functionality necessary to get you started, and it's free. Honestly though, if you have deep pockets there are probably more complete solutions out there in terms of functionality
Read Ivan Miller's full review
Anderson L.A. profile photo
Score 9 out of 10
Vetted Review
Verified User
Review Source
It's been used across the whole organization .

Automated reports and dashboards are some of our core usages .
  • automated reports
  • dashboards
  • data-integration
  • Big Data
  • Poor support
  • few books published in relation to Pentaho
  • Pentaho User Console could be more user friendly
Pentaho is an excellent tool for start ups and where CTOs are willing to invest in staff training

Drawback : Takes a bit longer to master Pentaho in relation to the leading BI solutions.
Read Anderson L.A.'s full review
Sandro Frattura profile photo
Score 8 out of 10
Vetted Review
Verified User
Review Source
We use it for what you would expect. ETL for process and aggregate data, and then building data cubes and reports for viewing that data. We also use the ETL tool to auto-process files (CSV, XML etc) from that we receive from a few of our vendors daily.
  • The Data Integration tool is fantastic. A novice user can get up to speed quickly with it and the GUI is intuitive
  • Support. As an enterprise customer, I am always thrilled with the fast turnaround on support tickets that I open.
  • Job Scheduling. The Job scheduler is easy to use and very reliable
  • Data Analyzer Tool in "User Console". The drag and drop UI makes it easy for me end users to see the data in whatever way suits them. They can even GEO-MAP the data instantly!!
  • Speed on ETL. Without a very complex setup, the ETL Job runs single threaded and can be slow for BIG jobs.
  • The "Report Designer" is decent, but lacks a lot of control that you might get from more mature products (like Business Objects, for instance). Sometimes to want to rotate or reformat a chart label, and it can't be done. In other cases, it can. It is not consistent. Also, 3D charts don't work in certain circumstances. Finally, charts mapped over time, in some cases, don't have the X Axis auto-scale and so all the data points are not readable.
If you are an open-source shop, this is a great choice. Also, if your developer has minimal experience with ETL, Pentaho is a great way to go wince it is easy to use. There is also a ton of help in the forums, in you go with Community Edition (i.e. sans support)
Read Sandro Frattura's full review
Alex Meadows profile photo
Score 9 out of 10
Vetted Review
Verified User
Review Source
Pentaho Data Integration is used to move data into our data warehouse solution, used for various projects.
  • Data Integration. PDI handles pretty much any type of data source and can transform data and conform it to any type of business logic.
  • Object Oriented ETl Coding. Code developed in PDI can be shared across numerous jobs/transformations. This allows for resusable and maintainable code.
  • Flexibility and plugins. If the function that you are looking for is not built into the tool, it's fairly straightforward to either download or develop plugins. There is a huge community of users that build new functionality all the time.
  • Scaleable and cloud ready. PDI is able to cluster and scale out to handle huge data sets.
  • The speed at which some trouble issues get resolved could be improved.
PDI is a great alternative to proprietary tools. If your environment is just starting out using ETL tools, definitely look into using PDI. The number of features plus the ability to write flexible dynamic processes allows for much more maintainable code.
Read Alex Meadows's full review
Stephanie Dozier profile photo
Score 7 out of 10
Vetted Review
Verified User
Review Source
Pentaho is currently being used for several functions within my department. The Data Integration is used as our primary ETL tool. The software does a great job with data cleansing and standardizing from multiple sources. The tool is easy enough to learn using videos out on You Tube or using the Pentaho Kettle Solutions book. We also use the Schema Editor for creating the semantic layer in the User Console reporting tool. Having defined objects available in the User consoles makes it easy for a data analyst to quickly log in and quickly get a report together. We are in the healthcare industry so much of what is being asked is counts of patients, encounters by diagnosis, practice or time period. All these items can quickly be answered very quickly via the user console.
  • A data analyst has quick access to data that would otherwise need to be pulled straight for the database. Additionally, the data dictionary is tied to an object so up keep is easy and users can scroll over the object to exactly what that measure or object represents.
  • Report presentation is clean and professional. Pentaho comes with a pretty extensive set of charting and graphing functionality out of the box that in other tools would have to be developed. Easy file output with sufficient types available to choose from. (.pdf, .xlsx, .csv)
  • The Data Integration tool is without question the best tool in the set. There is no limit to your transformation options. You can transform virtually any text file, excel file or script into a table, spreadsheet or even report. Many times we are given list of patients and file formats vary from practice to practice. With Pentaho DI we have the flexibility to take what has been provided and standardize.
  • Pentaho could use some improvement of the integration of ldap into the User Console. We have unsuccessfully been able to deploy the Pentaho User Console to partners because it does not support a federated ldap model. Even attempts at integrating to our primary ldap has proven fruitless.
  • The Pentaho Suite as a whole tends to be a resource hog. Since we work with such large data sets we often find that large requests hang up or get an "oops error" which is a standard Pentaho error when the data set is too large. It would be nice to have some indication of our limits so that we are not guessing or we can limit the data set up front.
  • Upgrades on the Suite are a bit cumbersome for our Administrators. There is not a standard backup and install package so upgrades often involve our administrator on the phone with Pentaho during the upgrades to walk through a custom install.
Pentaho is well suited for an internal environment who needs access to data quickly and offer the data up in a very professional business document. It is not suited for a reporting portal that customers can access via the internet or web. The limited security prevents us from being able to offer our Pentaho UI as a service to our partners and customers.
Read Stephanie Dozier's full review
Kenneth Panacy profile photo
Score 10 out of 10
Vetted Review
Verified User
Review Source
Pentaho serves several roles in our environment.

We use it for report design and delivery. The designer tool is fairly straightforward and once reports are complete they are immediately accessible to our end users.

Pentaho is used for Business Intelligence which includes totals, trends, and forecasts of sales and subscriptions of our services. Pentaho's web-based analysis tool allows our end-users to drag & drop measures and dimensions and instantly convert raw data into a variety of graphical representations. Before we had Pentaho, each view might have required a long query or static report that was hand-exported to Excel. With Pentaho's analysis tool, our sales, finance, and marketing teams have instant insight and infinitely flexible views of our data.

In the IT department, we use Pentaho as a scheduling engine for data maintenance jobs as its interface allows us to manage multiple shards from a single source.
  • Pentaho works as advertised -- it provides a truly drag-and-drop customizable interface for your data.
  • The Enterprise Suite has everything you need to start working and showing results right away.
  • Pentaho's customer support has been responsive and timely when we've had issues.
  • The Report Designer interface can seem clunky at times when trying to perform additional calculations on previously calculated fields or pass values to subreports.
  • There is inconsistency within the tools in Data Integration -- it is clear the various tools were authored by different engineers at different times. It is not always obvious what goes where.
  • There is inconsistency between all the products within the Pentaho Suite. Each has a different look and feel and none work the same way.
  • The product is not without flaws, but so far none have been insurmountable.
  • Upgrading to the latest version has proven to be EXTREMELY challenging for both us AND Pentaho customer services.
Do you need to integrate data from multiple sources? Pentaho is great for this. Previously, we were SSHing files around and importing with numerous cron scripts.

Are your Finance and Marketing teams constantly asking for one-off queries? Pentaho puts this power in THEIR hands so they can pull and arrange the data they want into whatever view they want.

Do you want to pay too much for a full-featured BI solution? Pentaho is very reasonably priced, and even has a free version!
Read Kenneth Panacy's full review
Deepak Paramanand profile photo
Score 7 out of 10
Vetted Review
Verified User
Review Source
Our product intended to analyse Media and Communications data and provide decision making capabilities to our end users. Whereas we built the workflow management, transactional capabilities, Pentaho provided the BI layer for our product. Analysts were our end users. Everything from data ingestion to decision making was the scope of our product. We read data across disparate data sources/formats and using our business expertise provided decision making capability.
  • Data Integration. Pentaho wins hands down. You can read huge data using a Hadoop process, do your encirchment, load it to a Netezza database afterwards, finally input the data to your WEKA model to predict which Customer will churn or what offer should be made to the customer so that he/she stays put.
  • BI Server. If you want to schedule a data read operation of your clickstream data, to finally burst out recommendations of next best actions to end users, Pentaho's BI Server performs this integration seamlessly.
  • Mondrian. This layer provides Cube based Hierarchical data modeling on the front end and at the back end converts this OLAP structure to a ROLAP, SQL based model. Hence any relational database becomes a ROLAP engine seamlessly.
  • Data Visualization. Provide richer library of data visualization capabilities. Ultimately the dashboards are the end result of all the hard work done at the back office. Yes, Pentaho has a lower TCO compared to other Products but richer data visualization capabilities would make it a winner! Pentaho has overcome this limitation by allowing external charting engines to be integrated with their product suite, but more needs to be done to strengthen core Pentaho Data Visualizaiton capabilities. Alternatively the external charting engine capabilities need to be documented and evangalized.
  • Alternative to Mondrian. In our case we needed to analyse data for a million subscribers over key performance areas like Churns, Activations etc. In these scenarios semi additive measures needed to be calculated and presented in a report across days to years grains of time. In such scenarios the Mondrian based ROLAP capability did not scale up to our expectation. Pentaho needs to address such issues and fast.
  • In the world of R/SAS/SPSS its hard to find use cases where WEKA was used in Production environments to solve a business problem. We would have needed some hand holding to replace our R/SPSS code to WEKA and help us build newer alogorithms on this platform ground up.
Integration is best with Pentaho. If you want to overlay your product capability with Pentaho's BI Server, Data Integration, Reporting Engine, Workflow capabilities then Pentaho is the only answer. Other tools provide each of these very well, but if you have already built your product and dont want to buy Informatica, Cognos/Microstratey and SPSS/SAS then Pentaho is the way to go.
Pentaho is Java based, so no shortage of skilled Java resources who can help you integrate. We did not spend time building our own reporting engine, Data Visualization layer, Security layer, workflow capabilities. We simply used our proven transactional product and used Pentaho to fill our gaps.
Read Deepak Paramanand's full review
Adam O'Connor profile photo
Score 8 out of 10
Vetted Review
Verified User
Review Source
We integrate Pentaho with Odoo for complex, custom reports that allow us to query data from PostgreSQL database across multiple data models in Odoo. I have used it to connect to Odoo via JDBC as well as directly to the web client for an object-based report. I have found the report designer client to be a bit difficult to use and buggy on my WIndows 7 machine but I have seen it working well on a Windows 8 machine. We use version 3.9.1.
  • Connect via JDBC
  • Integrates with Odoo (formerly OpenERP)
  • Grouping, summing seem to work well even though they are not intuitive as far as set up goes.
  • The multiple report output types are great (PDF, CSV, Excel, Word, Text, etc)
  • Connect via the web client of Odoo
  • Easy to build in parameters for filtering in the report
  • Report designer seems to be buggy on Windows 7
  • The interface for the report designer is out dated and difficult to navigate or use.
  • I would like to see a tool tip when hovering over options
Pentaho is great for companies that don't have huge budgets but want a powerful reporting tool that they can get and use for free. It is great for developers that know SQL and can learn to use a non-intuitive interface and are motivated to work through a very outdated interface.
Read Adam O'Connor's full review
Itamar Steinberg profile photo
Score 9 out of 10
Vetted Review
Verified User
Review Source
I used Pentaho Kettle as a team manager of development and later as a CIO. After that I opened a company that consults and implement business intelligence solutions. We are using Pentaho mostly with the data integration module. I think that of all the modules of Pentaho, Kettle is the most complete. It can give a "fair fight" to source solutions that are not open. The problem it addresses of course is to extract data from various sources; transform them; ”play with data”; and then load it to the target. I find the transformation most valuable and rich with functionality. I even made a full scale course about it, you can find it on udemy.
  • Pentaho Kettle gives you a great graphic user interface to plan your transformation and jobs.
  • Pentaho Kettle makes it easy to handle errors, logging and performance.
  • Pentaho Kettle has dozen of great steps like: lookup and SCD functionality.
  • Several steps have performance issues like the Json input.
  • The community edition does not include scheduler and job manager so you need to figure it out yourself, unless of course you buy the Enterprise edition.
  • I think that web service should be easier to operate.
Pros:
I find it suited for 90% of data integration projects , its a very good tool, easy to use, stable and affordable.
Cons:
I think that the big data connections are still not perfect, so if you have a NoSQL DBl / Hadoop / Cassandra, you might consider extracting the data to file from the source using MapReduce. Also, if you need bulk load, sometimes it's better to use it directly on a tool, for example Redshift / InfiniDB (that is no longer with us).
Apart than that I think it will suit you well.
Read Itamar Steinberg's full review
Julia Gusman profile photo
Score 10 out of 10
Vetted Review
Verified User
Review Source
We are a consulting company and implement BI and data warehousing solutions for our clients. I have lead and implemented a number of BI and data warehousing solutions in my current as well as previous role.
  • The data integration tool is very strong. It is easy to use and very accessible to business users
  • The data integration component provides ability to connect to almost any data source as input as well as output - quick and easy
  • The data integration component allows quick preview of the data in a pivot view type format which enables data analysis at an early stage of development
  • The BI server front end does not seem to be very appealing to the end users. In my experience the end users wish for a prettier display and easier navigation
  • Schema workbench is not easy to develop with, I know there were some plans of integrating the tool with the data integration component
  • Data integration server does not seem to be as useful. For example, while it provides some source control, the files are only controlled separately one by one. If the developer changes several files at the same time, they cannot commit them all in one go. Often the changes across different files need to be kept in sync.
I would definitely recommend the ETL tool but not the server. I would consider the amount of data to be processed, the budget the client is looking to spend, available knowledge and expertise. it seems that there are not that many Pentaho skills on the market and companies are reluctant to implement the product as they are concerned about ongoing maintenance. I would also ask about requirements to integrate with other products or run on the cloud. Being open standards Pentaho integrates nicely with in-house systems.
Read Julia Gusman's full review
Nathan Smith profile photo
Score 9 out of 10
Vetted Review
Verified User
Review Source
Pentaho Data Integration was used for a variety of data integration projects, including populating a dimensional data warehouse. Data sources included relational data bases, flat files, and LDAP directories. Pentaho Reporting served reports from a range of data sources to multiple departments with security integrated with Active Directory. The Mondrian OLAP engine delivered pivot tables for slice and dice analysis.
  • The Pentaho Data Integration tool is extremely versatile. I find it easier to use than comparable tools like SSIS.
  • The reporting engine delivers reports in multiple formats, including Excel and PDF.
  • It is easy for users to subscribe to reports for delivery by email.
  • The report parameterization is very flexible, and I find it easier to use and more versatile than parameterization in Crystal.
  • If using the community edition, be prepared to invest some effort in learning the product. Documentation can be vague.
  • The new pivot table viewer is part of the enterprise edition. Fortunately there is a nice open source plug in called Saiku that can replace the old, difficult to use jPivot pivot table interface.
  • I wish there were easier ways to audit user activity, especially to see which items have not been accessed for a long time and could be retired.
  • There are not as many people who know how to use Pentaho in the market as there are people proficient in other BI tools. There is a learning curve and design patterns are not necessarily the same from one BI tool to the next.
Pentaho provides a rich end to end BI solution. It is highly configurable. The data integration module is an especially useful and powerful. You may need a higher level of IT skills and support to work with Pentaho than you do with other BI tools. It is helpful to have SQL querying skills when creating Penthao reports. Make sure you have resources who are familiar with Pentaho or are able to get training.
Read Nathan Smith's full review
No photo available
Score 5 out of 10
Vetted Review
Verified User
Review Source
We made a decision to use Pentaho to OEM into the analytics feature of our software and the developers had a much harder time getting it integrated than we were expecting. The Mondrian code was hard to learn and the number of qualified applicants who already knew how to use it was limited.

  • Being able to use the OLAP Analyzer tool saved our organization a lot of time in not needing to develop an OLAP tool on our own.
  • The ability to customize the tool and integrate all the internationalization we required was pretty straightforward.
  • The Mondrian code was hard to learn.
  • The availability of resources that had experience with Pentaho and Mondrian was limited for us.
  • The dashboards were slow and we found that it ended up being easier to remove the Pentaho pieces of the dashboard.
Overall as an OEM vendor of analytics, I would say it was about average. They did help us get to market sooner than if we developed everything in-house, but I felt the dashboards were pretty ugly and I wouldn't say I was super impressed. I found the Analyzer OLAP tool to be very useful and that part would have been harder to develop in-house, but I would skip the dashboards next time and go straight to developing those in-house, which is what we ended up with in the end.
Read this authenticated review
No photo available
Score 6 out of 10
Vetted Review
Verified User
Review Source
In my organization, we use Pentaho as our go-to ETL tool. It is used by our engineering department in order to create our data warehouse environment, which is used by several customers. It allows us to create our data warehouse, which ultimately enables our end users to utilize analytics for their environments.
  • Great visualizations. Much like SSIS, the component-based model of ETL is easy to see at-a-glance in order to understand what is going on.
  • Easy roll-ups. It is very easy to encapsulate logic into separate jobs (KJBs) and transforms (KTRs), in order to group code into bite-size chunks.
  • Wide selection of tools. There are several dozen ETL steps to choose from, in a variety of different categories. Everything from file management to XML and scripting.
  • Lack of support. Unlike many Microsoft products, there is not a great wealth of support out there for Pentaho on the web. It is much harder to track down solutions.
  • Non-intuitive. There is a learning curve, and there are subtle nuances to Pentaho that even experienced ETL developers will take a while to get used to.
  • Less reputable. Despite all of Pentaho's benefits, it just hasn't gotten to that critical mass in the marketplace. Thus it does not garner as much respect as Microsoft or Oracle, which in turn makes it harder to find online support. It's a vicious cycle.
It is well suited for developers who may not be adept at strict coding, but have experience with GUI development. It is also better for rather simpler ETL, as the GUI interface has natural restrictions that make complex ETL problems challenging to solve. It may also be well suited for companies on a budget, as the free version still has enough features to be reasonably useful.
Read this authenticated review
No photo available
November 04, 2016

Pentaho - Yay or Nay?

Score 6 out of 10
Vetted Review
Verified User
Review Source
I worked in a small-size private consulting firm that used Pentaho for analytics and reporting firm-wide. It catered to clients in a broad range of industries ranging from hospitality to the public sector. The software is easy to learn and has an impressive amount of functionality incorporated. The ETL (Kettle) module is great for generating visualizations and I highly recommend this software to a beginner interested in BI.
  • User friendly interface
  • Easy and quick report generation
  • Great report customizing features
  • Heavy software to install
  • Requires licensing to enable all features
  • Difficult to integrate with other servers
Pentaho is well suited to small - mid size firms that have specific reporting needs. It is less likely to benefit firms that already have data analytics capabilities built into multiple platforms as Pentaho is not easy to integrate with other analytics platforms. There isn't enough documentation available online, however, customer support is easy to reach.
Read this authenticated review
No photo available
Score 7 out of 10
Vetted Review
Verified User
Review Source
We used the Pentaho Data Integration tool to pre-process some data and fill thr production database. It was used in the department where I worked. Being a software developer I made a decision to use PDI instead of writing scripts on Python. The main advantage of PDI was supporting scaling out of the box, and the ability to run on multiple machines. When you need something less than a Hadoop cluster but bigger than self-made quick scripts it fits well.
  • The main strength of Pentaho (PDI) is that it is free. No need for additional license approvals to implement something quickly using the UI tool.
  • It has Pentaho Reporting - the ability to use reporting on top of PDI transformations is a powerful feature. PDI can handle data extraction, and with reporting you can create a simple report on this data. They all just work with each other without additional integration required.
  • And most of Pentaho projects are open-source. That helps a lot when debugging something.
  • Help documentation is an area for improvements. A lot of features and side effects just not documented.
  • PDI uses swt. It has a lot of troubles with this dependency. It is run on Java, but with swt - and you are stuck with a platform that supports swt. ARM architecture does not have swt support out of the box.
  • Online jobs/transformations creation. Jobs/transformations are just xml. So the lack of online editor is just 'it was not implemented yet'.
  • Strange /Pentaho-solutions placement. Application is not deployed on a server in a standard way. Instead - a server should be embedded into a 'Pentaho solution' - very strange architecture.
There are different Pentaho features, and each one has its own specific scenario where it is best suited. Reporting - is for generating boring Excel like reports, with calculations etc. For data visualization reporting is not a [good] tool. PDI is used for creation data transformations, from almost any source of data. But it is very heavy weighted. It has Hadoop and other support. But hand made code runs faster than transformation made in Spoon. And there are some algorithms that can be much easily written manually.
Read this authenticated review
No photo available
Score 8 out of 10
Vetted Review
Verified User
Review Source
At our organization we are building a data ware house for data migration and analytics. After researching several options we went for the Pentaho community edition which is an open source tool. Pentaho played big role in the implementation of ETL jobs and QA testing. The features available in Pentaho covered all of our scenarios, plus it allowed us to plug in external jars to use for any unsupported activity.
  • Pentaho allows for migration from one system to another system without much trouble.
  • Minimum code or almost no code expertise is required. Just get your SQL and Pentaho will do rest.
  • It is difficult to verify millions of rows. But with Pentaho you can do that, this makes the QA task easier.
  • Pentaho does make comparison of data between two systems fairly easier compared to other systems, but it lags in terms of output you get. The output has a merge difference option which will show a difference transaction only but if you have large data set columns it is not easy to find issues.
  • Date formatting and number formatting is again an issue, both the systems should have the same datatype. It was difficult for us to compare as a new system had updated data types and pentaho failed to compare these.

Pentaho is well suited for:


  • Migration of large data.
  • Reading XML file and processing data.
  • QA Testing.
  • Data modeling.
  • Data processing.
It is less appropriate for:

  • DataType validation
  • Reporting
Read this authenticated review

Feature Scorecard Summary

Pixel Perfect reports (14)
6.2
Customizable dashboards (13)
8.1
Report Formatting Templates (14)
5.4
Drill-down analysis (14)
7.4
Formatting capabilities (14)
6.7
Integration with R or other statistical packages (8)
6.0
Report sharing and collaboration (13)
6.5
Publish to Web (13)
6.7
Publish to PDF (15)
8.2
Report Versioning (9)
6.3
Report Delivery Scheduling (13)
5.4
Delivery to Remote Servers (5)
5.0
Pre-built visualization formats (heatmaps, scatter plots etc.) (11)
7.7
Location Analytics / Geographic Visualization (11)
7.4
Predictive Analytics (10)
6.9
Multi-User Support (named login) (15)
7.4
Role-Based Security Model (14)
7.3
Multiple Access Permission Levels (Create, Read, Delete) (14)
6.7
Single Sign-On (SSO) (5)
6.4
Responsive Design for Web Access (7)
7.7
Dedicated iOS Application (3)
6.3
Dedicated Android Application (3)
6.3
Dashboard / Report / Visualization Interactivity on Mobile (8)
7.3
REST API (5)
6.7
Javascript API (4)
6.3
iFrames (5)
5.9
Java API (3)
6.0
Themeable User Interface (UI) (5)
5.1
Customizable Platform (Open Source) (5)
8.4

About Pentaho

Pentaho is a suite of open source business intelligence and analytics products, now offered and supported by Hitachi Data Systems since the June 2015 acquisition.

Pentaho Integrations

Pentaho Technical Details

Operating Systems: Unspecified
Mobile Application:No