A powerful ETL solution which focuses on enterprise scalability, flexibility, and code re-usability
Overall Satisfaction with PowerCenter
Informatica PowerCenter is the enterprise ETL tool used for all data integration across the organization. PowerCenter is utilized for all on premises data integration including the Electronic Medical Record (EMR) system as well as disparate systems including scheduling and time management systems.
Pros
- Enforces enterprise wide ETL development standards.
- Provides code re-usability with shared connections and objects.
- Particularly adept at integrating a wide range of disparate data sources (handles flat files particularly well).
- Well suited for moving large amounts of data.
Cons
- There are too many ways to perform the same or similar functions which in turn makes it challenging to trace what a workflow is doing and at which point (ex. sessions can be designed as static or re-usable and the override can occur at the session or workflow, or both which can be counter productive and confusing when troubleshooting).
- The power in structured design is a double edged sword. Simple tasks for a POC can become cumbersome. Ex. if you want to move some data to test a process, you first have to create your sources by importing them which means an ODBC connection or similar will need to be configured, you in turn have to develop your targets and all of the essential building blocks before being able to begin actual development. While I am on sources and targets, I think of a table definition as just that and find it counter intuitive to have to design a table as both a source and target and manage them as different objects. It would be more intuitive to have a table definition and its source/target properties defined by where you drag and drop it in the mapping.
- There are no checkpoints or data viewer type functions without designing an entire mapping and workflow. If you would like to simply run a job up to a point and check the throughput, an entire mapping needs to be completed and you would workaround this by creating a flat file target.
- PowerCenter has been instrumental in being the center of all data movement within the organization.
- It has also provided a foundation for which re-usability and scalability are the focus.
- Finding talent with experience and expertise in PowerCenter is far more likely due to its presence and market share.
SSIS is a good entry into ETL, for smaller organizations or Microsoft-centric companies. It's strengths lie in its ease-of-use, quick turnaround, and simplicity. Its weaknesses lie in scalability and re-usability (you can achieve re-usability, however segmentation is at the project level with no global scale). Pentaho is a great, low cost solution with deceptive power, however scheduling along with flexibility in other functions such as OS level script execution is a bit weaker. PowerCenter and DataStage are more comparable at an enterprise ETL level, having different management philosophies and approaches. For both, there are features in each that I wish the other either adopted or replaced, however due to its place in the market, ease of finding additional developers, and scalability, PowerCenter was chosen.
Comments
Please log in to join the conversation