Data Integration Tools

Best Data Integration Tools include:

Matillion, Astera Centerprise, Oracle GoldenGate, and Task Factory.

Data Integration Tools TrustMap

TrustMaps are two-dimensional charts that compare products based on trScore and research frequency by prospective buyers. Products must have 10 or more ratings to appear on this TrustMap.

Data Integration Tools Overview

What are Data Integration Tools?

The need for data integration emerges from complex data center environments where multiple different systems are creating large volumes of data. This data must be understood in aggregate, rather than in isolation. Data integration is nothing more than a technique and technology for providing a unified and consistent view of enterprise-wide data.

Data Integration Tools Features & Capabilities

  • Ability to process data from a wide variety of sources such as mainframes, enterprise applications, spreadsheets, proprietary databases, etc.

  • Ability to process unstructured data from social media, email, web pages, etc.

  • Syntactic and semantic checks to make sure data conforms to business rules and policies

  • Deduplication and removal of incorrectly or improperly formatted data

  • Support for metadata

Types of Data Integration

There are several different approaches to achieving this goal which are quite different to each other and essentially solve slightly different problems: The main technologies for data integration are Extract, Transform Load (ETL), Enterprise Application Integration (EAI), and Enterprise Information Integration (EII), or data virtualization as it is more often called today.


Products listed in this category belong to the ETL data integration approach. Unlike the other listed approaches, ETL is designed for data migration and integration of large volumes of data to provide a basis for decision-making.

What is ETL?

ETL is a process whereby large volumes of required data are extracted from various databases and converted into a common format. The data is then cleaned, and loaded into the specialized reporting database called a data warehouse. It is then available for standard reporting purposes.


The data used in ETL can come from any source including flat files, Excel data, application data like CRM or ERP data, or mainframe application data. Perhaps the most difficult part of the process is the “Transform” component. Here, not only must the data be cleansed and any duplicates removed, but the software also has to resolve data consistency issues. It applies rules to consistently convert data to the appropriate form for the data warehouse or repository.


Once the data has been loaded into a data warehouse it is available for querying by business intelligence front-end processes that can pull consolidated data into reports and dashboards.

Shortcomings of Data Warehouses

One shortcoming of the data warehouse approach is that the data is not always current. Data warehouses pull data from databases periodically in batches, not in real time. If the data in the source database has changed, this might not be reflected in the data in the warehouse. Various strategies can be employed to achieve “real-time ETL”, although some of them place a significant load on the database. This can have performance repercussions.


The simplest thing to do is simply increase the frequency of batch updates to near real-time processing. But there are other solutions including continuously feeding the database using real-time data transport technologies, the use of staging tables, or a real-time data cache.

Pricing Information

Enterprise-level data integration tools can be very expensive with some products costing upwards of $10,000 per user per year. On top of that, you may need to pay for professional services to get up and running. SMB solutions are significantly cheaper than this.

Data Integration Products

(1-25 of 98) Sorted by Most Reviews

Informatica MDM
22 ratings
3 reviews
Informatica MDM is an enterprise master data management solution that competes directly with IBM's InfoSphere and Oracle's Siebel UCM product. The product has about 200 licensed users. Informatica MDM is a multidomain solution with flexibility to support any master data domain and relationship—wheth…
Actian DataConnect
5 ratings
2 reviews
Originally developed by Pervasive Software, the Actian DataConnect (formerly Actian Integration Hub) is data integration technology.
elastic.io Integration Platform
2 ratings
2 reviews
The elastic.io Integration Platform is a hybrid integration system that users can deploy on-premise or in the cloud. It is highly scalable, with over 100 prebuilt application connectors for out-of-the-box use and a flexible data structure.
Denodo
8 ratings
2 reviews
Denodo is the eponymous data integration platform from the global company headquartered in Silicon Valley.
Oracle Data Service Integrator
17 ratings
2 reviews
Oracle Data Service Integrator provides companies the ability to develop and manage federated data services for accessing single views of disparate information. Oracle Data Service Integrator is standards based, declarative, and enables re-usability of data services. For more information visit https://www.oracle.com/middleware/technologies/data-service-integrator.html
SAS Data Integration Studio
11 ratings
2 reviews
SAS Data Integration Studio is as the name would suggest a data integration solution, from SAS.
SAS Enterprise Data Integration Server
16 ratings
2 reviews
The SAS Enterprise Data Integration Server is a suite of products; this legacy edition is now replaced by SAS Data Management Standard.
Analyza
2 ratings
1 reviews
Analyza is a business intelligence solution that is provided by PIT Business. The vendor provides support services in Luxembourg, Belgium, France, and other parts of Europe. Analyza allows users to: Build several indicators and dashboardsBuild ad-hoc analysis without IT knowledgeNavigate through …
RepreZen API Studio
1 ratings
1 reviews
RepreZen™ API Studio is an enterprise-class API design platform, built from the ground up to meet the demands of large-scale integration programs. The vendor says that while other tools only address individual APIs, RepreZen optimizes at the organizational scale, aligning interfaces and streamlining…
Infor Cloverleaf Integration Suite
5 ratings
1 reviews
Infor Cloverleaf Integration Suite is a data integration platform and suite of applications from ERP vendor Infor, optimized for healthcare organizations and their systems.
CloverDX
1 ratings
1 reviews
CloverDX is a rapid, end-to-end data integration solution. The vendor states that businesses choose CloverDX for its usability and intuitive controls, along with its lightweight footprint, flexibility, and processing speed. Achieving true, rapid data integration means much more than just raw data pr…
Actian Pervasive Data Integrator
4 ratings
1 reviews
Actian Pervasive Data Integrator is a data integration solution supported by Actian following the merging of that company with Pervasive.
Azure Data Factory
5 ratings
1 reviews
Microsoft's Azure Data Factory is a service built for all data integration needs and skill levels. It is designed to allow the user to easily construct ETL and ELT processes code-free within the intuitive visual environment, or write one's own code. Visually integrate data sources using more than 80…
SAS DataFlux
9 ratings
1 reviews
SAS DataFlux's capabilities handle data profiling, matching, cleansing and monitoring. Capabilities are available as individual products or as a platform. DataFlux competes with Informatica, Trilliium, Ataccama, and SAP Data Quality Management.
SAS Data Management Platform
8 ratings
1 reviews
SAS Data Management Platform is a data integration solution, from SAS.
BusinessObjects Data Integrator
8 ratings
1 reviews
BusinessObjects Data Integrator from SAS is a data integration platform.
Fivetran
7 ratings
1 reviews
Fivetran replicates all your applications, databases, events and files into a high-performance data warehouse, after a five minute setup. The vendor says their standardized cloud pipelines are fully managed and zero-maintenance. The vendor says Fivetran began with a realization: For modern companie…
iWay Enterprise Information Management Suite
The iWay Enterprise Information Management Suite (EIM Suite) is a data integration solution.
Omni-Gen Integration Edition
The iWay Integration Suite from Information Builders presents a data integration solution.
iWay Service Manager
iWay Service Manager (iSM) is an integration server that aims to ensure rapid access to timely, accurate data across all systems, processes and stakeholders – with interoperability between disparate systems and data. According to the vendor, with iSM, all aspects of your existing infrastructure – …
Omni-Gen Master Data Management Edition
The iWay Parallel Service Manager is a data integration solution from Information Builders.
Datasphere
Primary Data offers a data integration, virtualization platform.
ConnectALL
According to the vendor, ConnectALL®, an Orasi company, powers businesses in achieving higher agility and increased velocity. Teams from software development and delivery, IT and business units across large and small enterprises worldwide use ConnectALL’s integration platform to unify people, proces…
Syncsort Connect ETL (formerly DMX)
Syncsort Connect ETL (formerly DMX) is a data integration platform, from Syncsort.