Data Integration Tools

Best Data Integration Tools include:

Matillion, Astera Centerprise, Oracle GoldenGate, and Task Factory.

Data Integration Tools TrustMap

TrustMaps are two-dimensional charts that compare products based on trScore and research frequency by prospective buyers. Products must have 10 or more ratings to appear on this TrustMap.

Data Integration Tools Overview

What are Data Integration Tools?

The need for data integration emerges from complex data center environments where multiple different systems are creating large volumes of data. This data must be understood in aggregate, rather than in isolation. Data integration is nothing more than a technique and technology for providing a unified and consistent view of enterprise-wide data.

Data Integration Tools Features & Capabilities

  • Ability to process data from a wide variety of sources such as mainframes, enterprise applications, spreadsheets, proprietary databases, etc.

  • Ability to process unstructured data from social media, email, web pages, etc.

  • Syntactic and semantic checks to make sure data conforms to business rules and policies

  • Deduplication and removal of incorrectly or improperly formatted data

  • Support for metadata

Types of Data Integration

There are several different approaches to achieving this goal which are quite different to each other and essentially solve slightly different problems: The main technologies for data integration are Extract, Transform Load (ETL), Enterprise Application Integration (EAI), and Enterprise Information Integration (EII), or data virtualization as it is more often called today.


Products listed in this category belong to the ETL data integration approach. Unlike the other listed approaches, ETL is designed for data migration and integration of large volumes of data to provide a basis for decision-making.

What is ETL?

ETL is a process whereby large volumes of required data are extracted from various databases and converted into a common format. The data is then cleaned, and loaded into the specialized reporting database called a data warehouse. It is then available for standard reporting purposes.


The data used in ETL can come from any source including flat files, Excel data, application data like CRM or ERP data, or mainframe application data. Perhaps the most difficult part of the process is the “Transform” component. Here, not only must the data be cleansed and any duplicates removed, but the software also has to resolve data consistency issues. It applies rules to consistently convert data to the appropriate form for the data warehouse or repository.


Once the data has been loaded into a data warehouse it is available for querying by business intelligence front-end processes that can pull consolidated data into reports and dashboards.

Shortcomings of Data Warehouses

One shortcoming of the data warehouse approach is that the data is not always current. Data warehouses pull data from databases periodically in batches, not in real time. If the data in the source database has changed, this might not be reflected in the data in the warehouse. Various strategies can be employed to achieve “real-time ETL”, although some of them place a significant load on the database. This can have performance repercussions.


The simplest thing to do is simply increase the frequency of batch updates to near real-time processing. But there are other solutions including continuously feeding the database using real-time data transport technologies, the use of staging tables, or a real-time data cache.

Pricing Information

Enterprise-level data integration tools can be very expensive with some products costing upwards of $10,000 per user per year. On top of that, you may need to pay for professional services to get up and running. SMB solutions are significantly cheaper than this.

Data Integration Products

(1-25 of 98) Sorted by Most Reviews

TIBCO Cloud Integration (including BusinessWorks and Scribe)
444 ratings
215 reviews
Top Rated
TRUE
TIBCO Cloud™ Integration is an enterprise iPaaS platform. It offers a drag-and-drop and API- led design approach for user-friendliness.
Matillion
105 ratings
62 reviews
Top Rated
Matillion is data transformation for cloud data warehouses. According to the vendor, only Matillion is purpose-built for Amazon Redshift, Snowflake, and Google BigQuery enabling businesses to achieve new levels of simplicity, speed, scale, and savings. Quickly develop custom Transformation jobs by …
Task Factory
55 ratings
37 reviews
Top Rated
According to the vendor, Task Factory offers high-performance components and tasks for SQL Server Integration Services (SSIS) that save you time and money by accelerating ETL processes and eliminating many tedious SSIS programming tasks. With more than 70 components, Task Factory helps you connect t…
SQL Server Integration Services
202 ratings
35 reviews
Microsoft's SQL Server Integration Services (SSIS) is a data integration solution.
Oracle GoldenGate
198 ratings
35 reviews
Top Rated
Oracle GoldenGate is database management software for data integration, and availability support for heterogeneous databases.
Dell Boomi
75 ratings
30 reviews
Dell Boomi is a cloud-based, on-premise, or hybrid integration platform. It offers a low-code/no-code interface with the capacity for API and EDI connections for integrating with external organizations and systems, as well as compliance with data protection regulations.
Astera Centerprise
33 ratings
29 reviews
Top Rated
Centerprise Data Integrator is an integration platform that includes tools for data integration, data transformation, data quality, and data profiling.
Oracle Data Integrator
102 ratings
23 reviews
Oracle Data Integrator is an ELT data integrator designed with interoperability other Oracle programs. The program focuses on a high-performance capacity to support Big Data use within Oracle.
Informatica PowerCenter
77 ratings
18 reviews
Informatica PowerCenter is a metadata driven data integration technology designed to form the foundation for data integration initiatives, including analytics and data warehousing, application migration, or consolidation and data governance.
Dataloader.io
35 ratings
12 reviews
Dataloader.io delivers a cloud based solution to import and export information from Salesforce.
Talend Open Studio
44 ratings
11 reviews
Talend Open Studio is a data integration solution.
TIBCO Data Virtualization
27 ratings
8 reviews
Cisco Data Virtualization, formerly Composite (acquired July 2013) is, as the name might suggest, a data or datacenter virtualization platform.
Talend Data Integration
49 ratings
7 reviews
The Talend Integration Suite, from Talend, is a set of tools for data integration.
IBM InfoSphere Information Server
29 ratings
6 reviews
IBM InfoSphere is an enterprise grade master data management solution used by over 700 customers. It competes with Oracle's Siebel UCM product and Informatica.
Oracle Warehouse Builder
20 ratings
5 reviews
Oracle Warehouse Builder is a data integration solution, from Oracle.
SAS/Access
14 ratings
5 reviews
SAS/Access is a data integration solution, from SAS.
Skyvia
6 ratings
4 reviews
Skyvia is a cloud platform for no-coding data integration (both ELT and ETL), automating workflows, cloud to cloud backup, data management with SQL, CSV import/export, creating OData services, etc. The vendor says it supports all major cloud apps and databases, and requires no software except a web …
NetWeaver Process Integration
9 ratings
4 reviews
SAP NetWeaver Process Integration is an application integration solution.
SAP Replication Server
8 ratings
4 reviews
SAP's Sybase Replication Server is database development and management software.
IBM InfoSphere DataStage
10 ratings
4 reviews
IBM InfoSphere DataStage is an ETL platform for integrating data across enterprise systems, available on-premise or on cloud.
VertifyData
4 ratings
4 reviews
VertifyData is a cloud-based integration platform with core integration capacities, including a drag-and-drop interface and real-time synchronization. It also offers over 80 prebuilt connectors and templates, plus customizable integrations for scaling businesses.
Xactly Connect
4 ratings
3 reviews
Xactly Connect is a data integration and open API platform that allows organizations to create and automate data integration processes for commissions processing within Xactly Incent.
COZYROC
6 ratings
3 reviews
COZYROC SSIS+ is a suite of 240+ advanced components for developing ETL solutions with Microsoft SQL Server Integration Services. The vendor states that COZYROC is an easy-to-use, code-free library of tasks, components and reusable scripts that can significantly cut development time and improve the …
SAP Data Intelligence (formerly SAP Data Hub)
9 ratings
3 reviews
SAP Data Intelligence is presented by the vendor as a single solution to innovate with data. Including SAP Data Hub, it provides data-driven innovation in the cloud, on premise, and through BYOL deployments. It is described by the vendor as the new evolution of SAP Data Hub, a data orchestration and…
Phocas Business Intelligence
5 ratings
3 reviews
Phocas helps users discover data and provides results in real time. It is designed for non-technical users and delivers a simple yet powerful analytical capability that quickly turns data into a chart, graph or map. It brings up data on local, regional or global sales, inventory, forecasts, prices…