IBM InfoSphere DataStage

IBM InfoSphere DataStage

About TrustRadius Scoring
Score 8.6 out of 100
IBM InfoSphere DataStage


Recent Reviews


10 out of 10
January 31, 2019
  • DS is one of the most powerful ETL tools on the market. Its connectors to different bases plus the pack make it a solid tool, and …
Continue reading

Popular Features

View all 12 features

Simple transformations (5)


Connect to traditional data sources (5)


Complex transformations (5)


Metadata management (5)


Video Reviews

Leaving a video review helps other professionals like you evaluate products. Be the first one in your network to record a review of IBM InfoSphere DataStage, and make your voice heard!


View all pricing

What is IBM InfoSphere DataStage?

IBM InfoSphere DataStage is an ETL platform for integrating data across enterprise systems, available on-premise or on cloud.

Entry-level set up fee?

  • No setup fee


  • Free Trial
  • Free/Freemium Version
  • Premium Consulting / Integration Services

Would you like us to let the vendor know that you want pricing?

2 people want pricing too

Alternatives Pricing

What is Fivetran?

Fivetran replicates applications, databases, events and files into a high-performance data warehouse, after a five minute setup. The vendor says their standardized cloud pipelines are fully managed and zero-maintenance. The vendor says Fivetran began with a realization: For modern companies using…

What is Oracle GoldenGate?

Oracle GoldenGate is database management software for data integration, and availability support for heterogeneous databases.

Features Scorecard

Data Source Connection


Data Transformations


Data Modeling


Data Governance


Product Details

What is IBM InfoSphere DataStage?

IBM InfoSphere DataStage is an ETL platform for integrating data across enterprise systems, available on-premise or on cloud.

IBM InfoSphere DataStage Technical Details

Operating SystemsUnspecified
Mobile ApplicationNo


View all alternatives

Frequently Asked Questions

What is IBM InfoSphere DataStage?

IBM InfoSphere DataStage is an ETL platform for integrating data across enterprise systems, available on-premise or on cloud.

What is IBM InfoSphere DataStage's best feature?

Reviewers rate Connect to traditional data sources and Simple transformations highest, with a score of 9.8.

Who uses IBM InfoSphere DataStage?

The most common users of IBM InfoSphere DataStage are from Enterprises (1,001+ employees) and the Financial Services industry.

Reviews and Ratings




(1-5 of 5)
Companies can't remove reviews or game the system. Here's why
Filippo Orlando | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Review Source
In our company, IBM InfoSphere DataStage is the main ETL engine used by the data integration functions. It guarantees the integration of most of the company's analytical processes. The reliability and capillarity that has shown make it an essential tool for our needs.
  • reliability
  • capillarity
  • complexity
  • adaptability
I believe that for the classic vision of integration, IBM InfoSphere DataStage is a reliable tool. It certainly cannot meet the latest requirements and does not guarantee the speed and versatility of the new data management systems.
I believe that IBM generally has one of the worst and most complex assistance systems (physical and online) that exists.
Score 9 out of 10
Vetted Review
Verified User
Review Source
A few departments including IT department and Shadow IT groups within other departments in the company use the this tool at Office Depot. We recollect, load and transform data from multiple sources and applications including sales, inventory, financial information among others. Currently updating to the latest version of DataStage on the cloud and connecting to other cloud data-sources as well as data files and other types of data-sources.
  • Connect to multiple types of data-sources including Oracle, Teradata, Snowflake, SQl Server.
  • Powerful tool to load large volumes of data.
  • Transformation stages allow us to reduce the amount of code needed to create ETL scripts.
  • Allow us to synchronize and refresh data as much as needed.
  • Connector Stages to Snowflake on the cloud. We had some issues initially but since then had been corrected.
  • Accessing tool from a browser (zero foot-print). Currently we need to either install locally or connect to a server to do ETL work.
  • Diversify ways of authenticating users.
DataStage is well suited for any size of company that's looking to move, transform, clean data and easily create data-warehouses that would help to make data ready to be presented for decision making. Data Stage would easily integrate with companies that use IBM DB2 as their main RDBMS.
A scenario where it less suited could be cost. I have noticed IBM tools tend to be a little more costly than average.
IBM offers different levels of support but in my experience being and IBM shop helps to get direct support from more knowledgeable technicians from IBM. Not sure on the cost of having this kind of support, but I know there's also general support and community blogs and websites on the Internet make it easy to troubleshoot issues whenever there's need for that.
Score 8 out of 10
Vetted Review
Verified User
Review Source
It is mainly used by IT department as ETL software. It served as an efficient tool for us to extract data, transform and load data in the databases or provide the output file for users and other systems. With ODBC connections, it can connect Oracle, SQL, and other databases and open files to read and write.
  • Very reliable in handling data extraction, data transformation and loading
  • Flexibility in connecting to different type of databases, relational or non-relational
  • Great features such as parallel processing, hash handling, etc.
  • You can also take advantage of its FTP functions, and scheduling features if you need to.
  • Technical support is a key area IBM should improve for this product. Sometimes our case is assigned to a support engineer and he has no idea of the product or services.
  • Provide custom reports for datastage jobs and performance such as job history reports, warning messages or error messages.
  • Make it fully compatible with Oracle and users can direct use of Oracle ODBC drivers instead of Data Direct driver. Same for SQL server.
It is well suited if your data is quite complex and many data rules are in place. You have to write real complex data joins to generate output. While DataStage can handle it and provide you an intuitive design for support and analysis.
I mainly handle support cases with IBM and the support is really lagging behind.
January 31, 2019


Score 10 out of 10
Vetted Review
Verified User
Review Source
  • DS is one of the most powerful ETL tools on the market. Its connectors to different bases plus the pack make it a solid tool, and complete. It has a great number of functions, and the work with a big amount of data with DS is not complex (as long as you have the knowledge and know how to handle the partitioning algorithms, etc).
  • DataStage has improved over time. It has improved its connectors, added functionalities, making easier the programming and the maintenance of the development.
  • Connectivity.
  • Handling large numbers of records.
  • Varied partitioning algorithms.
  • Complementary packages of connectivity to applications, SAP, etc.
  • You must understand and know the algorithms, since the wrong use of them generates more time in processing.
  • Metadata. You need to develop with connectors, and taking all the Metadata from the menu, all the data that you complete manually, you can't track it.
Recommend for:
Small and large data volumes
For development and processing of complex data (functions / routines)
File management
Migrating Data from a Database to other
When you need to track all the data loaded, because you can have all the information about the transformation, the derivation, and where it was used
Gonzalo Angeleri | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Review Source
DataStage is part of the IBM data integration and governance suite and is primarily used as an ETL tool to integrate information at the corporate level. It allows to minimize the development and maintenance times of integration processes by having a totally visual development environment and therefore the cost of these processes.
  • Proven MPP Engine. Excellent performance for integrating data.
  • 100% visual development, operational and monitoring environment.
  • Any source to any target capabilities.
  • Complex transformation capabilities without writing code.
  • Metadada and data quality capabilities.
  • Hybrid deployment (on-prem or cloud).
  • Unified processing engine.
  • Pricing.
Good for:

  • Integration Layer for a Data Mart and Data Warehousing implementation.
  • Batch or near real-time integration processes
  • Big data implementation (integrating non-Hadoop world with Hadoop)

Not so Good:

  • Real time integration