Skip to main content
TrustRadius
IBM DataStage

IBM DataStage
Formerly InfoSphere DataStage

Overview

What is IBM DataStage?

IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns.…

Read more
Recent Reviews

DATASTAGE -ETL

10 out of 10
January 31, 2019
Incentivized
  • DS is one of the most powerful ETL tools on the market. Its connectors to different bases plus the pack make it a solid tool, and …
Continue reading
Read all reviews

Awards

Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards

Popular Features

View all 11 features
  • Simple transformations (9)
    9.8
    98%
  • Connect to traditional data sources (9)
    9.5
    95%
  • Complex transformations (9)
    9.3
    93%
  • Collaboration (9)
    9.0
    90%
Return to navigation

Pricing

View all pricing
N/A
Unavailable

What is IBM DataStage?

IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns. A basic version of the software is available for on…

Entry-level set up fee?

  • No setup fee

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

27 people also want pricing

Alternatives Pricing

N/A
Unavailable
What is SolarWinds Task Factory?

According to the vendor, SolarWinds Task Factory saves time managing tedious ELT/ETL tasks with high-performing SQL Server Integration Services (SSIS) components that can be used within the Visual Studio environment to connect to nearly any data source. Task Factory’s components and tasks have been…

What is Skyvia?

Skyvia is a cloud platform for no-coding data integration (both ELT and ETL), automating workflows, cloud to cloud backup, data management with SQL, CSV import/export, creating OData services, etc. The vendor says it supports all major cloud apps and databases, and requires no software except a web…

Return to navigation

Features

Data Source Connection

Ability to connect to multiple data sources

9.1
Avg 8.2

Data Transformations

Data transformations include calculations, search and replace, data normalization and data parsing

9.5
Avg 8.4

Data Modeling

A data model is a diagram or flowchart that illustrates the relationships between data

9
Avg 8.2

Data Governance

Data governance is the practise of implementing policies defining effective use of an organization's data assets

8.9
Avg 8.2
Return to navigation

Product Details

What is IBM DataStage?

IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns. A basic version of the software is available for on-premises deployment, and the cloud-based DataStage for IBM Cloud Pak® for Data offers automated integration capabilities in a hybrid or multicloud environment.


IBM DataStage Technical Details

Deployment TypesOn-premise, Software as a Service (SaaS), Cloud, or Web-Based
Operating Systems,
Mobile ApplicationNo

Frequently Asked Questions

IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns. A basic version of the software is available for on-premises deployment, and the cloud-based DataStage for IBM Cloud Pak® for Data offers automated integration capabilities in a hybrid or multicloud environment.

Reviewers rate Simple transformations highest, with a score of 9.8.

The most common users of IBM DataStage are from Enterprises (1,001+ employees).
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews and Ratings

(34)

Attribute Ratings

Reviews

(1-9 of 9)
Companies can't remove reviews or game the system. Here's why
niman goli | TrustRadius Reviewer
Score 10 out of 10
Vetted Review
Verified User
Incentivized
IBM InfoSphere DataStage is used for data analysis for business trending and banking aging report for the customers.It would be helpful for finding when the last transaction was done and when the future transaction would be coming for . IBM InfoSphere DataStagecan help with ease of access , easy of gui management and easy navigation for the user to great help
  • banking environment
  • user data management
  • user account management
  • data stage integration with cloud
  • integration with bmc
  • database management with datastage
it is suited for the scenario where the major or more number users and transactions are involved.it would help with data analysis,of how many transactions, and what was the frequency of the transactions,how much amount was paid and how much was the due , and how many count of the refills done.
Edger Loredo | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User
Incentivized
The effective mapping product and highly effective for project data management and scalability is the best. Functions are simple to start with and all the features are easy to custom. The IBM platform offers the best mapping capability and the data modeling functions are excellent. The reports are excellent and effective.
  • Mapping tools are excellent.
  • Reporting functions.
  • Data collaboration.
  • The deep functions manipulation is tricky.
  • Tools are not easy to manipulate through Cloud services.
  • Ability to manage big data can be more functional.
Excellent Cloud data mapping tool and easy creating multiple project data analytics in real-time and the report distribution are excellent via this IBM product. Easy tool to provide data visualization and the integration is effective and helpful to migrating huge amounts of data across other platforms and different websites insights gathering.
Score 9 out of 10
Vetted Review
Verified User
Incentivized
This is the primary tool that assists with Data movement and transformations from homogeneous and heterogeneous sources. Within the organization, this tool is extensively used to build warehouses and data marts that support downstream analytics. the Seamless nature of the product with the ability to weave shell scripts, python scripts, and data processing jobs into a single executable sequence is very helpful.
  • Data movement
  • Seamless integration of scripts and etl jobs
  • Descriptive logging
  • Ability to work with myriad of data assets
  • Direct integration for Governance catalog
  • Hierarchical stages to parse and build xmls and jsons needs improvement.
  • Web interface of the application also needs improvement.
IBM infosphere works exceptionally well as an ETL tool. It is relatively easy to pick up and the learning curve is not steep for a new user. From an integration perspective, Infosphere DataStage has connectors to all the prominent databases and files. The seamless integration with the Active directory helps manage the security and access privileges. A place to improve - web-based hierarchical stage.
Score 7 out of 10
Vetted Review
Verified User
Incentivized
We use DataStage for ETL development, creating jobs to extract the data from different source systems and transform the data to use with different databases, and load the data into the database tables/views using staging tables. We use DataStage with Cognos to show the glossary information for any particular field and track it all the way from the report to the source application.
  • Data integration
  • ETL jobs
  • Business Glossary
  • Creating snapshot views using jobs
  • Better interface for the job designs
It helps minimize the project delivery cycle since it allows a common set of tools across IIS. MDM integrations are a plus.
Filippo Orlando | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Incentivized
In our company, IBM InfoSphere DataStage is the main ETL engine used by the data integration functions. It guarantees the integration of most of the company's analytical processes. The reliability and capillarity that has shown make it an essential tool for our needs.
  • reliability
  • capillarity
  • complexity
  • adaptability
I believe that for the classic vision of integration, IBM InfoSphere DataStage is a reliable tool. It certainly cannot meet the latest requirements and does not guarantee the speed and versatility of the new data management systems.
Score 9 out of 10
Vetted Review
Verified User
Incentivized
A few departments including IT department and Shadow IT groups within other departments in the company use the this tool at Office Depot. We recollect, load and transform data from multiple sources and applications including sales, inventory, financial information among others. Currently updating to the latest version of DataStage on the cloud and connecting to other cloud data-sources as well as data files and other types of data-sources.
  • Connect to multiple types of data-sources including Oracle, Teradata, Snowflake, SQl Server.
  • Powerful tool to load large volumes of data.
  • Transformation stages allow us to reduce the amount of code needed to create ETL scripts.
  • Allow us to synchronize and refresh data as much as needed.
  • Connector Stages to Snowflake on the cloud. We had some issues initially but since then had been corrected.
  • Accessing tool from a browser (zero foot-print). Currently we need to either install locally or connect to a server to do ETL work.
  • Diversify ways of authenticating users.
DataStage is well suited for any size of company that's looking to move, transform, clean data and easily create data-warehouses that would help to make data ready to be presented for decision making. Data Stage would easily integrate with companies that use IBM DB2 as their main RDBMS.
A scenario where it less suited could be cost. I have noticed IBM tools tend to be a little more costly than average.
Score 8 out of 10
Vetted Review
Verified User
Incentivized
It is mainly used by IT department as ETL software. It served as an efficient tool for us to extract data, transform and load data in the databases or provide the output file for users and other systems. With ODBC connections, it can connect Oracle, SQL, and other databases and open files to read and write.
  • Very reliable in handling data extraction, data transformation and loading
  • Flexibility in connecting to different type of databases, relational or non-relational
  • Great features such as parallel processing, hash handling, etc.
  • You can also take advantage of its FTP functions, and scheduling features if you need to.
  • Technical support is a key area IBM should improve for this product. Sometimes our case is assigned to a support engineer and he has no idea of the product or services.
  • Provide custom reports for datastage jobs and performance such as job history reports, warning messages or error messages.
  • Make it fully compatible with Oracle and users can direct use of Oracle ODBC drivers instead of Data Direct driver. Same for SQL server.
It is well suited if your data is quite complex and many data rules are in place. You have to write real complex data joins to generate output. While DataStage can handle it and provide you an intuitive design for support and analysis.
January 31, 2019

DATASTAGE -ETL

Score 10 out of 10
Vetted Review
Verified User
Incentivized
  • DS is one of the most powerful ETL tools on the market. Its connectors to different bases plus the pack make it a solid tool, and complete. It has a great number of functions, and the work with a big amount of data with DS is not complex (as long as you have the knowledge and know how to handle the partitioning algorithms, etc).
  • DataStage has improved over time. It has improved its connectors, added functionalities, making easier the programming and the maintenance of the development.
  • Connectivity.
  • Handling large numbers of records.
  • Varied partitioning algorithms.
  • Complementary packages of connectivity to applications, SAP, etc.
  • You must understand and know the algorithms, since the wrong use of them generates more time in processing.
  • Metadata. You need to develop with connectors, and taking all the Metadata from the menu, all the data that you complete manually, you can't track it.
Recommend for:
Small and large data volumes
For development and processing of complex data (functions / routines)
File management
Migrating Data from a Database to other
When you need to track all the data loaded, because you can have all the information about the transformation, the derivation, and where it was used
Gonzalo Angeleri | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
ResellerIncentivized
DataStage is part of the IBM data integration and governance suite and is primarily used as an ETL tool to integrate information at the corporate level. It allows to minimize the development and maintenance times of integration processes by having a totally visual development environment and therefore the cost of these processes.
  • Proven MPP Engine. Excellent performance for integrating data.
  • 100% visual development, operational and monitoring environment.
  • Any source to any target capabilities.
  • Complex transformation capabilities without writing code.
  • Metadada and data quality capabilities.
  • Hybrid deployment (on-prem or cloud).
  • Unified processing engine.
  • Pricing.
Good for:

  • Integration Layer for a Data Mart and Data Warehousing implementation.
  • Batch or near real-time integration processes
  • Big data implementation (integrating non-Hadoop world with Hadoop)

Not so Good:

  • Real time integration
Return to navigation