Skip to main content
TrustRadius
IBM DataStage

IBM DataStage
Formerly InfoSphere DataStage

Overview

What is IBM DataStage?

IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns.…

Read more
Recent Reviews

DATASTAGE -ETL

10 out of 10
January 31, 2019
Incentivized
  • DS is one of the most powerful ETL tools on the market. Its connectors to different bases plus the pack make it a solid tool, and …
Continue reading
Read all reviews

Awards

Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards

Popular Features

View all 11 features
  • Simple transformations (9)
    9.8
    98%
  • Connect to traditional data sources (9)
    9.5
    95%
  • Complex transformations (9)
    9.3
    93%
  • Collaboration (9)
    9.0
    90%
Return to navigation

Pricing

View all pricing
N/A
Unavailable

What is IBM DataStage?

IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns. A basic version of the software is available for on…

Entry-level set up fee?

  • No setup fee

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

28 people also want pricing

Alternatives Pricing

N/A
Unavailable
What is SolarWinds Task Factory?

According to the vendor, SolarWinds Task Factory saves time managing tedious ELT/ETL tasks with high-performing SQL Server Integration Services (SSIS) components that can be used within the Visual Studio environment to connect to nearly any data source. Task Factory’s components and tasks have been…

What is Skyvia?

Skyvia is a cloud platform for no-coding data integration (both ELT and ETL), automating workflows, cloud to cloud backup, data management with SQL, CSV import/export, creating OData services, etc. The vendor says it supports all major cloud apps and databases, and requires no software except a web…

Return to navigation

Features

Data Source Connection

Ability to connect to multiple data sources

9.1
Avg 8.2

Data Transformations

Data transformations include calculations, search and replace, data normalization and data parsing

9.5
Avg 8.4

Data Modeling

A data model is a diagram or flowchart that illustrates the relationships between data

9
Avg 8.1

Data Governance

Data governance is the practise of implementing policies defining effective use of an organization's data assets

8.9
Avg 8.2
Return to navigation

Product Details

What is IBM DataStage?

IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns. A basic version of the software is available for on-premises deployment, and the cloud-based DataStage for IBM Cloud Pak® for Data offers automated integration capabilities in a hybrid or multicloud environment.


IBM DataStage Technical Details

Deployment TypesOn-premise, Software as a Service (SaaS), Cloud, or Web-Based
Operating Systems,
Mobile ApplicationNo

Frequently Asked Questions

IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns. A basic version of the software is available for on-premises deployment, and the cloud-based DataStage for IBM Cloud Pak® for Data offers automated integration capabilities in a hybrid or multicloud environment.

Reviewers rate Simple transformations highest, with a score of 9.8.

The most common users of IBM DataStage are from Enterprises (1,001+ employees).
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews and Ratings

(34)

Attribute Ratings

Reviews

(1-3 of 3)
Companies can't remove reviews or game the system. Here's why
Filippo Orlando | TrustRadius Reviewer
Score 7 out of 10
Vetted Review
Verified User
Incentivized
In our company, IBM InfoSphere DataStage is the main ETL engine used by the data integration functions. It guarantees the integration of most of the company's analytical processes. The reliability and capillarity that has shown make it an essential tool for our needs.
  • reliability
  • capillarity
  • complexity
  • adaptability
I believe that for the classic vision of integration, IBM InfoSphere DataStage is a reliable tool. It certainly cannot meet the latest requirements and does not guarantee the speed and versatility of the new data management systems.
Data Source Connection (2)
55%
5.5
Connect to traditional data sources
80%
8.0
Connecto to Big Data and NoSQL
30%
3.0
Data Transformations (2)
75%
7.5
Simple transformations
80%
8.0
Complex transformations
70%
7.0
Data Modeling (5)
58%
5.8
Data model creation
70%
7.0
Metadata management
50%
5.0
Business rules and workflow
70%
7.0
Collaboration
60%
6.0
Testing and debugging
40%
4.0
Data Governance (2)
50%
5.0
Integration with data quality tools
50%
5.0
Integration with MDM tools
50%
5.0
  • complex to integrate and adapt
  • reliable and safe for traditional flows
We chose IBM InfoSphere DataStage because it is the tool that has been used, historically, at the company level. In the near future, nothing prevents us from orienting ourselves to new solutions in view of a restructuring of architecture.
I believe that IBM generally has one of the worst and most complex assistance systems (physical and online) that exists.
Score 9 out of 10
Vetted Review
Verified User
Incentivized
A few departments including IT department and Shadow IT groups within other departments in the company use the this tool at Office Depot. We recollect, load and transform data from multiple sources and applications including sales, inventory, financial information among others. Currently updating to the latest version of DataStage on the cloud and connecting to other cloud data-sources as well as data files and other types of data-sources.
  • Connect to multiple types of data-sources including Oracle, Teradata, Snowflake, SQl Server.
  • Powerful tool to load large volumes of data.
  • Transformation stages allow us to reduce the amount of code needed to create ETL scripts.
  • Allow us to synchronize and refresh data as much as needed.
  • Connector Stages to Snowflake on the cloud. We had some issues initially but since then had been corrected.
  • Accessing tool from a browser (zero foot-print). Currently we need to either install locally or connect to a server to do ETL work.
  • Diversify ways of authenticating users.
DataStage is well suited for any size of company that's looking to move, transform, clean data and easily create data-warehouses that would help to make data ready to be presented for decision making. Data Stage would easily integrate with companies that use IBM DB2 as their main RDBMS.
A scenario where it less suited could be cost. I have noticed IBM tools tend to be a little more costly than average.
Data Source Connection (2)
75%
7.5
Connect to traditional data sources
100%
10.0
Connecto to Big Data and NoSQL
50%
5.0
Data Transformations (2)
95%
9.5
Simple transformations
100%
10.0
Complex transformations
90%
9.0
Data Modeling (5)
68%
6.8
Data model creation
50%
5.0
Metadata management
60%
6.0
Business rules and workflow
90%
9.0
Collaboration
50%
5.0
Testing and debugging
90%
9.0
Data Governance (2)
70%
7.0
Integration with data quality tools
90%
9.0
Integration with MDM tools
50%
5.0
  • Not directly related to ROI or cost figures. Only comment here is that IBM tools tend to be more costly than average ETL tools, but it depends on if the company is an IBM shop.
  • One positive aspect is the company has had not a need to switch ETL tool for years.
  • Upgrading to newer versions of the tool brings flexibility in the tool and up-to-date features in relation to other applications.
Currently not using any of the Informatica tools, so, I don't have a real way of comparing the tools. But comparison against Microsoft SSIS (SQL Server Integration Services) I'd say DataStage stacks favorably. DataStage is a powerful tool for ETL processes that integrates well with multiple data-sources and can handle high volumes of data. DataStage is also scalable and can be a more robust application when compared to SSIS.
Our development teams in the company can easily achieve and develop any ETL scripts that are needed to massage and move the data as needed. The company has also maintain this tool for a very long time making it automatic when it comes to ETL needs. We are currently trying to make DataStage our main and probably unique ETL tool
IBM offers different levels of support but in my experience being and IBM shop helps to get direct support from more knowledgeable technicians from IBM. Not sure on the cost of having this kind of support, but I know there's also general support and community blogs and websites on the Internet make it easy to troubleshoot issues whenever there's need for that.
Score 8 out of 10
Vetted Review
Verified User
Incentivized
It is mainly used by IT department as ETL software. It served as an efficient tool for us to extract data, transform and load data in the databases or provide the output file for users and other systems. With ODBC connections, it can connect Oracle, SQL, and other databases and open files to read and write.
  • Very reliable in handling data extraction, data transformation and loading
  • Flexibility in connecting to different type of databases, relational or non-relational
  • Great features such as parallel processing, hash handling, etc.
  • You can also take advantage of its FTP functions, and scheduling features if you need to.
  • Technical support is a key area IBM should improve for this product. Sometimes our case is assigned to a support engineer and he has no idea of the product or services.
  • Provide custom reports for datastage jobs and performance such as job history reports, warning messages or error messages.
  • Make it fully compatible with Oracle and users can direct use of Oracle ODBC drivers instead of Data Direct driver. Same for SQL server.
It is well suited if your data is quite complex and many data rules are in place. You have to write real complex data joins to generate output. While DataStage can handle it and provide you an intuitive design for support and analysis.
Data Source Connection (1)
80%
8.0
Connect to traditional data sources
80%
8.0
Data Transformations (2)
95%
9.5
Simple transformations
100%
10.0
Complex transformations
90%
9.0
Data Modeling (4)
72.5%
7.3
Metadata management
80%
8.0
Business rules and workflow
90%
9.0
Collaboration
40%
4.0
Testing and debugging
80%
8.0
Data Governance
N/A
N/A
  • Provides us an excellent ETL application
  • Made our data handling easy
  • Provide the business with high quality reports
We have very limited experience of using Informix and should not provide any comments. But datastage works real well for us.
I mainly handle support cases with IBM and the support is really lagging behind.
Return to navigation