AWS Glue vs. IBM DataStage

AWS Glue

IBM DataStage

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
AWS Glue	Score 8.6 out of 10	N/A	AWS Glue is a managed extract, transform, and load (ETL) service designed to make it easy for customers to prepare and load data for analytics. With it, users can create and run an ETL job in the AWS Management Console. Users point AWS Glue to data stored on AWS, and AWS Glue discovers data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, data is immediately searchable, queryable, and available for ETL.	$0.44 billed per second, 1 minute minimum
IBM DataStage	Score 7.7 out of 10	N/A	IBM® DataStage® is a data integration tool that helps users to design, develop and run jobs that move and transform data. At its core, the DataStage tool supports extract, transform and load (ETL) and extract, load and transform (ELT) patterns. A basic version of the software is available for on-premises deployment, and the cloud-based DataStage for IBM Cloud Pak® for Data offers automated integration capabilities in a hybrid or multicloud environment.	N/A

Pricing

AWS Glue

IBM DataStage

Editions & Modules

per DPU-Hour: $0.44
billed per second, 1 minute minimum

No answers on this topic

Offerings

Pricing Offerings
AWS Glue	IBM DataStage
Free Trial
No	Yes
Free/Freemium Version
No	No
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

Additional Details

—

More Pricing Information

Community Pulse
	AWS Glue	IBM DataStage
Considered Both Products	AWS Glue No answer on this topic	IBM DataStage Verified User Manager Chose IBM DataStage IBM DataStage performes bettere than SSIS in every aspect. IBM DataStage performes better than SAP Data Services in terms of variables and job orchestration flexibility. It is as strong as ODI, but less complex to implement. It allows to write SQL queries as dbt and Glue, but I … Incentivized Helpful?

Features

AWS Glue

IBM DataStage

Data Source Connection

Comparison of Data Source Connection features of Product A and Product B
	AWS Glue - Ratings	IBM DataStage 8.1 11 Ratings 2% below category average
Connect to traditional data sources	00 Ratings	8.311 Ratings
Connecto to Big Data and NoSQL	00 Ratings	7.810 Ratings

Data Transformations

Comparison of Data Transformations features of Product A and Product B
	AWS Glue - Ratings	IBM DataStage 7.7 11 Ratings 6% below category average
Simple transformations	00 Ratings	8.011 Ratings
Complex transformations	00 Ratings	7.411 Ratings

Data Modeling

Comparison of Data Modeling features of Product A and Product B
	AWS Glue - Ratings	IBM DataStage 7.0 11 Ratings 12% below category average
Data model creation	00 Ratings	6.78 Ratings
Metadata management	00 Ratings	5.010 Ratings
Business rules and workflow	00 Ratings	7.210 Ratings
Collaboration	00 Ratings	7.211 Ratings
Testing and debugging	00 Ratings	6.611 Ratings

Data Governance

Comparison of Data Governance features of Product A and Product B
	AWS Glue - Ratings	IBM DataStage 5.4 10 Ratings 39% below category average
Integration with data quality tools	00 Ratings	5.410 Ratings
Integration with MDM tools	00 Ratings	5.410 Ratings

Best Alternatives
	AWS Glue	IBM DataStage
Small Businesses	IBM SPSS Modeler Score 9.3 out of 10	Skyvia Score 10.0 out of 10
Medium-sized Companies	IBM InfoSphere Information Server Score 8.0 out of 10	IBM InfoSphere Information Server Score 8.0 out of 10
Enterprises	IBM InfoSphere Information Server Score 8.0 out of 10	IBM InfoSphere Information Server Score 8.0 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	AWS Glue	IBM DataStage
Likelihood to Recommend	8.8 (10 ratings)	6.8 (10 ratings)
Usability	9.2 (3 ratings)	8.0 (4 ratings)
Performance	- (0 ratings)	9.0 (1 ratings)
Support Rating	7.0 (1 ratings)	9.6 (3 ratings)

User Testimonials
	AWS Glue	IBM DataStage
Likelihood to Recommend	Amazon AWS One of AWS Glue's most notable features that aid in the creation and transformation of data is its data catalog. Support, scheduling, and the automation of the data schema recognition make it superior to its competitors aside from that. It also integrates perfectly with other AWS tools. The main restriction may be integrated with systems outside of the AWS environment. It functions flawlessly with the current AWS services but not with other goods. Another potential restriction that comes to mind is that glue operates on a spark, which means the engineer needs to be conversant in the language. Incentivized Verified User Anonymous Read full review	IBM DataStage is somewhat outdated for an ETL. I guess that's what makes it a bit lagged behind its competitors. It can be used for data processing, sure, but its performance seems to be lagging behind or quite slow given the server it is running from. I won’t depend on this application if it's handling a lot of mission-critical banking and business data. Verified User Anonymous Read full review
Pros	Amazon AWS It is extremely fast, easy, and self-intuitive. Though it is a suite of services, it requires pretty less time to get control over it. As it is a managed service, one need not take care of a lot of underlying details. The identification of data schema, code generation, customization, and orchestration of the different job components allows the developers to focus on the core business problem without worrying about infrastructure issues. It is a pay-as-you-go service. So, there is no need to provide any capacity in advance. So, it makes scheduling much easier. Incentivized Apurv Doshi Practice Head - Labs (Innovation and R&D) Read full review	IBM Connect to multiple types of data-sources including Oracle, Teradata, Snowflake, SQl Server. Powerful tool to load large volumes of data. Transformation stages allow us to reduce the amount of code needed to create ETL scripts. Allow us to synchronize and refresh data as much as needed. Incentivized HG Herber Gonzalez Lead Developer Read full review
Cons	Amazon AWS In-Stream schema registries feature people can not use this more efficiently in Connections feature they can add more connectors as well The crucial problem with AWS Glue is that it only works with AWS. Incentivized Verified User Anonymous Read full review	IBM Technical support is a key area IBM should improve for this product. Sometimes our case is assigned to a support engineer and he has no idea of the product or services. Provide custom reports for datastage jobs and performance such as job history reports, warning messages or error messages. Make it fully compatible with Oracle and users can direct use of Oracle ODBC drivers instead of Data Direct driver. Same for SQL server. Incentivized Verified User Anonymous Read full review
Usability	Amazon AWS While easy to set up and manage monitoring for large datasets, its complexity can be a barrier for new users. Integration with AWS Ecosystem, Managed Monitoring, Dashboards and monitoring tools for AWS Glue are generally easy to set up and maintain, Automated Data Pipelines. Automates data pipeline creation, making it efficient for certain data integration Incentivized SC Sonny Carlos Head of Cloud and Data Business Read full review	IBM Because it is robust, and it is being continuously improved. DS is one of the most used and recognized tools in the market. Large companies have implemented it in the first instance to develop their DW, but finding the advantages it has, they could use it for other types of projects such as migrations, application feeding, etc. Incentivized Verified User Anonymous Read full review
Performance	Amazon AWS No answers on this topic	IBM It could load thousands of records in seconds. But in the Parallel version, you need to understand how to particionate the data. If you use the algorithms erroneously, or the functionalities that it gives for the parsing of data, the performance can fall drastically, even with few records. It is necessary to have people with experience to be able to determine which algorithm to use and understand why. Incentivized Verified User Anonymous Read full review
Support Rating	Amazon AWS Amazon responds in good time once the ticket has been generated but needs to generate tickets frequent because very few sample codes are available, and it's not cover all the scenarios. Incentivized Verified User Anonymous Read full review	IBM IBM offers different levels of support but in my experience being and IBM shop helps to get direct support from more knowledgeable technicians from IBM. Not sure on the cost of having this kind of support, but I know there's also general support and community blogs and websites on the Internet make it easy to troubleshoot issues whenever there's need for that. Incentivized HG Herber Gonzalez Lead Developer Read full review
Alternatives Considered	Amazon AWS AWS Glue is a fully managed ETL service that automates many ETL tasks, making it easier to set AWS Glue simplifies ETL through a visual interface and automated code generation. Ashutosh Mishra Read full review	IBM With effective capabilities and easy to manipulate the features and easy to produce accurate data analytics and the Cloud services Automation, this IBM platform is more reliable and easy to document management. The features on this platform are equipped with excellent big data management and easy to provide accurate data analytics. Incentivized Edger Loredo Project Manager Read full review
Return on Investment	Amazon AWS We are using GLUE for our ETL purpose. it’s ease with other our AWS services makes our ROI, 100% ROI. One missing piece was compatibility with other data source for which we found a work around and made our data source as S3 only, so our dependencies on other data source is also reducing Incentivized Verified User Anonymous Read full review	IBM It’s hard to say at this point, it delivers, but not quite as I expected. It takes a lot of resources to manage and sort this out (manpower, financial). Definitely, I don’t have the exact numbers, but given the data it processes, it is A LOT. So props to the developer of this application. Again, based on my experience, I’d choose other ETL apps if there is one that's more user-friendly. Verified User Anonymous Read full review
ScreenShots