AWS Glue vs. Informatica Cloud Data Quality

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
AWS Glue
Score 8.6 out of 10
N/A
AWS Glue is a managed extract, transform, and load (ETL) service designed to make it easy for customers to prepare and load data for analytics. With it, users can create and run an ETL job in the AWS Management Console. Users point AWS Glue to data stored on AWS, and AWS Glue discovers data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, data is immediately searchable, queryable, and available for ETL.
$0.44
billed per second, 1 minute minimum
Informatica Cloud Data Quality
Score 6.8 out of 10
N/A
The vendor states that Informatica Data Quality empowers companies to take a holistic approach to managing data quality across the entire organization, and that with Informatica Data Quality, users are able to ensure the success of data-driven digital transformation initiatives and projects across users, types, and scale, while also automating mission-critical tasks.N/A
Pricing
AWS GlueInformatica Cloud Data Quality
Editions & Modules
per DPU-Hour
$0.44
billed per second, 1 minute minimum
No answers on this topic
Offerings
Pricing Offerings
AWS GlueInformatica Cloud Data Quality
Free Trial
NoNo
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Community Pulse
AWS GlueInformatica Cloud Data Quality
Features
AWS GlueInformatica Cloud Data Quality
Data Quality
Comparison of Data Quality features of Product A and Product B
AWS Glue
-
Ratings
Informatica Cloud Data Quality
8.2
4 Ratings
3% below category average
Data source connectivity00 Ratings8.94 Ratings
Data profiling00 Ratings8.74 Ratings
Master data management (MDM) integration00 Ratings8.24 Ratings
Data element standardization00 Ratings7.14 Ratings
Match and merge00 Ratings7.94 Ratings
Address verification00 Ratings8.44 Ratings
Best Alternatives
AWS GlueInformatica Cloud Data Quality
Small Businesses
IBM SPSS Modeler
IBM SPSS Modeler
Score 9.2 out of 10
HubSpot Data Hub
HubSpot Data Hub
Score 8.3 out of 10
Medium-sized Companies
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
Enterprises
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
AWS GlueInformatica Cloud Data Quality
Likelihood to Recommend
8.8
(10 ratings)
9.0
(19 ratings)
Likelihood to Renew
-
(0 ratings)
6.6
(14 ratings)
Usability
9.2
(3 ratings)
8.0
(1 ratings)
Availability
-
(0 ratings)
9.0
(2 ratings)
Performance
-
(0 ratings)
9.0
(1 ratings)
Support Rating
7.0
(1 ratings)
-
(0 ratings)
Online Training
-
(0 ratings)
10.0
(1 ratings)
Implementation Rating
-
(0 ratings)
10.0
(1 ratings)
Product Scalability
-
(0 ratings)
9.0
(1 ratings)
User Testimonials
AWS GlueInformatica Cloud Data Quality
Likelihood to Recommend
Amazon AWS
One of AWS Glue's most notable features that aid in the creation and transformation of data is its data catalog. Support, scheduling, and the automation of the data schema recognition make it superior to its competitors aside from that. It also integrates perfectly with other AWS tools. The main restriction may be integrated with systems outside of the AWS environment. It functions flawlessly with the current AWS services but not with other goods. Another potential restriction that comes to mind is that glue operates on a spark, which means the engineer needs to be conversant in the language.
Read full review
Informatica
For effective data collaboration, systematic verification of customer information, and address, among others, Informatica Data Quality is a fruitful application to consider. Besides, Informatica Data Quality controls quality through a cleansing process, giving the company a professional outline of candid data profiling and reputable analytics. Finally, Informatica Data Quality allows the simplistic navigation of content, with a dashboard that supports predictability.
Read full review
Pros
Amazon AWS
  • It is extremely fast, easy, and self-intuitive. Though it is a suite of services, it requires pretty less time to get control over it.
  • As it is a managed service, one need not take care of a lot of underlying details. The identification of data schema, code generation, customization, and orchestration of the different job components allows the developers to focus on the core business problem without worrying about infrastructure issues.
  • It is a pay-as-you-go service. So, there is no need to provide any capacity in advance. So, it makes scheduling much easier.
Read full review
Informatica
  • The matching algorithms in IDQ are very powerful if you understand the different types that they offer (e.g., Hamming Distance, Jaro, Bigram, etc..). We had to play around with it to see which best suit our own needs of identifying and eliminating duplicate customers. Setting up the whole process (e.g., creating the KeyGenerator Transformation, setting up the matching threshold, etc..) can be somewhat time consuming and a challenge if you don't first standardize your data.
  • The integration with PowerCenter is great if you have both. You can either import your mappings directly to PowerCenter or to an XML file. The only downside is that some of the transformations are unique to IDQ, so you are not really able to edit them once in PowerCenter.
  • The standardizer transformation was key in helping us standardize our customer data (e.g., names, addresses, etc..). It was helpful due to having create a reference table containing the standardized value and the associated unstandardized values. What was great was that if you used Informatica Analyst, a business analyst could login and correct any of the values.
Read full review
Cons
Amazon AWS
  • In-Stream schema registries feature people can not use this more efficiently
  • in Connections feature they can add more connectors as well
  • The crucial problem with AWS Glue is that it only works with AWS.
Read full review
Informatica
  • Several partnerships diminishing the value of technologies
  • Unable to get list of objects from Repository (like sources & targets) that don't have any dependency
  • Scheduling: The built-in scheduling tool has many constraints such as handling Unix/VB scripts etc. Most enterprises use third party tools for this.
Read full review
Likelihood to Renew
Amazon AWS
No answers on this topic
Informatica
As pointed out earlier, due all the robust features IDQ has, our use f the product is successful and stable. IDQ is being used in multiple sources (from CRM application and in batch mode). As this is an iterative process, we are looking to improve our system efficiency using IDQ.
Read full review
Usability
Amazon AWS
While easy to set up and manage monitoring for large datasets, its complexity can be a barrier for new users. Integration with AWS Ecosystem, Managed Monitoring, Dashboards and monitoring tools for AWS Glue are generally easy to set up and maintain, Automated Data Pipelines. Automates data pipeline creation, making it efficient for certain data integration
Read full review
Informatica
Easy to use not only for developers but also business users
Read full review
Reliability and Availability
Amazon AWS
No answers on this topic
Informatica
The application works well except an occasional error out while using the system. It usually gets fixed when restarting the Infa server
Read full review
Performance
Amazon AWS
No answers on this topic
Informatica
Performance works just fine. It was able to load 200+ business terms, 150+ DQ automation, etc. very well.
Read full review
Support Rating
Amazon AWS
Amazon responds in good time once the ticket has been generated but needs to generate tickets frequent because very few sample codes are available, and it's not cover all the scenarios.
Read full review
Informatica
No answers on this topic
Alternatives Considered
Amazon AWS
AWS Glue is a fully managed ETL service that automates many ETL tasks, making it easier to set AWS Glue simplifies ETL through a visual interface and automated code generation.
Read full review
Informatica
IDQ is used by a department at my organisation to ensure and enhance the data quality.
The usage was started with address standardization and now it had been brought to altogether a next level of quality check where it fixes duplicates, junk characters, standardize the names, streets, product descriptions.
In the past we had issues mainly with duplicate customers and products and this were affecting the sales projection and estimates.
Read full review
Scalability
Amazon AWS
No answers on this topic
Informatica
Scalability works as expected and it is truly an enterprise system.
Read full review
Return on Investment
Amazon AWS
  • We are using GLUE for our ETL purpose. it’s ease with other our AWS services makes our ROI, 100% ROI.
  • One missing piece was compatibility with other data source for which we found a work around and made our data source as S3 only, so our dependencies on other data source is also reducing
Read full review
Informatica
  • Integration with tools like PowerCenter helped faster delivery of product, and at the same time conversion
  • Reduce overall project cost due to bad data , bad quality, exceptions identified nearing go-live and post production
  • Employee efficiency is increased exponentially due to more automated, customized tool
Read full review
ScreenShots