AWS Glue vs. Informatica Cloud Data Quality

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
AWS Glue
Score 7.6 out of 10
N/A
AWS Glue is a managed extract, transform, and load (ETL) service designed to make it easy for customers to prepare and load data for analytics. With it, users can create and run an ETL job in the AWS Management Console. Users point AWS Glue to data stored on AWS, and AWS Glue discovers data and stores the associated metadata (e.g. table definition and schema) in the AWS Glue Data Catalog. Once cataloged, data is immediately searchable, queryable, and available for ETL.
$0.44
billed per second, 1 minute minimum
Informatica Cloud Data Quality
Score 8.5 out of 10
N/A
The vendor states that Informatica Data Quality empowers companies to take a holistic approach to managing data quality across the entire organization, and that with Informatica Data Quality, users are able to ensure the success of data-driven digital transformation initiatives and projects across users, types, and scale, while also automating mission-critical tasks.N/A
Pricing
AWS GlueInformatica Cloud Data Quality
Editions & Modules
per DPU-Hour
$0.44
billed per second, 1 minute minimum
No answers on this topic
Offerings
Pricing Offerings
AWS GlueInformatica Cloud Data Quality
Free Trial
NoNo
Free/Freemium Version
NoNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Features
AWS GlueInformatica Cloud Data Quality
Data Quality
Comparison of Data Quality features of Product A and Product B
AWS Glue
-
Ratings
Informatica Cloud Data Quality
8.9
6 Ratings
2% above category average
Data source connectivity00 Ratings9.36 Ratings
Data profiling00 Ratings9.26 Ratings
Master data management (MDM) integration00 Ratings8.96 Ratings
Data element standardization00 Ratings8.26 Ratings
Match and merge00 Ratings8.76 Ratings
Address verification00 Ratings9.06 Ratings
Best Alternatives
AWS GlueInformatica Cloud Data Quality
Small Businesses
IBM SPSS Modeler
IBM SPSS Modeler
Score 7.8 out of 10
HubSpot Operations Hub
HubSpot Operations Hub
Score 8.0 out of 10
Medium-sized Companies
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
Enterprises
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.1 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
AWS GlueInformatica Cloud Data Quality
Likelihood to Recommend
8.0
(7 ratings)
9.3
(20 ratings)
Likelihood to Renew
-
(0 ratings)
6.6
(14 ratings)
Usability
-
(0 ratings)
8.0
(1 ratings)
Availability
-
(0 ratings)
9.0
(2 ratings)
Performance
-
(0 ratings)
9.0
(1 ratings)
Support Rating
7.0
(1 ratings)
-
(0 ratings)
Online Training
-
(0 ratings)
10.0
(1 ratings)
Implementation Rating
-
(0 ratings)
10.0
(1 ratings)
Product Scalability
-
(0 ratings)
9.0
(1 ratings)
User Testimonials
AWS GlueInformatica Cloud Data Quality
Likelihood to Recommend
Amazon AWS
One of AWS Glue's most notable features that aid in the creation and transformation of data is its data catalog. Support, scheduling, and the automation of the data schema recognition make it superior to its competitors aside from that. It also integrates perfectly with other AWS tools. The main restriction may be integrated with systems outside of the AWS environment. It functions flawlessly with the current AWS services but not with other goods. Another potential restriction that comes to mind is that glue operates on a spark, which means the engineer needs to be conversant in the language.
Read full review
Informatica
Helps to increase productivity, optimize costs, and democratize data across multiple cloud environments with cloud ETL and ELT. Capacity to integrate data sources at scale and with ease. Has cloud data integration capabilities that cover diverse sets of patterns, use cases, and users ensuring we have well-architected and seamless automated data pipelines.
Read full review
Pros
Amazon AWS
  • It is extremely fast, easy, and self-intuitive. Though it is a suite of services, it requires pretty less time to get control over it.
  • As it is a managed service, one need not take care of a lot of underlying details. The identification of data schema, code generation, customization, and orchestration of the different job components allows the developers to focus on the core business problem without worrying about infrastructure issues.
  • It is a pay-as-you-go service. So, there is no need to provide any capacity in advance. So, it makes scheduling much easier.
Read full review
Informatica
  • The matching algorithms in IDQ are very powerful if you understand the different types that they offer (e.g., Hamming Distance, Jaro, Bigram, etc..). We had to play around with it to see which best suit our own needs of identifying and eliminating duplicate customers. Setting up the whole process (e.g., creating the KeyGenerator Transformation, setting up the matching threshold, etc..) can be somewhat time consuming and a challenge if you don't first standardize your data.
  • The integration with PowerCenter is great if you have both. You can either import your mappings directly to PowerCenter or to an XML file. The only downside is that some of the transformations are unique to IDQ, so you are not really able to edit them once in PowerCenter.
  • The standardizer transformation was key in helping us standardize our customer data (e.g., names, addresses, etc..). It was helpful due to having create a reference table containing the standardized value and the associated unstandardized values. What was great was that if you used Informatica Analyst, a business analyst could login and correct any of the values.
Read full review
Cons
Amazon AWS
  • In-Stream schema registries feature people can not use this more efficiently
  • in Connections feature they can add more connectors as well
  • The crucial problem with AWS Glue is that it only works with AWS.
Read full review
Informatica
  • Several partnerships diminishing the value of technologies
  • Unable to get list of objects from Repository (like sources & targets) that don't have any dependency
  • Scheduling: The built-in scheduling tool has many constraints such as handling Unix/VB scripts etc. Most enterprises use third party tools for this.
Read full review
Likelihood to Renew
Amazon AWS
No answers on this topic
Informatica
As pointed out earlier, due all the robust features IDQ has, our use f the product is successful and stable. IDQ is being used in multiple sources (from CRM application and in batch mode). As this is an iterative process, we are looking to improve our system efficiency using IDQ.
Read full review
Usability
Amazon AWS
No answers on this topic
Informatica
Easy to use not only for developers but also business users
Read full review
Reliability and Availability
Amazon AWS
No answers on this topic
Informatica
The application works well except an occasional error out while using the system. It usually gets fixed when restarting the Infa server
Read full review
Performance
Amazon AWS
No answers on this topic
Informatica
Performance works just fine. It was able to load 200+ business terms, 150+ DQ automation, etc. very well.
Read full review
Support Rating
Amazon AWS
Amazon responds in good time once the ticket has been generated but needs to generate tickets frequent because very few sample codes are available, and it's not cover all the scenarios.
Read full review
Informatica
No answers on this topic
Alternatives Considered
Amazon AWS
AWS Glue is a fully managed ETL service that automates many ETL tasks, making it easier to set AWS Glue simplifies ETL through a visual interface and automated code generation.
Read full review
Informatica
Informatica Data Quality has a wide range of cleansing features, that are detailed, professional, and accurate in scaling down the required database. Further, Informatica Data Quality ensures there is proper collaboration, and this fosters businesses to have the freedom of working closely with several programs. Finally, Informatica Data Quality design is authentic and allows personalization.
Read full review
Scalability
Amazon AWS
No answers on this topic
Informatica
Scalability works as expected and it is truly an enterprise system.
Read full review
Return on Investment
Amazon AWS
  • It had a positive impact on the way we build our data lake.
  • It is the single source of truth for data structure (schemas/tables/views).
Read full review
Informatica
  • Integration with tools like PowerCenter helped faster delivery of product, and at the same time conversion
  • Reduce overall project cost due to bad data , bad quality, exceptions identified nearing go-live and post production
  • Employee efficiency is increased exponentially due to more automated, customized tool
Read full review
ScreenShots