AWS Glue is a good data catalog and integration service
September 14, 2022

AWS Glue is a good data catalog and integration service

Anonymous | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User

Overall Satisfaction with AWS Glue

We heavily rely on AWS Glue for cataloging our data objects (tables and views). We use AWS Glue as our Data Catalog and use it in our data pipelines to sync external and internal data sources. We also utilize AWS Glue to auto-generate SQL-based ETL based on AWS Glue catalog objects.
  • Create schemes, tables and views (data catalog).
  • Sync external and internal data sources.
  • Auto-generate SQL-based data pipelines, based on AWS Glue catalog objects.
  • It is very difficult (almost impossible) to scale
  • We sometimes get throttled by service limitations.
  • AWS Glue crawlers sometimes mismatch the data in the files
  • Data Catalog (schemas, tables, views)
  • Crawlers
  • It had a positive impact on the way we build our data lake.
  • It is the single source of truth for data structure (schemas/tables/views).
AWS Glue is a managed service. It was easier for us to integrate it into our stack since we are already an AWS shop. It saved us the headache of managing a 3rd part service.

Do you think AWS Glue delivers good value for the price?

Yes

Are you happy with AWS Glue's feature set?

Yes

Did AWS Glue live up to sales and marketing promises?

Yes

Did implementation of AWS Glue go as expected?

Yes

Would you buy AWS Glue again?

Yes

Amazon EMR (Elastic MapReduce), Apache Airflow, Amazon S3 (Simple Storage Service), Amazon Athena, Vertica
AWS Glue is a mature product, which helps organizations start their journey with data exploration and analysis. AWS Glue has many great features, like a data catalog, jobs, crawlers, helping non-engineers to handle data and build a data lake.