AWS Glue is a good data catalog and integration service
September 14, 2022
AWS Glue is a good data catalog and integration service
Score 9 out of 10
Vetted Review
Verified User
Overall Satisfaction with AWS Glue
We heavily rely on AWS Glue for cataloging our data objects (tables and views). We use AWS Glue as our Data Catalog and use it in our data pipelines to sync external and internal data sources. We also utilize AWS Glue to auto-generate SQL-based ETL based on AWS Glue catalog objects.
- Create schemes, tables and views (data catalog).
- Sync external and internal data sources.
- Auto-generate SQL-based data pipelines, based on AWS Glue catalog objects.
- It is very difficult (almost impossible) to scale
- We sometimes get throttled by service limitations.
- AWS Glue crawlers sometimes mismatch the data in the files
- Data Catalog (schemas, tables, views)
- Crawlers
- It had a positive impact on the way we build our data lake.
- It is the single source of truth for data structure (schemas/tables/views).
AWS Glue is a managed service. It was easier for us to integrate it into our stack since we are already an AWS shop. It saved us the headache of managing a 3rd part service.
Do you think AWS Glue delivers good value for the price?
Yes
Are you happy with AWS Glue's feature set?
Yes
Did AWS Glue live up to sales and marketing promises?
Yes
Did implementation of AWS Glue go as expected?
Yes
Would you buy AWS Glue again?
Yes