AWS Glue Partner Review
Use Cases and Deployment Scope
As an AWS Advanced Consulting Partner, we use AWS Glue in many of our Data and Analytics Solutions. We've implemented in the major enterprises in the Philippines that are in the media, telecommunication, logistics and Fintech industries. The company aims to centralize their data lake of operational raw data containing various shipping details by making use of the AWS platform.The architecture must involve an automation of the data extraction from an API. The data lake should also be visualized to provide graphical details using QuickSight, and the generated dashboards are to be embedded into the customer web portal. AWS Services implemented - Lambda, S3, Glue, Athena, Quicksight, EventBridge
Pros
- After data cleansing, the team also implemented the best practices for using AWS platform services as a Data Lake, such as job bookmarking for AWS Glue jobs, proper delimiter for the AWS Glue crawlers, partitioning in AWS S3, and transformation to parquet file for compression and faster querying time in Amazon Athena.
- Data modernization through combining data from multiple sources into a functioning datasets, rebuilding DW, and resctructuring data sources.
- Aims to lessen customer complaints, eliminate manual data extraction requests via SR from different data sources, and Increase accuracy, consistency and speed up reconciliation process.
Cons
- Faster processing, on cases where data is not partitioned efficiency
- Cost optimization and pricing
- Developer experience on new users
Return on Investment
- ROI
- Faster processing of Data
- Integration to Athena and other AWS Data Services
Usability
Alternatives Considered
Amazon Athena, AWS Lake Formation and Snowflake
Other Software Used
Snowflake, MongoDB, Confluent


