AWS Glue Partner Review
Use Cases and Deployment Scope
As an AWS Advanced Consulting Partner, we use AWS Glue in many of our Data and Analytics Solutions. We've implemented in the major enterprises in the Philippines that are in the media, telecommunication, logistics and Fintech industries. The company aims to centralize their data lake of operational raw data containing various shipping details by making use of the AWS platform.The architecture must involve an automation of the data extraction from an API. The data lake should also be visualized to provide graphical details using QuickSight, and the generated dashboards are to be embedded into the customer web portal. AWS Services implemented - Lambda, S3, Glue, Athena, Quicksight, EventBridge
Pros
- After data cleansing, the team also implemented the best practices for using AWS platform services as a Data Lake, such as job bookmarking for AWS Glue jobs, proper delimiter for the AWS Glue crawlers, partitioning in AWS S3, and transformation to parquet file for compression and faster querying time in Amazon Athena.
- Data modernization through combining data from multiple sources into a functioning datasets, rebuilding DW, and resctructuring data sources.
- Aims to lessen customer complaints, eliminate manual data extraction requests via SR from different data sources, and Increase accuracy, consistency and speed up reconciliation process.
Cons
- Faster processing, on cases where data is not partitioned efficiency
- Cost optimization and pricing
- Developer experience on new users
Likelihood to Recommend
Operational Excellence: A customer asked for guidance mostly from data ingestion to transformation. However, we advised the customer to use Amazon CloudWatch to monitor their own AWS Glue jobs since when we fix their glue job errors, we rely more on CloudWatch information to resolve issues.