Matillion is an ETL tool that is... Easy To Learn!
April 29, 2019
Matillion is an ETL tool that is... Easy To Learn!
Score 8 out of 10
Vetted Review
Verified User
Overall Satisfaction with Matillion
We are currently using Matillion for scheduling, orchestrating and executing ETL jobs. This ETL process is critical to the company as the data being ingested is transformed to a clean and usable state, so that the data can be analysed and used for making data-driven business decisions.
We are also able to have a near real-time view of the data since we perform ETL on both stream and batch data.
- Easy to orchestrate an ETL pipeline
- Integration with Redshift and other AWS services provides convenience
- Support team is easy to reach and have relatively quick response times
- They also support the other 2 popular cloud data warehouses, Snowflake and BigQuery
- Some components are available out to the box for easy integration
- Task history is useful for debugging and monitoring current state of ETL pipelines
- A GUI is provided for implementing complicated workflows, that can be easily followed
- Since it's mainly GUI based, it becomes difficult for data engineers to compare changes between old and new versions of an ETL.
- There are hard limitations for the number of concurrent users. This is part of the pricing model which is the reason.
- Integration with GIT is not yet available but we heard that it would be coming in the near future.
- Since we are a small team, we are able to quickly implement data pipelines in Matillion and this provides quick business value (or high ROI).
- The trade off is that our process may neglect some additional code reviews due to the nature of the tool. This increases risk for bugs creeping into the ETLs which may be more expensive to fix later on.
It's pretty quick to have it running since Matillion (for Redshift) provides pre-configured EC2 AMI instance via the marketplace. We were also able to emulate this by running Matillion for Snowflake from scratch. After that, the learning curve is relatively simple.
I have used simple cron jobs, built and maintained a bespoke ETL tool. Other tools are always being evaluated such as Airflow and AWS step functions.