Apache Airflow vs. Collibra Platform

Overview
ProductRatingMost Used ByProduct SummaryStarting Price
Apache Airflow
Score 8.6 out of 10
N/A
Apache Airflow is an open source tool that can be used to programmatically author, schedule and monitor data pipelines using Python and SQL. Created at Airbnb as an open-source project in 2014, Airflow was brought into the Apache Software Foundation’s Incubator Program 2016 and announced as Top-Level Apache Project in 2019. It is used as a data orchestration solution, with over 140 integrations and community support.N/A
Collibra Platform
Score 8.1 out of 10
N/A
The Collibra Platform is a cloud-based data governance platform from the company of the same name in Brussels, enabling users to gain visibility into their data, collaborate intelligently and enable users to easily access trustworthy data, automate processes, manage compliance and, ultimately, make data meaningful.N/A
Pricing
Apache AirflowCollibra Platform
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
Apache AirflowCollibra Platform
Free Trial
NoNo
Free/Freemium Version
YesNo
Premium Consulting/Integration Services
NoNo
Entry-level Setup FeeNo setup feeNo setup fee
Additional Details
More Pricing Information
Features
Apache AirflowCollibra Platform
Workload Automation
Comparison of Workload Automation features of Product A and Product B
Apache Airflow
9.8
10 Ratings
17% above category average
Collibra Platform
-
Ratings
Multi-platform scheduling10.010 Ratings00 Ratings
Central monitoring9.910 Ratings00 Ratings
Logging9.910 Ratings00 Ratings
Alerts and notifications9.910 Ratings00 Ratings
Analysis and visualization9.910 Ratings00 Ratings
Application integration9.010 Ratings00 Ratings
Best Alternatives
Apache AirflowCollibra Platform
Small Businesses

No answers on this topic

Egnyte
Egnyte
Score 8.6 out of 10
Medium-sized Companies
ActiveBatch Workload Automation
ActiveBatch Workload Automation
Score 8.0 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
Enterprises
Control-M
Control-M
Score 9.3 out of 10
IBM InfoSphere Information Server
IBM InfoSphere Information Server
Score 8.0 out of 10
All AlternativesView all alternativesView all alternatives
User Ratings
Apache AirflowCollibra Platform
Likelihood to Recommend
9.0
(10 ratings)
10.0
(1 ratings)
Usability
10.0
(1 ratings)
-
(0 ratings)
User Testimonials
Apache AirflowCollibra Platform
Likelihood to Recommend
Apache
For a quick job scanning of status and deep-diving into job issues, details, and flows, AirFlow does a good job. No fuss, no muss. The low learning curve as the UI is very straightforward, and navigating it will be familiar after spending some time using it. Our requirements are pretty simple. Job scheduler, workflows, and monitoring. The jobs we run are >100, but still is a lot to review and troubleshoot when jobs don't run. So when managing large jobs, AirFlow dated UI can be a bit of a drawback.
Read full review
Collibra
Collibra is well suited where you have multiple reporting environments and multiple source systems. Collibra works well in our environment because we can delegate roles and administration to departments or it can be used in a centralized environment. The ability to customize attributes and assets as well and integrate using the rest api is very important to us.
Read full review
Pros
Apache
  • In charge of the ETL processes.
  • As there is no incoming or outgoing data, we may handle the scheduling of tasks as code and avoid the requirement for monitoring.
Read full review
Collibra
  • Data Lineage and traceability
  • Easy to customize
  • Easy tool to use for business users
  • Workflow
  • Out of the box metrics
Read full review
Cons
Apache
  • they should bring in some time based scheduling too not only event based
  • they do not store the metadata due to which we are not able to analyze the workflows
  • they only support python as of now for scripted pipeline writing
Read full review
Collibra
  • Data Quality and Data Profiling
  • Mobile interface
Read full review
Usability
Apache
Easy to learn Easy to use Robust workflow orchestration framework Good in dependent job management
Read full review
Collibra
No answers on this topic
Alternatives Considered
Apache
There are a number of reasons to choose Apache Airflow over other similar platforms- Integrations—ready-to-use operators allow you to integrate Airflow with cloud platforms (Google, AWS, Azure, etc) Apache Airflow helps with backups and other DevOps tasks, such as submitting a Spark job and storing the resulting data on a Hadoop cluster It has machine learning model training, such as triggering a Sage maker job.
Read full review
Collibra
Collibra offers more features [than alternatives] and is easy for business users to use. We did a proof of concept with Collibra using the Cloud service that allowed us to kick the tires and get a comfort level before we made the investment. At the time of our purchase the market was very immature and is continuing to evolve.
Read full review
Return on Investment
Apache
  • A lot of helpful features out-of-the-box, such as the DAG visualizations and task trees
  • Allowed us to implement complex data pipelines easily and at a relatively low cost
Read full review
Collibra
  • ROI is hard to measure because there are so many soft benefits that come from the tool. We do have a variety of metrics that we track using Collibra to better gauge the maturity of data governance.
Read full review
ScreenShots