The go-to ETL tool for most situations
September 14, 2023

The go-to ETL tool for most situations

Anonymous | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User

Overall Satisfaction with Azure Data Factory

Data Integration: We harness Azure Data Factory's capabilities to move data from various sources – both on-premises databases and cloud storage – into our Azure data storage solutions like Azure SQL Database, Azure Blob Storage, and Azure Data Lake Store. This ensures all our data, regardless of its origin, is consolidated in one place.
Transformations: Azure Data Factory's data flow transformations help us clean, transform, and enrich our data before loading it to the destination. This is crucial for maintaining data quality, especially when dealing with diverse datasets.

Pros

  • Azure Data Factory supports a vast array of source and destination connectors, both from within the Microsoft ecosystem (like Azure Blob Storage, Azure SQL Database, Azure Cosmos DB) and external platforms (like Amazon S3, Google Cloud Storage, SAP, Salesforce, and many more).
  • Azure Data Factory's Mapping Data Flows provides a code-free environment to design data transformations visually. Users can drag and drop elements to create complex ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) processes without needing to write any code.
  • Azure Data Factory provides a unified monitoring dashboard that offers a holistic view of all pipeline activities. I think this makes it easier for users to track the status of various jobs, identify failures, and pinpoint bottlenecks.

Cons

  • Granularity of Errors: Sometimes, Azure Data Factory provides error messages that are too generic or vague for us, making it challenging to pinpoint the exact cause of a pipeline failure. Enhanced error messages with more actionable details would greatly assist us as users in debugging their pipelines.
  • Pipeline Design UI: In my experience, the visual interface for designing pipelines, especially when dealing with complex workflows or numerous activities, can become cluttered. I think a more intuitive and scalable design interface would improve usability. In my opinion, features like zoom, better alignment tools, or grouping capabilities could make managing intricate designs more manageable.
  • Native Support: While Azure Data Factory does support incremental data loads, in my experience, the setup can be somewhat manual and complex. I think native and more straightforward support for Change Data Capture, especially from popular databases, would simplify the process of capturing and processing only the changed data, making regular data updates more efficient
  • Cost Savings: By automating our ETL processes with Azure Data Factory, we've reduced manual data handling by approximately 60%. This translates to savings from reduced man-hours and the overhead of maintaining legacy systems.
  • Timeliness: Our report generation time has reduced by 70% with Azure Data Factory's scheduled pipelines. Faster insights mean quicker decisions for us, enabling our teams to capitalize on time-sensitive opportunities. We can easily share the data visualizations to all stakeholders.
Azure Data Factory fits well into our overall systems architecture where we already utilize largely Azure services and also Microsoft based products in the on-premises environment. I think cost structure is also very competitive with Azure Data Factory. Most services provide a visual interface for designing ETL workflows, but our team found Azure Data Factory's interface more intuitive.

Do you think Azure Data Factory delivers good value for the price?

Yes

Are you happy with Azure Data Factory's feature set?

Yes

Did Azure Data Factory live up to sales and marketing promises?

Yes

Did implementation of Azure Data Factory go as expected?

Yes

Would you buy Azure Data Factory again?

Yes

Well-suited Scenarios for Azure Data Factory (ADF):
When an organization has data sources spread across on-premises databases and cloud storage solutions, I think Azure Data Factory is excellent for integrating these sources.
Azure Data Factory's integration with Azure Databricks allows it to handle large-scale data transformations effectively, leveraging the power of distributed processing.
For regular ETL or ELT processes that need to run at specific intervals (daily, weekly, etc.), I think Azure Data Factory's scheduling capabilities are very handy.

Less Appropriate Scenarios for Azure Data Factory:
Real-time Data Streaming - Azure Data Factory is primarily batch-oriented.
Simple Data Copy Tasks - For straightforward data copy tasks without the need for transformation or complex workflows, in my opinion, using Azure Data Factory might be overkill; simpler tools or scripts could suffice.
Advanced Data Science Workflows: While Azure Data Factory can handle data prep and transformation, in my experience, it's not designed for in-depth data science tasks. I think for advanced analytics, machine learning, or statistical modeling, integration with specialized tools would be necessary.

Azure Data Factory Feature Ratings

Connect to traditional data sources
9
Connecto to Big Data and NoSQL
9
Simple transformations
9
Complex transformations
8
Data model creation
8
Metadata management
7
Business rules and workflow
7
Collaboration
6
Testing and debugging
7
Integration with data quality tools
7
Integration with MDM tools
8

Comments

More Reviews of Azure Data Factory