Microsoft's Azure Data Factory is a service built for all data integration needs and skill levels. It is designed to allow the user to easily construct ETL and ELT processes code-free within the intuitive visual environment, or write one's own code. Visually integrate data sources using more than 80 natively built and maintenance-free connectors at no added cost. Focus on data—the serverless integration service does the rest.
N/A
Posit
Score 10.0 out of 10
N/A
Posit, formerly RStudio, is a modular data science platform, combining open source and commercial products.
N/A
Pricing
Azure Data Factory
Posit
Editions & Modules
No answers on this topic
No answers on this topic
Offerings
Pricing Offerings
Azure Data Factory
Posit
Free Trial
No
Yes
Free/Freemium Version
No
Yes
Premium Consulting/Integration Services
No
No
Entry-level Setup Fee
No setup fee
Optional
Additional Details
—
—
More Pricing Information
Community Pulse
Azure Data Factory
Posit
Features
Azure Data Factory
Posit
Data Source Connection
Comparison of Data Source Connection features of Product A and Product B
Azure Data Factory
8.5
10 Ratings
3% above category average
Posit
-
Ratings
Connect to traditional data sources
9.010 Ratings
00 Ratings
Connecto to Big Data and NoSQL
8.010 Ratings
00 Ratings
Data Transformations
Comparison of Data Transformations features of Product A and Product B
Azure Data Factory
7.8
10 Ratings
3% below category average
Posit
-
Ratings
Simple transformations
8.710 Ratings
00 Ratings
Complex transformations
7.010 Ratings
00 Ratings
Data Modeling
Comparison of Data Modeling features of Product A and Product B
Azure Data Factory
6.3
10 Ratings
21% below category average
Posit
-
Ratings
Data model creation
4.57 Ratings
00 Ratings
Metadata management
5.58 Ratings
00 Ratings
Business rules and workflow
6.010 Ratings
00 Ratings
Collaboration
7.09 Ratings
00 Ratings
Testing and debugging
6.310 Ratings
00 Ratings
Data Governance
Comparison of Data Governance features of Product A and Product B
Azure Data Factory
5.7
10 Ratings
33% below category average
Posit
-
Ratings
Integration with data quality tools
4.310 Ratings
00 Ratings
Integration with MDM tools
7.09 Ratings
00 Ratings
Platform Connectivity
Comparison of Platform Connectivity features of Product A and Product B
Azure Data Factory
-
Ratings
Posit
9.3
27 Ratings
11% above category average
Connect to Multiple Data Sources
00 Ratings
8.026 Ratings
Extend Existing Data Sources
00 Ratings
9.927 Ratings
Automatic Data Format Detection
00 Ratings
9.926 Ratings
Data Exploration
Comparison of Data Exploration features of Product A and Product B
Azure Data Factory
-
Ratings
Posit
9.0
27 Ratings
6% above category average
Visualization
00 Ratings
8.027 Ratings
Interactive Data Analysis
00 Ratings
10.024 Ratings
Data Preparation
Comparison of Data Preparation features of Product A and Product B
Azure Data Factory
-
Ratings
Posit
10.0
26 Ratings
21% above category average
Interactive Data Cleaning and Enrichment
00 Ratings
10.024 Ratings
Data Transformations
00 Ratings
10.026 Ratings
Platform Data Modeling
Comparison of Platform Data Modeling features of Product A and Product B
Azure Data Factory
-
Ratings
Posit
10.0
22 Ratings
18% above category average
Multiple Model Development Languages and Tools
00 Ratings
10.022 Ratings
Single platform for multiple model development
00 Ratings
10.022 Ratings
Self-Service Model Delivery
00 Ratings
10.019 Ratings
Model Deployment
Comparison of Model Deployment features of Product A and Product B
Best scenario is for ETL process. The flexibility and connectivity is outstanding. For our environment, SAP data connectivity with Azure Data Factory offers very limited features compared to SAP Data Sphere. Due to the limited modelling capacity of the tool, we use Databricks for data modelling and cleaning. Usage of multiple tools could have been avoided if adf has modelling capabilities.
In my humble opinion, if you are working on something related to Statistics, RStudio is your go-to tool. But if you are looking for something in Machine Learning, look out for Python. The beauty is that there are packages now by which you can write Python/SQL in R. Cross-platform functionality like such makes RStudio way ahead of its competition. A couple of chinks in RStudio armor are very small and can be considered as nagging just for the sake of argument. Other than completely based on programming language, I couldn't find significant drawbacks to using RStudio. It is one of the best free software available in the market at present.
The support is incredibly professional and helpful, and they often go out of their way to help me when something doesn't work.
The one-click publishing from RStudio Connect is absolutely amazing, and I really like the way that it deploys your exact package versions, because otherwise, you can get in a terrible mess.
Python doesn't feel quite as native as R at the moment but I have definitely deployed stuff in R and Python that works beautifully which is really nice indeed.
Granularity of Errors: Sometimes, Azure Data Factory provides error messages that are too generic or vague for us, making it challenging to pinpoint the exact cause of a pipeline failure. Enhanced error messages with more actionable details would greatly assist us as users in debugging their pipelines.
Pipeline Design UI: In my experience, the visual interface for designing pipelines, especially when dealing with complex workflows or numerous activities, can become cluttered. I think a more intuitive and scalable design interface would improve usability. In my opinion, features like zoom, better alignment tools, or grouping capabilities could make managing intricate designs more manageable.
Native Support: While Azure Data Factory does support incremental data loads, in my experience, the setup can be somewhat manual and complex. I think native and more straightforward support for Change Data Capture, especially from popular databases, would simplify the process of capturing and processing only the changed data, making regular data updates more efficient
Python integration is newer and still can be rough, especially with when using virtual environments.
RStudio Connect pricing feels very department focused, not quite an enterprise perspective.
Some of the RStudio packages don't follow conventional development guidelines (API breaking changes with minor version numbers) which can make supporting larger projects over longer timeframes difficult.
There is no viable alternative right now. The toolset is good and the functionality is increasing with every release. It is backed by regular releases and ongoing development by the RStudio team. There is good engagement with RStudio directly when support is required. Also there's a strong and growing community of developers who provide additional support and sample code.
So far product has performed as expected. We were noticing some performance issues, but they were largely Synapse related. This has led to a shift from Synapse to Databricks. Overall this has delayed our analytic platform. Once databricks becomes fully operational, Azure Data Factory will be critical to our environment and future success.
For someone who learns how to use the software and picks up on the "language" of R, it's very easy to use. For beginners, it can be hard and might require a course, as well as the appropriate statistical training to understand what packages to use and when
RStudio is very available and cheap to use. It needs to be updated every once in a while, but the updates tend to be quick and they do not hinder my ability to make progress. I have not experienced any RStudio outages, and I have used the application quite a bit for a variety of statistical analyses
We have not had need to engage with Microsoft much on Azure Data Factory, but they have been responsive and helpful when needed. This being said, we have not had a major emergency or outage requiring their intervention. The score of seven is a representation that they have done well for now, but have not proved out their support for a significant issue
Since R is trendy among statisticians, you can find lots of help from the data science/ stats communities. If you need help with anything related to RStudio or R, google it or search on StackOverflow, you might easily find the solution that you are looking for.
Azure Data Factory helps us automate to schedule jobs as per customer demands to make ETL triggers when the need arises. Anyone can define the workflow with the Azure Data Factory UI designer tool and easily test the systems. It helped us automate the same workflow with programming languages like Python or automation tools like ansible. Numerous options for connectivity be it a database or storage account helps us move data transfer to the cloud or on-premise systems.
RStudio was provided as the most customizable. It was also strictly the most feature-rich as far as enabling our organization to script, run, and make use of R open-source packages in our data analysis workstreams. It also provided some support for python, which was useful when we had R heavy code with some python threaded in. Overall we picked Rstudio for the features it provided for our data analysis needs and the ability to interface with our existing resources.
RStudio is very scalable as a product. The issue I have is that it doesn't necessarily fit in nicely with the mainly Microsoft environment that everybody else is using. Having RStudio for us means dedicated servers and recruiting staff who know how to manage the environment. This isn't a fault of the product at all, it's just part of the data science landscape that we all have to put up with. Having said that RStudio is absolutely great for running on low spec servers and there are loads of options to handle concurrency, memory use, etc.
Using it for data science in a very big and old company, the most positive impact, from my point of view, has been the ability of spreading data culture across the group. Shortening the path from data to value.
Still it's hard to quantify economic benefits, we are struggling and it's a great point of attention, since splitting out the contribution of the single aspects of a project (and getting the RStudio pie) is complicated.
What is sure is that, in the long run, RStudio is boosting productivity and making the process in which is embedded more efficient (cost reduction).