Azure Data Lake : A wonderful Scalable Cloud Storage Solution for all your Big Data Needs
April 25, 2022

Azure Data Lake : A wonderful Scalable Cloud Storage Solution for all your Big Data Needs

Abhishek Katara | TrustRadius Reviewer
Score 9 out of 10
Vetted Review
Verified User

Overall Satisfaction with Azure Data Lake Storage

Stored Terabytes of Healthcare data in a cost-optimized solution on-cloud using Azure Data Lake Storage Gen2 in containerized fashion. We utilized Azure Data Lake Storage containers as a Destination in our Data Engineering Streasmets Pipelines. Loaded Data became available further to multiple downstream applications in an automated and faster way using Azure Data Factory. Also turned out a better, cost-optimized, and faster solution than HDFS for our different business use cases like the migration of huge data from RDBMS to Data Lake.

Pros

  • Setting up Azure Data Lake Storage account, container is quite easy
  • Access from anywhere and easy maintenance
  • Integration with Azure Data Factory service for end to end pipeline is pretty easy
  • Can store Any form of data (Structured, Unstructured, Semi) in faster manner

Cons

  • UI search feature can certainly be improvised e.g. inclusion of wildcards to search a particular file in container
  • Sometimes gets Hanged/lagged while monitoring
  • Probably the new UI feature can address above issues.
  • Smooth Integration with other Azure Services i.e. Azure Databricks, Data factory, synapse, etc.
  • Easy to access and Manage, Less maintenance required in comparison to traditional storage solutions
  • Hadoop FIle System compatibility
  • Data Migration projects from relational sources to Azure Data Lake Storage have given a great ROI, thanks to the less running costs, and High availability
  • Pretty easy to work with in terms of Managing and accessing Data in containerized fashion.
  • Further features like Archival of data which is accessed less frequently can significantly reduce cost
We have used both Hadoop and GCS buckets for our storage needs of very large healthcare data. In terms of comparison with the Hadoop distributed Files system, Azure Data Lake Storage always stands in a far better position due to easy integration with various latest and widely used Data engineering and Data Science tech stack like Azure Databricks, Data factory. In comparison to GCS, I like the UI and search feature of GCS buckets far better than Azure Data Lake Storage.

Do you think Azure Data Lake Storage delivers good value for the price?

Yes

Are you happy with Azure Data Lake Storage's feature set?

Yes

Did Azure Data Lake Storage live up to sales and marketing promises?

Yes

Did implementation of Azure Data Lake Storage go as expected?

Yes

Would you buy Azure Data Lake Storage again?

Yes

Azure Data Lake storage is well suited for applications/use cases within organizations where capturing and storing large amounts of data in any format is required, primarily for storing and processing purposes. It's an easy and cost-effective cloud solution for your application data. The ability to integrate with other Azure Services like Azure Databricks and Azure Data Factory is superb.

Comments

More Reviews of Azure Data Lake Storage