Likelihood to Recommend If you need a managed big data megastore, which has native integration with highly optimized
Apache Spark Engine and native integration with MLflow, go for Databricks Lakehouse Platform. The Databricks Lakehouse Platform is a breeze to use and analytics capabilities are supported out of the box. You will find it a bit difficult to manage code in notebooks but you will get used to it soon.
Read full review If data storage, access, and security [are] of the highest priority to your business then Pure Storage FlashBlade is an excellent tool that must be considered. Analytics or sharing that requires the fastest speeds available will benefit from the NVMe solid-state drives they use which are far superior to spinning rust. It is less ideal for those who do not require such time-critical work.
Read full review Pros Process raw data in One Lake (S3) env to relational tables and views Share notebooks with our business analysts so that they can use the queries and generate value out of the data Try out PySpark and Spark SQL queries on raw data before using them in our Spark jobs Modern day ETL operations made easy using Databricks. Provide access mechanism for different set of customers Read full review Speed. We are seeing large transactions take very little time. Upgrades- In-place upgrades of both hardware and software are extremely easy. Ease of use- I have several engineers working on this and from setup to day to day operations it is extremely easy to maintain. Read full review Cons Connect my local code in Visual code to my Databricks Lakehouse Platform cluster so I can run the code on the cluster. The old databricks-connect approach has many bugs and is hard to set up. The new Databricks Lakehouse Platform extension on Visual Code, doesn't allow the developers to debug their code line by line (only we can run the code). Maybe have a specific Databricks Lakehouse Platform IDE that can be used by Databricks Lakehouse Platform users to develop locally. Visualization in MLFLOW experiment can be enhanced Read full review When reporting out a user has exceeded there quote, it only references the UID. It would certainly be nice it calls out the UID name that is clearly present in the Dashboard. The ability to determine a snapshot total size would be helpful. Proactive reachout to discuss new versions and assist in planning the upgrade would be a key win. Read full review Usability Because it is an amazing platform for designing experiments and delivering a deep dive analysis that requires execution of highly complex queries, as well as it allows to share the information and insights across the company with their shared workspaces, while keeping it secured. in terms of graph generation and interaction it could improve their UI and UX
Read full review Good API, multi-protocol support is great.
Read full review Support Rating One of the best customer and technology support that I have ever experienced in my career. You pay for what you get and you get the Rolls Royce. It reminds me of the customer support of SAS in the 2000s when the tools were reaching some limits and their engineer wanted to know more about what we were doing, long before "data science" was even a name. Databricks truly embraces the partnership with their customer and help them on any given challenge.
Read full review Without exception, the contacts with support have been quick and extremely knowledgeable. I do not fear getting an underqualified engineer to assess or work on my arrays. In addition to this support structure, the sales engineers are top notch as well.
Read full review Alternatives Considered Compared to
Synapse &
Snowflake , Databricks provides a much better development experience, and deeper configuration capabilities. It works out-of-the-box but still allows you intricate customisation of the environment. I find Databricks very flexible and resilient at the same time while
Synapse and
Snowflake feel more limited in terms of configuration and connectivity to external tools.
Read full review The NetApp a800 we tested was 14% faster than Pure FlashBlade with NFS workloads. However, NetApp lacked ease of administration and performing simple tasks such as creating multiple NFS volumes required scripting from the command line. Our flashblade contained 15 baldes and our NetApp was a clustered pair with each half containing 24 nvme devices.
Read full review Return on Investment The ability to spin up a BIG Data platform with little infrastructure overhead allows us to focus on business value not admin DB has the ability to terminate/time out instances which helps manage cost. The ability to quickly access typical hard to build data scenarios easily is a strength. Read full review We were able to consolidate 5 different storage platforms of lesser performance onto a single Flashblade and achieve much, much lower latency and higher throughput. We've been able to reduce the amount of training and configuration required to just Pure Flashblade, instead of 5 different vendors and products. In addition to our core use cases, Flashblade has capabilities that we are pursuing for some new projects, i.e. analytics data store and the object store features. Read full review ScreenShots