One Stop Shop for Data Professionals.
Use Cases and Deployment Scope
Databricks is the primary data platform where we land, standardize, clean, transform, and clean our data sources. We utilize the Workflows feature to automate reoccurring tasks and have built internal applications around the reusable workflows. We use the dashboard feature internally to allow customer success teams and business analysts to keep tabs on the performance and outputs of our products. The workloads are orchestrated in Databricks but executed within our own AWS accounts, allowing us to stay compliant with our stringent security requirements.
Pros
- Thoughtful application of AI assistants during the coding and analysis steps.
- Intuitive UI for users of varying skill sets.
- Frequently updated documentation.
Cons
- Greater support for non spark workloads.
- Ability to host JAR files on serverless endpoints.
Likelihood to Recommend
Medium to Large data throughput shops will benefit the most from Databricks Spark processing. Smaller use cases may find the barrier to entry a bit too high for casual use cases. Some of the overhead to kicking off a Spark compute job can actually lead to your workloads taking longer, but past a certain point the performance returns cannot be beat.
