Databricks in San Francisco offers the Databricks Lakehouse Platform (formerly the Unified Analytics Platform), a data science platform and Apache Spark cluster manager. The Databricks Unified Data Service aims to provide a reliable and scalable platform for data pipelines, data lakes, and data platforms. Users can manage full data journey, to ingest, process, store, and expose data throughout an organization. Its Data Science Workspace is a collaborative environment for practitioners to run…
$0.07
Per DBU
Datameer
Score 8.4 out of 10
N/A
Datameer helps businesses clean up, combine, and organize data to make sense of it and use it for reports and machine learning.
If you need a managed big data megastore, which has native integration with highly optimized Apache Spark Engine and native integration with MLflow, go for Databricks Lakehouse Platform. The Databricks Lakehouse Platform is a breeze to use and analytics capabilities are supported out of the box. You will find it a bit difficult to manage code in notebooks but you will get used to it soon.
Datameer is a great tool if someone is capable of keeping the most recent version of the tool up to date along with the most recent version of the distribution of Hadoop. The tool is easy to support but it must have someone who can run the back end processes
It leverages scalability, flexibility and cost-effectiveness of hadoop to deliver an end-user focused analytic platform for big data without involvement of IT.
It overcomes Hadoop`s complexity by providing GUI interface with pre-built functions across integration, analytics and data visualization .
Excel feature is awesome for business users which is already provided by Datameer.
Using datameer now user can do smart analytic using Decision Trees, Column dependency and recommendation.
Recently HTML5 inclusion is making application to available on a wider range of devices, including the iPad and other mobile devices which does not support Flash.
It can be used in premise or in a cloud computing environment.
Wizard-based data integration designed for IT and business users to schedule and do transformation of large sets of structured, semi-structured and unstructured data without any knowledge of Hadoop ecosystem.
Connect my local code in Visual code to my Databricks Lakehouse Platform cluster so I can run the code on the cluster. The old databricks-connect approach has many bugs and is hard to set up. The new Databricks Lakehouse Platform extension on Visual Code, doesn't allow the developers to debug their code line by line (only we can run the code).
Maybe have a specific Databricks Lakehouse Platform IDE that can be used by Databricks Lakehouse Platform users to develop locally.
Visualization in MLFLOW experiment can be enhanced
Employees with intermediate SQL and Hive knowledge can generate reports faster than using Datameer . It does have visualization tool but I don't think it is anything that cannot be accomplished by importing the data in Excel
Because it is an amazing platform for designing experiments and delivering a deep dive analysis that requires execution of highly complex queries, as well as it allows to share the information and insights across the company with their shared workspaces, while keeping it secured.
in terms of graph generation and interaction it could improve their UI and UX
One of the best customer and technology support that I have ever experienced in my career. You pay for what you get and you get the Rolls Royce. It reminds me of the customer support of SAS in the 2000s when the tools were reaching some limits and their engineer wanted to know more about what we were doing, long before "data science" was even a name. Databricks truly embraces the partnership with their customer and help them on any given challenge.
Compared to Synapse & Snowflake, Databricks provides a much better development experience, and deeper configuration capabilities. It works out-of-the-box but still allows you intricate customisation of the environment. I find Databricks very flexible and resilient at the same time while Synapse and Snowflake feel more limited in terms of configuration and connectivity to external tools.
Pricing, support, and ease of use. We plan to scale up our data over the net few years and Datameer gives us all the things we need in one tool. Handles large transformations quickly and works with all the cloud data warehouses.
Datameer's per-user pricing sealed the deal for us as we plan to transfer much more data over the next few years. We looked at Fivetran but the usage pricing discourages growth. We also looked at Informatica but it was too expensive and didn't work as well with other BI tools like Datameer does.