Skip to main content
TrustRadius
Delta Lake

Delta Lake

Overview

What is Delta Lake?

Delta Lake is an open-source storage framework designed to enable the creation of a Lakehouse architecture with various compute engines, including Spark, PrestoDB, Flink, Trino, and Hive. It is positioned as a robust solution for managing and processing large-scale datasets, targeting companies of all...

Read more
Recent Reviews
TrustRadius

Leaving a review helps other professionals like you evaluate Data Lakehouse Software

Be the first one in your network to review Delta Lake, and make your voice heard!

Return to navigation

Pricing

View all pricing

What is Delta Lake?

Delta Lake is an open source project that enables building a Lakehouse architecture on top of data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing on top of existing data lakes, such as S3, ADLS, GCS, and HDFS.

Entry-level set up fee?

  • No setup fee
For the latest information on pricing, visithttps://github.com/delta-io/delta

Offerings

  • Free Trial
  • Free/Freemium Version
  • Premium Consulting/Integration Services

Would you like us to let the vendor know that you want pricing?

8 people also want pricing

Alternatives Pricing

What is Toad Data Point?

Toad Data Point is a cross-platform, self-service, data-integration tool that simplifies data access, preparation and provisioning. It provides data connectivity and desktop data integration, and with the Workbook interface for business users, it provides simple-to-use visual query building and…

What is Amazon Redshift?

Amazon Redshift is a hosted data warehouse solution, from Amazon Web Services.

Return to navigation

Product Demos

Delta Live Tables Demo

YouTube

Announcing Delta Live Tables with Demo | Michael Armbrust | Keynote Data + AI Summit NA 2021

YouTube

Databricks Delta Lake Data Integration Demo (Auto Loader and COPY INTO)

YouTube

What is Delta Lake?

YouTube

Delta Lakehouse Data Profiler and SQL Analytics Demo

YouTube

Azure Databricks - Delta Lake - 5 Minute Demo

YouTube
Return to navigation

Product Details

What is Delta Lake?

Delta Lake is an open-source storage framework designed to enable the creation of a Lakehouse architecture with various compute engines, including Spark, PrestoDB, Flink, Trino, and Hive. It is positioned as a robust solution for managing and processing large-scale datasets, targeting companies of all sizes, from small startups to large enterprises. Data engineers, data scientists, data analysts, software developers, and professionals in the financial services industry are among the key users who can benefit from Delta Lake's capabilities.

Key Features

ACID Transactions: According to the vendor, Delta Lake provides ACID transactions to ensure data integrity and protect data from concurrent modifications. It guarantees serializability, allowing multiple transactions to read and write data without conflicts.

Scalable Metadata: The vendor claims that Delta Lake can handle petabyte-scale tables with billions of partitions and files. It efficiently manages metadata using a scalable metadata management system, enabling fast and efficient operations on large datasets.

Time Travel: Delta Lake enables users to access and revert to earlier versions of data, facilitating auditing, rollbacks, reproducing past results, and tracking changes over time. Users can query the table as it existed at a specific point in time or retrieve the entire history of changes.

Open Source: Delta Lake is an open-source project driven by the Delta Lake Project under the Linux Foundation. It is built on open standards and protocols, allowing for community contributions, open discussions, and transparency in the development process.

Unified Batch/Streaming: According to the vendor, Delta Lake provides unified batch and streaming data processing capabilities, supporting exactly once semantics for data ingestion. Users can seamlessly transition from batch to interactive queries, enabling real-time analytics and interactive data exploration on the same dataset.

Schema Evolution / Enforcement: Delta Lake supports schema evolution, allowing users to evolve the schema of a table over time. It provides mechanisms to add, modify, and delete columns while maintaining backward compatibility. Schema enforcement ensures data integrity by validating data against the specified schema.

Audit History: Delta Lake logs all changes made to the data, providing a comprehensive audit trail. It captures metadata changes, schema changes, and data changes, including inserts, updates, and deletes. The audit history can be used for compliance, data lineage, and debugging purposes.

DML Operations: Delta Lake supports Data Manipulation Language (DML) operations, allowing users to perform SQL, Scala/Java, and Python-based operations on Delta tables. Users can merge, update, and delete datasets using familiar SQL syntax or programmatic APIs.

Data Skipping: According to the vendor, Delta Lake utilizes data skipping to optimize query performance by automatically skipping irrelevant data based on statistics and filters. This feature aims to significantly improve query execution time, especially for large datasets.

Delta Caching: The vendor claims that Delta Lake provides efficient data caching capabilities, allowing users to cache data in-memory or on disk for faster access and improved query performance. It optimizes data caching based on query patterns and usage patterns, providing an enhanced user experience.

Delta Lake Technical Details

Operating SystemsUnspecified
Mobile ApplicationNo
Return to navigation

Comparisons

View all alternatives
Return to navigation

Reviews

Sorry, no reviews are available for this product yet

Return to navigation