Best Data Catalog Software

Data catalog software helps businesses find, validate, and organize their data assets. A well-constructed, actively maintained data catalog shows users what data assets are available, where they are located, what their relationship is to that data, and other metadata. These products often include machine-learning-based asset and metadata discovery tools to help keep the data catalog comprehensive and relevant. In addition to automated discovery tools, products in this category often rely on crowd-sourcing for metadata creation...

We've collected videos, features, and capabilities below. Take me there.

All Products

(1-25 of 38)

1
Alation Data Catalog

Alation offers enterprise data intelligence solutions, including data search & discovery, data governance, data stewardship, analytics, and digital transformation. Alation operates in thethe data catalog market. With its Behavioral Analysis Engine, inbuilt collaboration capabilities,…

2
SAP Data Intelligence

SAP Data Intelligence is presented by the vendor as a single solution to innovate with data. It provides data-driven innovation in the cloud, on premise, and through BYOL deployments. It is described by the vendor as the new evolution of the company's data orchestration and management…

4
CatalogExpress

nexoma's software solution "CatalogExpress" is a versatile SaaS tool for data syndication. It consolidates product data from one or multiple sources and file formats (CSV, XML, JSON, XLSX, etc.) and prepares these datasets for customer-specific target formats. The desired exchange formats (e.g., XLSX, BMEcat, xChange, FAB DIS) including classifications such as ETIM or ECLASS, are then exported with optimized product data and distributed manually or on a schedule to customers, marketplaces, platforms, and other data re…

5
OneTrust Privacy and Data Governance Cloud

The OneTrust Privacy and Data Governance Cloud provides privacy and data governance automation to help organizations better understand their data across the business, meet regulatory requirements, and operationalize risk mitigation to provide transparency and choice to individuals.…

6
Denodo

Denodo is the eponymous data integration platform from the global company headquartered in Silicon Valley.

7
Atlan

Atlan is a modern data collaboration workspace (like Github for engineering or Figma for design). By acting as a virtual hub for data assets r…

8
Dataedo
0 reviews

Dataedo is a metadata management software that allows users to collect, preserve and share tribal knowledge in a single metadata repository – editable by selected experts and accessible for everyone.

9
Secoda
0 reviews

A solution enables users to search, document and manage data, with data catalog, lineage, docs, dictionary, analysis and data requests in one collaborative and searchable platform. It is presented as a Data Enablement Platform built for data teams but designed to support anyone, that…

10
Storage Spotter

StorageSpotter is an SaaS p…

11
eQube®-TM
0 reviews

eQube®-TM, Transformation Modeler, establishes a catalog of 'models' and 'transformation maps' for Data Federation, 'For-Purpose' Apps, Application Integration, and Migration Solutions. Its interface allows developers to visually define, maintain, and update data transformation 'maps'…

12
CastorDoc
0 reviews

CastorDoc is an automated data discovery and catalog tool that provides a single source of truth to reference data and documents all the knowledge related to data within a company. When looking for a table related to customers, users can search as they would in Google and CastorDoc…

13
Google Cloud Dataplex

Available on Google Cloud, Dataplex’s intelligent data fabric enables organizations to centrally discover, manage, monitor, and govern their data across data lakes, data warehouses, and data marts with consistent controls, providing access to data and powering analytics. Pricing…

14
DataHub
0 reviews

DataHub's extensible metadata platform enables data discovery, data observability and federated governance, built for developers to help tame the complexity of data ecosystems. Also presented as a modern data catalog, the solution is built to enable end-to-end data discovery, data…

15
Azure Data Catalog

Microsoft's Azure Data Catalog is an enterprise-wide metadata catalog designed to make data asset discovery straightforward, a fully-managed service that lets analysts. data scientists, and developers to register, enrich, discover, understand, and consume data sources.

16
Zaloni Arena
0 reviews

Zaloni's end-to-end DataOps software, Arena, provides a collaborative metadata catalog that connects multi-cloud and on-premises data silos, highly-controlled data quality, tokenization and governance tools, and extensible, self-service data enrichment and consumption. Zaloni works…

17
Project Nessie
0 reviews

Nessie is to Data Lakes what Git is to source code repositories. Therefore, Nessie uses many terms from both Git and data lakes.

This page explains how Nessie makes working with data in data lakes much easier without requiring much prior knowledge of either Git or data lakes.

Nessie is designed to give users an always-consistent view of their data across all involved data sets (tables). Changes to data, for example from batch jobs, happen independently and are completely isolated. Users will not see any incomplete changes. Once all the changes are done, all the changes can be atomically and consistently applied and become visible to your users.…

18
AnalyticsCreator

AnalyticsCreator automates design, development, deployment, and change processes, to enable teams to manage data for analytical needs.

AnalyticsCreator's prototyping empowers business to showcase results faster and generate code for data management through a connected analytical frontend, such as Power BI. W…

19
Talend Data Catalog

The Talend Data Catalog allows users to create a central, governed catalog of enriched data that is documented automatically and can be shared and collaborated on. It offers organization a single, secure point of control for data with tools for search and discovery, and connectors…

20
ThinkData Works

Data is the backbone of effective decision-making. However, varied sources, inconsistent formats, and evolving compliance landscape make it challenging to manage.

ThinkData Works provides a catalog platform for discovering, managing, and sharing data from both internal and external…

21
Collibra Data Intelligence Cloud

The Collibra Platform is a cloud-based data governance platform from the company of the same name in Brussels, enabling users to gain visibility into their data, collaborate intelligently and enable users to easily access trustworthy data, automate processes, manage compliance and,…

22
Acryl Data
0 reviews

Acryl Data offers a managed oversion of the open source metadata management platform and data catalog, DataHub. The managed solution provides enterprise-grade data discovery and modern data governance within a SOC-2 compliant platform.

23
IOMETE
0 reviews

A data lakehouse deployed on-premise, in the cloud, or in a hybrid environment built on Apache Iceberg and Apache Spark, designed to address the evolving needs of the data landscape. It is presented as an anlternative to traditional Hadoop distributions:

1. Serverless Lakehouse: IOMETE provides a serverless lakehouse that simplifies data management, storage, and processing, reducing operational overhead.…

24
IBM Knowledge Catalog

IBM Knowledge Catalog is a cloud-based enterprise metadata repository allowing users to catalog knowledge and analytics assets, including machine learning models and structured and unstructured data wherever they reside, so that they can be easily accessed and used to fuel data science…

25
data.world

data.world in Austin offers their metadata management solution, an enterprise data catalog, allowing users to create reusable, extensible data and analysis. Capture context and knowledge as teams work, so it’s easy to understand, check, and build on what they produce.

Learn More About Data Catalog Software

What is Data Catalog Software?

Data catalog software helps businesses find, validate, and organize their data assets. A well-constructed, actively maintained data catalog shows users what data assets are available, where they are located, what their relationship is to that data, and other metadata. These products often include machine-learning-based asset and metadata discovery tools to help keep the data catalog comprehensive and relevant.

In addition to automated discovery tools, products in this category often rely on crowd-sourcing for metadata creation and verification. Since this requires high user engagement, these products commonly mimic collaboration tools to engage non-technical users. By enabling all users to comment on, validate, tag, categorize, and discuss the data sources they work with, data catalog software aims to give businesses a comprehensive, up-to-date, easily navigable, and consumable view of their data landscape.

Data catalog software is an evolution of metadata management software. While data catalog products perform metadata management at their core, their focus on crowdsourced data and universal accessibility sets them apart.

Products in this category also commonly include data governance features or support other data governance software. An actively maintained data catalog is also a pillar of a successful data fabric architecture.

Data Catalog Software Features

Most data catalog products have the following features:

  • Metadata discovery
  • Business glossary
  • Data analysis
  • Natural language search
  • Integrated commenting and collaboration
  • Data governance tools
  • Pre-built connections to data sources
  • Tools for creating custom connections
  • Access control

Data Catalog Software Comparison

When evaluating data catalog software, keep the following comparison points in mind:

Ease of Use: Although the user experience is important for any tool, it’s absolutely critical for a successful data catalog implementation. The value you’ll get from a data catalog is strongly related to the degree of employee adoption. A data catalog solution with an inviting, easy-to-navigate interface will encourage more contribution and interaction with the platform. Try out multiple product demos with employees of varying roles and skill levels to see which one is a better fit for your organization.

Training and Culture: A data catalog with an intuitive UI still isn’t worth much if your employees don’t know about the tool and understand its importance. Before and during implementation, you’ll need to train users on the tool and teach them how it can help them and their co-workers. Some vendors may offer training and onboarding programs to help get you started. If you prefer to do training without vendor support, be prepared to devote the necessary time and resources.

Connections: Some data catalogs are designed to work best with specific data systems, like Oracle or Azure. Other vendors are independent, and create their product with varied ecosystems in mind. If you already have a suite of data tools from a specific vendor, check to see if they offer a data catalog solution. No matter what you choose, make sure you pick a product that connects to as many of your existing data sources as possible. Some products let you create custom data connectors, but custom work will take additional time and maintenance.

Start a Data Catalog Software comparison here

Pricing Information

Pricing for data catalog software is not commonly provided. Where pricing is available, it varies widely based on feature set. Some data catalog products are add-ons to existing subscriptions and may start as low as $1 monthly per user. Other products are comprehensive data platforms in their own right and are much more expensive, starting around $4,000 monthly for 10 users, with each additional user starting at $400 per month.

Your best bet is to contact a vendor. They often offer free demos of their products, which can help you decide if their product is a good fit for your needs. They’ll also be able to provide you with a quote customized to your use case.

Related Categories

Frequently Asked Questions

What does data catalog software do?

Data catalog software helps businesses keep track of their data assets. These products automatically discover metadata, facilitate crowdsourced metadata, and assist with data governance.

What are the benefits of using data catalog software?

An actively-maintained data catalog saves businesses time and money by helping users find the data they need more quickly. They make data analysts, data stewards, and other data consumers faster and more efficient.

How much does data catalog software cost?

Pricing for data catalog software varies based on feature set. Simpler add-on products start as low as $1 monthly per user. More comprehensive data catalogs can be as expensive as $4,000 or more monthly. Contact a vendor for a quote. Free trials and demos are common.