Draft:OpenMetadata

From Wikipedia, the free encyclopedia

OpenMetadata is an open-source platform for metadata management, supporting use cases such as data discovery, data governance, and data lineage. Released in 2021, it provides a centralized system for managing metadata across data infrastructure, including databases, data warehouses, and analytics tools.

DevelopersCollate, Inc. and community contributors
Initial releaseAugust 1, 2021; 4 years ago (2021-08-01)
Stable release
1.12.1 / February 24, 2026; 53 days ago (2026-02-24)
Written inJava, TypeScript, Python
Quick facts OpenMetadata, Developers ...
OpenMetadata
DevelopersCollate, Inc. and community contributors
Initial releaseAugust 1, 2021; 4 years ago (2021-08-01)
Stable release
1.12.1 / February 24, 2026; 53 days ago (2026-02-24)
Written inJava, TypeScript, Python
Operating systemCross-platform (web application)
TypeData catalog, Metadata management
LicenseApache License 2.0
Websiteopen-metadata.org
Repositorygithub.com/open-metadata/OpenMetadata
Close

History

OpenMetadata originated from work by its founders on internal data infrastructure at Uber, including a metadata system known as Databook. Rather than releasing that system, they developed OpenMetadata as a general-purpose platform for broader use.[1]

Suresh Srinivas previously co-founded Hortonworks and contributed to Apache Atlas within the Apache Software Foundation, while Sriharsha Chintalapani has contributed to projects including Apache Kafka and Apache Storm.

The project’s source code repository was established in August 2021. It has since been developed as an open-source project with contributions from a distributed community.

In July 2025, Collate announced a $10 million Series A funding round led by Venrock.[2]

In 2025, OpenMetadata received a grant from Bloomberg L.P.’s Free and Open Source Software (FOSS) Contributor Fund.[3]

Overview

OpenMetadata is designed to consolidate multiple metadata-related functions, such as data cataloging, lineage tracking, and governance, within a single platform. It provides application programming interfaces for integrating with external systems and data pipelines.

The platform uses a centralized metadata repository and a schema-based model for representing entities such as datasets, pipelines, and dashboards.[4]

OpenMetadata includes integrations with a range of data systems and tools, including cloud data warehouses, analytics platforms, and pipeline orchestration frameworks such as Apache Airflow and dbt.

References

Related Articles

Wikiwand AI