data-lineage

Here are 165 public repositories matching this topic...

open-metadata / OpenMetadata

The Open Context Layer for Data and AI , OpenMetadata is the open platform for building trusted data context and business semantics for humans, AI assistants, and agents.

Updated Jun 7, 2026
TypeScript

elementary-data / elementary

Star

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

bigquery snowflake data-warehouse dataops data-analysis redshift dbt data-pipelines data-pipeline lineage data-governance data-lineage analytics-engineer dbt-packages data-observability data-reliability dbt-artifacts

Updated Jun 4, 2026
HTML

MarquezProject / marquez

Star

Collect, aggregate, and visualize a data ecosystem's metadata

metadata data-discovery data-dictionary data-governance data-lineage data-ops data-provenance metadata-service marquez data-ecosystem-metadata

Updated Jun 5, 2026
Java

reata / sqllineage

Star

SQL Lineage Analysis Tool powered by Python

metadata sql data-discovery lineage data-governance data-lineage

Updated Jun 4, 2026
Python

opendatadiscovery / odd-platform

Star

First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.

Updated Jun 4, 2026
Java

marmotdata / marmot

Star

The open-source context layer for your AI. Catalog your tables, topics, queues and APIs then expose real metadata to your AI agents.

metadata mcp bigdata data-catalog data-discovery data-exploration lineage dataengineering data-governance data-lineage datacatalog data-observability datadiscovery data-collaboration mcp-server

Updated May 28, 2026
Go

elementary-data / dbt-data-reliability

Star

This dbt package captures metadata, artifacts, and test results so you can detect anomalies, monitor data quality, and build metadata tables. It powers Elementary OSS and feeds the wider context layer used by Elementary Cloud’s full Data & AI Control Plane.

data analytics dbt data-pipelines data-lineage analytics-engineering data-pipeline-monitoring dbt-tests dbt-packages data-observability data-reliability dbt-artifacts

Updated Jun 4, 2026
Python

vmware / versatile-data-kit

Star

One framework to develop, deploy and operate data workflows with Python and SQL.

Updated Jun 1, 2026
Python

data-drift / data-drift

Star

Metrics Observability & Troubleshooting

Updated Feb 29, 2024
HTML

tokern / data-lineage

Star

Generate and Visualize Data Lineage from query history

python jupyter postgresql data-governance data-lineage

Updated Aug 4, 2023
Python

grai-io / grai-core

Star

mysql python open-source data-science data django postgresql snowflake mssql parquet redshift dbt hacktoberfest dataengineering data-lineage fivetran datalineage

Updated Jan 30, 2026
Python

tuva-health / tuva

Star

Main repo including core data model, data marts, data quality tests, and terminology sets.

open-source bigquery sql snowflake data-warehouse healthcare data-analytics redshift terminology dbt data-pipelines data-governance data-lineage healthcare-analysis healthcare-data analytics-engineering dbt-packages

Updated May 29, 2026
JavaScript

laminlabs / lamindb

Star

Open-source data lakehouse for biology. Query, trace & validate with a lineage-native lakehouse that supports bio-formats, registries & ontologies. Context and memory for millions of datasets & transforms, across infrastructure. 🍊YC S22

open-source lims ontologies observability traceability data-versioning eln omics-data-integration data-lineage feature-store ml-ops data-lakehouse comp-bio-ops context-engineering

Updated Jun 7, 2026
Python

rocky-data / rocky

Star

The typed graph between your code and whichever warehouse, table format, or query engine you've chosen — typed compiler, branches, replay, column-level lineage, compile-time contracts, per-model cost. Adapters: Databricks, Snowflake, BigQuery, DuckDB. Single static Rust binary. Apache 2.0.