This document discusses Marquez, an open source metadata management system. It provides an overview of Marquez and how it can be used to track metadata in data pipelines. Specifically: - Marquez collects and stores metadata about data sources, datasets, jobs, and runs to provide data lineage and observability. - It has a modular framework to support data governance, data lineage, and data discovery. Metadata can be collected via REST APIs or language SDKs. - Marquez integrates with Apache Airflow to collect task-level metadata, dependencies between DAGs, and link tasks to code versions. This enables understanding of operational dependencies and troubleshooting. - The Marquez community aims to build an open