The document discusses Confluent Stream Governance, a solution for governing data in motion with metadata. It introduces tools for managing schemas, classifying metadata, tracking lineage, and monitoring data quality. This helps bring order to what would otherwise be a "giant mess" of ungoverned data by enforcing standards and providing visibility into data flows and definitions.
8. Stream Governance
Now
Schema Registry
Schemas management UI (New)
Schema linking (New)
Next
Schema Registry PrivateLink access
Stream Validation - business quality rules
Stream Profiling - quality monitoring
Stream Quality
Metadata definition and
enforcement
Now
Auto technical metadata collection
Schemas classification with tagging
Stream Catalog UI & API
Next
Stream Catalog PrivateLink access
Key-value pairs metadata enrichment
Integration with 3rd party data catalogs
Stream Catalog
Metadata search and
discovery
Now
Automated lineage tracking
Metadata inspection
Live metrics
Next
Point in time lineage visualization
Cluster linking lineage visualization
Stream Lineage API
Stream Lineage
Metadata tracking and
analysis
9. Stream
Schema Registry
Stream Quality – contracts for data in motion
SCHEMA METADATA SCHEMA METADATA
Producer App 3
...
Producer App 1
Producer App 2
Consumer App 3
...
Consumer App 1
Consumer App 2
UNIVERSAL LANGUAGE
10. Stream
Catalog
Stream Catalog – knowledge base for data in motion
Tags
Key value pairs
TECHNICAL METADATA BUSINESS METADATA
INDEX
ENTITY TYPES
Owner
...
Name
Creation date
Integration with 3rd parties
11. Stream Lineage – maps for data in motion
Owner: David
Creation: 08/20/21
Retention: 30 days
Type: JSON
Tags: PII
Stream
Lineage
16. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
17. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Ungoverned Growth
Loose data definitions
No proper data
ownership
Siloed data
knowledge
No data visibility
Questionable
data quality
Inefficient data
usage
Slow expansion
and low ROI
18. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Governed Growth With Metadata
Explicit data
definitions and
enforcement
Safe data
evolvability
Data
classifications
and
organization
Democratized
data knowledge
Data flows
mapping
Self-service data
discovery and
access
21. Connecting people & places
The Incredible History of Commercial Air Travel
Data from World Bank
Air transport, passengers carried World
(2019)
4.397
Billion
28. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Slide explaining What is the problem for Kafka?
29. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Animation with Kafka mess without governance
30. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
32. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
Data
in motion
Metadata
in motion
33. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.
35. Copyright 2021, Confluent, Inc. All rights reserved. This document may not be reproduced in any manner without the express written permission of Confluent, Inc.