SlideShare a Scribd company logo
1 of 28
Download to read offline
1
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
How to Govern Data Lakes
with Special Guest Evan Terry
Monthly Webinar Series Hosted by DATAVERSITY
Robert S. Seiner – KIK Consulting / TDAN.com
July 18, 2019 – 11:00 a.m. PT / 2:00 p.m. ET
Real-World Data Governance
Unified Data Orchestration
Madan Kumar | Solutions Engineer| Alluxio
madan@alluxio.com
4 big trends driving the need for a new architecture
Separation of
Compute &
Storage
Hybrid – Multi
cloud
environments
Self-service
data across the
enterprise
Rise
of the object
store
Data Ecosystem - Beta Data Ecosystem 1.0
COMPUTE
STORAGE STORAGE
COMPUTE
Data Orchestration Framework
Java File API HDFS Interface S3 Interface REST APIFUSE Interface
HDFS Driver Swift Driver S3 Driver NFS Driver
Alluxio’s Approach to Big Data Federation
 Unified Access - Acts as a “virtual data lake.” Files are accessed in Alluxio’s
global namespace as if they resided in a single system
 Performant - Provides fast local access to important and frequently used data,
without maintaining a permanent copy of all data.
 Modern, flexible architecture - Promotes separation of compute from storage
 Storage Cost Optimization -Transparently reads and writes data directly
from the source system, and so does not need to create a permanent copy of
the data
Data Elasticity
with a unified
namespace
Abstract data silos & storage
systems to independently scale
data with compute
Run Spark, Hive, Presto, ML
workloads on your data
located anywhere
Accelerate big data
workloads with transparent
tiered local data
Data Accessibility
for popular APIs &
API translation
Data Locality
with Intelligent
Multi-tiering
Key Innovations of the Data Orchestration Layer
Use Cases Data Orchestration Enables
Hive
Alluxio
Run big data workloads in hybrid
cloud environments
On premise
Same instance
/ container
Spark
Alluxio
Any Cloud / Multi Cloud
Same data
center / region
PrestoSpark
Alluxio
Accelerate big data frameworks
on the public cloud
Same instance
/ container
Enable big data on object stores
across single or multiple clouds
Standalone
Incredible Open Source Momentum with growing community
900+ contributors &
growing
3760+ Git Stars
Apache 2.0 Licensed
Hundreds of thousands
of downloads
Join the conversation on Slack
alluxio.org/slack
2
2
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Real-World Data Governance – Monthly Webinar Series
– August 15, 2019 – Data Governance versus Information Governance
– Third Thursday each Month @ 2pm EST – Register at TDAN.com, KIKconsulting.com, DATAVERSITY.net
• Non-Invasive Data Governance Book
– ISBN 9781935504856 / Technics Publishing / Amazon.com
• Speaking @ Dataversity Events
– Data Architecture Summit, Chicago – October 14-17
– Data Governance Vision, Washington, DC – December 9-12
• Non-Invasive Data Governance Online Learning Plan
Non-Invasive Metadata Governance Online Learning Plan
– DATAVERSITY Training Center
– https://training.dataversity.net
• The Data Administration Newsletter (TDAN.com)
– Twice Monthly – Data Articles, Columns, Blogs and Features
– Produced by DATAVERSITY – Subscribe for emails
– New Non-Invasive Data Governance Framework now being published
• KIK Consulting & Educational Services
KIKConsulting.com
Home of Non-Invasive Data Governance™
– Home of Non-Invasive Metadata Governance
How to Govern Data Lakes
Introduction
3
3
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
Chief Analytics Officer, Velocity Mortgage Capital
Evan brings over 20 years of consulting experience in IT environments, including leading
software development projects, designing and implementing IT and data strategies, and working
on long term, cross departmental projects in such diverse industries as automotive, retail, state
government, and e-commerce payments.
Evan’s areas of expertise include designing practical analytics solutions, aligning business and IT
strategies, and implementing data management and governance programs.
He co-authored the data modeling book Beginning Relational Data Modeling and has spoken
about data and process quality and systems design. Evan has a BA in Economics from McGill
University and an MBA from Columbia Business School.
How to Govern Data Lakes
Special Guest Evan Terry
4
4
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• In this webinar, Bob and Evan will discuss:
– The relationship between Data Lakes and Data Governance
– Preventing your Data Lake from becoming a Data Swamp
– Governing the Metadata associated with your Data Lake
– Leveraging governed data to provide trustworthy Analytics
– Measuring the value of a governed Data Lake
How to Govern Data Lakes
Abstract
5
5
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• What is Data Governance?
– The execution and enforcement of authority over the
definition, production and usage of data and data-related assets.
Robert S. Seiner
– The management and organization of data.
Evan Terry
– The orchestration of people and process and data.
– The harmonization of people and process and data.
– The formalization of accountability for data.
– The implementation of decision-rights for data.
How to Govern Data Lakes
The Relationship between Data Lakes and Data Governance
6
6
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• What is a Data Lake?
– A data lake is a system or repository of data stored in its natural/ raw format,
usually object blobs or files.
– A data lake is usually a single store of all enterprise data including raw copies
of source system data and transformed data used for tasks such as reporting,
visualization, advanced analytics and machine learning.
SAS Article, 2016
• When does a Data Lake become a Data Swamp?
– A data swamp is a deteriorated and unmanaged data lake that is either
inaccessible to its intended users or is providing little value.
Olavsrud, Thor. CIO 2017
– When the data in the lake is ungoverned.
How to Govern Data Lakes
The Relationship between Data Lakes and Data Governance
7
7
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• A connection between governance (how to manage and organize) and data lakes
for accurate and useful data management
• Catalogs are critical to help you govern data, especially in data lakes
– Find things
– Defining things
– Curate content
• Need to include policy-driven processes that classify and identify the information
in the lake, why it’s in there, what it means, who owns it, and who is using it
• A data lake without data governance will ultimately end up being a collection of
disconnected data pools or information silos—just all in one place.
How to Govern Data Lakes
The Relationship between Data Lakes and Data Governance
8
8
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• What can be done to prevent the swamping of your data lake?
– Implement data governance for the lake.
– Implement metadata management for the lake.
– Implement sound principles of:
• Data Definition
• Data Production
• Data Usage
• What is the appropriate level
of data governance for your
data lake?
How to Govern Data Lakes
Preventing your Data Lake from becoming a Data Swamp
9
9
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• A “data lake” becomes a data swamp without organization
– No organization, no curation of content, little metadata
• Data warehouse principles are relevant:
– Stewardship/Curation
– Design, documentation, maintenance of the lake
– Metadata capture
– Governance
• Technique - Create zones in your data lake:
– Transition data sets from “raw data” to “clean data”
– Apply different curation/governance principles to each zone
How to Govern Data Lakes
Preventing your Data Lake from becoming a Data Swamp
10
10
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Governing metadata associated with:
– Data Definition
– Data Production
– Data Usage
• (Where) Is there metadata associated with your data lake?
• Who is responsible for the metadata associated with your data lake?
• “The metadata will not govern itself!”
How to Govern Data Lakes
Governing the Metadata Associated with your Data Lake
11
11
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Cataloging is key, but is tricky:
– don’t under/over catalog
– don't be too loose/rigid in your governance rules
• “Goldilocks” mentality – everything in moderation
• Tune governance to priorities and context
– One person's data lake is another’s data swamp
– Don't turn data lake into a data warehouse – the clearest data lake
– Cannot be all things to all people – playground, incubator, or operational
data store?
How to Govern Data Lakes
Governing the Metadata Associated with your Data Lake
12
12
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Sample DG purpose statement – Use strategic data with confidence.
• Make certain the water is clean or it may be unhealthy.
• “Boil water alert” – Is data governance the boiling of the water?
• “Freshwater” versus “Saltwater”
determines species that will
live in your lake.
How to Govern Data Lakes
Leveraging Governed Data to Provide Trustworthy Analytics
13
13
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Data catalogs solve the problems of finding, interpreting and using data
• Data lake is a tool and the context is key – differences in required data quality
• “Trustworthy” depends on context and accuracy needs – data lakes are defined
as “less” controlled and structured
How to Govern Data Lakes
Leveraging Governed Data to Provide Trustworthy Analytics
14
14
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Provides much the same value as for a data warehouse – analytics requires:
– Who owns the data and can answer questions about it
– Finding the right data elements that meet your needs
– Cleaning the data to an appropriate level of quality
– Having the right security on the data being used
– Monitoring the data for adherence to standards
• Lightweight governance on adding, naming, organizing protects the shared
resource from the “tragedy of the commons”
How to Govern Data Lakes
Leveraging Governed Data to Provide Trustworthy Analytics
15
15
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Metrics are one of the 6 core components of Data Governance.
Data, people, process, communications, metrics and tools.
• Measuring people’s ____________ the data in the lake.
– confidence in
– understanding of
– usage of
– decisions made using
– knowledge of what data resides in
– … all will depend on the effective management
of metadata associated with your data lake.
How to Govern Data Lakes
Measuring the Value of a Governed Data Lake
16
16
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Considerations for providing metrics
– Benchmark current status
– Select metrics that mean something to someone
– Select metrics associated with the data lake rather than data governance
– Consider that it is not easy to measure Return on Investment on DG
– Go jump in the lake!
How to Govern Data Lakes
Measuring the Value of a Governed Data Lake
17
17
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Unlocking the value depends on the data lake being broadly usable
• What is the value of R&D? What is the value of avoiding a disaster?
• The context of the data lake is key
– What is the purpose of the data lake?
– What is the tool the data lake will help you solve?
– How much value does governance (lightweight or not) provide?
• Value is measured in combination with the final use
– AI/Machine Learning
– Agility/Time to Market
– Variety of end users served/capabilities enabled
How to Govern Data Lakes
Measuring the Value of a Governed Data Lake
18
18
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• In this webinar, Bob and Evan discussed:
– The relationship between Data Lakes and Data Governance
– Preventing your Data Lake from becoming a Data Swamp
– Governing the Metadata associated with your Data Lake
– Leveraging governed data to provide trustworthy Analytics
– Measuring the value of a governed Data Lake
How to Govern Data Lakes
Abstract
19
19
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Questions and Answers
Real-World Data Governance
Contact Information
Join us in the Dataversity Community to continue the conversation.
https://community.dataversity.net/
20
20
Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com
Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting
#RWDG @RSeiner
• Robert S. Seiner
KIK Consulting & Educational Services – KIKconsulting.com
The Data Administration Newsletter – TDAN.com
Post Office Box 112571, Upper St. Clair, Pennsylvania 15241
412.220.9643, 412.220.9644 (Fax)
rseiner@kikconsulting.com
rseiner@tdan.com
@RSeiner @TDAN_com
#RWDG
Real-World Data Governance
Contact Information

More Related Content

What's hot

Data Quality Success Stories
Data Quality Success StoriesData Quality Success Stories
Data Quality Success StoriesDATAVERSITY
 
Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures DATAVERSITY
 
Data Quality Strategies
Data Quality StrategiesData Quality Strategies
Data Quality StrategiesDATAVERSITY
 
Governing Big Data, Smart Data, Data Lakes, and the Internet of Things
Governing Big Data, Smart Data, Data Lakes, and the Internet of ThingsGoverning Big Data, Smart Data, Data Lakes, and the Internet of Things
Governing Big Data, Smart Data, Data Lakes, and the Internet of ThingsDATAVERSITY
 
RWDG Slides: Data Governance and Three Levels of Metadata Management
RWDG Slides: Data Governance and Three Levels of Metadata ManagementRWDG Slides: Data Governance and Three Levels of Metadata Management
RWDG Slides: Data Governance and Three Levels of Metadata ManagementDATAVERSITY
 
DAS Slides: Data Governance and Data Architecture – Alignment and Synergies
DAS Slides: Data Governance and Data Architecture – Alignment and SynergiesDAS Slides: Data Governance and Data Architecture – Alignment and Synergies
DAS Slides: Data Governance and Data Architecture – Alignment and SynergiesDATAVERSITY
 
RWDG Webinar: Mastering and Master Data Governance
RWDG Webinar: Mastering and Master Data GovernanceRWDG Webinar: Mastering and Master Data Governance
RWDG Webinar: Mastering and Master Data GovernanceDATAVERSITY
 
Getting Started with Data Stewardship
Getting Started with Data StewardshipGetting Started with Data Stewardship
Getting Started with Data StewardshipDATAVERSITY
 
DataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance StrategicallyDataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance StrategicallyDATAVERSITY
 
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced AnalyticsADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced AnalyticsDATAVERSITY
 
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...DATAVERSITY
 
RWDG Slides: Non-Invasive Metadata Governance
RWDG Slides: Non-Invasive Metadata GovernanceRWDG Slides: Non-Invasive Metadata Governance
RWDG Slides: Non-Invasive Metadata GovernanceDATAVERSITY
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogDATAVERSITY
 
RWDG Slides: The Future of Data Governance – IoT, AI, IG, and Cloud
RWDG Slides: The Future of Data Governance – IoT, AI, IG, and CloudRWDG Slides: The Future of Data Governance – IoT, AI, IG, and Cloud
RWDG Slides: The Future of Data Governance – IoT, AI, IG, and CloudDATAVERSITY
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management DATAVERSITY
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata StrategiesDATAVERSITY
 
Do-It-Yourself Metadata Framework
Do-It-Yourself Metadata FrameworkDo-It-Yourself Metadata Framework
Do-It-Yourself Metadata FrameworkDATAVERSITY
 
DataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDATAVERSITY
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationHistoric Environment Scotland
 
The Value of Metadata
The Value of MetadataThe Value of Metadata
The Value of MetadataDATAVERSITY
 

What's hot (20)

Data Quality Success Stories
Data Quality Success StoriesData Quality Success Stories
Data Quality Success Stories
 
Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures Data-Ed Webinar: Design & Manage Data Structures
Data-Ed Webinar: Design & Manage Data Structures
 
Data Quality Strategies
Data Quality StrategiesData Quality Strategies
Data Quality Strategies
 
Governing Big Data, Smart Data, Data Lakes, and the Internet of Things
Governing Big Data, Smart Data, Data Lakes, and the Internet of ThingsGoverning Big Data, Smart Data, Data Lakes, and the Internet of Things
Governing Big Data, Smart Data, Data Lakes, and the Internet of Things
 
RWDG Slides: Data Governance and Three Levels of Metadata Management
RWDG Slides: Data Governance and Three Levels of Metadata ManagementRWDG Slides: Data Governance and Three Levels of Metadata Management
RWDG Slides: Data Governance and Three Levels of Metadata Management
 
DAS Slides: Data Governance and Data Architecture – Alignment and Synergies
DAS Slides: Data Governance and Data Architecture – Alignment and SynergiesDAS Slides: Data Governance and Data Architecture – Alignment and Synergies
DAS Slides: Data Governance and Data Architecture – Alignment and Synergies
 
RWDG Webinar: Mastering and Master Data Governance
RWDG Webinar: Mastering and Master Data GovernanceRWDG Webinar: Mastering and Master Data Governance
RWDG Webinar: Mastering and Master Data Governance
 
Getting Started with Data Stewardship
Getting Started with Data StewardshipGetting Started with Data Stewardship
Getting Started with Data Stewardship
 
DataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance StrategicallyDataEd Slides: Approaching Data Governance Strategically
DataEd Slides: Approaching Data Governance Strategically
 
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced AnalyticsADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
ADV Slides: What Happened of Note in 1H 2020 in Enterprise Advanced Analytics
 
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
Seiner dataversity-rwdg2017-05-operating modelofdatagovernanceroles-20170518f...
 
RWDG Slides: Non-Invasive Metadata Governance
RWDG Slides: Non-Invasive Metadata GovernanceRWDG Slides: Non-Invasive Metadata Governance
RWDG Slides: Non-Invasive Metadata Governance
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 
RWDG Slides: The Future of Data Governance – IoT, AI, IG, and Cloud
RWDG Slides: The Future of Data Governance – IoT, AI, IG, and CloudRWDG Slides: The Future of Data Governance – IoT, AI, IG, and Cloud
RWDG Slides: The Future of Data Governance – IoT, AI, IG, and Cloud
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata Strategies
 
Do-It-Yourself Metadata Framework
Do-It-Yourself Metadata FrameworkDo-It-Yourself Metadata Framework
Do-It-Yourself Metadata Framework
 
DataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data GovernanceDataEd Online: Unlock Business Value through Data Governance
DataEd Online: Unlock Business Value through Data Governance
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
The Value of Metadata
The Value of MetadataThe Value of Metadata
The Value of Metadata
 

Similar to RWDG Slides: How to Govern Data Lakes

Data Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working TogetherData Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working TogetherDATAVERSITY
 
RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...
RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...
RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...DATAVERSITY
 
How to Govern Your Master Data
How to Govern Your Master Data How to Govern Your Master Data
How to Govern Your Master Data DATAVERSITY
 
RWDG Slides: Data and Metadata Will Not Govern Themselves
RWDG Slides: Data and Metadata Will Not Govern ThemselvesRWDG Slides: Data and Metadata Will Not Govern Themselves
RWDG Slides: Data and Metadata Will Not Govern ThemselvesDATAVERSITY
 
RWDG Slides: Applying Governance to Business Processes
RWDG Slides: Applying Governance to Business ProcessesRWDG Slides: Applying Governance to Business Processes
RWDG Slides: Applying Governance to Business ProcessesDATAVERSITY
 
RWDG Slides: Master Data Governance in Action
RWDG Slides: Master Data Governance in ActionRWDG Slides: Master Data Governance in Action
RWDG Slides: Master Data Governance in ActionDATAVERSITY
 
The Role of Metadata in a Data Governance Program
The Role of Metadata in a Data Governance ProgramThe Role of Metadata in a Data Governance Program
The Role of Metadata in a Data Governance ProgramDATAVERSITY
 
Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...
Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...
Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...DATAVERSITY
 
RWDG Webinar: Using Data Governance to Improve Data Understanding
RWDG Webinar: Using Data Governance to Improve Data UnderstandingRWDG Webinar: Using Data Governance to Improve Data Understanding
RWDG Webinar: Using Data Governance to Improve Data UnderstandingDATAVERSITY
 
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceGlossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceDATAVERSITY
 
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJXDriving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJXDATAVERSITY
 
RWDG Webinar: Metadata to Support Data Governance
RWDG Webinar: Metadata to Support Data GovernanceRWDG Webinar: Metadata to Support Data Governance
RWDG Webinar: Metadata to Support Data GovernanceDATAVERSITY
 
Data Governance and Data Science to Improve Data Quality
Data Governance and Data Science to Improve Data QualityData Governance and Data Science to Improve Data Quality
Data Governance and Data Science to Improve Data QualityDATAVERSITY
 
RWDG: Data Governance and Three Levels of Metadata 
RWDG: Data Governance and Three Levels of Metadata RWDG: Data Governance and Three Levels of Metadata 
RWDG: Data Governance and Three Levels of Metadata DATAVERSITY
 
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data GovernanceRWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data GovernanceDATAVERSITY
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesDATAVERSITY
 
RWDG Webinar: Govern Metadata: Vocabulary, Dictionaries and Data
RWDG Webinar: Govern Metadata: Vocabulary, Dictionaries and DataRWDG Webinar: Govern Metadata: Vocabulary, Dictionaries and Data
RWDG Webinar: Govern Metadata: Vocabulary, Dictionaries and DataDATAVERSITY
 
Data Governance vs. Information Governance
Data Governance vs. Information GovernanceData Governance vs. Information Governance
Data Governance vs. Information GovernanceDATAVERSITY
 
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllReal-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllDATAVERSITY
 
RWDG Slides: Build an Effective Data Governance Framework
RWDG Slides: Build an Effective Data Governance FrameworkRWDG Slides: Build an Effective Data Governance Framework
RWDG Slides: Build an Effective Data Governance FrameworkDATAVERSITY
 

Similar to RWDG Slides: How to Govern Data Lakes (20)

Data Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working TogetherData Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working Together
 
RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...
RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...
RWDG Slides: Metadata Governance for Catalogs, Glossaries, Dictionaries, and ...
 
How to Govern Your Master Data
How to Govern Your Master Data How to Govern Your Master Data
How to Govern Your Master Data
 
RWDG Slides: Data and Metadata Will Not Govern Themselves
RWDG Slides: Data and Metadata Will Not Govern ThemselvesRWDG Slides: Data and Metadata Will Not Govern Themselves
RWDG Slides: Data and Metadata Will Not Govern Themselves
 
RWDG Slides: Applying Governance to Business Processes
RWDG Slides: Applying Governance to Business ProcessesRWDG Slides: Applying Governance to Business Processes
RWDG Slides: Applying Governance to Business Processes
 
RWDG Slides: Master Data Governance in Action
RWDG Slides: Master Data Governance in ActionRWDG Slides: Master Data Governance in Action
RWDG Slides: Master Data Governance in Action
 
The Role of Metadata in a Data Governance Program
The Role of Metadata in a Data Governance ProgramThe Role of Metadata in a Data Governance Program
The Role of Metadata in a Data Governance Program
 
Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...
Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...
Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...
 
RWDG Webinar: Using Data Governance to Improve Data Understanding
RWDG Webinar: Using Data Governance to Improve Data UnderstandingRWDG Webinar: Using Data Governance to Improve Data Understanding
RWDG Webinar: Using Data Governance to Improve Data Understanding
 
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceGlossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data Governance
 
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJXDriving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
Driving Data Intelligence in the Supply Chain Through the Data Catalog at TJX
 
RWDG Webinar: Metadata to Support Data Governance
RWDG Webinar: Metadata to Support Data GovernanceRWDG Webinar: Metadata to Support Data Governance
RWDG Webinar: Metadata to Support Data Governance
 
Data Governance and Data Science to Improve Data Quality
Data Governance and Data Science to Improve Data QualityData Governance and Data Science to Improve Data Quality
Data Governance and Data Science to Improve Data Quality
 
RWDG: Data Governance and Three Levels of Metadata 
RWDG: Data Governance and Three Levels of Metadata RWDG: Data Governance and Three Levels of Metadata 
RWDG: Data Governance and Three Levels of Metadata 
 
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data GovernanceRWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
RWDG Slides: Glossaries, Dictionaries, and Catalogs Result in Data Governance
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 
RWDG Webinar: Govern Metadata: Vocabulary, Dictionaries and Data
RWDG Webinar: Govern Metadata: Vocabulary, Dictionaries and DataRWDG Webinar: Govern Metadata: Vocabulary, Dictionaries and Data
RWDG Webinar: Govern Metadata: Vocabulary, Dictionaries and Data
 
Data Governance vs. Information Governance
Data Governance vs. Information GovernanceData Governance vs. Information Governance
Data Governance vs. Information Governance
 
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come AllReal-World Data Governance: Governing Data – Big and Small, Come One Come All
Real-World Data Governance: Governing Data – Big and Small, Come One Come All
 
RWDG Slides: Build an Effective Data Governance Framework
RWDG Slides: Build an Effective Data Governance FrameworkRWDG Slides: Build an Effective Data Governance Framework
RWDG Slides: Build an Effective Data Governance Framework
 

More from DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Recently uploaded

Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 

Recently uploaded (20)

Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 

RWDG Slides: How to Govern Data Lakes

  • 1. 1 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner How to Govern Data Lakes with Special Guest Evan Terry Monthly Webinar Series Hosted by DATAVERSITY Robert S. Seiner – KIK Consulting / TDAN.com July 18, 2019 – 11:00 a.m. PT / 2:00 p.m. ET Real-World Data Governance
  • 2. Unified Data Orchestration Madan Kumar | Solutions Engineer| Alluxio madan@alluxio.com
  • 3. 4 big trends driving the need for a new architecture Separation of Compute & Storage Hybrid – Multi cloud environments Self-service data across the enterprise Rise of the object store
  • 4. Data Ecosystem - Beta Data Ecosystem 1.0 COMPUTE STORAGE STORAGE COMPUTE
  • 5. Data Orchestration Framework Java File API HDFS Interface S3 Interface REST APIFUSE Interface HDFS Driver Swift Driver S3 Driver NFS Driver
  • 6. Alluxio’s Approach to Big Data Federation  Unified Access - Acts as a “virtual data lake.” Files are accessed in Alluxio’s global namespace as if they resided in a single system  Performant - Provides fast local access to important and frequently used data, without maintaining a permanent copy of all data.  Modern, flexible architecture - Promotes separation of compute from storage  Storage Cost Optimization -Transparently reads and writes data directly from the source system, and so does not need to create a permanent copy of the data
  • 7. Data Elasticity with a unified namespace Abstract data silos & storage systems to independently scale data with compute Run Spark, Hive, Presto, ML workloads on your data located anywhere Accelerate big data workloads with transparent tiered local data Data Accessibility for popular APIs & API translation Data Locality with Intelligent Multi-tiering Key Innovations of the Data Orchestration Layer
  • 8. Use Cases Data Orchestration Enables Hive Alluxio Run big data workloads in hybrid cloud environments On premise Same instance / container Spark Alluxio Any Cloud / Multi Cloud Same data center / region PrestoSpark Alluxio Accelerate big data frameworks on the public cloud Same instance / container Enable big data on object stores across single or multiple clouds Standalone
  • 9. Incredible Open Source Momentum with growing community 900+ contributors & growing 3760+ Git Stars Apache 2.0 Licensed Hundreds of thousands of downloads Join the conversation on Slack alluxio.org/slack
  • 10. 2 2 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Real-World Data Governance – Monthly Webinar Series – August 15, 2019 – Data Governance versus Information Governance – Third Thursday each Month @ 2pm EST – Register at TDAN.com, KIKconsulting.com, DATAVERSITY.net • Non-Invasive Data Governance Book – ISBN 9781935504856 / Technics Publishing / Amazon.com • Speaking @ Dataversity Events – Data Architecture Summit, Chicago – October 14-17 – Data Governance Vision, Washington, DC – December 9-12 • Non-Invasive Data Governance Online Learning Plan Non-Invasive Metadata Governance Online Learning Plan – DATAVERSITY Training Center – https://training.dataversity.net • The Data Administration Newsletter (TDAN.com) – Twice Monthly – Data Articles, Columns, Blogs and Features – Produced by DATAVERSITY – Subscribe for emails – New Non-Invasive Data Governance Framework now being published • KIK Consulting & Educational Services KIKConsulting.com Home of Non-Invasive Data Governance™ – Home of Non-Invasive Metadata Governance How to Govern Data Lakes Introduction
  • 11. 3 3 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner Chief Analytics Officer, Velocity Mortgage Capital Evan brings over 20 years of consulting experience in IT environments, including leading software development projects, designing and implementing IT and data strategies, and working on long term, cross departmental projects in such diverse industries as automotive, retail, state government, and e-commerce payments. Evan’s areas of expertise include designing practical analytics solutions, aligning business and IT strategies, and implementing data management and governance programs. He co-authored the data modeling book Beginning Relational Data Modeling and has spoken about data and process quality and systems design. Evan has a BA in Economics from McGill University and an MBA from Columbia Business School. How to Govern Data Lakes Special Guest Evan Terry
  • 12. 4 4 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • In this webinar, Bob and Evan will discuss: – The relationship between Data Lakes and Data Governance – Preventing your Data Lake from becoming a Data Swamp – Governing the Metadata associated with your Data Lake – Leveraging governed data to provide trustworthy Analytics – Measuring the value of a governed Data Lake How to Govern Data Lakes Abstract
  • 13. 5 5 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • What is Data Governance? – The execution and enforcement of authority over the definition, production and usage of data and data-related assets. Robert S. Seiner – The management and organization of data. Evan Terry – The orchestration of people and process and data. – The harmonization of people and process and data. – The formalization of accountability for data. – The implementation of decision-rights for data. How to Govern Data Lakes The Relationship between Data Lakes and Data Governance
  • 14. 6 6 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • What is a Data Lake? – A data lake is a system or repository of data stored in its natural/ raw format, usually object blobs or files. – A data lake is usually a single store of all enterprise data including raw copies of source system data and transformed data used for tasks such as reporting, visualization, advanced analytics and machine learning. SAS Article, 2016 • When does a Data Lake become a Data Swamp? – A data swamp is a deteriorated and unmanaged data lake that is either inaccessible to its intended users or is providing little value. Olavsrud, Thor. CIO 2017 – When the data in the lake is ungoverned. How to Govern Data Lakes The Relationship between Data Lakes and Data Governance
  • 15. 7 7 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • A connection between governance (how to manage and organize) and data lakes for accurate and useful data management • Catalogs are critical to help you govern data, especially in data lakes – Find things – Defining things – Curate content • Need to include policy-driven processes that classify and identify the information in the lake, why it’s in there, what it means, who owns it, and who is using it • A data lake without data governance will ultimately end up being a collection of disconnected data pools or information silos—just all in one place. How to Govern Data Lakes The Relationship between Data Lakes and Data Governance
  • 16. 8 8 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • What can be done to prevent the swamping of your data lake? – Implement data governance for the lake. – Implement metadata management for the lake. – Implement sound principles of: • Data Definition • Data Production • Data Usage • What is the appropriate level of data governance for your data lake? How to Govern Data Lakes Preventing your Data Lake from becoming a Data Swamp
  • 17. 9 9 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • A “data lake” becomes a data swamp without organization – No organization, no curation of content, little metadata • Data warehouse principles are relevant: – Stewardship/Curation – Design, documentation, maintenance of the lake – Metadata capture – Governance • Technique - Create zones in your data lake: – Transition data sets from “raw data” to “clean data” – Apply different curation/governance principles to each zone How to Govern Data Lakes Preventing your Data Lake from becoming a Data Swamp
  • 18. 10 10 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Governing metadata associated with: – Data Definition – Data Production – Data Usage • (Where) Is there metadata associated with your data lake? • Who is responsible for the metadata associated with your data lake? • “The metadata will not govern itself!” How to Govern Data Lakes Governing the Metadata Associated with your Data Lake
  • 19. 11 11 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Cataloging is key, but is tricky: – don’t under/over catalog – don't be too loose/rigid in your governance rules • “Goldilocks” mentality – everything in moderation • Tune governance to priorities and context – One person's data lake is another’s data swamp – Don't turn data lake into a data warehouse – the clearest data lake – Cannot be all things to all people – playground, incubator, or operational data store? How to Govern Data Lakes Governing the Metadata Associated with your Data Lake
  • 20. 12 12 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Sample DG purpose statement – Use strategic data with confidence. • Make certain the water is clean or it may be unhealthy. • “Boil water alert” – Is data governance the boiling of the water? • “Freshwater” versus “Saltwater” determines species that will live in your lake. How to Govern Data Lakes Leveraging Governed Data to Provide Trustworthy Analytics
  • 21. 13 13 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Data catalogs solve the problems of finding, interpreting and using data • Data lake is a tool and the context is key – differences in required data quality • “Trustworthy” depends on context and accuracy needs – data lakes are defined as “less” controlled and structured How to Govern Data Lakes Leveraging Governed Data to Provide Trustworthy Analytics
  • 22. 14 14 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Provides much the same value as for a data warehouse – analytics requires: – Who owns the data and can answer questions about it – Finding the right data elements that meet your needs – Cleaning the data to an appropriate level of quality – Having the right security on the data being used – Monitoring the data for adherence to standards • Lightweight governance on adding, naming, organizing protects the shared resource from the “tragedy of the commons” How to Govern Data Lakes Leveraging Governed Data to Provide Trustworthy Analytics
  • 23. 15 15 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Metrics are one of the 6 core components of Data Governance. Data, people, process, communications, metrics and tools. • Measuring people’s ____________ the data in the lake. – confidence in – understanding of – usage of – decisions made using – knowledge of what data resides in – … all will depend on the effective management of metadata associated with your data lake. How to Govern Data Lakes Measuring the Value of a Governed Data Lake
  • 24. 16 16 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Considerations for providing metrics – Benchmark current status – Select metrics that mean something to someone – Select metrics associated with the data lake rather than data governance – Consider that it is not easy to measure Return on Investment on DG – Go jump in the lake! How to Govern Data Lakes Measuring the Value of a Governed Data Lake
  • 25. 17 17 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Unlocking the value depends on the data lake being broadly usable • What is the value of R&D? What is the value of avoiding a disaster? • The context of the data lake is key – What is the purpose of the data lake? – What is the tool the data lake will help you solve? – How much value does governance (lightweight or not) provide? • Value is measured in combination with the final use – AI/Machine Learning – Agility/Time to Market – Variety of end users served/capabilities enabled How to Govern Data Lakes Measuring the Value of a Governed Data Lake
  • 26. 18 18 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • In this webinar, Bob and Evan discussed: – The relationship between Data Lakes and Data Governance – Preventing your Data Lake from becoming a Data Swamp – Governing the Metadata associated with your Data Lake – Leveraging governed data to provide trustworthy Analytics – Measuring the value of a governed Data Lake How to Govern Data Lakes Abstract
  • 27. 19 19 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Questions and Answers Real-World Data Governance Contact Information Join us in the Dataversity Community to continue the conversation. https://community.dataversity.net/
  • 28. 20 20 Copyright © 2019 Robert S. Seiner – KIK Consulting & EducationalServices / TDAN.com Non-InvasiveData Governance™ is a trademark of Robert S. Seiner & KIK Consulting #RWDG @RSeiner • Robert S. Seiner KIK Consulting & Educational Services – KIKconsulting.com The Data Administration Newsletter – TDAN.com Post Office Box 112571, Upper St. Clair, Pennsylvania 15241 412.220.9643, 412.220.9644 (Fax) rseiner@kikconsulting.com rseiner@tdan.com @RSeiner @TDAN_com #RWDG Real-World Data Governance Contact Information