SlideShare a Scribd company logo
Taming the Digital Transformation Dragon
Big Data Governance Practices
Jay Zaidi
November 14, 2016
ABOUT ME
“Succeeding with data isn’t just a matter of putting Hadoop in
your machine room, or hiring some physicists with crazy math
skills. It requires you to develop a data culture that involves
people throughout the organization.”
- DJ Patil, Chief Data Scientist of the U.S.
● Founded AlyData in 2014. We specialise
in Data Management and Data Science.
Our mission is to transform
organizations, by helping them innovate
and gain a competitive advantage by
unleashing value from their data and
information assets.
● 13 Years at Fannie Mae. Last 5 years
was a direct report to the CDO.
● Have authored 2 books and over 80
articles on data management on
LinkedIn.
● Email - jayzaidi@alydata.com
● LinkedIn Profile -
https://www.linkedin.com/in/javedzai
di
● Twitter - @jayzaidi
#data-driven leadership 2
3 MAJOR THEMES I WISH TO HIGHLIGHT -
1. Most important areas leaders focus on - Value Creation and Risk Management. “Data” is
foundational to that.
2. We are living in the Fourth Industrial Revolution or the “Age of Data”. Data practitioners must
focus on “What Data Does (value) and How to Govern It (risk)” not “What Data Is”. Value is
generated by Data Science by unleashing the power of data via better insights and Risk is
managed via robust Data Governance.
3. Data Governance is at the heart of Risk Management and Data Management (Master Data
Management, Data Quality, Metadata, Analytics, and Data Science). Build it into the Big Data
Lake, Data Science, and Small Data Repositories.
#create-value & #manage-risk3
#data-science 4
#data-governance 5
A FRAMEWORK FOR UNDERSTANDING DATA GOVERNANCE
6
Business
Imperatives
Risk
Management
Regulatory
Compliance
Legal
Data Security &
Privacy
Data
Controls
Data
Optimization
WHYWHAT
Data democratization, data decentralization, and tougher regulatory environment are driving data
governance as a critical risk management component.
“Governance is about changing culture and
implementing processes to ensure proper
oversight of data semantics and quality,
transparency into data related metrics,
accountability for data, and timely resolution
of data-related issues or questions”
#manage-risk 7
8
BRIDGING THE DATA GOVERNANCE DIVIDE - SMALL AND BIG DATA
- Big data training about
unstructured data and containers
- metadata, quality, master data,
logs etc.
- Process training on changes made
to governance, quality, MDM,
metadata changes due to big
data
- Analysis training on issue logs,
interpreting them and working
with stakeholders to address data
issues
- Training on process changes due
to big data
- Training on new Training about
unstructured data and containers
- Training on process changes due
to big data
- Training on new tools used to
tag, profile, review big data
- Implement new tools used to tag,
profile, analyze, review big data
- Implement activity log analyzer,
automate and working with
stakeholders to address data
issues
- Training about unstructured data
and containers
Small Data - Structured data is highly organized information that uploads neatly into a relational database (schema on write), lives in fixed fields,
and is easily detectable via search operations or algorithms. Structured data is relatively simple to enter, store, query, and analyze, but it must
be strictly defined in terms of field name and type (e.g. alpha, numeric, date, currency).
Big Data - Unstructured data resides in files. It is increasingly available in the form of complex data sources, such as web logs, multimedia
content, email, customer service interactions, sales automation, and social media data. The fundamental challenge of unstructured data sources
is that they don’t follow a predefined schema, are difficult for nontechnical business users and data analysts alike to unbox, understand, and
prepare for analytic use. Beyond issues of structure, is the sheer volume of this type of data. Need additional contextual data to understand,
different tools to process/analyze, and more automation due to volume and variety.
#small-big-data
This architecture supports various data access patterns and governance.
ARCHITECTING A DATA LAKE WITH GOVERNANCE IN MIND - THE HOW?
#data-classification
#data-access-controls
#data-lineage
#data-semantics
#data-quality
#issue-log
9
Diagram Courtesy - “Enterprise Big Data Lake” a O’Reilly book by Alex Gorelik
#master-data
#people #process #tech
10
SOME EXAMPLES OF DATA LAKE GOVERNANCE
Client: Fortune 50 Financial Services
Problem: CISO and Security Operation Center was able to
pinpoint sensitive data being accessed via security logs but had
no idea where it resided and who was accountable for it.
Solution: The Data Governance program had already classified
data by sensitivity, identified enterprise critical data, and had a
list of data stewards and custodians accountable for each. This
facilitated the root cause analysis exercise to get to address the
questions that the security team had.
Client: Fortune 100 Healthcare Client
Problem: Reports to CFO and corporate disclosures required
significant effort and impacted time-to-value due to the quality
of data acquired from upstream systems.
Solution: The Enterprise Data Quality group collaborated with
CFO’s group and the three key upstream system staff to define
data quality requirements for key data, agreed on the quality
dimensions, implemented a data certification process and
provided guidance on usage of open source data quality tools to
ensure consistent delivery of high quality data.
Client: Federal Government Civilian Agency
Problem: Internal Audit team wasn’t able to quickly pinpoint
issues with underlying data and how it could impact risk to the
organization.
Solution: The Enterprise Data Quality team had already
implemented a data quality framework that standardized the
data quality dimensions, data profiling process and tools, and a
consistent way to report this information. They collaborated
with the Internal Audit staff to train them on these capabilities,
so that they could independently validate data and process.
Client: Fortune 500 Hospitality Industry
Problem: Sales and Marketing campaigns weren’t delivering
the desired ROI and there was no way to determine the root
cause.
Solution: The Enterprise Data Governance team engaged the
Stewards and Custodians for Customer, Sales, and Rewards
system organizations to discuss the consistency and quality of
data based on their specific quality reports and subject matter
expertise and concluded that the lack of Customer 360 data and
inconsistencies in customer and sales data across three siloed
systems was the root cause. EDG facilitated sessions between
various teams to address these issues.
#data-lake-governance
6-STEP GOVERNANCE PROCESS
1. Assess: Data Management Maturity Assessment focused on Governance, Quality and Master Data
Management.
2. Process: Apply our proprietary Scope, Process Automation, Ownership, Cross-functional
Engagement, and Human Intelligence (SPOCH) framework.
3. Agility: Use Agile Data Governance (DG), Data Quality (DQ) and Master Data Management (MDM)
processes.
4. Standards/Policies: Define and monitor compliance.
5. Alignment: Align Data Governance with IT governance.
6. Automation: Implement tools to automate DG processes, DQ profiling, Tagging, Discovery, etc.
#data-governance 11
APPENDIX
12
1. Data Governance Demystified – Lessons From The Trenches
2. Bridging the Data Governance Chasm
3. You Think You Know Data? Think Again
4. 6 Reasons Why Big Data Investments Aren’t Paying Off For Some Organizations
5. 5 Reasons More Companies Don’t Have Data Quality Programs
6. What’s An Information Supply Chain and Why You Should Care?
7. The Dark Side of Big Data
8. Re-Thinking Information Security and Data Governance
SOME ARTICLES I’VE AUTHORED
(link to over 80 articles)
#thought-leadership 13

More Related Content

Recently uploaded

一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
MAQIB18
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Domenico Conte
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 

Recently uploaded (20)

一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Computer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage sComputer Presentation.pptx ecommerce advantage s
Computer Presentation.pptx ecommerce advantage s
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPsWebinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
Webinar One View, Multiple Systems No-Code Integration of Salesforce and ERPs
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
2024-05-14 - Tableau User Group - TC24 Hot Topics - Tableau Pulse and Einstei...
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
Jpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization SampleJpolillo Amazon PPC - Bid Optimization Sample
Jpolillo Amazon PPC - Bid Optimization Sample
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 

Featured

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Best Governance Practices - World Quality Day 2016

  • 1. Taming the Digital Transformation Dragon Big Data Governance Practices Jay Zaidi November 14, 2016
  • 2. ABOUT ME “Succeeding with data isn’t just a matter of putting Hadoop in your machine room, or hiring some physicists with crazy math skills. It requires you to develop a data culture that involves people throughout the organization.” - DJ Patil, Chief Data Scientist of the U.S. ● Founded AlyData in 2014. We specialise in Data Management and Data Science. Our mission is to transform organizations, by helping them innovate and gain a competitive advantage by unleashing value from their data and information assets. ● 13 Years at Fannie Mae. Last 5 years was a direct report to the CDO. ● Have authored 2 books and over 80 articles on data management on LinkedIn. ● Email - jayzaidi@alydata.com ● LinkedIn Profile - https://www.linkedin.com/in/javedzai di ● Twitter - @jayzaidi #data-driven leadership 2
  • 3. 3 MAJOR THEMES I WISH TO HIGHLIGHT - 1. Most important areas leaders focus on - Value Creation and Risk Management. “Data” is foundational to that. 2. We are living in the Fourth Industrial Revolution or the “Age of Data”. Data practitioners must focus on “What Data Does (value) and How to Govern It (risk)” not “What Data Is”. Value is generated by Data Science by unleashing the power of data via better insights and Risk is managed via robust Data Governance. 3. Data Governance is at the heart of Risk Management and Data Management (Master Data Management, Data Quality, Metadata, Analytics, and Data Science). Build it into the Big Data Lake, Data Science, and Small Data Repositories. #create-value & #manage-risk3
  • 6. A FRAMEWORK FOR UNDERSTANDING DATA GOVERNANCE 6 Business Imperatives Risk Management Regulatory Compliance Legal Data Security & Privacy Data Controls Data Optimization WHYWHAT Data democratization, data decentralization, and tougher regulatory environment are driving data governance as a critical risk management component.
  • 7. “Governance is about changing culture and implementing processes to ensure proper oversight of data semantics and quality, transparency into data related metrics, accountability for data, and timely resolution of data-related issues or questions” #manage-risk 7
  • 8. 8 BRIDGING THE DATA GOVERNANCE DIVIDE - SMALL AND BIG DATA - Big data training about unstructured data and containers - metadata, quality, master data, logs etc. - Process training on changes made to governance, quality, MDM, metadata changes due to big data - Analysis training on issue logs, interpreting them and working with stakeholders to address data issues - Training on process changes due to big data - Training on new Training about unstructured data and containers - Training on process changes due to big data - Training on new tools used to tag, profile, review big data - Implement new tools used to tag, profile, analyze, review big data - Implement activity log analyzer, automate and working with stakeholders to address data issues - Training about unstructured data and containers Small Data - Structured data is highly organized information that uploads neatly into a relational database (schema on write), lives in fixed fields, and is easily detectable via search operations or algorithms. Structured data is relatively simple to enter, store, query, and analyze, but it must be strictly defined in terms of field name and type (e.g. alpha, numeric, date, currency). Big Data - Unstructured data resides in files. It is increasingly available in the form of complex data sources, such as web logs, multimedia content, email, customer service interactions, sales automation, and social media data. The fundamental challenge of unstructured data sources is that they don’t follow a predefined schema, are difficult for nontechnical business users and data analysts alike to unbox, understand, and prepare for analytic use. Beyond issues of structure, is the sheer volume of this type of data. Need additional contextual data to understand, different tools to process/analyze, and more automation due to volume and variety. #small-big-data
  • 9. This architecture supports various data access patterns and governance. ARCHITECTING A DATA LAKE WITH GOVERNANCE IN MIND - THE HOW? #data-classification #data-access-controls #data-lineage #data-semantics #data-quality #issue-log 9 Diagram Courtesy - “Enterprise Big Data Lake” a O’Reilly book by Alex Gorelik #master-data #people #process #tech
  • 10. 10 SOME EXAMPLES OF DATA LAKE GOVERNANCE Client: Fortune 50 Financial Services Problem: CISO and Security Operation Center was able to pinpoint sensitive data being accessed via security logs but had no idea where it resided and who was accountable for it. Solution: The Data Governance program had already classified data by sensitivity, identified enterprise critical data, and had a list of data stewards and custodians accountable for each. This facilitated the root cause analysis exercise to get to address the questions that the security team had. Client: Fortune 100 Healthcare Client Problem: Reports to CFO and corporate disclosures required significant effort and impacted time-to-value due to the quality of data acquired from upstream systems. Solution: The Enterprise Data Quality group collaborated with CFO’s group and the three key upstream system staff to define data quality requirements for key data, agreed on the quality dimensions, implemented a data certification process and provided guidance on usage of open source data quality tools to ensure consistent delivery of high quality data. Client: Federal Government Civilian Agency Problem: Internal Audit team wasn’t able to quickly pinpoint issues with underlying data and how it could impact risk to the organization. Solution: The Enterprise Data Quality team had already implemented a data quality framework that standardized the data quality dimensions, data profiling process and tools, and a consistent way to report this information. They collaborated with the Internal Audit staff to train them on these capabilities, so that they could independently validate data and process. Client: Fortune 500 Hospitality Industry Problem: Sales and Marketing campaigns weren’t delivering the desired ROI and there was no way to determine the root cause. Solution: The Enterprise Data Governance team engaged the Stewards and Custodians for Customer, Sales, and Rewards system organizations to discuss the consistency and quality of data based on their specific quality reports and subject matter expertise and concluded that the lack of Customer 360 data and inconsistencies in customer and sales data across three siloed systems was the root cause. EDG facilitated sessions between various teams to address these issues. #data-lake-governance
  • 11. 6-STEP GOVERNANCE PROCESS 1. Assess: Data Management Maturity Assessment focused on Governance, Quality and Master Data Management. 2. Process: Apply our proprietary Scope, Process Automation, Ownership, Cross-functional Engagement, and Human Intelligence (SPOCH) framework. 3. Agility: Use Agile Data Governance (DG), Data Quality (DQ) and Master Data Management (MDM) processes. 4. Standards/Policies: Define and monitor compliance. 5. Alignment: Align Data Governance with IT governance. 6. Automation: Implement tools to automate DG processes, DQ profiling, Tagging, Discovery, etc. #data-governance 11
  • 13. 1. Data Governance Demystified – Lessons From The Trenches 2. Bridging the Data Governance Chasm 3. You Think You Know Data? Think Again 4. 6 Reasons Why Big Data Investments Aren’t Paying Off For Some Organizations 5. 5 Reasons More Companies Don’t Have Data Quality Programs 6. What’s An Information Supply Chain and Why You Should Care? 7. The Dark Side of Big Data 8. Re-Thinking Information Security and Data Governance SOME ARTICLES I’VE AUTHORED (link to over 80 articles) #thought-leadership 13

Editor's Notes

  1. Information governance is control of information to meet legal, regulatory, risk, and business demands. That means, information is controlled to adhere to legal guidelines, such as the Federal Rules of Civil Procedure. It also means that regulatory requirements for keeping and producing business records are met. It further means that business risk and security considerations and an organization's