SlideShare a Scribd company logo
Operationalizing Data Governance
through Data Quality Control
David Loshin
Knowledge Integrity, Inc.
www.knowledge-integrity.com
1© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
Linking Organization to Practice
p  Mapping business value drivers to
data expectations and objectives,
granting oversight and
accountability, and verifying
performance of compliance with
corporate information policies
n  Processes prescribed for operations,
n  Procedures for day-to-day
observance
n  Oversight for verifying compliance
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
2
Accountability
Data Policies
Processes &
Best Practices
Information
Standards
Business
Policy
Roles and
responsibilities
Program management
Data policies and
standards
Business intelligence
Business terminology
Data quality
Auditability
Business Value and Data Dependence
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
3
Expenses
Risk
Management
Revenue Customer
Experience
Performance
p  Business policies,
corporate mission, and
strategic performance
objectives can be
translated into
dimensions of value
p  These criteria are used
for prioritizing effort in
relation to maximizing
value
p  Data governance helps
establish the
relationship between
value drivers and
information utility
Sources of Information Policy
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
4
Expenses
Risk
Management
Revenue Customer
Experience
Performance
•  Customer lifetime
value analysis
•  Voice of the
customer
•  Satisfaction surveys
•  Asset productivity
analysis
•  Human capital
performance
•  Regulations
•  Operational risk
•  Market factors
•  Customer acquisition
and retention
•  Investment
opportunities
•  Product performance
•  Spend analysis
•  Commodity risk
•  Cost management
Data Management Challenges
Data Use
and ReUse
Data
Requirements
Reinterpreting
Semantics
Measurement
Triage and
Remediation
Impact
Assessment
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
5
The Data Policy Lifecycle: Actualizing Governance
p  Refinement of business requirements for facets of information
utility
p  Specification for data quality measures and level of
acceptability
p  Determination of functional requirements to facilitate
continuous compliance
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
6
Determination
of need
Drafting a
Policy
Policy Review
& Approval
Design &
Development
Marketing Deployment
Information Policies and Data Governance
Integrated
Data
Governance
Manage User
Requirements
Data
Discovery
Shared
Semantics
Embedded
Validation
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
7
•  Business metadata management
•  Data quality discovery and assessment
•  Specifying data quality rules
•  Inspection, monitoring, measurement
•  Managing data lineage
Managing the Quality of Business Metadata
p  Many sources of entity concepts and business
terms may conflict with each other
p  The data governance framework must
facilitate the collection, documentation, and
harmonization of business terms
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
8
Policies
System Docs
Processes
Models
Standards
Applications
Business Rules
Profiling
Etc.
Entity Concepts
Business
Terms
Definition
Contextual
Meaning
… …
Definition
Contextual
Meaning
Definition
Contextual
Meaning
Definition
Contextual
Meaning
Data Discovery
p  Data Discovery enables these
types of questions to be
answered:
n  What data sets are available?
n  What entities are embedded?
n  What data elements are
available?
n  How is the data accessed?
n  What are the quality
constraints?
p  The results can be shared via
a platform for managing
semantic metadata
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
9
DataIntegration
Attribute
First d 4 6 y
Last f 6 2 h
Street d 4 7 n
City a 0 2 o
State
Value Count
A 12000
I 10000
L 7655
X 3208
N 120
M 8
Data Quality Assessment
p  Analysis of data sets, records, data elements, and data values
to
n  Identify potential anomalies
n  Determine business impacts
n  Evaluate dimensions for measurement of data quality
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
10
Analysis
Data Quality: Expectations, Rules, and Monitoring
p  Data quality rules can be used to monitor conformance to data
policies
p  Conformance can be measured, thresholded, and reported at
each handoff location in the processing stream
p  Specific failures can generate events as directed by Data
Quality Service Level Agreements
p  Static auditing: measurement applied to a “static” data set
n  Examples: SQL queries, data profiling tools
p  Inlined monitoring: measurement performed within a process
flow
n  Example: edit checks, dynamic monitors
p  All measurements are compared against acceptability
thresholds
p  Acceptability threshold is related to the degree of impact
1111© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
Data Quality Rules: Measures and Thresholds
p  Provide specific
n  Measures
n  Methods of measurement
n  Units of measures
n  Levels of acceptability
© 2012 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
12
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
1313
Data Quality Control
p  Controls measure
observance of data
expectations based on
information policies and
corresponding data rules
p  Those rules are refined
based on an analysis of the
data dependencies and
defined expectations
p  Controls are placed at
relevant locations within
the process stream
Producer
Process
Consumer
Process
As data is handed off
between process tasks,
controls validate accuracy,
completeness, consistency,
timeliness against defined
expectations
Instituting Inspection Using Data Quality Rules
p  Apply tools and techniques for measuring conformance to data
rules (using data profiling and data monitoring tools):
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
14
Instituting Inspection Using Data Quality Rules
p  Data quality expectations are inspected and any emerging
issues are identified:
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
15
Instituting Inspection Using Data Quality Rules
p  Different events can be triggered by a data failure, such as
notifications to data stewards:
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
16
Instituting Inspection Using Data Quality Rules
p  Or logging the failure in a Data Quality Incident Management
System and score card:
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
17
Instituting Inspection Using Data Quality Rules
p  Effectiveness demonstrated when:
n  Control events occur when data failure events take place,
n  The proper mitigation or remediation actions are performed,
n  The corrective actions to correct the problem and eliminate its root
cause are performed within a reasonable time frame, and
n  A control event for the same issue is never triggered further
downstream
p  Measurements can be aggregated over time into performance
metrics
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
18
Integrating Data Quality Reporting with Governance
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
19
Processing
Stage
Processing
Stage
Processing
Stage
Processing
Stage
Tools & Processes: Operationalizing Data Governance
p  Methods and tools for data discovery: profiling
data, statistical analysis of values, and model
evaluation
p  Metadata management through a central platform
for knowledge capture and communication
p  End-to-end visibility of lineage for structure,
semantics, and use across enterprise
p  Data Quality assessment
p  Integrated data quality control
p  Inspection, monitoring, and reporting
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
20
Questions and Open Discussion
p  www.knowledge-integrity.com
p  If you have questions,
comments, or suggestions,
please contact me
David Loshin
301-754-6350
loshin@knowledge-integrity.com
© 2013 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350
21
www.dataqualitybook.com
www.mdmbook.com

More Related Content

Similar to Loshin operationalizingdatagovernance

Geek Sync | 5 Best Practices for Operationalizing Data Governance
Geek Sync | 5 Best Practices for Operationalizing Data GovernanceGeek Sync | 5 Best Practices for Operationalizing Data Governance
Geek Sync | 5 Best Practices for Operationalizing Data Governance
IDERA Software
 
Analytics: What is it really and how can it help my organization?
Analytics: What is it really and how can it help my organization?Analytics: What is it really and how can it help my organization?
Analytics: What is it really and how can it help my organization?
SAS Canada
 
How do you assess the quality and reliability of data sources in data analysi...
How do you assess the quality and reliability of data sources in data analysi...How do you assess the quality and reliability of data sources in data analysi...
How do you assess the quality and reliability of data sources in data analysi...
Soumodeep Nanee Kundu
 
The Relevance of Data Analytics in External Audit.pdf
The Relevance of Data Analytics in External Audit.pdfThe Relevance of Data Analytics in External Audit.pdf
The Relevance of Data Analytics in External Audit.pdf
Fiyona Nourin
 
Fuel your Data-Driven Ambitions with Data Governance
Fuel your Data-Driven Ambitions with Data GovernanceFuel your Data-Driven Ambitions with Data Governance
Fuel your Data-Driven Ambitions with Data Governance
Pedro Martins
 
PERFORMANCE & STRATEGIC INFORMATION MANAGEMENT
 PERFORMANCE & STRATEGIC INFORMATION MANAGEMENT PERFORMANCE & STRATEGIC INFORMATION MANAGEMENT
PERFORMANCE & STRATEGIC INFORMATION MANAGEMENT
Office des Forages Ruraux. Government of Senegal
 
A Business-first Approach to Building Data Governance Programs
A Business-first Approach to Building Data Governance ProgramsA Business-first Approach to Building Data Governance Programs
A Business-first Approach to Building Data Governance Programs
Precisely
 
Automating and Orchestrating Processes and Decisions Across the Enterprise
Automating and Orchestrating Processes and Decisions Across the EnterpriseAutomating and Orchestrating Processes and Decisions Across the Enterprise
Automating and Orchestrating Processes and Decisions Across the Enterprise
Denis Gagné
 
Workable Enteprise Data Governance
Workable Enteprise Data GovernanceWorkable Enteprise Data Governance
Workable Enteprise Data Governance
Bhavendra Chavan
 
Empowering Your Business with Advanced Data Analytics Services
 Empowering Your Business with Advanced Data Analytics Services Empowering Your Business with Advanced Data Analytics Services
Empowering Your Business with Advanced Data Analytics Services
Corotsystems
 
Introduction-to-Data-Research-Services.pptx
Introduction-to-Data-Research-Services.pptxIntroduction-to-Data-Research-Services.pptx
Introduction-to-Data-Research-Services.pptx
soulilutionitfirmusa
 
Data Quality Rules introduction
Data Quality Rules introductionData Quality Rules introduction
Data Quality Rules introduction
datatovalue
 
The Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision ModelingThe Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision Modeling
Decision Management Solutions
 
Lifecycle Management
Lifecycle ManagementLifecycle Management
Lifecycle Management
Glen Alleman
 
Data Governance: From speed dating to lifelong partnership
Data Governance: From speed dating to lifelong partnershipData Governance: From speed dating to lifelong partnership
Data Governance: From speed dating to lifelong partnership
Precisely
 
5 Signs Your Privacy Management Program is Not Working for You
5 Signs Your Privacy Management Program is Not Working for You5 Signs Your Privacy Management Program is Not Working for You
5 Signs Your Privacy Management Program is Not Working for You
TrustArc
 
WCAR Rutgers Presentation Nov 2013
WCAR Rutgers Presentation Nov 2013WCAR Rutgers Presentation Nov 2013
WCAR Rutgers Presentation Nov 2013
Vikas Dutta, CISA, CIPP/IT, ISO 27001 LA
 
ITSM and Service Catalog Overview
ITSM and Service Catalog OverviewITSM and Service Catalog Overview
ITSM and Service Catalog Overview
Christopher Glennon
 
Too much data and not enough analytics!
Too much data and not enough analytics!Too much data and not enough analytics!
Too much data and not enough analytics!
Emma Kelly
 
dimensions_of_data_quality.pptx
dimensions_of_data_quality.pptxdimensions_of_data_quality.pptx
dimensions_of_data_quality.pptx
hailemariam hailemariam
 

Similar to Loshin operationalizingdatagovernance (20)

Geek Sync | 5 Best Practices for Operationalizing Data Governance
Geek Sync | 5 Best Practices for Operationalizing Data GovernanceGeek Sync | 5 Best Practices for Operationalizing Data Governance
Geek Sync | 5 Best Practices for Operationalizing Data Governance
 
Analytics: What is it really and how can it help my organization?
Analytics: What is it really and how can it help my organization?Analytics: What is it really and how can it help my organization?
Analytics: What is it really and how can it help my organization?
 
How do you assess the quality and reliability of data sources in data analysi...
How do you assess the quality and reliability of data sources in data analysi...How do you assess the quality and reliability of data sources in data analysi...
How do you assess the quality and reliability of data sources in data analysi...
 
The Relevance of Data Analytics in External Audit.pdf
The Relevance of Data Analytics in External Audit.pdfThe Relevance of Data Analytics in External Audit.pdf
The Relevance of Data Analytics in External Audit.pdf
 
Fuel your Data-Driven Ambitions with Data Governance
Fuel your Data-Driven Ambitions with Data GovernanceFuel your Data-Driven Ambitions with Data Governance
Fuel your Data-Driven Ambitions with Data Governance
 
PERFORMANCE & STRATEGIC INFORMATION MANAGEMENT
 PERFORMANCE & STRATEGIC INFORMATION MANAGEMENT PERFORMANCE & STRATEGIC INFORMATION MANAGEMENT
PERFORMANCE & STRATEGIC INFORMATION MANAGEMENT
 
A Business-first Approach to Building Data Governance Programs
A Business-first Approach to Building Data Governance ProgramsA Business-first Approach to Building Data Governance Programs
A Business-first Approach to Building Data Governance Programs
 
Automating and Orchestrating Processes and Decisions Across the Enterprise
Automating and Orchestrating Processes and Decisions Across the EnterpriseAutomating and Orchestrating Processes and Decisions Across the Enterprise
Automating and Orchestrating Processes and Decisions Across the Enterprise
 
Workable Enteprise Data Governance
Workable Enteprise Data GovernanceWorkable Enteprise Data Governance
Workable Enteprise Data Governance
 
Empowering Your Business with Advanced Data Analytics Services
 Empowering Your Business with Advanced Data Analytics Services Empowering Your Business with Advanced Data Analytics Services
Empowering Your Business with Advanced Data Analytics Services
 
Introduction-to-Data-Research-Services.pptx
Introduction-to-Data-Research-Services.pptxIntroduction-to-Data-Research-Services.pptx
Introduction-to-Data-Research-Services.pptx
 
Data Quality Rules introduction
Data Quality Rules introductionData Quality Rules introduction
Data Quality Rules introduction
 
The Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision ModelingThe Value of Predictive Analytics and Decision Modeling
The Value of Predictive Analytics and Decision Modeling
 
Lifecycle Management
Lifecycle ManagementLifecycle Management
Lifecycle Management
 
Data Governance: From speed dating to lifelong partnership
Data Governance: From speed dating to lifelong partnershipData Governance: From speed dating to lifelong partnership
Data Governance: From speed dating to lifelong partnership
 
5 Signs Your Privacy Management Program is Not Working for You
5 Signs Your Privacy Management Program is Not Working for You5 Signs Your Privacy Management Program is Not Working for You
5 Signs Your Privacy Management Program is Not Working for You
 
WCAR Rutgers Presentation Nov 2013
WCAR Rutgers Presentation Nov 2013WCAR Rutgers Presentation Nov 2013
WCAR Rutgers Presentation Nov 2013
 
ITSM and Service Catalog Overview
ITSM and Service Catalog OverviewITSM and Service Catalog Overview
ITSM and Service Catalog Overview
 
Too much data and not enough analytics!
Too much data and not enough analytics!Too much data and not enough analytics!
Too much data and not enough analytics!
 
dimensions_of_data_quality.pptx
dimensions_of_data_quality.pptxdimensions_of_data_quality.pptx
dimensions_of_data_quality.pptx
 

More from Taldor Group

7. emc isilon hdfs enterprise storage for hadoop
7. emc isilon hdfs   enterprise storage for hadoop7. emc isilon hdfs   enterprise storage for hadoop
7. emc isilon hdfs enterprise storage for hadoop
Taldor Group
 
5. big data vs it stki - pini cohen
5. big data vs  it    stki - pini cohen5. big data vs  it    stki - pini cohen
5. big data vs it stki - pini cohen
Taldor Group
 
4. hadoop גיא לבנברג
4. hadoop  גיא לבנברג4. hadoop  גיא לבנברג
4. hadoop גיא לבנברג
Taldor Group
 
3. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 20133. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 2013
Taldor Group
 
A new platform for a new era emc
A new platform for a new era   emcA new platform for a new era   emc
A new platform for a new era emc
Taldor Group
 
Yossi cohen 3 base
Yossi cohen   3 baseYossi cohen   3 base
Yossi cohen 3 base
Taldor Group
 
פיני מנדל תובנות עסקיות מיישומי Hadoop
פיני מנדל   תובנות עסקיות מיישומי Hadoopפיני מנדל   תובנות עסקיות מיישומי Hadoop
פיני מנדל תובנות עסקיות מיישומי Hadoop
Taldor Group
 
נתן פרידחי הקדמה לכנס Hadoop
נתן פרידחי   הקדמה לכנס Hadoopנתן פרידחי   הקדמה לכנס Hadoop
נתן פרידחי הקדמה לכנס Hadoop
Taldor Group
 
הערך העסקי שבאיכות הנתונים קוסטין מרזאה
הערך העסקי שבאיכות הנתונים   קוסטין מרזאההערך העסקי שבאיכות הנתונים   קוסטין מרזאה
הערך העסקי שבאיכות הנתונים קוסטין מרזאהTaldor Group
 
Dcl צביקה מנלה - סיפורי לקוחות
Dcl   צביקה מנלה - סיפורי לקוחותDcl   צביקה מנלה - סיפורי לקוחות
Dcl צביקה מנלה - סיפורי לקוחותTaldor Group
 
Taldor data quality einat shimoni - stki
Taldor data quality   einat shimoni - stkiTaldor data quality   einat shimoni - stki
Taldor data quality einat shimoni - stki
Taldor Group
 
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 32013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
Taldor Group
 

More from Taldor Group (12)

7. emc isilon hdfs enterprise storage for hadoop
7. emc isilon hdfs   enterprise storage for hadoop7. emc isilon hdfs   enterprise storage for hadoop
7. emc isilon hdfs enterprise storage for hadoop
 
5. big data vs it stki - pini cohen
5. big data vs  it    stki - pini cohen5. big data vs  it    stki - pini cohen
5. big data vs it stki - pini cohen
 
4. hadoop גיא לבנברג
4. hadoop  גיא לבנברג4. hadoop  גיא לבנברג
4. hadoop גיא לבנברג
 
3. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 20133. ami big data hadoop on ucs seminar may 2013
3. ami big data hadoop on ucs seminar may 2013
 
A new platform for a new era emc
A new platform for a new era   emcA new platform for a new era   emc
A new platform for a new era emc
 
Yossi cohen 3 base
Yossi cohen   3 baseYossi cohen   3 base
Yossi cohen 3 base
 
פיני מנדל תובנות עסקיות מיישומי Hadoop
פיני מנדל   תובנות עסקיות מיישומי Hadoopפיני מנדל   תובנות עסקיות מיישומי Hadoop
פיני מנדל תובנות עסקיות מיישומי Hadoop
 
נתן פרידחי הקדמה לכנס Hadoop
נתן פרידחי   הקדמה לכנס Hadoopנתן פרידחי   הקדמה לכנס Hadoop
נתן פרידחי הקדמה לכנס Hadoop
 
הערך העסקי שבאיכות הנתונים קוסטין מרזאה
הערך העסקי שבאיכות הנתונים   קוסטין מרזאההערך העסקי שבאיכות הנתונים   קוסטין מרזאה
הערך העסקי שבאיכות הנתונים קוסטין מרזאה
 
Dcl צביקה מנלה - סיפורי לקוחות
Dcl   צביקה מנלה - סיפורי לקוחותDcl   צביקה מנלה - סיפורי לקוחות
Dcl צביקה מנלה - סיפורי לקוחות
 
Taldor data quality einat shimoni - stki
Taldor data quality   einat shimoni - stkiTaldor data quality   einat shimoni - stki
Taldor data quality einat shimoni - stki
 
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 32013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
 

Loshin operationalizingdatagovernance

  • 1. Operationalizing Data Governance through Data Quality Control David Loshin Knowledge Integrity, Inc. www.knowledge-integrity.com 1© 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350
  • 2. Linking Organization to Practice p  Mapping business value drivers to data expectations and objectives, granting oversight and accountability, and verifying performance of compliance with corporate information policies n  Processes prescribed for operations, n  Procedures for day-to-day observance n  Oversight for verifying compliance © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 2 Accountability Data Policies Processes & Best Practices Information Standards Business Policy Roles and responsibilities Program management Data policies and standards Business intelligence Business terminology Data quality Auditability
  • 3. Business Value and Data Dependence © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 3 Expenses Risk Management Revenue Customer Experience Performance p  Business policies, corporate mission, and strategic performance objectives can be translated into dimensions of value p  These criteria are used for prioritizing effort in relation to maximizing value p  Data governance helps establish the relationship between value drivers and information utility
  • 4. Sources of Information Policy © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 4 Expenses Risk Management Revenue Customer Experience Performance •  Customer lifetime value analysis •  Voice of the customer •  Satisfaction surveys •  Asset productivity analysis •  Human capital performance •  Regulations •  Operational risk •  Market factors •  Customer acquisition and retention •  Investment opportunities •  Product performance •  Spend analysis •  Commodity risk •  Cost management
  • 5. Data Management Challenges Data Use and ReUse Data Requirements Reinterpreting Semantics Measurement Triage and Remediation Impact Assessment © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 5
  • 6. The Data Policy Lifecycle: Actualizing Governance p  Refinement of business requirements for facets of information utility p  Specification for data quality measures and level of acceptability p  Determination of functional requirements to facilitate continuous compliance © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 6 Determination of need Drafting a Policy Policy Review & Approval Design & Development Marketing Deployment
  • 7. Information Policies and Data Governance Integrated Data Governance Manage User Requirements Data Discovery Shared Semantics Embedded Validation © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 7 •  Business metadata management •  Data quality discovery and assessment •  Specifying data quality rules •  Inspection, monitoring, measurement •  Managing data lineage
  • 8. Managing the Quality of Business Metadata p  Many sources of entity concepts and business terms may conflict with each other p  The data governance framework must facilitate the collection, documentation, and harmonization of business terms © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 8 Policies System Docs Processes Models Standards Applications Business Rules Profiling Etc. Entity Concepts Business Terms Definition Contextual Meaning … … Definition Contextual Meaning Definition Contextual Meaning Definition Contextual Meaning
  • 9. Data Discovery p  Data Discovery enables these types of questions to be answered: n  What data sets are available? n  What entities are embedded? n  What data elements are available? n  How is the data accessed? n  What are the quality constraints? p  The results can be shared via a platform for managing semantic metadata © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 9 DataIntegration
  • 10. Attribute First d 4 6 y Last f 6 2 h Street d 4 7 n City a 0 2 o State Value Count A 12000 I 10000 L 7655 X 3208 N 120 M 8 Data Quality Assessment p  Analysis of data sets, records, data elements, and data values to n  Identify potential anomalies n  Determine business impacts n  Evaluate dimensions for measurement of data quality © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 10 Analysis
  • 11. Data Quality: Expectations, Rules, and Monitoring p  Data quality rules can be used to monitor conformance to data policies p  Conformance can be measured, thresholded, and reported at each handoff location in the processing stream p  Specific failures can generate events as directed by Data Quality Service Level Agreements p  Static auditing: measurement applied to a “static” data set n  Examples: SQL queries, data profiling tools p  Inlined monitoring: measurement performed within a process flow n  Example: edit checks, dynamic monitors p  All measurements are compared against acceptability thresholds p  Acceptability threshold is related to the degree of impact 1111© 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350
  • 12. Data Quality Rules: Measures and Thresholds p  Provide specific n  Measures n  Methods of measurement n  Units of measures n  Levels of acceptability © 2012 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 12
  • 13. © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 1313 Data Quality Control p  Controls measure observance of data expectations based on information policies and corresponding data rules p  Those rules are refined based on an analysis of the data dependencies and defined expectations p  Controls are placed at relevant locations within the process stream Producer Process Consumer Process As data is handed off between process tasks, controls validate accuracy, completeness, consistency, timeliness against defined expectations
  • 14. Instituting Inspection Using Data Quality Rules p  Apply tools and techniques for measuring conformance to data rules (using data profiling and data monitoring tools): © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 14
  • 15. Instituting Inspection Using Data Quality Rules p  Data quality expectations are inspected and any emerging issues are identified: © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 15
  • 16. Instituting Inspection Using Data Quality Rules p  Different events can be triggered by a data failure, such as notifications to data stewards: © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 16
  • 17. Instituting Inspection Using Data Quality Rules p  Or logging the failure in a Data Quality Incident Management System and score card: © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 17
  • 18. Instituting Inspection Using Data Quality Rules p  Effectiveness demonstrated when: n  Control events occur when data failure events take place, n  The proper mitigation or remediation actions are performed, n  The corrective actions to correct the problem and eliminate its root cause are performed within a reasonable time frame, and n  A control event for the same issue is never triggered further downstream p  Measurements can be aggregated over time into performance metrics © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 18
  • 19. Integrating Data Quality Reporting with Governance © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 19 Processing Stage Processing Stage Processing Stage Processing Stage
  • 20. Tools & Processes: Operationalizing Data Governance p  Methods and tools for data discovery: profiling data, statistical analysis of values, and model evaluation p  Metadata management through a central platform for knowledge capture and communication p  End-to-end visibility of lineage for structure, semantics, and use across enterprise p  Data Quality assessment p  Integrated data quality control p  Inspection, monitoring, and reporting © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 20
  • 21. Questions and Open Discussion p  www.knowledge-integrity.com p  If you have questions, comments, or suggestions, please contact me David Loshin 301-754-6350 loshin@knowledge-integrity.com © 2013 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 21 www.dataqualitybook.com www.mdmbook.com