Quack Chat: Diving into Data Governance

Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
Diving into Data Governance
March 14, 2017
Ron Huizenga
Senior Product Manager, EnterpriseArchitecture & Modeling
@DataAviator

Topics
• Second level
• Third level
− Fourth level
• Fifth level
2
ER/Studio Enterprise Team Edition

Topics
• Second level
• Third level
− Fourth level
• Fifth level
3
Agenda
 Governance Overview
 Definitions
 Master Data
 Data lineage & life cycle
 Master Data Management (MDM)
 Importance of Data Models
 Data quality
 Change Management & Audit
 Business Glossaries
 Data Maturity
Data
Governance
Data
Architecture
Management
Data
Development
Database
Operations
Management
Data Security
Management
Reference &
Master Data
Management
Data
Warehousing
& Business
Intelligence
Management
Document &
Content
Management
Metadata
Management
Data Quality
Management

Topics
• Second level
• Third level
− Fourth level
• Fifth level
4
DMBOK: Definitions
 Data Governance
 The exercise of authority, control and shared decision making (planning, monitoring and
enforcement) over the management of data assets.
 Master Data
 Synonymous with reference data.The data that provides the context for transaction data. It
includes the details (definitions and identifiers) of internal and external objects involved in
business transactions. Includes data about customers, products, employees, vendors, and
controlled domains (code values).
 Master Data Management
 Processes that ensure that reference data is kept up to date and coordinated across an
enterprise. The organization, management and distribution of corporately adjudicated data
with widespread use in the organization.

Topics
• Second level
• Third level
− Fourth level
• Fifth level
5
Master Data Classification Considerations
 Behavior
 Life Cycle
 Complexity
 Value
 Volatility
 Reuse

Topics
• Second level
• Third level
− Fourth level
• Fifth level
6
Master Data - Behavior
 Can be described by the way it interacts with other data
 Master data is almost always involved with transactional data
 Often a noun/verb relationship between the master data item and the transaction
 Master data are the nouns
• Customer
• Product
 Transactional data capture the verbs
• Customer places order
• Product sold on order
 Same type of relationships are shared between facts and dimensions in a data
warehouse
 Master data are the dimensions
 Transactions are the facts

Topics
• Second level
• Third level
− Fourth level
• Fifth level
7
Master Data - Lifecycle
 Describes how a master data element is created, read, updated, deleted (CRUD)
 Many factors come into play
 Business rules
 Business processes
 Applications
 There may be more than 1 way a particular master data element is created
 Need to model:
 Business process
 Data lineage
• Data flow
• Integration
• Include ExtractTransform and Load (ETL) for data warehouse/data marts and staging areas

Topics
• Second level
• Third level
− Fourth level
• Fifth level
8
Business Process & Data CRUD

Topics
• Second level
• Third level
− Fourth level
• Fifth level
9
Master Data – Complexity, Value
 Complexity
 Very simple entities are rarely a challenge to manage
 The less complex an element, the less likely the need to manage change
• Likely not master data elements
• Possibly reference data
• States/Provinces
• Units of measure
• Classification references
 Value
 Value and complexity interact
 The higher value a data element is to an organization the more likely it will be considered
master data

Topics
• Second level
• Third level
− Fourth level
• Fifth level
10
Master Data - Volatility
 Level of change in characteristics describing a master data element
 Frequent change = high volatility
 Infrequent change = low volatility
 Sometimes referred to as stability
 Frequent change = unstable
 Infrequent change = stable

Topics
• Second level
• Third level
− Fourth level
• Fifth level
11
Master Data - Reuse
 Master data elements are often shared across a number of systems
 Can lead to inconsistency and errors
 Multiple systems
 Which is the “version of truth”
 Spreadsheets
 Private data stores
 An error in master data can cause errors in
 All the transactions that use it
 All the applications that use it
 All reports and analytics that use it
 This is one of the primary reasons for “Master Data Management”

Topics
• Second level
• Third level
− Fourth level
• Fifth level
12
Data Lineage
 Source of truth
 Chain of custody
 Transformations
12

Topics
• Second level
• Third level
− Fourth level
• Fifth level
13
Data Lineage

Topics
• Second level
• Third level
− Fourth level
• Fifth level
14
What is Master Data Management?
 The processes, tools and technology required to create and maintain consistent and
accurate lists of master data
 Includes both creating and maintaining master data
 Often requires fundamental changes in business process
 Not just a technological problem
 Some of the most difficult issues are more political than technical
 Organization wide MDM may be difficult
 Many organizations begin with critical, high value elements
 Grow and expand
 MDM is not a project
 Ongoing
 Continuous improvement

Topics
• Second level
• Third level
− Fourth level
• Fifth level
15
MDM Activities
 Identify sources of master data
 Identify the producers and
consumers of the master data
 Collect and analyze metadata about
for your master data
 Appoint data stewards
 Implement a data-governance
program and council
 Develop the master-data model
 Choose a toolset
 Design the infrastructure
 Generate and test the master data
 Modify the producing and consuming
systems
 Be sure to incorporate versioning and
auditing

Topics
• Second level
• Third level
− Fourth level
• Fifth level
16
Importance of data models
 Full Specification
 Logical
 Physical
 Descriptive metadata
 Names
 Definitions (data dictionary)
 Notes
 Implementation characteristics
 Data types
 Keys
 Indexes
 Views
 Business Rules
 Relationships (referential constraints)
 Value Restrictions (constraints)
 Security Classifications + Rules
 Governance Metadata
 Master Data Management classes
 Data Quality classifications
 Retention policies

Topics
• Second level
• Third level
− Fourth level
• Fifth level
17
Data Dictionary – Metadata Extensions

Topics
• Second level
• Third level
− Fourth level
• Fifth level
18
ER/Studio – Metadata Attachment Setup

Topics
• Second level
• Third level
− Fourth level
• Fifth level
19
Universal Mappings
 Ability to link “like” or related objects
 Within same model file
 Across separate model files
 Entity/Table level
 Attribute/Column level

Topics
• Second level
• Third level
− Fourth level
• Fifth level
20
Universal Mappings

Topics
• Second level
• Third level
− Fourth level
• Fifth level
21
Data Quality (DMBOK)
 Data Quality
 The degree to which data is accurate, complete, timely, consistent with all requirements
and business rules, and relevant for a given use.
 Information Quality
 The degree to which information consistently meets the requirements and expectations of
knowledge workers in performing their jobs.
 In the context of a specific use, the degree to which information is meet the requirements
and expectations for that use.

Topics
• Second level
• Third level
− Fourth level
• Fifth level
22
Data Quality
 Accuracy
 Timeliness
 Completeness
 Consistency
 Relevance
 Fitness For Use

Topics
• Second level
• Third level
− Fourth level
• Fifth level
23
Poor Data Quality Implications
 Costs a typical company the equivalent of 15% to 20% of revenue
 Estimated by US Insurance Data Management Association
 Low Quality = Low Efficiency
 It is insidious – most data quality issues are hidden in day to day work
 From time to time, a small amount of bad data leads to a disaster of epic
proportions

Topics
• Second level
• Third level
− Fourth level
• Fifth level
24
Poor data quality isn’t a new problem

Topics
• Second level
• Third level
− Fourth level
• Fifth level
25
Mitigation Best Practice
 Adopt the philosophy of prevention
 Show thought leadership
 Be accountable at the points of data creation
 Measure, control, improve
 Establish data culture

Topics
• Second level
• Third level
− Fourth level
• Fifth level
26
Change Management Tasks & Requirements

Topics
• Second level
• Third level
− Fourth level
• Fifth level
27
Change Management & Audit

Topics
• Second level
• Third level
− Fourth level
• Fifth level
28
Providing Meaning: Business Glossary

Topics
• Second level
• Third level
− Fourth level
• Fifth level
29
Glossary Integration

Topics
• Second level
• Third level
− Fourth level
• Fifth level
30
Data Maturity
Level 0 1 2 3 4 5
Description None Initial Managed Standardized Advanced Optimized
Data Governance None Project Level Program Level Division Level Cross Divisional Enterprise Wide
Master Data Management
no formal master
data clasification
Non-integrated
master data
Integrated, shared
master data
repository
Data Management Services
Master data stewards
established
Data stewardship
council
Data Integration
ad-hoc, point to
point
Reactive, point-to-
point interfaces,
some common tools,
lack of standards
common integration
platform, design
patterns
Middleware utilization:
service bus, canonical
model, business rules,
repository
Data Excellence
Centre (education
and training)
Data Excellence
embedded in
corporate culture
Data Quality
Silos, scattered data,
inconsistencies
accepted
Recognition of
inconsistecies but no
management plan to
address
Data cleansing at
consumption in
order to attempt
data quality
improvement
Data Quality KPI's and
conformance visibility,
some cleansing at source.
Prevention approach
to data quality
Full data quality
management
practice
Behaviour
Unaware /
Denial
Chaotic Reactive Stable Proactive Predictive
Continuous Improvement
Introduction Expanson Transformation
Technology &
Infrastructure
Information &
Strategic Business
Enablement
Primary IS Focus
HIGH LOWRisk
LOW HIGHValue Generation

Topics
• Second level
• Third level
− Fourth level
• Fifth level
31
Summary
 Master Data Management and Data Quality are vital aspects of Data Governance
 Master Data Characteristics
 Behavior
 Lifecycle
 Complexity
 Volatility
 Reuse
 MDM is an ongoing, continuous improvement discipline, not a project
 Data models & metadata constitute the blueprint for data governance
 Change management and auditability is paramount for compliance
 Integrated business glossaries provide definition and context
 Achieving data maturity is a journey

Topics
• Second level
• Third level
− Fourth level
• Fifth level
32
Q&A
Join the discussion: http://community.idera.com/
You can find me at:
ron.huizenga@idera.com
@DataAviator

Quack Chat: Diving into Data Governance

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (17)

Similar to Quack Chat: Diving into Data Governance

Similar to Quack Chat: Diving into Data Governance (20)

More from IDERA Software

More from IDERA Software (20)

Recently uploaded

Recently uploaded (20)

Quack Chat: Diving into Data Governance

Editor's Notes