SlideShare a Scribd company logo
1 of 32
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
Diving into Data Governance
March 14, 2017
Ron Huizenga
Senior Product Manager, EnterpriseArchitecture & Modeling
@DataAviator
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
2
ER/Studio Enterprise Team Edition
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
3
Agenda
 Governance Overview
 Definitions
 Master Data
 Data lineage & life cycle
 Master Data Management (MDM)
 Importance of Data Models
 Data quality
 Change Management & Audit
 Business Glossaries
 Data Maturity
Data
Governance
Data
Architecture
Management
Data
Development
Database
Operations
Management
Data Security
Management
Reference &
Master Data
Management
Data
Warehousing
& Business
Intelligence
Management
Document &
Content
Management
Metadata
Management
Data Quality
Management
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
4
DMBOK: Definitions
 Data Governance
 The exercise of authority, control and shared decision making (planning, monitoring and
enforcement) over the management of data assets.
 Master Data
 Synonymous with reference data.The data that provides the context for transaction data. It
includes the details (definitions and identifiers) of internal and external objects involved in
business transactions. Includes data about customers, products, employees, vendors, and
controlled domains (code values).
 Master Data Management
 Processes that ensure that reference data is kept up to date and coordinated across an
enterprise. The organization, management and distribution of corporately adjudicated data
with widespread use in the organization.
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
5
Master Data Classification Considerations
 Behavior
 Life Cycle
 Complexity
 Value
 Volatility
 Reuse
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
6
Master Data - Behavior
 Can be described by the way it interacts with other data
 Master data is almost always involved with transactional data
 Often a noun/verb relationship between the master data item and the transaction
 Master data are the nouns
• Customer
• Product
 Transactional data capture the verbs
• Customer places order
• Product sold on order
 Same type of relationships are shared between facts and dimensions in a data
warehouse
 Master data are the dimensions
 Transactions are the facts
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
7
Master Data - Lifecycle
 Describes how a master data element is created, read, updated, deleted (CRUD)
 Many factors come into play
 Business rules
 Business processes
 Applications
 There may be more than 1 way a particular master data element is created
 Need to model:
 Business process
 Data lineage
• Data flow
• Integration
• Include ExtractTransform and Load (ETL) for data warehouse/data marts and staging areas
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
8
Business Process & Data CRUD
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
9
Master Data – Complexity, Value
 Complexity
 Very simple entities are rarely a challenge to manage
 The less complex an element, the less likely the need to manage change
• Likely not master data elements
• Possibly reference data
• States/Provinces
• Units of measure
• Classification references
 Value
 Value and complexity interact
 The higher value a data element is to an organization the more likely it will be considered
master data
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
10
Master Data - Volatility
 Level of change in characteristics describing a master data element
 Frequent change = high volatility
 Infrequent change = low volatility
 Sometimes referred to as stability
 Frequent change = unstable
 Infrequent change = stable
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
11
Master Data - Reuse
 Master data elements are often shared across a number of systems
 Can lead to inconsistency and errors
 Multiple systems
 Which is the “version of truth”
 Spreadsheets
 Private data stores
 An error in master data can cause errors in
 All the transactions that use it
 All the applications that use it
 All reports and analytics that use it
 This is one of the primary reasons for “Master Data Management”
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
12
Data Lineage
 Source of truth
 Chain of custody
 Transformations
12
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
13
Data Lineage
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
14
What is Master Data Management?
 The processes, tools and technology required to create and maintain consistent and
accurate lists of master data
 Includes both creating and maintaining master data
 Often requires fundamental changes in business process
 Not just a technological problem
 Some of the most difficult issues are more political than technical
 Organization wide MDM may be difficult
 Many organizations begin with critical, high value elements
 Grow and expand
 MDM is not a project
 Ongoing
 Continuous improvement
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
15
MDM Activities
 Identify sources of master data
 Identify the producers and
consumers of the master data
 Collect and analyze metadata about
for your master data
 Appoint data stewards
 Implement a data-governance
program and council
 Develop the master-data model
 Choose a toolset
 Design the infrastructure
 Generate and test the master data
 Modify the producing and consuming
systems
 Be sure to incorporate versioning and
auditing
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
16
Importance of data models
 Full Specification
 Logical
 Physical
 Descriptive metadata
 Names
 Definitions (data dictionary)
 Notes
 Implementation characteristics
 Data types
 Keys
 Indexes
 Views
 Business Rules
 Relationships (referential constraints)
 Value Restrictions (constraints)
 Security Classifications + Rules
 Governance Metadata
 Master Data Management classes
 Data Quality classifications
 Retention policies
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
17
Data Dictionary – Metadata Extensions
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
18
ER/Studio – Metadata Attachment Setup
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
19
Universal Mappings
 Ability to link “like” or related objects
 Within same model file
 Across separate model files
 Entity/Table level
 Attribute/Column level
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
20
Universal Mappings
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
21
Data Quality (DMBOK)
 Data Quality
 The degree to which data is accurate, complete, timely, consistent with all requirements
and business rules, and relevant for a given use.
 Information Quality
 The degree to which information consistently meets the requirements and expectations of
knowledge workers in performing their jobs.
 In the context of a specific use, the degree to which information is meet the requirements
and expectations for that use.
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
22
Data Quality
 Accuracy
 Timeliness
 Completeness
 Consistency
 Relevance
 Fitness For Use
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
23
Poor Data Quality Implications
 Costs a typical company the equivalent of 15% to 20% of revenue
 Estimated by US Insurance Data Management Association
 Low Quality = Low Efficiency
 It is insidious – most data quality issues are hidden in day to day work
 From time to time, a small amount of bad data leads to a disaster of epic
proportions
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
24
Poor data quality isn’t a new problem
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
25
Mitigation Best Practice
 Adopt the philosophy of prevention
 Show thought leadership
 Be accountable at the points of data creation
 Measure, control, improve
 Establish data culture
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
26
Change Management Tasks & Requirements
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
27
Change Management & Audit
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
28
Providing Meaning: Business Glossary
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
29
Glossary Integration
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
30
Data Maturity
Level 0 1 2 3 4 5
Description None Initial Managed Standardized Advanced Optimized
Data Governance None Project Level Program Level Division Level Cross Divisional Enterprise Wide
Master Data Management
no formal master
data clasification
Non-integrated
master data
Integrated, shared
master data
repository
Data Management Services
Master data stewards
established
Data stewardship
council
Data Integration
ad-hoc, point to
point
Reactive, point-to-
point interfaces,
some common tools,
lack of standards
common integration
platform, design
patterns
Middleware utilization:
service bus, canonical
model, business rules,
repository
Data Excellence
Centre (education
and training)
Data Excellence
embedded in
corporate culture
Data Quality
Silos, scattered data,
inconsistencies
accepted
Recognition of
inconsistecies but no
management plan to
address
Data cleansing at
consumption in
order to attempt
data quality
improvement
Data Quality KPI's and
conformance visibility,
some cleansing at source.
Prevention approach
to data quality
Full data quality
management
practice
Behaviour
Unaware /
Denial
Chaotic Reactive Stable Proactive Predictive
Continuous Improvement
Introduction Expanson Transformation
Technology &
Infrastructure
Information &
Strategic Business
Enablement
Primary IS Focus
HIGH LOWRisk
LOW HIGHValue Generation
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
31
Summary
 Master Data Management and Data Quality are vital aspects of Data Governance
 Master Data Characteristics
 Behavior
 Lifecycle
 Complexity
 Volatility
 Reuse
 MDM is an ongoing, continuous improvement discipline, not a project
 Data models & metadata constitute the blueprint for data governance
 Change management and auditability is paramount for compliance
 Integrated business glossaries provide definition and context
 Achieving data maturity is a journey
Topics
 Click to edit Master text styles
• Second level
• Third level
− Fourth level
• Fifth level
32
Q&A
Join the discussion: http://community.idera.com/
You can find me at:
ron.huizenga@idera.com
@DataAviator

More Related Content

Viewers also liked

CONNEXTWorking Oct 2016 - Job interview
CONNEXTWorking Oct 2016 - Job interviewCONNEXTWorking Oct 2016 - Job interview
CONNEXTWorking Oct 2016 - Job interviewBC Talents
 
Imaging Clinical Decision Support: The Medicare Mandate - Prepared for Nation...
Imaging Clinical Decision Support: The Medicare Mandate - Prepared for Nation...Imaging Clinical Decision Support: The Medicare Mandate - Prepared for Nation...
Imaging Clinical Decision Support: The Medicare Mandate - Prepared for Nation...Mick Brown
 
Hydro80 fr-2016-hd1460620251
Hydro80 fr-2016-hd1460620251Hydro80 fr-2016-hd1460620251
Hydro80 fr-2016-hd1460620251saxhorn2
 
Aula 4 estrutura literária - lucas 18. 18-30
Aula 4   estrutura literária - lucas 18. 18-30Aula 4   estrutura literária - lucas 18. 18-30
Aula 4 estrutura literária - lucas 18. 18-30Ipabr Limesp
 
2017 circular 1ª ronda liga nacional cadete v.1
2017 circular 1ª ronda liga nacional cadete v.12017 circular 1ª ronda liga nacional cadete v.1
2017 circular 1ª ronda liga nacional cadete v.1Luis Gomez
 
Logica proposicional
Logica proposicionalLogica proposicional
Logica proposicionalcepecole
 
Photogrammetry for Games
Photogrammetry for GamesPhotogrammetry for Games
Photogrammetry for Gamestravytravy
 
What have i learnt from target audience feedback
What have i learnt from target audience feedback What have i learnt from target audience feedback
What have i learnt from target audience feedback harryswaine
 
Revista Mujer Patria Roja Marzo 2017
Revista Mujer Patria Roja Marzo 2017Revista Mujer Patria Roja Marzo 2017
Revista Mujer Patria Roja Marzo 2017Tania Roja
 

Viewers also liked (17)

CONNEXTWorking Oct 2016 - Job interview
CONNEXTWorking Oct 2016 - Job interviewCONNEXTWorking Oct 2016 - Job interview
CONNEXTWorking Oct 2016 - Job interview
 
Delito Informático
Delito InformáticoDelito Informático
Delito Informático
 
Imaging Clinical Decision Support: The Medicare Mandate - Prepared for Nation...
Imaging Clinical Decision Support: The Medicare Mandate - Prepared for Nation...Imaging Clinical Decision Support: The Medicare Mandate - Prepared for Nation...
Imaging Clinical Decision Support: The Medicare Mandate - Prepared for Nation...
 
Lesson 03
Lesson 03Lesson 03
Lesson 03
 
Media evaluation q6
Media evaluation  q6Media evaluation  q6
Media evaluation q6
 
El Coltan
El ColtanEl Coltan
El Coltan
 
Hydro80 fr-2016-hd1460620251
Hydro80 fr-2016-hd1460620251Hydro80 fr-2016-hd1460620251
Hydro80 fr-2016-hd1460620251
 
Bc5116 02
Bc5116 02Bc5116 02
Bc5116 02
 
Ebt centec
Ebt   centecEbt   centec
Ebt centec
 
Aula 4 estrutura literária - lucas 18. 18-30
Aula 4   estrutura literária - lucas 18. 18-30Aula 4   estrutura literária - lucas 18. 18-30
Aula 4 estrutura literária - lucas 18. 18-30
 
Ley de titulos_valores
Ley de titulos_valoresLey de titulos_valores
Ley de titulos_valores
 
2017 circular 1ª ronda liga nacional cadete v.1
2017 circular 1ª ronda liga nacional cadete v.12017 circular 1ª ronda liga nacional cadete v.1
2017 circular 1ª ronda liga nacional cadete v.1
 
Ganesh murti nagar.
Ganesh murti nagar.Ganesh murti nagar.
Ganesh murti nagar.
 
Logica proposicional
Logica proposicionalLogica proposicional
Logica proposicional
 
Photogrammetry for Games
Photogrammetry for GamesPhotogrammetry for Games
Photogrammetry for Games
 
What have i learnt from target audience feedback
What have i learnt from target audience feedback What have i learnt from target audience feedback
What have i learnt from target audience feedback
 
Revista Mujer Patria Roja Marzo 2017
Revista Mujer Patria Roja Marzo 2017Revista Mujer Patria Roja Marzo 2017
Revista Mujer Patria Roja Marzo 2017
 

Similar to Quack Chat: Diving into Data Governance

Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...IDERA Software
 
Introduction to Data Management Maturity Models
Introduction to Data Management Maturity ModelsIntroduction to Data Management Maturity Models
Introduction to Data Management Maturity ModelsKingland
 
Geek Sync | Database People and DevOps: The Fundamentals
Geek Sync | Database People and DevOps: The FundamentalsGeek Sync | Database People and DevOps: The Fundamentals
Geek Sync | Database People and DevOps: The FundamentalsIDERA Software
 
Data Governance: From speed dating to lifelong partnership
Data Governance: From speed dating to lifelong partnershipData Governance: From speed dating to lifelong partnership
Data Governance: From speed dating to lifelong partnershipPrecisely
 
Creating enterprise standards 09302010
Creating enterprise standards 09302010Creating enterprise standards 09302010
Creating enterprise standards 09302010ERwin Modeling
 
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...DATAVERSITY
 
Enterprise Data Governance for Financial Institutions
Enterprise Data Governance for Financial InstitutionsEnterprise Data Governance for Financial Institutions
Enterprise Data Governance for Financial InstitutionsSheldon McCarthy
 
IT6701 Information Management - Unit III
IT6701 Information Management - Unit IIIIT6701 Information Management - Unit III
IT6701 Information Management - Unit IIIpkaviya
 
Enterprise Data World Webinars: Master Data Management: Ensuring Value is Del...
Enterprise Data World Webinars: Master Data Management: Ensuring Value is Del...Enterprise Data World Webinars: Master Data Management: Ensuring Value is Del...
Enterprise Data World Webinars: Master Data Management: Ensuring Value is Del...DATAVERSITY
 
The Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindThe Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindDATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesChristopher Eaker
 
Geek Sync I Does Data Modeling Have Business Value?
Geek Sync I Does Data Modeling Have Business Value?Geek Sync I Does Data Modeling Have Business Value?
Geek Sync I Does Data Modeling Have Business Value?IDERA Software
 
Geek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementGeek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementIDERA Software
 
Data Governance Maturity Levels
Data Governance Maturity LevelsData Governance Maturity Levels
Data Governance Maturity LevelsSowmya Kandregula
 
You Need a Data Catalog. Do You Know Why?
 You Need a Data Catalog. Do You Know Why? You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?Precisely
 
Chapter 4 Organizational Aspects of Data Management.ppt
Chapter 4 Organizational Aspects of Data Management.pptChapter 4 Organizational Aspects of Data Management.ppt
Chapter 4 Organizational Aspects of Data Management.pptAnasSamara3
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?Precisely
 

Similar to Quack Chat: Diving into Data Governance (20)

Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
Geek Sync | Data Architecture and Data Governance: A Powerful Data Management...
 
Introduction to Data Management Maturity Models
Introduction to Data Management Maturity ModelsIntroduction to Data Management Maturity Models
Introduction to Data Management Maturity Models
 
Geek Sync | Database People and DevOps: The Fundamentals
Geek Sync | Database People and DevOps: The FundamentalsGeek Sync | Database People and DevOps: The Fundamentals
Geek Sync | Database People and DevOps: The Fundamentals
 
Data Governance: From speed dating to lifelong partnership
Data Governance: From speed dating to lifelong partnershipData Governance: From speed dating to lifelong partnership
Data Governance: From speed dating to lifelong partnership
 
Creating enterprise standards 09302010
Creating enterprise standards 09302010Creating enterprise standards 09302010
Creating enterprise standards 09302010
 
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
Conformed Dimensions of Data Quality – An Organized Approach to Data Quality ...
 
Enterprise Data Governance for Financial Institutions
Enterprise Data Governance for Financial InstitutionsEnterprise Data Governance for Financial Institutions
Enterprise Data Governance for Financial Institutions
 
IT6701 Information Management - Unit III
IT6701 Information Management - Unit IIIIT6701 Information Management - Unit III
IT6701 Information Management - Unit III
 
Enterprise Data World Webinars: Master Data Management: Ensuring Value is Del...
Enterprise Data World Webinars: Master Data Management: Ensuring Value is Del...Enterprise Data World Webinars: Master Data Management: Ensuring Value is Del...
Enterprise Data World Webinars: Master Data Management: Ensuring Value is Del...
 
Chapter 05
Chapter 05Chapter 05
Chapter 05
 
The Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindThe Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data Mind
 
Intro to Data Management
Intro to Data ManagementIntro to Data Management
Intro to Data Management
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
Geek Sync I Does Data Modeling Have Business Value?
Geek Sync I Does Data Modeling Have Business Value?Geek Sync I Does Data Modeling Have Business Value?
Geek Sync I Does Data Modeling Have Business Value?
 
Geek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change ManagementGeek Sync I The Importance of Data Model Change Management
Geek Sync I The Importance of Data Model Change Management
 
Data Governance Maturity Levels
Data Governance Maturity LevelsData Governance Maturity Levels
Data Governance Maturity Levels
 
You Need a Data Catalog. Do You Know Why?
 You Need a Data Catalog. Do You Know Why? You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
Chapter 4 Organizational Aspects of Data Management.ppt
Chapter 4 Organizational Aspects of Data Management.pptChapter 4 Organizational Aspects of Data Management.ppt
Chapter 4 Organizational Aspects of Data Management.ppt
 
Chap005
Chap005Chap005
Chap005
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 

More from IDERA Software

The role of the database administrator (DBA) in 2020: Changes, challenges, an...
The role of the database administrator (DBA) in 2020: Changes, challenges, an...The role of the database administrator (DBA) in 2020: Changes, challenges, an...
The role of the database administrator (DBA) in 2020: Changes, challenges, an...IDERA Software
 
Problems and solutions for migrating databases to the cloud
Problems and solutions for migrating databases to the cloudProblems and solutions for migrating databases to the cloud
Problems and solutions for migrating databases to the cloudIDERA Software
 
Public cloud uses and limitations
Public cloud uses and limitationsPublic cloud uses and limitations
Public cloud uses and limitationsIDERA Software
 
Optimize the performance, cost, and value of databases.pptx
Optimize the performance, cost, and value of databases.pptxOptimize the performance, cost, and value of databases.pptx
Optimize the performance, cost, and value of databases.pptxIDERA Software
 
Monitor cloud database with SQL Diagnostic Manager for SQL Server
Monitor cloud database with SQL Diagnostic Manager for SQL ServerMonitor cloud database with SQL Diagnostic Manager for SQL Server
Monitor cloud database with SQL Diagnostic Manager for SQL ServerIDERA Software
 
Database administrators (dbas) face increasing pressure to monitor databases
Database administrators (dbas) face increasing pressure to monitor databasesDatabase administrators (dbas) face increasing pressure to monitor databases
Database administrators (dbas) face increasing pressure to monitor databasesIDERA Software
 
Six tips for cutting sql server licensing costs
Six tips for cutting sql server licensing costsSix tips for cutting sql server licensing costs
Six tips for cutting sql server licensing costsIDERA Software
 
Idera live 2021: The Power of Abstraction by Steve Hoberman
Idera live 2021:  The Power of Abstraction by Steve HobermanIdera live 2021:  The Power of Abstraction by Steve Hoberman
Idera live 2021: The Power of Abstraction by Steve HobermanIDERA Software
 
Idera live 2021: Why Data Lakes are Critical for AI, ML, and IoT By Brian Flug
Idera live 2021:  Why Data Lakes are Critical for AI, ML, and IoT  By Brian FlugIdera live 2021:  Why Data Lakes are Critical for AI, ML, and IoT  By Brian Flug
Idera live 2021: Why Data Lakes are Critical for AI, ML, and IoT By Brian FlugIDERA Software
 
Idera live 2021: Will Data Vault add Value to Your Data Warehouse? 3 Signs th...
Idera live 2021: Will Data Vault add Value to Your Data Warehouse? 3 Signs th...Idera live 2021: Will Data Vault add Value to Your Data Warehouse? 3 Signs th...
Idera live 2021: Will Data Vault add Value to Your Data Warehouse? 3 Signs th...IDERA Software
 
Idera live 2021: Managing Digital Transformation on a Budget by Bert Scalzo
Idera live 2021:  Managing Digital Transformation on a Budget by Bert ScalzoIdera live 2021:  Managing Digital Transformation on a Budget by Bert Scalzo
Idera live 2021: Managing Digital Transformation on a Budget by Bert ScalzoIDERA Software
 
Idera live 2021: Keynote Presentation The Future of Data is The Data Cloud b...
Idera live 2021:  Keynote Presentation The Future of Data is The Data Cloud b...Idera live 2021:  Keynote Presentation The Future of Data is The Data Cloud b...
Idera live 2021: Keynote Presentation The Future of Data is The Data Cloud b...IDERA Software
 
Idera live 2021: Managing Databases in the Cloud - the First Step, a Succes...
Idera live 2021:   Managing Databases in the Cloud - the First Step, a Succes...Idera live 2021:   Managing Databases in the Cloud - the First Step, a Succes...
Idera live 2021: Managing Databases in the Cloud - the First Step, a Succes...IDERA Software
 
Idera live 2021: Database Auditing - on-Premises and in the Cloud by Craig M...
Idera live 2021:  Database Auditing - on-Premises and in the Cloud by Craig M...Idera live 2021:  Database Auditing - on-Premises and in the Cloud by Craig M...
Idera live 2021: Database Auditing - on-Premises and in the Cloud by Craig M...IDERA Software
 
Idera live 2021: Performance Tuning Azure SQL Database by Monica Rathbun
Idera live 2021:  Performance Tuning Azure SQL Database by Monica RathbunIdera live 2021:  Performance Tuning Azure SQL Database by Monica Rathbun
Idera live 2021: Performance Tuning Azure SQL Database by Monica RathbunIDERA Software
 
Geek Sync | How to Be the DBA When You Don't Have a DBA - Eric Cobb | IDERA
Geek Sync | How to Be the DBA When You Don't Have a DBA - Eric Cobb | IDERAGeek Sync | How to Be the DBA When You Don't Have a DBA - Eric Cobb | IDERA
Geek Sync | How to Be the DBA When You Don't Have a DBA - Eric Cobb | IDERAIDERA Software
 
How Users of a Performance Monitoring Tool Can Benefit from an Inventory Mana...
How Users of a Performance Monitoring Tool Can Benefit from an Inventory Mana...How Users of a Performance Monitoring Tool Can Benefit from an Inventory Mana...
How Users of a Performance Monitoring Tool Can Benefit from an Inventory Mana...IDERA Software
 
Benefits of Third Party Tools for MySQL | IDERA
Benefits of Third Party Tools for MySQL | IDERABenefits of Third Party Tools for MySQL | IDERA
Benefits of Third Party Tools for MySQL | IDERAIDERA Software
 
Achieve More with Less Resources | IDERA
Achieve More with Less Resources | IDERAAchieve More with Less Resources | IDERA
Achieve More with Less Resources | IDERAIDERA Software
 
Benefits of SQL Server 2017 and 2019 | IDERA
Benefits of SQL Server 2017 and 2019 | IDERABenefits of SQL Server 2017 and 2019 | IDERA
Benefits of SQL Server 2017 and 2019 | IDERAIDERA Software
 

More from IDERA Software (20)

The role of the database administrator (DBA) in 2020: Changes, challenges, an...
The role of the database administrator (DBA) in 2020: Changes, challenges, an...The role of the database administrator (DBA) in 2020: Changes, challenges, an...
The role of the database administrator (DBA) in 2020: Changes, challenges, an...
 
Problems and solutions for migrating databases to the cloud
Problems and solutions for migrating databases to the cloudProblems and solutions for migrating databases to the cloud
Problems and solutions for migrating databases to the cloud
 
Public cloud uses and limitations
Public cloud uses and limitationsPublic cloud uses and limitations
Public cloud uses and limitations
 
Optimize the performance, cost, and value of databases.pptx
Optimize the performance, cost, and value of databases.pptxOptimize the performance, cost, and value of databases.pptx
Optimize the performance, cost, and value of databases.pptx
 
Monitor cloud database with SQL Diagnostic Manager for SQL Server
Monitor cloud database with SQL Diagnostic Manager for SQL ServerMonitor cloud database with SQL Diagnostic Manager for SQL Server
Monitor cloud database with SQL Diagnostic Manager for SQL Server
 
Database administrators (dbas) face increasing pressure to monitor databases
Database administrators (dbas) face increasing pressure to monitor databasesDatabase administrators (dbas) face increasing pressure to monitor databases
Database administrators (dbas) face increasing pressure to monitor databases
 
Six tips for cutting sql server licensing costs
Six tips for cutting sql server licensing costsSix tips for cutting sql server licensing costs
Six tips for cutting sql server licensing costs
 
Idera live 2021: The Power of Abstraction by Steve Hoberman
Idera live 2021:  The Power of Abstraction by Steve HobermanIdera live 2021:  The Power of Abstraction by Steve Hoberman
Idera live 2021: The Power of Abstraction by Steve Hoberman
 
Idera live 2021: Why Data Lakes are Critical for AI, ML, and IoT By Brian Flug
Idera live 2021:  Why Data Lakes are Critical for AI, ML, and IoT  By Brian FlugIdera live 2021:  Why Data Lakes are Critical for AI, ML, and IoT  By Brian Flug
Idera live 2021: Why Data Lakes are Critical for AI, ML, and IoT By Brian Flug
 
Idera live 2021: Will Data Vault add Value to Your Data Warehouse? 3 Signs th...
Idera live 2021: Will Data Vault add Value to Your Data Warehouse? 3 Signs th...Idera live 2021: Will Data Vault add Value to Your Data Warehouse? 3 Signs th...
Idera live 2021: Will Data Vault add Value to Your Data Warehouse? 3 Signs th...
 
Idera live 2021: Managing Digital Transformation on a Budget by Bert Scalzo
Idera live 2021:  Managing Digital Transformation on a Budget by Bert ScalzoIdera live 2021:  Managing Digital Transformation on a Budget by Bert Scalzo
Idera live 2021: Managing Digital Transformation on a Budget by Bert Scalzo
 
Idera live 2021: Keynote Presentation The Future of Data is The Data Cloud b...
Idera live 2021:  Keynote Presentation The Future of Data is The Data Cloud b...Idera live 2021:  Keynote Presentation The Future of Data is The Data Cloud b...
Idera live 2021: Keynote Presentation The Future of Data is The Data Cloud b...
 
Idera live 2021: Managing Databases in the Cloud - the First Step, a Succes...
Idera live 2021:   Managing Databases in the Cloud - the First Step, a Succes...Idera live 2021:   Managing Databases in the Cloud - the First Step, a Succes...
Idera live 2021: Managing Databases in the Cloud - the First Step, a Succes...
 
Idera live 2021: Database Auditing - on-Premises and in the Cloud by Craig M...
Idera live 2021:  Database Auditing - on-Premises and in the Cloud by Craig M...Idera live 2021:  Database Auditing - on-Premises and in the Cloud by Craig M...
Idera live 2021: Database Auditing - on-Premises and in the Cloud by Craig M...
 
Idera live 2021: Performance Tuning Azure SQL Database by Monica Rathbun
Idera live 2021:  Performance Tuning Azure SQL Database by Monica RathbunIdera live 2021:  Performance Tuning Azure SQL Database by Monica Rathbun
Idera live 2021: Performance Tuning Azure SQL Database by Monica Rathbun
 
Geek Sync | How to Be the DBA When You Don't Have a DBA - Eric Cobb | IDERA
Geek Sync | How to Be the DBA When You Don't Have a DBA - Eric Cobb | IDERAGeek Sync | How to Be the DBA When You Don't Have a DBA - Eric Cobb | IDERA
Geek Sync | How to Be the DBA When You Don't Have a DBA - Eric Cobb | IDERA
 
How Users of a Performance Monitoring Tool Can Benefit from an Inventory Mana...
How Users of a Performance Monitoring Tool Can Benefit from an Inventory Mana...How Users of a Performance Monitoring Tool Can Benefit from an Inventory Mana...
How Users of a Performance Monitoring Tool Can Benefit from an Inventory Mana...
 
Benefits of Third Party Tools for MySQL | IDERA
Benefits of Third Party Tools for MySQL | IDERABenefits of Third Party Tools for MySQL | IDERA
Benefits of Third Party Tools for MySQL | IDERA
 
Achieve More with Less Resources | IDERA
Achieve More with Less Resources | IDERAAchieve More with Less Resources | IDERA
Achieve More with Less Resources | IDERA
 
Benefits of SQL Server 2017 and 2019 | IDERA
Benefits of SQL Server 2017 and 2019 | IDERABenefits of SQL Server 2017 and 2019 | IDERA
Benefits of SQL Server 2017 and 2019 | IDERA
 

Recently uploaded

BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 

Recently uploaded (20)

BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 

Quack Chat: Diving into Data Governance

  • 1. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level Diving into Data Governance March 14, 2017 Ron Huizenga Senior Product Manager, EnterpriseArchitecture & Modeling @DataAviator
  • 2. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 2 ER/Studio Enterprise Team Edition
  • 3. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 3 Agenda  Governance Overview  Definitions  Master Data  Data lineage & life cycle  Master Data Management (MDM)  Importance of Data Models  Data quality  Change Management & Audit  Business Glossaries  Data Maturity Data Governance Data Architecture Management Data Development Database Operations Management Data Security Management Reference & Master Data Management Data Warehousing & Business Intelligence Management Document & Content Management Metadata Management Data Quality Management
  • 4. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 4 DMBOK: Definitions  Data Governance  The exercise of authority, control and shared decision making (planning, monitoring and enforcement) over the management of data assets.  Master Data  Synonymous with reference data.The data that provides the context for transaction data. It includes the details (definitions and identifiers) of internal and external objects involved in business transactions. Includes data about customers, products, employees, vendors, and controlled domains (code values).  Master Data Management  Processes that ensure that reference data is kept up to date and coordinated across an enterprise. The organization, management and distribution of corporately adjudicated data with widespread use in the organization.
  • 5. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 5 Master Data Classification Considerations  Behavior  Life Cycle  Complexity  Value  Volatility  Reuse
  • 6. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 6 Master Data - Behavior  Can be described by the way it interacts with other data  Master data is almost always involved with transactional data  Often a noun/verb relationship between the master data item and the transaction  Master data are the nouns • Customer • Product  Transactional data capture the verbs • Customer places order • Product sold on order  Same type of relationships are shared between facts and dimensions in a data warehouse  Master data are the dimensions  Transactions are the facts
  • 7. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 7 Master Data - Lifecycle  Describes how a master data element is created, read, updated, deleted (CRUD)  Many factors come into play  Business rules  Business processes  Applications  There may be more than 1 way a particular master data element is created  Need to model:  Business process  Data lineage • Data flow • Integration • Include ExtractTransform and Load (ETL) for data warehouse/data marts and staging areas
  • 8. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 8 Business Process & Data CRUD
  • 9. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 9 Master Data – Complexity, Value  Complexity  Very simple entities are rarely a challenge to manage  The less complex an element, the less likely the need to manage change • Likely not master data elements • Possibly reference data • States/Provinces • Units of measure • Classification references  Value  Value and complexity interact  The higher value a data element is to an organization the more likely it will be considered master data
  • 10. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 10 Master Data - Volatility  Level of change in characteristics describing a master data element  Frequent change = high volatility  Infrequent change = low volatility  Sometimes referred to as stability  Frequent change = unstable  Infrequent change = stable
  • 11. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 11 Master Data - Reuse  Master data elements are often shared across a number of systems  Can lead to inconsistency and errors  Multiple systems  Which is the “version of truth”  Spreadsheets  Private data stores  An error in master data can cause errors in  All the transactions that use it  All the applications that use it  All reports and analytics that use it  This is one of the primary reasons for “Master Data Management”
  • 12. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 12 Data Lineage  Source of truth  Chain of custody  Transformations 12
  • 13. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 13 Data Lineage
  • 14. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 14 What is Master Data Management?  The processes, tools and technology required to create and maintain consistent and accurate lists of master data  Includes both creating and maintaining master data  Often requires fundamental changes in business process  Not just a technological problem  Some of the most difficult issues are more political than technical  Organization wide MDM may be difficult  Many organizations begin with critical, high value elements  Grow and expand  MDM is not a project  Ongoing  Continuous improvement
  • 15. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 15 MDM Activities  Identify sources of master data  Identify the producers and consumers of the master data  Collect and analyze metadata about for your master data  Appoint data stewards  Implement a data-governance program and council  Develop the master-data model  Choose a toolset  Design the infrastructure  Generate and test the master data  Modify the producing and consuming systems  Be sure to incorporate versioning and auditing
  • 16. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 16 Importance of data models  Full Specification  Logical  Physical  Descriptive metadata  Names  Definitions (data dictionary)  Notes  Implementation characteristics  Data types  Keys  Indexes  Views  Business Rules  Relationships (referential constraints)  Value Restrictions (constraints)  Security Classifications + Rules  Governance Metadata  Master Data Management classes  Data Quality classifications  Retention policies
  • 17. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 17 Data Dictionary – Metadata Extensions
  • 18. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 18 ER/Studio – Metadata Attachment Setup
  • 19. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 19 Universal Mappings  Ability to link “like” or related objects  Within same model file  Across separate model files  Entity/Table level  Attribute/Column level
  • 20. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 20 Universal Mappings
  • 21. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 21 Data Quality (DMBOK)  Data Quality  The degree to which data is accurate, complete, timely, consistent with all requirements and business rules, and relevant for a given use.  Information Quality  The degree to which information consistently meets the requirements and expectations of knowledge workers in performing their jobs.  In the context of a specific use, the degree to which information is meet the requirements and expectations for that use.
  • 22. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 22 Data Quality  Accuracy  Timeliness  Completeness  Consistency  Relevance  Fitness For Use
  • 23. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 23 Poor Data Quality Implications  Costs a typical company the equivalent of 15% to 20% of revenue  Estimated by US Insurance Data Management Association  Low Quality = Low Efficiency  It is insidious – most data quality issues are hidden in day to day work  From time to time, a small amount of bad data leads to a disaster of epic proportions
  • 24. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 24 Poor data quality isn’t a new problem
  • 25. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 25 Mitigation Best Practice  Adopt the philosophy of prevention  Show thought leadership  Be accountable at the points of data creation  Measure, control, improve  Establish data culture
  • 26. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 26 Change Management Tasks & Requirements
  • 27. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 27 Change Management & Audit
  • 28. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 28 Providing Meaning: Business Glossary
  • 29. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 29 Glossary Integration
  • 30. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 30 Data Maturity Level 0 1 2 3 4 5 Description None Initial Managed Standardized Advanced Optimized Data Governance None Project Level Program Level Division Level Cross Divisional Enterprise Wide Master Data Management no formal master data clasification Non-integrated master data Integrated, shared master data repository Data Management Services Master data stewards established Data stewardship council Data Integration ad-hoc, point to point Reactive, point-to- point interfaces, some common tools, lack of standards common integration platform, design patterns Middleware utilization: service bus, canonical model, business rules, repository Data Excellence Centre (education and training) Data Excellence embedded in corporate culture Data Quality Silos, scattered data, inconsistencies accepted Recognition of inconsistecies but no management plan to address Data cleansing at consumption in order to attempt data quality improvement Data Quality KPI's and conformance visibility, some cleansing at source. Prevention approach to data quality Full data quality management practice Behaviour Unaware / Denial Chaotic Reactive Stable Proactive Predictive Continuous Improvement Introduction Expanson Transformation Technology & Infrastructure Information & Strategic Business Enablement Primary IS Focus HIGH LOWRisk LOW HIGHValue Generation
  • 31. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 31 Summary  Master Data Management and Data Quality are vital aspects of Data Governance  Master Data Characteristics  Behavior  Lifecycle  Complexity  Volatility  Reuse  MDM is an ongoing, continuous improvement discipline, not a project  Data models & metadata constitute the blueprint for data governance  Change management and auditability is paramount for compliance  Integrated business glossaries provide definition and context  Achieving data maturity is a journey
  • 32. Topics  Click to edit Master text styles • Second level • Third level − Fourth level • Fifth level 32 Q&A Join the discussion: http://community.idera.com/ You can find me at: ron.huizenga@idera.com @DataAviator

Editor's Notes

  1. Master data can be described by the way that it interacts with other data. For example, in transaction systems, master data is almost always involved with transactional data. A customer buys a product. A vendor sells a part, and a partner delivers a crate of materials to a location. An employee is hierarchically related to their manager, who reports up through a manager (another employee). A product may be a part of multiple hierarchies describing their placement within a store. This relationship between master data and transactional data may be fundamentally viewed as a noun/verb relationship. Transactional data capture the verbs, such as sale, delivery, purchase, email, and revocation; master data are the nouns. This is the same relationship data-warehouse facts and dimensions share.
  2. Master data can be described by the way that it is created, read, updated, deleted, and searched. This life cycle is called the CRUD cycle and is different for different master-data element types and companies. For example, how a customer is created depends largely upon a company's business rules, industry segment, and data systems. One company may have multiple customer-creation vectors, such as through the Internet, directly through account representatives, or through outlet stores. Another company may only allow customers to be created through direct contact over the phone with its call center. Further, how a customer element is created is certainly different from how a vendor element is created. The following table illustrates the differing CRUD cycles for four common master-data subject areas.
  3. Simple entities, even valuable entities, are rarely a challenge to manage and are rarely considered master-data elements. The less complex an element, the less likely the need to manage change for that element. Typically, such assets are simply collected and tallied. For example, Fort Knox likely would not track information on each individual gold bar stored there, but rather only keep a count of them. The value of each gold bar is substantial, the cardinality high, and the lifespan long; yet, the complexity is low.
  4. While master data is typically less volatile than transactional data, entities with attributes that do not change at all typically do not require a master-data solution. For example, rare coins would seem to meet many of the criteria for a master-data treatment. A rare-coin collector would likely have many rare coins. So, cardinality is high. They are valuable. They are also complex. For example, rare coins have a history and description. There are attributes, such as condition of obverse, reverse, legend, inscription, rim, and field. There are other attributes, such as designer initials, edge design, layers, and portrait. Yet, rare coins do not need to be managed as a master-data item, because they don't change over time—or, at least, they don't change enough. There may need to be more information added, as the history of a particular coin is revealed or if certain attributes must be corrected. But, generally speaking, rare coins would not be managed through a master-data management system, because they are not volatile enough to warrant a solution.
  5. One of the primary drivers of master-data management is reuse. For example, in a simple world, the CRM system would manage everything about a customer and never need to share any information about the customer with other systems. However, in today's complex environments, customer information needs to be shared across multiple applications. That's where the trouble begins. Because—for a number of reasons—access to a master datum is not always available, people start storing master data in various locations, such as spreadsheets and application private stores. There are still reasons, such as data-quality degradation and decay, to manage master data that is not reused across the enterprise. However, if a master-data entity is reused in multiple systems, it's a sure bet that it should be managed with a master-data management system. To summarize, while it is simple to enumerate the various master-data entity types, it is sometimes more challenging to decide which data items in a company should be treated as master data. Often, data that does not normally comply with the definition for master data may need to be managed as such, and data that does comply with the definition may not. Ultimately, when deciding on what entity types should be treated as master data, it is better to categorize them in terms of their behavior and attributes within the context of the business needs than to rely on simple lists of entity types. Because it is used by multiple applications, an error in master data can cause errors in all the applications that use it. For example, an incorrect address in the customer master might mean orders, bills, and marketing literature are all sent to the wrong address. Similarly, an incorrect price on an item master can be a marketing disaster, and an incorrect account number in an Account Master can lead to huge fines or even jail time for the CEO—a career-limiting move for the person who made the mistake!
  6. In ER/Studio, metadata lineage is supported directly in the modeling tool through the “where used” and dependent-objects functionality. It is part of the metadata that can be published in Team Server. Within ER/Studio, data lineage is the ability to document data extraction, transformation and load parameters, which is sometimes referred to as source and target mapping. Data lineage enables you to document the movement of data from point A to point B, and any intermediate steps in between. This movement is sometimes referred to as Extraction, Transformation and Load (ETL). A model produced in ER/Studio can represent any point along the way. Data Architects need the ability to specify the "source" or "target" of data, down to the column/attribute level. Along with the metadata that defines the source and target mapping are rules for how the data is manipulated along the way.
  7. Master Data Management (MDM) as the technology, tools, and processes required to create and maintain consistent and accurate lists of master data. There are a couple things worth noting in this definition. One is that MDM is not just a technological problem. In many cases, fundamental changes to business process will be required to maintain clean master data, and some of the most difficult MDM issues are more political than technical. The second thing to note is that MDM includes both creating and maintaining master data. Investing a lot of time, money, and effort in creating a clean, consistent set of master data is a wasted effort unless the solution includes tools and processes to keep the master data clean and consistent as it is updated and expanded. While MDM is most effective when applied to all the master data in an organization, in many cases the risk and expense of an enterprise-wide effort are difficult to justify. It may be easier to start with a few key sources of Master Data and expand the effort, once success has been demonstrated and lessons have been learned. If you do start small, you should include an analysis of all the master data that you might eventually want to include, so you do not make design decisions or tool choices that will force you to start over when you try to incorporate a new data source. For example, if your initial Customer master implementation only includes the 10,000 customers your direct-sales force deals with, you don't want to make design decisions that will preclude adding your 10,000,000 Web customers later.
  8. Most MDM initiatives : Identify sources of master data. This step is usually a very revealing exercise. Some companies find they have dozens of databases containing customer data that the IT department did not know existed. Identify the producers and consumers of the master data. Which applications produce the master data identified in the first step, and—generally more difficult to determine—which applications use the master data. Depending on the approach you use for maintaining the master data, this step might not be necessary. For example, if all changes are detected and handled at the database level, it probably does not matter where the changes come from. Collect and analyze metadata about for your master data. For all the sources identified in step one, what are the entities and attributes of the data, and what do they mean? This should include attribute name, datatype, allowed values, constraints, default values, dependencies, and who owns the definition and maintenance of the data. The owner is the most important and often the hardest to determine. If you have a repository loaded with all your metadata, this step is an easy one. If you have to start from database tables and source code, this could be a significant effort. Appoint data stewards. These should be the people with the knowledge of the current source data and the ability to determine how to transform the source into the master-data format. In general, stewards should be appointed from the owners of each master-data source, the architects responsible for the MDM systems, and representatives from the business users of the master data. Implement a data-governance program and data-governance council. This group must have the knowledge and authority to make decisions on how the master data is maintained, what it contains, how long it is kept, and how changes are authorized and audited. Hundreds of decisions must be made in the course of a master-data project, and if there is not a well-defined decision-making body and process, the project can fail, because the politics prevent effective decision making. Develop the master-data model. Decide what the master records look like: what attributes are included, what size and datatype they are, what values are allowed, and so forth. This step should also include the mapping between the master-data model and the current data sources. This is normally both the most important and most difficult step in the process. If you try to make everybody happy by including all the source attributes in the master entity, you often end up with master data that is too complex and cumbersome to be useful. For example, if you cannot decide whether weight should be in pounds or kilograms, one approach would be to include both (WeightLb and WeightKg). While this might make people happy, you are wasting megabytes of storage for numbers that can be calculated in microseconds, as well as running the risk of creating inconsistent data (WeightLb = 5 and WeightKg = 5). While this is a pretty trivial example, a bigger issue would be maintaining multiple part numbers for the same part. As in any committee effort, there will be fights and deals resulting in sub-optimal decisions. It's important to work out the decision process, priorities, and final decision maker in advance, to make sure things run smoothly. Choose a toolset. You will need to buy or build tools to create the master lists by cleaning, transforming, and merging the source data. You will also need an infrastructure to use and maintain the master list. These functions are covered in detail later in the paper. You can use a single toolset from a single vendor for all of these functions, or you might want to take a best-of-breed approach. In general, the techniques to clean and merge data are different for different types of data, so there are not a lot of tools that span the whole range of master data. The two main categories of tools are Customer Data Integration (CDI) tools for creating the customer master and Product Information Management (PIM) tools for creating the product master. Some tools will do both, but generally they are better at one or the other. The toolset should also have support for finding and fixing data-quality issues and maintaining versions and hierarchies. Versioning is a critical feature, because understanding the history of a master-data record is vital to maintaining its quality and accuracy over time. For example, if a merge tool combines two records for John Smith in Boston, and you decide there really are two different John Smiths in Boston, you need to know what the records looked like before they were merged, in order to "unmerge" them. Design the infrastructure. Once you have clean, consistent master data, you will need to expose it to your applications and provide processes to manage and maintain it. This step is a big-enough issue, I devote a section to it later in the document. When this infrastructure is implemented, you will have a number of applications that will depend on it being available, so reliability and scalability are important considerations to include in your design. In most cases, you will have to implement significant parts of the infrastructure yourself, because it will be designed to fit into your current infrastructure, platforms, and applications. Generate and test the master data. This step is where you use the tools you have developed or purchased to merge your source data into your master-data list. This is often an iterative process requiring tinkering with rules and settings to get the matching right. This process also requires a lot of manual inspection to ensure that the results are correct and meet the requirements established for the project. No tool will get the matching done correctly 100 percent of the time, so you will have to weigh the consequences of false matches versus missed matches to determine how to configure the matching tools. False matches can lead to customer dissatisfaction, if bills are inaccurate or the wrong person is arrested. Too many missed matches make the master data less useful, because you are not getting the benefits you invested in MDM to get. Modify the producing and consuming systems. Depending on how your MDM implementation is designed, you might have to change the systems that produce, maintain, or consume master data to work with the new source of master data. If the master data is used in a system separate from the source systems—a data warehouse, for example—the source systems might not have to change. If the source systems are going to use the master data, however, there will likely be changes required. Either the source systems will have to access the new master data or the master data will have to be synchronized with the source systems, so that the source systems have a copy of the cleaned-up master data to use. If it's not possible to change one or more of the source systems, either that source system might not be able to use the master data or the master data will have to be integrated with the source system's database through external processes, such as triggers and SQL commands. The source systems generating new records should be changed to look up existing master record sets before creating new records or updating existing master records. This ensures that the quality of data being generated upstream is good, so that the MDM can function more efficiently and the application itself manages data quality. MDM should be leveraged not only as a system of record, but also as an application that promotes cleaner and more efficient handling of data across all applications in the enterprise. As part of MDM strategy, all three pillars of data management need to be looked into: data origination, data management, and data consumption. It is not possible to have a robust enterprise-level MDM strategy if any one of these aspects is ignored. Implement the maintenance processes. As we stated earlier, any MDM implementation must incorporate tools, processes, and people to maintain the quality of the data. All data must have a data steward who is responsible for ensuring the quality of the master data. The data steward is normally a business person who has knowledge of the data, can recognize incorrect data, and has the knowledge and authority to correct the issues. The MDM infrastructure should include tools that help the data steward recognize issues and simplify corrections. A good data-stewardship tool should point out questionable matches that were made—customers with different names and customer numbers that live at the same address, for example. The steward might also want to review items that were added as new, because the match criteria were close but below the threshold. It is important for the data steward to see the history of changes made to the data by the MDM systems, to isolate the source of errors and undo incorrect changes. Maintenance also includes the processes to pull changes and additions into the MDM system, and to distribute the cleansed data to the required places. Versioning and Auditing No matter how you manage your master data, it's important to be able to understand how the data got to the current state. For example, if a customer record was consolidated from two different merged records, you might need to know what the original records looked like, in case a data steward determines that the records were merged by mistake and really should be two different customers. The version management should include a simple interface for displaying versions and reverting all or part of a change to a previous version. The normal branching of versions and grouping of changes that source-control systems use can also be very useful for maintaining different derivation changes and reverting groups of changes to a previous branch. Data stewardship and compliance requirements will often include a way to determine who made each change and when it was made. To support these requirements, an MDM system should include a facility for auditing changes to the master data. In addition to keeping an audit log, the MDM system should include a simple way to find the particular change you are looking for. An MDM system can audit thousands of changes a day, so search and reporting facilities for the audit log are important.
  9. This image depicts the dictionary tab in ER/Studio, showing the attachments and Data Security Information tags. Beside it in the diagram view, a table is depicted, illustrating that they can also be depicted on model diagrams
  10. In ER/Studio, an attachment type is easily defined in the data dictionary and the scope of object types that it applies to can be specified. This is very powerful, as the attachment only needs to be defined once and can be applied to many different object types if necessary. Editing of the values is extremely easy as depicted by the text list below and a default value clearly declared. This will be clarified by a simple example: Let’s assume that we wish to create a metadata extension to specify the business value of specific entities and attributes. It will be used to prioritize data quality initiatives to ensure that the most important data is reviewed first. The list of allowed values is Critical, High, Medium and Low. In ER/Studio, the attachment type with list of values only needs to be created once. It can be associated with object types of entity and attribute. The values are selected and bound to entities and/or attributes as required.   Conversely, in ERwin separate user-defined properties must be set up: one for entities and another for attributes. They can be named the same, and the same list of allowed values must be separately set up for each, using different setup tabs. Because there is no correlation between the two, they must be evaluated separately for analytical purposes.
  11. Universal Mappings are the ability to link “like” or related objects within the same model file or across separate model files. A typical use case is linking the representations of the same real life business object that exist in different models. For example, let’s assume we are dealing with the concept of employees. Employee data may exist in many different databases across the organization. Once we have reverse engineered those databases, universal mappings would be used to link the tables (or corresponding entities) together. This provides traceability in “where used” functionality to find all instances of the object.
  12. Accuracy - The recorded value conforms to the real world fact or value. The most important characteristic of data quality Timeliness - The recorded value is not out of date. Data must be available in time to influence the decision (time frame varies based upon nature of decision) Completeness - The degree to which values are present in a data collection. Focuses on whether all values for all variables are recorded and retained Consistency - Representation of the data values is the same in all cases. Implies that there is no redundancy in the data store and that referential integrity is enforced. Relevance - Refers to the applicability of data in a particular issue by a particular user. Relevant data can be used directly in solving a business problem Fitness For Use - Data must be presented in a format that serves the user’s purpose and stated in terms familiar to that user. It is relative: what is useful to engineers may not be useful to administrators
  13. What is your organzation’s level of data maturity. Here are a few significant categories to look at. Scores may be different for every row in the chart Cleansing at point of consumption is expensive! It’s analogous to old school manufacturing where the focus was on inspection of final products to look for defects, rather than building quality into the process. A commitment to continuous improvement is essential to achieving higher levels of data maturity. Expand on the behavior row Explain the bottom 3 highlighted rows in order.