Designing Master Data Services for
Application Integration
David Loshin
Knowledge Integrity, Inc.
www.knowledge-integrity.com

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

1
Consolidation: The Typical Approach to MDM


Motivating factors have largely focused on consolidation of data
into a single repository rather than how shared master data is
to be used
Customer
Service

Finance

Sales

Marketing

Financial
Data

Customer
Data

Human
Resources

H/R
Data

Sales
Data

Legal
Data

Marketing
Data

Legal

Fulfillment
Data

Centralized
Repository

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

Fulfillment

?
2
Why Consolidation Should Not Drive MDM Integration

Loss of
Knowledge

Loss of
Meaning

Organizational
Misalignment

No Migration
Plan

No Process
Governance

Missing User
Input

“YADS”

Cloud Issues

Master Data
Consolidation

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

Shadow IT

3
Migration Strategy for Sharing Master Data


As the master registry/hub are brought online, there must be plans
for migrating functional applications to the MDM environment to
benefit

Finance

Sales

Marketing

Service

H/R

Transactions

Compliance

H/R
Data

Transaction
Data

Compliance
Data

Master Data

Financial
Data

Sales
Data

Marketing
Data

Service
Data

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

4
Consider Typical Master Data Use Cases




Presumes the existence of a master data repository, index, and
identity resolution capability
Support for operational and analytical activities, such as:







Satisfy data management and governance policies, such as:






Managed assignment of unique identifiers
Search and retrieve unified views of entity information
Managing relationships
Managing identifier cross-reference
Batch matching services
Duplicate analysis and elimination
Data standardization/cleansing

Enable Queries, Reporting, Analysis





Direct queries
JOINs against other resources
Feeding analytics applications
© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

5
Master Data Services Stack

Search/Retri
eve Entity

Assign
EID

Enrichment
Reconcile
Identities

Governance
Merge

Standardize/C
leanse

Bulk/Batch
Bulk CrossReference

Identity Services
Split

Bulk
Match/Resol
ution

Relationship
Management

Bulk
Update

Relationship Management

Generate EIDs

Find

ID X-Ref

Break

Relate

Data Management
Search

Retrieve

Update

Create

Deactivate

Matching/Identity Resolution
Data Access
Entity Indexes
Entity Repositories

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

Entity Relationships

6
Master Entity Repositories
Entity Indexes
Entity Repositories



Instantiated repositories






Entity Relationships

Profiles
Relationships
Index

Consider typical data interactions supporting enterprise use
of master data:





Ingestion
Profiling
Probabilistic matching
Internal master data services for management of repository
© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

7
Data Access

Data Access



Enable access to shared master data transparently to service
layers





Direct access to physical data repository
Virtualized/federated access to conceptual master repository

Query access



SELECTs
JOINs

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

8
Core Services
Governance
Merge

Identity Services
Split

Relationship Management

Generate EIDs

Find

ID X-Ref

Relate

Break

Data Management
Search

Retrieve

Update

Create

Deactivate

Matching/Identity Resolution



Matching/Identity Resolution




Data management








Algorithms for probabilistic matching
Identity search (probabilistic search, key-based search, similarity scoring)
Register/create a new master entity record
Access master entity record
Update master entity record
Deactivate a master entity record

Identity Services



Guarantee the generation of unique identifiers
Manage cross-reference of internal and external identifiers
© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

9
Core Services
Governance

Identity Services

Merge

Split

Relationship Management

Generate EIDs

Find

ID X-Ref

Relate

Break

Data Management
Search

Retrieve

Update

Create

Deactivate

Matching/Identity Resolution



Relationship management






Find all relationships associated with an entity
Relate two or more entities and associate nature of relationship
Break/deactivate a relationship

Governance



Merge two records when they can be determined to represent the same entity
Split a record into multiple records when it can be determined that there is a
false positive

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

10
Application Services

Search/Retri
eve Entity






Bulk
Match/Resol
ution

Invokes core search, retrieve, or create
Provide unique entity identifiers
Data standardization
Alignment of otherwise-assigned identifiers for cross-referencing
Cleansing

Batch Services






Standardize/C
leanse

Bulk/Batch
Bulk CrossReference

Enrichment




Enrichment
Reconcile
Identities

Assign EID




Relationship
Management

Search/Retrieve Entity




Assign
EID

Deliver result set of master records matching a provided set of candidate identities
Apply batch updates
Batch identity resolution

Relationship Management





Find all relationships associated with an entity
Relate two or more entities and associate nature of relationship (e.g. household)
Break/deactivate a relationship
© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

11

Bulk
Update
Delivering a Unified View Based on Consumer Rules


Data
Source 1

Master
Data
Index

Collection &
Assembly
Data
Source 2

…
Data
Source N

Virtualized
presentation of
federated data



The master index maps the
canonical representation of a
specific entity (beneficiary,
providers, etc.) to the locations
in the original sources (or
replicas of those source)
holding data instances that
have been linked via the
identity resolution processes
The accessibility of the different
sources and the ability to apply
those business rules at the data
user’s discretion will provide a
consolidated view that best
meets the data users’
requirements at the time the
data is reused

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

12
Master Data Facilitated via Federation/Virtualization
1
2
Master Data Services
3

Standardized Canonical Relational Views

8

4
Transform and Map

Master Index

Federated/Virtualized
5
Data Access

Parse and Standardize,
Normalize, Cleanse

7

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

13
Summary









Review the business process to understand uses of shared
master data
Identify integration points within business processes
Characterize usage scenarios
Determine required functionality
Assess existing systemic support for required functionality
Design application integration service APIs
Design and implement application integration services

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

14
Questions and Open Discussion


www.knowledge-integrity.com



If you have questions,
comments, or suggestions,
please contact me
David Loshin
301-754-6350
loshin@knowledge-integrity.com

www.dataqualitybook.com

www.mdmbook.com

© 2014 Knowledge Integrity, Inc.
www.knowledge-integrity.com
(301)754-6350

15

EDW Webinar: Designing Master Data Services for Application Integration

  • 1.
    Designing Master DataServices for Application Integration David Loshin Knowledge Integrity, Inc. www.knowledge-integrity.com © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 1
  • 2.
    Consolidation: The TypicalApproach to MDM  Motivating factors have largely focused on consolidation of data into a single repository rather than how shared master data is to be used Customer Service Finance Sales Marketing Financial Data Customer Data Human Resources H/R Data Sales Data Legal Data Marketing Data Legal Fulfillment Data Centralized Repository © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 Fulfillment ? 2
  • 3.
    Why Consolidation ShouldNot Drive MDM Integration Loss of Knowledge Loss of Meaning Organizational Misalignment No Migration Plan No Process Governance Missing User Input “YADS” Cloud Issues Master Data Consolidation © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 Shadow IT 3
  • 4.
    Migration Strategy forSharing Master Data  As the master registry/hub are brought online, there must be plans for migrating functional applications to the MDM environment to benefit Finance Sales Marketing Service H/R Transactions Compliance H/R Data Transaction Data Compliance Data Master Data Financial Data Sales Data Marketing Data Service Data © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 4
  • 5.
    Consider Typical MasterData Use Cases   Presumes the existence of a master data repository, index, and identity resolution capability Support for operational and analytical activities, such as:      Satisfy data management and governance policies, such as:     Managed assignment of unique identifiers Search and retrieve unified views of entity information Managing relationships Managing identifier cross-reference Batch matching services Duplicate analysis and elimination Data standardization/cleansing Enable Queries, Reporting, Analysis    Direct queries JOINs against other resources Feeding analytics applications © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 5
  • 6.
    Master Data ServicesStack Search/Retri eve Entity Assign EID Enrichment Reconcile Identities Governance Merge Standardize/C leanse Bulk/Batch Bulk CrossReference Identity Services Split Bulk Match/Resol ution Relationship Management Bulk Update Relationship Management Generate EIDs Find ID X-Ref Break Relate Data Management Search Retrieve Update Create Deactivate Matching/Identity Resolution Data Access Entity Indexes Entity Repositories © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 Entity Relationships 6
  • 7.
    Master Entity Repositories EntityIndexes Entity Repositories  Instantiated repositories     Entity Relationships Profiles Relationships Index Consider typical data interactions supporting enterprise use of master data:     Ingestion Profiling Probabilistic matching Internal master data services for management of repository © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 7
  • 8.
    Data Access Data Access  Enableaccess to shared master data transparently to service layers    Direct access to physical data repository Virtualized/federated access to conceptual master repository Query access   SELECTs JOINs © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 8
  • 9.
    Core Services Governance Merge Identity Services Split RelationshipManagement Generate EIDs Find ID X-Ref Relate Break Data Management Search Retrieve Update Create Deactivate Matching/Identity Resolution  Matching/Identity Resolution   Data management       Algorithms for probabilistic matching Identity search (probabilistic search, key-based search, similarity scoring) Register/create a new master entity record Access master entity record Update master entity record Deactivate a master entity record Identity Services   Guarantee the generation of unique identifiers Manage cross-reference of internal and external identifiers © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 9
  • 10.
    Core Services Governance Identity Services Merge Split RelationshipManagement Generate EIDs Find ID X-Ref Relate Break Data Management Search Retrieve Update Create Deactivate Matching/Identity Resolution  Relationship management     Find all relationships associated with an entity Relate two or more entities and associate nature of relationship Break/deactivate a relationship Governance   Merge two records when they can be determined to represent the same entity Split a record into multiple records when it can be determined that there is a false positive © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 10
  • 11.
    Application Services Search/Retri eve Entity    Bulk Match/Resol ution Invokescore search, retrieve, or create Provide unique entity identifiers Data standardization Alignment of otherwise-assigned identifiers for cross-referencing Cleansing Batch Services     Standardize/C leanse Bulk/Batch Bulk CrossReference Enrichment   Enrichment Reconcile Identities Assign EID   Relationship Management Search/Retrieve Entity   Assign EID Deliver result set of master records matching a provided set of candidate identities Apply batch updates Batch identity resolution Relationship Management    Find all relationships associated with an entity Relate two or more entities and associate nature of relationship (e.g. household) Break/deactivate a relationship © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 11 Bulk Update
  • 12.
    Delivering a UnifiedView Based on Consumer Rules  Data Source 1 Master Data Index Collection & Assembly Data Source 2 … Data Source N Virtualized presentation of federated data  The master index maps the canonical representation of a specific entity (beneficiary, providers, etc.) to the locations in the original sources (or replicas of those source) holding data instances that have been linked via the identity resolution processes The accessibility of the different sources and the ability to apply those business rules at the data user’s discretion will provide a consolidated view that best meets the data users’ requirements at the time the data is reused © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 12
  • 13.
    Master Data Facilitatedvia Federation/Virtualization 1 2 Master Data Services 3 Standardized Canonical Relational Views 8 4 Transform and Map Master Index Federated/Virtualized 5 Data Access Parse and Standardize, Normalize, Cleanse 7 © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 13
  • 14.
    Summary        Review the businessprocess to understand uses of shared master data Identify integration points within business processes Characterize usage scenarios Determine required functionality Assess existing systemic support for required functionality Design application integration service APIs Design and implement application integration services © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 14
  • 15.
    Questions and OpenDiscussion  www.knowledge-integrity.com  If you have questions, comments, or suggestions, please contact me David Loshin 301-754-6350 loshin@knowledge-integrity.com www.dataqualitybook.com www.mdmbook.com © 2014 Knowledge Integrity, Inc. www.knowledge-integrity.com (301)754-6350 15