SlideShare a Scribd company logo
1 of 57
Closed Loop in Enterprise
Information Management
Oliver Engels & Tillmann Eitelberg
Who we are
 Oliver:
 CEO of oh22data AG, German MS Gold Partner
 SQL MVP, Microsoft vTSP

 Tillmann:
 CTO of oh22information services GmbH

 Both:





PASS Germany Board Members
Regional Mentors for Germany
SQL Information Services Advisory Board Members
Data Quality Maniacs
Our Sponsors:
What Are Your Professional Development
Goals?
I want to take
the path from
DBA to Data
Analytics Guru

I want to
upgrade
my skills

I want to
give my
career a
competitive
edge

I want to expand
my network in the
business analytics
industry

Sound familiar? Get a head start and join us today at:

www.passbaconference.com
#passbac

Enjoy $150 off registration: use code CHM2D
Upcoming SQL Server events:
XXXIII Encontro da Comunidade SQLPort
Data Evento: 23 Abril 2013 - 18:30
Local do Evento: Auditório Microsoft, Parque das Nações, Lisboa

18:30 - Abertura e recepção.
19:10 - "Analyzing Twitter Data" - Niko Neugebauer (SQL Server MVP, Community Evangelist –
PASS)
20:15 - Coffee break
20:30 - "First Approach to SQL Server Analysis Services" - João Fialho (Consultor BI Independente)
21:30 - Sorteio de prémios

XXXIV Encontro da Comunidade SQLPort
Data Evento: 7 Maio 2013 - 19:00
Local do Evento: Porto
18:30 - Abertura e recepção.
19:00 - «Apresentação para Developers» - para definir
20:15 - Coffee break
20:30 - «Apresentação para definir» - para definir
21:30 - Sorteio de prémios
Volunteers:
 They spend their FREE time to give you this
event. (2 months per person)
 Because they are crazy.
 Because they want YOU
to learn from the BEST IN THE WORLD.
 If you see a guy with “STAFF” on their back –
buy them a beer, they deserve it.
Paulo Matos:
Paulo Borges:
João Fialho:
Bruno Basto:
Niko Neugebauer:
Take aways
 EIM. SQL IS. Data Curation…. what? Give
some explanations
 Understanding of the building bricks of EIM in
the Microsoft BI Stack: SSIS, DQS, MDS
 Closed loop: Bring’em all together
What’s possible
 If time allow: First impressions on Selfservice
ETL: Data Explorer Preview
Agenda





Definitions
Data Quality Services
Master Data Quality Services
Closed loop: SSIS, DQS, MDS Team up!
The situation
Definition:
 EIM: Enterprise Information Management
 Wiki:
 Enterprise information management combines
business intelligence (BI) and enterprise content
management (ECM)
 Where BI and ECM respectively manage structured
and unstructured information, EIM does not make this
"technical" distinction.
 It approaches the management of information from
the perspective of enterprise information strategy,
based on the needs of information workers.
Definition: SQL Information Services
 SQL Information Service charter:
Enrich enterprise data with the world’s data
Empower developers to build new services and
applications
Connecting with the
world’s data to turn data
into action

Vibrant marketplace ecosystem for the world’s
data

SQL Information Services
IT Pro

Knowledge Worker

Surface all
information as a
service to the
organization,
while maintaining
the right level of
control

Enable any user to find
reliable, trusted
information needed
to do their job

discover
secure
create

govern
clean
curate

publish

operationalize

recommend

transform

analyze

Developer
Immediate access to
the data and services
they need to build new
services and applications

Data Analyst
Democratize the broad
adoption of advanced
analytics to empower
businesses
SQL Information Services Portfolio
 Building the tools for Enterprise Information
Management
Integration
Services

BizTalk

Master
Data Services

Data
Quality Services

Data Explorer

Big Data

Azure
Data Market

Stream
Insight

Other
IS Tools
Data curation
 Data curation components for EIM
Data Quality Services

Master Data Services
Manage

Cleanse

SSIS/BizTalk
Integrate
Discover and
Access Data
and Services

PoC: Role definiton
Mash, Improve
Quality, Enrich
and Analyze

Share and
Collaborate

Information
Worker
Simplified, trusted
consumption of data

Data Steward
Data Management

ITProfessional
Service
Management

Provision, Deploy,
Maintain SLA

Publish
Add data sources to source
catalog

Investigate

Identify Data usage

Artifacts and data relations issues

Monitor usage

Govern
Assess, configure and oversee

Respond to
incidents

Manage Assets
Usage and Policy

Improve Quality of
Data and Metadata
Cleanse, Enrich, Curate

Build the plumbing,
Connect the assets
to the service
Enterprise Information Management

DQS: DATA QUALITY SERVICE

22
Data curation
DQS: Data Quality Services
 Main driver for data quality: Costs!

Data quality cost
Costs because of
bad data quality

Cost of optimizing
data quality

Direct

Prevention

Indirect

Discovery

Cleansing
DQS: Data Quality Services
 Microsoft's DQM approach:
Data Quality Services (DQS)
is a Knowledge-Driven data quality solution
enabling data stewards to easily improve the
quality of their data
 Easy = Information Worker Driven
 Knowledge driven =
 Capturing knowledge of good and bad
data in knowledge base
25
DQS: Data Quality Services
 Domain concept
 Domain (e.g. Street) has
 Domain values
(List of correct and incorrect values)
 Reference data
(External data references, e.g. D&B)
 Rules
(Proofing if data is valid or invalid)
 Termbased Relations
(Change abreviations)

26
Data Quality Services

DEMO

27
DQS: Data Quality Services
 Domain values
 List of values
 By Excel Import
 By knowledge
discovery
 By hand
 Correction values
 Invalid values
DQS: Data Quality Services
 Domain rules
 Regular Expressions
 Logical expressions
 Matching values

 Termbased relations

29
DQS: Data Quality Services
 Reference data (RDS)
 External cloud or on premise data streams
with enrichment functions
DQS: Data Quality Services
 Reference data (RDS)
 DQS delivers the address, RDS Service
delivers the correction or the geocode
 DQS delivers the name and address RDS
service delivers the new address if moved
 All kind of services available
 Exchange rates, Translations, Geocoding,
Gender definition
DQS: Data Quality Services
 Knowledge base (KB)
Knowledge Discovery

Domain 1
Reference
Data

Domain 6

Domain 2

Domain
Values

Domain 7

Domain 8

Rules

Termbased
Relations

Domain 9

Domain 3
Domain 4
Domain 5

Matching Policy

Composite
domain 1

CP Domain 2

CP Domain 3
DQS: Data Quality Services
 Matching
 Second functionality in DQS. Detection of
redundant data. After the cleaning values are
standardized and good for comparison processes
 No simple comparison! Comparison will be
handled through complex fuzzy algorithms based
on matching policies the data steward will test
and setup

33
DQS: Data Quality Services
 Matching policies:
DQS: Data Quality Services
Uncleaned
data

Standardized, structure
and enrich

Discover
redundancy

Classified
data

Monitoring
Azure

Discovery

Reference
Data

Domain
Values

Uncleaned
data

Matching

Cleansing

Rules

Knowledge
Base (KB)

Termbased
Relations

Cleaned
data

Policy

Classification

Profiling & Notifications
Enterprise Information Management

MDS:
MASTER DATA SERVICES
37
Master Data Management
CRM

ERP

WWW

Customer

Customer

Customer

Product

Product

Product

DWH
MDS: Master Data Services
 Problem in EIM
 Heterogenic system environment with several line
of business application [LOB] who produce and
consume data from identical business entities
 Core identities
 Customer
 Product
 Chart of accounts etc.

 Operational and Analytical Problem:

39
MDS: Master Data Services
CRM

ERP

WWW

MDM [operational]
Customer

Customer

Customer

Product

Product

Product

MDM [analytical]
DWH
MDS: Master Data Services
 Operational MDM
 LOB‘s write and read from MDM to achieve a
single point of trouth
 MDM enforcing the single point of truth [SPOT]
through rules, security, versioning
 LOB systems provide and consume the SPOT of
an entity and the related attributes
 Open interfaces for data exchange
 All by an LOB indipendend UI

41
MDS: Master Data Services
 Analytical MDM
 Instead of loading the data from different LOBs to
the DWH landing area and standardize it in the
stage the MDM solution is the gatekeeper
 The gatekeeper function of MDM will be achieved
through rules, standardized hierarchies,
versioning, approvals workflows, dimension
modeling (SCD etc.)
 All by an LOB indipendend UI

42
MDS: Master Data Services
CRM

ERP
MDM [operational]
Customer
Product
MDM [analytical]
DWH

WWW
MDS: Master Data Services
 Basic Model
Master Data Services

DEMO

45
MDS: Master Data Services
 Administration Screen
MDS: Master Data Services
 Excel Add in for Information Worker

47
MDS: Master Data Services
 Central hierachy management

48
MDS: Master Data Services
 Collection
MDS: Master Data Services
 Business Rules:
 Allows Data Owners to validate data without
writing T-SQL
 Compiled into Stored Procedures
 Uses IF..THEN Structures
 Can use AND & OR Logical Operators, to create
Complex Rules up to 7 levels
 Rules using OR Logical Operator can be broken
down into simpler rules
 Applied to Attribute Members for it’s validation

50
MDS: Master Data Services
 Business rules accommodate various
requirements
 Connecting data sources and set overrides
 Multi-level processes
 Workflow and approval – internal (Master Data
Services) and external (Service Broker > SharePoint)
 Multiple or compound business rules provide for more
complex requirements
 Logical operators (AND / OR)
 Control priority of activation
 Enable/disable rules
51
MDS: Master Data Services
Rolebased user access
for master data stewards

 Stream
Excel Add In

Silverlight UI

MDS App

LOB [1-n]

DWH
LOB

SSIS
BizTalk

MDS DB
SQL
Views
Stage
Table

Subscription
Views
Enterprise Information Management

CLOSED LOOP: TEAM UP
EIM: Closed Loop
 Combine MDS and DQS Functionalities
 Use Integration Services to build a closed
loop workflow:
 DQS Knowledge base for cleaning
 MDS Model for standardization and audit
 SSIS for data import, control flow and export
EIM Closed Loop
 Demo case:
 Sample available as download from MS for
everybody to play with (
 Today using new SSDT 2012

)

http://www.microsoft.com/en-us/download/details.aspx?id=35462
EIM Closed
 Business case:
 Supplier Data List from External
 Need to be checked if new suppliers are available
 New data need to be proofed against data quality
standards set up by the Data Steward
 Correct/Corrected data need to be validated
against Master Data Management to apply
business rules and add new data to the master
EIM Closed Loop
 Version 1 (Simple version)
Correct /Corrected

Source

Cleaning
with DQS KB

Fuzzy
Grouping

Filter
Duplicates

Incorrect

Audit

MDS
Stage
EIM Closed Loop
 Version 2 (Advanced version)
Cleaning
with DQS KB

Source

Split

Union
for
MDS

Review by
MDS Data
Steward

Union
for
DQS

Correct

Review by
DQS Data
Steward

New

Lookup Up
MDS via ID
Corrected
No

Match

Lookup
corrected
MDS

Yes

Union
Data stream

Yes

>= Confidence

No
Match

Split
< Confidence

Stage
Obrigado!

More Related Content

What's hot

Credit Suisse: Multi-Domain Enterprise Reference Data
Credit Suisse: Multi-Domain Enterprise Reference DataCredit Suisse: Multi-Domain Enterprise Reference Data
Credit Suisse: Multi-Domain Enterprise Reference DataOrchestra Networks
 
MDM Architecture - SAP
MDM Architecture - SAPMDM Architecture - SAP
MDM Architecture - SAPCapgemini
 
Customer-Centric Data Management for Better Customer Experiences
Customer-Centric Data Management for Better Customer ExperiencesCustomer-Centric Data Management for Better Customer Experiences
Customer-Centric Data Management for Better Customer ExperiencesInformatica
 
DRM Webinar Series, PART 2: Concerned You're Not Getting the Most Out of Orac...
DRM Webinar Series, PART 2: Concerned You're Not Getting the Most Out of Orac...DRM Webinar Series, PART 2: Concerned You're Not Getting the Most Out of Orac...
DRM Webinar Series, PART 2: Concerned You're Not Getting the Most Out of Orac...US-Analytics
 
DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?
DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?
DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?US-Analytics
 
DRM Webinar Series, PART 1: Barriers Preventing You From Getting Started?
DRM Webinar Series, PART 1: Barriers Preventing You From Getting Started?DRM Webinar Series, PART 1: Barriers Preventing You From Getting Started?
DRM Webinar Series, PART 1: Barriers Preventing You From Getting Started?US-Analytics
 
Talent Base Case: Funster - Product MDM case
Talent Base Case: Funster - Product MDM caseTalent Base Case: Funster - Product MDM case
Talent Base Case: Funster - Product MDM caseLoihde Advisory
 
DRM Webinar Series, PART 4: Best Practices, Unlocked
DRM Webinar Series, PART 4: Best Practices, UnlockedDRM Webinar Series, PART 4: Best Practices, Unlocked
DRM Webinar Series, PART 4: Best Practices, UnlockedUS-Analytics
 
Understanding Reference Data with Aaron Zornes
Understanding Reference Data with Aaron ZornesUnderstanding Reference Data with Aaron Zornes
Understanding Reference Data with Aaron ZornesOrchestra Networks
 
Data Governance for EPM Systems with Oracle DRM
Data Governance for EPM Systems with Oracle DRMData Governance for EPM Systems with Oracle DRM
Data Governance for EPM Systems with Oracle DRMUS-Analytics
 
Business objects data services in an sap landscape
Business objects data services in an sap landscapeBusiness objects data services in an sap landscape
Business objects data services in an sap landscapePradeep Ketoli
 
MDM Institute: Why is Reference data mission critical now?
MDM Institute: Why is Reference data mission critical now?MDM Institute: Why is Reference data mission critical now?
MDM Institute: Why is Reference data mission critical now?Orchestra Networks
 
Best Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best PracticesBest Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best PracticesDenodo
 
Bi an ia with sap sybase power designer
Bi an ia with sap sybase power designerBi an ia with sap sybase power designer
Bi an ia with sap sybase power designerJane Kitabayashi
 
MicroLink Corporate Overview
MicroLink Corporate OverviewMicroLink Corporate Overview
MicroLink Corporate OverviewMicroLink, LLC
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityDatabase Architechs
 
Data Governance for the Cloud with Oracle DRM
Data Governance for the Cloud with Oracle DRMData Governance for the Cloud with Oracle DRM
Data Governance for the Cloud with Oracle DRMUS-Analytics
 
Webinar: How MongoDB is Used to Manage Reference Data - May 2014
Webinar: How MongoDB is Used to Manage Reference Data - May 2014Webinar: How MongoDB is Used to Manage Reference Data - May 2014
Webinar: How MongoDB is Used to Manage Reference Data - May 2014MongoDB
 
Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...
Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...
Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...Denodo
 

What's hot (20)

Credit Suisse: Multi-Domain Enterprise Reference Data
Credit Suisse: Multi-Domain Enterprise Reference DataCredit Suisse: Multi-Domain Enterprise Reference Data
Credit Suisse: Multi-Domain Enterprise Reference Data
 
MDM Architecture - SAP
MDM Architecture - SAPMDM Architecture - SAP
MDM Architecture - SAP
 
Customer-Centric Data Management for Better Customer Experiences
Customer-Centric Data Management for Better Customer ExperiencesCustomer-Centric Data Management for Better Customer Experiences
Customer-Centric Data Management for Better Customer Experiences
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
DRM Webinar Series, PART 2: Concerned You're Not Getting the Most Out of Orac...
DRM Webinar Series, PART 2: Concerned You're Not Getting the Most Out of Orac...DRM Webinar Series, PART 2: Concerned You're Not Getting the Most Out of Orac...
DRM Webinar Series, PART 2: Concerned You're Not Getting the Most Out of Orac...
 
DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?
DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?
DRM Webinar Series, PART 3: Will DRM Integrate With Our Applications?
 
DRM Webinar Series, PART 1: Barriers Preventing You From Getting Started?
DRM Webinar Series, PART 1: Barriers Preventing You From Getting Started?DRM Webinar Series, PART 1: Barriers Preventing You From Getting Started?
DRM Webinar Series, PART 1: Barriers Preventing You From Getting Started?
 
Talent Base Case: Funster - Product MDM case
Talent Base Case: Funster - Product MDM caseTalent Base Case: Funster - Product MDM case
Talent Base Case: Funster - Product MDM case
 
DRM Webinar Series, PART 4: Best Practices, Unlocked
DRM Webinar Series, PART 4: Best Practices, UnlockedDRM Webinar Series, PART 4: Best Practices, Unlocked
DRM Webinar Series, PART 4: Best Practices, Unlocked
 
Understanding Reference Data with Aaron Zornes
Understanding Reference Data with Aaron ZornesUnderstanding Reference Data with Aaron Zornes
Understanding Reference Data with Aaron Zornes
 
Data Governance for EPM Systems with Oracle DRM
Data Governance for EPM Systems with Oracle DRMData Governance for EPM Systems with Oracle DRM
Data Governance for EPM Systems with Oracle DRM
 
Business objects data services in an sap landscape
Business objects data services in an sap landscapeBusiness objects data services in an sap landscape
Business objects data services in an sap landscape
 
MDM Institute: Why is Reference data mission critical now?
MDM Institute: Why is Reference data mission critical now?MDM Institute: Why is Reference data mission critical now?
MDM Institute: Why is Reference data mission critical now?
 
Best Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best PracticesBest Practices: Data Virtualization Perspectives and Best Practices
Best Practices: Data Virtualization Perspectives and Best Practices
 
Bi an ia with sap sybase power designer
Bi an ia with sap sybase power designerBi an ia with sap sybase power designer
Bi an ia with sap sybase power designer
 
MicroLink Corporate Overview
MicroLink Corporate OverviewMicroLink Corporate Overview
MicroLink Corporate Overview
 
Informatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data QualityInformatica World 2006 - MDM Data Quality
Informatica World 2006 - MDM Data Quality
 
Data Governance for the Cloud with Oracle DRM
Data Governance for the Cloud with Oracle DRMData Governance for the Cloud with Oracle DRM
Data Governance for the Cloud with Oracle DRM
 
Webinar: How MongoDB is Used to Manage Reference Data - May 2014
Webinar: How MongoDB is Used to Manage Reference Data - May 2014Webinar: How MongoDB is Used to Manage Reference Data - May 2014
Webinar: How MongoDB is Used to Manage Reference Data - May 2014
 
Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...
Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...
Rethink Your Data Governance - POPI Act Compliance Made Easy with Data Virtua...
 

Similar to SQLSaturday #188 - Enterprise Information Management

Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Denodo
 
Introduction To SQL Server 2014
Introduction To SQL Server 2014Introduction To SQL Server 2014
Introduction To SQL Server 2014Vishal Pawar
 
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...Denodo
 
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...Denodo
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligenceAhsan Kabir
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesDenodo
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Denodo
 
Best Practices: Data Admin & Data Management
Best Practices: Data Admin & Data ManagementBest Practices: Data Admin & Data Management
Best Practices: Data Admin & Data ManagementEmpowered Holdings, LLC
 
Cloud Modernization and Data as a Service Option
Cloud Modernization and Data as a Service OptionCloud Modernization and Data as a Service Option
Cloud Modernization and Data as a Service OptionDenodo
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalHarvinder Atwal
 
Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)Denodo
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkCaserta
 
Denodo Platform 7.0: What's New?
Denodo Platform 7.0: What's New?Denodo Platform 7.0: What's New?
Denodo Platform 7.0: What's New?Denodo
 
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsEmpowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsPrecisely
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Denodo
 
CDI-MDMSummit.290213824
CDI-MDMSummit.290213824CDI-MDMSummit.290213824
CDI-MDMSummit.290213824ypai
 
Microsoft SQL Server 2008 R2 and BizTalk Server Presentation
Microsoft SQL Server 2008 R2 and BizTalk Server PresentationMicrosoft SQL Server 2008 R2 and BizTalk Server Presentation
Microsoft SQL Server 2008 R2 and BizTalk Server PresentationMicrosoft Private Cloud
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationDenodo
 

Similar to SQLSaturday #188 - Enterprise Information Management (20)

DQS & MDS in SQL Server 2016
DQS & MDS in SQL Server 2016DQS & MDS in SQL Server 2016
DQS & MDS in SQL Server 2016
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
Introduction To SQL Server 2014
Introduction To SQL Server 2014Introduction To SQL Server 2014
Introduction To SQL Server 2014
 
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
 
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligence
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & Bénéfices
 
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
Accelerate Self-Service Analytics with Virtualization and Visualisation (Thai)
 
Best Practices: Data Admin & Data Management
Best Practices: Data Admin & Data ManagementBest Practices: Data Admin & Data Management
Best Practices: Data Admin & Data Management
 
Cloud Modernization and Data as a Service Option
Cloud Modernization and Data as a Service OptionCloud Modernization and Data as a Service Option
Cloud Modernization and Data as a Service Option
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
 
Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)Data Virtualization for Data Architects (Australia)
Data Virtualization for Data Architects (Australia)
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
Denodo Platform 7.0: What's New?
Denodo Platform 7.0: What's New?Denodo Platform 7.0: What's New?
Denodo Platform 7.0: What's New?
 
Empowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog RequirementsEmpowering Business & IT Teams:  Modern Data Catalog Requirements
Empowering Business & IT Teams:  Modern Data Catalog Requirements
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
 
CDI-MDMSummit.290213824
CDI-MDMSummit.290213824CDI-MDMSummit.290213824
CDI-MDMSummit.290213824
 
Microsoft SQL Server 2008 R2 and BizTalk Server Presentation
Microsoft SQL Server 2008 R2 and BizTalk Server PresentationMicrosoft SQL Server 2008 R2 and BizTalk Server Presentation
Microsoft SQL Server 2008 R2 and BizTalk Server Presentation
 
Ds04 data quality
Ds04   data qualityDs04   data quality
Ds04 data quality
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
 

More from Tillmann Eitelberg

Data lake analytics for the admin
Data lake analytics for the adminData lake analytics for the admin
Data lake analytics for the adminTillmann Eitelberg
 
Embrace and extend first-class activity and 3rd party ecosystem for ssis in adf
Embrace and extend first-class activity and 3rd party ecosystem for ssis in adfEmbrace and extend first-class activity and 3rd party ecosystem for ssis in adf
Embrace and extend first-class activity and 3rd party ecosystem for ssis in adfTillmann Eitelberg
 
Webanalytics with Microsoft BI
Webanalytics with Microsoft BIWebanalytics with Microsoft BI
Webanalytics with Microsoft BITillmann Eitelberg
 
Power BI - The self service BI Lifecycle in the cloud
Power BI - The self service BI Lifecycle in the cloudPower BI - The self service BI Lifecycle in the cloud
Power BI - The self service BI Lifecycle in the cloudTillmann Eitelberg
 
SQL Server Konferenz 2014 - SSIS & HDInsight
SQL Server Konferenz 2014 - SSIS & HDInsightSQL Server Konferenz 2014 - SSIS & HDInsight
SQL Server Konferenz 2014 - SSIS & HDInsightTillmann Eitelberg
 

More from Tillmann Eitelberg (8)

Data lake analytics for the admin
Data lake analytics for the adminData lake analytics for the admin
Data lake analytics for the admin
 
Embrace and extend first-class activity and 3rd party ecosystem for ssis in adf
Embrace and extend first-class activity and 3rd party ecosystem for ssis in adfEmbrace and extend first-class activity and 3rd party ecosystem for ssis in adf
Embrace and extend first-class activity and 3rd party ecosystem for ssis in adf
 
Industry 4.0 in a box
Industry 4.0 in a boxIndustry 4.0 in a box
Industry 4.0 in a box
 
Bioinformatics on Azure
Bioinformatics on AzureBioinformatics on Azure
Bioinformatics on Azure
 
Webanalytics with Microsoft BI
Webanalytics with Microsoft BIWebanalytics with Microsoft BI
Webanalytics with Microsoft BI
 
Power BI - The self service BI Lifecycle in the cloud
Power BI - The self service BI Lifecycle in the cloudPower BI - The self service BI Lifecycle in the cloud
Power BI - The self service BI Lifecycle in the cloud
 
SQL Server Konferenz 2014 - SSIS & HDInsight
SQL Server Konferenz 2014 - SSIS & HDInsightSQL Server Konferenz 2014 - SSIS & HDInsight
SQL Server Konferenz 2014 - SSIS & HDInsight
 
Advanced DQS Integration
Advanced DQS IntegrationAdvanced DQS Integration
Advanced DQS Integration
 

Recently uploaded

Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 

Recently uploaded (20)

Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 

SQLSaturday #188 - Enterprise Information Management

  • 1. Closed Loop in Enterprise Information Management Oliver Engels & Tillmann Eitelberg
  • 2. Who we are  Oliver:  CEO of oh22data AG, German MS Gold Partner  SQL MVP, Microsoft vTSP  Tillmann:  CTO of oh22information services GmbH  Both:     PASS Germany Board Members Regional Mentors for Germany SQL Information Services Advisory Board Members Data Quality Maniacs
  • 4. What Are Your Professional Development Goals? I want to take the path from DBA to Data Analytics Guru I want to upgrade my skills I want to give my career a competitive edge I want to expand my network in the business analytics industry Sound familiar? Get a head start and join us today at: www.passbaconference.com #passbac Enjoy $150 off registration: use code CHM2D
  • 5. Upcoming SQL Server events: XXXIII Encontro da Comunidade SQLPort Data Evento: 23 Abril 2013 - 18:30 Local do Evento: Auditório Microsoft, Parque das Nações, Lisboa 18:30 - Abertura e recepção. 19:10 - "Analyzing Twitter Data" - Niko Neugebauer (SQL Server MVP, Community Evangelist – PASS) 20:15 - Coffee break 20:30 - "First Approach to SQL Server Analysis Services" - João Fialho (Consultor BI Independente) 21:30 - Sorteio de prémios XXXIV Encontro da Comunidade SQLPort Data Evento: 7 Maio 2013 - 19:00 Local do Evento: Porto 18:30 - Abertura e recepção. 19:00 - «Apresentação para Developers» - para definir 20:15 - Coffee break 20:30 - «Apresentação para definir» - para definir 21:30 - Sorteio de prémios
  • 6. Volunteers:  They spend their FREE time to give you this event. (2 months per person)  Because they are crazy.  Because they want YOU to learn from the BEST IN THE WORLD.  If you see a guy with “STAFF” on their back – buy them a beer, they deserve it.
  • 12. Take aways  EIM. SQL IS. Data Curation…. what? Give some explanations  Understanding of the building bricks of EIM in the Microsoft BI Stack: SSIS, DQS, MDS  Closed loop: Bring’em all together What’s possible  If time allow: First impressions on Selfservice ETL: Data Explorer Preview
  • 13. Agenda     Definitions Data Quality Services Master Data Quality Services Closed loop: SSIS, DQS, MDS Team up!
  • 15. Definition:  EIM: Enterprise Information Management  Wiki:  Enterprise information management combines business intelligence (BI) and enterprise content management (ECM)  Where BI and ECM respectively manage structured and unstructured information, EIM does not make this "technical" distinction.  It approaches the management of information from the perspective of enterprise information strategy, based on the needs of information workers.
  • 16. Definition: SQL Information Services  SQL Information Service charter: Enrich enterprise data with the world’s data Empower developers to build new services and applications Connecting with the world’s data to turn data into action Vibrant marketplace ecosystem for the world’s data SQL Information Services
  • 17. IT Pro Knowledge Worker Surface all information as a service to the organization, while maintaining the right level of control Enable any user to find reliable, trusted information needed to do their job discover secure create govern clean curate publish operationalize recommend transform analyze Developer Immediate access to the data and services they need to build new services and applications Data Analyst Democratize the broad adoption of advanced analytics to empower businesses
  • 18. SQL Information Services Portfolio  Building the tools for Enterprise Information Management Integration Services BizTalk Master Data Services Data Quality Services Data Explorer Big Data Azure Data Market Stream Insight Other IS Tools
  • 19. Data curation  Data curation components for EIM Data Quality Services Master Data Services Manage Cleanse SSIS/BizTalk Integrate
  • 20. Discover and Access Data and Services PoC: Role definiton Mash, Improve Quality, Enrich and Analyze Share and Collaborate Information Worker Simplified, trusted consumption of data Data Steward Data Management ITProfessional Service Management Provision, Deploy, Maintain SLA Publish Add data sources to source catalog Investigate Identify Data usage Artifacts and data relations issues Monitor usage Govern Assess, configure and oversee Respond to incidents Manage Assets Usage and Policy Improve Quality of Data and Metadata Cleanse, Enrich, Curate Build the plumbing, Connect the assets to the service
  • 21. Enterprise Information Management DQS: DATA QUALITY SERVICE 22
  • 23. DQS: Data Quality Services  Main driver for data quality: Costs! Data quality cost Costs because of bad data quality Cost of optimizing data quality Direct Prevention Indirect Discovery Cleansing
  • 24. DQS: Data Quality Services  Microsoft's DQM approach: Data Quality Services (DQS) is a Knowledge-Driven data quality solution enabling data stewards to easily improve the quality of their data  Easy = Information Worker Driven  Knowledge driven =  Capturing knowledge of good and bad data in knowledge base 25
  • 25. DQS: Data Quality Services  Domain concept  Domain (e.g. Street) has  Domain values (List of correct and incorrect values)  Reference data (External data references, e.g. D&B)  Rules (Proofing if data is valid or invalid)  Termbased Relations (Change abreviations) 26
  • 27. DQS: Data Quality Services  Domain values  List of values  By Excel Import  By knowledge discovery  By hand  Correction values  Invalid values
  • 28. DQS: Data Quality Services  Domain rules  Regular Expressions  Logical expressions  Matching values  Termbased relations 29
  • 29. DQS: Data Quality Services  Reference data (RDS)  External cloud or on premise data streams with enrichment functions
  • 30. DQS: Data Quality Services  Reference data (RDS)  DQS delivers the address, RDS Service delivers the correction or the geocode  DQS delivers the name and address RDS service delivers the new address if moved  All kind of services available  Exchange rates, Translations, Geocoding, Gender definition
  • 31. DQS: Data Quality Services  Knowledge base (KB) Knowledge Discovery Domain 1 Reference Data Domain 6 Domain 2 Domain Values Domain 7 Domain 8 Rules Termbased Relations Domain 9 Domain 3 Domain 4 Domain 5 Matching Policy Composite domain 1 CP Domain 2 CP Domain 3
  • 32. DQS: Data Quality Services  Matching  Second functionality in DQS. Detection of redundant data. After the cleaning values are standardized and good for comparison processes  No simple comparison! Comparison will be handled through complex fuzzy algorithms based on matching policies the data steward will test and setup 33
  • 33. DQS: Data Quality Services  Matching policies:
  • 34. DQS: Data Quality Services Uncleaned data Standardized, structure and enrich Discover redundancy Classified data Monitoring Azure Discovery Reference Data Domain Values Uncleaned data Matching Cleansing Rules Knowledge Base (KB) Termbased Relations Cleaned data Policy Classification Profiling & Notifications
  • 37. MDS: Master Data Services  Problem in EIM  Heterogenic system environment with several line of business application [LOB] who produce and consume data from identical business entities  Core identities  Customer  Product  Chart of accounts etc.  Operational and Analytical Problem: 39
  • 38. MDS: Master Data Services CRM ERP WWW MDM [operational] Customer Customer Customer Product Product Product MDM [analytical] DWH
  • 39. MDS: Master Data Services  Operational MDM  LOB‘s write and read from MDM to achieve a single point of trouth  MDM enforcing the single point of truth [SPOT] through rules, security, versioning  LOB systems provide and consume the SPOT of an entity and the related attributes  Open interfaces for data exchange  All by an LOB indipendend UI 41
  • 40. MDS: Master Data Services  Analytical MDM  Instead of loading the data from different LOBs to the DWH landing area and standardize it in the stage the MDM solution is the gatekeeper  The gatekeeper function of MDM will be achieved through rules, standardized hierarchies, versioning, approvals workflows, dimension modeling (SCD etc.)  All by an LOB indipendend UI 42
  • 41. MDS: Master Data Services CRM ERP MDM [operational] Customer Product MDM [analytical] DWH WWW
  • 42. MDS: Master Data Services  Basic Model
  • 44. MDS: Master Data Services  Administration Screen
  • 45. MDS: Master Data Services  Excel Add in for Information Worker 47
  • 46. MDS: Master Data Services  Central hierachy management 48
  • 47. MDS: Master Data Services  Collection
  • 48. MDS: Master Data Services  Business Rules:  Allows Data Owners to validate data without writing T-SQL  Compiled into Stored Procedures  Uses IF..THEN Structures  Can use AND & OR Logical Operators, to create Complex Rules up to 7 levels  Rules using OR Logical Operator can be broken down into simpler rules  Applied to Attribute Members for it’s validation 50
  • 49. MDS: Master Data Services  Business rules accommodate various requirements  Connecting data sources and set overrides  Multi-level processes  Workflow and approval – internal (Master Data Services) and external (Service Broker > SharePoint)  Multiple or compound business rules provide for more complex requirements  Logical operators (AND / OR)  Control priority of activation  Enable/disable rules 51
  • 50. MDS: Master Data Services Rolebased user access for master data stewards  Stream Excel Add In Silverlight UI MDS App LOB [1-n] DWH LOB SSIS BizTalk MDS DB SQL Views Stage Table Subscription Views
  • 52. EIM: Closed Loop  Combine MDS and DQS Functionalities  Use Integration Services to build a closed loop workflow:  DQS Knowledge base for cleaning  MDS Model for standardization and audit  SSIS for data import, control flow and export
  • 53. EIM Closed Loop  Demo case:  Sample available as download from MS for everybody to play with (  Today using new SSDT 2012 ) http://www.microsoft.com/en-us/download/details.aspx?id=35462
  • 54. EIM Closed  Business case:  Supplier Data List from External  Need to be checked if new suppliers are available  New data need to be proofed against data quality standards set up by the Data Steward  Correct/Corrected data need to be validated against Master Data Management to apply business rules and add new data to the master
  • 55. EIM Closed Loop  Version 1 (Simple version) Correct /Corrected Source Cleaning with DQS KB Fuzzy Grouping Filter Duplicates Incorrect Audit MDS Stage
  • 56. EIM Closed Loop  Version 2 (Advanced version) Cleaning with DQS KB Source Split Union for MDS Review by MDS Data Steward Union for DQS Correct Review by DQS Data Steward New Lookup Up MDS via ID Corrected No Match Lookup corrected MDS Yes Union Data stream Yes >= Confidence No Match Split < Confidence Stage