SlideShare a Scribd company logo
Mark Gschwind
Enterprise Information
Management in SQL Server 2012
Originally Titled “Master Data and Data Quality Management in SQL Server 2012”
Mark Gschwind
 Independent Consultant
 Business Intelligence practitioner, manager since 1995
 Over 50 Business BI projects
 Data Warehousing/Cubing/Reporting/Data Mining/EIM
 MCP, certified in Oracle Essbase, Melissa Data MVP
 Working with clients on EIM since 2008
mark@gschwindconsulting.com
find me on
www.linkedin.com/in/markgschwind
Blog Site:
www.marksbiblog.com
Agenda
 Enterprise Information Management (EIM)
 What is it and why do we need it?
 Microsoft EIM, 3 technologies working together
 DQS
• Capabilities
• Demo
 SSIS
 MDS
• Capabilities
• Demo
 EIM=DQS+MDS+SSIS
 Wrap up
 Questions
Why Do We Need EIM?
Impediments to EIM Success
DATA QUALITY
SERVICES
MASTER DATA
SERVICES
INTEGRATION
SERVICES
What is Data Quality?
DQS: What is Data Quality?
 Data Quality represents the degree to which the
data is suitable for business usages
 Data Quality is built through People + Processes +
Technology
 Bad Data  Bad Business
“Poor data quality can cost companies 15%
to 25% (or more) of their operating budget”
- Larry English (International Data Quality Expert)
Common Data Quality Issues
Data
Quality
Issue Sample Data Problem
Standard Are data elements consistently
defined and understood?
Gender code = M, F, U in one system and
Gender code = 0, 1, 2 in another system
Complete Is all necessary data present? 20% of customers‟ last name is blank,
50% of zip-codes are 99999
Accurate Does the data accurately
represent reality or a verifiable
source?
A Supplier is listed as „Active‟ but went out of
business six years ago
Valid Do data values fall within
acceptable ranges?
Salary values should be between
60,000-120,000
Unique Data appears several times Both John Ryan and Jack Ryan appear in
the system – are they the same person?
Common Issues DQS Addresses
Name Gender Street House # Zip code City State D.O.B
John Doe Male 60th street 45 New York New York 08/12/64
Jane Doe Male Jonathan ln 36 10023 Poughkeepsy NY 21-dec-1954
Name Gender Street House # Zip
code
City State D.O.B
John Doe Male E 60th St 45W 10022 New York NY 08/12/64
Jane Doe Female Jonathan
Lane
36 10023 Poughkeepsie NY 12/21/54
Name Address Postal Code City State
John Smith 545 S Valley View Drive # 136 34563 Anytown New York
Margaret & John smith 545 Valley View ave unit 136 34563-2341 Anytown New York
Maggie Smith 545 S Valley View Dr Anytown New York
John Smith 545 Valley Drive St. 34253 NY NY
Name Address Zip Code City State Cluster
John Smith 545 S Valley View Drive # 136 34563 Anytown New York 1
Margaret & John smith 545 Valley View ave unit 136 34563-2341 Anytown New York 1
Maggie Smith 545 S Valley View Dr Anytown New York 1
John Smith 545 Valley Drive St. 34253 NY NY 2
Before
Before
After
After
Completeness Accuracy Conformity Consistency Uniqueness
DQS Use Cases
• One-Time cleanups
o Merge/Migrate multiple divisional CRMs into one
• Continuous Process with Steward Intervention
o Vendor master with continuous trickle of data
o Customer master with incomplete data
• Continuous Process with Minimal Intervention
o Database marketing mailing list
DQS Process
Build
Use
Knowledge
Management
Knowledge
Base
Demo
Integrate DQS using SSIS
(continuous low-intervention use case)
MDS: What is Master Data?

 Continuous quality management
 Ease of use for business users (not just IT)
 Effective sharing (producing and consuming)
 Centralized maintenance, by different departments
 Changes that keep pace with the business
 Master Data contains different attributes for
different departments
(marketing, finance, operations, business
groups…)
 The challenge: To make a trusted single source
of business data used across multiple
systems, applications, and processes
MDS Use Cases
Regulatory
Enable security
management and auditing
of data used for
regulatory reporting
Data Warehouse /
Data Marts Mgmt
Operational Data
Management
Enable business users to
manage the dimensions
and hierarchies of DW /
Data Marts
Central data records
mgmt and consumption
sourced by other
operational systems
A company has adopted 6 “best
of breed” systems from
different vendors. They need
to be able to propagate the
correct customer information to
each system in a consistent
way.
MDS provides a platform for
central schema, integration
points and validation for
SI/ISV/Internal IT to develop a
custom solution
The IT department has built a
data warehouse and reporting
platform, but business users
complain about the
correctness of the dimensions
and lack of agility in making
updates.
MDS empowers the
business users to manage
dimensions themselves
while IT can govern the
changes
There are 3 G/L systems
whose G/L accounts need to
be consolidated and rolled up
to create financial statements
for regulatory reporting to
several countries
MDS enables an approval
process for changes with
role-based security and
transactional auditing of all
changes
Where is Master Data (in a DW)?
Versioning
Validation
Authoring business rules
to ensure data
correctness
Modeling
Entities, Attributes,
Hierarchies
Enabling Integration & Sharing
MDS Capabilities
Role-based Security and
Transaction Annotation
Master Data
Stewardship
External
(CRM, ..)
Excel DWH
Loading batched
data through
Staging Tables
Consuming data
through Views
Registering to
changes through
APIs
Excel Add-In Web UI
Workflow /
Notifications
Data Matching
(DQS Integrated)
MDS Architecture
MDS Database
Entity Based
Staging Tables
Subscription
Views
IIS Service
MDS Service
Excel Add-InWEB-UI
External
System
CRM/ERP
Workflow /
Notifications
DWH
Excel Cleansing and
Matching
(DQS)
Silverlight
SSIS
SSIS
SSIS
BI
OLAP
External System
WCF
PW
Pivot
BizTalk / Others
Demo
Business Rules
 Business Rules are expressions and actions that
can govern the conduct of business processes*
 Enable data governance by:
-- Enforcing data standards
-- Alerting users to data quality issues
-- Creating simple workflows
 Have limitations, but can be extended
*EIM = DQS+MDS+SSIS+People+Process
Security
 Functional area permissions
 Model/Entity level permissions provide column-
level security
 Hierarchy permissions allow row-level security
 Use AD groups, not individual users
 Only use Hierarchy permissions if row-level
security is required
DATA QUALITY
SERVICES
MASTER DATA
SERVICES
INTEGRATION
SERVICES
Key Takeaways
 SQL Server has tools to address EIM, the biggest
impediment to BI success
 EIM is People + Processes enabled by Technology

More Related Content

What's hot

Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
Hai Nguyen
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
Deenbandhu Prasad
 
Master Data Services - used for than just data
Master Data Services - used for than just dataMaster Data Services - used for than just data
Master Data Services - used for than just data
Kenneth Michael Nielsen
 
MDS & SQL 2012
MDS & SQL 2012MDS & SQL 2012
MDS & SQL 2012
Chad Dotzenrod
 
Data Quality Services in SQL Server 2012
Data Quality Services in SQL Server 2012Data Quality Services in SQL Server 2012
Data Quality Services in SQL Server 2012
Stéphane Fréchette
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
MoniqueO Opris
 
EIM Tutorial
EIM TutorialEIM Tutorial
EIM Tutorial
Neil Hambly
 
A Crash Course in SQL Server Administration for Reluctant Database Administra...
A Crash Course in SQL Server Administration for Reluctant Database Administra...A Crash Course in SQL Server Administration for Reluctant Database Administra...
A Crash Course in SQL Server Administration for Reluctant Database Administra...
Chad Petrovay
 
Introduction to Microsoft’s Master Data Services (MDS)
Introduction to Microsoft’s Master Data Services (MDS)Introduction to Microsoft’s Master Data Services (MDS)
Introduction to Microsoft’s Master Data Services (MDS)
James Serra
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
Cloudbells.com
 
Data quality services
Data quality servicesData quality services
Data quality servicesSteve Xu
 
IT6701-Information Management Unit 3
IT6701-Information Management Unit 3IT6701-Information Management Unit 3
IT6701-Information Management Unit 3
SIMONTHOMAS S
 
Master Data Management methodology
Master Data Management methodologyMaster Data Management methodology
Master Data Management methodology
Database Architechs
 
Master Your Data. Master Your Business
Master Your Data. Master Your BusinessMaster Your Data. Master Your Business
Master Your Data. Master Your Business
DLT Solutions
 
MDM Architecture - SAP
MDM Architecture - SAPMDM Architecture - SAP
MDM Architecture - SAP
Capgemini
 
Présentation IBM InfoSphere MDM 11.3
Présentation IBM InfoSphere MDM 11.3Présentation IBM InfoSphere MDM 11.3
Présentation IBM InfoSphere MDM 11.3
IBMInfoSphereUGFR
 
Credit Suisse: Multi-Domain Enterprise Reference Data
Credit Suisse: Multi-Domain Enterprise Reference DataCredit Suisse: Multi-Domain Enterprise Reference Data
Credit Suisse: Multi-Domain Enterprise Reference Data
Orchestra Networks
 
Master data management
Master data managementMaster data management
Master data management
Zahra Mansoori
 
Tips & tricks to drive effective Master Data Management & ERP harmonization
Tips & tricks to drive effective Master Data Management & ERP harmonizationTips & tricks to drive effective Master Data Management & ERP harmonization
Tips & tricks to drive effective Master Data Management & ERP harmonization
Verdantis
 

What's hot (20)

Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
Master Data Services - used for than just data
Master Data Services - used for than just dataMaster Data Services - used for than just data
Master Data Services - used for than just data
 
MDS & SQL 2012
MDS & SQL 2012MDS & SQL 2012
MDS & SQL 2012
 
Data Quality Services in SQL Server 2012
Data Quality Services in SQL Server 2012Data Quality Services in SQL Server 2012
Data Quality Services in SQL Server 2012
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
EIM Tutorial
EIM TutorialEIM Tutorial
EIM Tutorial
 
A Crash Course in SQL Server Administration for Reluctant Database Administra...
A Crash Course in SQL Server Administration for Reluctant Database Administra...A Crash Course in SQL Server Administration for Reluctant Database Administra...
A Crash Course in SQL Server Administration for Reluctant Database Administra...
 
Introduction to Microsoft’s Master Data Services (MDS)
Introduction to Microsoft’s Master Data Services (MDS)Introduction to Microsoft’s Master Data Services (MDS)
Introduction to Microsoft’s Master Data Services (MDS)
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
Data quality services
Data quality servicesData quality services
Data quality services
 
IT6701-Information Management Unit 3
IT6701-Information Management Unit 3IT6701-Information Management Unit 3
IT6701-Information Management Unit 3
 
Master Data Management methodology
Master Data Management methodologyMaster Data Management methodology
Master Data Management methodology
 
Master Your Data. Master Your Business
Master Your Data. Master Your BusinessMaster Your Data. Master Your Business
Master Your Data. Master Your Business
 
MDM Architecture - SAP
MDM Architecture - SAPMDM Architecture - SAP
MDM Architecture - SAP
 
Présentation IBM InfoSphere MDM 11.3
Présentation IBM InfoSphere MDM 11.3Présentation IBM InfoSphere MDM 11.3
Présentation IBM InfoSphere MDM 11.3
 
Credit Suisse: Multi-Domain Enterprise Reference Data
Credit Suisse: Multi-Domain Enterprise Reference DataCredit Suisse: Multi-Domain Enterprise Reference Data
Credit Suisse: Multi-Domain Enterprise Reference Data
 
Master data management
Master data managementMaster data management
Master data management
 
Tips & tricks to drive effective Master Data Management & ERP harmonization
Tips & tricks to drive effective Master Data Management & ERP harmonizationTips & tricks to drive effective Master Data Management & ERP harmonization
Tips & tricks to drive effective Master Data Management & ERP harmonization
 
Data Flux
Data FluxData Flux
Data Flux
 

Similar to Enterprise Information Management (EIM) in SQL Server 2012

14178090.ppt
14178090.ppt14178090.ppt
14178090.ppt
Sarat Saikia
 
SQLSaturday #188 - Enterprise Information Management
SQLSaturday #188  - Enterprise Information ManagementSQLSaturday #188  - Enterprise Information Management
SQLSaturday #188 - Enterprise Information Management
Tillmann Eitelberg
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligence
Ahsan Kabir
 
TekMindz Master Data Management Capabilities
TekMindz Master Data Management CapabilitiesTekMindz Master Data Management Capabilities
TekMindz Master Data Management CapabilitiesAkshay Pandita
 
Why data governance is the new buzz?
Why data governance is the new buzz?Why data governance is the new buzz?
Why data governance is the new buzz?
Aachen Data & AI Meetup
 
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit UnionsWebinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Denodo
 
10 Worst Practices in Master Data Management
10 Worst Practices in Master Data Management10 Worst Practices in Master Data Management
10 Worst Practices in Master Data Management
ibi
 
Successful Stewardship NZ
Successful Stewardship NZSuccessful Stewardship NZ
Successful Stewardship NZ
Certus Solutions
 
Salesforce Architect Group, Frederick, United States April 2023 - Architect’s...
Salesforce Architect Group, Frederick, United States April 2023 - Architect’s...Salesforce Architect Group, Frederick, United States April 2023 - Architect’s...
Salesforce Architect Group, Frederick, United States April 2023 - Architect’s...
NadinaLisbon1
 
مدیریت کیفیت داده
مدیریت کیفیت دادهمدیریت کیفیت داده
مدیریت کیفیت داده
Hosseinieh Ershad Public Library
 
IT6701 Information Management - Unit III
IT6701 Information Management - Unit IIIIT6701 Information Management - Unit III
IT6701 Information Management - Unit III
pkaviya
 
Data Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AIData Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AI
Denodo
 
Tag dg 101 march 2017
Tag dg 101 march 2017Tag dg 101 march 2017
Tag dg 101 march 2017
Mary Levins, PMP
 
Enterprise-Level Preparation for Master Data Management.pdf
Enterprise-Level Preparation for Master Data Management.pdfEnterprise-Level Preparation for Master Data Management.pdf
Enterprise-Level Preparation for Master Data Management.pdf
AmeliaWong21
 
Then & Now: Strategic Considerations for Data Quality
Then & Now: Strategic Considerations for Data QualityThen & Now: Strategic Considerations for Data Quality
Then & Now: Strategic Considerations for Data Quality
Precisely
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Denodo
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality Right
DATAVERSITY
 
MDM SUMMIT Asia-Pacific 2009 Conference Keynote Aaron Zornes (Sydney April ...
MDM SUMMIT Asia-Pacific 2009 Conference Keynote   Aaron Zornes (Sydney April ...MDM SUMMIT Asia-Pacific 2009 Conference Keynote   Aaron Zornes (Sydney April ...
MDM SUMMIT Asia-Pacific 2009 Conference Keynote Aaron Zornes (Sydney April ...
Aaron Zornes
 
Présentation Forrester - Forum MDM Micropole 2014
Présentation Forrester - Forum MDM Micropole 2014Présentation Forrester - Forum MDM Micropole 2014
Présentation Forrester - Forum MDM Micropole 2014
Micropole Group
 

Similar to Enterprise Information Management (EIM) in SQL Server 2012 (20)

14178090.ppt
14178090.ppt14178090.ppt
14178090.ppt
 
Bad customer data?
Bad customer data?Bad customer data?
Bad customer data?
 
SQLSaturday #188 - Enterprise Information Management
SQLSaturday #188  - Enterprise Information ManagementSQLSaturday #188  - Enterprise Information Management
SQLSaturday #188 - Enterprise Information Management
 
Overview of business intelligence
Overview of business intelligenceOverview of business intelligence
Overview of business intelligence
 
TekMindz Master Data Management Capabilities
TekMindz Master Data Management CapabilitiesTekMindz Master Data Management Capabilities
TekMindz Master Data Management Capabilities
 
Why data governance is the new buzz?
Why data governance is the new buzz?Why data governance is the new buzz?
Why data governance is the new buzz?
 
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit UnionsWebinar #2 - Transforming Challenges into Opportunities for Credit Unions
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
 
10 Worst Practices in Master Data Management
10 Worst Practices in Master Data Management10 Worst Practices in Master Data Management
10 Worst Practices in Master Data Management
 
Successful Stewardship NZ
Successful Stewardship NZSuccessful Stewardship NZ
Successful Stewardship NZ
 
Salesforce Architect Group, Frederick, United States April 2023 - Architect’s...
Salesforce Architect Group, Frederick, United States April 2023 - Architect’s...Salesforce Architect Group, Frederick, United States April 2023 - Architect’s...
Salesforce Architect Group, Frederick, United States April 2023 - Architect’s...
 
مدیریت کیفیت داده
مدیریت کیفیت دادهمدیریت کیفیت داده
مدیریت کیفیت داده
 
IT6701 Information Management - Unit III
IT6701 Information Management - Unit IIIIT6701 Information Management - Unit III
IT6701 Information Management - Unit III
 
Data Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AIData Science Operationalization: The Journey of Enterprise AI
Data Science Operationalization: The Journey of Enterprise AI
 
Tag dg 101 march 2017
Tag dg 101 march 2017Tag dg 101 march 2017
Tag dg 101 march 2017
 
Enterprise-Level Preparation for Master Data Management.pdf
Enterprise-Level Preparation for Master Data Management.pdfEnterprise-Level Preparation for Master Data Management.pdf
Enterprise-Level Preparation for Master Data Management.pdf
 
Then & Now: Strategic Considerations for Data Quality
Then & Now: Strategic Considerations for Data QualityThen & Now: Strategic Considerations for Data Quality
Then & Now: Strategic Considerations for Data Quality
 
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data VirtualizationKASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
 
Getting Data Quality Right
Getting Data Quality RightGetting Data Quality Right
Getting Data Quality Right
 
MDM SUMMIT Asia-Pacific 2009 Conference Keynote Aaron Zornes (Sydney April ...
MDM SUMMIT Asia-Pacific 2009 Conference Keynote   Aaron Zornes (Sydney April ...MDM SUMMIT Asia-Pacific 2009 Conference Keynote   Aaron Zornes (Sydney April ...
MDM SUMMIT Asia-Pacific 2009 Conference Keynote Aaron Zornes (Sydney April ...
 
Présentation Forrester - Forum MDM Micropole 2014
Présentation Forrester - Forum MDM Micropole 2014Présentation Forrester - Forum MDM Micropole 2014
Présentation Forrester - Forum MDM Micropole 2014
 

Recently uploaded

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 

Recently uploaded (20)

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 

Enterprise Information Management (EIM) in SQL Server 2012

  • 1. Mark Gschwind Enterprise Information Management in SQL Server 2012 Originally Titled “Master Data and Data Quality Management in SQL Server 2012”
  • 2. Mark Gschwind  Independent Consultant  Business Intelligence practitioner, manager since 1995  Over 50 Business BI projects  Data Warehousing/Cubing/Reporting/Data Mining/EIM  MCP, certified in Oracle Essbase, Melissa Data MVP  Working with clients on EIM since 2008 mark@gschwindconsulting.com find me on www.linkedin.com/in/markgschwind Blog Site: www.marksbiblog.com
  • 3. Agenda  Enterprise Information Management (EIM)  What is it and why do we need it?  Microsoft EIM, 3 technologies working together  DQS • Capabilities • Demo  SSIS  MDS • Capabilities • Demo  EIM=DQS+MDS+SSIS  Wrap up  Questions
  • 4. Why Do We Need EIM?
  • 7. What is Data Quality?
  • 8. DQS: What is Data Quality?  Data Quality represents the degree to which the data is suitable for business usages  Data Quality is built through People + Processes + Technology  Bad Data  Bad Business “Poor data quality can cost companies 15% to 25% (or more) of their operating budget” - Larry English (International Data Quality Expert)
  • 9. Common Data Quality Issues Data Quality Issue Sample Data Problem Standard Are data elements consistently defined and understood? Gender code = M, F, U in one system and Gender code = 0, 1, 2 in another system Complete Is all necessary data present? 20% of customers‟ last name is blank, 50% of zip-codes are 99999 Accurate Does the data accurately represent reality or a verifiable source? A Supplier is listed as „Active‟ but went out of business six years ago Valid Do data values fall within acceptable ranges? Salary values should be between 60,000-120,000 Unique Data appears several times Both John Ryan and Jack Ryan appear in the system – are they the same person?
  • 10. Common Issues DQS Addresses Name Gender Street House # Zip code City State D.O.B John Doe Male 60th street 45 New York New York 08/12/64 Jane Doe Male Jonathan ln 36 10023 Poughkeepsy NY 21-dec-1954 Name Gender Street House # Zip code City State D.O.B John Doe Male E 60th St 45W 10022 New York NY 08/12/64 Jane Doe Female Jonathan Lane 36 10023 Poughkeepsie NY 12/21/54 Name Address Postal Code City State John Smith 545 S Valley View Drive # 136 34563 Anytown New York Margaret & John smith 545 Valley View ave unit 136 34563-2341 Anytown New York Maggie Smith 545 S Valley View Dr Anytown New York John Smith 545 Valley Drive St. 34253 NY NY Name Address Zip Code City State Cluster John Smith 545 S Valley View Drive # 136 34563 Anytown New York 1 Margaret & John smith 545 Valley View ave unit 136 34563-2341 Anytown New York 1 Maggie Smith 545 S Valley View Dr Anytown New York 1 John Smith 545 Valley Drive St. 34253 NY NY 2 Before Before After After Completeness Accuracy Conformity Consistency Uniqueness
  • 11. DQS Use Cases • One-Time cleanups o Merge/Migrate multiple divisional CRMs into one • Continuous Process with Steward Intervention o Vendor master with continuous trickle of data o Customer master with incomplete data • Continuous Process with Minimal Intervention o Database marketing mailing list
  • 13. Demo
  • 14. Integrate DQS using SSIS (continuous low-intervention use case)
  • 15. MDS: What is Master Data?   Continuous quality management  Ease of use for business users (not just IT)  Effective sharing (producing and consuming)  Centralized maintenance, by different departments  Changes that keep pace with the business  Master Data contains different attributes for different departments (marketing, finance, operations, business groups…)  The challenge: To make a trusted single source of business data used across multiple systems, applications, and processes
  • 16. MDS Use Cases Regulatory Enable security management and auditing of data used for regulatory reporting Data Warehouse / Data Marts Mgmt Operational Data Management Enable business users to manage the dimensions and hierarchies of DW / Data Marts Central data records mgmt and consumption sourced by other operational systems A company has adopted 6 “best of breed” systems from different vendors. They need to be able to propagate the correct customer information to each system in a consistent way. MDS provides a platform for central schema, integration points and validation for SI/ISV/Internal IT to develop a custom solution The IT department has built a data warehouse and reporting platform, but business users complain about the correctness of the dimensions and lack of agility in making updates. MDS empowers the business users to manage dimensions themselves while IT can govern the changes There are 3 G/L systems whose G/L accounts need to be consolidated and rolled up to create financial statements for regulatory reporting to several countries MDS enables an approval process for changes with role-based security and transactional auditing of all changes
  • 17. Where is Master Data (in a DW)?
  • 18. Versioning Validation Authoring business rules to ensure data correctness Modeling Entities, Attributes, Hierarchies Enabling Integration & Sharing MDS Capabilities Role-based Security and Transaction Annotation Master Data Stewardship External (CRM, ..) Excel DWH Loading batched data through Staging Tables Consuming data through Views Registering to changes through APIs Excel Add-In Web UI Workflow / Notifications Data Matching (DQS Integrated)
  • 19. MDS Architecture MDS Database Entity Based Staging Tables Subscription Views IIS Service MDS Service Excel Add-InWEB-UI External System CRM/ERP Workflow / Notifications DWH Excel Cleansing and Matching (DQS) Silverlight SSIS SSIS SSIS BI OLAP External System WCF PW Pivot BizTalk / Others
  • 20. Demo
  • 21. Business Rules  Business Rules are expressions and actions that can govern the conduct of business processes*  Enable data governance by: -- Enforcing data standards -- Alerting users to data quality issues -- Creating simple workflows  Have limitations, but can be extended *EIM = DQS+MDS+SSIS+People+Process
  • 22. Security  Functional area permissions  Model/Entity level permissions provide column- level security  Hierarchy permissions allow row-level security  Use AD groups, not individual users  Only use Hierarchy permissions if row-level security is required
  • 24. Key Takeaways  SQL Server has tools to address EIM, the biggest impediment to BI success  EIM is People + Processes enabled by Technology

Editor's Notes

  1. Working w these EIM technologies for 5 years, 7 implementations
  2. How many people are using MDS or DQS ? How many people are using something else for MDM ?Need to start w a little background…
  3. http://reports.informationweek.com/cart/index/downloadasset/id/8574“2013 Analytics & Information Management Trends” (in 2012 was “2012 BI and Information Management Trends”)Was top barrier in 2011 as well
  4. Today I will show you 3 tools that address these top 3 impediments to success
  5. Microsoft has 3 tools that work together to address these challengesThese technologies + People+ Processes is the MSFT strategy to Product accurate, trustworthy dataMDS appeared in 2008R2 (acquired Stratature), DQS in 2012 (acquired Zoomix). Integration of these products is a work-in-progress.
  6. Data Quality is kind of like doing the dishes; a lot of work you don’t get much credit for
  7. Larry English claims that “Poor data quality can cost companies 15% to 25% (or more) of their operating budget”Good discussion on the cost of bad data is here:http://dataqualitybook.com/?p=300
  8. <skip>
  9. Now, how to address DQ use cases
  10. DQS is a Knowledge-Driven data quality solution,ie you must know some things about your data in order to cleanse it.Ie, you must know rules to identify valid values, lists of valid values, etc.Create a process to continually improve the KBReference data from the azure marketplace
  11. Transition: from a “Continuous Process with Steward Intervention” use case to “Continuous Process with Minimal Intervention”Map values to a kb + domains in DQS, can do a conditional split on bad values etc
  12. Transition: we’ve gone through 2 legs of EIM (DQS and SSIS), not the 3rd leg, MDS…most of us know what master data is, but stating some things about it will help frame our discussion about it.Because of its importance, it can be in the center of many business processes and hence must be effectively shared for both producing and consumingWhat MDS does is enable these different groups bring their objects together and they can be cared for centrallyOnce an organization has this, it can be used in a number of scenarios
  13. Explaining by saying where it ends up
  14. Let’s talk about MDS’s capabilities for addressing these use casesIn the center we have our data steward who uses the MDS web UI and Excel addin to continuously maintain data qualityModeling an enterprise’s master data objects is a capability brought to the data stewardship process, as well as…DQS – some integration, won’t be showing tonightData Quality Services is acquired from Zoomix in 2008MDS is acquired from Stratature in 2007
  15. Now let’s talk about the underlying technologies supporting these capabilitiesA requirement for any MDM system these days is it has to be SOAP-enabled, to interact with ERPs like SAP and Oracle.The Windows Communication Foundation (or WCF), is an application programming interface (API) in the .NET Framework for building connected, service-oriented applications.The Excel addin communicates through WCF, the Web UI uses Silverlight 5 (new in 2012 and enhances the performance)BizTalk allows organizations to more easily connect disparate systems with over 25 multi-platform adapters and a robust messaging infrastructure.External systems can interact w MDS either through the WCF to the MDS service, or more directly with SQL tablesMention the database can be sql 2008 or sql 2012
  16. DEMOS TO DO:TileSample
  17. Slide Goal: Review what was saidThese technologies + People+ Processes is the MSFT strategy to Product accurate, trustworthy data