SlideShare a Scribd company logo
1 of 16
DWH-Ahsan AbdullahDWH-Ahsan Abdullah
11
Data WarehousingData Warehousing
Lecture-21Lecture-21
Introduction to Data Quality Management (DQM)Introduction to Data Quality Management (DQM)
Virtual University of PakistanVirtual University of Pakistan
Ahsan Abdullah
Assoc. Prof. & Head
Center for Agro-Informatics Research
www.nu.edu.pk/cairindex.asp
National University of Computers & Emerging Sciences, Islamabad
Email: ahsan101@yahoo.com
DWH-Ahsan Abdullah
2
Introduction to Data QualityIntroduction to Data Quality
Management (DQM)Management (DQM)
DWH-Ahsan Abdullah
3
What is Quality? InformallyWhat is Quality? Informally
Some things are better than others i.e. they are ofSome things are better than others i.e. they are of
higher quality. How much “better” is better?higher quality. How much “better” is better?
Is the right item the best item to purchase? HowIs the right item the best item to purchase? How
about after the purchase?about after the purchase?
What is quality of service? The bank exampleWhat is quality of service? The bank example
DWH-Ahsan Abdullah
4
What is Quality? FormallyWhat is Quality? Formally
“Quality is conformance to requirements”
P. Crosby, “Quality is Free” 1979
“Degree of excellence”
Webster’s Third New International Dictionary
DWH-Ahsan Abdullah
5
What is Quality? Examples from Auto IndustryWhat is Quality? Examples from Auto Industry
Quality means meeting customer’s needs,
not necessarily exceeding them.
Quality means improving things customers
care about, because that makes their lives
easier and more comfortable.
Why example from auto-industry?
DWH-Ahsan Abdullah
6
What is Data Quality?What is Data Quality?
Muhammad Khan
Height = 5’8”
Weight = 160 lbs
Gender = Male
Age = 35 yrs
Emp_ID = 440
All data is an abstraction of something real
What is Data?
Note Change
the picture
DWH-Ahsan Abdullah
7
What is Data Quality?What is Data Quality?
Intrinsic Data Quality
Electronic reproduction of reality.
Realistic Data Quality
Degree of utility or value of data to business.
DWH-Ahsan Abdullah
8
Data Quality & OrganizationsData Quality & Organizations
Intelligent Learning Organization:
High-quality data is an open, shared resource with value-
adding processes.
The dysfunctional learning
organization:
Low-quality data is a proprietary resource with cost-adding
processes.
{Comment: Put picture of person in water holding round tube with data written on it}
DWH-Ahsan Abdullah
9
Law #1 - “Data that is not used cannot be correct!”
Law #2 - “Data quality is a function of its use, not its
collection!”
Law #3 - “Data will be no better than its most stringent use!”
Law #4 - “Data quality problems increase with the age of the
system!”
Law #5 – “The less likely something is to occur, the more
traumatic it will be when it happens!”
Orr’s Laws of Data QualityOrr’s Laws of Data Quality
DWH-Ahsan Abdullah
10
Total Quality Control (TQM)Total Quality Control (TQM)
Philosophy of involving all forPhilosophy of involving all for systematicsystematic andand
continuouscontinuous improvement.improvement.
It is customer oriented. Why?It is customer oriented. Why?
TQM incorporates the concept of product quality,TQM incorporates the concept of product quality,
process control, quality assurance, and qualityprocess control, quality assurance, and quality
improvement.improvement.
Quality assurance isQuality assurance is NOTNOT Quality improvementQuality improvement
DWH-Ahsan Abdullah
11
Co$t of fixing data qualityCo$t of fixing data quality
Lowest Quality Highest quality
Costofachievingquality
 Defect minimization is economical.
 Defect elimination is very very expensive.
Exponential rise
in cost
DWH-Ahsan Abdullah
12
Co$t of Data Quality DefectsCo$t of Data Quality Defects
 Controllable CostsControllable Costs
 Recurring costs for analyzing, correcting, and preventingRecurring costs for analyzing, correcting, and preventing
data errorsdata errors
 Resultant CostsResultant Costs
 Internal and external failure costs of business opportunitiesInternal and external failure costs of business opportunities
missed.missed.
 Equipment & Training CostsEquipment & Training Costs
DWH-Ahsan Abdullah
13
Where data quality is critical?Where data quality is critical?
Almost everywhere, some examples:Almost everywhere, some examples:
Marketing communications.Marketing communications.
Customer matching.Customer matching.
Retail house-holding.Retail house-holding.
Combining MIS systems after acquisition.Combining MIS systems after acquisition.
DWH-Ahsan Abdullah
14
Characteristics or Dimensions of Data QualityCharacteristics or Dimensions of Data Quality
Data Quality
Characteristic
Definition
Accuracy Qualitatively assessing lack of error, high accuracy
corresponding to small error.
Completeness The degree to which values are present in the attributes that
require them.
DWH-Ahsan Abdullah
15
Completeness Vs AccuracyCompleteness Vs Accuracy
95% accurate and 100% complete
OR
100% accurate and 95% complete
Which is better?
Depends on data quality (i) tolerances,Depends on data quality (i) tolerances,
the (ii) corresponding application and the (iii) cost ofthe (ii) corresponding application and the (iii) cost of
achieving that data quality vs. the (iv) business value.achieving that data quality vs. the (iv) business value.
DWH-Ahsan Abdullah
16
Characteristics or Dimensions of Data QualityCharacteristics or Dimensions of Data Quality
Data Quality
Characteristic
Definition
Consistency A measure of the degree to which a set of data satisfies a set of
constraints.
Timeliness A measure of how current or up to date the data is.
Uniqueness The state of being only one of its kind or being without an equal
or parallel.
Interpretability The extent to which data is in appropriate languages, symbols,
and units, and the definitions are clear.
Accessibility The extent to which data is available, or easily and quickly
retrievable
Objectivity The extent to which data is unbiased, unprejudiced, and
impartial

More Related Content

What's hot

Data quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeData quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeStefan Kühn
 
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Edureka!
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Edureka!
 
Data science tutorial
Data science tutorialData science tutorial
Data science tutorialAakashdata
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Simplilearn
 
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Edureka!
 
Data mining financial services
Data mining financial servicesData mining financial services
Data mining financial servicesHprentice
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Edureka!
 
Have Data—Need Analysts. Lessons Learned From The Woodworking Industry
Have Data—Need Analysts. Lessons Learned From The Woodworking IndustryHave Data—Need Analysts. Lessons Learned From The Woodworking Industry
Have Data—Need Analysts. Lessons Learned From The Woodworking IndustryHealth Catalyst
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analyticsSSaudia
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4
 
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...The Hive
 
Aa proj assited-living_iot
Aa proj assited-living_iotAa proj assited-living_iot
Aa proj assited-living_iotIshanDhoble1
 

What's hot (20)

Data quality - The True Big Data Challenge
Data quality - The True Big Data ChallengeData quality - The True Big Data Challenge
Data quality - The True Big Data Challenge
 
Data analytics
Data analyticsData analytics
Data analytics
 
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
 
Data science tutorial
Data science tutorialData science tutorial
Data science tutorial
 
Machine Learning in Healthcare: A Case Study
Machine Learning in Healthcare: A Case StudyMachine Learning in Healthcare: A Case Study
Machine Learning in Healthcare: A Case Study
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
 
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
Data Analyst vs Data Engineer vs Data Scientist | Data Analytics Masters Prog...
 
Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
Data mining financial services
Data mining financial servicesData mining financial services
Data mining financial services
 
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
Who is a Data Scientist? | How to become a Data Scientist? | Data Science Cou...
 
Have Data—Need Analysts. Lessons Learned From The Woodworking Industry
Have Data—Need Analysts. Lessons Learned From The Woodworking IndustryHave Data—Need Analysts. Lessons Learned From The Woodworking Industry
Have Data—Need Analysts. Lessons Learned From The Woodworking Industry
 
Predictive analytics
Predictive analytics Predictive analytics
Predictive analytics
 
Life Science Analytics
Life Science AnalyticsLife Science Analytics
Life Science Analytics
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the table
 
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
The Hive Data Virtualization Introduction - Sanjay Krishnamurti, Chief Archit...
 
Aa proj assited-living_iot
Aa proj assited-living_iotAa proj assited-living_iot
Aa proj assited-living_iot
 
Machine Learning and Multi Drug Resistant(MDR) Infections case study
Machine Learning and Multi Drug Resistant(MDR) Infections case studyMachine Learning and Multi Drug Resistant(MDR) Infections case study
Machine Learning and Multi Drug Resistant(MDR) Infections case study
 
Data Analytics Life Cycle
Data Analytics Life CycleData Analytics Life Cycle
Data Analytics Life Cycle
 

Viewers also liked

Viewers also liked (20)

Lecture 40
Lecture 40Lecture 40
Lecture 40
 
Lecture 17
Lecture 17Lecture 17
Lecture 17
 
Lecture 27
Lecture 27Lecture 27
Lecture 27
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Lecture 16
Lecture 16Lecture 16
Lecture 16
 
Lecture 31
Lecture 31Lecture 31
Lecture 31
 
Lecture 32
Lecture 32Lecture 32
Lecture 32
 
Lecture 20
Lecture 20Lecture 20
Lecture 20
 
Lecture 26
Lecture 26Lecture 26
Lecture 26
 
Lecture 30
Lecture 30Lecture 30
Lecture 30
 
Lecture 38
Lecture 38Lecture 38
Lecture 38
 
Lecture 18
Lecture 18Lecture 18
Lecture 18
 
Lecture 29
Lecture 29Lecture 29
Lecture 29
 
Lecture 5
Lecture 5Lecture 5
Lecture 5
 
Lecture 35
Lecture 35Lecture 35
Lecture 35
 
Lecture 33
Lecture 33Lecture 33
Lecture 33
 
Lecture 34
Lecture 34Lecture 34
Lecture 34
 
Lecture 37
Lecture 37Lecture 37
Lecture 37
 
Lecture 7
Lecture 7Lecture 7
Lecture 7
 

Similar to Lecture 21

Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBigDataExpo
 
Reframing the Value Proposition and Proposed Value of Information Quality
Reframing the Value Proposition and Proposed Value of Information QualityReframing the Value Proposition and Proposed Value of Information Quality
Reframing the Value Proposition and Proposed Value of Information QualityIAIDQ Community
 
10 Steps for Taking Control of Your Organization's Digital Debris
10 Steps for Taking Control of Your Organization's Digital Debris 10 Steps for Taking Control of Your Organization's Digital Debris
10 Steps for Taking Control of Your Organization's Digital Debris Perficient, Inc.
 
John Mancini's Predictions for Information Management in 2015
John Mancini's Predictions for Information Management in 2015John Mancini's Predictions for Information Management in 2015
John Mancini's Predictions for Information Management in 2015AIIM International
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management BasicKhaled Mosharraf
 
Developing A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataDeveloping A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataFindWhitePapers
 
PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)Preeti Sirohi
 
A Hitchhiker's Guide to Data Quality_20150331
A Hitchhiker's Guide to Data Quality_20150331A Hitchhiker's Guide to Data Quality_20150331
A Hitchhiker's Guide to Data Quality_20150331Tatiana Stebakova
 
From Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data GovernanceFrom Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data GovernancePrecisely
 
CDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and TrendsCDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and TrendsJeffrey T. Pollock
 
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipelineQlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipelineSrikanth Sharma Boddupalli
 
AI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfAI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfarifulislam946965
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agilityMike ORourke
 
E outsource asia 2010
E outsource asia 2010E outsource asia 2010
E outsource asia 2010Azlan Zainal
 
Surviving the Change Agents - How Business Survive the Next Evolution
Surviving the Change Agents - How Business Survive the Next EvolutionSurviving the Change Agents - How Business Survive the Next Evolution
Surviving the Change Agents - How Business Survive the Next EvolutionKyle Lacy
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallTrillium Software
 
Damo Consulting INFORMS 2015 cloud based analytics for healthcare
Damo Consulting INFORMS 2015 cloud based analytics for healthcare Damo Consulting INFORMS 2015 cloud based analytics for healthcare
Damo Consulting INFORMS 2015 cloud based analytics for healthcare Damo Consulting Inc.
 
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...Elemica
 
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data QualityThe Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data QualityDATAVERSITY
 

Similar to Lecture 21 (20)

Big Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data QualityBig Data Expo 2015 - Trillium software Big Data and the Data Quality
Big Data Expo 2015 - Trillium software Big Data and the Data Quality
 
Reframing the Value Proposition and Proposed Value of Information Quality
Reframing the Value Proposition and Proposed Value of Information QualityReframing the Value Proposition and Proposed Value of Information Quality
Reframing the Value Proposition and Proposed Value of Information Quality
 
10 Steps for Taking Control of Your Organization's Digital Debris
10 Steps for Taking Control of Your Organization's Digital Debris 10 Steps for Taking Control of Your Organization's Digital Debris
10 Steps for Taking Control of Your Organization's Digital Debris
 
John Mancini's Predictions for Information Management in 2015
John Mancini's Predictions for Information Management in 2015John Mancini's Predictions for Information Management in 2015
John Mancini's Predictions for Information Management in 2015
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management Basic
 
Developing A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product DataDeveloping A Universal Approach to Cleansing Customer and Product Data
Developing A Universal Approach to Cleansing Customer and Product Data
 
PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)PROJECT softwares (28 May 14)
PROJECT softwares (28 May 14)
 
Quality in information_security
Quality in information_securityQuality in information_security
Quality in information_security
 
A Hitchhiker's Guide to Data Quality_20150331
A Hitchhiker's Guide to Data Quality_20150331A Hitchhiker's Guide to Data Quality_20150331
A Hitchhiker's Guide to Data Quality_20150331
 
From Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data GovernanceFrom Compliance to Customer 360: Winning with Data Quality & Data Governance
From Compliance to Customer 360: Winning with Data Quality & Data Governance
 
CDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and TrendsCDO - Chief Data Officer Momentum and Trends
CDO - Chief Data Officer Momentum and Trends
 
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipelineQlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
 
AI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdfAI-Led-Cognitive-Data-Quality.pdf
AI-Led-Cognitive-Data-Quality.pdf
 
Cloud and business agility
Cloud and business agilityCloud and business agility
Cloud and business agility
 
E outsource asia 2010
E outsource asia 2010E outsource asia 2010
E outsource asia 2010
 
Surviving the Change Agents - How Business Survive the Next Evolution
Surviving the Change Agents - How Business Survive the Next EvolutionSurviving the Change Agents - How Business Survive the Next Evolution
Surviving the Change Agents - How Business Survive the Next Evolution
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They Fall
 
Damo Consulting INFORMS 2015 cloud based analytics for healthcare
Damo Consulting INFORMS 2015 cloud based analytics for healthcare Damo Consulting INFORMS 2015 cloud based analytics for healthcare
Damo Consulting INFORMS 2015 cloud based analytics for healthcare
 
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
Adrian Gonzalez, Adelante SCM – “A Supply Chain Operating Network (SCON): Com...
 
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data QualityThe Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
The Chief Data Officer's Agenda: What a CDO Needs to Know about Data Quality
 

More from Shani729

Python tutorialfeb152012
Python tutorialfeb152012Python tutorialfeb152012
Python tutorialfeb152012Shani729
 
Python tutorial
Python tutorialPython tutorial
Python tutorialShani729
 
Interaction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionInteraction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionShani729
 
Fm lecturer 13(final)
Fm lecturer 13(final)Fm lecturer 13(final)
Fm lecturer 13(final)Shani729
 
Lecture slides week14-15
Lecture slides week14-15Lecture slides week14-15
Lecture slides week14-15Shani729
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodShani729
 
Dwh lecture slides-week15
Dwh lecture slides-week15Dwh lecture slides-week15
Dwh lecture slides-week15Shani729
 
Dwh lecture slides-week10
Dwh lecture slides-week10Dwh lecture slides-week10
Dwh lecture slides-week10Shani729
 
Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Shani729
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Shani729
 
Dwh lecture slides-week3&4
Dwh lecture slides-week3&4Dwh lecture slides-week3&4
Dwh lecture slides-week3&4Shani729
 
Dwh lecture slides-week2
Dwh lecture slides-week2Dwh lecture slides-week2
Dwh lecture slides-week2Shani729
 
Dwh lecture slides-week1
Dwh lecture slides-week1Dwh lecture slides-week1
Dwh lecture slides-week1Shani729
 
Dwh lecture slides-week 13
Dwh lecture slides-week 13Dwh lecture slides-week 13
Dwh lecture slides-week 13Shani729
 
Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13Shani729
 
Data warehousing and mining furc
Data warehousing and mining furcData warehousing and mining furc
Data warehousing and mining furcShani729
 
Lecture 39
Lecture 39Lecture 39
Lecture 39Shani729
 
Lecture 36
Lecture 36Lecture 36
Lecture 36Shani729
 
Lecture 28
Lecture 28Lecture 28
Lecture 28Shani729
 

More from Shani729 (19)

Python tutorialfeb152012
Python tutorialfeb152012Python tutorialfeb152012
Python tutorialfeb152012
 
Python tutorial
Python tutorialPython tutorial
Python tutorial
 
Interaction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interactionInteraction design _beyond_human_computer_interaction
Interaction design _beyond_human_computer_interaction
 
Fm lecturer 13(final)
Fm lecturer 13(final)Fm lecturer 13(final)
Fm lecturer 13(final)
 
Lecture slides week14-15
Lecture slides week14-15Lecture slides week14-15
Lecture slides week14-15
 
Frequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth methodFrequent itemset mining using pattern growth method
Frequent itemset mining using pattern growth method
 
Dwh lecture slides-week15
Dwh lecture slides-week15Dwh lecture slides-week15
Dwh lecture slides-week15
 
Dwh lecture slides-week10
Dwh lecture slides-week10Dwh lecture slides-week10
Dwh lecture slides-week10
 
Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8Dwh lecture slidesweek7&8
Dwh lecture slidesweek7&8
 
Dwh lecture slides-week5&6
Dwh lecture slides-week5&6Dwh lecture slides-week5&6
Dwh lecture slides-week5&6
 
Dwh lecture slides-week3&4
Dwh lecture slides-week3&4Dwh lecture slides-week3&4
Dwh lecture slides-week3&4
 
Dwh lecture slides-week2
Dwh lecture slides-week2Dwh lecture slides-week2
Dwh lecture slides-week2
 
Dwh lecture slides-week1
Dwh lecture slides-week1Dwh lecture slides-week1
Dwh lecture slides-week1
 
Dwh lecture slides-week 13
Dwh lecture slides-week 13Dwh lecture slides-week 13
Dwh lecture slides-week 13
 
Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13Dwh lecture slides-week 12&13
Dwh lecture slides-week 12&13
 
Data warehousing and mining furc
Data warehousing and mining furcData warehousing and mining furc
Data warehousing and mining furc
 
Lecture 39
Lecture 39Lecture 39
Lecture 39
 
Lecture 36
Lecture 36Lecture 36
Lecture 36
 
Lecture 28
Lecture 28Lecture 28
Lecture 28
 

Recently uploaded

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGSIVASHANKAR N
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 

Recently uploaded (20)

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 

Lecture 21

  • 1. DWH-Ahsan AbdullahDWH-Ahsan Abdullah 11 Data WarehousingData Warehousing Lecture-21Lecture-21 Introduction to Data Quality Management (DQM)Introduction to Data Quality Management (DQM) Virtual University of PakistanVirtual University of Pakistan Ahsan Abdullah Assoc. Prof. & Head Center for Agro-Informatics Research www.nu.edu.pk/cairindex.asp National University of Computers & Emerging Sciences, Islamabad Email: ahsan101@yahoo.com
  • 2. DWH-Ahsan Abdullah 2 Introduction to Data QualityIntroduction to Data Quality Management (DQM)Management (DQM)
  • 3. DWH-Ahsan Abdullah 3 What is Quality? InformallyWhat is Quality? Informally Some things are better than others i.e. they are ofSome things are better than others i.e. they are of higher quality. How much “better” is better?higher quality. How much “better” is better? Is the right item the best item to purchase? HowIs the right item the best item to purchase? How about after the purchase?about after the purchase? What is quality of service? The bank exampleWhat is quality of service? The bank example
  • 4. DWH-Ahsan Abdullah 4 What is Quality? FormallyWhat is Quality? Formally “Quality is conformance to requirements” P. Crosby, “Quality is Free” 1979 “Degree of excellence” Webster’s Third New International Dictionary
  • 5. DWH-Ahsan Abdullah 5 What is Quality? Examples from Auto IndustryWhat is Quality? Examples from Auto Industry Quality means meeting customer’s needs, not necessarily exceeding them. Quality means improving things customers care about, because that makes their lives easier and more comfortable. Why example from auto-industry?
  • 6. DWH-Ahsan Abdullah 6 What is Data Quality?What is Data Quality? Muhammad Khan Height = 5’8” Weight = 160 lbs Gender = Male Age = 35 yrs Emp_ID = 440 All data is an abstraction of something real What is Data? Note Change the picture
  • 7. DWH-Ahsan Abdullah 7 What is Data Quality?What is Data Quality? Intrinsic Data Quality Electronic reproduction of reality. Realistic Data Quality Degree of utility or value of data to business.
  • 8. DWH-Ahsan Abdullah 8 Data Quality & OrganizationsData Quality & Organizations Intelligent Learning Organization: High-quality data is an open, shared resource with value- adding processes. The dysfunctional learning organization: Low-quality data is a proprietary resource with cost-adding processes. {Comment: Put picture of person in water holding round tube with data written on it}
  • 9. DWH-Ahsan Abdullah 9 Law #1 - “Data that is not used cannot be correct!” Law #2 - “Data quality is a function of its use, not its collection!” Law #3 - “Data will be no better than its most stringent use!” Law #4 - “Data quality problems increase with the age of the system!” Law #5 – “The less likely something is to occur, the more traumatic it will be when it happens!” Orr’s Laws of Data QualityOrr’s Laws of Data Quality
  • 10. DWH-Ahsan Abdullah 10 Total Quality Control (TQM)Total Quality Control (TQM) Philosophy of involving all forPhilosophy of involving all for systematicsystematic andand continuouscontinuous improvement.improvement. It is customer oriented. Why?It is customer oriented. Why? TQM incorporates the concept of product quality,TQM incorporates the concept of product quality, process control, quality assurance, and qualityprocess control, quality assurance, and quality improvement.improvement. Quality assurance isQuality assurance is NOTNOT Quality improvementQuality improvement
  • 11. DWH-Ahsan Abdullah 11 Co$t of fixing data qualityCo$t of fixing data quality Lowest Quality Highest quality Costofachievingquality  Defect minimization is economical.  Defect elimination is very very expensive. Exponential rise in cost
  • 12. DWH-Ahsan Abdullah 12 Co$t of Data Quality DefectsCo$t of Data Quality Defects  Controllable CostsControllable Costs  Recurring costs for analyzing, correcting, and preventingRecurring costs for analyzing, correcting, and preventing data errorsdata errors  Resultant CostsResultant Costs  Internal and external failure costs of business opportunitiesInternal and external failure costs of business opportunities missed.missed.  Equipment & Training CostsEquipment & Training Costs
  • 13. DWH-Ahsan Abdullah 13 Where data quality is critical?Where data quality is critical? Almost everywhere, some examples:Almost everywhere, some examples: Marketing communications.Marketing communications. Customer matching.Customer matching. Retail house-holding.Retail house-holding. Combining MIS systems after acquisition.Combining MIS systems after acquisition.
  • 14. DWH-Ahsan Abdullah 14 Characteristics or Dimensions of Data QualityCharacteristics or Dimensions of Data Quality Data Quality Characteristic Definition Accuracy Qualitatively assessing lack of error, high accuracy corresponding to small error. Completeness The degree to which values are present in the attributes that require them.
  • 15. DWH-Ahsan Abdullah 15 Completeness Vs AccuracyCompleteness Vs Accuracy 95% accurate and 100% complete OR 100% accurate and 95% complete Which is better? Depends on data quality (i) tolerances,Depends on data quality (i) tolerances, the (ii) corresponding application and the (iii) cost ofthe (ii) corresponding application and the (iii) cost of achieving that data quality vs. the (iv) business value.achieving that data quality vs. the (iv) business value.
  • 16. DWH-Ahsan Abdullah 16 Characteristics or Dimensions of Data QualityCharacteristics or Dimensions of Data Quality Data Quality Characteristic Definition Consistency A measure of the degree to which a set of data satisfies a set of constraints. Timeliness A measure of how current or up to date the data is. Uniqueness The state of being only one of its kind or being without an equal or parallel. Interpretability The extent to which data is in appropriate languages, symbols, and units, and the definitions are clear. Accessibility The extent to which data is available, or easily and quickly retrievable Objectivity The extent to which data is unbiased, unprejudiced, and impartial

Editor's Notes

  1. <number>
  2. <number>