SlideShare a Scribd company logo
1 of 40
1
QUALITYQUALITY
OFOF
DATADATA
2
LEARNING OBJECTIVESLEARNING OBJECTIVES
Realise importance of correct data for
program management
Realise distinction between random data
errors and falsified data
Understand causes of poor data quality
Being able to check data quality through
supervision and review of reports
Learn when and how to correct erroneous
data
3
Occurrence & importance of errorsOccurrence & importance of errors
In business context:
 Error rates of 1-5 % are not exceptional
 Estimated cost ≈ 10 % of revenue
 Problems with data quality ↑ when data originate from
multiple sources
 After initial enthusiasm to improve data quality, focus
on data quality generally slowly fades
In disease control context
 Error occurrence?
 Impact on program performance?
 Checking of errors: limited effort
4
Errors in RNTCP?Errors in RNTCP?
Based on pre-test carried in all countries:
Real possibility of errors in subdistrict
reports
Minor possibility in district reports
Little attention to checking for errors
5
Errors identified in 1039 TB patients
cohort review method in NY city: Munsiff et all IJTLD, 2006, 10 : 1133-9
• 41% of cases presented errors
• multiple errors per patient: 596 / 424 = 1.4
• What kind of errors?
- program info errors 55 %
- patient related errors 45 %
NB. Error rates in HMIS > 50 %
Gillies A. Methods Inf Med 2000, 39 : 208-12
6
DataData qualityquality: definitiondefinition
The state of
validity,validity,
reliability,reliability,
consistency,consistency,
timelinesstimeliness
and completenessand completeness
making data appropriate for a specific use
Problems with data quality do not only arise from
incorrect data
Inconsistent data is a problem as well
7
Data Quality ~ ManagementData Quality ~ Management
Quality AssuranceQuality Assurance
Activities to ensure
quality before data
collection
Quality AssuranceQuality Assurance
Activities to ensure
quality before data
collection
Quality ControlQuality Control
Monitoring and
maintaining quality of
data during RNTCP
implementation
Quality ControlQuality Control
Monitoring and
maintaining quality of
data during RNTCP
implementation
Data managementData management
Handling and analysis of data throughout
the RNTCP surveillance
Data managementData management
Handling and analysis of data throughout
the RNTCP surveillance
8
Quality assurance & controlQuality assurance & control
Quality assurance Quality control
- anticipates problems before they occur - responds to observed problems
- uses all available information to generate
improvements
- uses ongoing measurements to make
decisions on the processes or products
- is not tied to a specific quality standard - requires a pre-specified quality standard
for comparability
- is applicable mostly at the planning stage - is applicable mostly at the processing
stage
- is all-encompassing in its activities - is a set procedure that is a subset of
quality assurance
9
Quality controlQuality control
Quality control is a regulatory procedure
through which we:
 measure quality
 compare quality with pre-set standards
 act on the differences
The objective of quality control is to achieve
a given quality level with minimum cost
(ex. EQA sampling)
10
Dimensions of data qualityDimensions of data quality
1. Intrinsic data quality
 accuracy (validity and reliability)
2. Contextual data quality
 relevant
 timely
 complete
3. Representational data quality
 interpretability, easy to understand
4. Accessibility data quality
 accessibility, security
11
Intrinsic data qualityIntrinsic data quality
ACCURACYACCURACY
Exact conformity to the true value
WHY IMPORTANT?
Accurate data = precondition for
accurate decisions!!
Two concepts: validityvalidity and reliabilityreliability
QUESTION: is this guaranteed?
12
ValidityValidity
= the degree to which
a measurement reflects the truth
There should be no systematic error or bias
What is a valid sputum result for an open TB case?What is a valid sputum result for an open TB case?
A result is valid if it corresponds to the true value!A result is valid if it corresponds to the true value!
Open TB case = sputum positive!!Open TB case = sputum positive!!
13
ReliabilityReliability
The degree to which a measurement gives the same
result:
 each time it is used under the same condition
 with the same subject
A necessary but not sufficient condition for validity
because one can make the same errors twice
Reliability = repeatibility of measurementsReliability = repeatibility of measurements
Reliability is inversely related toReliability is inversely related to random errorrandom error
14
Dimensions of data qualityDimensions of data quality
1. Intrinsic data quality
 accuracy
2. Contextual data quality
 relevant
 timely
 complete
15
RELEVANCERELEVANCE
(usefulness)(usefulness)
Reflects the degree to which information
meets the real needs of clients.
Is concerned with whether the available
information sheds light on the issues that are
important to users.
16
RELEVANCERELEVANCE
A good information source should include all
relevant content and exclude all irrelevant content.
. Decision making for RNTCP management
Relevant for what?
.
Assessing relevance is subjective and depends upon the
varying needs of users!
17
TIMELINESSTIMELINESS
Refers to the moment data are compiled,
reported and analysed
Given RNTCP’s normalization of the data reporting
system, timeliness is not a major issue in India.
But it could be an issue in remote areas and in PPM
18
COMPLETENESSCOMPLETENESS
No missing data (records, items)
All data fields that have to be filled up,
should indeed contain data.
QUESTION: does this presently happen??
19
Missing records
• Annual report 2001 NTP Bangladesh
Reports DOTS areas non DOTS areas
-------------------------------------------------------
Received 2230 180
Missing 59 4
% missing 3% 2%
20
Dimensions of data qualityDimensions of data quality
1. Intrinsic data quality
 accuracy
2. Contextual data quality
 relevant
 timely
 complete
3. Representational data quality
 interpretability, easy to understand
21
Representational data qualityRepresentational data quality
Interpretability
Data must be in appropriate language and
units, and the data definitions must be
clear to all (language, jargon, concepts)
Ease of understanding
Data must be clear, without ambiguity, and
easily comprehended.
22
Dimensions of data qualityDimensions of data quality
1. Intrinsic data quality
 accuracy
2. Contextual data quality
 relevant
 timely
 complete
3. Representational data quality
 interpretability, easy to understand
4. Accessibility data quality
 accessibility, security
23
ACCESSIBILITYACCESSIBILITY
Essential element of any data quality assessment.Essential element of any data quality assessment.
If data is not accessible, then it has little or no valueIf data is not accessible, then it has little or no value..If data is not accessible, then it has little or no valueIf data is not accessible, then it has little or no value..
Accessibility = precondition for use, but no guarantee for use!
Data items should be easily obtainable and legal to collect.Data items should be easily obtainable and legal to collect.
In computer era, guidelines have to be established for whoIn computer era, guidelines have to be established for who
may access which datamay access which data
24
SECURITYSECURITY
The protection of data from:
☞unauthorized modification (accidental or
intentional)
☞equipment malfunction (computer crash),
☞natural disasters (fire, tsunami..) and crime
Be aware!
Security threats are more serious when HMIS is
computerized:
 unauthorized access to data
 damage to files (viruses…)
Be aware!
Security threats are more serious when HMIS is
computerized:
 unauthorized access to data
 damage to files (viruses…)
25
Data management covers the whole process, starting from data
recording to transcription, compilation, analysis & interpretation,
reporting, feedback and use.
TB CENTRE
(OPD or lab)
TB CENTRE
(OPD or lab)
TRANSCRIPTIONTRANSCRIPTIONRECORDINGRECORDING
COMPILATIONCOMPILATION
ANALYSIS
& INTERPRETATION
ANALYSIS
& INTERPRETATION
REPORTINGREPORTING
FEEDBACK & USEFEEDBACK & USE
26
Where can errors occur?Where can errors occur?
At each step, especially during:
Data recording
Manual data transcription
Data compilation
Data entry in computer
Analysis
Interpretation
27
Step in data flow Source of error
Data recording Information not registered
Wrong information (wrong address, etc )
Right information wrongly entered (in the wrong
place)
Missing records
Data compilation Wrong counts
Missed reports
Duplicate counting
Compterised data
entry
Wrong entry
Partial entry
Partial entry of records
Template based
computerised data
analysis
nil
28
Prevention of data errorsPrevention of data errors
 clarity of the instructions
 training and motivation of the staff
 honesty of the staff
 user-friendliness of the data supports,
such as data forms and templates
 supervision
29
Prevention of data errorsPrevention of data errors
 computerized data handling :
 improves the accuracy of the data
 prevents processing and analysis errors
 makes fudging less easy, once the data
have been entered in the computer
 use of independent double entry techniques
(and checking of inconsistencies between
the 2 entries)
 data entry formatted to acceptable ranges
and modalities only
30
How to proceed with the dataHow to proceed with the data
verification?verification?
1. Be alert
2. Routine checking of data
3. Quarterly report checking
31
BE ALERT!BE ALERT!
 Registers that look meticulously clean
 All data entered with the same pen
 Lack of variation / identically results every quarter
 A too nice performance:
 absence of initial defaulters
 too low death rates
 too high cure rates
 absence of defaulting in IP …
Be alert to the likelihood of intentional
falsification of data!!!
Do not accept data without checking
their veracity!!!
32
How to proceed with the dataHow to proceed with the data
verification?verification?
 Routine checking of the data through
supervision
 Completeness checking
 Consistency checking
 Quarterly report checking
 Range checking
 Modality checking
33
Completeness checkingCompleteness checking
Completeness of report = all data have been reported!
A minimal completeness check verifies if
all variables contain data.
A minimal completeness check verifies if
all variables contain data.
Example:
200 NSP cases and age information only for 187 casesInformation is incomplete!
How to solve?
Verify via the original reports.
34
Consistency checkingConsistency checking
Checks whether the values of data items are concordant
Example: CAT III and Sputum+
How to check for inconsistencies?How to check for inconsistencies?
By cross tabulation
CAT Sputum result
SP+ SP-
CAT I 1162 114
CAT II 300 148
CAT III 16 103016
Contradiction
35
Range checkingRange checking
Any method of detecting whether a quantitative
variable is within an acceptable range
Example 1: Height of an adult patient
Acceptable range = 1.00 m to 2,00 m
3.00 m is impossible
0.98 is possible, but needs verification
Example 2: Age of an adult patient
Acceptable range =15 to 100 years
150 years is impossible
Any “impossible” or “out of range” value should
be verified via the original record or the patient.
Any “impossible” or “out of range” value should
be verified via the original record or the patient.
36
Modality checkingModality checking
The data of a qualitative variable are classified in
groups or modalities.
Each data should belong to one modality only
Example : Sex
Two modalities: Male or Female
Other values are impossible!
“Not known” is sometimes entered but is not a valid
modality and should be verified and corrected!
37
Correction of errorsCorrection of errors
ERROR ERRORS ??
Go back to the original data source.
But what if the original data source is erroneous?
The best method is to go back to a previous step in
the data flow, and verify patient records, lab records,
etc.
 If correct data found, then modify the erroneous data
 If correct data not found, then report as “missing”.
38
Errors in dataErrors in data
Risk for wrong decisionsRisk for wrong decisions
Information has to be of good quality
• correct data
• correct data processing
ValidValid
ReliableReliable
CompleteComplete
ConsistentConsistent
TimelyTimely
39
Erroneous dataErroneous data
BadBad informationinformation
WrongWrong decisionsdecisions
Appropriate actions??Appropriate actions??
40
Don’t forget : there is more room for
error than shown in this picture

More Related Content

What's hot

Technology Assessment and Refinement for Its Adoption
Technology Assessment and  Refinement for Its AdoptionTechnology Assessment and  Refinement for Its Adoption
Technology Assessment and Refinement for Its AdoptionManoj Sharma
 
Seven steps for Use Routine Information to Improve HIV/AIDS Program_Snyder_5....
Seven steps for Use Routine Information to Improve HIV/AIDS Program_Snyder_5....Seven steps for Use Routine Information to Improve HIV/AIDS Program_Snyder_5....
Seven steps for Use Routine Information to Improve HIV/AIDS Program_Snyder_5....CORE Group
 
Quality improvement in reproductive, maternal, newborn and child health
Quality improvement in reproductive, maternal, newborn and child healthQuality improvement in reproductive, maternal, newborn and child health
Quality improvement in reproductive, maternal, newborn and child healthREACHOUTCONSORTIUMSLIDES
 
Roadmap to next generation digital lab
Roadmap to next generation digital labRoadmap to next generation digital lab
Roadmap to next generation digital labStephan Gürtler
 
Measurement Control Risk Based Test Cases Activities Latw09
Measurement Control Risk Based Test Cases Activities Latw09Measurement Control Risk Based Test Cases Activities Latw09
Measurement Control Risk Based Test Cases Activities Latw09Júlio Venâncio
 
Modern quality systems in pharmaceutical education and industries
Modern quality systems in pharmaceutical education and industriesModern quality systems in pharmaceutical education and industries
Modern quality systems in pharmaceutical education and industriesKoshish Gabhane
 
3rd alex marketing club (pharmaceutical forecasting) dr. ahmed sham'a
3rd  alex marketing club (pharmaceutical forecasting) dr. ahmed sham'a3rd  alex marketing club (pharmaceutical forecasting) dr. ahmed sham'a
3rd alex marketing club (pharmaceutical forecasting) dr. ahmed sham'aMahmoud Bahgat
 
Risk Assessment: Approach to enhance Network Security
Risk Assessment: Approach to enhance Network SecurityRisk Assessment: Approach to enhance Network Security
Risk Assessment: Approach to enhance Network SecurityIJCSIS Research Publications
 
Opportunities for data analytics in power generation affelt 2016
Opportunities for data analytics in power generation affelt 2016Opportunities for data analytics in power generation affelt 2016
Opportunities for data analytics in power generation affelt 2016Scott Affelt
 
Foundational Methodology for Data Science
Foundational Methodology for Data ScienceFoundational Methodology for Data Science
Foundational Methodology for Data ScienceJohn B. Rollins, Ph.D.
 
Jumpstart Success in Your Supply Chain: How Data Science and Modeling Can Sup...
Jumpstart Success in Your Supply Chain: How Data Science and Modeling Can Sup...Jumpstart Success in Your Supply Chain: How Data Science and Modeling Can Sup...
Jumpstart Success in Your Supply Chain: How Data Science and Modeling Can Sup...Aggregage
 
Evaluation in Audio Music Similarity
Evaluation in Audio Music SimilarityEvaluation in Audio Music Similarity
Evaluation in Audio Music SimilarityJulián Urbano
 
MAT 510 Inspiring Innovation/tutorialrank.com
 MAT 510 Inspiring Innovation/tutorialrank.com MAT 510 Inspiring Innovation/tutorialrank.com
MAT 510 Inspiring Innovation/tutorialrank.comjonhson139
 
MAT 510 Great Stories /newtonhelp.com
MAT 510 Great Stories /newtonhelp.comMAT 510 Great Stories /newtonhelp.com
MAT 510 Great Stories /newtonhelp.combellflower184
 
Industrial Internet
Industrial InternetIndustrial Internet
Industrial InternetDeepam Goyal
 
Mb0047 management information system
Mb0047  management information systemMb0047  management information system
Mb0047 management information systemsmumbahelp
 
A REVIEW ON PREDICTIVE ANALYTICS IN DATA MINING
A REVIEW ON PREDICTIVE ANALYTICS IN DATA MININGA REVIEW ON PREDICTIVE ANALYTICS IN DATA MINING
A REVIEW ON PREDICTIVE ANALYTICS IN DATA MININGijccmsjournal
 

What's hot (20)

Technology Assessment and Refinement for Its Adoption
Technology Assessment and  Refinement for Its AdoptionTechnology Assessment and  Refinement for Its Adoption
Technology Assessment and Refinement for Its Adoption
 
Seven steps for Use Routine Information to Improve HIV/AIDS Program_Snyder_5....
Seven steps for Use Routine Information to Improve HIV/AIDS Program_Snyder_5....Seven steps for Use Routine Information to Improve HIV/AIDS Program_Snyder_5....
Seven steps for Use Routine Information to Improve HIV/AIDS Program_Snyder_5....
 
Quality improvement in reproductive, maternal, newborn and child health
Quality improvement in reproductive, maternal, newborn and child healthQuality improvement in reproductive, maternal, newborn and child health
Quality improvement in reproductive, maternal, newborn and child health
 
Roadmap to next generation digital lab
Roadmap to next generation digital labRoadmap to next generation digital lab
Roadmap to next generation digital lab
 
Measurement Control Risk Based Test Cases Activities Latw09
Measurement Control Risk Based Test Cases Activities Latw09Measurement Control Risk Based Test Cases Activities Latw09
Measurement Control Risk Based Test Cases Activities Latw09
 
Modern quality systems in pharmaceutical education and industries
Modern quality systems in pharmaceutical education and industriesModern quality systems in pharmaceutical education and industries
Modern quality systems in pharmaceutical education and industries
 
3rd alex marketing club (pharmaceutical forecasting) dr. ahmed sham'a
3rd  alex marketing club (pharmaceutical forecasting) dr. ahmed sham'a3rd  alex marketing club (pharmaceutical forecasting) dr. ahmed sham'a
3rd alex marketing club (pharmaceutical forecasting) dr. ahmed sham'a
 
Risk Assessment: Approach to enhance Network Security
Risk Assessment: Approach to enhance Network SecurityRisk Assessment: Approach to enhance Network Security
Risk Assessment: Approach to enhance Network Security
 
Data analysis
Data analysisData analysis
Data analysis
 
Opportunities for data analytics in power generation affelt 2016
Opportunities for data analytics in power generation affelt 2016Opportunities for data analytics in power generation affelt 2016
Opportunities for data analytics in power generation affelt 2016
 
Foundational Methodology for Data Science
Foundational Methodology for Data ScienceFoundational Methodology for Data Science
Foundational Methodology for Data Science
 
Jumpstart Success in Your Supply Chain: How Data Science and Modeling Can Sup...
Jumpstart Success in Your Supply Chain: How Data Science and Modeling Can Sup...Jumpstart Success in Your Supply Chain: How Data Science and Modeling Can Sup...
Jumpstart Success in Your Supply Chain: How Data Science and Modeling Can Sup...
 
Evaluation in Audio Music Similarity
Evaluation in Audio Music SimilarityEvaluation in Audio Music Similarity
Evaluation in Audio Music Similarity
 
MAT 510 Inspiring Innovation/tutorialrank.com
 MAT 510 Inspiring Innovation/tutorialrank.com MAT 510 Inspiring Innovation/tutorialrank.com
MAT 510 Inspiring Innovation/tutorialrank.com
 
MAT 510 Great Stories /newtonhelp.com
MAT 510 Great Stories /newtonhelp.comMAT 510 Great Stories /newtonhelp.com
MAT 510 Great Stories /newtonhelp.com
 
Data Quality
Data QualityData Quality
Data Quality
 
Industrial Internet
Industrial InternetIndustrial Internet
Industrial Internet
 
RMMM Plan
RMMM PlanRMMM Plan
RMMM Plan
 
Mb0047 management information system
Mb0047  management information systemMb0047  management information system
Mb0047 management information system
 
A REVIEW ON PREDICTIVE ANALYTICS IN DATA MINING
A REVIEW ON PREDICTIVE ANALYTICS IN DATA MININGA REVIEW ON PREDICTIVE ANALYTICS IN DATA MINING
A REVIEW ON PREDICTIVE ANALYTICS IN DATA MINING
 

Similar to Data verification slides bangalore to t (4)

sources of data.ppt
sources of data.pptsources of data.ppt
sources of data.pptTeenaPS1
 
Developing Protocols & Procedures for CT Data Integrity
Developing Protocols & Procedures for CT Data Integrity Developing Protocols & Procedures for CT Data Integrity
Developing Protocols & Procedures for CT Data Integrity Bhaswat Chakraborty
 
Data Quality Presentation.ppt
Data Quality Presentation.pptData Quality Presentation.ppt
Data Quality Presentation.pptmusa_s
 
How To Optimize Your EDC Solution For Risk Based Monitoring
How To Optimize Your EDC Solution For Risk Based MonitoringHow To Optimize Your EDC Solution For Risk Based Monitoring
How To Optimize Your EDC Solution For Risk Based Monitoringwww.datatrak.com
 
Assessing M&E Systems For Data Quality
Assessing M&E Systems For Data QualityAssessing M&E Systems For Data Quality
Assessing M&E Systems For Data QualityMEASURE Evaluation
 
Review of the Implications of Uploading Unverified Dataset in A Data Banking ...
Review of the Implications of Uploading Unverified Dataset in A Data Banking ...Review of the Implications of Uploading Unverified Dataset in A Data Banking ...
Review of the Implications of Uploading Unverified Dataset in A Data Banking ...ssuser793b4e
 
Best Practices to Risk Based Data Integrity at Data Integrity Conference, Lon...
Best Practices to Risk Based Data Integrity at Data Integrity Conference, Lon...Best Practices to Risk Based Data Integrity at Data Integrity Conference, Lon...
Best Practices to Risk Based Data Integrity at Data Integrity Conference, Lon...Bhaswat Chakraborty
 
Data Cleaning and Quality Control: Techniques and Challenges
Data Cleaning and Quality Control: Techniques and ChallengesData Cleaning and Quality Control: Techniques and Challenges
Data Cleaning and Quality Control: Techniques and ChallengesClinosolIndia
 
Clinical data-management-overview
Clinical data-management-overviewClinical data-management-overview
Clinical data-management-overviewAcri India
 
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...Health Catalyst
 
How do you assess the quality and reliability of data sources in data analysi...
How do you assess the quality and reliability of data sources in data analysi...How do you assess the quality and reliability of data sources in data analysi...
How do you assess the quality and reliability of data sources in data analysi...Soumodeep Nanee Kundu
 
Retina Today (Nov-Dec 2014): The Clinical Data Management Process
Retina Today (Nov-Dec 2014): The Clinical Data Management ProcessRetina Today (Nov-Dec 2014): The Clinical Data Management Process
Retina Today (Nov-Dec 2014): The Clinical Data Management ProcessStatistics & Data Corporation
 
Keeping up with ICH E6(R2): Risk-Based Monitoring (RBM) Redefined
Keeping up with ICH E6(R2): Risk-Based Monitoring (RBM) RedefinedKeeping up with ICH E6(R2): Risk-Based Monitoring (RBM) Redefined
Keeping up with ICH E6(R2): Risk-Based Monitoring (RBM) RedefinedLife Sciences Network marcus evans
 
Survival Guide: Taming the Data Quality Beast
Survival Guide: Taming the Data Quality BeastSurvival Guide: Taming the Data Quality Beast
Survival Guide: Taming the Data Quality BeastTechWell
 
Machine Learning Approaches and its Challenges
Machine Learning Approaches and its ChallengesMachine Learning Approaches and its Challenges
Machine Learning Approaches and its Challengesijcnes
 

Similar to Data verification slides bangalore to t (4) (20)

sources of data.ppt
sources of data.pptsources of data.ppt
sources of data.ppt
 
Developing Protocols & Procedures for CT Data Integrity
Developing Protocols & Procedures for CT Data Integrity Developing Protocols & Procedures for CT Data Integrity
Developing Protocols & Procedures for CT Data Integrity
 
Data Quality Presentation.ppt
Data Quality Presentation.pptData Quality Presentation.ppt
Data Quality Presentation.ppt
 
Data Quality Presentation.ppt
Data Quality Presentation.pptData Quality Presentation.ppt
Data Quality Presentation.ppt
 
How To Optimize Your EDC Solution For Risk Based Monitoring
How To Optimize Your EDC Solution For Risk Based MonitoringHow To Optimize Your EDC Solution For Risk Based Monitoring
How To Optimize Your EDC Solution For Risk Based Monitoring
 
dimensions_of_data_quality.pptx
dimensions_of_data_quality.pptxdimensions_of_data_quality.pptx
dimensions_of_data_quality.pptx
 
Assessing M&E Systems For Data Quality
Assessing M&E Systems For Data QualityAssessing M&E Systems For Data Quality
Assessing M&E Systems For Data Quality
 
Review of the Implications of Uploading Unverified Dataset in A Data Banking ...
Review of the Implications of Uploading Unverified Dataset in A Data Banking ...Review of the Implications of Uploading Unverified Dataset in A Data Banking ...
Review of the Implications of Uploading Unverified Dataset in A Data Banking ...
 
Pmcf data quality challenges & best practices
Pmcf data quality challenges & best practicesPmcf data quality challenges & best practices
Pmcf data quality challenges & best practices
 
Best Practices to Risk Based Data Integrity at Data Integrity Conference, Lon...
Best Practices to Risk Based Data Integrity at Data Integrity Conference, Lon...Best Practices to Risk Based Data Integrity at Data Integrity Conference, Lon...
Best Practices to Risk Based Data Integrity at Data Integrity Conference, Lon...
 
Data Cleaning and Quality Control: Techniques and Challenges
Data Cleaning and Quality Control: Techniques and ChallengesData Cleaning and Quality Control: Techniques and Challenges
Data Cleaning and Quality Control: Techniques and Challenges
 
Clinical data-management-overview
Clinical data-management-overviewClinical data-management-overview
Clinical data-management-overview
 
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
Optimize Your Healthcare Data Quality Investment: Three Ways to Accelerate Ti...
 
How do you assess the quality and reliability of data sources in data analysi...
How do you assess the quality and reliability of data sources in data analysi...How do you assess the quality and reliability of data sources in data analysi...
How do you assess the quality and reliability of data sources in data analysi...
 
Retina Today (Nov-Dec 2014): The Clinical Data Management Process
Retina Today (Nov-Dec 2014): The Clinical Data Management ProcessRetina Today (Nov-Dec 2014): The Clinical Data Management Process
Retina Today (Nov-Dec 2014): The Clinical Data Management Process
 
PEDSnet DQA CHOP Symposium
PEDSnet DQA CHOP SymposiumPEDSnet DQA CHOP Symposium
PEDSnet DQA CHOP Symposium
 
Keeping up with ICH E6(R2): Risk-Based Monitoring (RBM) Redefined
Keeping up with ICH E6(R2): Risk-Based Monitoring (RBM) RedefinedKeeping up with ICH E6(R2): Risk-Based Monitoring (RBM) Redefined
Keeping up with ICH E6(R2): Risk-Based Monitoring (RBM) Redefined
 
Survival Guide: Taming the Data Quality Beast
Survival Guide: Taming the Data Quality BeastSurvival Guide: Taming the Data Quality Beast
Survival Guide: Taming the Data Quality Beast
 
Machine Learning Approaches and its Challenges
Machine Learning Approaches and its ChallengesMachine Learning Approaches and its Challenges
Machine Learning Approaches and its Challenges
 
RHINO Forum Presentation on DQR Framework
RHINO Forum Presentation on DQR FrameworkRHINO Forum Presentation on DQR Framework
RHINO Forum Presentation on DQR Framework
 

Recently uploaded

Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...Miss joya
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...astropune
 
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...narwatsonia7
 
Call Girls Service Bellary Road Just Call 7001305949 Enjoy College Girls Service
Call Girls Service Bellary Road Just Call 7001305949 Enjoy College Girls ServiceCall Girls Service Bellary Road Just Call 7001305949 Enjoy College Girls Service
Call Girls Service Bellary Road Just Call 7001305949 Enjoy College Girls Servicenarwatsonia7
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatorenarwatsonia7
 
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...Miss joya
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...indiancallgirl4rent
 
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...narwatsonia7
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiAlinaDevecerski
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...Miss joya
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escortsaditipandeya
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
VIP Call Girls Pune Sanjana 9907093804 Short 1500 Night 6000 Best call girls ...
VIP Call Girls Pune Sanjana 9907093804 Short 1500 Night 6000 Best call girls ...VIP Call Girls Pune Sanjana 9907093804 Short 1500 Night 6000 Best call girls ...
VIP Call Girls Pune Sanjana 9907093804 Short 1500 Night 6000 Best call girls ...Miss joya
 
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Deliverynehamumbai
 
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoybabeytanya
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...Miss joya
 

Recently uploaded (20)

Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
Low Rate Call Girls Pune Esha 9907093804 Short 1500 Night 6000 Best call girl...
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...High Profile Call Girls Coimbatore Saanvi☎️  8250192130 Independent Escort Se...
High Profile Call Girls Coimbatore Saanvi☎️ 8250192130 Independent Escort Se...
 
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCREscort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
 
Call Girls Service Bellary Road Just Call 7001305949 Enjoy College Girls Service
Call Girls Service Bellary Road Just Call 7001305949 Enjoy College Girls ServiceCall Girls Service Bellary Road Just Call 7001305949 Enjoy College Girls Service
Call Girls Service Bellary Road Just Call 7001305949 Enjoy College Girls Service
 
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service CoimbatoreCall Girl Coimbatore Prisha☎️  8250192130 Independent Escort Service Coimbatore
Call Girl Coimbatore Prisha☎️ 8250192130 Independent Escort Service Coimbatore
 
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
 
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
(Rocky) Jaipur Call Girl - 9521753030 Escorts Service 50% Off with Cash ON De...
 
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...Bangalore Call Girls Hebbal Kempapura Number 7001035870  Meetin With Bangalor...
Bangalore Call Girls Hebbal Kempapura Number 7001035870 Meetin With Bangalor...
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
 
VIP Call Girls Pune Sanjana 9907093804 Short 1500 Night 6000 Best call girls ...
VIP Call Girls Pune Sanjana 9907093804 Short 1500 Night 6000 Best call girls ...VIP Call Girls Pune Sanjana 9907093804 Short 1500 Night 6000 Best call girls ...
VIP Call Girls Pune Sanjana 9907093804 Short 1500 Night 6000 Best call girls ...
 
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Servicesauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
 
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on DeliveryCall Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
Call Girls Colaba Mumbai ❤️ 9920874524 👈 Cash on Delivery
 
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night EnjoyCall Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
Call Girl Number in Vashi Mumbai📲 9833363713 💞 Full Night Enjoy
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
 

Data verification slides bangalore to t (4)

  • 2. 2 LEARNING OBJECTIVESLEARNING OBJECTIVES Realise importance of correct data for program management Realise distinction between random data errors and falsified data Understand causes of poor data quality Being able to check data quality through supervision and review of reports Learn when and how to correct erroneous data
  • 3. 3 Occurrence & importance of errorsOccurrence & importance of errors In business context:  Error rates of 1-5 % are not exceptional  Estimated cost ≈ 10 % of revenue  Problems with data quality ↑ when data originate from multiple sources  After initial enthusiasm to improve data quality, focus on data quality generally slowly fades In disease control context  Error occurrence?  Impact on program performance?  Checking of errors: limited effort
  • 4. 4 Errors in RNTCP?Errors in RNTCP? Based on pre-test carried in all countries: Real possibility of errors in subdistrict reports Minor possibility in district reports Little attention to checking for errors
  • 5. 5 Errors identified in 1039 TB patients cohort review method in NY city: Munsiff et all IJTLD, 2006, 10 : 1133-9 • 41% of cases presented errors • multiple errors per patient: 596 / 424 = 1.4 • What kind of errors? - program info errors 55 % - patient related errors 45 % NB. Error rates in HMIS > 50 % Gillies A. Methods Inf Med 2000, 39 : 208-12
  • 6. 6 DataData qualityquality: definitiondefinition The state of validity,validity, reliability,reliability, consistency,consistency, timelinesstimeliness and completenessand completeness making data appropriate for a specific use Problems with data quality do not only arise from incorrect data Inconsistent data is a problem as well
  • 7. 7 Data Quality ~ ManagementData Quality ~ Management Quality AssuranceQuality Assurance Activities to ensure quality before data collection Quality AssuranceQuality Assurance Activities to ensure quality before data collection Quality ControlQuality Control Monitoring and maintaining quality of data during RNTCP implementation Quality ControlQuality Control Monitoring and maintaining quality of data during RNTCP implementation Data managementData management Handling and analysis of data throughout the RNTCP surveillance Data managementData management Handling and analysis of data throughout the RNTCP surveillance
  • 8. 8 Quality assurance & controlQuality assurance & control Quality assurance Quality control - anticipates problems before they occur - responds to observed problems - uses all available information to generate improvements - uses ongoing measurements to make decisions on the processes or products - is not tied to a specific quality standard - requires a pre-specified quality standard for comparability - is applicable mostly at the planning stage - is applicable mostly at the processing stage - is all-encompassing in its activities - is a set procedure that is a subset of quality assurance
  • 9. 9 Quality controlQuality control Quality control is a regulatory procedure through which we:  measure quality  compare quality with pre-set standards  act on the differences The objective of quality control is to achieve a given quality level with minimum cost (ex. EQA sampling)
  • 10. 10 Dimensions of data qualityDimensions of data quality 1. Intrinsic data quality  accuracy (validity and reliability) 2. Contextual data quality  relevant  timely  complete 3. Representational data quality  interpretability, easy to understand 4. Accessibility data quality  accessibility, security
  • 11. 11 Intrinsic data qualityIntrinsic data quality ACCURACYACCURACY Exact conformity to the true value WHY IMPORTANT? Accurate data = precondition for accurate decisions!! Two concepts: validityvalidity and reliabilityreliability QUESTION: is this guaranteed?
  • 12. 12 ValidityValidity = the degree to which a measurement reflects the truth There should be no systematic error or bias What is a valid sputum result for an open TB case?What is a valid sputum result for an open TB case? A result is valid if it corresponds to the true value!A result is valid if it corresponds to the true value! Open TB case = sputum positive!!Open TB case = sputum positive!!
  • 13. 13 ReliabilityReliability The degree to which a measurement gives the same result:  each time it is used under the same condition  with the same subject A necessary but not sufficient condition for validity because one can make the same errors twice Reliability = repeatibility of measurementsReliability = repeatibility of measurements Reliability is inversely related toReliability is inversely related to random errorrandom error
  • 14. 14 Dimensions of data qualityDimensions of data quality 1. Intrinsic data quality  accuracy 2. Contextual data quality  relevant  timely  complete
  • 15. 15 RELEVANCERELEVANCE (usefulness)(usefulness) Reflects the degree to which information meets the real needs of clients. Is concerned with whether the available information sheds light on the issues that are important to users.
  • 16. 16 RELEVANCERELEVANCE A good information source should include all relevant content and exclude all irrelevant content. . Decision making for RNTCP management Relevant for what? . Assessing relevance is subjective and depends upon the varying needs of users!
  • 17. 17 TIMELINESSTIMELINESS Refers to the moment data are compiled, reported and analysed Given RNTCP’s normalization of the data reporting system, timeliness is not a major issue in India. But it could be an issue in remote areas and in PPM
  • 18. 18 COMPLETENESSCOMPLETENESS No missing data (records, items) All data fields that have to be filled up, should indeed contain data. QUESTION: does this presently happen??
  • 19. 19 Missing records • Annual report 2001 NTP Bangladesh Reports DOTS areas non DOTS areas ------------------------------------------------------- Received 2230 180 Missing 59 4 % missing 3% 2%
  • 20. 20 Dimensions of data qualityDimensions of data quality 1. Intrinsic data quality  accuracy 2. Contextual data quality  relevant  timely  complete 3. Representational data quality  interpretability, easy to understand
  • 21. 21 Representational data qualityRepresentational data quality Interpretability Data must be in appropriate language and units, and the data definitions must be clear to all (language, jargon, concepts) Ease of understanding Data must be clear, without ambiguity, and easily comprehended.
  • 22. 22 Dimensions of data qualityDimensions of data quality 1. Intrinsic data quality  accuracy 2. Contextual data quality  relevant  timely  complete 3. Representational data quality  interpretability, easy to understand 4. Accessibility data quality  accessibility, security
  • 23. 23 ACCESSIBILITYACCESSIBILITY Essential element of any data quality assessment.Essential element of any data quality assessment. If data is not accessible, then it has little or no valueIf data is not accessible, then it has little or no value..If data is not accessible, then it has little or no valueIf data is not accessible, then it has little or no value.. Accessibility = precondition for use, but no guarantee for use! Data items should be easily obtainable and legal to collect.Data items should be easily obtainable and legal to collect. In computer era, guidelines have to be established for whoIn computer era, guidelines have to be established for who may access which datamay access which data
  • 24. 24 SECURITYSECURITY The protection of data from: ☞unauthorized modification (accidental or intentional) ☞equipment malfunction (computer crash), ☞natural disasters (fire, tsunami..) and crime Be aware! Security threats are more serious when HMIS is computerized:  unauthorized access to data  damage to files (viruses…) Be aware! Security threats are more serious when HMIS is computerized:  unauthorized access to data  damage to files (viruses…)
  • 25. 25 Data management covers the whole process, starting from data recording to transcription, compilation, analysis & interpretation, reporting, feedback and use. TB CENTRE (OPD or lab) TB CENTRE (OPD or lab) TRANSCRIPTIONTRANSCRIPTIONRECORDINGRECORDING COMPILATIONCOMPILATION ANALYSIS & INTERPRETATION ANALYSIS & INTERPRETATION REPORTINGREPORTING FEEDBACK & USEFEEDBACK & USE
  • 26. 26 Where can errors occur?Where can errors occur? At each step, especially during: Data recording Manual data transcription Data compilation Data entry in computer Analysis Interpretation
  • 27. 27 Step in data flow Source of error Data recording Information not registered Wrong information (wrong address, etc ) Right information wrongly entered (in the wrong place) Missing records Data compilation Wrong counts Missed reports Duplicate counting Compterised data entry Wrong entry Partial entry Partial entry of records Template based computerised data analysis nil
  • 28. 28 Prevention of data errorsPrevention of data errors  clarity of the instructions  training and motivation of the staff  honesty of the staff  user-friendliness of the data supports, such as data forms and templates  supervision
  • 29. 29 Prevention of data errorsPrevention of data errors  computerized data handling :  improves the accuracy of the data  prevents processing and analysis errors  makes fudging less easy, once the data have been entered in the computer  use of independent double entry techniques (and checking of inconsistencies between the 2 entries)  data entry formatted to acceptable ranges and modalities only
  • 30. 30 How to proceed with the dataHow to proceed with the data verification?verification? 1. Be alert 2. Routine checking of data 3. Quarterly report checking
  • 31. 31 BE ALERT!BE ALERT!  Registers that look meticulously clean  All data entered with the same pen  Lack of variation / identically results every quarter  A too nice performance:  absence of initial defaulters  too low death rates  too high cure rates  absence of defaulting in IP … Be alert to the likelihood of intentional falsification of data!!! Do not accept data without checking their veracity!!!
  • 32. 32 How to proceed with the dataHow to proceed with the data verification?verification?  Routine checking of the data through supervision  Completeness checking  Consistency checking  Quarterly report checking  Range checking  Modality checking
  • 33. 33 Completeness checkingCompleteness checking Completeness of report = all data have been reported! A minimal completeness check verifies if all variables contain data. A minimal completeness check verifies if all variables contain data. Example: 200 NSP cases and age information only for 187 casesInformation is incomplete! How to solve? Verify via the original reports.
  • 34. 34 Consistency checkingConsistency checking Checks whether the values of data items are concordant Example: CAT III and Sputum+ How to check for inconsistencies?How to check for inconsistencies? By cross tabulation CAT Sputum result SP+ SP- CAT I 1162 114 CAT II 300 148 CAT III 16 103016 Contradiction
  • 35. 35 Range checkingRange checking Any method of detecting whether a quantitative variable is within an acceptable range Example 1: Height of an adult patient Acceptable range = 1.00 m to 2,00 m 3.00 m is impossible 0.98 is possible, but needs verification Example 2: Age of an adult patient Acceptable range =15 to 100 years 150 years is impossible Any “impossible” or “out of range” value should be verified via the original record or the patient. Any “impossible” or “out of range” value should be verified via the original record or the patient.
  • 36. 36 Modality checkingModality checking The data of a qualitative variable are classified in groups or modalities. Each data should belong to one modality only Example : Sex Two modalities: Male or Female Other values are impossible! “Not known” is sometimes entered but is not a valid modality and should be verified and corrected!
  • 37. 37 Correction of errorsCorrection of errors ERROR ERRORS ?? Go back to the original data source. But what if the original data source is erroneous? The best method is to go back to a previous step in the data flow, and verify patient records, lab records, etc.  If correct data found, then modify the erroneous data  If correct data not found, then report as “missing”.
  • 38. 38 Errors in dataErrors in data Risk for wrong decisionsRisk for wrong decisions Information has to be of good quality • correct data • correct data processing ValidValid ReliableReliable CompleteComplete ConsistentConsistent TimelyTimely
  • 39. 39 Erroneous dataErroneous data BadBad informationinformation WrongWrong decisionsdecisions Appropriate actions??Appropriate actions??
  • 40. 40 Don’t forget : there is more room for error than shown in this picture