SlideShare a Scribd company logo
ACCURACY
(AND OTHER VALIDATION MEASURES)
Adina L. Feldman, M.Sc.
Karolinska Institutet
Department of Medical Epidemiology and Biostatistics
e-mail: adina.feldman@ki.se
tel. 08 5248 2313

24 October 2013

Adina L. Feldman

1
SYSTEMATIC ERROR

24 October 2013

Low

High

High

RANDOM ERROR

Low

Adina L. Feldman

2
Validity
 Accuracy is a type of systematic error (potential bias)
 (Random error/precision is related to power, e.g. size of study sample)

 Validity is what we call the certainty (accuracy) of a proxy measure/test
 Why is knowing the validity of a measure important?
 Consider these examples:
 What is the validity of breast cancer screening (mammography)?
 What is the validity of home pregnancy tests?
 What is the validity of self-reported height? …weight?
 What is the validity of register-based Parkinson’s disease diagnoses?

24 October 2013

Adina L. Feldman

3
24 October 2013

Adina L. Feldman

4
24 October 2013

Adina L. Feldman

5
24 October 2013

Adina L. Feldman

6
24 October 2013

Adina L. Feldman

7
Gold Standard
 = The best possible available measure agianst which the measure under study is
validated

 Discuss: What gold standard was used in these validations?
 Breast cancer screening (mammography)?
 Home pregnancy tests?
 Self-reported height? …weight?
 Register-based Parkinson’s disease diagnoses?

24 October 2013

Adina L. Feldman

8
Gold Standard
Binary

Continuous

24 October 2013

Binary

 Breast cancer screening

(mammography)?
 Home pregnancy tests?

Contiuous

Test measure

Discuss: Where do these validations fit in?

 Self-reported height? …weight?
 Register-based Parkinson’s disease

diagnoses?

Adina L. Feldman

9
Gold Standard

24 October 2013

Binary
Contiuous

Test measure

Binary

Reg PDx

Continuous

X

Discuss: Where do these validations fit in?
 Breast cancer screening

(mammography)?
 Home pregnancy tests?
 Self-reported height? …weight?

Preg test
BC screening

Height
Weight

 Register-based Parkinson’s disease

diagnoses?

Adina L. Feldman

10
Gold Standard

24 October 2013

Binary
Contiuous

Test measure

Binary
Sensitivity,
Specificity,
etc.

ROC-curves

Continuous
Different validation methods are used for
different types of validation studies!

X

 These are covered (or at least
mentioned) today

Correlations,
BlandAltman plots

Adina L. Feldman

11
Outcome measure

Gold Standard
Positive
+
Positive
+

Negative
-

True Positive
(TP)

False Positive
(FP)

Positive
Predictive Value
(PPV)

=TP/
(TP+FP)

True Negative
(TN)

Negative
Predictive Value
(NPV)

=TN/
(TN+FN)

Negative False Negative
(FN)
-

Sensitivity

Specificity

=TP/
(TP+FN)
24 October 2013

Validity measures for
binary outcomes
(Print and pin to your
office wall!)

=TN/
(TN+FP)
Adina L. Feldman

12
Outcome measure

Gold Standard
Positive
+
Positive
+

Negative
-

True Positive
(TP)

False Positive
(FP)

Positive
Predictive Value
(PPV)

=TP/
(TP+FP)

True Negative
(TN)

Negative
Predictive Value
(NPV)

=TN/
(TN+FN)

Negative False Negative
(FN)
-

Sensitivity
=TP/
(TP+FN)
24 October 2013

Specificity
=TN/
(TN+FP)
Adina L. Feldman

13
Outcome measure

Gold Standard
Positive
+

Negative
-

These are less commonly
used measures, but
still good to know

24 October 2013

False Positive
(FP)

=FP/
(TP+FP)

True Negative
(TN)

False Negative
Rate (FNR),
cNPV
(=1-NPV)

=FN/
(TN+FN)

True Positive
Rate

FPR (OBS!!)
(=1-Spec.)

Accuracy

=Sens.

Positive
+

False Positive
Rate (FPR),
cPPV
(=1-PPV)

=FP/
(FP+TN)

=TP+TN/
(TP+TN+FP+FN)

True Positive
(TP)

Negative False Negative
(FN)
-

Adina L. Feldman

14
Misclassification
 FN and FP are misclassifications
 Consider cause of misclassification
 FN: Why are some cases not detected?
 FP: Why are some noncases given erroneous diagnoses?

 Differential misclassification:
 Non-random distribution of TP and FN (with regards to the exposure)

24 October 2013

Adina L. Feldman

15
Misclassification
 Discuss: What could be the cause of FP and FN in these validations?
What could be the consequences of misclassification here?
 Breast cancer screening (mammography)?
 Home pregnancy tests?
 Self-reported height? …weight?
 Register-based Parkinson’s disease diagnoses?

24 October 2013

Adina L. Feldman

16
 Fictional Example 1
 Cohort study of 10,000 participants (random population-based sample)
 Binary proxy measure

e.g. self-reported myocardial infarction (”heart attack”) ever/never
 Binary Gold Standard

e.g. myocardial infarction confirmed according to best clinical practice

24 October 2013

Adina L. Feldman

17
Gold Standard

Example 1

Negative
-

Positive
+

90

5

PPV?

Negative
-

10

9,895

NPV?

Sens.?

Outcome measure

Positive
+

Spec.?

GS prev.?
OM prev.?

24 October 2013

Adina L. Feldman

18
Gold Standard

Example 1
↓prevalence
↑PPV
↑Sens.

Positive
+

90

5

PPV

94.7%

Negative
-

10

9,895

NPV

100%
(99.899%)

Spec.

GS prev.

1.0%

90.0%

Outcome measure

Negative
-

Sens.

24 October 2013

Positive
+

100%
(99.95%)

OM prev.

0.95%

Adina L. Feldman

19
 Fictional Example 2
 Cohort study of 10,000 participants (random population-based sample)
 Binary proxy measure

e.g. self-reported influenza during one winter season yes/no
 Binary Gold Standard

e.g. laboratory-confirmed infection with influenza virus

24 October 2013

Adina L. Feldman

20
Gold Standard

Example 2

Negative
-

Positive
+

1950

1400

PPV?

Negative
-

50

6600

NPV?

Sens.?

Outcome measure

Positive
+

Spec.?

GS prev.?
OM prev.?

24 October 2013

Adina L. Feldman

21
Gold Standard

Example 2
↑prevalence
↓PPV
↑Sens.

Positive
+

1950

1400

PPV

58.2%

Negative
-

50

6600

NPV

99.2%

Spec.

GS prev.

20.0%

97.5%

Outcome measure

Negative
-

Sens.

24 October 2013

Positive
+

82.5%

OM prev.

33.5%

Adina L. Feldman

22
Discussion points
 Many validation study have only available either:
 Only Gold Standard positive cases
 Only proxy outcome positive cases

 What validity measures can be calculated in each instance?

 Two-phase screening is a very common approach to diagnosing disease,
e.g. Breast cancer (mammography followed by ultrasound, cytology)
 What type of validity is most important in each phase?

24 October 2013

Adina L. Feldman

23
24 October 2013

Adina L. Feldman

24
24 October 2013

Adina L. Feldman

25
Gold Standard

24 October 2013

Binary
Contiuous

Test measure

Binary
Sensitivity,
Specificity,
etc.

ROC-curves

Continuous
Different validation methods are used for
different types of validation studies!

X

 These are covered (or at least
mentioned) today

Correlations,
BlandAltman plots

Adina L. Feldman

26
Measures with discrimination threshold for binary outcomes

Frequency of cases

GS-

GS+

E.g. biomarker concentration in blood
24 October 2013

Adina L. Feldman

27
Measures with discrimination threshold for binary outcomes

Frequency of cases

GS-

GS+

TN

TP
FN FP

E.g. biomarker concentration in blood
24 October 2013

Adina L. Feldman

28
24 October 2013

Adina L. Feldman

29
 Gold Standard:

Reduced insulin sensitivity based
on established clinical index
cutoff
 Proxy test:

Appendicular lean body mass
(LBM) index (kg/m2)

 The threshold for LBM is varied

and for each step the sensitivity
and 1-specificity for the GS are
calculated and plotted

 The goal is to determine the

optimal threshold for LBM in
predicting reduced insulin
sensitivity

 AUC = Area Under the Curve (%)

(Bigger = Better)
24 October 2013

Adina L. Feldman

30
24 October 2013

Adina L. Feldman

31
24 October 2013

Adina L. Feldman

32
Gold Standard

24 October 2013

Binary
Contiuous

Test measure

Binary
Sensitivity,
Specificity,
etc.

ROC-curves

Continuous
Different validation methods are used for
different types of validation studies!

X

 These are covered (or at least
mentioned) today

Correlations,
BlandAltman plots

Adina L. Feldman

33
24 October 2013

Adina L. Feldman

34
24 October 2013

Adina L. Feldman

35
?

24 October 2013

Adina L. Feldman

36
24 October 2013

Adina L. Feldman

37
24 October 2013

Adina L. Feldman

38
24 October 2013

Adina L. Feldman

39
Pearson correlation coefficient overall = 0.61

24 October 2013

Adina L. Feldman

40
Afternoon group excercise:
Ad hoc study of the validity of self-reported height
 Define
 Gold Standard
 Method of ascertainment of self-reported height

 Collect data
 Proxy
 Gold Standard

 Using Excel
 Plot correlation (scatter plot)
 Brand-Altman plot

 Draw conclusion

24 October 2013

Adina L. Feldman

41
Thank You!
(See you this afternoon)
Welcome to my PhD dissertation defence
10 Januari 2014, at 9 am in Andreas Vesalius,
Karolinska Institutet Campus Solna
Dissertation title:
”If I Only Had a Brain
– Epidemiological Studies of Parkinson’s Disease”
24 October 2013

Adina L. Feldman

43

More Related Content

Similar to Accuracy lecture 131024

Validity and realibility.pptx
Validity and realibility.pptxValidity and realibility.pptx
Validity and realibility.pptx
briankash1
 
Screening and diagnostic testing
Screening and diagnostic  testingScreening and diagnostic  testing
Screening and diagnostic testing
amitakashyap1
 
Critical appraisal of diagnostic studies
Critical appraisal of diagnostic studiesCritical appraisal of diagnostic studies
Critical appraisal of diagnostic studies
Samir Haffar
 
Screening of diseases
Screening of diseasesScreening of diseases
Screening of diseases
PrateekGoyal67
 
Diagnostic testing 2009
Diagnostic testing 2009Diagnostic testing 2009
Diagnostic testing 2009
coolboy101pk
 
BIOSTATISTICS
BIOSTATISTICSBIOSTATISTICS
Dr swe swe latt screening for slideshare
Dr swe swe latt screening for slideshareDr swe swe latt screening for slideshare
Dr swe swe latt screening for slideshare
International Islamic University Malaysia
 
Disease screening
Disease screeningDisease screening
Disease screening
Amandeep Kaur
 
Case control studies..skp
Case control studies..skpCase control studies..skp
Case control studies..skp
sudhiramkcg
 
Describing the performance of a diagnostic test
Describing the performance of a diagnostic testDescribing the performance of a diagnostic test
Describing the performance of a diagnostic test
Amany El-seoud
 
Epidemiological Approaches for Evaluation of diagnostic tests.pptx
Epidemiological Approaches for Evaluation of diagnostic tests.pptxEpidemiological Approaches for Evaluation of diagnostic tests.pptx
Epidemiological Approaches for Evaluation of diagnostic tests.pptx
Bhoj Raj Singh
 
Evaluating a diagnostic test presentation www.eyenirvaan.com - part 1
Evaluating a diagnostic test presentation www.eyenirvaan.com - part 1Evaluating a diagnostic test presentation www.eyenirvaan.com - part 1
Evaluating a diagnostic test presentation www.eyenirvaan.com - part 1
Eyenirvaan
 
INFERENTIAL STATISTICS.pdf
INFERENTIAL STATISTICS.pdfINFERENTIAL STATISTICS.pdf
INFERENTIAL STATISTICS.pdf
Mandar Baviskar
 
CAT 1 -MPH 5101 - FOUNDATIONS OF EPIDEMIOLOGY (1).pptx
CAT 1 -MPH 5101 -  FOUNDATIONS OF EPIDEMIOLOGY  (1).pptxCAT 1 -MPH 5101 -  FOUNDATIONS OF EPIDEMIOLOGY  (1).pptx
CAT 1 -MPH 5101 - FOUNDATIONS OF EPIDEMIOLOGY (1).pptx
Shafici Almis
 
sta
stasta
Epcm l18-19 assessing tests
Epcm  l18-19 assessing testsEpcm  l18-19 assessing tests
Epcm l18-19 assessing tests
Dr Ghaiath Hussein
 
Overview of different statistical tests used in epidemiological
Overview of different  statistical tests used in epidemiologicalOverview of different  statistical tests used in epidemiological
Overview of different statistical tests used in epidemiological
shefali jain
 
Diagnostic test
Diagnostic test Diagnostic test
Diagnostic test
Zafar Equebal
 
El espionaje de datos y la falacia de las pruebas múltiples
El espionaje de datos y la falacia de las pruebas múltiplesEl espionaje de datos y la falacia de las pruebas múltiples
El espionaje de datos y la falacia de las pruebas múltiples
afgallegos1997
 
Variables confounding
Variables confoundingVariables confounding
Variables confounding
DRSPRAO
 

Similar to Accuracy lecture 131024 (20)

Validity and realibility.pptx
Validity and realibility.pptxValidity and realibility.pptx
Validity and realibility.pptx
 
Screening and diagnostic testing
Screening and diagnostic  testingScreening and diagnostic  testing
Screening and diagnostic testing
 
Critical appraisal of diagnostic studies
Critical appraisal of diagnostic studiesCritical appraisal of diagnostic studies
Critical appraisal of diagnostic studies
 
Screening of diseases
Screening of diseasesScreening of diseases
Screening of diseases
 
Diagnostic testing 2009
Diagnostic testing 2009Diagnostic testing 2009
Diagnostic testing 2009
 
BIOSTATISTICS
BIOSTATISTICSBIOSTATISTICS
BIOSTATISTICS
 
Dr swe swe latt screening for slideshare
Dr swe swe latt screening for slideshareDr swe swe latt screening for slideshare
Dr swe swe latt screening for slideshare
 
Disease screening
Disease screeningDisease screening
Disease screening
 
Case control studies..skp
Case control studies..skpCase control studies..skp
Case control studies..skp
 
Describing the performance of a diagnostic test
Describing the performance of a diagnostic testDescribing the performance of a diagnostic test
Describing the performance of a diagnostic test
 
Epidemiological Approaches for Evaluation of diagnostic tests.pptx
Epidemiological Approaches for Evaluation of diagnostic tests.pptxEpidemiological Approaches for Evaluation of diagnostic tests.pptx
Epidemiological Approaches for Evaluation of diagnostic tests.pptx
 
Evaluating a diagnostic test presentation www.eyenirvaan.com - part 1
Evaluating a diagnostic test presentation www.eyenirvaan.com - part 1Evaluating a diagnostic test presentation www.eyenirvaan.com - part 1
Evaluating a diagnostic test presentation www.eyenirvaan.com - part 1
 
INFERENTIAL STATISTICS.pdf
INFERENTIAL STATISTICS.pdfINFERENTIAL STATISTICS.pdf
INFERENTIAL STATISTICS.pdf
 
CAT 1 -MPH 5101 - FOUNDATIONS OF EPIDEMIOLOGY (1).pptx
CAT 1 -MPH 5101 -  FOUNDATIONS OF EPIDEMIOLOGY  (1).pptxCAT 1 -MPH 5101 -  FOUNDATIONS OF EPIDEMIOLOGY  (1).pptx
CAT 1 -MPH 5101 - FOUNDATIONS OF EPIDEMIOLOGY (1).pptx
 
sta
stasta
sta
 
Epcm l18-19 assessing tests
Epcm  l18-19 assessing testsEpcm  l18-19 assessing tests
Epcm l18-19 assessing tests
 
Overview of different statistical tests used in epidemiological
Overview of different  statistical tests used in epidemiologicalOverview of different  statistical tests used in epidemiological
Overview of different statistical tests used in epidemiological
 
Diagnostic test
Diagnostic test Diagnostic test
Diagnostic test
 
El espionaje de datos y la falacia de las pruebas múltiples
El espionaje de datos y la falacia de las pruebas múltiplesEl espionaje de datos y la falacia de las pruebas múltiples
El espionaje de datos y la falacia de las pruebas múltiples
 
Variables confounding
Variables confoundingVariables confounding
Variables confounding
 

Recently uploaded

Tests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptxTests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptx
taiba qazi
 
Efficacy of Avartana Sneha in Ayurveda
Efficacy of Avartana Sneha in AyurvedaEfficacy of Avartana Sneha in Ayurveda
Efficacy of Avartana Sneha in Ayurveda
Dr. Jyothirmai Paindla
 
Aortic Association CBL Pilot April 19 – 20 Bern
Aortic Association CBL Pilot April 19 – 20 BernAortic Association CBL Pilot April 19 – 20 Bern
Aortic Association CBL Pilot April 19 – 20 Bern
suvadeepdas911
 
Journal Article Review on Rasamanikya
Journal Article Review on RasamanikyaJournal Article Review on Rasamanikya
Journal Article Review on Rasamanikya
Dr. Jyothirmai Paindla
 
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
rishi2789
 
OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1
KafrELShiekh University
 
Complementary feeding in infant IAP PROTOCOLS
Complementary feeding in infant IAP PROTOCOLSComplementary feeding in infant IAP PROTOCOLS
Complementary feeding in infant IAP PROTOCOLS
chiranthgowda16
 
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
rightmanforbloodline
 
Chapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptxChapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptx
Earlene McNair
 
Diabetic nephropathy diagnosis treatment
Diabetic nephropathy diagnosis treatmentDiabetic nephropathy diagnosis treatment
Diabetic nephropathy diagnosis treatment
arahmanzai5
 
Ketone bodies and metabolism-biochemistry
Ketone bodies and metabolism-biochemistryKetone bodies and metabolism-biochemistry
Ketone bodies and metabolism-biochemistry
Dhayanithi C
 
The Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic PrinciplesThe Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic Principles
MedicoseAcademics
 
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptxREGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
LaniyaNasrink
 
Post-Menstrual Smell- When to Suspect Vaginitis.pptx
Post-Menstrual Smell- When to Suspect Vaginitis.pptxPost-Menstrual Smell- When to Suspect Vaginitis.pptx
Post-Menstrual Smell- When to Suspect Vaginitis.pptx
FFragrant
 
Cell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune DiseaseCell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune Disease
Health Advances
 
CBL Seminar 2024_Preliminary Program.pdf
CBL Seminar 2024_Preliminary Program.pdfCBL Seminar 2024_Preliminary Program.pdf
CBL Seminar 2024_Preliminary Program.pdf
suvadeepdas911
 
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
Donc Test
 
Osteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdfOsteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdf
Jim Jacob Roy
 
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
Holistified Wellness
 
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa CentralClinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
19various
 

Recently uploaded (20)

Tests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptxTests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptx
 
Efficacy of Avartana Sneha in Ayurveda
Efficacy of Avartana Sneha in AyurvedaEfficacy of Avartana Sneha in Ayurveda
Efficacy of Avartana Sneha in Ayurveda
 
Aortic Association CBL Pilot April 19 – 20 Bern
Aortic Association CBL Pilot April 19 – 20 BernAortic Association CBL Pilot April 19 – 20 Bern
Aortic Association CBL Pilot April 19 – 20 Bern
 
Journal Article Review on Rasamanikya
Journal Article Review on RasamanikyaJournal Article Review on Rasamanikya
Journal Article Review on Rasamanikya
 
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
CHEMOTHERAPY_RDP_CHAPTER 2 _LEPROSY.pdf1
 
OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1OCT Training Course for clinical practice Part 1
OCT Training Course for clinical practice Part 1
 
Complementary feeding in infant IAP PROTOCOLS
Complementary feeding in infant IAP PROTOCOLSComplementary feeding in infant IAP PROTOCOLS
Complementary feeding in infant IAP PROTOCOLS
 
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
 
Chapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptxChapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptx
 
Diabetic nephropathy diagnosis treatment
Diabetic nephropathy diagnosis treatmentDiabetic nephropathy diagnosis treatment
Diabetic nephropathy diagnosis treatment
 
Ketone bodies and metabolism-biochemistry
Ketone bodies and metabolism-biochemistryKetone bodies and metabolism-biochemistry
Ketone bodies and metabolism-biochemistry
 
The Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic PrinciplesThe Electrocardiogram - Physiologic Principles
The Electrocardiogram - Physiologic Principles
 
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptxREGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
 
Post-Menstrual Smell- When to Suspect Vaginitis.pptx
Post-Menstrual Smell- When to Suspect Vaginitis.pptxPost-Menstrual Smell- When to Suspect Vaginitis.pptx
Post-Menstrual Smell- When to Suspect Vaginitis.pptx
 
Cell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune DiseaseCell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune Disease
 
CBL Seminar 2024_Preliminary Program.pdf
CBL Seminar 2024_Preliminary Program.pdfCBL Seminar 2024_Preliminary Program.pdf
CBL Seminar 2024_Preliminary Program.pdf
 
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
 
Osteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdfOsteoporosis - Definition , Evaluation and Management .pdf
Osteoporosis - Definition , Evaluation and Management .pdf
 
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
 
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa CentralClinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
 

Accuracy lecture 131024

  • 1. ACCURACY (AND OTHER VALIDATION MEASURES) Adina L. Feldman, M.Sc. Karolinska Institutet Department of Medical Epidemiology and Biostatistics e-mail: adina.feldman@ki.se tel. 08 5248 2313 24 October 2013 Adina L. Feldman 1
  • 2. SYSTEMATIC ERROR 24 October 2013 Low High High RANDOM ERROR Low Adina L. Feldman 2
  • 3. Validity  Accuracy is a type of systematic error (potential bias)  (Random error/precision is related to power, e.g. size of study sample)  Validity is what we call the certainty (accuracy) of a proxy measure/test  Why is knowing the validity of a measure important?  Consider these examples:  What is the validity of breast cancer screening (mammography)?  What is the validity of home pregnancy tests?  What is the validity of self-reported height? …weight?  What is the validity of register-based Parkinson’s disease diagnoses? 24 October 2013 Adina L. Feldman 3
  • 4. 24 October 2013 Adina L. Feldman 4
  • 5. 24 October 2013 Adina L. Feldman 5
  • 6. 24 October 2013 Adina L. Feldman 6
  • 7. 24 October 2013 Adina L. Feldman 7
  • 8. Gold Standard  = The best possible available measure agianst which the measure under study is validated  Discuss: What gold standard was used in these validations?  Breast cancer screening (mammography)?  Home pregnancy tests?  Self-reported height? …weight?  Register-based Parkinson’s disease diagnoses? 24 October 2013 Adina L. Feldman 8
  • 9. Gold Standard Binary Continuous 24 October 2013 Binary  Breast cancer screening (mammography)?  Home pregnancy tests? Contiuous Test measure Discuss: Where do these validations fit in?  Self-reported height? …weight?  Register-based Parkinson’s disease diagnoses? Adina L. Feldman 9
  • 10. Gold Standard 24 October 2013 Binary Contiuous Test measure Binary Reg PDx Continuous X Discuss: Where do these validations fit in?  Breast cancer screening (mammography)?  Home pregnancy tests?  Self-reported height? …weight? Preg test BC screening Height Weight  Register-based Parkinson’s disease diagnoses? Adina L. Feldman 10
  • 11. Gold Standard 24 October 2013 Binary Contiuous Test measure Binary Sensitivity, Specificity, etc. ROC-curves Continuous Different validation methods are used for different types of validation studies! X  These are covered (or at least mentioned) today Correlations, BlandAltman plots Adina L. Feldman 11
  • 12. Outcome measure Gold Standard Positive + Positive + Negative - True Positive (TP) False Positive (FP) Positive Predictive Value (PPV) =TP/ (TP+FP) True Negative (TN) Negative Predictive Value (NPV) =TN/ (TN+FN) Negative False Negative (FN) - Sensitivity Specificity =TP/ (TP+FN) 24 October 2013 Validity measures for binary outcomes (Print and pin to your office wall!) =TN/ (TN+FP) Adina L. Feldman 12
  • 13. Outcome measure Gold Standard Positive + Positive + Negative - True Positive (TP) False Positive (FP) Positive Predictive Value (PPV) =TP/ (TP+FP) True Negative (TN) Negative Predictive Value (NPV) =TN/ (TN+FN) Negative False Negative (FN) - Sensitivity =TP/ (TP+FN) 24 October 2013 Specificity =TN/ (TN+FP) Adina L. Feldman 13
  • 14. Outcome measure Gold Standard Positive + Negative - These are less commonly used measures, but still good to know 24 October 2013 False Positive (FP) =FP/ (TP+FP) True Negative (TN) False Negative Rate (FNR), cNPV (=1-NPV) =FN/ (TN+FN) True Positive Rate FPR (OBS!!) (=1-Spec.) Accuracy =Sens. Positive + False Positive Rate (FPR), cPPV (=1-PPV) =FP/ (FP+TN) =TP+TN/ (TP+TN+FP+FN) True Positive (TP) Negative False Negative (FN) - Adina L. Feldman 14
  • 15. Misclassification  FN and FP are misclassifications  Consider cause of misclassification  FN: Why are some cases not detected?  FP: Why are some noncases given erroneous diagnoses?  Differential misclassification:  Non-random distribution of TP and FN (with regards to the exposure) 24 October 2013 Adina L. Feldman 15
  • 16. Misclassification  Discuss: What could be the cause of FP and FN in these validations? What could be the consequences of misclassification here?  Breast cancer screening (mammography)?  Home pregnancy tests?  Self-reported height? …weight?  Register-based Parkinson’s disease diagnoses? 24 October 2013 Adina L. Feldman 16
  • 17.  Fictional Example 1  Cohort study of 10,000 participants (random population-based sample)  Binary proxy measure e.g. self-reported myocardial infarction (”heart attack”) ever/never  Binary Gold Standard e.g. myocardial infarction confirmed according to best clinical practice 24 October 2013 Adina L. Feldman 17
  • 18. Gold Standard Example 1 Negative - Positive + 90 5 PPV? Negative - 10 9,895 NPV? Sens.? Outcome measure Positive + Spec.? GS prev.? OM prev.? 24 October 2013 Adina L. Feldman 18
  • 19. Gold Standard Example 1 ↓prevalence ↑PPV ↑Sens. Positive + 90 5 PPV 94.7% Negative - 10 9,895 NPV 100% (99.899%) Spec. GS prev. 1.0% 90.0% Outcome measure Negative - Sens. 24 October 2013 Positive + 100% (99.95%) OM prev. 0.95% Adina L. Feldman 19
  • 20.  Fictional Example 2  Cohort study of 10,000 participants (random population-based sample)  Binary proxy measure e.g. self-reported influenza during one winter season yes/no  Binary Gold Standard e.g. laboratory-confirmed infection with influenza virus 24 October 2013 Adina L. Feldman 20
  • 21. Gold Standard Example 2 Negative - Positive + 1950 1400 PPV? Negative - 50 6600 NPV? Sens.? Outcome measure Positive + Spec.? GS prev.? OM prev.? 24 October 2013 Adina L. Feldman 21
  • 22. Gold Standard Example 2 ↑prevalence ↓PPV ↑Sens. Positive + 1950 1400 PPV 58.2% Negative - 50 6600 NPV 99.2% Spec. GS prev. 20.0% 97.5% Outcome measure Negative - Sens. 24 October 2013 Positive + 82.5% OM prev. 33.5% Adina L. Feldman 22
  • 23. Discussion points  Many validation study have only available either:  Only Gold Standard positive cases  Only proxy outcome positive cases  What validity measures can be calculated in each instance?  Two-phase screening is a very common approach to diagnosing disease, e.g. Breast cancer (mammography followed by ultrasound, cytology)  What type of validity is most important in each phase? 24 October 2013 Adina L. Feldman 23
  • 24. 24 October 2013 Adina L. Feldman 24
  • 25. 24 October 2013 Adina L. Feldman 25
  • 26. Gold Standard 24 October 2013 Binary Contiuous Test measure Binary Sensitivity, Specificity, etc. ROC-curves Continuous Different validation methods are used for different types of validation studies! X  These are covered (or at least mentioned) today Correlations, BlandAltman plots Adina L. Feldman 26
  • 27. Measures with discrimination threshold for binary outcomes Frequency of cases GS- GS+ E.g. biomarker concentration in blood 24 October 2013 Adina L. Feldman 27
  • 28. Measures with discrimination threshold for binary outcomes Frequency of cases GS- GS+ TN TP FN FP E.g. biomarker concentration in blood 24 October 2013 Adina L. Feldman 28
  • 29. 24 October 2013 Adina L. Feldman 29
  • 30.  Gold Standard: Reduced insulin sensitivity based on established clinical index cutoff  Proxy test: Appendicular lean body mass (LBM) index (kg/m2)  The threshold for LBM is varied and for each step the sensitivity and 1-specificity for the GS are calculated and plotted  The goal is to determine the optimal threshold for LBM in predicting reduced insulin sensitivity  AUC = Area Under the Curve (%) (Bigger = Better) 24 October 2013 Adina L. Feldman 30
  • 31. 24 October 2013 Adina L. Feldman 31
  • 32. 24 October 2013 Adina L. Feldman 32
  • 33. Gold Standard 24 October 2013 Binary Contiuous Test measure Binary Sensitivity, Specificity, etc. ROC-curves Continuous Different validation methods are used for different types of validation studies! X  These are covered (or at least mentioned) today Correlations, BlandAltman plots Adina L. Feldman 33
  • 34. 24 October 2013 Adina L. Feldman 34
  • 35. 24 October 2013 Adina L. Feldman 35
  • 36. ? 24 October 2013 Adina L. Feldman 36
  • 37. 24 October 2013 Adina L. Feldman 37
  • 38. 24 October 2013 Adina L. Feldman 38
  • 39. 24 October 2013 Adina L. Feldman 39
  • 40. Pearson correlation coefficient overall = 0.61 24 October 2013 Adina L. Feldman 40
  • 41. Afternoon group excercise: Ad hoc study of the validity of self-reported height  Define  Gold Standard  Method of ascertainment of self-reported height  Collect data  Proxy  Gold Standard  Using Excel  Plot correlation (scatter plot)  Brand-Altman plot  Draw conclusion 24 October 2013 Adina L. Feldman 41
  • 42. Thank You! (See you this afternoon) Welcome to my PhD dissertation defence 10 Januari 2014, at 9 am in Andreas Vesalius, Karolinska Institutet Campus Solna Dissertation title: ”If I Only Had a Brain – Epidemiological Studies of Parkinson’s Disease” 24 October 2013 Adina L. Feldman 43