Lecture on validation measures in epidemiology for master students in publiv health and epidemiology at Karolinska Institutet in Stockholm, Sweden on 24 October 2013.
Young & Hot Surat ℂall Girls Vesu 8527049040 WhatsApp AnyTime Best Surat ℂall...
Accuracy lecture 131024
1. ACCURACY
(AND OTHER VALIDATION MEASURES)
Adina L. Feldman, M.Sc.
Karolinska Institutet
Department of Medical Epidemiology and Biostatistics
e-mail: adina.feldman@ki.se
tel. 08 5248 2313
24 October 2013
Adina L. Feldman
1
3. Validity
Accuracy is a type of systematic error (potential bias)
(Random error/precision is related to power, e.g. size of study sample)
Validity is what we call the certainty (accuracy) of a proxy measure/test
Why is knowing the validity of a measure important?
Consider these examples:
What is the validity of breast cancer screening (mammography)?
What is the validity of home pregnancy tests?
What is the validity of self-reported height? …weight?
What is the validity of register-based Parkinson’s disease diagnoses?
24 October 2013
Adina L. Feldman
3
8. Gold Standard
= The best possible available measure agianst which the measure under study is
validated
Discuss: What gold standard was used in these validations?
Breast cancer screening (mammography)?
Home pregnancy tests?
Self-reported height? …weight?
Register-based Parkinson’s disease diagnoses?
24 October 2013
Adina L. Feldman
8
9. Gold Standard
Binary
Continuous
24 October 2013
Binary
Breast cancer screening
(mammography)?
Home pregnancy tests?
Contiuous
Test measure
Discuss: Where do these validations fit in?
Self-reported height? …weight?
Register-based Parkinson’s disease
diagnoses?
Adina L. Feldman
9
10. Gold Standard
24 October 2013
Binary
Contiuous
Test measure
Binary
Reg PDx
Continuous
X
Discuss: Where do these validations fit in?
Breast cancer screening
(mammography)?
Home pregnancy tests?
Self-reported height? …weight?
Preg test
BC screening
Height
Weight
Register-based Parkinson’s disease
diagnoses?
Adina L. Feldman
10
11. Gold Standard
24 October 2013
Binary
Contiuous
Test measure
Binary
Sensitivity,
Specificity,
etc.
ROC-curves
Continuous
Different validation methods are used for
different types of validation studies!
X
These are covered (or at least
mentioned) today
Correlations,
BlandAltman plots
Adina L. Feldman
11
12. Outcome measure
Gold Standard
Positive
+
Positive
+
Negative
-
True Positive
(TP)
False Positive
(FP)
Positive
Predictive Value
(PPV)
=TP/
(TP+FP)
True Negative
(TN)
Negative
Predictive Value
(NPV)
=TN/
(TN+FN)
Negative False Negative
(FN)
-
Sensitivity
Specificity
=TP/
(TP+FN)
24 October 2013
Validity measures for
binary outcomes
(Print and pin to your
office wall!)
=TN/
(TN+FP)
Adina L. Feldman
12
13. Outcome measure
Gold Standard
Positive
+
Positive
+
Negative
-
True Positive
(TP)
False Positive
(FP)
Positive
Predictive Value
(PPV)
=TP/
(TP+FP)
True Negative
(TN)
Negative
Predictive Value
(NPV)
=TN/
(TN+FN)
Negative False Negative
(FN)
-
Sensitivity
=TP/
(TP+FN)
24 October 2013
Specificity
=TN/
(TN+FP)
Adina L. Feldman
13
14. Outcome measure
Gold Standard
Positive
+
Negative
-
These are less commonly
used measures, but
still good to know
24 October 2013
False Positive
(FP)
=FP/
(TP+FP)
True Negative
(TN)
False Negative
Rate (FNR),
cNPV
(=1-NPV)
=FN/
(TN+FN)
True Positive
Rate
FPR (OBS!!)
(=1-Spec.)
Accuracy
=Sens.
Positive
+
False Positive
Rate (FPR),
cPPV
(=1-PPV)
=FP/
(FP+TN)
=TP+TN/
(TP+TN+FP+FN)
True Positive
(TP)
Negative False Negative
(FN)
-
Adina L. Feldman
14
15. Misclassification
FN and FP are misclassifications
Consider cause of misclassification
FN: Why are some cases not detected?
FP: Why are some noncases given erroneous diagnoses?
Differential misclassification:
Non-random distribution of TP and FN (with regards to the exposure)
24 October 2013
Adina L. Feldman
15
16. Misclassification
Discuss: What could be the cause of FP and FN in these validations?
What could be the consequences of misclassification here?
Breast cancer screening (mammography)?
Home pregnancy tests?
Self-reported height? …weight?
Register-based Parkinson’s disease diagnoses?
24 October 2013
Adina L. Feldman
16
17. Fictional Example 1
Cohort study of 10,000 participants (random population-based sample)
Binary proxy measure
e.g. self-reported myocardial infarction (”heart attack”) ever/never
Binary Gold Standard
e.g. myocardial infarction confirmed according to best clinical practice
24 October 2013
Adina L. Feldman
17
20. Fictional Example 2
Cohort study of 10,000 participants (random population-based sample)
Binary proxy measure
e.g. self-reported influenza during one winter season yes/no
Binary Gold Standard
e.g. laboratory-confirmed infection with influenza virus
24 October 2013
Adina L. Feldman
20
23. Discussion points
Many validation study have only available either:
Only Gold Standard positive cases
Only proxy outcome positive cases
What validity measures can be calculated in each instance?
Two-phase screening is a very common approach to diagnosing disease,
e.g. Breast cancer (mammography followed by ultrasound, cytology)
What type of validity is most important in each phase?
24 October 2013
Adina L. Feldman
23
26. Gold Standard
24 October 2013
Binary
Contiuous
Test measure
Binary
Sensitivity,
Specificity,
etc.
ROC-curves
Continuous
Different validation methods are used for
different types of validation studies!
X
These are covered (or at least
mentioned) today
Correlations,
BlandAltman plots
Adina L. Feldman
26
27. Measures with discrimination threshold for binary outcomes
Frequency of cases
GS-
GS+
E.g. biomarker concentration in blood
24 October 2013
Adina L. Feldman
27
28. Measures with discrimination threshold for binary outcomes
Frequency of cases
GS-
GS+
TN
TP
FN FP
E.g. biomarker concentration in blood
24 October 2013
Adina L. Feldman
28
30. Gold Standard:
Reduced insulin sensitivity based
on established clinical index
cutoff
Proxy test:
Appendicular lean body mass
(LBM) index (kg/m2)
The threshold for LBM is varied
and for each step the sensitivity
and 1-specificity for the GS are
calculated and plotted
The goal is to determine the
optimal threshold for LBM in
predicting reduced insulin
sensitivity
AUC = Area Under the Curve (%)
(Bigger = Better)
24 October 2013
Adina L. Feldman
30
33. Gold Standard
24 October 2013
Binary
Contiuous
Test measure
Binary
Sensitivity,
Specificity,
etc.
ROC-curves
Continuous
Different validation methods are used for
different types of validation studies!
X
These are covered (or at least
mentioned) today
Correlations,
BlandAltman plots
Adina L. Feldman
33
41. Afternoon group excercise:
Ad hoc study of the validity of self-reported height
Define
Gold Standard
Method of ascertainment of self-reported height
Collect data
Proxy
Gold Standard
Using Excel
Plot correlation (scatter plot)
Brand-Altman plot
Draw conclusion
24 October 2013
Adina L. Feldman
41
42. Thank You!
(See you this afternoon)
Welcome to my PhD dissertation defence
10 Januari 2014, at 9 am in Andreas Vesalius,
Karolinska Institutet Campus Solna
Dissertation title:
”If I Only Had a Brain
– Epidemiological Studies of Parkinson’s Disease”
24 October 2013
Adina L. Feldman
43