SlideShare a Scribd company logo
1 of 25
A plea for good methodology:
the strengths and limitations of
approaches to developing prediction
models in obstetrics and gynecology
Ben Van Calster
Department of Development and Regeneration, KU Leuven (B)
Department of Biomedical Data Sciences, LUMC (NL)
Research Ethics Committee, University Hospitals Leuven (B)
Epi-Centre, KU Leuven (B)
Glasgow/Leuven, October 16th 2020
2
To explain or to predict?
DESCRIBE / EXPLAIN
• Study independent associations / predictors / risk factors
• Key: effect size per variable
• Not prediction modeling!
PREDICT
• Obtain a system that gives predictions (risk estimates)
• Aim is the use in NEW patients: it should work ‘tomorrow’, not now
• Key: quality of the predictions
3
Strengths of prediction models
• Help in (shared) clinical decision making
• Objectify predictions
• Patient counseling
• Effect on clinical workflow and outcomes
GOOD METHODOLOGY AND
GOOD REPORTING ARE ESSENTIAL!
4
Beam and Kohane. JAMA 2018;319:1317-8.
Get the objective right
5
Riley. Nature 2019;572:27-9.
Cronin & Vickers. Urology 2010;76:1298-301.
Get the objective right
• Is there a real clinical need for a new model?
• For which outcome, and for which management decision?
• When during the clinical workflow should the prediction be made?
• Does this match with the timing of the predictors?
• Do you have/can you collect data that is (really) fit for purpose?
6
Example
7
Tangiisuran et al. PLoS One 2014;9:e111254.
Too many models, too few validations
• 1060 models predicting outcomes after CVD (1990-2015) (Wessler et al, 2017)
• 363 models predicting CVD (Damen et al, 2016)
• 231 models related to Covid-19 (Wynants et al, 2020; living syst review)
ObGyn related:
• 263 models in obstetrics (Kleinrouweler et al, 2016)
• 116 models to diagnose ovarian malignancy (Kaijser et al, 2014)
 Perhaps academic CVs need help, but patients need help more
8
Thanks to @GSCollins
Wessler et al. Diagn Progn Res 2017;1:20. Damen et al. BMJ 2016;353:i2416. Wynants et al. BMJ 2020;369:m1328.
Kleinrouweler et al. AJOG 2016;214:79-90. Kaijser et al. Hum Reprod Update 2014;20:229-62.
Models in obstetrics
Only 23 of 263 models (9%) have been externally validated!
9
Kleinrouweler et al. AJOG 2016;214:79-90.
Knowledge is power (1)
Avoid dichotomization of continuous predictor variables
• Biologically implausible
• Deletes information, worse predictions (AUC ) (Collins 2016; Steyerberg 2018)
• Only clinical decisions should be binary
10
Collins et al. Stat Med 2016;35:4124-35.
Steyerberg et al. J Clin Epidemiol 2018;98:133-43.
Butts & Ng. Statistical and methodological myths and urban legends, p361-86. Routledge/Taylor & Francis, 2009.
Knowledge is power (2)
Use available knowledge, do not always ask the data!
11
Good & Hardin. Common errors in statistics (and how to avoid them). Wiley, 2006.
“Bypassing the brain to
compute by reflex is a
sure recipe for disaster”
Knowledge is power (3)
Explain how and when predictors are measured, standardize where
reasonably possible
- Units; e.g. progesterone in ng/ml or nmol/L
- How tumor volume or diameter is calculated
- What is meant by ‘hormonal therapy use’ (Which? When?)
- Smoking
- BMI: measured vs self-reported
If measurement varies across studies, model performance deteriorates
(Luijken, 2019; Luijken, 2020)
12
Luijken et al. Stat Med 2019;38:3444-59.
Luijken et al. J Clin Epidemiol 2020;119:7-18.
Knowledge is power (4): sample size
You think of buying a Porsche.
But if you do not want to pay for it,
you may get this.
The same applies for developing risk models.
13
The currency is sample size
The more complicated (or ‘fancy’) the modeling strategy,
the more you have to pay with sample size.
(counterfeit money does not help: we need good quality data)
In this respect, avoid train-test split, this reduces sample size for model
development: you’re burning your money
14
The currency is sample size
Many have heard of the “10 events per variable” rule
1. Often incorrect use: This is not about 10 patients per variable in the final model!
2. This is outdated, 10 EPV is often not enough. See new procedure (BMJ 2020).
3. Flexible algorithms are data hungry, EPV>>10 may be needed (van der Ploeg 2014).
15
Van der Ploeg et al. BMC Med Res Methodol 2014;14:137
Riley et al. BMJ 2020;368:m441.
Knowledge is power (5): missing data
Usually, “empty cells” are “full of information”!
Using only complete cases
- decreases sample size (less money)
- typically leaves a non-representative sample (biased risk estimates)
Presence of a test can be more predictive than the test result! See EHR data.
16
Agniel et al. BMJ 2018;360:k1479.
Model validation: assess calibration!
Key elements of model performance:
discrimination between patients with and without the event
calibration (correctness) of risk estimates
17
DISCRIMINATION
When it rained, was the
estimated chance of rain
higher (on average)?
CALIBRATION
For days with 80% estimated
chance of rain, did it rain on
8 out of 10 days?
Calibration: the Achilles heel
18
Van Calster & Vickers. Med Decis Making 2015;35:162-9.
Van Calster et al. BMC Med 2019;17:230.
Miscalibration: estimated risk is inaccurate
 Patient and clinician are misinformed, may lead to inappropriate decisions
(Van Calster & Vickers, 2015)
Performance depends on place and time
One external validation in one hospital does not tell much about a model!
“There is no such thing as a validated model”
 Study heterogeneity
19
Van Calster et al. BMJ 2020;370:m2614.
P-values and significance testing
Very small role in prediction modeling
- Focus is on robust predictions
- Focus is on precision of the performance estimates (e.g. AUC, calibration)
- Focus is on quantifying heterogeneity
- Focus is on qualitative difference between populations
- Focus is on a priori selection of predictors
- further data-driven selection can be based on p-values; high alpha recommended
(Steyerberg & Van Calster, 2020)
20
Steyerberg & Van Calster. Eur J Clin Invest 2020;50:e13229.
Machine learning popularity
21
“Typical machine learning algorithms are highly flexible
So will uncover associations we could not find before
Hence better predictions and management decisions”
→ One of the master keys, with guaranteed success!
Machine Learning: success guaranteed?
22
Christodoulou et al. J Clin Epidemiol 2019;110:12-22.
Poor methodology and reporting is common
23
Christodoulou et al (2019) – 71 studies:
- What was done about missing data? 100% poor or unclear
- How was performance validated? 68% unclear or biased approach
- Was calibration of risk estimates studied? 79% not at all
- Prognostic models: time horizon often ignored completely
Kleinrouweler et al (2016) – 263 models:
- Was calibration studied? 82% not at all
- Was the model fully presented so people can use it? Not for 38% of models
- Was the clinical use discussed? Not for 89% of models
FOLLOW TRIPOD GUIDELINES FOR REPORTING!
www.tripod-statement.org
Christodoulou et al. J Clin Epidemiol 2019;110:12-22.
Kleinrouweler et al. AJOG 2016;214:79-90.
Moons et al. Ann Intern Med 2015;162:w1-73.
The harm of poor methodology
24
Steyerberg et al. J Clin Epidemiol 2018;98:133-43.
Resources on prediction modeling
25
Involve a statistician with knowledge of prediction modeling!
Steyerberg EW. Clinical prediction models (2nd ed). Springer, 2019.
Riley RD et al. Prognosis research in healthcare. OUP, 2019.
Moons KGM et al. Transparent reporting of a multivariable prediction model for individual
prognosis and diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med
2015;162:W1-73.
Wynants L et al. Key steps and common pitfalls in developing and validating risk models.
BJOG 2017; 2017;124:423-432.
Prognosisresearch.com (newly launched website)

More Related Content

What's hot

NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...European School of Oncology
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkStats Statswork
 
850 keynote savage_using his laptop
850 keynote savage_using his laptop850 keynote savage_using his laptop
850 keynote savage_using his laptopRising Media, Inc.
 
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...MEASURE Evaluation
 
Ijp volume 4 issue 3_pages 1465-1473
Ijp volume 4 issue 3_pages 1465-1473Ijp volume 4 issue 3_pages 1465-1473
Ijp volume 4 issue 3_pages 1465-1473Ajums
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IMaarten van Smeden
 
Emergency Department Triage and Digital Health
Emergency Department Triage and Digital HealthEmergency Department Triage and Digital Health
Emergency Department Triage and Digital HealthHughSingleton
 
Regression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questionsRegression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questionsMaarten van Smeden
 
Surrogate endpoints in global health research: still searching for killer app...
Surrogate endpoints in global health research: still searching for killer app...Surrogate endpoints in global health research: still searching for killer app...
Surrogate endpoints in global health research: still searching for killer app...SystemOne
 
Bioanalytical validation house of cards
Bioanalytical validation house of cardsBioanalytical validation house of cards
Bioanalytical validation house of cardsE. Dennis Bashaw
 
A cost effectiveness_model_of_screening_strategies.16
A cost effectiveness_model_of_screening_strategies.16A cost effectiveness_model_of_screening_strategies.16
A cost effectiveness_model_of_screening_strategies.16Yesenia Castillo Salinas
 
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906Mark Gusack
 
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1cPredicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1cDamian R. Mingle, MBA
 
Digital platforms could disrupts how pharma companies plan and excecute clini...
Digital platforms could disrupts how pharma companies plan and excecute clini...Digital platforms could disrupts how pharma companies plan and excecute clini...
Digital platforms could disrupts how pharma companies plan and excecute clini...Jayanthi Repalli, PhD
 
The absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problemThe absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problemMaarten van Smeden
 
Glymour aaai
Glymour aaaiGlymour aaai
Glymour aaaimglymour
 
Meta Analysis of Medical Device Data Applications for Designing Studies and R...
Meta Analysis of Medical Device Data Applications for Designing Studies and R...Meta Analysis of Medical Device Data Applications for Designing Studies and R...
Meta Analysis of Medical Device Data Applications for Designing Studies and R...NAMSA
 
Optimising sepsis treatment with reinforcement learning - Matthieu Komorowski
Optimising sepsis treatment with reinforcement learning - Matthieu KomorowskiOptimising sepsis treatment with reinforcement learning - Matthieu Komorowski
Optimising sepsis treatment with reinforcement learning - Matthieu KomorowskiMads Astvad
 

What's hot (20)

NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
NY Prostate Cancer Conference - A. Vickers - Session 1: Traditional statistic...
 
How to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - StatsworkHow to establish and evaluate clinical prediction models - Statswork
How to establish and evaluate clinical prediction models - Statswork
 
850 keynote savage_using his laptop
850 keynote savage_using his laptop850 keynote savage_using his laptop
850 keynote savage_using his laptop
 
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
Timeliness of Malaria Treatment in Children Under Five Years of Age in sub-Sa...
 
Ijp volume 4 issue 3_pages 1465-1473
Ijp volume 4 issue 3_pages 1465-1473Ijp volume 4 issue 3_pages 1465-1473
Ijp volume 4 issue 3_pages 1465-1473
 
Introduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part IIntroduction to prediction modelling - Berlin 2018 - Part I
Introduction to prediction modelling - Berlin 2018 - Part I
 
Emergency Department Triage and Digital Health
Emergency Department Triage and Digital HealthEmergency Department Triage and Digital Health
Emergency Department Triage and Digital Health
 
Regression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questionsRegression shrinkage: better answers to causal questions
Regression shrinkage: better answers to causal questions
 
Surrogate endpoints in global health research: still searching for killer app...
Surrogate endpoints in global health research: still searching for killer app...Surrogate endpoints in global health research: still searching for killer app...
Surrogate endpoints in global health research: still searching for killer app...
 
Bioanalytical validation house of cards
Bioanalytical validation house of cardsBioanalytical validation house of cards
Bioanalytical validation house of cards
 
A cost effectiveness_model_of_screening_strategies.16
A cost effectiveness_model_of_screening_strategies.16A cost effectiveness_model_of_screening_strategies.16
A cost effectiveness_model_of_screening_strategies.16
 
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
 
SgtSaraEdition
SgtSaraEditionSgtSaraEdition
SgtSaraEdition
 
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1cPredicting Diabetic Readmission Rates: Moving Beyond HbA1c
Predicting Diabetic Readmission Rates: Moving Beyond HbA1c
 
Digital platforms could disrupts how pharma companies plan and excecute clini...
Digital platforms could disrupts how pharma companies plan and excecute clini...Digital platforms could disrupts how pharma companies plan and excecute clini...
Digital platforms could disrupts how pharma companies plan and excecute clini...
 
The absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problemThe absence of a gold standard: a measurement error problem
The absence of a gold standard: a measurement error problem
 
Glymour aaai
Glymour aaaiGlymour aaai
Glymour aaai
 
Meta Analysis of Medical Device Data Applications for Designing Studies and R...
Meta Analysis of Medical Device Data Applications for Designing Studies and R...Meta Analysis of Medical Device Data Applications for Designing Studies and R...
Meta Analysis of Medical Device Data Applications for Designing Studies and R...
 
Optimising sepsis treatment with reinforcement learning - Matthieu Komorowski
Optimising sepsis treatment with reinforcement learning - Matthieu KomorowskiOptimising sepsis treatment with reinforcement learning - Matthieu Komorowski
Optimising sepsis treatment with reinforcement learning - Matthieu Komorowski
 
Austin Ophthalmology
Austin OphthalmologyAustin Ophthalmology
Austin Ophthalmology
 

Similar to A plea for good methodology when developing clinical prediction models

Make clinical prediction models great again
Make clinical prediction models great againMake clinical prediction models great again
Make clinical prediction models great againBenVanCalster
 
Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Evangelos Kritsotakis
 
Comparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment aloneComparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment aloneDanskSygeplejeraad
 
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...IRJET Journal
 
Common statistical pitfalls & errors in biomedical research (a top-5 list)
Common statistical pitfalls & errors in biomedical research (a top-5 list)Common statistical pitfalls & errors in biomedical research (a top-5 list)
Common statistical pitfalls & errors in biomedical research (a top-5 list)Evangelos Kritsotakis
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Ewout Steyerberg
 
Therapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
Therapeutic_Innovation_&_Regulatory_Science-2015-TantsyuraTherapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
Therapeutic_Innovation_&_Regulatory_Science-2015-TantsyuraVadim Tantsyura
 
David Haggstrom Slides from AHRQ Kick-Off Event
David Haggstrom Slides from AHRQ Kick-Off EventDavid Haggstrom Slides from AHRQ Kick-Off Event
David Haggstrom Slides from AHRQ Kick-Off EventShawnHoke
 
ISCB 2023 Sources of uncertainty b.pptx
ISCB 2023 Sources of uncertainty b.pptxISCB 2023 Sources of uncertainty b.pptx
ISCB 2023 Sources of uncertainty b.pptxBenVanCalster
 
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...mlaij
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...mlaij
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...mlaij
 
Evaluating the Medical Literature
Evaluating the Medical LiteratureEvaluating the Medical Literature
Evaluating the Medical LiteratureClista Clanton
 
Measuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net BenefitMeasuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net BenefitLaure Wynants
 
20160223 patient experience2
20160223 patient experience220160223 patient experience2
20160223 patient experience2jescarra
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianLaure Wynants
 
Pediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events PresentationPediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events PresentationJordan Gamart
 
Integrated ACO selected for the NAACOS Innovation Showcase
Integrated ACO selected for the NAACOS Innovation ShowcaseIntegrated ACO selected for the NAACOS Innovation Showcase
Integrated ACO selected for the NAACOS Innovation ShowcaseEric Weaver
 

Similar to A plea for good methodology when developing clinical prediction models (20)

Make clinical prediction models great again
Make clinical prediction models great againMake clinical prediction models great again
Make clinical prediction models great again
 
Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...Developing and validating statistical models for clinical prediction and prog...
Developing and validating statistical models for clinical prediction and prog...
 
Comparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment aloneComparison of a fall risk assessment tool with nurses’ judgment alone
Comparison of a fall risk assessment tool with nurses’ judgment alone
 
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
PREDICTION OF BREAST CANCER,COMPARATIVE REVIEW OF MACHINE LEARNING TECHNIQUES...
 
Common statistical pitfalls & errors in biomedical research (a top-5 list)
Common statistical pitfalls & errors in biomedical research (a top-5 list)Common statistical pitfalls & errors in biomedical research (a top-5 list)
Common statistical pitfalls & errors in biomedical research (a top-5 list)
 
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
Prediction, Big Data, and AI: Steyerberg, Basel Nov 1, 2019
 
Therapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
Therapeutic_Innovation_&_Regulatory_Science-2015-TantsyuraTherapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
Therapeutic_Innovation_&_Regulatory_Science-2015-Tantsyura
 
David Haggstrom Slides from AHRQ Kick-Off Event
David Haggstrom Slides from AHRQ Kick-Off EventDavid Haggstrom Slides from AHRQ Kick-Off Event
David Haggstrom Slides from AHRQ Kick-Off Event
 
Neal Lesh
Neal LeshNeal Lesh
Neal Lesh
 
ISCB 2023 Sources of uncertainty b.pptx
ISCB 2023 Sources of uncertainty b.pptxISCB 2023 Sources of uncertainty b.pptx
ISCB 2023 Sources of uncertainty b.pptx
 
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
BREAST TUMOR DETECTION USING EFFICIENT MACHINE LEARNING AND DEEP LEARNING TEC...
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
 
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
Breast Tumor Detection Using Efficient Machine Learning and Deep Learning Tec...
 
Evaluating the Medical Literature
Evaluating the Medical LiteratureEvaluating the Medical Literature
Evaluating the Medical Literature
 
Measuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net BenefitMeasuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net Benefit
 
20160223 patient experience2
20160223 patient experience220160223 patient experience2
20160223 patient experience2
 
Dichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatisticianDichotomania and other challenges for the collaborating biostatistician
Dichotomania and other challenges for the collaborating biostatistician
 
Clinical prediction models
Clinical prediction modelsClinical prediction models
Clinical prediction models
 
Pediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events PresentationPediatric Adverse Drug Events Presentation
Pediatric Adverse Drug Events Presentation
 
Integrated ACO selected for the NAACOS Innovation Showcase
Integrated ACO selected for the NAACOS Innovation ShowcaseIntegrated ACO selected for the NAACOS Innovation Showcase
Integrated ACO selected for the NAACOS Innovation Showcase
 

Recently uploaded

From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 

Recently uploaded (20)

From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 

A plea for good methodology when developing clinical prediction models

  • 1. A plea for good methodology: the strengths and limitations of approaches to developing prediction models in obstetrics and gynecology Ben Van Calster Department of Development and Regeneration, KU Leuven (B) Department of Biomedical Data Sciences, LUMC (NL) Research Ethics Committee, University Hospitals Leuven (B) Epi-Centre, KU Leuven (B) Glasgow/Leuven, October 16th 2020
  • 2. 2
  • 3. To explain or to predict? DESCRIBE / EXPLAIN • Study independent associations / predictors / risk factors • Key: effect size per variable • Not prediction modeling! PREDICT • Obtain a system that gives predictions (risk estimates) • Aim is the use in NEW patients: it should work ‘tomorrow’, not now • Key: quality of the predictions 3
  • 4. Strengths of prediction models • Help in (shared) clinical decision making • Objectify predictions • Patient counseling • Effect on clinical workflow and outcomes GOOD METHODOLOGY AND GOOD REPORTING ARE ESSENTIAL! 4 Beam and Kohane. JAMA 2018;319:1317-8.
  • 5. Get the objective right 5 Riley. Nature 2019;572:27-9. Cronin & Vickers. Urology 2010;76:1298-301.
  • 6. Get the objective right • Is there a real clinical need for a new model? • For which outcome, and for which management decision? • When during the clinical workflow should the prediction be made? • Does this match with the timing of the predictors? • Do you have/can you collect data that is (really) fit for purpose? 6
  • 7. Example 7 Tangiisuran et al. PLoS One 2014;9:e111254.
  • 8. Too many models, too few validations • 1060 models predicting outcomes after CVD (1990-2015) (Wessler et al, 2017) • 363 models predicting CVD (Damen et al, 2016) • 231 models related to Covid-19 (Wynants et al, 2020; living syst review) ObGyn related: • 263 models in obstetrics (Kleinrouweler et al, 2016) • 116 models to diagnose ovarian malignancy (Kaijser et al, 2014)  Perhaps academic CVs need help, but patients need help more 8 Thanks to @GSCollins Wessler et al. Diagn Progn Res 2017;1:20. Damen et al. BMJ 2016;353:i2416. Wynants et al. BMJ 2020;369:m1328. Kleinrouweler et al. AJOG 2016;214:79-90. Kaijser et al. Hum Reprod Update 2014;20:229-62.
  • 9. Models in obstetrics Only 23 of 263 models (9%) have been externally validated! 9 Kleinrouweler et al. AJOG 2016;214:79-90.
  • 10. Knowledge is power (1) Avoid dichotomization of continuous predictor variables • Biologically implausible • Deletes information, worse predictions (AUC ) (Collins 2016; Steyerberg 2018) • Only clinical decisions should be binary 10 Collins et al. Stat Med 2016;35:4124-35. Steyerberg et al. J Clin Epidemiol 2018;98:133-43. Butts & Ng. Statistical and methodological myths and urban legends, p361-86. Routledge/Taylor & Francis, 2009.
  • 11. Knowledge is power (2) Use available knowledge, do not always ask the data! 11 Good & Hardin. Common errors in statistics (and how to avoid them). Wiley, 2006. “Bypassing the brain to compute by reflex is a sure recipe for disaster”
  • 12. Knowledge is power (3) Explain how and when predictors are measured, standardize where reasonably possible - Units; e.g. progesterone in ng/ml or nmol/L - How tumor volume or diameter is calculated - What is meant by ‘hormonal therapy use’ (Which? When?) - Smoking - BMI: measured vs self-reported If measurement varies across studies, model performance deteriorates (Luijken, 2019; Luijken, 2020) 12 Luijken et al. Stat Med 2019;38:3444-59. Luijken et al. J Clin Epidemiol 2020;119:7-18.
  • 13. Knowledge is power (4): sample size You think of buying a Porsche. But if you do not want to pay for it, you may get this. The same applies for developing risk models. 13
  • 14. The currency is sample size The more complicated (or ‘fancy’) the modeling strategy, the more you have to pay with sample size. (counterfeit money does not help: we need good quality data) In this respect, avoid train-test split, this reduces sample size for model development: you’re burning your money 14
  • 15. The currency is sample size Many have heard of the “10 events per variable” rule 1. Often incorrect use: This is not about 10 patients per variable in the final model! 2. This is outdated, 10 EPV is often not enough. See new procedure (BMJ 2020). 3. Flexible algorithms are data hungry, EPV>>10 may be needed (van der Ploeg 2014). 15 Van der Ploeg et al. BMC Med Res Methodol 2014;14:137 Riley et al. BMJ 2020;368:m441.
  • 16. Knowledge is power (5): missing data Usually, “empty cells” are “full of information”! Using only complete cases - decreases sample size (less money) - typically leaves a non-representative sample (biased risk estimates) Presence of a test can be more predictive than the test result! See EHR data. 16 Agniel et al. BMJ 2018;360:k1479.
  • 17. Model validation: assess calibration! Key elements of model performance: discrimination between patients with and without the event calibration (correctness) of risk estimates 17 DISCRIMINATION When it rained, was the estimated chance of rain higher (on average)? CALIBRATION For days with 80% estimated chance of rain, did it rain on 8 out of 10 days?
  • 18. Calibration: the Achilles heel 18 Van Calster & Vickers. Med Decis Making 2015;35:162-9. Van Calster et al. BMC Med 2019;17:230. Miscalibration: estimated risk is inaccurate  Patient and clinician are misinformed, may lead to inappropriate decisions (Van Calster & Vickers, 2015)
  • 19. Performance depends on place and time One external validation in one hospital does not tell much about a model! “There is no such thing as a validated model”  Study heterogeneity 19 Van Calster et al. BMJ 2020;370:m2614.
  • 20. P-values and significance testing Very small role in prediction modeling - Focus is on robust predictions - Focus is on precision of the performance estimates (e.g. AUC, calibration) - Focus is on quantifying heterogeneity - Focus is on qualitative difference between populations - Focus is on a priori selection of predictors - further data-driven selection can be based on p-values; high alpha recommended (Steyerberg & Van Calster, 2020) 20 Steyerberg & Van Calster. Eur J Clin Invest 2020;50:e13229.
  • 21. Machine learning popularity 21 “Typical machine learning algorithms are highly flexible So will uncover associations we could not find before Hence better predictions and management decisions” → One of the master keys, with guaranteed success!
  • 22. Machine Learning: success guaranteed? 22 Christodoulou et al. J Clin Epidemiol 2019;110:12-22.
  • 23. Poor methodology and reporting is common 23 Christodoulou et al (2019) – 71 studies: - What was done about missing data? 100% poor or unclear - How was performance validated? 68% unclear or biased approach - Was calibration of risk estimates studied? 79% not at all - Prognostic models: time horizon often ignored completely Kleinrouweler et al (2016) – 263 models: - Was calibration studied? 82% not at all - Was the model fully presented so people can use it? Not for 38% of models - Was the clinical use discussed? Not for 89% of models FOLLOW TRIPOD GUIDELINES FOR REPORTING! www.tripod-statement.org Christodoulou et al. J Clin Epidemiol 2019;110:12-22. Kleinrouweler et al. AJOG 2016;214:79-90. Moons et al. Ann Intern Med 2015;162:w1-73.
  • 24. The harm of poor methodology 24 Steyerberg et al. J Clin Epidemiol 2018;98:133-43.
  • 25. Resources on prediction modeling 25 Involve a statistician with knowledge of prediction modeling! Steyerberg EW. Clinical prediction models (2nd ed). Springer, 2019. Riley RD et al. Prognosis research in healthcare. OUP, 2019. Moons KGM et al. Transparent reporting of a multivariable prediction model for individual prognosis and diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med 2015;162:W1-73. Wynants L et al. Key steps and common pitfalls in developing and validating risk models. BJOG 2017; 2017;124:423-432. Prognosisresearch.com (newly launched website)