SlideShare a Scribd company logo
1 of 45
E N A R 2 0 1 8
Exploring finite-sample
bias in propensity score
weights
LUCY D’AGOSTINO MCGOWAN
&
ROBERT GREEVY, JR.
E N A R 2 0 1 8
Overview
Finite sample bias
Unmeasured confounding – the problem
Unmeasured confounding – a solution
1
2
3
E N A R 2 0 1 8
treatmentcontrol
E N A R 2 0 1 8
trtcontrol
E N A R 2 0 1 8
trtcontrol
151
ATE
E N A R 2 0 1 8
treatmentcontrol
1
ATM
E N A R 2 0 1 8
treatmentcontrol
ATO
E N A R 2 0 1 8
ATE
ATM AT0
E N A R 2 0 1 8
ATE
ATM AT0
E N A R 2 0 1 8
ATE
ATM AT0
E N A R 2 0 1 8
Finite-sample bias
E N A R 2 0 1 8
E N A R 2 0 1 8
Freedman & Berk
Simulation
E N A R 2 0 1 8
yz
x1 x2
E N A R 2 0 1 8
yz
x1 x2
1
0.25 10.75 2
1
E N A R 2 0 1 8
yz
x1 x2
1
0.25 10.75 2
1
E N A R 2 0 1 8
yz
x1 x2
1
0.25 10.75 2
1
E N A R 2 0 1 8
yz
x1 x2
1
0.25 10.75 2
1
E N A R 2 0 1 8
z
x1 x2
Propensity score model
E N A R 2 0 1 8
yz
x1 x2
Outcome model
E N A R 2 0 1 8
treatmentctrl
E N A R 2 0 1 8
n = 1000
E N A R 2 0 1 8
E N A R 2 0 1 8
Standard errors
E N A R 2 0 1 8
yz
x1 x2
Outcome model
E N A R 2 0 1 8
E N A R 2 0 1 8
E N A R 2 0 1 8
Recap
• Replicated the finite-sample bias seen by
Freedman and Berk using the ATE weights
• ATM and ATO weights had improved finite-sample
properties
• The variance for the ATO and ATM is preferable to
that of the ATE
E N A R 2 0 1 8
Unmeasured confounding
the problem
E N A R 2 0 1 8
E N A R 2 0 1 8
z
x1 x2
Propensity score model
E N A R 2 0 1 8
yz
x1 x2
Outcome model
E N A R 2 0 1 8
ATE
E N A R 2 0 1 8
ATO
E N A R 2 0 1 8
Unmeasured confounding
a solution
E N A R 2 0 1 8
E-value
E − value = 𝐿𝐵 𝑜𝑏𝑠 + 𝐿𝐵 𝑜𝑏𝑠 × (𝐿𝐵 𝑜𝑏𝑠 − 1)
VanderWeele and Ding (2017)
E N A R 2 0 1 8
Adjusted E-value
E − value 𝑎𝑑𝑗 =
𝐿𝐵 𝑜𝑏𝑠
𝐿𝐵 𝑎𝑑𝑗
+
𝐿𝐵 𝑜𝑏𝑠
𝐿𝐵 𝑎𝑑𝑗
×
𝐿𝐵 𝑜𝑏𝑠
𝐿𝐵 𝑎𝑑𝑗
− 1
E N A R 2 0 1 8
Adjusted E-value
E − value 𝑎𝑑𝑗 =
𝐿𝐵 𝑜𝑏𝑠
𝐿𝐵 𝑎𝑑𝑗
+
𝐿𝐵 𝑜𝑏𝑠
𝐿𝐵 𝑎𝑑𝑗
×
𝐿𝐵 𝑜𝑏𝑠
𝐿𝐵 𝑎𝑑𝑗
− 1
E N A R 2 0 1 8
Right Heart Catheterization
Data
Connors et al (1996)
E N A R 2 0 1 8
Right Heart Catheterization
data
• We chose 20 covariates for demonstration purposes
• demographics
• comorbidities
• physiological measurements
• diagnosis categories
• APACHE score
• SUPPORT (probability of surviving 2 months)
• DNR status on day 1
Connors et al (1996)
E N A R 2 0 1 8
Right Heart Catheterization
data
•Fit a propensity score model
•Use ATO weights
•Fit weighted cox model for 30 day survival
E N A R 2 0 1 8
E N A R 2 0 1 8
E N A R 2 0 1 8
E N A R 2 0 1 8
@LucyStats
http://bit.ly/LucyStatsENAR2018
lucymcgowan.com
Thank you!

More Related Content

Recently uploaded

GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
Sérgio Sacani
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 

Recently uploaded (20)

GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 

Featured

Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

Featured (20)

Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 

Exploring finite-sample bias in propensity score weights

  • 1. E N A R 2 0 1 8 Exploring finite-sample bias in propensity score weights LUCY D’AGOSTINO MCGOWAN & ROBERT GREEVY, JR.
  • 2. E N A R 2 0 1 8 Overview Finite sample bias Unmeasured confounding – the problem Unmeasured confounding – a solution 1 2 3
  • 3. E N A R 2 0 1 8 treatmentcontrol
  • 4. E N A R 2 0 1 8 trtcontrol
  • 5. E N A R 2 0 1 8 trtcontrol 151 ATE
  • 6. E N A R 2 0 1 8 treatmentcontrol 1 ATM
  • 7. E N A R 2 0 1 8 treatmentcontrol ATO
  • 8. E N A R 2 0 1 8 ATE ATM AT0
  • 9. E N A R 2 0 1 8 ATE ATM AT0
  • 10. E N A R 2 0 1 8 ATE ATM AT0
  • 11. E N A R 2 0 1 8 Finite-sample bias
  • 12. E N A R 2 0 1 8
  • 13. E N A R 2 0 1 8 Freedman & Berk Simulation
  • 14. E N A R 2 0 1 8 yz x1 x2
  • 15. E N A R 2 0 1 8 yz x1 x2 1 0.25 10.75 2 1
  • 16. E N A R 2 0 1 8 yz x1 x2 1 0.25 10.75 2 1
  • 17. E N A R 2 0 1 8 yz x1 x2 1 0.25 10.75 2 1
  • 18. E N A R 2 0 1 8 yz x1 x2 1 0.25 10.75 2 1
  • 19. E N A R 2 0 1 8 z x1 x2 Propensity score model
  • 20. E N A R 2 0 1 8 yz x1 x2 Outcome model
  • 21. E N A R 2 0 1 8 treatmentctrl
  • 22. E N A R 2 0 1 8 n = 1000
  • 23. E N A R 2 0 1 8
  • 24. E N A R 2 0 1 8 Standard errors
  • 25. E N A R 2 0 1 8 yz x1 x2 Outcome model
  • 26. E N A R 2 0 1 8
  • 27. E N A R 2 0 1 8
  • 28. E N A R 2 0 1 8 Recap • Replicated the finite-sample bias seen by Freedman and Berk using the ATE weights • ATM and ATO weights had improved finite-sample properties • The variance for the ATO and ATM is preferable to that of the ATE
  • 29. E N A R 2 0 1 8 Unmeasured confounding the problem
  • 30. E N A R 2 0 1 8
  • 31. E N A R 2 0 1 8 z x1 x2 Propensity score model
  • 32. E N A R 2 0 1 8 yz x1 x2 Outcome model
  • 33. E N A R 2 0 1 8 ATE
  • 34. E N A R 2 0 1 8 ATO
  • 35. E N A R 2 0 1 8 Unmeasured confounding a solution
  • 36. E N A R 2 0 1 8 E-value E − value = 𝐿𝐵 𝑜𝑏𝑠 + 𝐿𝐵 𝑜𝑏𝑠 × (𝐿𝐵 𝑜𝑏𝑠 − 1) VanderWeele and Ding (2017)
  • 37. E N A R 2 0 1 8 Adjusted E-value E − value 𝑎𝑑𝑗 = 𝐿𝐵 𝑜𝑏𝑠 𝐿𝐵 𝑎𝑑𝑗 + 𝐿𝐵 𝑜𝑏𝑠 𝐿𝐵 𝑎𝑑𝑗 × 𝐿𝐵 𝑜𝑏𝑠 𝐿𝐵 𝑎𝑑𝑗 − 1
  • 38. E N A R 2 0 1 8 Adjusted E-value E − value 𝑎𝑑𝑗 = 𝐿𝐵 𝑜𝑏𝑠 𝐿𝐵 𝑎𝑑𝑗 + 𝐿𝐵 𝑜𝑏𝑠 𝐿𝐵 𝑎𝑑𝑗 × 𝐿𝐵 𝑜𝑏𝑠 𝐿𝐵 𝑎𝑑𝑗 − 1
  • 39. E N A R 2 0 1 8 Right Heart Catheterization Data Connors et al (1996)
  • 40. E N A R 2 0 1 8 Right Heart Catheterization data • We chose 20 covariates for demonstration purposes • demographics • comorbidities • physiological measurements • diagnosis categories • APACHE score • SUPPORT (probability of surviving 2 months) • DNR status on day 1 Connors et al (1996)
  • 41. E N A R 2 0 1 8 Right Heart Catheterization data •Fit a propensity score model •Use ATO weights •Fit weighted cox model for 30 day survival
  • 42. E N A R 2 0 1 8
  • 43. E N A R 2 0 1 8
  • 44. E N A R 2 0 1 8
  • 45. E N A R 2 0 1 8 @LucyStats http://bit.ly/LucyStatsENAR2018 lucymcgowan.com Thank you!

Editor's Notes

  1. We’ve shown these exciting new weighting methods that have dispelled myths about propensity score weighting. Weighting is dangerous when it comes to unmeasured confounding – it is true that it is theoretically possible for weighting to make things worse – if that unmeasured confounder has some correlation, weighting does better than covariate adjustment alone Misconceptions: Weighting has finite sample problems It is inefficient It makes unmeasured confounding worse WRONG-WRONG relative to OLS – increase correlation between X2 X1 – one effect size, but what we vary is the correlation
  2. This talk will have 3 main parts, the first part follows up nicely after Laine’s discussion of the ATO weights – we use a simulation setting from a paper published in 2008 that suggests that weighting provides biased estimates, even when the propensity score is correctly specified replicate these simulations to explore this finite sample bias using ATO and ATM weights This simulation is under the setting where both a propensity model and an outcome model are fit, and as I mentioned the propensity score is correctly specified. We then examine what would happen under these simulation conditions if rather than correctly specifying the propensity score model, both models are misspecified – this introduces “the problem” Finally, we demonstrate a potential solution to be able to get a handle on this potential for unmeasured confounding
  3. The dark portion, the grey, are the propensity scores the top part of the mirrored histogram is the treated population and the bottom, the upside-down histogram is the control population. Then the green represents the weighted propensity scores for the treated and the blue is the weighted propensity scores for the controls.
  4. The dark portion, the grey, are the propensity scores the top part of the mirrored histogram is the treated population and the bottom, the upside-down histogram is the control population. Then the green represents the weighted propensity scores for the treated and the blue is the weighted propensity scores for the controls. - Frequency counts for a sudo population based on the weights _click_ This guy at the far end has a propensity score greater than .99, and therefore ends up accounting for 151 people in the sudo cohort
  5. Population: based on the table 1, the exposed and unexposed in the weighted Table 1 will look like each other, but they will look different than the overall cohort – they will look like if you had done a matched cohort (1:1 matching) This one person now is only representing 1 person in the sudo cohort, meanwhile the 20 treated people right above are each account for 1/20th of a person in the sudo cohort
  6. Population: will look a lot like a Table 1 from a 1:1 matched trial, but shifts a bit to improve the variance
  7. This is one of the several papers has cast shade on IPW weighting that made us think that weighting is inherently flawed. We’re going to show that it is not inherently flawed In their setting they have set it us such that they are fitting a propensity score model and and a weighted outcome model -- in the continuous case this is essentially a doubly robust estimator They used ATE weights
  8. At n = 1000, Freedman and Berk report a bias of 0.13 and we replicate this, we observing a bias of 0.14. Even when n is 10,000, the result is still biased. So they of course were correct, these weights do lead to bias in finite samples.
  9. However it turns out to be less of a story about weighting in general, and more of a story about these particular weights.
  10. We replicate what Freedman and Berk show, that this difference in the true standard error of these estimates is 0.12.
  11. Correct-correct Minimum variance unbiased estimator
  12. The wisdom we've taken from these papers in rooted in teh ATE, we need to revisit this for these new weights
  13. We varied the correlation between x1 and x2 to visualize how this impacts the bias observed
  14. show ATE first, then show ATO
  15. We are excited about quantified and contextualized sensitivity analysis
  16. Vanderweele and Ding’s E-value answers a similar question as this quantified sensitivity analysis, it is the point that minimizes the strength of association, on the risk ratio scale, that an unmeasured confounder would need to have with both the exposure and outcome, conditional on the measured covariates, to explain away an observed exposure-outcome association. Again this is certainly one way to look at the analysis, but can be difficult contextualize without more information.
  17. we’ve used similar methodology to create an adjusted e-value, where we allow you to input the value you are trying to adjust to, rather than always setting the adjusted limiting bound to one. I will demonstrate soon how we can use this with our observed covariates
  18. Rather than tipping to see where the lower bound crosses 1, we allow you to specify a value you’d like to tip to – this allows us calculate “observed e-values” for our measured covariates
  19. This dataset is assessing the effectiveness of right heart catheterization (RHC) in the initial care of critically ill patients. Publically available
  20. We chose 20 covariates to include for demonstration purposes Demographics Comorbidities Physiological measurements Diagnosis categories APACHE score SUPPORT model estimate of the probability of surviving 2 months DNR status on day 1
  21. looking at relationship between 30 day survival and RHC
  22. I’m going to spend a minute on this plot -- this is a plot we’ve come up with that we’re calling an ”observed bias plot” the light blue shaded region shows the confidence interval for the observed exposure – outcome effect with all covariates included in the analysis, so here the effect between RHC and 30 day survival, in our case it is about (1.11 to 1.37). The blue line down the center is the observed point estimate, in this case it is a hazard ratio around 1.24. Each black line is the exposure-outcome effect that would have been observed had the covariate indicated not been observed. For example this top one here _click_ is the observed effect between RHC and 30 day survival if we had not observed DNR status on day 1 _click_. The purple stars represent the “adjusted E-values” These represent what the E-value would be if the *only* unmeasured confounder that were missing is the one that is dropped. So this E-value here _click_ represents what the E-value would be if DNR status on day 1 were the only missing covariate. This gives some contextualization to the E-value, allowing you to imagine it in the context of your observed covariates
  23. We also include additional values at the bottom _click_ where we can observe what happens if we leave out groups of variables, such as Labs or physiological measurements. This begins to get at the question “what if we are missing a lot of unmeasured confounders”. We then also plot what an unmeasured confounder that would tip the analysis at the Lower Bound and at the point estimate would look like, along with the associated E-value.
  24. Given the observed lower bound of 1.11, the associated E-value is 1.46. Examining Observed bias plot we can add some context to this value. The only associated E-value close to this is that for DNR status on day 1. This implies that we would need to be missing an additional independent covariate akin to DNR status on day 1 in order to tip our analysis. Even dropping all physiological measurements would not reach an E-value great enough to tip this study to inconclusive.