Absolute risk estimation in a case cohort study of prostate cancer

Background
Methods
Results
Summary
Absolute Risk Estimation in a Case Cohort Study
of Prostate Cancer
Sahir Rai Bhatnagar
Queen’s University
sahir.bhatnagar@queensu.ca
August 30, 2013
1 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Incidence vs. Mortality of Prostate Cancer in Canada
prostate
25
50
75
100
125
1990 2000 2010
year
age−standardizedrate
incidence mortality
2 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Incidence vs. Mortality of Prostate Cancer in Canada
prostate
25
50
75
100
125
1990 2000 2010
year
age−standardizedrate
incidence mortality
Possible reasons for gap:
Better treatment outcomes
2 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Incidence vs. Mortality of Prostate Cancer in Canada
prostate
25
50
75
100
125
1990 2000 2010
year
age−standardizedrate
incidence mortality
Possible reasons for gap:
Better treatment outcomes
Death from other causes
(comorbidities)
2 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Incidence vs. Mortality of Prostate Cancer in Canada
prostate
25
50
75
100
125
1990 2000 2010
year
age−standardizedrate
incidence mortality
Possible reasons for gap:
Better treatment outcomes
Death from other causes
(comorbidities)
Overdiagnosis
2 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Why is this Gap a Problem?
3 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Why is this Gap a Problem?
Overdiagnosis
1 Screen detected cancer that would have otherwise not been
diagnosed before death from other causes
3 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Why is this Gap a Problem?
Overdiagnosis
1 Screen detected cancer that would have otherwise not been
diagnosed before death from other causes
2 Can lead to overtreatment
3 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Why is this Gap a Problem?
Overdiagnosis
1 Screen detected cancer that would have otherwise not been
diagnosed before death from other causes
2 Can lead to overtreatment
Overtreatment
Treatment for a cancer that is clinically not significant → low
tumour grade
3 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Why is this Gap a Problem?
Overdiagnosis
1 Screen detected cancer that would have otherwise not been
diagnosed before death from other causes
2 Can lead to overtreatment
Overtreatment
Treatment for a cancer that is clinically not significant → low
tumour grade
Morbidity associated with being diagnosed with cancer
3 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Why is this Gap a Problem?
Overdiagnosis
1 Screen detected cancer that would have otherwise not been
diagnosed before death from other causes
2 Can lead to overtreatment
Overtreatment
Treatment for a cancer that is clinically not significant → low
tumour grade
Morbidity associated with being diagnosed with cancer
Quality of life effects, incontinence, erectile dysfunction
3 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Why is this Gap a Problem?
Overdiagnosis
1 Screen detected cancer that would have otherwise not been
diagnosed before death from other causes
2 Can lead to overtreatment
Overtreatment
Treatment for a cancer that is clinically not significant → low
tumour grade
Morbidity associated with being diagnosed with cancer
Quality of life effects, incontinence, erectile dysfunction
Lower life expectancy
3 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Solution to Overdiagnosis and Overtreatment
4 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Solution to Overdiagnosis and Overtreatment
Individualised Decision Making
4 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Solution to Overdiagnosis and Overtreatment
Individualised Decision Making
Active surveillance for patients with favorable risk disease
(Gleason score ≤ 6)
4 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Solution to Overdiagnosis and Overtreatment
Individualised Decision Making
Active surveillance for patients with favorable risk disease
(Gleason score ≤ 6)
Clinical tools that calculate life expectancy based on
predisposing factors such as age and comorbidities
4 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Objectives
Research Statement
1 Using a semiparametric model based on both age and
comorbidity, obtain absolute risk estimates of death from
prostate cancer in a population based case cohort study
5 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Objectives
Research Statement
1 Using a semiparametric model based on both age and
comorbidity, obtain absolute risk estimates of death from
prostate cancer in a population based case cohort study
2 Create a web tool for life expectancy based on this model, for
use in a clinical setting
5 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Wykes (2011)
Wykes (2011) proposed an ad hoc approach to estimating the
relative risk and baseline hazard function for the case cohort design:
6 / 35
Background
Methods
Results
Summary
Rationale
Objective
Previous Work
Wykes (2011)
Wykes (2011) proposed an ad hoc approach to estimating the
relative risk and baseline hazard function for the case cohort design:
Modified PH Model
S(t) = (S0(t))α
exp {αβ1 (age − µage) +
β2 CIRS-Gpros − µCIRS-Gpros
S0(t), β1, µage are derived from the full cohort
β2, µCIRS-Gpros are from case cohort
α = β1 from case cohort
β1 from full cohort
6 / 35
Case Cohort Study Design
40 45 50 55 60 65 70
(a) Cohort, no exposure info
40 45 50 55 60 65 70
(b) Case cohort sampling
40 45 50 55 60 65 70
(c) Add back failures
CIRS=10
CIRS=2
CIRS=7
CIRS=6
CIRS=14
CIRS=13
CIRS=1
CIRS=9
CIRS=9
CIRS=4
CIRS=4
CIRS=10
CIRS=1
CIRS=3
CIRS=6
CIRS=7
40 45 50 55 60 65 70
(d) Get exposure information
Figure: Each line represents how long a subject is at risk. A failure is represented by a
red dot. Subjects without a red dot are censored individuals. The axis represents age
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Study Population
8 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Study Population
Population-based case cohort study of 2,740 patients
diagnosed with prostate cancer between 1990 and 1998 in
Ontario
8 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Study Population
Population-based case cohort study of 2,740 patients
diagnosed with prostate cancer between 1990 and 1998 in
Ontario
Stratified by CCOR random sample
8 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Study Population
Population-based case cohort study of 2,740 patients
diagnosed with prostate cancer between 1990 and 1998 in
Ontario
Stratified by CCOR random sample
Cases were sampled independently of subcohort
8 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Study Population
Population-based case cohort study of 2,740 patients
diagnosed with prostate cancer between 1990 and 1998 in
Ontario
Stratified by CCOR random sample
Cases were sampled independently of subcohort
Death from prostate cancer and death from other causes
assessed → death clearance date December 31, 1999
8 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Study Population
Population-based case cohort study of 2,740 patients
diagnosed with prostate cancer between 1990 and 1998 in
Ontario
Stratified by CCOR random sample
Cases were sampled independently of subcohort
Death from prostate cancer and death from other causes
assessed → death clearance date December 31, 1999
Death from any cause updated for subcohort only → death
clearance date December 31, 2008
8 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Improvements
Stratified by CCOR Cox PH
Sj (t) = S0j (t)exp(β1·Age+β2·CIRS-Gpros )
, j = 1, . . . , 8
9 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Improvements
Stratified by CCOR Cox PH
Sj (t) = S0j (t)exp(β1·Age+β2·CIRS-Gpros )
, j = 1, . . . , 8
We improve on Wykes (2011) work in two ways:
9 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Improvements
Stratified by CCOR Cox PH
Sj (t) = S0j (t)exp(β1·Age+β2·CIRS-Gpros )
, j = 1, . . . , 8
We improve on Wykes (2011) work in two ways:
we estimate both the baseline survival function and relative
risk parameters using the case cohort population
9 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Improvements
Stratified by CCOR Cox PH
Sj (t) = S0j (t)exp(β1·Age+β2·CIRS-Gpros )
, j = 1, . . . , 8
We improve on Wykes (2011) work in two ways:
we estimate both the baseline survival function and relative
risk parameters using the case cohort population
we use a stratified by CCOR analysis, which assumes a
different baseline hazard for each region
9 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Relative Risk Estimation
Prentice (1986) proposed a pseudolikelihood given by
˜L(β) =
D
j=1
exp zT
(j)β
k∈˜Rtj
exp zT
k β wk
10 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Relative Risk Estimation
Prentice (1986) proposed a pseudolikelihood given by
˜L(β) =
D
j=1
exp zT
(j)β
k∈˜Rtj
exp zT
k β wk
t1, . . . , tD: D distinct failure times
10 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Relative Risk Estimation
Prentice (1986) proposed a pseudolikelihood given by
˜L(β) =
D
j=1
exp zT
(j)β
k∈˜Rtj
exp zT
k β wk
t1, . . . , tD: D distinct failure times
˜Rtj : the risk set at failure time tj
10 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Relative Risk Estimation
Prentice (1986) proposed a pseudolikelihood given by
˜L(β) =
D
j=1
exp zT
(j)β
k∈˜Rtj
exp zT
k β wk
t1, . . . , tD: D distinct failure times
˜Rtj : the risk set at failure time tj
wk: the weight applied to the risk set
10 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Weighting Schemes
exp zT
i β
Yi (tj )wi (tj ) exp zT
i β +
k∈SC,k=i
Yk(tj )wk(tj ) exp zT
k β
Table: Comparing different values of wk
Outcome type and timing Prentice Self & Prentice Barlow
(1986) (1988) (1994)
Case outside SC before failure 0 0 0
Case outside SC at failure 1 0 1
Case in SC before failure 1 1 1/α
Case in SC at failure 1 1 1
SC control 1 1 1/α
11 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Relative Risk Estimation
Therneau & Li (1999) showed that Var(˜β) can be computed by
adjusting the outputted variance from standard Cox model
programs:
˜I−1
+ (1 − α)DT
SC DSC
12 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Relative Risk Estimation
Therneau & Li (1999) showed that Var(˜β) can be computed by
adjusting the outputted variance from standard Cox model
programs:
˜I−1
+ (1 − α)DT
SC DSC
˜I−1: variance returned by the Cox model program
12 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Relative Risk Estimation
Therneau & Li (1999) showed that Var(˜β) can be computed by
adjusting the outputted variance from standard Cox model
programs:
˜I−1
+ (1 − α)DT
SC DSC
˜I−1: variance returned by the Cox model program
DSC : subset of matrix of dfbeta residuals that contains
subcohort members only
12 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Absolute Risk Estimation
A weighted version of the Breslow estimator for the full cohort
with covariate history z0(u) is given by
d ˆΛ(u, z0(u)) = d ˆΛ0(u) exp zT
0 (u) ˆβ
=
n
i=1
dNi (u)
( n
m ) j∈˜Ri (u) exp zT
j (u) ˆβ
exp zT
0 (u) ˆβ
=
n
i=1
dNi (u)
( n
m ) j∈˜Ri (u) exp (zj(u) − z0(u))T ˆβ
=
n
i=1
m
n
dNi (u)
j∈˜Ri (u) exp (zj(u) − z0(u))T ˆβ
13 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Absolute Risk Estimation
The sum of these increments can be used to calculate the risk
between two time points, say s and t, for s < t:
ˆΛ(s, t, z0) = ˆΛ(0, t, z0) − ˆΛ(0, s, z0) =
t
s
d ˆΛ(u, z0(u))
14 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Absolute Risk Estimation
The sum of these increments can be used to calculate the risk
between two time points, say s and t, for s < t:
ˆΛ(s, t, z0) = ˆΛ(0, t, z0) − ˆΛ(0, s, z0) =
t
s
d ˆΛ(u, z0(u))
For this project, we will be only considering the case when s = 0.
The corresponding survival probability is
ˆS(0, t, z0) = exp − ˆΛ(0, t, z0)
14 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Software
PROC PHREG in SAS/STAT software and coxph in R
15 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Software
PROC PHREG in SAS/STAT software and coxph in R
SAS macro developed by Langholz & Jiao (2007)
15 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Computational Methods for the Case Cohort Design
Standard Cox PH functions in SAS and R cannot directly
compute the relative and absolute risk estimates
16 / 35
Background
Methods
Results
Summary
Case Cohort Study Design
Population
Model
Software
Computational Methods for the Case Cohort Design
Standard Cox PH functions in SAS and R cannot directly
compute the relative and absolute risk estimates
Modifications must be made to the dataset to make use of
PROC PHREG and coxph functions
16 / 35
Computation of Pseudolikelihood
Manually
Subcohort failure
Non-Subcohort failure
Subcohort failure
40 45 50 55 60 65 70
7
6
5
4
3
2
1
˜L(β) =
exp(Z7β)
exp(Z1β) + exp(Z2β) + exp(Z6β) + exp(Z7β)
×
exp(Z5β)
exp(Z1β) + exp(Z2β) + exp(Z4β) + exp(Z5β) + exp(Z6β)
×
exp(Z1β)
exp(Z1β) + exp(Z3β) + exp(Z4β)
Computation of Pseudolikelihood
Manually
Subcohort failure
Non-Subcohort failure
Subcohort failure
40 45 50 55 60 65 70
7
6
5
4
3
2
1
˜L(β) =
exp(Z7β)
exp(Z1β) + exp(Z2β) + exp(Z6β) + exp(Z7β)
×
exp(Z5β)
exp(Z1β) + exp(Z2β) + exp(Z4β) + exp(Z5β) + exp(Z6β)
×
exp(Z1β)
exp(Z1β) + exp(Z3β) + exp(Z4β)
Computationally
40 45 50 55 60 65 70
7.2
7.1
6
5
4
3
2
1.2
1.1
˜L(β) =
exp(Z7.2β)
exp(Z1.1β) + exp(Z2β) + exp(Z6β) + exp(Z7.2β)
×
exp(Z5β)
exp(Z1.1β) + exp(Z2β) + exp(Z4β) + exp(Z5β) + exp(Z6β)
×
exp(Z1.2β)
exp(Z1.2β) + exp(Z3β) + exp(Z4β)
Background
Methods
Results
Summary
Relative Risk Estimates
Absolute Risk Estimates
Web Life Expectancy Tool
Table: All Cause mortality
All Cause
Covariate 1999 2008 Wykes (2011)
Age 1.04 (1.02, 1.05) 1.05 (1.04, 1.06) 1.02 (1.01, 1.03)
CIRS-Gpros 1.14 (1.10, 1.18) 1.13 (1.10, 1.16) 1.12 (1.09, 1.15)
18 / 35
Background
Methods
Results
Summary
Relative Risk Estimates
Absolute Risk Estimates
Web Life Expectancy Tool
Table: All Cause mortality
All Cause
Covariate 1999 2008 Wykes (2011)
Age 1.04 (1.02, 1.05) 1.05 (1.04, 1.06) 1.02 (1.01, 1.03)
CIRS-Gpros 1.14 (1.10, 1.18) 1.13 (1.10, 1.16) 1.12 (1.09, 1.15)
Table: Cause Specific mortality
Cause Specific
Covariate PCa (1999)a OC (1999)
Age 1.00 (0.99, 1.02) 1.07 (1.05, 1.09)
CIRS-Gpros 1.032 (0.99, 1.07) 1.25 (1.20, 1.30)
a
predictors not significant
18 / 35
Background
Methods
Results
Summary
Relative Risk Estimates
Absolute Risk Estimates
Web Life Expectancy Tool
qqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqq
q
qqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqq
q
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
q
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
q
qqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
q
q
qqqqqqqqq
q
qq
qqqqqqqqqqqqqqqq
q
qqqqqqqqqqqqqqqqqqqqqqq
q
q
q
q
q
qqqqqqqqqqqqqqqqqqqqqqqqq
q
q
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqq
qqqqqqqqqqqqqq
qqqqqqqqqqqqqqq
qqqqqqqqqqqqqq
qqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq
All cause (1999) All cause (2008) OC (1999)
PCa (1999) Wykes (2011)
10
20
30
10
20
30
0 3 4 5 6 7 8 9101112 0 3 4 5 6 7 8 9101112
CIRS−G_pros
lifeexpectancy(years)
50
60
70
80
Age
Figure: Estimates from East Region (Wykes estimates based on all cause unstratified
subcohort at 2008) 19 / 35
Background
Methods
Results
Summary
Relative Risk Estimates
Absolute Risk Estimates
Web Life Expectancy Tool
5
10
15
20
25
0 3 4 5 6 7 8 9 10 11 12
CIRS−G_pros
lifeexpectancy(years)
Age
55
65
75
Model
All cause (2008)
Wykes (2011)
Figure: Estimates from East Region (Wykes estimates based on all cause unstratified
subcohort at 2008) 20 / 35
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95
age (years)
survivalprobability
CCO North West region
(a) All cause (1999)
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95
age (years)
survivalprobability
CCO North West region
(b) Other cause(1999)
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90
age (years)
survivalprobability
CCO North West region
(c) Prostate cancer(1999)
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100
age (years)
survivalprobability
CCO North West region
(d) All cause (2008)
Background
Methods
Results
Summary
Relative Risk Estimates
Absolute Risk Estimates
Web Life Expectancy Tool
Tool for Use in a Clinical Setting
Built using shiny package for R
22 / 35
Background
Methods
Results
Summary
Relative Risk Estimates
Absolute Risk Estimates
Web Life Expectancy Tool
Tool for Use in a Clinical Setting
Built using shiny package for R
Requires R and an internet connection
22 / 35
Background
Methods
Results
Summary
Relative Risk Estimates
Absolute Risk Estimates
Web Life Expectancy Tool
Tool for Use in a Clinical Setting
Built using shiny package for R
Requires R and an internet connection
Three simple lines of code
22 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Contributions
1 Life expectancy estimates based on age and comorbidity score
from a population based case cohort study
23 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Contributions
1 Life expectancy estimates based on age and comorbidity score
from a population based case cohort study
2 Improves previous work done with this dataset by using
appropriate statistical methodology for case cohort design
23 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Contributions
1 Life expectancy estimates based on age and comorbidity score
from a population based case cohort study
2 Improves previous work done with this dataset by using
appropriate statistical methodology for case cohort design
3 Easy-to-use interactive web tool for clinical use
23 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Future Work
Internal validation → risk prediction accuracy, PH
assumption, compare expected values from 1999 model to
observed 2008 subcohort
24 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Future Work
Internal validation → risk prediction accuracy, PH
assumption, compare expected values from 1999 model to
observed 2008 subcohort
External validation → generalizability
24 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Future Work
Internal validation → risk prediction accuracy, PH
assumption, compare expected values from 1999 model to
observed 2008 subcohort
External validation → generalizability
Convert Langholz & Jiao (2007) SAS macro to R for absolute
risk estimates
24 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Alternative Analysis
Treat the case cohort design as a cohort study with missing
data (NestedCohort and Survey packages for R) (Mark &
Katki (2008), Breslow & Lumley (2009))
25 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Alternative Analysis
Treat the case cohort design as a cohort study with missing
data (NestedCohort and Survey packages for R) (Mark &
Katki (2008), Breslow & Lumley (2009))
Competing Risk analysis using the subdistribution hazard of
Fine & Gray
25 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
References I
M. Center et al.
International Variation in Prostate Cancer Incidence and
Mortality Rates
European Association of Urology, 61:2012
Bryan Langholz and Jenny Jiao
Computational methods for case cohort studies
Computational Statistics & Data Analysis, 51:2007
Therneau, T and Li, H
Computing the Cox model for case cohort designs
Lifetime data analysis, 2(5):1999
26 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
References II
PRENTICE, R. L.
A case-cohort design for epidemiologic cohort studies and
disease prevention trials
Biometrika, 73(1):1986
RStudio and Inc.
shiny: Web Application Framework for R
R package version 0.6.0, 17(25):2012
T. Lumley
survey: analysis of complex survey samples
R package version 3.28-2,2012
27 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
References III
Katki, H.A. and Mark, S.D.
Survival Analysis for Cohorts with Missing Covariate
Information
The R Journal,2008
28 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Influential Analysis
Dfbeta residuals
Are the approximate changes in the parameter estimates (ˆβ − ˆβ(j))
when the jth observation is omitted. These variables are a
weighted transform of the score residual variables and are useful in
assessing local influence and in computing approximate and robust
variance estimates
Standardized dfbeta residuals:
(ˆβ − ˆβ(j))
sd(ˆβ − ˆβ(j))
29 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Why is the PsL not a Partial Likelihood
Not a PL → Asymptotic distribution theory for the PL
estimators breaks down because cases outside the subcohort
induce non-nesting of the σ-fields, or sets on which the
counting process probability measure is defined
cannot use Likelihood ratio tests since PsL are not real
likelihoods
30 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Consistent Estimators
An estimator ˆθ will perform better and better as we obtain more
samples. If at the limit n → ∞ the estimator tends to be always
right, it is said to be consistent. This notion is equivalent to
convergence in probability
Convergence in Probability
Let X1, . . . , Xn be a sequence of iid RVs drawn from a distribution
with parameter θ and ˆθ an estimator for θ. ˆθ is consistent as an
estimator of θ if ˆθ
p
→ θ
lim
n→∞
P(|ˆθ − θ| > ) = 0, ∀ > 0
31 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Competing Risks
when disease of interest has a delayed occurrence, other
events may preclude observation of disease of interest giving
rise to a competing risk situation
to analyze the event of interest taking into account the
competing risks, one has to model the hazard of
subdistribution. In the presence of competing risks, the
probability to observe the event of interest spans the interval
[0, p], p < 1. Because p does not reach 1, this function is not
a proper distribution function and it is called a subdistribution
Modify the partial likelihood in the Cox model to model
hazard of subdistribution, such that the individuals
experiencing competing risks are always in the risk set with
their contribution regulated by a weight
32 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Competing Risks
Fine and Gray Partial Likelihood:
PL(β) =
n
j=1




exp {βxj }
r∈˜Rj
wrj exp {βxr }




δj
(1)
wrj =
ˆG(tj )
ˆG(min(tj , tr ))
(2)
Cj =



1 if the event of interest was observed at time tj
2 if the competing risk event was observed at time tj
0 if no event was observed at time tj
δj =
1 Cj = 1
0 Cj = 0 or Cj = 2
33 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Competing Risks
ˆG(tj ): estimated probability of being censored at time tj by
either the event of interest or a competing risks event
˜Rj = {r; tr ≥ tj or Cr = 2}
individuals experiencing competing risk event are always
considered in the risk set at all time points
The weight controls how much a specific observation
participates in the sum in the denominator
If the observation at time tr is censored (Cr = 0) or has the
event of interest (Cr = 1), the weight is either 1 or 0
depending whether the observation is in the risk set (tr ≥ tj )
or it is not in the risk set (tr < tj )
If the observation has a competing risk event, then it is always
in the risk set and the weight is 1 if (tr ≥ tj ) or a quantity less
than 1 if tr < tj
This quantity decreases as the distance between tr and tj
34 / 35
Background
Methods
Results
Summary
Contributions
Future Work
References
Case-cohort in the presence of competing risks
PsL(β) =
n
j=1




exp {βxj }
r∈˜Rj
Irj wrj exp {βxr }




δj
δj =
1 Cj = 1
0 Cj = 0 or Cj = 2
Irj =
1 r is in subcohort or r is a case outside the subcohort and tr
0 r is a case outside the subcohort and tr = tj
wrj =
ˆG(tj )
ˆG(min(tj ,tr ))
, where ˆG(tj ) is the estimation of the probability
of censoring regardless of the fact that in a case-cohort study the
cases are oversampled
35 / 35
1 of 74

Recommended

Computational methods for case-cohort studies by
Computational methods for case-cohort studiesComputational methods for case-cohort studies
Computational methods for case-cohort studiessahirbhatnagar
1.6K views78 slides
Malimu cohort studies by
Malimu cohort studiesMalimu cohort studies
Malimu cohort studiesMiharbi Ignasm
631 views29 slides
Cohort ppt by
Cohort pptCohort ppt
Cohort pptPokhara University, Pokhara, Nepal
27.7K views47 slides
Cohort, case control & survival studies-2014 by
Cohort, case control & survival studies-2014Cohort, case control & survival studies-2014
Cohort, case control & survival studies-2014Ramnath Takiar
1.9K views38 slides
06 cohort studies by
06 cohort studies06 cohort studies
06 cohort studiesAbdiwali Abdullahi Abdiwali
217 views79 slides
SAMS EBM Online Course: Observational Study Designs by
SAMS EBM Online Course: Observational Study DesignsSAMS EBM Online Course: Observational Study Designs
SAMS EBM Online Course: Observational Study DesignsAhmad Al-Moujahed
3.4K views89 slides

More Related Content

What's hot

EPIDEMIOLOGICAL RESEARCH DESIGN by
EPIDEMIOLOGICAL RESEARCH DESIGNEPIDEMIOLOGICAL RESEARCH DESIGN
EPIDEMIOLOGICAL RESEARCH DESIGNMAHESWARI JAIKUMAR
4.9K views12 slides
Cohort dr.wah by
Cohort dr.wahCohort dr.wah
Cohort dr.wahMmedsc Hahm
328 views31 slides
COHORT STUDY by
COHORT STUDY COHORT STUDY
COHORT STUDY Rafiul Basher Rabby
1.2K views23 slides
Cohort study by
Cohort studyCohort study
Cohort studyDr. Ankit Gaur
834 views10 slides
5. cohort studies by
5. cohort  studies5. cohort  studies
5. cohort studiesNaveen Phuyal
13.3K views54 slides
case control study by
case control study case control study
case control study Mahmood Wazeer
586 views24 slides

What's hot(20)

Steps in cohort study by Sachin Patne
Steps in cohort studySteps in cohort study
Steps in cohort study
Sachin Patne9.9K views
Nested case control study by Prayas Gautam
Nested case control studyNested case control study
Nested case control study
Prayas Gautam8.7K views
Cohort studies by DrKHReddy
Cohort studiesCohort studies
Cohort studies
DrKHReddy206 views
7. experimental epidemiology by Naveen Phuyal
7. experimental epidemiology7. experimental epidemiology
7. experimental epidemiology
Naveen Phuyal16.9K views
Research Methodology - Case control study by Rizwan S A
Research Methodology - Case control studyResearch Methodology - Case control study
Research Methodology - Case control study
Rizwan S A6.1K views
Case control study by Abhijit Das
Case control studyCase control study
Case control study
Abhijit Das396 views
Types of clinical studies by Samir Haffar
Types of clinical studiesTypes of clinical studies
Types of clinical studies
Samir Haffar12K views

Viewers also liked

Cohort Study by
Cohort StudyCohort Study
Cohort StudyAchyut Raj Pandey
147.5K views75 slides
Peter Elwood - The success of the caerphilly cohort study by
Peter Elwood - The success of the caerphilly cohort studyPeter Elwood - The success of the caerphilly cohort study
Peter Elwood - The success of the caerphilly cohort studyangewatkins
848 views44 slides
e-Health & e-epidemiology: the E4N cohort study by
e-Health & e-epidemiology: the E4N cohort studye-Health & e-epidemiology: the E4N cohort study
e-Health & e-epidemiology: the E4N cohort studyGuy Fagherazzi
375 views21 slides
Study Designs_YL by
Study Designs_YLStudy Designs_YL
Study Designs_YLYvonne Lee
2.6K views26 slides
cohort study by
cohort studycohort study
cohort studybunu lama
4.7K views12 slides
Cohort study and case control study by
Cohort study and case control studyCohort study and case control study
Cohort study and case control studyNiv09
5.5K views28 slides

Viewers also liked(12)

Peter Elwood - The success of the caerphilly cohort study by angewatkins
Peter Elwood - The success of the caerphilly cohort studyPeter Elwood - The success of the caerphilly cohort study
Peter Elwood - The success of the caerphilly cohort study
angewatkins848 views
e-Health & e-epidemiology: the E4N cohort study by Guy Fagherazzi
e-Health & e-epidemiology: the E4N cohort studye-Health & e-epidemiology: the E4N cohort study
e-Health & e-epidemiology: the E4N cohort study
Guy Fagherazzi375 views
Study Designs_YL by Yvonne Lee
Study Designs_YLStudy Designs_YL
Study Designs_YL
Yvonne Lee2.6K views
cohort study by bunu lama
cohort studycohort study
cohort study
bunu lama4.7K views
Cohort study and case control study by Niv09
Cohort study and case control studyCohort study and case control study
Cohort study and case control study
Niv095.5K views
Bab vi studi desain epidemiologi by NajMah Usman
Bab vi studi desain epidemiologiBab vi studi desain epidemiologi
Bab vi studi desain epidemiologi
NajMah Usman5.1K views
Cohort study - basics by Rizwan S A
Cohort study - basicsCohort study - basics
Cohort study - basics
Rizwan S A12.8K views
Epidemiological studies by Bruno Mmassy
Epidemiological studiesEpidemiological studies
Epidemiological studies
Bruno Mmassy35.4K views
Case control & cohort study by Bhumika Bhatt
Case control & cohort studyCase control & cohort study
Case control & cohort study
Bhumika Bhatt67.8K views
Cohort and case-controls studies by apfortis
Cohort and case-controls studiesCohort and case-controls studies
Cohort and case-controls studies
apfortis20.1K views

Similar to Absolute risk estimation in a case cohort study of prostate cancer

ASEE-GSW_2015_submission_75 by
ASEE-GSW_2015_submission_75ASEE-GSW_2015_submission_75
ASEE-GSW_2015_submission_75Sam Yang
123 views7 slides
Health economic modelling in the diagnostics development process by
Health economic modelling in the diagnostics development processHealth economic modelling in the diagnostics development process
Health economic modelling in the diagnostics development processcheweb1
1.5K views45 slides
NY Prostate Cancer Conference - C. Bangma - Session 3: Predicting indolent an... by
NY Prostate Cancer Conference - C. Bangma - Session 3: Predicting indolent an...NY Prostate Cancer Conference - C. Bangma - Session 3: Predicting indolent an...
NY Prostate Cancer Conference - C. Bangma - Session 3: Predicting indolent an...European School of Oncology
2.7K views20 slides
Ca prostata by
Ca prostataCa prostata
Ca prostataHugo Robles Gomez
67 views9 slides
Harnessing Data to Improve Health Equity - Dr. Ali Mokdad by
Harnessing Data to Improve Health Equity - Dr. Ali MokdadHarnessing Data to Improve Health Equity - Dr. Ali Mokdad
Harnessing Data to Improve Health Equity - Dr. Ali MokdadLauren Johnson
1.1K views70 slides
Prof Richard Peto on Breast Cancer Screening by
Prof Richard Peto on Breast Cancer ScreeningProf Richard Peto on Breast Cancer Screening
Prof Richard Peto on Breast Cancer Screeningecancer - the leading oncology channel
1.2K views32 slides

Similar to Absolute risk estimation in a case cohort study of prostate cancer(20)

ASEE-GSW_2015_submission_75 by Sam Yang
ASEE-GSW_2015_submission_75ASEE-GSW_2015_submission_75
ASEE-GSW_2015_submission_75
Sam Yang123 views
Health economic modelling in the diagnostics development process by cheweb1
Health economic modelling in the diagnostics development processHealth economic modelling in the diagnostics development process
Health economic modelling in the diagnostics development process
cheweb11.5K views
NY Prostate Cancer Conference - C. Bangma - Session 3: Predicting indolent an... by European School of Oncology
NY Prostate Cancer Conference - C. Bangma - Session 3: Predicting indolent an...NY Prostate Cancer Conference - C. Bangma - Session 3: Predicting indolent an...
NY Prostate Cancer Conference - C. Bangma - Session 3: Predicting indolent an...
Harnessing Data to Improve Health Equity - Dr. Ali Mokdad by Lauren Johnson
Harnessing Data to Improve Health Equity - Dr. Ali MokdadHarnessing Data to Improve Health Equity - Dr. Ali Mokdad
Harnessing Data to Improve Health Equity - Dr. Ali Mokdad
Lauren Johnson1.1K views
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE by IJCSEIT Journal
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASEUSING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
IJCSEIT Journal372 views
Epidemiology and trends of asbestos-related diseases at Helsinki Asbestos 2014 by Työterveyslaitos
Epidemiology and trends of asbestos-related diseases at Helsinki Asbestos 2014Epidemiology and trends of asbestos-related diseases at Helsinki Asbestos 2014
Epidemiology and trends of asbestos-related diseases at Helsinki Asbestos 2014
Työterveyslaitos2.3K views
Mastectomy Vs BCT by AnneSaputra
Mastectomy Vs BCTMastectomy Vs BCT
Mastectomy Vs BCT
AnneSaputra136 views
Mastectomi Vs BCT by AnneSaputra
Mastectomi Vs BCTMastectomi Vs BCT
Mastectomi Vs BCT
AnneSaputra168 views
Dr. Frank Sullivan - Early diagnosis of lung cancer by pincomm
Dr. Frank Sullivan - Early diagnosis of lung cancerDr. Frank Sullivan - Early diagnosis of lung cancer
Dr. Frank Sullivan - Early diagnosis of lung cancer
pincomm820 views
Federal Research & Development for the Florida system Sept 2014 by Warren Kibbe
Federal Research & Development for the Florida system Sept 2014 Federal Research & Development for the Florida system Sept 2014
Federal Research & Development for the Florida system Sept 2014
Warren Kibbe647 views
Data Science Meets Healthcare: The Advent of Personalized Medicine - Jacomo C... by CityAge
Data Science Meets Healthcare: The Advent of Personalized Medicine - Jacomo C...Data Science Meets Healthcare: The Advent of Personalized Medicine - Jacomo C...
Data Science Meets Healthcare: The Advent of Personalized Medicine - Jacomo C...
CityAge2.1K views
Endoscopy in Gastrointestinal Oncology - Slide 15 - D. Fisher - Colorectal ca... by European School of Oncology
Endoscopy in Gastrointestinal Oncology - Slide 15 - D. Fisher - Colorectal ca...Endoscopy in Gastrointestinal Oncology - Slide 15 - D. Fisher - Colorectal ca...
Endoscopy in Gastrointestinal Oncology - Slide 15 - D. Fisher - Colorectal ca...
Oncovue jorge power point presentation.ppt(2) by mariuslehaci
Oncovue jorge power point presentation.ppt(2)Oncovue jorge power point presentation.ppt(2)
Oncovue jorge power point presentation.ppt(2)
mariuslehaci1.2K views
Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ... by Cirdan
Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...
Morphologomics - Challenges for Surgical Pathology in the Genomic Age by Dr. ...
Cirdan145 views
Cancer treatment.edited by Okoth Morgan
Cancer treatment.editedCancer treatment.edited
Cancer treatment.edited
Okoth Morgan22 views
2016-Crawford-BMC Pulm Med published by Ji-Youn Yeo
2016-Crawford-BMC Pulm Med published2016-Crawford-BMC Pulm Med published
2016-Crawford-BMC Pulm Med published
Ji-Youn Yeo235 views
16 by vanney9
1616
16
vanney9314 views

More from sahirbhatnagar

Strong Heredity Models in High Dimensional Data by
Strong Heredity Models in High Dimensional DataStrong Heredity Models in High Dimensional Data
Strong Heredity Models in High Dimensional Datasahirbhatnagar
173 views53 slides
Methods for High Dimensional Interactions by
Methods for High Dimensional InteractionsMethods for High Dimensional Interactions
Methods for High Dimensional Interactionssahirbhatnagar
682 views94 slides
An introduction to knitr and R Markdown by
An introduction to knitr and R MarkdownAn introduction to knitr and R Markdown
An introduction to knitr and R Markdownsahirbhatnagar
2.7K views40 slides
Atelier r-gerad by
Atelier r-geradAtelier r-gerad
Atelier r-geradsahirbhatnagar
1.9K views153 slides
Reproducible Research: An Introduction to knitr by
Reproducible Research: An Introduction to knitrReproducible Research: An Introduction to knitr
Reproducible Research: An Introduction to knitrsahirbhatnagar
839 views43 slides
Analysis of DNA methylation and Gene expression to predict childhood obesity by
Analysis of DNA methylation and Gene expression to predict childhood obesityAnalysis of DNA methylation and Gene expression to predict childhood obesity
Analysis of DNA methylation and Gene expression to predict childhood obesitysahirbhatnagar
926 views33 slides

More from sahirbhatnagar(11)

Strong Heredity Models in High Dimensional Data by sahirbhatnagar
Strong Heredity Models in High Dimensional DataStrong Heredity Models in High Dimensional Data
Strong Heredity Models in High Dimensional Data
sahirbhatnagar173 views
Methods for High Dimensional Interactions by sahirbhatnagar
Methods for High Dimensional InteractionsMethods for High Dimensional Interactions
Methods for High Dimensional Interactions
sahirbhatnagar682 views
An introduction to knitr and R Markdown by sahirbhatnagar
An introduction to knitr and R MarkdownAn introduction to knitr and R Markdown
An introduction to knitr and R Markdown
sahirbhatnagar2.7K views
Reproducible Research: An Introduction to knitr by sahirbhatnagar
Reproducible Research: An Introduction to knitrReproducible Research: An Introduction to knitr
Reproducible Research: An Introduction to knitr
sahirbhatnagar839 views
Analysis of DNA methylation and Gene expression to predict childhood obesity by sahirbhatnagar
Analysis of DNA methylation and Gene expression to predict childhood obesityAnalysis of DNA methylation and Gene expression to predict childhood obesity
Analysis of DNA methylation and Gene expression to predict childhood obesity
sahirbhatnagar926 views
Estimation and Accuracy after Model Selection by sahirbhatnagar
Estimation and Accuracy after Model SelectionEstimation and Accuracy after Model Selection
Estimation and Accuracy after Model Selection
sahirbhatnagar1.1K views
Factors influencing participation in cancer screening by sahirbhatnagar
Factors influencing participation in cancer screeningFactors influencing participation in cancer screening
Factors influencing participation in cancer screening
sahirbhatnagar485 views
Methylation and Expression data integration by sahirbhatnagar
Methylation and Expression data integrationMethylation and Expression data integration
Methylation and Expression data integration
sahirbhatnagar643 views

Recently uploaded

Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe... by
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Anmol Vishnu Gupta
26 views12 slides
scopus cited journals.pdf by
scopus cited journals.pdfscopus cited journals.pdf
scopus cited journals.pdfKSAravindSrivastava
10 views15 slides
RemeOs science and clinical evidence by
RemeOs science and clinical evidenceRemeOs science and clinical evidence
RemeOs science and clinical evidencePetrusViitanen1
47 views96 slides
POSTER IV LAWCN_ROVER_IUE.pdf by
POSTER IV LAWCN_ROVER_IUE.pdfPOSTER IV LAWCN_ROVER_IUE.pdf
POSTER IV LAWCN_ROVER_IUE.pdfSOCIEDAD JULIO GARAVITO
11 views1 slide
Conventional and non-conventional methods for improvement of cucurbits.pptx by
Conventional and non-conventional methods for improvement of cucurbits.pptxConventional and non-conventional methods for improvement of cucurbits.pptx
Conventional and non-conventional methods for improvement of cucurbits.pptxgandhi976
20 views35 slides
vitamine B1.pptx by
vitamine B1.pptxvitamine B1.pptx
vitamine B1.pptxajithkilpart
27 views22 slides

Recently uploaded(20)

Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe... by Anmol Vishnu Gupta
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
RemeOs science and clinical evidence by PetrusViitanen1
RemeOs science and clinical evidenceRemeOs science and clinical evidence
RemeOs science and clinical evidence
PetrusViitanen147 views
Conventional and non-conventional methods for improvement of cucurbits.pptx by gandhi976
Conventional and non-conventional methods for improvement of cucurbits.pptxConventional and non-conventional methods for improvement of cucurbits.pptx
Conventional and non-conventional methods for improvement of cucurbits.pptx
gandhi97620 views
Pollination By Nagapradheesh.M.pptx by MNAGAPRADHEESH
Pollination By Nagapradheesh.M.pptxPollination By Nagapradheesh.M.pptx
Pollination By Nagapradheesh.M.pptx
MNAGAPRADHEESH19 views
Light Pollution for LVIS students by CWBarthlmew
Light Pollution for LVIS studentsLight Pollution for LVIS students
Light Pollution for LVIS students
CWBarthlmew9 views
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor... by Trustlife
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...
Trustlife100 views
Experimental animal Guinea pigs.pptx by Mansee Arya
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptx
Mansee Arya35 views
Exploring the nature and synchronicity of early cluster formation in the Larg... by Sérgio Sacani
Exploring the nature and synchronicity of early cluster formation in the Larg...Exploring the nature and synchronicity of early cluster formation in the Larg...
Exploring the nature and synchronicity of early cluster formation in the Larg...
Sérgio Sacani910 views
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... by InsideScientific
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
InsideScientific78 views
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by ILRI
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
ILRI5 views
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by ILRI
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
ILRI7 views
A giant thin stellar stream in the Coma Galaxy Cluster by Sérgio Sacani
A giant thin stellar stream in the Coma Galaxy ClusterA giant thin stellar stream in the Coma Galaxy Cluster
A giant thin stellar stream in the Coma Galaxy Cluster
Sérgio Sacani17 views
Applications of Large Language Models in Materials Discovery and Design by Anubhav Jain
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and Design
Anubhav Jain13 views

Absolute risk estimation in a case cohort study of prostate cancer

  • 1. Background Methods Results Summary Absolute Risk Estimation in a Case Cohort Study of Prostate Cancer Sahir Rai Bhatnagar Queen’s University sahir.bhatnagar@queensu.ca August 30, 2013 1 / 35
  • 2. Background Methods Results Summary Rationale Objective Previous Work Incidence vs. Mortality of Prostate Cancer in Canada prostate 25 50 75 100 125 1990 2000 2010 year age−standardizedrate incidence mortality 2 / 35
  • 3. Background Methods Results Summary Rationale Objective Previous Work Incidence vs. Mortality of Prostate Cancer in Canada prostate 25 50 75 100 125 1990 2000 2010 year age−standardizedrate incidence mortality Possible reasons for gap: Better treatment outcomes 2 / 35
  • 4. Background Methods Results Summary Rationale Objective Previous Work Incidence vs. Mortality of Prostate Cancer in Canada prostate 25 50 75 100 125 1990 2000 2010 year age−standardizedrate incidence mortality Possible reasons for gap: Better treatment outcomes Death from other causes (comorbidities) 2 / 35
  • 5. Background Methods Results Summary Rationale Objective Previous Work Incidence vs. Mortality of Prostate Cancer in Canada prostate 25 50 75 100 125 1990 2000 2010 year age−standardizedrate incidence mortality Possible reasons for gap: Better treatment outcomes Death from other causes (comorbidities) Overdiagnosis 2 / 35
  • 7. Background Methods Results Summary Rationale Objective Previous Work Why is this Gap a Problem? Overdiagnosis 1 Screen detected cancer that would have otherwise not been diagnosed before death from other causes 3 / 35
  • 8. Background Methods Results Summary Rationale Objective Previous Work Why is this Gap a Problem? Overdiagnosis 1 Screen detected cancer that would have otherwise not been diagnosed before death from other causes 2 Can lead to overtreatment 3 / 35
  • 9. Background Methods Results Summary Rationale Objective Previous Work Why is this Gap a Problem? Overdiagnosis 1 Screen detected cancer that would have otherwise not been diagnosed before death from other causes 2 Can lead to overtreatment Overtreatment Treatment for a cancer that is clinically not significant → low tumour grade 3 / 35
  • 10. Background Methods Results Summary Rationale Objective Previous Work Why is this Gap a Problem? Overdiagnosis 1 Screen detected cancer that would have otherwise not been diagnosed before death from other causes 2 Can lead to overtreatment Overtreatment Treatment for a cancer that is clinically not significant → low tumour grade Morbidity associated with being diagnosed with cancer 3 / 35
  • 11. Background Methods Results Summary Rationale Objective Previous Work Why is this Gap a Problem? Overdiagnosis 1 Screen detected cancer that would have otherwise not been diagnosed before death from other causes 2 Can lead to overtreatment Overtreatment Treatment for a cancer that is clinically not significant → low tumour grade Morbidity associated with being diagnosed with cancer Quality of life effects, incontinence, erectile dysfunction 3 / 35
  • 12. Background Methods Results Summary Rationale Objective Previous Work Why is this Gap a Problem? Overdiagnosis 1 Screen detected cancer that would have otherwise not been diagnosed before death from other causes 2 Can lead to overtreatment Overtreatment Treatment for a cancer that is clinically not significant → low tumour grade Morbidity associated with being diagnosed with cancer Quality of life effects, incontinence, erectile dysfunction Lower life expectancy 3 / 35
  • 14. Background Methods Results Summary Rationale Objective Previous Work Solution to Overdiagnosis and Overtreatment Individualised Decision Making 4 / 35
  • 15. Background Methods Results Summary Rationale Objective Previous Work Solution to Overdiagnosis and Overtreatment Individualised Decision Making Active surveillance for patients with favorable risk disease (Gleason score ≤ 6) 4 / 35
  • 16. Background Methods Results Summary Rationale Objective Previous Work Solution to Overdiagnosis and Overtreatment Individualised Decision Making Active surveillance for patients with favorable risk disease (Gleason score ≤ 6) Clinical tools that calculate life expectancy based on predisposing factors such as age and comorbidities 4 / 35
  • 17. Background Methods Results Summary Rationale Objective Previous Work Objectives Research Statement 1 Using a semiparametric model based on both age and comorbidity, obtain absolute risk estimates of death from prostate cancer in a population based case cohort study 5 / 35
  • 18. Background Methods Results Summary Rationale Objective Previous Work Objectives Research Statement 1 Using a semiparametric model based on both age and comorbidity, obtain absolute risk estimates of death from prostate cancer in a population based case cohort study 2 Create a web tool for life expectancy based on this model, for use in a clinical setting 5 / 35
  • 19. Background Methods Results Summary Rationale Objective Previous Work Wykes (2011) Wykes (2011) proposed an ad hoc approach to estimating the relative risk and baseline hazard function for the case cohort design: 6 / 35
  • 20. Background Methods Results Summary Rationale Objective Previous Work Wykes (2011) Wykes (2011) proposed an ad hoc approach to estimating the relative risk and baseline hazard function for the case cohort design: Modified PH Model S(t) = (S0(t))α exp {αβ1 (age − µage) + β2 CIRS-Gpros − µCIRS-Gpros S0(t), β1, µage are derived from the full cohort β2, µCIRS-Gpros are from case cohort α = β1 from case cohort β1 from full cohort 6 / 35
  • 21. Case Cohort Study Design 40 45 50 55 60 65 70 (a) Cohort, no exposure info 40 45 50 55 60 65 70 (b) Case cohort sampling 40 45 50 55 60 65 70 (c) Add back failures CIRS=10 CIRS=2 CIRS=7 CIRS=6 CIRS=14 CIRS=13 CIRS=1 CIRS=9 CIRS=9 CIRS=4 CIRS=4 CIRS=10 CIRS=1 CIRS=3 CIRS=6 CIRS=7 40 45 50 55 60 65 70 (d) Get exposure information Figure: Each line represents how long a subject is at risk. A failure is represented by a red dot. Subjects without a red dot are censored individuals. The axis represents age
  • 22. Background Methods Results Summary Case Cohort Study Design Population Model Software Study Population 8 / 35
  • 23. Background Methods Results Summary Case Cohort Study Design Population Model Software Study Population Population-based case cohort study of 2,740 patients diagnosed with prostate cancer between 1990 and 1998 in Ontario 8 / 35
  • 24. Background Methods Results Summary Case Cohort Study Design Population Model Software Study Population Population-based case cohort study of 2,740 patients diagnosed with prostate cancer between 1990 and 1998 in Ontario Stratified by CCOR random sample 8 / 35
  • 25. Background Methods Results Summary Case Cohort Study Design Population Model Software Study Population Population-based case cohort study of 2,740 patients diagnosed with prostate cancer between 1990 and 1998 in Ontario Stratified by CCOR random sample Cases were sampled independently of subcohort 8 / 35
  • 26. Background Methods Results Summary Case Cohort Study Design Population Model Software Study Population Population-based case cohort study of 2,740 patients diagnosed with prostate cancer between 1990 and 1998 in Ontario Stratified by CCOR random sample Cases were sampled independently of subcohort Death from prostate cancer and death from other causes assessed → death clearance date December 31, 1999 8 / 35
  • 27. Background Methods Results Summary Case Cohort Study Design Population Model Software Study Population Population-based case cohort study of 2,740 patients diagnosed with prostate cancer between 1990 and 1998 in Ontario Stratified by CCOR random sample Cases were sampled independently of subcohort Death from prostate cancer and death from other causes assessed → death clearance date December 31, 1999 Death from any cause updated for subcohort only → death clearance date December 31, 2008 8 / 35
  • 28. Background Methods Results Summary Case Cohort Study Design Population Model Software Improvements Stratified by CCOR Cox PH Sj (t) = S0j (t)exp(β1·Age+β2·CIRS-Gpros ) , j = 1, . . . , 8 9 / 35
  • 29. Background Methods Results Summary Case Cohort Study Design Population Model Software Improvements Stratified by CCOR Cox PH Sj (t) = S0j (t)exp(β1·Age+β2·CIRS-Gpros ) , j = 1, . . . , 8 We improve on Wykes (2011) work in two ways: 9 / 35
  • 30. Background Methods Results Summary Case Cohort Study Design Population Model Software Improvements Stratified by CCOR Cox PH Sj (t) = S0j (t)exp(β1·Age+β2·CIRS-Gpros ) , j = 1, . . . , 8 We improve on Wykes (2011) work in two ways: we estimate both the baseline survival function and relative risk parameters using the case cohort population 9 / 35
  • 31. Background Methods Results Summary Case Cohort Study Design Population Model Software Improvements Stratified by CCOR Cox PH Sj (t) = S0j (t)exp(β1·Age+β2·CIRS-Gpros ) , j = 1, . . . , 8 We improve on Wykes (2011) work in two ways: we estimate both the baseline survival function and relative risk parameters using the case cohort population we use a stratified by CCOR analysis, which assumes a different baseline hazard for each region 9 / 35
  • 32. Background Methods Results Summary Case Cohort Study Design Population Model Software Relative Risk Estimation Prentice (1986) proposed a pseudolikelihood given by ˜L(β) = D j=1 exp zT (j)β k∈˜Rtj exp zT k β wk 10 / 35
  • 33. Background Methods Results Summary Case Cohort Study Design Population Model Software Relative Risk Estimation Prentice (1986) proposed a pseudolikelihood given by ˜L(β) = D j=1 exp zT (j)β k∈˜Rtj exp zT k β wk t1, . . . , tD: D distinct failure times 10 / 35
  • 34. Background Methods Results Summary Case Cohort Study Design Population Model Software Relative Risk Estimation Prentice (1986) proposed a pseudolikelihood given by ˜L(β) = D j=1 exp zT (j)β k∈˜Rtj exp zT k β wk t1, . . . , tD: D distinct failure times ˜Rtj : the risk set at failure time tj 10 / 35
  • 35. Background Methods Results Summary Case Cohort Study Design Population Model Software Relative Risk Estimation Prentice (1986) proposed a pseudolikelihood given by ˜L(β) = D j=1 exp zT (j)β k∈˜Rtj exp zT k β wk t1, . . . , tD: D distinct failure times ˜Rtj : the risk set at failure time tj wk: the weight applied to the risk set 10 / 35
  • 36. Background Methods Results Summary Case Cohort Study Design Population Model Software Weighting Schemes exp zT i β Yi (tj )wi (tj ) exp zT i β + k∈SC,k=i Yk(tj )wk(tj ) exp zT k β Table: Comparing different values of wk Outcome type and timing Prentice Self & Prentice Barlow (1986) (1988) (1994) Case outside SC before failure 0 0 0 Case outside SC at failure 1 0 1 Case in SC before failure 1 1 1/α Case in SC at failure 1 1 1 SC control 1 1 1/α 11 / 35
  • 37. Background Methods Results Summary Case Cohort Study Design Population Model Software Relative Risk Estimation Therneau & Li (1999) showed that Var(˜β) can be computed by adjusting the outputted variance from standard Cox model programs: ˜I−1 + (1 − α)DT SC DSC 12 / 35
  • 38. Background Methods Results Summary Case Cohort Study Design Population Model Software Relative Risk Estimation Therneau & Li (1999) showed that Var(˜β) can be computed by adjusting the outputted variance from standard Cox model programs: ˜I−1 + (1 − α)DT SC DSC ˜I−1: variance returned by the Cox model program 12 / 35
  • 39. Background Methods Results Summary Case Cohort Study Design Population Model Software Relative Risk Estimation Therneau & Li (1999) showed that Var(˜β) can be computed by adjusting the outputted variance from standard Cox model programs: ˜I−1 + (1 − α)DT SC DSC ˜I−1: variance returned by the Cox model program DSC : subset of matrix of dfbeta residuals that contains subcohort members only 12 / 35
  • 40. Background Methods Results Summary Case Cohort Study Design Population Model Software Absolute Risk Estimation A weighted version of the Breslow estimator for the full cohort with covariate history z0(u) is given by d ˆΛ(u, z0(u)) = d ˆΛ0(u) exp zT 0 (u) ˆβ = n i=1 dNi (u) ( n m ) j∈˜Ri (u) exp zT j (u) ˆβ exp zT 0 (u) ˆβ = n i=1 dNi (u) ( n m ) j∈˜Ri (u) exp (zj(u) − z0(u))T ˆβ = n i=1 m n dNi (u) j∈˜Ri (u) exp (zj(u) − z0(u))T ˆβ 13 / 35
  • 41. Background Methods Results Summary Case Cohort Study Design Population Model Software Absolute Risk Estimation The sum of these increments can be used to calculate the risk between two time points, say s and t, for s < t: ˆΛ(s, t, z0) = ˆΛ(0, t, z0) − ˆΛ(0, s, z0) = t s d ˆΛ(u, z0(u)) 14 / 35
  • 42. Background Methods Results Summary Case Cohort Study Design Population Model Software Absolute Risk Estimation The sum of these increments can be used to calculate the risk between two time points, say s and t, for s < t: ˆΛ(s, t, z0) = ˆΛ(0, t, z0) − ˆΛ(0, s, z0) = t s d ˆΛ(u, z0(u)) For this project, we will be only considering the case when s = 0. The corresponding survival probability is ˆS(0, t, z0) = exp − ˆΛ(0, t, z0) 14 / 35
  • 43. Background Methods Results Summary Case Cohort Study Design Population Model Software Software PROC PHREG in SAS/STAT software and coxph in R 15 / 35
  • 44. Background Methods Results Summary Case Cohort Study Design Population Model Software Software PROC PHREG in SAS/STAT software and coxph in R SAS macro developed by Langholz & Jiao (2007) 15 / 35
  • 45. Background Methods Results Summary Case Cohort Study Design Population Model Software Computational Methods for the Case Cohort Design Standard Cox PH functions in SAS and R cannot directly compute the relative and absolute risk estimates 16 / 35
  • 46. Background Methods Results Summary Case Cohort Study Design Population Model Software Computational Methods for the Case Cohort Design Standard Cox PH functions in SAS and R cannot directly compute the relative and absolute risk estimates Modifications must be made to the dataset to make use of PROC PHREG and coxph functions 16 / 35
  • 47. Computation of Pseudolikelihood Manually Subcohort failure Non-Subcohort failure Subcohort failure 40 45 50 55 60 65 70 7 6 5 4 3 2 1 ˜L(β) = exp(Z7β) exp(Z1β) + exp(Z2β) + exp(Z6β) + exp(Z7β) × exp(Z5β) exp(Z1β) + exp(Z2β) + exp(Z4β) + exp(Z5β) + exp(Z6β) × exp(Z1β) exp(Z1β) + exp(Z3β) + exp(Z4β)
  • 48. Computation of Pseudolikelihood Manually Subcohort failure Non-Subcohort failure Subcohort failure 40 45 50 55 60 65 70 7 6 5 4 3 2 1 ˜L(β) = exp(Z7β) exp(Z1β) + exp(Z2β) + exp(Z6β) + exp(Z7β) × exp(Z5β) exp(Z1β) + exp(Z2β) + exp(Z4β) + exp(Z5β) + exp(Z6β) × exp(Z1β) exp(Z1β) + exp(Z3β) + exp(Z4β) Computationally 40 45 50 55 60 65 70 7.2 7.1 6 5 4 3 2 1.2 1.1 ˜L(β) = exp(Z7.2β) exp(Z1.1β) + exp(Z2β) + exp(Z6β) + exp(Z7.2β) × exp(Z5β) exp(Z1.1β) + exp(Z2β) + exp(Z4β) + exp(Z5β) + exp(Z6β) × exp(Z1.2β) exp(Z1.2β) + exp(Z3β) + exp(Z4β)
  • 49. Background Methods Results Summary Relative Risk Estimates Absolute Risk Estimates Web Life Expectancy Tool Table: All Cause mortality All Cause Covariate 1999 2008 Wykes (2011) Age 1.04 (1.02, 1.05) 1.05 (1.04, 1.06) 1.02 (1.01, 1.03) CIRS-Gpros 1.14 (1.10, 1.18) 1.13 (1.10, 1.16) 1.12 (1.09, 1.15) 18 / 35
  • 50. Background Methods Results Summary Relative Risk Estimates Absolute Risk Estimates Web Life Expectancy Tool Table: All Cause mortality All Cause Covariate 1999 2008 Wykes (2011) Age 1.04 (1.02, 1.05) 1.05 (1.04, 1.06) 1.02 (1.01, 1.03) CIRS-Gpros 1.14 (1.10, 1.18) 1.13 (1.10, 1.16) 1.12 (1.09, 1.15) Table: Cause Specific mortality Cause Specific Covariate PCa (1999)a OC (1999) Age 1.00 (0.99, 1.02) 1.07 (1.05, 1.09) CIRS-Gpros 1.032 (0.99, 1.07) 1.25 (1.20, 1.30) a predictors not significant 18 / 35
  • 51. Background Methods Results Summary Relative Risk Estimates Absolute Risk Estimates Web Life Expectancy Tool qqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqq q qqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqq q qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq q qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq q qqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq q q qqqqqqqqq q qq qqqqqqqqqqqqqqqq q qqqqqqqqqqqqqqqqqqqqqqq q q q q q qqqqqqqqqqqqqqqqqqqqqqqqq q q qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqq qqqqqqqqqqqqqq qqqqqqqqqqqqqqq qqqqqqqqqqqqqq qqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq qqqqqqqqqqqqqqqqqqqqqqqqqqqqqq All cause (1999) All cause (2008) OC (1999) PCa (1999) Wykes (2011) 10 20 30 10 20 30 0 3 4 5 6 7 8 9101112 0 3 4 5 6 7 8 9101112 CIRS−G_pros lifeexpectancy(years) 50 60 70 80 Age Figure: Estimates from East Region (Wykes estimates based on all cause unstratified subcohort at 2008) 19 / 35
  • 52. Background Methods Results Summary Relative Risk Estimates Absolute Risk Estimates Web Life Expectancy Tool 5 10 15 20 25 0 3 4 5 6 7 8 9 10 11 12 CIRS−G_pros lifeexpectancy(years) Age 55 65 75 Model All cause (2008) Wykes (2011) Figure: Estimates from East Region (Wykes estimates based on all cause unstratified subcohort at 2008) 20 / 35
  • 53. 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 age (years) survivalprobability CCO North West region (a) All cause (1999) 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 age (years) survivalprobability CCO North West region (b) Other cause(1999) 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 age (years) survivalprobability CCO North West region (c) Prostate cancer(1999) 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 age (years) survivalprobability CCO North West region (d) All cause (2008)
  • 54. Background Methods Results Summary Relative Risk Estimates Absolute Risk Estimates Web Life Expectancy Tool Tool for Use in a Clinical Setting Built using shiny package for R 22 / 35
  • 55. Background Methods Results Summary Relative Risk Estimates Absolute Risk Estimates Web Life Expectancy Tool Tool for Use in a Clinical Setting Built using shiny package for R Requires R and an internet connection 22 / 35
  • 56. Background Methods Results Summary Relative Risk Estimates Absolute Risk Estimates Web Life Expectancy Tool Tool for Use in a Clinical Setting Built using shiny package for R Requires R and an internet connection Three simple lines of code 22 / 35
  • 57. Background Methods Results Summary Contributions Future Work References Contributions 1 Life expectancy estimates based on age and comorbidity score from a population based case cohort study 23 / 35
  • 58. Background Methods Results Summary Contributions Future Work References Contributions 1 Life expectancy estimates based on age and comorbidity score from a population based case cohort study 2 Improves previous work done with this dataset by using appropriate statistical methodology for case cohort design 23 / 35
  • 59. Background Methods Results Summary Contributions Future Work References Contributions 1 Life expectancy estimates based on age and comorbidity score from a population based case cohort study 2 Improves previous work done with this dataset by using appropriate statistical methodology for case cohort design 3 Easy-to-use interactive web tool for clinical use 23 / 35
  • 60. Background Methods Results Summary Contributions Future Work References Future Work Internal validation → risk prediction accuracy, PH assumption, compare expected values from 1999 model to observed 2008 subcohort 24 / 35
  • 61. Background Methods Results Summary Contributions Future Work References Future Work Internal validation → risk prediction accuracy, PH assumption, compare expected values from 1999 model to observed 2008 subcohort External validation → generalizability 24 / 35
  • 62. Background Methods Results Summary Contributions Future Work References Future Work Internal validation → risk prediction accuracy, PH assumption, compare expected values from 1999 model to observed 2008 subcohort External validation → generalizability Convert Langholz & Jiao (2007) SAS macro to R for absolute risk estimates 24 / 35
  • 63. Background Methods Results Summary Contributions Future Work References Alternative Analysis Treat the case cohort design as a cohort study with missing data (NestedCohort and Survey packages for R) (Mark & Katki (2008), Breslow & Lumley (2009)) 25 / 35
  • 64. Background Methods Results Summary Contributions Future Work References Alternative Analysis Treat the case cohort design as a cohort study with missing data (NestedCohort and Survey packages for R) (Mark & Katki (2008), Breslow & Lumley (2009)) Competing Risk analysis using the subdistribution hazard of Fine & Gray 25 / 35
  • 65. Background Methods Results Summary Contributions Future Work References References I M. Center et al. International Variation in Prostate Cancer Incidence and Mortality Rates European Association of Urology, 61:2012 Bryan Langholz and Jenny Jiao Computational methods for case cohort studies Computational Statistics & Data Analysis, 51:2007 Therneau, T and Li, H Computing the Cox model for case cohort designs Lifetime data analysis, 2(5):1999 26 / 35
  • 66. Background Methods Results Summary Contributions Future Work References References II PRENTICE, R. L. A case-cohort design for epidemiologic cohort studies and disease prevention trials Biometrika, 73(1):1986 RStudio and Inc. shiny: Web Application Framework for R R package version 0.6.0, 17(25):2012 T. Lumley survey: analysis of complex survey samples R package version 3.28-2,2012 27 / 35
  • 67. Background Methods Results Summary Contributions Future Work References References III Katki, H.A. and Mark, S.D. Survival Analysis for Cohorts with Missing Covariate Information The R Journal,2008 28 / 35
  • 68. Background Methods Results Summary Contributions Future Work References Influential Analysis Dfbeta residuals Are the approximate changes in the parameter estimates (ˆβ − ˆβ(j)) when the jth observation is omitted. These variables are a weighted transform of the score residual variables and are useful in assessing local influence and in computing approximate and robust variance estimates Standardized dfbeta residuals: (ˆβ − ˆβ(j)) sd(ˆβ − ˆβ(j)) 29 / 35
  • 69. Background Methods Results Summary Contributions Future Work References Why is the PsL not a Partial Likelihood Not a PL → Asymptotic distribution theory for the PL estimators breaks down because cases outside the subcohort induce non-nesting of the σ-fields, or sets on which the counting process probability measure is defined cannot use Likelihood ratio tests since PsL are not real likelihoods 30 / 35
  • 70. Background Methods Results Summary Contributions Future Work References Consistent Estimators An estimator ˆθ will perform better and better as we obtain more samples. If at the limit n → ∞ the estimator tends to be always right, it is said to be consistent. This notion is equivalent to convergence in probability Convergence in Probability Let X1, . . . , Xn be a sequence of iid RVs drawn from a distribution with parameter θ and ˆθ an estimator for θ. ˆθ is consistent as an estimator of θ if ˆθ p → θ lim n→∞ P(|ˆθ − θ| > ) = 0, ∀ > 0 31 / 35
  • 71. Background Methods Results Summary Contributions Future Work References Competing Risks when disease of interest has a delayed occurrence, other events may preclude observation of disease of interest giving rise to a competing risk situation to analyze the event of interest taking into account the competing risks, one has to model the hazard of subdistribution. In the presence of competing risks, the probability to observe the event of interest spans the interval [0, p], p < 1. Because p does not reach 1, this function is not a proper distribution function and it is called a subdistribution Modify the partial likelihood in the Cox model to model hazard of subdistribution, such that the individuals experiencing competing risks are always in the risk set with their contribution regulated by a weight 32 / 35
  • 72. Background Methods Results Summary Contributions Future Work References Competing Risks Fine and Gray Partial Likelihood: PL(β) = n j=1     exp {βxj } r∈˜Rj wrj exp {βxr }     δj (1) wrj = ˆG(tj ) ˆG(min(tj , tr )) (2) Cj =    1 if the event of interest was observed at time tj 2 if the competing risk event was observed at time tj 0 if no event was observed at time tj δj = 1 Cj = 1 0 Cj = 0 or Cj = 2 33 / 35
  • 73. Background Methods Results Summary Contributions Future Work References Competing Risks ˆG(tj ): estimated probability of being censored at time tj by either the event of interest or a competing risks event ˜Rj = {r; tr ≥ tj or Cr = 2} individuals experiencing competing risk event are always considered in the risk set at all time points The weight controls how much a specific observation participates in the sum in the denominator If the observation at time tr is censored (Cr = 0) or has the event of interest (Cr = 1), the weight is either 1 or 0 depending whether the observation is in the risk set (tr ≥ tj ) or it is not in the risk set (tr < tj ) If the observation has a competing risk event, then it is always in the risk set and the weight is 1 if (tr ≥ tj ) or a quantity less than 1 if tr < tj This quantity decreases as the distance between tr and tj 34 / 35
  • 74. Background Methods Results Summary Contributions Future Work References Case-cohort in the presence of competing risks PsL(β) = n j=1     exp {βxj } r∈˜Rj Irj wrj exp {βxr }     δj δj = 1 Cj = 1 0 Cj = 0 or Cj = 2 Irj = 1 r is in subcohort or r is a case outside the subcohort and tr 0 r is a case outside the subcohort and tr = tj wrj = ˆG(tj ) ˆG(min(tj ,tr )) , where ˆG(tj ) is the estimation of the probability of censoring regardless of the fact that in a case-cohort study the cases are oversampled 35 / 35