PMED Opening Workshop - Measurement Error and Precision Medicine - Michael Wallace, August 14, 2018

Measurement error and precision medicine
Michael Wallace
Assistant Professor, Biostatistics
Department of Statistics and Actuarial Science
University of Waterloo
michael.wallace@uwaterloo.ca
August 14, 2018

Outline
1. Crash course 1: personalized medicine
‘’
2. Crash course 2: measurement error
‘’
3. Worlds collide/some speciﬁc problems
‘’
4. Application: STAR*D

Treating the patient, not the diagnosis
Heterogeneity between patients:
0 months
Patient 1
Patient 2
3 months
Drug A
Drug B
Heterogeneity within patients:
0 months 6 months
Patient 3
3 months
Drug A
0 months 6 months
Patient 3
3 months
Drug A Drug B
Drug A

Dynamic treatment regimes
Dynamic treatment regimes (DTRs) ‘formalize’ the process of
personalized medicine:
“If patient BMI over 30 prescribe therapy A, otherwise provide
therapy B.”
DTR
Treatment
recommendation
Patient
information
DTRs can lead to improved results over standard ‘one size ﬁts
all’ approaches.

Notation
X A Y
Pre-treatment
covariates
Treatment
received
Outcome
BMI
'Healthiness
metric'
Drug A or
Drug B?
DTR: treatment Aopt that maximizes E[Y |X, Aopt]

Identifying the best treatment regime: multi-stage
X
A
Y1
1
X
A
2
2
Stage 1 Stage 2
More often,
we work
in stages.

Y
A
2
2
Stage 2
H
H2
X 1
A1
X2
( )= , ,

X
A
1
1
Y2
Stage 1 Stage 2
opt
Expected
outcome
if stage 2
treatment
optimal.

Lots of methods available:
Q-learning
G-estimation
IPTW
A-learning
dWOLS
MSMs
OWL
etc...

Identifying the best treatment regime
If only one treatment decision:
E[Y |X, A]
Expected outcome
(to be maximized)
We might propose the following model
E[Y |X, A; β, ψ] = β0 + β1BMI + A(ψ0 + ψ1BMI)

If only one treatment decision:
E[Y |X, A]
Expected outcome
(to be maximized)
We might propose the following model
More generally, split outcome into two components:
E[Y |X, A; β, ψ]
Expected outcome
(to be maximized)
=
Impact of patient history
in the absence of treatment
G(X; β) + γ(X, A; ψ)
Impact of treatment
on outcome
Simpliﬁes focus: ﬁnd Aopt that maximizes γ(X, A; ψ).

Suppose the true outcome model is:
E[Y |X, A; β, ψ] = β0 + β1BMI + β2BMI2
+ A(ψ0 + ψ1BMI)
But we propose:
Problem: what if A depends on BMI?

Dynamic WOLS (dWOLS)
E[Y |X, A; β, ψ] = G(X; β) + γ(X, A; ψ)
Three models to specify:
1. Blip model: γ(X, A; ψ).
2. Treatment-free model: G(X; β).
3. Treatment model: P(A = 1|X; α).
Estimate ψ via WOLS of Y on covariates in blip and
treatment-free models, with weights
w = |A − P(A = 1|X; ˆα)|.

Dynamic WOLS (dWOLS)
E[Y |X, A; β, ψ] = G(X; β) + γ(X, A; ψ)
Three models to specify:
1. Blip model: γ(X, A; ψ).
2. Treatment-free model: G(X; β).
3. Treatment model: P(A = 1|X; α).
Estimate ψ via WOLS of Y on covariates in blip and
treatment-free models, with weights
w = |A − P(A = 1|X; ˆα)|.
Estimates are doubly-robust: consistent if either G(X; β) or
P(A = 1|X; α) correctly speciﬁed.

Multi-stage recursion
In the multi-stage setting we conduct a single-stage analysis at
each stage by forming pesudo-outcomes:
˜Yj = Y +
J
k=j+1
[γk(Xk, Aopt
k ; ˆψk) − γk(Xk, Ak; ˆψk)]
˜Yj is the expected outcome assuming optimal treatment from
stage j + 1 onwards.
We plug ˜Yj into our dWOLS procedure and proceed similarly.

Summary
Our dWOLS analysis can be broken down into three separate
components:
Find blip parameter estimates ˆψ via dWOLS.

Summary
components:
Implement the rule “treat if ˆψX > 0”.

Summary
components:
(If multi-stage) form pseudo outcomes
˜Yj = Y +
J
k=j+1
[γk(Xk, Aopt
k ; ˆψk) − γk(Xk, Ak; ˆψk)].

Summary
components:
(If multi-stage) form pseudo outcomes
˜Yj = Y +
J
k=j+1
[γk(Xk, Aopt
Easy!

Measurement Error
True history
Treatment
Outcome

Measurement Error
True history
Treatment
Outcome
Reported history

Measurement Error
Classical additive measurement error:
Observed = True + Error
W = X + U
Key assumptions:
U ∼ N(0, σ2
u)
Non-diﬀerential: Y ⊥ W |X

Correction Methods
To account for measurement error we need to know about U.
i W X
1 5 -
2 8 -
3 7 -
4 3 -
5 4 -
... ... ...

Correction Methods
We’ll assume replicate data are available:
i W1 W2
1 5 4
2 8 6
3 7 -
4 3 -
5 4 4
... ... ...
We then have numerous error correction methods to choose from.

Regression Calibration
Simple correction method: Regression Calibration.
Principle:
1. Use additional data to estimate E[X|W , A] = Xrc.
2. Replace X with Xrc and carry out a standard analysis.
3. Adjust the resulting standard errors to account for the
estimation in step 1.

E[Y |·] = β0 + β1X + β2X2 + A(ψ0 + ψ1X)

E[Y |·] = β0 + β1X + β2X2 + A(ψ0 + ψ1X)
If we have RC estimates Xrc then we could ﬁt
E[Y |·] = β0 + β1Xrc + β2X2
rc + A(ψ0 + ψ1Xrc)

E[Y |·] = β0 + β1X + β2X2 + A(ψ0 + ψ1X)
E[Y |·] = β0 + β1Xrc + β2X2
But we might mis-specify the model as
E[Y |·] = β0 + β1Xrc + A(ψ0 + ψ1Xrc)

E[Y |·] = β0 + β1X + β2X2 + A(ψ0 + ψ1X)
E[Y |·] = β0 + β1Xrc + β2X2
But we might mis-specify the model as
E[Y |·] = β0 + β1Xrc + A(ψ0 + ψ1Xrc)
A depends on W . Establish (approximate) covariate balance
in Xrc by regressing A on Xrc.

Simulation Studies
Measurement error: Wr = X + U; U ∼ N(0, 1); r = 1, 2
Outcome: E[Y |X, A; β, ψ] = β0 + β1X + β2X2 + A(ψ0 + ψ1X)
Treatment: P(A = 1|W1) = 1 + exp(−(0.5W1 + 0.5W 2
1 ))
−1
We set ψ0 = −1; ψ1 = 1 ← our target

Simulation studies: truth (X)
q
q
qq
qqq
qq
q
qqqq
qq
qqqqq
q qq
qq
qq
qqq
q
q qq
qq
qq
qqq
q
q
Neither Treatment Treatment−free Both
−2−1012
Correct models
Blipparameterestimates
ψ0

Simulation studies: truth (X)
q
q
qq
qqq
qq
q
qqqq
qq
qqqqq
q qq
qq
qq
qqq
q
q qq
qq
qq
qqq
q
q
−2−1012
Correct models
q
q
q
q
q
qq
q
q
qq
q
q
q
q
q
q
qq
q
qqqqq
q
q
q
q
qqq
q
−2−1012
ψ0
ψ1

Simulation studies: naive (W )
qqq
q
q
qqqq
q
q
qq
q
q
qqq
q
q
qq
qq
q
qq
q
q
q
−2−1012
Correct models
q
qq
q
qqq
q
qq
q
q
q
qq
q
q
qqq
q
qq q
q
qq
q
qqq q
q
q
qq
q
−2−1012
ψ0
ψ1

Truth: “treat if X > 1”; Estimated: “treat if X > 2”
qqq
q
q
qqqq
q
q
qq
q
q
qqq
q
q
qq
qq
q
qq
q
q
q
−2−1012
Correct models
q
qq
q
qqq
q
qq
q
q
q
qq
q
q
qqq
q
qq q
q
qq
q
qqq q
q
q
qq
q
−2−1012
ψ0
ψ1

Simulation studies: corrected (Xrc)
q
q
qqq
q
q
q
q
qqq
q
q
q
q
q
q
qqq
qq
q
q
q
qq
q
q
−2−1012
Correct models
Blipparameterestimates q
qq
q
q
q
q
q
q q
qq
q
q
q
q
q
q
qqq
q
q
q
qq
q
q
q
q
qq
q
q
q
q
qqq
q
q
q
q
q
q
q
q
−2−1012
ψ0
ψ1

Simulation studies
qq
qq
q
qq
q
q
q
qq
qq
qq
qqq
q
q
q
q
q
qq
q
q
Both models correctly specified
−2−1012
q
q
q
qq
q
q
qqq
q
q
q
q
qqq
q
q
q
q
q
q
q
q
−2−1012
Naive True RC
ψ0
ψ1

Recursion
Recall the multi-stage case requires the computation of
pseudo-outcomes:
˜Yj = Y +
J
k=j+1
[γk(Xk, Aopt
Question: What happens if we use Wk instead of Xk? And what
should we do about it?

Recursion
Recall the multi-stage case requires the computation of
pseudo-outcomes:
˜Yj = Y +
J
k=j+1
[γk(Xk, Aopt
Question: What happens if we use Wk instead of Xk? And what
should we do about it?
Answer: We’re not sure (yet).

Future treatment
After estimating ψ, have “Aopt = 1 if ˆψ0 + ˆψ1X > 0”
Suppose “Aopt = 1 if X > τ” (or vice-versa) for ‘threshold’ τ
We observe W = X + U
Questions of practical interest:
P(X < τ|W = w > τ) P(X > τ|W = w < τ)
(e.g., if observed BMI = 31, the probability true BMI < 30)

Future treatment
If X ∼ N(µx , σ2
x ), U ∼ N(0, σ2
u), W = X + U then
X|W ∼ N
σ2
x w + σ2
uµx
σ2
x + σ2
u
,
σ2
x σ2
u
σ2
x + σ2
u
Can easily estimate
P(X < τ|W = w > τ) P(X > τ|W = w < τ)
for any given patient.

Future treatment
In some settings, results fairly intuitive:
−3 −2 −1 0 1 2 3
0.00.10.20.30.40.5
W
Probabilityofmis−treatment
X ~ N(0,1)
U ~ N(0,1)
τ = 0

Future treatment
In others, perhaps more of a surprise (to some):
−3 −2 −1 0 1 2 3
0.00.20.40.6
W
Probabilityofmis−treatment
X ~ N(0,1)
U ~ N(0,1)
τ = 1

STAR*D Study
Sequenced Treatment Alternatives to Relieve Depression
Multi-stage: treatment ‘switching’ vs. ‘augmentation’
Outcome: Quick Inventory of Depressive Symptomatology
(QIDS) score (integers from 0-27).
Key variable: patient preference for diﬀerent types of therapy.

STAR*D
QIDS score: clinician- and patient-rated at each stage
=⇒ replicates?
0 5 10 15 20 25
0510152025
Stage 1 Symptom Scores
Clinician−rated
Patient−rated
5 10 15 20 25
510152025
Stage 2 Symptom Scores
Clinician−rated
Patient−rated
(Clinician ratings 0.60 higher at stage 1, 0.25 higher at stage 2.)

STAR*D
Treatment: SSRI∗ (A = 1) vs. non-SSRI (A = 0) therapies.
∗
selective serotonin reuptake inhibitor
Variables we’ll consider:
q1, q2: QIDS score at start of stage 1, 2
s1, s2: QIDS ‘slope’ for the previous stage (rate of change
of q1, q2)
p1: patient treatment preference for stage 1
X
A
Y0
0
X
A
1
1
X
A
2
2
Stage 0 Stage 2Stage 1

STAR*D
Treatment: SSRI∗ (A = 1) vs. non-SSRI (A = 0) therapies.
∗
selective serotonin reuptake inhibitor
Variables we’ll consider:
q1, q2: QIDS score at start of stage 1, 2
s1, s2: QIDS ‘slope’ for the previous stage (rate of change
of q1, q2)
p1: patient treatment preference for stage 1
X
A
y0
0
q
a1
q
a2
Stage 0 Stage 2Stage 1
(n = 1027) (n = 273)
s
p
s1 1
1
2 2 2
y1

STAR*D
Consider ‘full’ blip models at each stage:
stage 1: γ1(hψ1, a1; ψ1) = a1(ψ10 + q1ψ11 + s1ψ12 + p1ψ13)
stage 2: γ2(hψ2, a2; ψ2) = a2(ψ20 + q2ψ21 + s2ψ22)
Model selection: Wald-type or (new!) quasilikelihood information
criterion.

STAR*D
Stage 1 Stage 2
Variable ˆψ (Naive) ˆψ (RC) ˆψ (Naive) ˆψ (RC)
Intercept -1.498 -2.231
QIDS score (q) 0.117 0.263
QIDS slope (s) -0.512 -0.020
Preference (p) -2.040 - -
Bold: recommended variables in ﬁnal model.

STAR*D
Stage 1 Stage 2
Intercept -1.498 -0.792 -2.231 -5.073
QIDS score (q) 0.117 0.066 0.263 0.491
QIDS slope (s) -0.512 -0.031 -0.020 -1.393
Preference (p) -2.040 -1.726 - -

STAR*D
Stage 1 Stage 2
Intercept -1.498 -0.792 -2.231 -5.073
QIDS score (q) 0.117 0.066 0.263 0.491*
QIDS slope (s) -0.512 -0.031 -0.020 -1.393
Preference (p) -2.040 -1.726 - -
*May not survive contact with the bootstrap...

Summary/Future Work
So far:
Measurement error poses unique challenges in the
personalized medicine setting.
Biased/incorrect treatment rules.
Theoretical issues (double robustness, recursion).
Consequences for future treatments tailored on error-prone
observations.

Summary/Future Work
So far:
Measurement error poses unique challenges in the
personalized medicine setting.
Biased/incorrect treatment rules.
Theoretical issues (double robustness, recursion).
Consequences for future treatments tailored on error-prone
observations.
Moving forward:
Other correction methods (SIMEX, conditional score, etc.).
New methodological work speciﬁc to DTR/personalized
framework.
Diagnostics for extant analyses/datasets.

References/Acknowledgments
A comprehensive guide to DTRs: B. Chakraborty and E. E.
M. Moodie (2013). Statistical Methods for Dynamic
Treatment Regimens. Springer: New York.
dWOLS: M. P. Wallace and E. E. M. Moodie (2015).
Doubly-robust dynamic treatment regimen estimation via
weighted least squares. Biometrics 71(3) 636-644.
michael.wallace@uwaterloo.ca
@statacake

15 20 25 30 35 40 45
0.000.050.100.150.20
X or W
True

15 20 25 30 35 40 45
0.000.050.100.150.20
X or W
True
Observed

Choosing replicates
0.51.01.52.0
Observed value (W)
Errorvarianceestimate
27 28 29 30 31 32 33
But: non-random re-sampling can lead to poor error estimates.

Measurement Error Eﬀects
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
−2 −1 0 1
−2−1012
x
y

Measurement Error Eﬀects
q
q
q
qq
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
q
qq
q
q
−2 −1 0 1
−2−1012
x or w
y
q True
Observed

PMED Opening Workshop - Measurement Error and Precision Medicine - Michael Wallace, August 14, 2018

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to PMED Opening Workshop - Measurement Error and Precision Medicine - Michael Wallace, August 14, 2018

Similar to PMED Opening Workshop - Measurement Error and Precision Medicine - Michael Wallace, August 14, 2018 (20)

More from The Statistical and Applied Mathematical Sciences Institute

More from The Statistical and Applied Mathematical Sciences Institute (20)

Recently uploaded

Recently uploaded (20)

PMED Opening Workshop - Measurement Error and Precision Medicine - Michael Wallace, August 14, 2018