SlideShare a Scribd company logo
1 of 41
Download to read offline
New Developments in East:
Design and Simulation of Trials with
Multiple Treatment Arms
Cytel Webinar Series
October 4-5, 2011
Cyrus R. Mehta, Ph.D
Cytel Inc., Cambridge, MA
email: mehta@cytel.com – web: www.cytel.com – tel: 617-661-2011
1 Cytel Webinar Series. October 4-5, 2011
Outline of Presentation
Multiple Comparisons Procedures in SiZ(TM)
• Overview of SiZ(TM)
• Brief introduction to MCP and MCP module in SiZ(TM)
• Motivating example: Alzheimers disease
• Parametric tests (Dunnett, Step-Down Dunnett)
• Nonparametric tests (Bonferroni, Holms, Hochberg, Hommel,
Fixed Sequence, Fall Back)
• Principle of closed testing
• Short-cuts to closed testing
New Architecture for East-6
• Integration of SiZ(TM)
, East(R)
and modules for adaptive multi-arm
trials into one unified platform
2 Cytel Webinar Series. October 4-5, 2011
Quick Overview of SiZ
3 Cytel Webinar Series. October 4-5, 2011
Some Sources of Multiplicity
• Repeated significance tests
• Multiple treatment arms
• Multiple endpoints
• Subgroup analysis
• Variable selection in regression models
4 Cytel Webinar Series. October 4-5, 2011
Error Rates for Multiplicity Problems
• There is a ‘family’ of m inferences
• Parameters are δ1, δ2, . . . δm
• Null hypotheses are H1, H2, . . . Hm
• Comparisonwise error rate applies to an individual
hypothesis; offers ‘local control’ of type-1 error
CERj = P (reject Hj|Hj is true)
• Familywise error rate applies to the entire family
FWER = P (reject at least one true null hypotheses)
• Usually wish to control FWER at some level α
5 Cytel Webinar Series. October 4-5, 2011
Strong and Weak Control of FWER
• Control of FWER at level α means that
P (reject at least one true null hypotheses) ≤ α (1)
• Strong Control of FWER means that (1) is satisfied under
all partial null hypotheses of the type HI = ∩i∈IHi for all
subsets I ⊆ {1, 2, . . . m}
• Weak Control of FWER means that (1) is satisfied only
under some HI, typically I = {1, 2, . . . m}
• Strong control of FWER is a regulatory requirement if
multiple statements about product efficacy are to be
included in the product label
6 Cytel Webinar Series. October 4-5, 2011
Motivation for MCP Design
Software
• Many confirmatory trials have multiple arms or multiple
endpoints
• Software to perform multiplicity adjusted analyses for such
trials exists (in R and SAS)
• Sample size software for such designs is, however, limited
• SiZ can evaluate the operating characteristics of the
different MCPs and choose the best one for the study
7 Cytel Webinar Series. October 4-5, 2011
Motivating Example: Alzheimer’s
Disease
• Randomized, double-blind, placebo controlled, parallel
group trial
• Three doses (0.3 mg, 1 mg, 2 mg) compared to placebo
• Primary endpoint: change from baseline in ADAS-cog-11
at week 24
• Difference from placebo expected to be between 1.5 and
2.5 units with common standard deviation σ = 5
Multiplicity arises because the trial is considered successful if
at least one dose is declared statistically significant
8 Cytel Webinar Series. October 4-5, 2011
The MCP Module in SiZ
• Simulation based sample size calculations
• Parametric Tests: Single-step Dunnett and step-down Dunnett
• Nonparametric Tests: Bonferroni, Sidak, Weighted Bonferroni, Holm,
Hochberg, Hommel, Fixed-sequence, Fall-back
9 Cytel Webinar Series. October 4-5, 2011
Dunnett’s Procedure
• Let Yij ∼ N(μi, σ2
) be response of subject j = 1, 2, . . . n on
treatment i = 0, 1, 2, . . . m, where i = 0 denotes the control arm
• The marginal t-statistic for ith treatment effect is
ti =
yi − y0
s 2/n
• Denote the cumulative distribution function for the maximum ti by
F (x|m, ν) = P {max(T1, T2, . . . Tm) ≤ x}
• Under H0: μi − μ0 = 0 for all i, F (x|m, ν) is multivariate-t with
ν = (m + 1)(n − 1) degrees of freedom
• Compute qα,m, the (1 − α) quantile of F , defined by
F (qα,m|m, ν) = 1 − α
The critical value qα,m is evaluated by numerical integration
10 Cytel Webinar Series. October 4-5, 2011
Single-Step Dunnett
Reject every null hypothesis
Hi: μi − μ0 = 0
for which ti ≥ qα,m
11 Cytel Webinar Series. October 4-5, 2011
Simulations of Single-Step Dunnett
• A total sample size of 360 patients produces the desired 90% global
power with Dunnett’s single-step procedure
12 Cytel Webinar Series. October 4-5, 2011
Power Definitions for Multiplicity Problems
Global Power: Probability of rejecting at least one null
hypothesis
Disjunctive Power: Probability of rejecting at least one null
hypothesis that is false
Conjunctive Power: Probability of rejecting all null
hypotheses that are false
Global and Disjunctive power are usually almost the same
13 Cytel Webinar Series. October 4-5, 2011
Step-Down Dunnett
Key Idea: Because the test statistics T1, T2, . . . Tm are
positively correlated, if you reject a hypothesis, the rejection
criteria for the remaining hypotheses can be weakened
Accordingly let:
qα,m satisfy F (qα,m|m, ν) = α; denote it by c1
qα,m−1 satisfy F (qα,m−1|m − 1, ν) = α; denote it by c2
...
qα,1 satisfy F (qα,1|1, ν) = α; denote it by cm
14 Cytel Webinar Series. October 4-5, 2011
Step-Down Dunnett, contd.
Let T(1) ≥ T(2) ≥ · · · ≥ T(m) denote the m order statistics
• Step 1. If t(1) ≥ c1, reject H(1) and go to the next step.
Otherwise retain all hypotheses and stop.
• Steps i = 2, . . . , m − 1. If t(i) ≥ ci, reject H(i) and go to
the next step. Otherwise retain H(i), . . . H(m) and stop.
• Step m. If t(m) ≥ cm reject H(m). Otherwise retain H(m).
Since c1 > c2 > · · · > cm, step-down Dunnett rejects at least
as many (and sometimes more) hypotheses as single step
15 Cytel Webinar Series. October 4-5, 2011
Simulations of Step-Down Dunnett
• The step-down test improves the conjunctive power
• Global and disjunctive power are the same in this example
since all null hypotheses are false
16 Cytel Webinar Series. October 4-5, 2011
Advantages and Limitations of
Dunnett
Advantages
• More powerful than nonparametric procedures if
assumptions are met
• Generates multiplicity adjusted confidence intervals for
individual treatment effects
Limitations
• Relies on normality assumption
• Relies on homoscedasticity assumption
17 Cytel Webinar Series. October 4-5, 2011
Dunnett’s FWER under unequal
variance
In 100,000 simulations FWER was not preserved for Dunnett but was
preserved for Bonferroni and Sidak (two non-parametric MCPs)
18 Cytel Webinar Series. October 4-5, 2011
Nonparametric MCPs
• Do not require any distributional assumptions
• Bonferroni and Sidak are single-step MCPs
• Holms is a step-down MCP (start with the biggest effect
and work your way down)
• Hochberg and Hommel are step-up MCPs (start with the
smallest effect and work your way up)
• Fixed Sequence and Fall Back procedures test each
individual hypothesis in a pre-specified order
19 Cytel Webinar Series. October 4-5, 2011
Bonferroni Procedure
• Let p1, p2, . . . pm be the m marginal
p-values
• The Bonferroni procedure rejects any Hi
for which
pi ≤
α
m
20 Cytel Webinar Series. October 4-5, 2011
Comparing Dunnett and Bonferroni
Dunnett loses power because of heteroscedasticity
21 Cytel Webinar Series. October 4-5, 2011
The Weighted Bonferroni
• Let wi < 1 be the fraction of α allocated to testing Hi,
where m
i=1 wi = 1
• The weighted Bonferroni procedure rejects any Hi for
which
pi ≤ wiα
Note: The regular Bonferroni is a special case of the
weighted Bonferroni in which wi = 1
m
for all i
22 Cytel Webinar Series. October 4-5, 2011
Can we improve on Bonferroni for testing
individual hypotheses?
• Suppose we are testing m individual hypotheses
• Let p1, p2, . . . pi be the individual p-values
• The Bonferrioni procedure rejects each Hi for which
pi ≤ α
m
. Can we do better than a single cut-off?
• Yes! By applying closed testing, we don’t require the same
strict criterion for every individual hypothesis
23 Cytel Webinar Series. October 4-5, 2011
Closed Testing of Individual Hypotheses
Given individual hypotheses H1, H2, . . . Hm:
1. Construct the closed set consisting of all possible intersections of the
individual hypotheses of the form Hi1 ∩ Hi2 ∩ · · · ∩ Hiq for all
q = 1, 2, . . . m
2. Specify a local level-α test for each member of the closed set
3. An individual Hi may be rejected with strong control of FWER at
level α if both these conditions hold:
• Hi is rejected by its local level-α test
• All intersection hypotheses that contain Hi are also rejected by
their local level-α tests
24 Cytel Webinar Series. October 4-5, 2011
Example of Closed Testing
Acknowledgement: This slide has been taken from Peter Westfall’s notes
25 Cytel Webinar Series. October 4-5, 2011
Testing the Intersection Hypotheses
The key difference between one closed testing procedure and
another is the method used to test intersection hypotheses of
the form Hi1
∩ Hi2
∩ · · · ∩ Hiq
. There are many candidates
• Use Dunnetts test for the intersection hypotheses
Short Cut: Step-down Dunnett procedure
• Use the Bonferroni test for the intersection hypotheses
Short Cut: Step-down Holms procedure
• Using Simes test for the intersection hypotheses
Short Cut: Step-up Hommel procedure
• Using Hochberg’s test for the intersection hypotheses
Short Cut: Step-up Hochberg procedure
26 Cytel Webinar Series. October 4-5, 2011
Level-α Tests of H1 ∩ H2 ∩ · · · Hm
Let p(1) ≤ p(2) ≤ . . . ≤ p(m) be the ordered p-values
Bonferroni: Reject if p(1) ≤ α/m
Hochberg: Reject if
p(m) ≤ α or p(m−1) ≤
α
2
or p(m−2) ≤
α
3
or . . . or p(1) ≤
α
m
Simes: Reject if
p(m) ≤ α or p(m−1) ≤
(m − 1)α
m
or p(m−2) ≤
(m − 2)α
m
or . . . or p(1) ≤
α
m
Hochberg is more powerful than Bonferroni. Simes is more powerful than
Hochberg. Simes is only valid if the p-values are positively correlated
27 Cytel Webinar Series. October 4-5, 2011
Holm’s Procedure
• Order the p-values for the individual hypotheses in
ascending order as p(1) ≤ p(2) ≤ · · · ≤ p(m)
• If p(1) ≤ α/m reject H(1) and continue testing; else stop
• If p(2) ≤ α/(m − 1) reject H(2) and continue testing; else
stop
• In general, reject p(i) if p(i) ≤ α/(m − i + 1) and
H(1), H(2), . . . H(i−1) have already been rejected
28 Cytel Webinar Series. October 4-5, 2011
Power Comparison: Bonferroni vs Holms
Global and disjunctive power are equal; global power for Bonferroni and
Holm are equal but big differences in conjunctive and individual power
29 Cytel Webinar Series. October 4-5, 2011
Hochberg’s Procedure
• Order the p-values in ascending order as p(1) ≤ p(2) · · · ≤ p(m)
• If p(m) ≤ α, reject all m hypotheses and stop; otherwise retain H(m)
and continue testing
• If p(m−1) ≤ α/2, reject all m − 1 hypotheses and stop; otherwise
retain H(m−1) and continue testing
• If p(m−2) ≤ α/3, reject all m − 2 hypotheses and stop; otherwise
retain H(m−1) and continue testing
• In general, at any step i = 1, 2, . . . m, if
p(m−i+1) ≤
α
i
reject all m − i + 1 hypotheses and stop; otherwise retain H(m−i+1)
and continue testing
30 Cytel Webinar Series. October 4-5, 2011
Hommel’s Procedure
• Order the p-values in ascending order as p(1) ≤ p(2) · · · ≤ p(m)
• If p(m) ≤ α, reject all m hypotheses and stop; otherwise retain H(m)
and continue testing
• If p(m) ≤ α or p(m−1) ≤ α/2, reject all m − 1 hypotheses and stop;
otherwise retain H(m−1) and continue testing
• If p(m) ≤ α or p(m−1) ≤ 2α/3 or p(m−2) ≤ α/3, reject all m − 2
hypotheses and stop; otherwise retain H(m−2) and continue testing
• In general, at any step i = 1, 2, . . . m, if
p(m) ≤
iα
i
or p(m−1) ≤
(i − 1)α
i
or p(m−2) ≤
(i − 2)α
i
. . . p(m−i+1) ≤
α
i
reject all m − i + 1 hypotheses and stop; otherwise retain H(m−i+1)
and continue testing
31 Cytel Webinar Series. October 4-5, 2011
Hommel versus Hochberg
• Hommel’s procedure is clearly more powerful than
Hochberg’s since it gives more chances for rejection at
each step
• But it is rarely used in practice because it is perceived to
be too complicated
• Hochberg is very simple to explain and use, hence more
popular
32 Cytel Webinar Series. October 4-5, 2011
Comparing Power for Bonferroni, Holms,
Hochberg and Hommel Procedures
33 Cytel Webinar Series. October 4-5, 2011
Fixed Sequence Testing
• Assume H1, H2, ..., Hm are ordered hypotheses; i.e., μi ≥ μi−1,
i = 1, 2, . . . m
• Let p1, p2, ..., pm be the associated raw p-values
– Step 1. If p1 < α, reject H1 and go to the next step. Otherwise
retain all hypotheses and stop
– Step i = 2, . . . , m − 1. If pi < α, reject Hi and go to the next
step. Otherwise retain all the remaining hypotheses and stop
– Step m. If pm < α, reject Hm; otherwise retain it.
• More powerful than other procedures if ordering is correct
• Closed under fixed sequence testing of each intersection hypotheses
34 Cytel Webinar Series. October 4-5, 2011
Example 1: Testing sequence is correct
If the order of testing is correctly specified, the fixed
sequence procedure is the most powerful
35 Cytel Webinar Series. October 4-5, 2011
Example 2: Testing sequence is incorrect
Fixed sequence procedure loses considerable power if testing
sequence is incorrect
36 Cytel Webinar Series. October 4-5, 2011
The Fall Back Procedure
• Assume H1, H2, ..., Hm are ordered hypotheses, w1, w2, . . . wm are
pre-specified weights with m
i=1 wi = 1, and p1, p2, ..., pm are the
associated raw p-values. The testing proceeds as follows:
– Step 1. Test H1 at α1 = w1α. If p1 ≤ α1, reject H1; otherwise
retain it and go to the next step.
– Step i = 2, . . . , m. Test Hi at αi = αi−1 + wiα if Hi−1 is
rejected and at αi = wiα if Hi−1 is retained. If pi ≤ αi, reject
Hi; otherwise retain it and go to the next step.
• Provides option to continue even if a hypothesis is retained. Hence
good insurance policy in case incorrect testing order was specified
• Specializes to fixed sequence test if w1 = 1 and w2 = · · · = wm = 0
• Was shown by Wiens and Dmitreinko (2005) to be a closed test
37 Cytel Webinar Series. October 4-5, 2011
Example 3: Performance of Fall Back Test
under Incorrect Testing Sequence
Test H1 at level α1 = α/3. If H1 is rejected, test H2 at level α2 = α1 + α/3,
otherwise test H2 at level α2 = α/3. If H2 is rejected test H3 at level
α2 + α/3, otherwise test H3 at level α/3
Fall back procedure is almost as good as Hommel despite
guessing treatment order incorrectly
38 Cytel Webinar Series. October 4-5, 2011
Example 4: Performance of Fall Back Test
under Correct Testing Sequence
In this example the fall back test is almost as good as the
fixed sequence test when the ordering is correctly specified
and is superior to the fixed sequence test when the ordering is
incorrectly specified
39 Cytel Webinar Series. October 4-5, 2011
Which Test to Use?
40 Cytel Webinar Series. October 4-5, 2011
Download SiZ 2.0 30-day demo version
www.cytel.com/Software/SiZ.aspx
or
Request a CD:
info@cytel.com
617-661-2011
41 Cytel Webinar Series. October 4-5, 2011

More Related Content

Similar to Si z slides-multiple comparisons procedures (east) 41pg

EUGM 2011 | JEHL | group sequential designs with 2 time to event endpoints
EUGM 2011 | JEHL | group sequential designs with 2 time to event endpointsEUGM 2011 | JEHL | group sequential designs with 2 time to event endpoints
EUGM 2011 | JEHL | group sequential designs with 2 time to event endpointsCytel USA
 
20150603 establishing theoretical minimal sets of mutants (Lab seminar)
20150603 establishing theoretical minimal sets of mutants (Lab seminar)20150603 establishing theoretical minimal sets of mutants (Lab seminar)
20150603 establishing theoretical minimal sets of mutants (Lab seminar)Donghwan Shin
 
Performance analysis of logic
Performance analysis of logicPerformance analysis of logic
Performance analysis of logicijcsa
 
A GARCH analysis of dark-pool trades - Philippe de Peretti, Oren J. Tapiero ....
A GARCH analysis of dark-pool trades - Philippe de Peretti, Oren J. Tapiero ....A GARCH analysis of dark-pool trades - Philippe de Peretti, Oren J. Tapiero ....
A GARCH analysis of dark-pool trades - Philippe de Peretti, Oren J. Tapiero ....SYRTO Project
 
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...Aalto University
 
Experimental Design
Experimental  DesignExperimental  Design
Experimental DesignGreenAvatar
 
Chernick.Michael.ppt
Chernick.Michael.pptChernick.Michael.ppt
Chernick.Michael.pptABINASHPADHY6
 
Chernick.Michael.ppt
Chernick.Michael.pptChernick.Michael.ppt
Chernick.Michael.pptalizain9604
 
Chernick.Michael (1).ppt
Chernick.Michael (1).pptChernick.Michael (1).ppt
Chernick.Michael (1).pptalizain9604
 
The stepped wedge cluster randomised trial workshop: session 3
The stepped wedge cluster randomised trial workshop: session 3The stepped wedge cluster randomised trial workshop: session 3
The stepped wedge cluster randomised trial workshop: session 3Karla hemming
 
Research and reporting methods for the stepped wedge cluster randomized trial...
Research and reporting methods for the stepped wedge cluster randomized trial...Research and reporting methods for the stepped wedge cluster randomized trial...
Research and reporting methods for the stepped wedge cluster randomized trial...NIHR CLAHRC West Midlands
 
Take it to the Limit: quantitation, likelihood, modelling and other matters
Take it to the Limit: quantitation, likelihood, modelling and other mattersTake it to the Limit: quantitation, likelihood, modelling and other matters
Take it to the Limit: quantitation, likelihood, modelling and other mattersStephen Senn
 
1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inferenceDev Pandey
 
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...niranjan kumar
 
Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Simple Variance Ana...
Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Simple Variance Ana...Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Simple Variance Ana...
Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Simple Variance Ana...J. García - Verdugo
 

Similar to Si z slides-multiple comparisons procedures (east) 41pg (20)

Stat sample test ch 11
Stat sample test ch 11Stat sample test ch 11
Stat sample test ch 11
 
EUGM 2011 | JEHL | group sequential designs with 2 time to event endpoints
EUGM 2011 | JEHL | group sequential designs with 2 time to event endpointsEUGM 2011 | JEHL | group sequential designs with 2 time to event endpoints
EUGM 2011 | JEHL | group sequential designs with 2 time to event endpoints
 
20150603 establishing theoretical minimal sets of mutants (Lab seminar)
20150603 establishing theoretical minimal sets of mutants (Lab seminar)20150603 establishing theoretical minimal sets of mutants (Lab seminar)
20150603 establishing theoretical minimal sets of mutants (Lab seminar)
 
Performance analysis of logic
Performance analysis of logicPerformance analysis of logic
Performance analysis of logic
 
A GARCH analysis of dark-pool trades - Philippe de Peretti, Oren J. Tapiero ....
A GARCH analysis of dark-pool trades - Philippe de Peretti, Oren J. Tapiero ....A GARCH analysis of dark-pool trades - Philippe de Peretti, Oren J. Tapiero ....
A GARCH analysis of dark-pool trades - Philippe de Peretti, Oren J. Tapiero ....
 
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
Model-Based User Interface Optimization: Part IV: ADVANCED TOPICS - At SICSA ...
 
STATISTIC ESTIMATION
STATISTIC ESTIMATIONSTATISTIC ESTIMATION
STATISTIC ESTIMATION
 
Experimental Design
Experimental  DesignExperimental  Design
Experimental Design
 
Chernick.Michael.ppt
Chernick.Michael.pptChernick.Michael.ppt
Chernick.Michael.ppt
 
Chernick.Michael.ppt
Chernick.Michael.pptChernick.Michael.ppt
Chernick.Michael.ppt
 
Chernick.Michael (1).ppt
Chernick.Michael (1).pptChernick.Michael (1).ppt
Chernick.Michael (1).ppt
 
The stepped wedge cluster randomised trial workshop: session 3
The stepped wedge cluster randomised trial workshop: session 3The stepped wedge cluster randomised trial workshop: session 3
The stepped wedge cluster randomised trial workshop: session 3
 
Research and reporting methods for the stepped wedge cluster randomized trial...
Research and reporting methods for the stepped wedge cluster randomized trial...Research and reporting methods for the stepped wedge cluster randomized trial...
Research and reporting methods for the stepped wedge cluster randomized trial...
 
Why do a designed experiment
Why do a designed experimentWhy do a designed experiment
Why do a designed experiment
 
Take it to the Limit: quantitation, likelihood, modelling and other matters
Take it to the Limit: quantitation, likelihood, modelling and other mattersTake it to the Limit: quantitation, likelihood, modelling and other matters
Take it to the Limit: quantitation, likelihood, modelling and other matters
 
Design of experiments(
Design of experiments(Design of experiments(
Design of experiments(
 
1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference1192012 155942 f023_=_statistical_inference
1192012 155942 f023_=_statistical_inference
 
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
VOICE PASSWORD BASED SPEAKER VERIFICATION SYSTEM USING VOWEL AND NON VOWEL RE...
 
Tests of significance
Tests of significance  Tests of significance
Tests of significance
 
Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Simple Variance Ana...
Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Simple Variance Ana...Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Simple Variance Ana...
Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Simple Variance Ana...
 

More from Cytel

StatXact 10 Procedures Roadmap
StatXact 10 Procedures RoadmapStatXact 10 Procedures Roadmap
StatXact 10 Procedures RoadmapCytel
 
Stat xact10 infosheet
Stat xact10 infosheetStat xact10 infosheet
Stat xact10 infosheetCytel
 
Si z mc-datasheet-
Si z mc-datasheet-Si z mc-datasheet-
Si z mc-datasheet-Cytel
 
Si z mc-datasheet- 2
Si z mc-datasheet- 2Si z mc-datasheet- 2
Si z mc-datasheet- 2Cytel
 
Si z siz vs. nquery
Si z siz vs. nquerySi z siz vs. nquery
Si z siz vs. nqueryCytel
 
Si z multi arm trials
Si z multi arm trialsSi z multi arm trials
Si z multi arm trialsCytel
 
East ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehtaEast ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehtaCytel
 
East Infosheet
East InfosheetEast Infosheet
East InfosheetCytel
 
East Sample Size Re-Estimation Infosheet
East Sample Size Re-Estimation InfosheetEast Sample Size Re-Estimation Infosheet
East Sample Size Re-Estimation InfosheetCytel
 
East Bayesian Assurance Infosheet
East Bayesian Assurance InfosheetEast Bayesian Assurance Infosheet
East Bayesian Assurance InfosheetCytel
 
East 6.2 Architect Infosheet
East 6.2 Architect InfosheetEast 6.2 Architect Infosheet
East 6.2 Architect InfosheetCytel
 
East 6.2 architect-brochure
East 6.2 architect-brochureEast 6.2 architect-brochure
East 6.2 architect-brochureCytel
 
East sample size re estimation
East sample size re estimationEast sample size re estimation
East sample size re estimationCytel
 
East bayesian power calculations
East bayesian power calculationsEast bayesian power calculations
East bayesian power calculationsCytel
 
East architect brochure
East architect brochureEast architect brochure
East architect brochureCytel
 
Compass Infosheet
Compass InfosheetCompass Infosheet
Compass InfosheetCytel
 
Clinical outsourcing
Clinical outsourcingClinical outsourcing
Clinical outsourcingCytel
 
ACES - What is ACES And Who is Using It
ACES - What is ACES And Who is Using ItACES - What is ACES And Who is Using It
ACES - What is ACES And Who is Using ItCytel
 
Aces overview & features
Aces overview & featuresAces overview & features
Aces overview & featuresCytel
 
Therapeutic Area Standards: Reflections on Oncology Standards and What is Nee...
Therapeutic Area Standards: Reflections on Oncology Standards and What is Nee...Therapeutic Area Standards: Reflections on Oncology Standards and What is Nee...
Therapeutic Area Standards: Reflections on Oncology Standards and What is Nee...Cytel
 

More from Cytel (20)

StatXact 10 Procedures Roadmap
StatXact 10 Procedures RoadmapStatXact 10 Procedures Roadmap
StatXact 10 Procedures Roadmap
 
Stat xact10 infosheet
Stat xact10 infosheetStat xact10 infosheet
Stat xact10 infosheet
 
Si z mc-datasheet-
Si z mc-datasheet-Si z mc-datasheet-
Si z mc-datasheet-
 
Si z mc-datasheet- 2
Si z mc-datasheet- 2Si z mc-datasheet- 2
Si z mc-datasheet- 2
 
Si z siz vs. nquery
Si z siz vs. nquerySi z siz vs. nquery
Si z siz vs. nquery
 
Si z multi arm trials
Si z multi arm trialsSi z multi arm trials
Si z multi arm trials
 
East ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehtaEast ugm-2012-presentation-east-future-mehta
East ugm-2012-presentation-east-future-mehta
 
East Infosheet
East InfosheetEast Infosheet
East Infosheet
 
East Sample Size Re-Estimation Infosheet
East Sample Size Re-Estimation InfosheetEast Sample Size Re-Estimation Infosheet
East Sample Size Re-Estimation Infosheet
 
East Bayesian Assurance Infosheet
East Bayesian Assurance InfosheetEast Bayesian Assurance Infosheet
East Bayesian Assurance Infosheet
 
East 6.2 Architect Infosheet
East 6.2 Architect InfosheetEast 6.2 Architect Infosheet
East 6.2 Architect Infosheet
 
East 6.2 architect-brochure
East 6.2 architect-brochureEast 6.2 architect-brochure
East 6.2 architect-brochure
 
East sample size re estimation
East sample size re estimationEast sample size re estimation
East sample size re estimation
 
East bayesian power calculations
East bayesian power calculationsEast bayesian power calculations
East bayesian power calculations
 
East architect brochure
East architect brochureEast architect brochure
East architect brochure
 
Compass Infosheet
Compass InfosheetCompass Infosheet
Compass Infosheet
 
Clinical outsourcing
Clinical outsourcingClinical outsourcing
Clinical outsourcing
 
ACES - What is ACES And Who is Using It
ACES - What is ACES And Who is Using ItACES - What is ACES And Who is Using It
ACES - What is ACES And Who is Using It
 
Aces overview & features
Aces overview & featuresAces overview & features
Aces overview & features
 
Therapeutic Area Standards: Reflections on Oncology Standards and What is Nee...
Therapeutic Area Standards: Reflections on Oncology Standards and What is Nee...Therapeutic Area Standards: Reflections on Oncology Standards and What is Nee...
Therapeutic Area Standards: Reflections on Oncology Standards and What is Nee...
 

Si z slides-multiple comparisons procedures (east) 41pg

  • 1. New Developments in East: Design and Simulation of Trials with Multiple Treatment Arms Cytel Webinar Series October 4-5, 2011 Cyrus R. Mehta, Ph.D Cytel Inc., Cambridge, MA email: mehta@cytel.com – web: www.cytel.com – tel: 617-661-2011 1 Cytel Webinar Series. October 4-5, 2011
  • 2. Outline of Presentation Multiple Comparisons Procedures in SiZ(TM) • Overview of SiZ(TM) • Brief introduction to MCP and MCP module in SiZ(TM) • Motivating example: Alzheimers disease • Parametric tests (Dunnett, Step-Down Dunnett) • Nonparametric tests (Bonferroni, Holms, Hochberg, Hommel, Fixed Sequence, Fall Back) • Principle of closed testing • Short-cuts to closed testing New Architecture for East-6 • Integration of SiZ(TM) , East(R) and modules for adaptive multi-arm trials into one unified platform 2 Cytel Webinar Series. October 4-5, 2011
  • 3. Quick Overview of SiZ 3 Cytel Webinar Series. October 4-5, 2011
  • 4. Some Sources of Multiplicity • Repeated significance tests • Multiple treatment arms • Multiple endpoints • Subgroup analysis • Variable selection in regression models 4 Cytel Webinar Series. October 4-5, 2011
  • 5. Error Rates for Multiplicity Problems • There is a ‘family’ of m inferences • Parameters are δ1, δ2, . . . δm • Null hypotheses are H1, H2, . . . Hm • Comparisonwise error rate applies to an individual hypothesis; offers ‘local control’ of type-1 error CERj = P (reject Hj|Hj is true) • Familywise error rate applies to the entire family FWER = P (reject at least one true null hypotheses) • Usually wish to control FWER at some level α 5 Cytel Webinar Series. October 4-5, 2011
  • 6. Strong and Weak Control of FWER • Control of FWER at level α means that P (reject at least one true null hypotheses) ≤ α (1) • Strong Control of FWER means that (1) is satisfied under all partial null hypotheses of the type HI = ∩i∈IHi for all subsets I ⊆ {1, 2, . . . m} • Weak Control of FWER means that (1) is satisfied only under some HI, typically I = {1, 2, . . . m} • Strong control of FWER is a regulatory requirement if multiple statements about product efficacy are to be included in the product label 6 Cytel Webinar Series. October 4-5, 2011
  • 7. Motivation for MCP Design Software • Many confirmatory trials have multiple arms or multiple endpoints • Software to perform multiplicity adjusted analyses for such trials exists (in R and SAS) • Sample size software for such designs is, however, limited • SiZ can evaluate the operating characteristics of the different MCPs and choose the best one for the study 7 Cytel Webinar Series. October 4-5, 2011
  • 8. Motivating Example: Alzheimer’s Disease • Randomized, double-blind, placebo controlled, parallel group trial • Three doses (0.3 mg, 1 mg, 2 mg) compared to placebo • Primary endpoint: change from baseline in ADAS-cog-11 at week 24 • Difference from placebo expected to be between 1.5 and 2.5 units with common standard deviation σ = 5 Multiplicity arises because the trial is considered successful if at least one dose is declared statistically significant 8 Cytel Webinar Series. October 4-5, 2011
  • 9. The MCP Module in SiZ • Simulation based sample size calculations • Parametric Tests: Single-step Dunnett and step-down Dunnett • Nonparametric Tests: Bonferroni, Sidak, Weighted Bonferroni, Holm, Hochberg, Hommel, Fixed-sequence, Fall-back 9 Cytel Webinar Series. October 4-5, 2011
  • 10. Dunnett’s Procedure • Let Yij ∼ N(μi, σ2 ) be response of subject j = 1, 2, . . . n on treatment i = 0, 1, 2, . . . m, where i = 0 denotes the control arm • The marginal t-statistic for ith treatment effect is ti = yi − y0 s 2/n • Denote the cumulative distribution function for the maximum ti by F (x|m, ν) = P {max(T1, T2, . . . Tm) ≤ x} • Under H0: μi − μ0 = 0 for all i, F (x|m, ν) is multivariate-t with ν = (m + 1)(n − 1) degrees of freedom • Compute qα,m, the (1 − α) quantile of F , defined by F (qα,m|m, ν) = 1 − α The critical value qα,m is evaluated by numerical integration 10 Cytel Webinar Series. October 4-5, 2011
  • 11. Single-Step Dunnett Reject every null hypothesis Hi: μi − μ0 = 0 for which ti ≥ qα,m 11 Cytel Webinar Series. October 4-5, 2011
  • 12. Simulations of Single-Step Dunnett • A total sample size of 360 patients produces the desired 90% global power with Dunnett’s single-step procedure 12 Cytel Webinar Series. October 4-5, 2011
  • 13. Power Definitions for Multiplicity Problems Global Power: Probability of rejecting at least one null hypothesis Disjunctive Power: Probability of rejecting at least one null hypothesis that is false Conjunctive Power: Probability of rejecting all null hypotheses that are false Global and Disjunctive power are usually almost the same 13 Cytel Webinar Series. October 4-5, 2011
  • 14. Step-Down Dunnett Key Idea: Because the test statistics T1, T2, . . . Tm are positively correlated, if you reject a hypothesis, the rejection criteria for the remaining hypotheses can be weakened Accordingly let: qα,m satisfy F (qα,m|m, ν) = α; denote it by c1 qα,m−1 satisfy F (qα,m−1|m − 1, ν) = α; denote it by c2 ... qα,1 satisfy F (qα,1|1, ν) = α; denote it by cm 14 Cytel Webinar Series. October 4-5, 2011
  • 15. Step-Down Dunnett, contd. Let T(1) ≥ T(2) ≥ · · · ≥ T(m) denote the m order statistics • Step 1. If t(1) ≥ c1, reject H(1) and go to the next step. Otherwise retain all hypotheses and stop. • Steps i = 2, . . . , m − 1. If t(i) ≥ ci, reject H(i) and go to the next step. Otherwise retain H(i), . . . H(m) and stop. • Step m. If t(m) ≥ cm reject H(m). Otherwise retain H(m). Since c1 > c2 > · · · > cm, step-down Dunnett rejects at least as many (and sometimes more) hypotheses as single step 15 Cytel Webinar Series. October 4-5, 2011
  • 16. Simulations of Step-Down Dunnett • The step-down test improves the conjunctive power • Global and disjunctive power are the same in this example since all null hypotheses are false 16 Cytel Webinar Series. October 4-5, 2011
  • 17. Advantages and Limitations of Dunnett Advantages • More powerful than nonparametric procedures if assumptions are met • Generates multiplicity adjusted confidence intervals for individual treatment effects Limitations • Relies on normality assumption • Relies on homoscedasticity assumption 17 Cytel Webinar Series. October 4-5, 2011
  • 18. Dunnett’s FWER under unequal variance In 100,000 simulations FWER was not preserved for Dunnett but was preserved for Bonferroni and Sidak (two non-parametric MCPs) 18 Cytel Webinar Series. October 4-5, 2011
  • 19. Nonparametric MCPs • Do not require any distributional assumptions • Bonferroni and Sidak are single-step MCPs • Holms is a step-down MCP (start with the biggest effect and work your way down) • Hochberg and Hommel are step-up MCPs (start with the smallest effect and work your way up) • Fixed Sequence and Fall Back procedures test each individual hypothesis in a pre-specified order 19 Cytel Webinar Series. October 4-5, 2011
  • 20. Bonferroni Procedure • Let p1, p2, . . . pm be the m marginal p-values • The Bonferroni procedure rejects any Hi for which pi ≤ α m 20 Cytel Webinar Series. October 4-5, 2011
  • 21. Comparing Dunnett and Bonferroni Dunnett loses power because of heteroscedasticity 21 Cytel Webinar Series. October 4-5, 2011
  • 22. The Weighted Bonferroni • Let wi < 1 be the fraction of α allocated to testing Hi, where m i=1 wi = 1 • The weighted Bonferroni procedure rejects any Hi for which pi ≤ wiα Note: The regular Bonferroni is a special case of the weighted Bonferroni in which wi = 1 m for all i 22 Cytel Webinar Series. October 4-5, 2011
  • 23. Can we improve on Bonferroni for testing individual hypotheses? • Suppose we are testing m individual hypotheses • Let p1, p2, . . . pi be the individual p-values • The Bonferrioni procedure rejects each Hi for which pi ≤ α m . Can we do better than a single cut-off? • Yes! By applying closed testing, we don’t require the same strict criterion for every individual hypothesis 23 Cytel Webinar Series. October 4-5, 2011
  • 24. Closed Testing of Individual Hypotheses Given individual hypotheses H1, H2, . . . Hm: 1. Construct the closed set consisting of all possible intersections of the individual hypotheses of the form Hi1 ∩ Hi2 ∩ · · · ∩ Hiq for all q = 1, 2, . . . m 2. Specify a local level-α test for each member of the closed set 3. An individual Hi may be rejected with strong control of FWER at level α if both these conditions hold: • Hi is rejected by its local level-α test • All intersection hypotheses that contain Hi are also rejected by their local level-α tests 24 Cytel Webinar Series. October 4-5, 2011
  • 25. Example of Closed Testing Acknowledgement: This slide has been taken from Peter Westfall’s notes 25 Cytel Webinar Series. October 4-5, 2011
  • 26. Testing the Intersection Hypotheses The key difference between one closed testing procedure and another is the method used to test intersection hypotheses of the form Hi1 ∩ Hi2 ∩ · · · ∩ Hiq . There are many candidates • Use Dunnetts test for the intersection hypotheses Short Cut: Step-down Dunnett procedure • Use the Bonferroni test for the intersection hypotheses Short Cut: Step-down Holms procedure • Using Simes test for the intersection hypotheses Short Cut: Step-up Hommel procedure • Using Hochberg’s test for the intersection hypotheses Short Cut: Step-up Hochberg procedure 26 Cytel Webinar Series. October 4-5, 2011
  • 27. Level-α Tests of H1 ∩ H2 ∩ · · · Hm Let p(1) ≤ p(2) ≤ . . . ≤ p(m) be the ordered p-values Bonferroni: Reject if p(1) ≤ α/m Hochberg: Reject if p(m) ≤ α or p(m−1) ≤ α 2 or p(m−2) ≤ α 3 or . . . or p(1) ≤ α m Simes: Reject if p(m) ≤ α or p(m−1) ≤ (m − 1)α m or p(m−2) ≤ (m − 2)α m or . . . or p(1) ≤ α m Hochberg is more powerful than Bonferroni. Simes is more powerful than Hochberg. Simes is only valid if the p-values are positively correlated 27 Cytel Webinar Series. October 4-5, 2011
  • 28. Holm’s Procedure • Order the p-values for the individual hypotheses in ascending order as p(1) ≤ p(2) ≤ · · · ≤ p(m) • If p(1) ≤ α/m reject H(1) and continue testing; else stop • If p(2) ≤ α/(m − 1) reject H(2) and continue testing; else stop • In general, reject p(i) if p(i) ≤ α/(m − i + 1) and H(1), H(2), . . . H(i−1) have already been rejected 28 Cytel Webinar Series. October 4-5, 2011
  • 29. Power Comparison: Bonferroni vs Holms Global and disjunctive power are equal; global power for Bonferroni and Holm are equal but big differences in conjunctive and individual power 29 Cytel Webinar Series. October 4-5, 2011
  • 30. Hochberg’s Procedure • Order the p-values in ascending order as p(1) ≤ p(2) · · · ≤ p(m) • If p(m) ≤ α, reject all m hypotheses and stop; otherwise retain H(m) and continue testing • If p(m−1) ≤ α/2, reject all m − 1 hypotheses and stop; otherwise retain H(m−1) and continue testing • If p(m−2) ≤ α/3, reject all m − 2 hypotheses and stop; otherwise retain H(m−1) and continue testing • In general, at any step i = 1, 2, . . . m, if p(m−i+1) ≤ α i reject all m − i + 1 hypotheses and stop; otherwise retain H(m−i+1) and continue testing 30 Cytel Webinar Series. October 4-5, 2011
  • 31. Hommel’s Procedure • Order the p-values in ascending order as p(1) ≤ p(2) · · · ≤ p(m) • If p(m) ≤ α, reject all m hypotheses and stop; otherwise retain H(m) and continue testing • If p(m) ≤ α or p(m−1) ≤ α/2, reject all m − 1 hypotheses and stop; otherwise retain H(m−1) and continue testing • If p(m) ≤ α or p(m−1) ≤ 2α/3 or p(m−2) ≤ α/3, reject all m − 2 hypotheses and stop; otherwise retain H(m−2) and continue testing • In general, at any step i = 1, 2, . . . m, if p(m) ≤ iα i or p(m−1) ≤ (i − 1)α i or p(m−2) ≤ (i − 2)α i . . . p(m−i+1) ≤ α i reject all m − i + 1 hypotheses and stop; otherwise retain H(m−i+1) and continue testing 31 Cytel Webinar Series. October 4-5, 2011
  • 32. Hommel versus Hochberg • Hommel’s procedure is clearly more powerful than Hochberg’s since it gives more chances for rejection at each step • But it is rarely used in practice because it is perceived to be too complicated • Hochberg is very simple to explain and use, hence more popular 32 Cytel Webinar Series. October 4-5, 2011
  • 33. Comparing Power for Bonferroni, Holms, Hochberg and Hommel Procedures 33 Cytel Webinar Series. October 4-5, 2011
  • 34. Fixed Sequence Testing • Assume H1, H2, ..., Hm are ordered hypotheses; i.e., μi ≥ μi−1, i = 1, 2, . . . m • Let p1, p2, ..., pm be the associated raw p-values – Step 1. If p1 < α, reject H1 and go to the next step. Otherwise retain all hypotheses and stop – Step i = 2, . . . , m − 1. If pi < α, reject Hi and go to the next step. Otherwise retain all the remaining hypotheses and stop – Step m. If pm < α, reject Hm; otherwise retain it. • More powerful than other procedures if ordering is correct • Closed under fixed sequence testing of each intersection hypotheses 34 Cytel Webinar Series. October 4-5, 2011
  • 35. Example 1: Testing sequence is correct If the order of testing is correctly specified, the fixed sequence procedure is the most powerful 35 Cytel Webinar Series. October 4-5, 2011
  • 36. Example 2: Testing sequence is incorrect Fixed sequence procedure loses considerable power if testing sequence is incorrect 36 Cytel Webinar Series. October 4-5, 2011
  • 37. The Fall Back Procedure • Assume H1, H2, ..., Hm are ordered hypotheses, w1, w2, . . . wm are pre-specified weights with m i=1 wi = 1, and p1, p2, ..., pm are the associated raw p-values. The testing proceeds as follows: – Step 1. Test H1 at α1 = w1α. If p1 ≤ α1, reject H1; otherwise retain it and go to the next step. – Step i = 2, . . . , m. Test Hi at αi = αi−1 + wiα if Hi−1 is rejected and at αi = wiα if Hi−1 is retained. If pi ≤ αi, reject Hi; otherwise retain it and go to the next step. • Provides option to continue even if a hypothesis is retained. Hence good insurance policy in case incorrect testing order was specified • Specializes to fixed sequence test if w1 = 1 and w2 = · · · = wm = 0 • Was shown by Wiens and Dmitreinko (2005) to be a closed test 37 Cytel Webinar Series. October 4-5, 2011
  • 38. Example 3: Performance of Fall Back Test under Incorrect Testing Sequence Test H1 at level α1 = α/3. If H1 is rejected, test H2 at level α2 = α1 + α/3, otherwise test H2 at level α2 = α/3. If H2 is rejected test H3 at level α2 + α/3, otherwise test H3 at level α/3 Fall back procedure is almost as good as Hommel despite guessing treatment order incorrectly 38 Cytel Webinar Series. October 4-5, 2011
  • 39. Example 4: Performance of Fall Back Test under Correct Testing Sequence In this example the fall back test is almost as good as the fixed sequence test when the ordering is correctly specified and is superior to the fixed sequence test when the ordering is incorrectly specified 39 Cytel Webinar Series. October 4-5, 2011
  • 40. Which Test to Use? 40 Cytel Webinar Series. October 4-5, 2011
  • 41. Download SiZ 2.0 30-day demo version www.cytel.com/Software/SiZ.aspx or Request a CD: info@cytel.com 617-661-2011 41 Cytel Webinar Series. October 4-5, 2011