SlideShare a Scribd company logo
1 of 54
Download to read offline
Introduction Methodology Results Conclusions Future Work Contributions
B.Tech. Project
Attaining maximum power under cost constraints
in single gene experiments
P Jishnu Jaykumar (201352005)
Abhijeet Singh Panwar (201351005)
under the supervision of
Dr. Bhargab Chattopadhyay
Indian Institute of Information Technology,
Vadodara
May 5, 2017
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 1/27
Introduction Methodology Results Conclusions Future Work Contributions
Introduction
Single gene experiments are assays for querying ribonu-
cleic acid species abundance in individual or pools of
cells.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
Introduction Methodology Results Conclusions Future Work Contributions
Introduction
Single gene experiments are assays for querying ribonu-
cleic acid species abundance in individual or pools of
cells.
They have enormous importance in varied fields like evo-
lution, pathology and drug development.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
Introduction Methodology Results Conclusions Future Work Contributions
Introduction
Single gene experiments are assays for querying ribonu-
cleic acid species abundance in individual or pools of
cells.
They have enormous importance in varied fields like evo-
lution, pathology and drug development.
However, these experiments are usually performed by
pre-specifying the sample size during experimental de-
sign, which may lead to under powered or over powered
hypothesis tests.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
Introduction Methodology Results Conclusions Future Work Contributions
Introduction
Single gene experiments are assays for querying ribonu-
cleic acid species abundance in individual or pools of
cells.
They have enormous importance in varied fields like evo-
lution, pathology and drug development.
However, these experiments are usually performed by
pre-specifying the sample size during experimental de-
sign, which may lead to under powered or over powered
hypothesis tests.
This resulted in a shift to experimental design
paradigms with minimal number of replicates to maxi-
mize power.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
Introduction Methodology Results Conclusions Future Work Contributions
Introduction
Single gene experiments are assays for querying ribonu-
cleic acid species abundance in individual or pools of
cells.
They have enormous importance in varied fields like evo-
lution, pathology and drug development.
However, these experiments are usually performed by
pre-specifying the sample size during experimental de-
sign, which may lead to under powered or over powered
hypothesis tests.
This resulted in a shift to experimental design
paradigms with minimal number of replicates to maxi-
mize power.
Several studies have analyzed sample size required to
maximize power in hypothesis testing in RNA abun-
dance experiments.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
Introduction Methodology Results Conclusions Future Work Contributions
Introduction - Contd.
Recently, for the two-stage experimental design frame-
work, algorithms like SCOTTY have been proposed.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 3/27
Introduction Methodology Results Conclusions Future Work Contributions
Introduction - Contd.
Recently, for the two-stage experimental design frame-
work, algorithms like SCOTTY have been proposed.
But SCOTTY fails, if there are multiple (> 2) cell types.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 3/27
Introduction Methodology Results Conclusions Future Work Contributions
Introduction - Contd.
Recently, for the two-stage experimental design frame-
work, algorithms like SCOTTY have been proposed.
But SCOTTY fails, if there are multiple (> 2) cell types.
Also, two-stage procedures are imperfect if pilot sample
is unrepresentative.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 3/27
Introduction Methodology Results Conclusions Future Work Contributions
Introduction - Contd.
Recently, for the two-stage experimental design frame-
work, algorithms like SCOTTY have been proposed.
But SCOTTY fails, if there are multiple (> 2) cell types.
Also, two-stage procedures are imperfect if pilot sample
is unrepresentative.
Thus the focus is to maximize the power of the hypoth-
esis test related to a single gene belonging to more than
two cell types under a cost constraint.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 3/27
Introduction Methodology Results Conclusions Future Work Contributions
Some facts
In experimental research, limited funding is allotted be-
forehand to carry out the sampling process.
Thus under a cost constraint we have to carry out the
test of equivalence or inferiority (or superiority) with
minimum errors.
Note that we fix the probability of type-I error to α (0.05
in our case),
So the idea is to minimize the probability of type-II
error or equivalently maximizing the power under cost
constraints.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 4/27
Introduction Methodology Results Conclusions Future Work Contributions
Formulation
Suppose there are K cell types belonging to a particular
gene.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 5/27
Introduction Methodology Results Conclusions Future Work Contributions
Formulation
Suppose there are K cell types belonging to a particular
gene.
For the ith
cell type, suppose Xi1, . . . , Xini
be iid ran-
dom variables, not necessary normal, with means µi and
variances σ2
i .
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 5/27
Introduction Methodology Results Conclusions Future Work Contributions
Formulation
Suppose there are K cell types belonging to a particular
gene.
For the ith
cell type, suppose Xi1, . . . , Xini
be iid ran-
dom variables, not necessary normal, with means µi and
variances σ2
i .
Consider δ = c µ K
i=1 ciµi , c = (c1, . . . , cK) is
known.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 5/27
Introduction Methodology Results Conclusions Future Work Contributions
Formulation
Suppose there are K cell types belonging to a particular
gene.
For the ith
cell type, suppose Xi1, . . . , Xini
be iid ran-
dom variables, not necessary normal, with means µi and
variances σ2
i .
Consider δ = c µ K
i=1 ciµi , c = (c1, . . . , cK) is
known.
Estimator of δ is δn = K
i=1 ci
¯Xni
.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 5/27
Introduction Methodology Results Conclusions Future Work Contributions
Formulation Contd.
γi’s: sample size allocation ratios (unknown), ni = γin1,
γ1 = 1.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 6/27
Introduction Methodology Results Conclusions Future Work Contributions
Formulation Contd.
γi’s: sample size allocation ratios (unknown), ni = γin1,
γ1 = 1.
V ar(δn) = K
i=1 c2
i σ2
i /ni = 1
n1
K
i=1 c2
i σ2
i /γi.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 6/27
Introduction Methodology Results Conclusions Future Work Contributions
Formulation Contd.
γi’s: sample size allocation ratios (unknown), ni = γin1,
γ1 = 1.
V ar(δn) = K
i=1 c2
i σ2
i /ni = 1
n1
K
i=1 c2
i σ2
i /γi.
Using CLT,
√
n1(δn − δ)
K
i=1 c2
i σ2
i /γi
→
D
N(0, 1)
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 6/27
Introduction Methodology Results Conclusions Future Work Contributions
Test for Equivalence
Compare δ, to two equivalence margins with δ1 < δ < δ2
H0 : δ ≤ δ1 or δ ≥ δ2 against HA : δ1 < δ < δ2
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 7/27
Introduction Methodology Results Conclusions Future Work Contributions
Test for Equivalence
Compare δ, to two equivalence margins with δ1 < δ < δ2
H0 : δ ≤ δ1 or δ ≥ δ2 against HA : δ1 < δ < δ2
Consider the test for equivalence. Then the null hypoth-
esis H0 is rejected at α% if
δ1 + zα V ar(δn) < δn < δ2 + zα V ar(δn)
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 7/27
Introduction Methodology Results Conclusions Future Work Contributions
Test for Equivalence
Compare δ, to two equivalence margins with δ1 < δ < δ2
H0 : δ ≤ δ1 or δ ≥ δ2 against HA : δ1 < δ < δ2
Consider the test for equivalence. Then the null hypoth-
esis H0 is rejected at α% if
δ1 + zα V ar(δn) < δn < δ2 + zα V ar(δn)
The approximate power function is
PTE(δ) = Φ

−zα +
δ2 − δ
V ar(ˆδn)

−Φ

zα −
δ − δ1
V ar(ˆδn)


P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 7/27
Introduction Methodology Results Conclusions Future Work Contributions
Test for Inferiority
Compare δ, to δ0, we have,
H0 : δ ≤ −|δ0| against HA : δ > −|δ0|
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 8/27
Introduction Methodology Results Conclusions Future Work Contributions
Test for Inferiority
Compare δ, to δ0, we have,
H0 : δ ≤ −|δ0| against HA : δ > −|δ0|
Consider the inferiority test. Then the null hypothesis
H0 is rejected at α% if
δn > −|δ0| + zα V ar(δn)
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 8/27
Introduction Methodology Results Conclusions Future Work Contributions
Test for Inferiority
Compare δ, to δ0, we have,
H0 : δ ≤ −|δ0| against HA : δ > −|δ0|
Consider the inferiority test. Then the null hypothesis
H0 is rejected at α% if
δn > −|δ0| + zα V ar(δn)
The approximate power function
PTI(δ) = Φ

−zα +
δ + | δ0 |
V ar(ˆδn)


P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 8/27
Introduction Methodology Results Conclusions Future Work Contributions
Maximizing Power Under Cost Constraints
Suppose, we have a certain amount of money to carry
out sampling, say |A0.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 9/27
Introduction Methodology Results Conclusions Future Work Contributions
Maximizing Power Under Cost Constraints
Suppose, we have a certain amount of money to carry
out sampling, say |A0.
If it costs |ai to sample each observation from ith
cell
type, then A0 = K
i=1 aini =⇒ n1 = A0
K
i=1 aiγi
.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 9/27
Introduction Methodology Results Conclusions Future Work Contributions
Maximizing Power Under Cost Constraints
Suppose, we have a certain amount of money to carry
out sampling, say |A0.
If it costs |ai to sample each observation from ith
cell
type, then A0 = K
i=1 aini =⇒ n1 = A0
K
i=1 aiγi
.
In order to maximize power, the optimal allocation ratio
γi =
| ci | σi
| c1 | σ1
a1
ai
(1)
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 9/27
Introduction Methodology Results Conclusions Future Work Contributions
Maximizing Power Under Cost Constraints
Suppose, we have a certain amount of money to carry
out sampling, say |A0.
If it costs |ai to sample each observation from ith
cell
type, then A0 = K
i=1 aini =⇒ n1 = A0
K
i=1 aiγi
.
In order to maximize power, the optimal allocation ratio
γi =
| ci | σi
| c1 | σ1
a1
ai
(1)
The optimal sample size for ith
cell type is,
nio =
A0 | ci | σi
K
i=1 | cl | σl
√
alai
(2)
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 9/27
Introduction Methodology Results Conclusions Future Work Contributions
Sequential Procedure
Step 1: First obtain Nio = m(≥ 2) observations from
each of the cell types and for ith
cell type ,check
m ≥
A0 | Ci | SiNio
K
i=1 | cl | slNlo
√
alai
If for ith
cell type, the above condition gets satisfied, no
further observations should be obtained from that cell
type. Else go to step 2 after carrying out step 1 for all
cell types.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 10/27
Introduction Methodology Results Conclusions Future Work Contributions
Sequential Procedure Contd.
Step 2: Add m observations corresponding to the ith
cell type and set Nio = m + m and check,
m + m ≥
A0 | Ci | SiNio
K
i=1 | cl | slNlo
√
alai
If for ith
cell type, the above condition gets satisfied,
no further sampling needs to be done for that cell type.
Else repeat step 2 and continue this until for all cell
types, the condition gets satisfied.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 11/27
Introduction Methodology Results Conclusions Future Work Contributions
Sequential Procedure and Characteristics
One can find optimal sample size using the stopping
rule. Nio, is the smallest integer, such that
nio ≥
A0 | Ci | Sinio
K
i=1 | cl | slnlo
√
alai
(3)
The total cost of sampling K
i=1 Nio observations, Nio
being the estimated final optimal sample size for ith
cell
type computed using Equation 4, is |A0 .
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 12/27
Introduction Methodology Results Conclusions Future Work Contributions
Contd.
The optimal allocation ratio as defined in Equation 2 to
derive minimum variance depends on population variances
of the K cell types. In practice, the value of the population
variance of each cell type is unknown, therefore, the sample
size required to obtain maximum power of the test cannot
be computed.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 13/27
Introduction Methodology Results Conclusions Future Work Contributions
Graphical Representation
Flowchart that describes the sequential procedure developed.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 14/27
Introduction Methodology Results Conclusions Future Work Contributions
One of the simulation results
Cell Type |ci| ai Nio s(Nio) nio Nio/nio
1 N(µ=1, σ=2) 3 4 467.01 0.23 469 0.99
2 N(µ=2, σ=2) 3 4 233.49 0.78 235 0.99
3 N(µ=3, σ=4) 3 4 1168.9 0.48 1171 0.998
4 N(µ=4, σ=3) 1 2 440.3 0.70 442 0.996
5 N(µ=5, σ=2) 2 3 1080.92 0.80 1082 0.998
6 N(µ=6, σ=4) 3 4 1873.8 0.66 1873 1
7 N(µ=7, σ=5) 4 4 2187.02 0.72 2185 1
8 N(µ=8, σ=6) 5 3 1351.93 1.04 1352 1
9 N(µ=9, σ=3) 6 2 5968.11 1.09 5959 1.001
10 N(µ=10, σ=5) 7 2 3092.26 0.45 3090 1
Final Sample Sizes for Normal Distribution, simulated 5000
times when A0=|50000
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 15/27
Introduction Methodology Results Conclusions Future Work Contributions
One of the simulation results
Cell Type |ci| ai Nio s(Nio) nio Nio/nio
1 .95N(1,2)+.05N(20,4) 3 4 59 0.141 60 0.98
2 .95N(2,2)+.05N(20,4) 3 4 58 0.140 60 0.96
3 .95N(3,4)+.05N(20,4) 3 4 118 0.210 120 0.98
4 .95N(4,3)+.05N(20,4) 1 2 41 0.122 43 0.953
5 .95N(5,2)+.05N(20,4) 2 3 45 0.123 47 0.957
6 .95N(6,4)+.05N(20,4) 3 4 118 0.209 120 0.98
7 .95N(7,5)+.05N(20,4) 4 4 200 0.271 200 1
8 .95N(8,6)+.05N(20,4) 5 3 338 0.331 347 0.974
9 .95N(9,3)+.05N(20,4) 6 2 259 0.303 255 1.01
10 .95N(10,5)+.05N(20,4) 7 2 493 0.448 495 0.995
Final Sample Sizes for Mixture-Normal Distribution, simulated
5000 times Distribution when A0=|50000
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 16/27
Introduction Methodology Results Conclusions Future Work Contributions
Graph
Figure: Accuracy w.r.t. to all the distributions taken into account
for the simulation study.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 17/27
Introduction Methodology Results Conclusions Future Work Contributions
About the toolkit
A toolkit (web-application) for the sequential procedure
was also developed.
Input: A xlsx file containing data in a predefined format.
Output: Optimal sample sizes for each cell type speci-
fied in the input file.
Intermediate stages of the procedure can also be viewed.
It is temporarily hosted at http://139.59.74.69/
alpha_version/
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 18/27
Introduction Methodology Results Conclusions Future Work Contributions
Conclusions
Using the sequential procedure we found the optimal
sample sizes of the cell types, which are required to ob-
tain maximum power under cost constraints.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 19/27
Introduction Methodology Results Conclusions Future Work Contributions
Conclusions
Using the sequential procedure we found the optimal
sample sizes of the cell types, which are required to ob-
tain maximum power under cost constraints.
The simulation study showed that on an average, using
our procedure, the estimated optimal sample size and
theoretical sample size are close for most of the situa-
tions, except for the mixture of normal, weibull and chi
distribution.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 19/27
Introduction Methodology Results Conclusions Future Work Contributions
Conclusions
Using the sequential procedure we found the optimal
sample sizes of the cell types, which are required to ob-
tain maximum power under cost constraints.
The simulation study showed that on an average, using
our procedure, the estimated optimal sample size and
theoretical sample size are close for most of the situa-
tions, except for the mixture of normal, weibull and chi
distribution.
The procedure developed is independent of distribu-
tions.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 19/27
Introduction Methodology Results Conclusions Future Work Contributions
Conclusions Contd.
SCOTTY is limited to normal distribution and hence
can’t handle the datasets following other distributions.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 20/27
Introduction Methodology Results Conclusions Future Work Contributions
Conclusions Contd.
SCOTTY is limited to normal distribution and hence
can’t handle the datasets following other distributions.
Due to this factor, our procedure gains an advantage
over SCOTTY.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 20/27
Introduction Methodology Results Conclusions Future Work Contributions
Conclusions Contd.
SCOTTY is limited to normal distribution and hence
can’t handle the datasets following other distributions.
Due to this factor, our procedure gains an advantage
over SCOTTY.
We have developed a toolkit for estimating optimal sam-
ple sizes of different cell types corresponding to a single
gene and for making an inference about hypothesis test.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 20/27
Introduction Methodology Results Conclusions Future Work Contributions
Conclusions Contd.
SCOTTY is limited to normal distribution and hence
can’t handle the datasets following other distributions.
Due to this factor, our procedure gains an advantage
over SCOTTY.
We have developed a toolkit for estimating optimal sam-
ple sizes of different cell types corresponding to a single
gene and for making an inference about hypothesis test.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 20/27
Introduction Methodology Results Conclusions Future Work Contributions
Future Work
The project consisted of simulation study and toolkit
development for the sequential procedure with a single
gene having multiple cell types.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 21/27
Introduction Methodology Results Conclusions Future Work Contributions
Future Work
The project consisted of simulation study and toolkit
development for the sequential procedure with a single
gene having multiple cell types.
The future activities include developing a procedure for
a set of genes with multiple cell types.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 21/27
Introduction Methodology Results Conclusions Future Work Contributions
Future Work
The project consisted of simulation study and toolkit
development for the sequential procedure with a single
gene having multiple cell types.
The future activities include developing a procedure for
a set of genes with multiple cell types.
In addition to this, the toolkit will be also added up
with extra features and would be made to run for set of
genes scenario as well.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 21/27
Introduction Methodology Results Conclusions Future Work Contributions
Future Work
The project consisted of simulation study and toolkit
development for the sequential procedure with a single
gene having multiple cell types.
The future activities include developing a procedure for
a set of genes with multiple cell types.
In addition to this, the toolkit will be also added up
with extra features and would be made to run for set of
genes scenario as well.
The final outcome would be a rich web and desktop
application.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 21/27
Introduction Methodology Results Conclusions Future Work Contributions
Contributions
Abhijeet
Writing the R code for
the sequential procedure.
Simulation study using
Gamma Distribution
Weibull Distribution
Normal-Exponential
Distribution
Chi Distribution
Chi Square (Non-
Central) Distribution
Logistic Distribution
Jishnu
R code reviewing and bug
fixing.
Transforming the R code
to Python code.
Simulation study using
Normal distribution
Mixture-normal distri-
butions
Mixture of Normal +
Weibull + Chi Distri-
bution
Toolkit development.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 22/27
Introduction Methodology Results Conclusions Future Work Contributions
References I
[BSM+
13] Michele A Busby, Chip Stewart, Chase A Miller,
Krzysztof R Grzeda, and Gabor T Marth, Scotty:
a web tool for designing rna-seq experiments to
measure differential gene expression, Bioinfor-
matics 29 (2013), no. 5, 656–657.
[GCL11] Jiin-Huarng Guo, Hubert J Chen, and Wei-Ming
Luh, Sample size planning with the cost constraint
for testing superiority and equivalence of two in-
dependent groups, British Journal of Mathemat-
ical and Statistical Psychology 64 (2011), no. 3,
439–461.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 23/27
Introduction Methodology Results Conclusions Future Work Contributions
References II
[GS91] Bhaskar Kumar Ghosh and Pranab Kumar Sen,
Handbook of sequential analysis, CRC Press,
1991.
[JT99] Christopher Jennison and Bruce W Turnbull,
Group sequential methods with applications to
clinical trials, CRC Press, 1999.
[LG16] Wei-Ming Luh and Jiin-Huarng Guo, Sample size
planning for the noninferiority or equivalence of
a linear contrast with cost considerations., Psy-
chological methods 21 (2016), no. 1, 13.
[Lip90] Mark W Lipsey, Design sensitivity: Statistical
power for experimental research, vol. 19, Sage,
1990.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 24/27
Introduction Methodology Results Conclusions Future Work Contributions
References III
[LR06] Erich L Lehmann and Joseph P Romano, Testing
statistical hypotheses, Springer Science & Busi-
ness Media, 2006.
[MDS08] Nitis Mukhopadhyay and Basil M De Silva, Se-
quential methods and their applications, CRC
press, 2008.
[SS81] Pranab Kumar Sen and Pranab K Sen, Sequential
nonparametrics: invariance principles and statis-
tical inference, Wiley New York, 1981.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 25/27
Introduction Methodology Results Conclusions Future Work Contributions
Any Questions?
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 26/27
Introduction Methodology Results Conclusions Future Work Contributions
Statistics, likelihoods, and probabilities
mean everything to men, nothing to God.
- Richelle E. Goodrich
Thank You.
P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 27/27

More Related Content

Similar to Btp 2017 presentation

Correlational analysis, Basics, Assumptions for Pearson, Spearman Tests
Correlational analysis, Basics, Assumptions for Pearson, Spearman TestsCorrelational analysis, Basics, Assumptions for Pearson, Spearman Tests
Correlational analysis, Basics, Assumptions for Pearson, Spearman TestsMichael J Leo
 
Confirmatory Bayesian Online Change Point Detection in the Covariance Structu...
Confirmatory Bayesian Online Change Point Detection in the Covariance Structu...Confirmatory Bayesian Online Change Point Detection in the Covariance Structu...
Confirmatory Bayesian Online Change Point Detection in the Covariance Structu...JeeyeonHan
 
Rules Reduction using Evolutionary Meta-Heuristics
Rules Reduction using  Evolutionary Meta-HeuristicsRules Reduction using  Evolutionary Meta-Heuristics
Rules Reduction using Evolutionary Meta-HeuristicsAboul Ella Hassanien
 
Use of Definitive Screening Designs to Optimize an Analytical Method
Use of Definitive Screening Designs to Optimize an Analytical MethodUse of Definitive Screening Designs to Optimize an Analytical Method
Use of Definitive Screening Designs to Optimize an Analytical MethodPhilip Ramsey
 
Self-Control in Chimpanzees Relates to General Intelligence
Self-Control in Chimpanzees Relates to General IntelligenceSelf-Control in Chimpanzees Relates to General Intelligence
Self-Control in Chimpanzees Relates to General IntelligenceSachin Eldho
 
Computational Pool-Testing with Retesting Strategy
Computational Pool-Testing with Retesting StrategyComputational Pool-Testing with Retesting Strategy
Computational Pool-Testing with Retesting StrategyWaqas Tariq
 
Meta-analysis when the normality assumptions are violated (2008)
Meta-analysis when the normality assumptions are violated (2008)Meta-analysis when the normality assumptions are violated (2008)
Meta-analysis when the normality assumptions are violated (2008)Evangelos Kontopantelis
 
Hypothesis Testing: Relationships (Compare 1:1)
Hypothesis Testing: Relationships (Compare 1:1)Hypothesis Testing: Relationships (Compare 1:1)
Hypothesis Testing: Relationships (Compare 1:1)Matt Hansen
 
Data Analysis and Interpretation.pptx
Data Analysis and Interpretation.pptxData Analysis and Interpretation.pptx
Data Analysis and Interpretation.pptxRizzaLeenJoySubaan
 
Opticon 2017 Experimenting with Stats Engine
Opticon 2017 Experimenting with Stats EngineOpticon 2017 Experimenting with Stats Engine
Opticon 2017 Experimenting with Stats EngineOptimizely
 
Bayesian solutions to high-dimensional challenges using hybrid search
Bayesian solutions to high-dimensional challenges using hybrid searchBayesian solutions to high-dimensional challenges using hybrid search
Bayesian solutions to high-dimensional challenges using hybrid searchCaleb (Shiqiang) Jin
 
Innovative sample size methods for adaptive clinical trials webinar web ver...
Innovative sample size methods for adaptive clinical trials webinar   web ver...Innovative sample size methods for adaptive clinical trials webinar   web ver...
Innovative sample size methods for adaptive clinical trials webinar web ver...nQuery
 
Innovative Sample Size Methods For Clinical Trials
Innovative Sample Size Methods For Clinical Trials Innovative Sample Size Methods For Clinical Trials
Innovative Sample Size Methods For Clinical Trials nQuery
 
Screening of new strains of sugarcane using augmented block designs
Screening of new strains of sugarcane using augmented block designsScreening of new strains of sugarcane using augmented block designs
Screening of new strains of sugarcane using augmented block designsAlexander Decker
 
Sampling techniques and size
Sampling techniques and sizeSampling techniques and size
Sampling techniques and sizeDr. Keerti Jain
 
Business Research Methods PPT - III
Business Research Methods PPT - IIIBusiness Research Methods PPT - III
Business Research Methods PPT - IIIRavinder Singh
 
Protein Distance Map Prediction based on a Nearest Neighbors Approach
Protein Distance Map Prediction based on a Nearest Neighbors ApproachProtein Distance Map Prediction based on a Nearest Neighbors Approach
Protein Distance Map Prediction based on a Nearest Neighbors ApproachGualberto Asencio Cortés
 
Week 6 Assignment 2Application Chi-Square Study.docx
Week 6 Assignment 2Application Chi-Square Study.docxWeek 6 Assignment 2Application Chi-Square Study.docx
Week 6 Assignment 2Application Chi-Square Study.docxmelbruce90096
 

Similar to Btp 2017 presentation (20)

Correlational analysis, Basics, Assumptions for Pearson, Spearman Tests
Correlational analysis, Basics, Assumptions for Pearson, Spearman TestsCorrelational analysis, Basics, Assumptions for Pearson, Spearman Tests
Correlational analysis, Basics, Assumptions for Pearson, Spearman Tests
 
Confirmatory Bayesian Online Change Point Detection in the Covariance Structu...
Confirmatory Bayesian Online Change Point Detection in the Covariance Structu...Confirmatory Bayesian Online Change Point Detection in the Covariance Structu...
Confirmatory Bayesian Online Change Point Detection in the Covariance Structu...
 
Rules Reduction using Evolutionary Meta-Heuristics
Rules Reduction using  Evolutionary Meta-HeuristicsRules Reduction using  Evolutionary Meta-Heuristics
Rules Reduction using Evolutionary Meta-Heuristics
 
Use of Definitive Screening Designs to Optimize an Analytical Method
Use of Definitive Screening Designs to Optimize an Analytical MethodUse of Definitive Screening Designs to Optimize an Analytical Method
Use of Definitive Screening Designs to Optimize an Analytical Method
 
Self-Control in Chimpanzees Relates to General Intelligence
Self-Control in Chimpanzees Relates to General IntelligenceSelf-Control in Chimpanzees Relates to General Intelligence
Self-Control in Chimpanzees Relates to General Intelligence
 
Computational Pool-Testing with Retesting Strategy
Computational Pool-Testing with Retesting StrategyComputational Pool-Testing with Retesting Strategy
Computational Pool-Testing with Retesting Strategy
 
Meta-analysis when the normality assumptions are violated (2008)
Meta-analysis when the normality assumptions are violated (2008)Meta-analysis when the normality assumptions are violated (2008)
Meta-analysis when the normality assumptions are violated (2008)
 
Hypothesis Testing: Relationships (Compare 1:1)
Hypothesis Testing: Relationships (Compare 1:1)Hypothesis Testing: Relationships (Compare 1:1)
Hypothesis Testing: Relationships (Compare 1:1)
 
Data Analysis and Interpretation.pptx
Data Analysis and Interpretation.pptxData Analysis and Interpretation.pptx
Data Analysis and Interpretation.pptx
 
Opticon 2017 Experimenting with Stats Engine
Opticon 2017 Experimenting with Stats EngineOpticon 2017 Experimenting with Stats Engine
Opticon 2017 Experimenting with Stats Engine
 
Bayesian solutions to high-dimensional challenges using hybrid search
Bayesian solutions to high-dimensional challenges using hybrid searchBayesian solutions to high-dimensional challenges using hybrid search
Bayesian solutions to high-dimensional challenges using hybrid search
 
Innovative sample size methods for adaptive clinical trials webinar web ver...
Innovative sample size methods for adaptive clinical trials webinar   web ver...Innovative sample size methods for adaptive clinical trials webinar   web ver...
Innovative sample size methods for adaptive clinical trials webinar web ver...
 
Innovative Sample Size Methods For Clinical Trials
Innovative Sample Size Methods For Clinical Trials Innovative Sample Size Methods For Clinical Trials
Innovative Sample Size Methods For Clinical Trials
 
CHI Square
CHI SquareCHI Square
CHI Square
 
Statistical Analysis
Statistical AnalysisStatistical Analysis
Statistical Analysis
 
Screening of new strains of sugarcane using augmented block designs
Screening of new strains of sugarcane using augmented block designsScreening of new strains of sugarcane using augmented block designs
Screening of new strains of sugarcane using augmented block designs
 
Sampling techniques and size
Sampling techniques and sizeSampling techniques and size
Sampling techniques and size
 
Business Research Methods PPT - III
Business Research Methods PPT - IIIBusiness Research Methods PPT - III
Business Research Methods PPT - III
 
Protein Distance Map Prediction based on a Nearest Neighbors Approach
Protein Distance Map Prediction based on a Nearest Neighbors ApproachProtein Distance Map Prediction based on a Nearest Neighbors Approach
Protein Distance Map Prediction based on a Nearest Neighbors Approach
 
Week 6 Assignment 2Application Chi-Square Study.docx
Week 6 Assignment 2Application Chi-Square Study.docxWeek 6 Assignment 2Application Chi-Square Study.docx
Week 6 Assignment 2Application Chi-Square Study.docx
 

More from Jishnu P

SinGAN - Learning a Generative Model from a Single Natural Image
SinGAN - Learning a Generative Model from a Single Natural ImageSinGAN - Learning a Generative Model from a Single Natural Image
SinGAN - Learning a Generative Model from a Single Natural ImageJishnu P
 
Breaking CAPTCHAs using ML
Breaking CAPTCHAs using MLBreaking CAPTCHAs using ML
Breaking CAPTCHAs using MLJishnu P
 
Stencil computation research project presentation #1
Stencil computation research project presentation #1Stencil computation research project presentation #1
Stencil computation research project presentation #1Jishnu P
 
Ir mcq-answering-system
Ir mcq-answering-systemIr mcq-answering-system
Ir mcq-answering-systemJishnu P
 
Cs403 Parellel Programming Travelling Salesman Problem
Cs403   Parellel Programming Travelling Salesman ProblemCs403   Parellel Programming Travelling Salesman Problem
Cs403 Parellel Programming Travelling Salesman ProblemJishnu P
 
Ansible Overview - System Administration and Maintenance
Ansible Overview - System Administration and MaintenanceAnsible Overview - System Administration and Maintenance
Ansible Overview - System Administration and MaintenanceJishnu P
 
CS404 Pattern Recognition - Locality Preserving Projections
CS404   Pattern Recognition - Locality Preserving ProjectionsCS404   Pattern Recognition - Locality Preserving Projections
CS404 Pattern Recognition - Locality Preserving ProjectionsJishnu P
 

More from Jishnu P (7)

SinGAN - Learning a Generative Model from a Single Natural Image
SinGAN - Learning a Generative Model from a Single Natural ImageSinGAN - Learning a Generative Model from a Single Natural Image
SinGAN - Learning a Generative Model from a Single Natural Image
 
Breaking CAPTCHAs using ML
Breaking CAPTCHAs using MLBreaking CAPTCHAs using ML
Breaking CAPTCHAs using ML
 
Stencil computation research project presentation #1
Stencil computation research project presentation #1Stencil computation research project presentation #1
Stencil computation research project presentation #1
 
Ir mcq-answering-system
Ir mcq-answering-systemIr mcq-answering-system
Ir mcq-answering-system
 
Cs403 Parellel Programming Travelling Salesman Problem
Cs403   Parellel Programming Travelling Salesman ProblemCs403   Parellel Programming Travelling Salesman Problem
Cs403 Parellel Programming Travelling Salesman Problem
 
Ansible Overview - System Administration and Maintenance
Ansible Overview - System Administration and MaintenanceAnsible Overview - System Administration and Maintenance
Ansible Overview - System Administration and Maintenance
 
CS404 Pattern Recognition - Locality Preserving Projections
CS404   Pattern Recognition - Locality Preserving ProjectionsCS404   Pattern Recognition - Locality Preserving Projections
CS404 Pattern Recognition - Locality Preserving Projections
 

Recently uploaded

Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 

Recently uploaded (20)

Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 

Btp 2017 presentation

  • 1. Introduction Methodology Results Conclusions Future Work Contributions B.Tech. Project Attaining maximum power under cost constraints in single gene experiments P Jishnu Jaykumar (201352005) Abhijeet Singh Panwar (201351005) under the supervision of Dr. Bhargab Chattopadhyay Indian Institute of Information Technology, Vadodara May 5, 2017 P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 1/27
  • 2. Introduction Methodology Results Conclusions Future Work Contributions Introduction Single gene experiments are assays for querying ribonu- cleic acid species abundance in individual or pools of cells. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
  • 3. Introduction Methodology Results Conclusions Future Work Contributions Introduction Single gene experiments are assays for querying ribonu- cleic acid species abundance in individual or pools of cells. They have enormous importance in varied fields like evo- lution, pathology and drug development. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
  • 4. Introduction Methodology Results Conclusions Future Work Contributions Introduction Single gene experiments are assays for querying ribonu- cleic acid species abundance in individual or pools of cells. They have enormous importance in varied fields like evo- lution, pathology and drug development. However, these experiments are usually performed by pre-specifying the sample size during experimental de- sign, which may lead to under powered or over powered hypothesis tests. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
  • 5. Introduction Methodology Results Conclusions Future Work Contributions Introduction Single gene experiments are assays for querying ribonu- cleic acid species abundance in individual or pools of cells. They have enormous importance in varied fields like evo- lution, pathology and drug development. However, these experiments are usually performed by pre-specifying the sample size during experimental de- sign, which may lead to under powered or over powered hypothesis tests. This resulted in a shift to experimental design paradigms with minimal number of replicates to maxi- mize power. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
  • 6. Introduction Methodology Results Conclusions Future Work Contributions Introduction Single gene experiments are assays for querying ribonu- cleic acid species abundance in individual or pools of cells. They have enormous importance in varied fields like evo- lution, pathology and drug development. However, these experiments are usually performed by pre-specifying the sample size during experimental de- sign, which may lead to under powered or over powered hypothesis tests. This resulted in a shift to experimental design paradigms with minimal number of replicates to maxi- mize power. Several studies have analyzed sample size required to maximize power in hypothesis testing in RNA abun- dance experiments. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 2/27
  • 7. Introduction Methodology Results Conclusions Future Work Contributions Introduction - Contd. Recently, for the two-stage experimental design frame- work, algorithms like SCOTTY have been proposed. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 3/27
  • 8. Introduction Methodology Results Conclusions Future Work Contributions Introduction - Contd. Recently, for the two-stage experimental design frame- work, algorithms like SCOTTY have been proposed. But SCOTTY fails, if there are multiple (> 2) cell types. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 3/27
  • 9. Introduction Methodology Results Conclusions Future Work Contributions Introduction - Contd. Recently, for the two-stage experimental design frame- work, algorithms like SCOTTY have been proposed. But SCOTTY fails, if there are multiple (> 2) cell types. Also, two-stage procedures are imperfect if pilot sample is unrepresentative. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 3/27
  • 10. Introduction Methodology Results Conclusions Future Work Contributions Introduction - Contd. Recently, for the two-stage experimental design frame- work, algorithms like SCOTTY have been proposed. But SCOTTY fails, if there are multiple (> 2) cell types. Also, two-stage procedures are imperfect if pilot sample is unrepresentative. Thus the focus is to maximize the power of the hypoth- esis test related to a single gene belonging to more than two cell types under a cost constraint. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 3/27
  • 11. Introduction Methodology Results Conclusions Future Work Contributions Some facts In experimental research, limited funding is allotted be- forehand to carry out the sampling process. Thus under a cost constraint we have to carry out the test of equivalence or inferiority (or superiority) with minimum errors. Note that we fix the probability of type-I error to α (0.05 in our case), So the idea is to minimize the probability of type-II error or equivalently maximizing the power under cost constraints. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 4/27
  • 12. Introduction Methodology Results Conclusions Future Work Contributions Formulation Suppose there are K cell types belonging to a particular gene. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 5/27
  • 13. Introduction Methodology Results Conclusions Future Work Contributions Formulation Suppose there are K cell types belonging to a particular gene. For the ith cell type, suppose Xi1, . . . , Xini be iid ran- dom variables, not necessary normal, with means µi and variances σ2 i . P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 5/27
  • 14. Introduction Methodology Results Conclusions Future Work Contributions Formulation Suppose there are K cell types belonging to a particular gene. For the ith cell type, suppose Xi1, . . . , Xini be iid ran- dom variables, not necessary normal, with means µi and variances σ2 i . Consider δ = c µ K i=1 ciµi , c = (c1, . . . , cK) is known. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 5/27
  • 15. Introduction Methodology Results Conclusions Future Work Contributions Formulation Suppose there are K cell types belonging to a particular gene. For the ith cell type, suppose Xi1, . . . , Xini be iid ran- dom variables, not necessary normal, with means µi and variances σ2 i . Consider δ = c µ K i=1 ciµi , c = (c1, . . . , cK) is known. Estimator of δ is δn = K i=1 ci ¯Xni . P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 5/27
  • 16. Introduction Methodology Results Conclusions Future Work Contributions Formulation Contd. γi’s: sample size allocation ratios (unknown), ni = γin1, γ1 = 1. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 6/27
  • 17. Introduction Methodology Results Conclusions Future Work Contributions Formulation Contd. γi’s: sample size allocation ratios (unknown), ni = γin1, γ1 = 1. V ar(δn) = K i=1 c2 i σ2 i /ni = 1 n1 K i=1 c2 i σ2 i /γi. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 6/27
  • 18. Introduction Methodology Results Conclusions Future Work Contributions Formulation Contd. γi’s: sample size allocation ratios (unknown), ni = γin1, γ1 = 1. V ar(δn) = K i=1 c2 i σ2 i /ni = 1 n1 K i=1 c2 i σ2 i /γi. Using CLT, √ n1(δn − δ) K i=1 c2 i σ2 i /γi → D N(0, 1) P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 6/27
  • 19. Introduction Methodology Results Conclusions Future Work Contributions Test for Equivalence Compare δ, to two equivalence margins with δ1 < δ < δ2 H0 : δ ≤ δ1 or δ ≥ δ2 against HA : δ1 < δ < δ2 P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 7/27
  • 20. Introduction Methodology Results Conclusions Future Work Contributions Test for Equivalence Compare δ, to two equivalence margins with δ1 < δ < δ2 H0 : δ ≤ δ1 or δ ≥ δ2 against HA : δ1 < δ < δ2 Consider the test for equivalence. Then the null hypoth- esis H0 is rejected at α% if δ1 + zα V ar(δn) < δn < δ2 + zα V ar(δn) P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 7/27
  • 21. Introduction Methodology Results Conclusions Future Work Contributions Test for Equivalence Compare δ, to two equivalence margins with δ1 < δ < δ2 H0 : δ ≤ δ1 or δ ≥ δ2 against HA : δ1 < δ < δ2 Consider the test for equivalence. Then the null hypoth- esis H0 is rejected at α% if δ1 + zα V ar(δn) < δn < δ2 + zα V ar(δn) The approximate power function is PTE(δ) = Φ  −zα + δ2 − δ V ar(ˆδn)  −Φ  zα − δ − δ1 V ar(ˆδn)   P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 7/27
  • 22. Introduction Methodology Results Conclusions Future Work Contributions Test for Inferiority Compare δ, to δ0, we have, H0 : δ ≤ −|δ0| against HA : δ > −|δ0| P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 8/27
  • 23. Introduction Methodology Results Conclusions Future Work Contributions Test for Inferiority Compare δ, to δ0, we have, H0 : δ ≤ −|δ0| against HA : δ > −|δ0| Consider the inferiority test. Then the null hypothesis H0 is rejected at α% if δn > −|δ0| + zα V ar(δn) P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 8/27
  • 24. Introduction Methodology Results Conclusions Future Work Contributions Test for Inferiority Compare δ, to δ0, we have, H0 : δ ≤ −|δ0| against HA : δ > −|δ0| Consider the inferiority test. Then the null hypothesis H0 is rejected at α% if δn > −|δ0| + zα V ar(δn) The approximate power function PTI(δ) = Φ  −zα + δ + | δ0 | V ar(ˆδn)   P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 8/27
  • 25. Introduction Methodology Results Conclusions Future Work Contributions Maximizing Power Under Cost Constraints Suppose, we have a certain amount of money to carry out sampling, say |A0. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 9/27
  • 26. Introduction Methodology Results Conclusions Future Work Contributions Maximizing Power Under Cost Constraints Suppose, we have a certain amount of money to carry out sampling, say |A0. If it costs |ai to sample each observation from ith cell type, then A0 = K i=1 aini =⇒ n1 = A0 K i=1 aiγi . P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 9/27
  • 27. Introduction Methodology Results Conclusions Future Work Contributions Maximizing Power Under Cost Constraints Suppose, we have a certain amount of money to carry out sampling, say |A0. If it costs |ai to sample each observation from ith cell type, then A0 = K i=1 aini =⇒ n1 = A0 K i=1 aiγi . In order to maximize power, the optimal allocation ratio γi = | ci | σi | c1 | σ1 a1 ai (1) P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 9/27
  • 28. Introduction Methodology Results Conclusions Future Work Contributions Maximizing Power Under Cost Constraints Suppose, we have a certain amount of money to carry out sampling, say |A0. If it costs |ai to sample each observation from ith cell type, then A0 = K i=1 aini =⇒ n1 = A0 K i=1 aiγi . In order to maximize power, the optimal allocation ratio γi = | ci | σi | c1 | σ1 a1 ai (1) The optimal sample size for ith cell type is, nio = A0 | ci | σi K i=1 | cl | σl √ alai (2) P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 9/27
  • 29. Introduction Methodology Results Conclusions Future Work Contributions Sequential Procedure Step 1: First obtain Nio = m(≥ 2) observations from each of the cell types and for ith cell type ,check m ≥ A0 | Ci | SiNio K i=1 | cl | slNlo √ alai If for ith cell type, the above condition gets satisfied, no further observations should be obtained from that cell type. Else go to step 2 after carrying out step 1 for all cell types. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 10/27
  • 30. Introduction Methodology Results Conclusions Future Work Contributions Sequential Procedure Contd. Step 2: Add m observations corresponding to the ith cell type and set Nio = m + m and check, m + m ≥ A0 | Ci | SiNio K i=1 | cl | slNlo √ alai If for ith cell type, the above condition gets satisfied, no further sampling needs to be done for that cell type. Else repeat step 2 and continue this until for all cell types, the condition gets satisfied. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 11/27
  • 31. Introduction Methodology Results Conclusions Future Work Contributions Sequential Procedure and Characteristics One can find optimal sample size using the stopping rule. Nio, is the smallest integer, such that nio ≥ A0 | Ci | Sinio K i=1 | cl | slnlo √ alai (3) The total cost of sampling K i=1 Nio observations, Nio being the estimated final optimal sample size for ith cell type computed using Equation 4, is |A0 . P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 12/27
  • 32. Introduction Methodology Results Conclusions Future Work Contributions Contd. The optimal allocation ratio as defined in Equation 2 to derive minimum variance depends on population variances of the K cell types. In practice, the value of the population variance of each cell type is unknown, therefore, the sample size required to obtain maximum power of the test cannot be computed. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 13/27
  • 33. Introduction Methodology Results Conclusions Future Work Contributions Graphical Representation Flowchart that describes the sequential procedure developed. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 14/27
  • 34. Introduction Methodology Results Conclusions Future Work Contributions One of the simulation results Cell Type |ci| ai Nio s(Nio) nio Nio/nio 1 N(µ=1, σ=2) 3 4 467.01 0.23 469 0.99 2 N(µ=2, σ=2) 3 4 233.49 0.78 235 0.99 3 N(µ=3, σ=4) 3 4 1168.9 0.48 1171 0.998 4 N(µ=4, σ=3) 1 2 440.3 0.70 442 0.996 5 N(µ=5, σ=2) 2 3 1080.92 0.80 1082 0.998 6 N(µ=6, σ=4) 3 4 1873.8 0.66 1873 1 7 N(µ=7, σ=5) 4 4 2187.02 0.72 2185 1 8 N(µ=8, σ=6) 5 3 1351.93 1.04 1352 1 9 N(µ=9, σ=3) 6 2 5968.11 1.09 5959 1.001 10 N(µ=10, σ=5) 7 2 3092.26 0.45 3090 1 Final Sample Sizes for Normal Distribution, simulated 5000 times when A0=|50000 P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 15/27
  • 35. Introduction Methodology Results Conclusions Future Work Contributions One of the simulation results Cell Type |ci| ai Nio s(Nio) nio Nio/nio 1 .95N(1,2)+.05N(20,4) 3 4 59 0.141 60 0.98 2 .95N(2,2)+.05N(20,4) 3 4 58 0.140 60 0.96 3 .95N(3,4)+.05N(20,4) 3 4 118 0.210 120 0.98 4 .95N(4,3)+.05N(20,4) 1 2 41 0.122 43 0.953 5 .95N(5,2)+.05N(20,4) 2 3 45 0.123 47 0.957 6 .95N(6,4)+.05N(20,4) 3 4 118 0.209 120 0.98 7 .95N(7,5)+.05N(20,4) 4 4 200 0.271 200 1 8 .95N(8,6)+.05N(20,4) 5 3 338 0.331 347 0.974 9 .95N(9,3)+.05N(20,4) 6 2 259 0.303 255 1.01 10 .95N(10,5)+.05N(20,4) 7 2 493 0.448 495 0.995 Final Sample Sizes for Mixture-Normal Distribution, simulated 5000 times Distribution when A0=|50000 P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 16/27
  • 36. Introduction Methodology Results Conclusions Future Work Contributions Graph Figure: Accuracy w.r.t. to all the distributions taken into account for the simulation study. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 17/27
  • 37. Introduction Methodology Results Conclusions Future Work Contributions About the toolkit A toolkit (web-application) for the sequential procedure was also developed. Input: A xlsx file containing data in a predefined format. Output: Optimal sample sizes for each cell type speci- fied in the input file. Intermediate stages of the procedure can also be viewed. It is temporarily hosted at http://139.59.74.69/ alpha_version/ P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 18/27
  • 38. Introduction Methodology Results Conclusions Future Work Contributions Conclusions Using the sequential procedure we found the optimal sample sizes of the cell types, which are required to ob- tain maximum power under cost constraints. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 19/27
  • 39. Introduction Methodology Results Conclusions Future Work Contributions Conclusions Using the sequential procedure we found the optimal sample sizes of the cell types, which are required to ob- tain maximum power under cost constraints. The simulation study showed that on an average, using our procedure, the estimated optimal sample size and theoretical sample size are close for most of the situa- tions, except for the mixture of normal, weibull and chi distribution. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 19/27
  • 40. Introduction Methodology Results Conclusions Future Work Contributions Conclusions Using the sequential procedure we found the optimal sample sizes of the cell types, which are required to ob- tain maximum power under cost constraints. The simulation study showed that on an average, using our procedure, the estimated optimal sample size and theoretical sample size are close for most of the situa- tions, except for the mixture of normal, weibull and chi distribution. The procedure developed is independent of distribu- tions. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 19/27
  • 41. Introduction Methodology Results Conclusions Future Work Contributions Conclusions Contd. SCOTTY is limited to normal distribution and hence can’t handle the datasets following other distributions. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 20/27
  • 42. Introduction Methodology Results Conclusions Future Work Contributions Conclusions Contd. SCOTTY is limited to normal distribution and hence can’t handle the datasets following other distributions. Due to this factor, our procedure gains an advantage over SCOTTY. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 20/27
  • 43. Introduction Methodology Results Conclusions Future Work Contributions Conclusions Contd. SCOTTY is limited to normal distribution and hence can’t handle the datasets following other distributions. Due to this factor, our procedure gains an advantage over SCOTTY. We have developed a toolkit for estimating optimal sam- ple sizes of different cell types corresponding to a single gene and for making an inference about hypothesis test. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 20/27
  • 44. Introduction Methodology Results Conclusions Future Work Contributions Conclusions Contd. SCOTTY is limited to normal distribution and hence can’t handle the datasets following other distributions. Due to this factor, our procedure gains an advantage over SCOTTY. We have developed a toolkit for estimating optimal sam- ple sizes of different cell types corresponding to a single gene and for making an inference about hypothesis test. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 20/27
  • 45. Introduction Methodology Results Conclusions Future Work Contributions Future Work The project consisted of simulation study and toolkit development for the sequential procedure with a single gene having multiple cell types. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 21/27
  • 46. Introduction Methodology Results Conclusions Future Work Contributions Future Work The project consisted of simulation study and toolkit development for the sequential procedure with a single gene having multiple cell types. The future activities include developing a procedure for a set of genes with multiple cell types. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 21/27
  • 47. Introduction Methodology Results Conclusions Future Work Contributions Future Work The project consisted of simulation study and toolkit development for the sequential procedure with a single gene having multiple cell types. The future activities include developing a procedure for a set of genes with multiple cell types. In addition to this, the toolkit will be also added up with extra features and would be made to run for set of genes scenario as well. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 21/27
  • 48. Introduction Methodology Results Conclusions Future Work Contributions Future Work The project consisted of simulation study and toolkit development for the sequential procedure with a single gene having multiple cell types. The future activities include developing a procedure for a set of genes with multiple cell types. In addition to this, the toolkit will be also added up with extra features and would be made to run for set of genes scenario as well. The final outcome would be a rich web and desktop application. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 21/27
  • 49. Introduction Methodology Results Conclusions Future Work Contributions Contributions Abhijeet Writing the R code for the sequential procedure. Simulation study using Gamma Distribution Weibull Distribution Normal-Exponential Distribution Chi Distribution Chi Square (Non- Central) Distribution Logistic Distribution Jishnu R code reviewing and bug fixing. Transforming the R code to Python code. Simulation study using Normal distribution Mixture-normal distri- butions Mixture of Normal + Weibull + Chi Distri- bution Toolkit development. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 22/27
  • 50. Introduction Methodology Results Conclusions Future Work Contributions References I [BSM+ 13] Michele A Busby, Chip Stewart, Chase A Miller, Krzysztof R Grzeda, and Gabor T Marth, Scotty: a web tool for designing rna-seq experiments to measure differential gene expression, Bioinfor- matics 29 (2013), no. 5, 656–657. [GCL11] Jiin-Huarng Guo, Hubert J Chen, and Wei-Ming Luh, Sample size planning with the cost constraint for testing superiority and equivalence of two in- dependent groups, British Journal of Mathemat- ical and Statistical Psychology 64 (2011), no. 3, 439–461. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 23/27
  • 51. Introduction Methodology Results Conclusions Future Work Contributions References II [GS91] Bhaskar Kumar Ghosh and Pranab Kumar Sen, Handbook of sequential analysis, CRC Press, 1991. [JT99] Christopher Jennison and Bruce W Turnbull, Group sequential methods with applications to clinical trials, CRC Press, 1999. [LG16] Wei-Ming Luh and Jiin-Huarng Guo, Sample size planning for the noninferiority or equivalence of a linear contrast with cost considerations., Psy- chological methods 21 (2016), no. 1, 13. [Lip90] Mark W Lipsey, Design sensitivity: Statistical power for experimental research, vol. 19, Sage, 1990. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 24/27
  • 52. Introduction Methodology Results Conclusions Future Work Contributions References III [LR06] Erich L Lehmann and Joseph P Romano, Testing statistical hypotheses, Springer Science & Busi- ness Media, 2006. [MDS08] Nitis Mukhopadhyay and Basil M De Silva, Se- quential methods and their applications, CRC press, 2008. [SS81] Pranab Kumar Sen and Pranab K Sen, Sequential nonparametrics: invariance principles and statis- tical inference, Wiley New York, 1981. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 25/27
  • 53. Introduction Methodology Results Conclusions Future Work Contributions Any Questions? P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 26/27
  • 54. Introduction Methodology Results Conclusions Future Work Contributions Statistics, likelihoods, and probabilities mean everything to men, nothing to God. - Richelle E. Goodrich Thank You. P Jishnu Jaykumar Abhijeet Singh Panwar BTP-2017 27/27