SlideShare a Scribd company logo
Inferring parameters in a large stochastic model using only proportions of cell
death: insights from the birth-death model
Holly Ainsworth, Richard Boys & Colin Gillespie∗
Newcastle University, UK
REFERENCES
[1] Tang, M.Y., Proctor, C.J., Woulfe, J., Gray, D.A.
Experimental and Computational Analysis of
Polyglutamine-Mediated Cytotoxicity In PLoS Com-
put Biol., 2010 .
INTRODUCTION →
• Expanded polyglutamine (PolyQ) proteins
are known to be the causative agents in a
number of neurodegenerative diseases, but
they are still poorly understood.
• Aggregation of specific proteins are part of
the normal process of ageing in the brain
as well as in many age-related diseases.
• A causative link between aggregation and
disease is not universally accepted.
STOCHASTIC MODEL ↓
Stochastic kinetic model with uncertain pa-
rameters:
• 27 (chemical) species
• 70 reactions
• 40 rate constants, denoted θ
The model aims to explore the relationship
between PolyQ, p38MAPK activation, genera-
tion of reactive oxygen species (ROS), protea-
some inhibition and inclusion body formation.
DATA ←
Data are proportions (of cell death) - not quan-
titative trait measurements
Scenario 24hrs 36hrs 48hrs
GFP 15.03 14.55 26.08
H25 18.97 18.07 22.50
H103 21.68 23.44 36.44
SIMULATION ↓
Denote the probability of cell death at time t, pt(θ). Note the probability of cell death depends
on parameters θ. Given θ, the Gillespie algorithm can be used to simulate the time evolution of
a particular cell:
• The simulation gives us a binary time series for the cell, with 1 = death and 0 = no death.
• Repeating the above for n cells gives us a handle on pt(θ) via the observed proportion of
cell death ˆpt(θ), where
ˆpt(θ) ∼
1
n
Bin(n, pt(θ)).
BIRTH-DEATH MODEL →
Let x denote the number of individuals
present in the population. In chemical kinetic
notation, this system is represented as
R1 : x
λ
−→ 2x (birth)
R2 : 2x
µ
−→ x (death)
Example simulations from the model:
q qq0
5
10
15
0 4 8 12
Time
Population
0.00
0.25
0.50
0.75
1.00
0 4 8 12
Time
Probabilityofextinction
Simulator n = 10 n = 100 n = 1000 n = 10000
Comparisons with PolyQ model:
• Compare a single population governed by
a birth-death process with a cell governed
by the PolyQ model.
• A cell becoming extinct in the birth-death
process can be likened to a cell dying in
the PolyQ model.
• Proportions of extinction from the birth-
death model are comparible to propor-
tions of cell death from the PolyQ model.
An analytic expression for the probability of
extinction in the birth-death process for given
t, λ, µ and initial population level is available.
INFERENCE ↓
Work with proportions on the logit scale and assume data model:
yt = logit xt = logit pt(θ) + σ t, t = 1, . . . , T
where t ∼ N(0, 1) independently. The posterior of interest is
π(θ, σ|y) ∝ π(θ)π(σ)π(y|θ, σ).
Approaches to inference:
• Vanilla MCMC - approximate the distribution of ˆpt(θ) and account for the uncertainty using
elogit ˆpt(θ) ∼ N logit pt(θ),
1
npt(θ)[1 − pt(θ)]
approximately.
• Pseudo marginal 1 - construct a Monte Carlo estimate of the marginal likelihood.
• Pseudo marginal 2 - at each iteration of the MCMC scheme, use a (SIR) particle filter to
construct a SMC approximation to the marginal likelihood.
RESULTS
log(λ) log(µ) log(σ)
0
1
2
3
4
5
0
1
2
3
4
5
0
1
2
3
4
5
n=10n=100n=1000
−3 −2 −1 0 1 −0.5 0.0 0.5 1.0 −3 −2 −1 0 1
Parameter value
Density
Vanilla MCMC Pseudo−marginal MCMC 1 Pseudo−marginal MCMC 2
Inference using exact
probability of death
• Both pseudo-marginal schemes involve
more work. They use n×#particles runs
of the simulator at each iteration, com-
pared to n runs for the original scheme.
• The pseudo-marginal approach has the
advantage that it performs exact infer-
ence (for a particular choice of n).
• This will be useful particularly when
the asymptotic distributional result for
elogit ˆpt(θ) is poor for small n.
• The SMC approach to estimating the
marginal likelihood appears to mix better
than the MC approach.

More Related Content

Similar to Poster for Information, probability and inference in systems biology (IPISB 2013)

Integration of biological annotations using hierarchical modeling
Integration of biological annotations using hierarchical modelingIntegration of biological annotations using hierarchical modeling
Integration of biological annotations using hierarchical modeling
USC
 
International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI), International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions www.ijeijournal.com
 
MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...
MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...
MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...
The Statistical and Applied Mathematical Sciences Institute
 
Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmission
JeremyHeng10
 
Igor Segota: PhD thesis presentation
Igor Segota: PhD thesis presentationIgor Segota: PhD thesis presentation
Igor Segota: PhD thesis presentation
IgorSegota3
 
Looking Inside Mechanistic Models of Carcinogenesis
Looking Inside Mechanistic Models of CarcinogenesisLooking Inside Mechanistic Models of Carcinogenesis
Looking Inside Mechanistic Models of Carcinogenesis
Sascha Zöllner
 
Survival analysis 1
Survival analysis 1Survival analysis 1
Survival analysis 1
KyusonLim
 
Markov chain and SIR epidemic model (Greenwood model)
Markov chain and SIR epidemic model (Greenwood model)Markov chain and SIR epidemic model (Greenwood model)
Markov chain and SIR epidemic model (Greenwood model)
writwik mandal
 
NeurSciACone
NeurSciAConeNeurSciACone
NeurSciACone
Adam Cone
 
Ldb Convergenze Parallele_De barros_01
Ldb Convergenze Parallele_De barros_01Ldb Convergenze Parallele_De barros_01
Ldb Convergenze Parallele_De barros_01
laboratoridalbasso
 
Causal Effect Inference with Deep Latent-Variable Models
Causal Effect Inference with Deep Latent-Variable ModelsCausal Effect Inference with Deep Latent-Variable Models
Causal Effect Inference with Deep Latent-Variable Models
Jeongmin Cha
 
Talk 5
Talk 5Talk 5
American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1
Double Check ĆŐNSULTING
 
lecture1 on survival analysis HRP 262 class
lecture1 on survival analysis HRP 262 classlecture1 on survival analysis HRP 262 class
lecture1 on survival analysis HRP 262 class
TroyTeo1
 
Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmission
JeremyHeng10
 
population genetics of gene function (talk)
population genetics of gene function (talk)population genetics of gene function (talk)
population genetics of gene function (talk)
Ignacio Gallo
 
JMM_Poster_2015
JMM_Poster_2015JMM_Poster_2015
JMM_Poster_2015
Jennifer Kiser
 
Stochastic Modeling Neutral Evolution by an Iambp of Cortisol Secretion of Br...
Stochastic Modeling Neutral Evolution by an Iambp of Cortisol Secretion of Br...Stochastic Modeling Neutral Evolution by an Iambp of Cortisol Secretion of Br...
Stochastic Modeling Neutral Evolution by an Iambp of Cortisol Secretion of Br...
IJERA Editor
 
Survival analysis
Survival analysisSurvival analysis
Survival analysis
IbraahimAli3
 
Computational Intelligence for Time Series Prediction
Computational Intelligence for Time Series PredictionComputational Intelligence for Time Series Prediction
Computational Intelligence for Time Series Prediction
Gianluca Bontempi
 

Similar to Poster for Information, probability and inference in systems biology (IPISB 2013) (20)

Integration of biological annotations using hierarchical modeling
Integration of biological annotations using hierarchical modelingIntegration of biological annotations using hierarchical modeling
Integration of biological annotations using hierarchical modeling
 
International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI), International Journal of Engineering Inventions (IJEI),
International Journal of Engineering Inventions (IJEI),
 
MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...
MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...
MUMS Undergraduate Workshop - Introduction to Bayesian Inference & Uncertaint...
 
Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmission
 
Igor Segota: PhD thesis presentation
Igor Segota: PhD thesis presentationIgor Segota: PhD thesis presentation
Igor Segota: PhD thesis presentation
 
Looking Inside Mechanistic Models of Carcinogenesis
Looking Inside Mechanistic Models of CarcinogenesisLooking Inside Mechanistic Models of Carcinogenesis
Looking Inside Mechanistic Models of Carcinogenesis
 
Survival analysis 1
Survival analysis 1Survival analysis 1
Survival analysis 1
 
Markov chain and SIR epidemic model (Greenwood model)
Markov chain and SIR epidemic model (Greenwood model)Markov chain and SIR epidemic model (Greenwood model)
Markov chain and SIR epidemic model (Greenwood model)
 
NeurSciACone
NeurSciAConeNeurSciACone
NeurSciACone
 
Ldb Convergenze Parallele_De barros_01
Ldb Convergenze Parallele_De barros_01Ldb Convergenze Parallele_De barros_01
Ldb Convergenze Parallele_De barros_01
 
Causal Effect Inference with Deep Latent-Variable Models
Causal Effect Inference with Deep Latent-Variable ModelsCausal Effect Inference with Deep Latent-Variable Models
Causal Effect Inference with Deep Latent-Variable Models
 
Talk 5
Talk 5Talk 5
Talk 5
 
American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1American Statistical Association October 23 2009 Presentation Part 1
American Statistical Association October 23 2009 Presentation Part 1
 
lecture1 on survival analysis HRP 262 class
lecture1 on survival analysis HRP 262 classlecture1 on survival analysis HRP 262 class
lecture1 on survival analysis HRP 262 class
 
Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmission
 
population genetics of gene function (talk)
population genetics of gene function (talk)population genetics of gene function (talk)
population genetics of gene function (talk)
 
JMM_Poster_2015
JMM_Poster_2015JMM_Poster_2015
JMM_Poster_2015
 
Stochastic Modeling Neutral Evolution by an Iambp of Cortisol Secretion of Br...
Stochastic Modeling Neutral Evolution by an Iambp of Cortisol Secretion of Br...Stochastic Modeling Neutral Evolution by an Iambp of Cortisol Secretion of Br...
Stochastic Modeling Neutral Evolution by an Iambp of Cortisol Secretion of Br...
 
Survival analysis
Survival analysisSurvival analysis
Survival analysis
 
Computational Intelligence for Time Series Prediction
Computational Intelligence for Time Series PredictionComputational Intelligence for Time Series Prediction
Computational Intelligence for Time Series Prediction
 

More from Colin Gillespie

Bayesian Experimental Design for Stochastic Kinetic Models
Bayesian Experimental Design for Stochastic Kinetic ModelsBayesian Experimental Design for Stochastic Kinetic Models
Bayesian Experimental Design for Stochastic Kinetic Models
Colin Gillespie
 
The tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic modelsThe tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic models
Colin Gillespie
 
Reference classes: a case study with the poweRlaw package
Reference classes: a case study with the poweRlaw packageReference classes: a case study with the poweRlaw package
Reference classes: a case study with the poweRlaw package
Colin Gillespie
 
Introduction to power laws
Introduction to power lawsIntroduction to power laws
Introduction to power laws
Colin Gillespie
 
Moment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic ModelsMoment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic Models
Colin Gillespie
 
An introduction to moment closure techniques
An introduction to moment closure techniquesAn introduction to moment closure techniques
An introduction to moment closure techniques
Colin Gillespie
 
Speeding up the Gillespie algorithm
Speeding up the Gillespie algorithmSpeeding up the Gillespie algorithm
Speeding up the Gillespie algorithm
Colin Gillespie
 
Moment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic modelsMoment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic models
Colin Gillespie
 
WCSB 2012
WCSB 2012 WCSB 2012
WCSB 2012
Colin Gillespie
 
Bayesian inference for stochastic population models with application to aphids
Bayesian inference for stochastic population models with application to aphidsBayesian inference for stochastic population models with application to aphids
Bayesian inference for stochastic population models with application to aphids
Colin Gillespie
 

More from Colin Gillespie (10)

Bayesian Experimental Design for Stochastic Kinetic Models
Bayesian Experimental Design for Stochastic Kinetic ModelsBayesian Experimental Design for Stochastic Kinetic Models
Bayesian Experimental Design for Stochastic Kinetic Models
 
The tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic modelsThe tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic models
 
Reference classes: a case study with the poweRlaw package
Reference classes: a case study with the poweRlaw packageReference classes: a case study with the poweRlaw package
Reference classes: a case study with the poweRlaw package
 
Introduction to power laws
Introduction to power lawsIntroduction to power laws
Introduction to power laws
 
Moment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic ModelsMoment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic Models
 
An introduction to moment closure techniques
An introduction to moment closure techniquesAn introduction to moment closure techniques
An introduction to moment closure techniques
 
Speeding up the Gillespie algorithm
Speeding up the Gillespie algorithmSpeeding up the Gillespie algorithm
Speeding up the Gillespie algorithm
 
Moment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic modelsMoment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic models
 
WCSB 2012
WCSB 2012 WCSB 2012
WCSB 2012
 
Bayesian inference for stochastic population models with application to aphids
Bayesian inference for stochastic population models with application to aphidsBayesian inference for stochastic population models with application to aphids
Bayesian inference for stochastic population models with application to aphids
 

Poster for Information, probability and inference in systems biology (IPISB 2013)

  • 1. Inferring parameters in a large stochastic model using only proportions of cell death: insights from the birth-death model Holly Ainsworth, Richard Boys & Colin Gillespie∗ Newcastle University, UK REFERENCES [1] Tang, M.Y., Proctor, C.J., Woulfe, J., Gray, D.A. Experimental and Computational Analysis of Polyglutamine-Mediated Cytotoxicity In PLoS Com- put Biol., 2010 . INTRODUCTION → • Expanded polyglutamine (PolyQ) proteins are known to be the causative agents in a number of neurodegenerative diseases, but they are still poorly understood. • Aggregation of specific proteins are part of the normal process of ageing in the brain as well as in many age-related diseases. • A causative link between aggregation and disease is not universally accepted. STOCHASTIC MODEL ↓ Stochastic kinetic model with uncertain pa- rameters: • 27 (chemical) species • 70 reactions • 40 rate constants, denoted θ The model aims to explore the relationship between PolyQ, p38MAPK activation, genera- tion of reactive oxygen species (ROS), protea- some inhibition and inclusion body formation. DATA ← Data are proportions (of cell death) - not quan- titative trait measurements Scenario 24hrs 36hrs 48hrs GFP 15.03 14.55 26.08 H25 18.97 18.07 22.50 H103 21.68 23.44 36.44 SIMULATION ↓ Denote the probability of cell death at time t, pt(θ). Note the probability of cell death depends on parameters θ. Given θ, the Gillespie algorithm can be used to simulate the time evolution of a particular cell: • The simulation gives us a binary time series for the cell, with 1 = death and 0 = no death. • Repeating the above for n cells gives us a handle on pt(θ) via the observed proportion of cell death ˆpt(θ), where ˆpt(θ) ∼ 1 n Bin(n, pt(θ)). BIRTH-DEATH MODEL → Let x denote the number of individuals present in the population. In chemical kinetic notation, this system is represented as R1 : x λ −→ 2x (birth) R2 : 2x µ −→ x (death) Example simulations from the model: q qq0 5 10 15 0 4 8 12 Time Population 0.00 0.25 0.50 0.75 1.00 0 4 8 12 Time Probabilityofextinction Simulator n = 10 n = 100 n = 1000 n = 10000 Comparisons with PolyQ model: • Compare a single population governed by a birth-death process with a cell governed by the PolyQ model. • A cell becoming extinct in the birth-death process can be likened to a cell dying in the PolyQ model. • Proportions of extinction from the birth- death model are comparible to propor- tions of cell death from the PolyQ model. An analytic expression for the probability of extinction in the birth-death process for given t, λ, µ and initial population level is available. INFERENCE ↓ Work with proportions on the logit scale and assume data model: yt = logit xt = logit pt(θ) + σ t, t = 1, . . . , T where t ∼ N(0, 1) independently. The posterior of interest is π(θ, σ|y) ∝ π(θ)π(σ)π(y|θ, σ). Approaches to inference: • Vanilla MCMC - approximate the distribution of ˆpt(θ) and account for the uncertainty using elogit ˆpt(θ) ∼ N logit pt(θ), 1 npt(θ)[1 − pt(θ)] approximately. • Pseudo marginal 1 - construct a Monte Carlo estimate of the marginal likelihood. • Pseudo marginal 2 - at each iteration of the MCMC scheme, use a (SIR) particle filter to construct a SMC approximation to the marginal likelihood. RESULTS log(λ) log(µ) log(σ) 0 1 2 3 4 5 0 1 2 3 4 5 0 1 2 3 4 5 n=10n=100n=1000 −3 −2 −1 0 1 −0.5 0.0 0.5 1.0 −3 −2 −1 0 1 Parameter value Density Vanilla MCMC Pseudo−marginal MCMC 1 Pseudo−marginal MCMC 2 Inference using exact probability of death • Both pseudo-marginal schemes involve more work. They use n×#particles runs of the simulator at each iteration, com- pared to n runs for the original scheme. • The pseudo-marginal approach has the advantage that it performs exact infer- ence (for a particular choice of n). • This will be useful particularly when the asymptotic distributional result for elogit ˆpt(θ) is poor for small n. • The SMC approach to estimating the marginal likelihood appears to mix better than the MC approach.