SlideShare a Scribd company logo
1 of 27
Download to read offline
Bayesian inference for the chemical
master equation using approximate
                            models

Colin Gillespie

 Joint work with Andrew Golightly
Modelling
Represent a biochemical network with a set of (pseudo-)biochemical
reactions
    Specify the rate laws and rate constants of the reactions
    Run some stochastic or deterministic computer simulator of the
    system dynamics
    Straightforward using the Gillespie algorithm. The reverse problem is
    trickier – given time course data, and a set of reactions, can we
    recover the rate constants?




                                                                            2/19
Modelling
Generically,

k species and r reactions with a typical reaction
                                               i c
           Ri :   ui1 Y1 + . . . + uik Yk    −→ vi1 Y1 + . . . + vik Yk
                                              −


Stochastic rate constant: ci

Hazard / instantaneous rate: hi (Y (t ), ci ) where
Y (t ) = (Y1 (t ), . . . , Yk (t )) is the current state of the system and

                                                     k
                                                         Yj (t )
                         hi (Y (t ), ci ) = ci   ∏        uij
                                                 j =1


                                                                             3/19
Modelling
Some remarks:
    This setup describes a Markov jump process (MJP)
    The effect of reaction Ri is to change the value of each Yj by vij − uij
    It can be shown that the time to the next reaction is
                                                                         r
    τ ∼ Exp {h0 (Y (t ), c )}        where       h0 ( Y ( t ) , c ) =   ∑ hi (Y (t ), ci )
                                                                        i =1


    and the reaction is of type i with probability hi (Y (t ), ci )/h0 (Y (t ), ci )
    Hence, the process is easily simulated (and this technique is known
    as the Gillespie algorithm)



                                                                                         4/19
Inference for the exact model
Aim: infer the ci given time-course biochemical data. Following Boys et al.
(2008) and Wilkinson (2006):
     Suppose we observe the entire process Y over [0, T ]
     The i th unit interval contains ni reactions with times and types
     (tij , kij ), j = 1, 2, . . . , ni
     Hence, the likelihood for c is

                  T −1 ni                                                T
     π (Y|c ) =    ∏ ∏ hk      ij
                                     Y (ti ,j −1 ), ckij   exp   −
                                                                     0
                                                                             h0 (Y (t ), c ) dt
                   i =0 j =1




                                                                                                  5/19
Inference for the exact model
Aim: infer the ci given time-course biochemical data. Following Boys et al.
(2008) and Wilkinson (2006):
     Suppose we observe the entire process Y over [0, T ]
     The i th unit interval contains ni reactions with times and types
     (tij , kij ), j = 1, 2, . . . , ni
     Hence, the likelihood for c is

                  T −1 ni                                                T
     π (Y|c ) =    ∏ ∏ hk      ij
                                     Y (ti ,j −1 ), ckij   exp   −
                                                                     0
                                                                             h0 (Y (t ), c ) dt
                   i =0 j =1




                                                                                                  5/19
Inference for the exact model
Problem: it is not feasible to observe all reaction times and types
     Assume data are observed on a regular grid with

       Y0:T =    Y (t ) = (Y1 (t ), Y2 (t ), . . . , Yk (t )) : t = 0, 1, 2, . . . , T

Idea: use a Gibbs sampler to alternate between draws of
 1. times and types of reactions in (0, T ] conditional on c and the
    observations,
 2. each ci conditional on the augmented data

Note that step 1 can be performed for each interval (i , i + 1] in turn, due to
the factorisation of π (Y|Y0:T , c )


                                                                                         6/19
Inference for the exact model
Problem: it is not feasible to observe all reaction times and types
     Assume data are observed on a regular grid with

       Y0:T =    Y (t ) = (Y1 (t ), Y2 (t ), . . . , Yk (t )) : t = 0, 1, 2, . . . , T

Idea: use a Gibbs sampler to alternate between draws of
 1. times and types of reactions in (0, T ] conditional on c and the
    observations,
 2. each ci conditional on the augmented data

Note that step 1 can be performed for each interval (i , i + 1] in turn, due to
the factorisation of π (Y|Y0:T , c )


                                                                                         6/19
Inference for the exact model
Problem: it is not feasible to observe all reaction times and types
     Assume data are observed on a regular grid with

       Y0:T =    Y (t ) = (Y1 (t ), Y2 (t ), . . . , Yk (t )) : t = 0, 1, 2, . . . , T

Idea: use a Gibbs sampler to alternate between draws of
 1. times and types of reactions in (0, T ] conditional on c and the
    observations,
 2. each ci conditional on the augmented data

Note that step 1 can be performed for each interval (i , i + 1] in turn, due to
the factorisation of π (Y|Y0:T , c )


                                                                                         6/19
Inference for the exact model
Problem: it is not feasible to observe all reaction times and types
     Assume data are observed on a regular grid with

       Y0:T =    Y (t ) = (Y1 (t ), Y2 (t ), . . . , Yk (t )) : t = 0, 1, 2, . . . , T

Idea: use a Gibbs sampler to alternate between draws of
 1. times and types of reactions in (0, T ] conditional on c and the
    observations,
 2. each ci conditional on the augmented data

Note that step 1 can be performed for each interval (i , i + 1] in turn, due to
the factorisation of π (Y|Y0:T , c )


                                                                                         6/19
Difficulties
These techniques do not scale well to problems of realistic size and
complexity...
True process is discrete and stochastic — stochasticity is vital — what
about discreteness?
Treating molecule numbers as continuous and performing exact
inference for the resulting approximate model appears to be
promising...




                                                                       7/19
Bayesian filtering
Suppose we have noisy observations X0:(i −1) = {X (t ) : t = 0, 1, . . . , i − 1}
where for example,

                            X (t ) = Y (t ) + ,          ∼ N(0, Σ)

Goal: generate a sample from π [c , Y (i )|X0:i ] given a new datum X (i )

π [c , Y (i )|X0:i ] ∝   π [c , Y (i − 1)|X0:i −1 ] πh Y(i −1,i ] |c π [Y (i )|X (i )] dY[i −1,i )
                   ∝               prior     ×     likelihood

where Y(i −1,i ] = {Y (t ) : t ∈ (i − 1, i ]} is the latent path in (i − 1, i ] and πh [·|c ]
is the associated density under the simulator

Idea: if we can sample π [c , Y (i − 1)|X0:i −1 ] then we can use MCMC to sample
the target π [c , Y (i )|X0:i ]

                                                                                                 8/19
Bayesian filtering
Suppose we have noisy observations X0:(i −1) = {X (t ) : t = 0, 1, . . . , i − 1}
where for example,

                            X (t ) = Y (t ) + ,          ∼ N(0, Σ)

Goal: generate a sample from π [c , Y (i )|X0:i ] given a new datum X (i )

π [c , Y (i )|X0:i ] ∝   π [c , Y (i − 1)|X0:i −1 ] πh Y(i −1,i ] |c π [Y (i )|X (i )] dY[i −1,i )
                   ∝               prior     ×     likelihood

where Y(i −1,i ] = {Y (t ) : t ∈ (i − 1, i ]} is the latent path in (i − 1, i ] and πh [·|c ]
is the associated density under the simulator

Idea: if we can sample π [c , Y (i − 1)|X0:i −1 ] then we can use MCMC to sample
the target π [c , Y (i )|X0:i ]

                                                                                                 8/19
Bayesian filtering
Suppose we have noisy observations X0:(i −1) = {X (t ) : t = 0, 1, . . . , i − 1}
where for example,

                            X (t ) = Y (t ) + ,          ∼ N(0, Σ)

Goal: generate a sample from π [c , Y (i )|X0:i ] given a new datum X (i )

π [c , Y (i )|X0:i ] ∝   π [c , Y (i − 1)|X0:i −1 ] πh Y(i −1,i ] |c π [Y (i )|X (i )] dY[i −1,i )
                   ∝               prior     ×     likelihood

where Y(i −1,i ] = {Y (t ) : t ∈ (i − 1, i ]} is the latent path in (i − 1, i ] and πh [·|c ]
is the associated density under the simulator

Idea: if we can sample π [c , Y (i − 1)|X0:i −1 ] then we can use MCMC to sample
the target π [c , Y (i )|X0:i ]

                                                                                                 8/19
Idealised MCMC scheme
 1. Propose (c ∗ , Y (i − 1)∗ ) ∼ π [ · |X0:i −1 ]
 2. Draw Y(∗−1,i ] ∼ πh [ · |c ∗ ] by forward simulation
            i
 3. Accept/reject with probability

                                       π [Y (i )∗ |X (i )]
                           min    1,                         .
                                       π [Y (i )|X (i )]

Since the simulator is used as a proposal process, we don’t need to
evaluate its associated density.

Similar to Approximate Bayesian Computation (ABC)




                                                                      9/19
A particle approach
Since π [c , Y (i − 1)|X0:i −1 ] does not have analytic form (typically)
we approximate this density with an equally weighted cloud of points
or particles, hence the term particle filter
The scheme is initialised with a sample from the prior and a posterior
sample calculated at time i − 1 is used as the prior at time i
To avoid sample impoverishment, in step 1 of the scheme we sample
the kernel density estimate (KDE) of π [c ∗ , Y (i − 1)∗ |X0:i −1 ] by
adding zero mean Normal noise to each proposed particle




                                                                           10/19
Three types of approximation
The distribution π [c , Y (i − 1)|X0:i −1 ] is represented as a discrete
distribution - in theory as the number of particles N increases, the
approximation improves
We perturb particles - as N increases, perturbation decreases
Simulator
     Exact: Gillespie
     Approximate: SDE, but can be perform poorly for low molecule
     numbers
     Approximate: ODE, but ignores stochasticity




                                                                           11/19
Three types of approximation
The distribution π [c , Y (i − 1)|X0:i −1 ] is represented as a discrete
distribution - in theory as the number of particles N increases, the
approximation improves
We perturb particles - as N increases, perturbation decreases
Simulator
     Exact: Gillespie
     Approximate: SDE, but can be perform poorly for low molecule
     numbers
     Approximate: ODE, but ignores stochasticity




                                                                           11/19
Three types of approximation
The distribution π [c , Y (i − 1)|X0:i −1 ] is represented as a discrete
distribution - in theory as the number of particles N increases, the
approximation improves
We perturb particles - as N increases, perturbation decreases
Simulator
     Exact: Gillespie
     Approximate: SDE, but can be perform poorly for low molecule
     numbers
     Approximate: ODE, but ignores stochasticity




                                                                           11/19
Case study
    Wish to compare the hybrid scheme with inferences made assuming the
    “true” discrete, stochastic model
    To allow such inferences, consider a toy example:

Autoregulatory network
                                          c1                               c2
    Immigration                 R1 : ∅ − → Y1
                                        −                       R2 : ∅ − → Y2
                                                                        −
                                           c3                              c4
         Death                  R 3 : Y1 − → ∅
                                          −                     R4 : Y2 − → ∅
                                                                         −
                                                 c5
     Interaction                R5 : Y1 + Y2 − → 2Y2 .
                                              −

    Corrupt the data by constructing

                                       Poisson (Yi (t )) if Yi (t ) > 0,
                   Xi (t )|Yi (t ) ∼
                                       Bernoulli(0.1) if Yi (t ) = 0.

    for each i = 1, 2 so that Y (t ) is not observed anywhere
                                                                                12/19
Stochastic vs deterministic
                                    c2 = 10                                       c2 = 1.0


             1500




             1000




              500
Population




               0

                                    c2 = 0.1                                  c2 = 0.01


             1500




             1000




              500




               0

                    0   200   400              600   800   1000   0   200   400              600   800   1000
                                                             Time




                                                                                                                13/19
Stochastic vs deterministic
                                            ODE steady state                                  Stochastic steady state

                         104


                                                                                                                           q
                                                                                                                       q
                                                                                                                   q
                                                                                                               q
c2 = 10 × scale          103
                                                                                                           q
                                                                                                                                     Scale

                                                                                                                                     q   10−5
                                                                                                q
(Y2 imm’ rate)                                                                                                                       q   10−4
                    Y2

                                                                                               q                                     q   10−3
                                                                                              q
c3 = 0.01 × scale                                                                            q
                                                                                                                                     q
                                                                                                                                     q
                                                                                                                                         10−2

                                                                                                                                         10−1
                         102
                                                                                            q
(Y1 death rate)                                                                            q
                                                                                          q
                                                                                          q
                                                                                         q
                                     q                                                q qq
                                                                                         q
                                      qq
                                      qq
                                       q                                               qq
                                                                                       qq
                                                                                        q
                         101


                               102    103         104          105   106        102     103            104              105    106
                                                                           Y1




                                                                                                                                     14/19
Autoregulatory network

Initial conditions:                                     120

                                                                                                                      q



Y1 (0) = Y2 (0) = 10
                                                                                               q
                                                                                                              q
                                                                                                        q         q
                                                                              q                                               q
                                                                                                                          q

Rate constants: c1 = 10, c2 = 0.1,                                                     q           q




                                     Population Level
                                                         80                                q
                                                                                  q                                                q
                                                                          q

c3 = 0.1, c4 = 0.7 and c5 = 0.008.                                                                                                                 q



                                                                                                                                               q
                                                                                                                                       q

Y1 : solid red line                                                   q



                                                         40
                                                                  q


Y2 : blue dashed lines
                                                                                                                                           q




                                                                                                                                           q


Solid points: twenty one noised                               q

                                                              q                            q
                                                                                                                                       q       q

                                                                                       q                q
                                                         0        q   q   q   q   q            q   q          q   q   q   q   q    q               q

points used for inference                                     0                   25                    50                    75                   100
                                                                                                       Time




                                                                                                                                                   15/19
Results: N = 30, 000 particles
                         0.5
                                                                                          30                                                               30



                         0.4                                                              25                                                               25



                                                                                          20                                                               20
                         0.3




                                                                                                                                                 Density
               Density




                                                                                Density
                                                                                          15                                                               15

                         0.2


Vague priors                                                                              10                                                               10


                         0.1
                                                                                                                                                            5

used
                                                                                           5



                         0.0                                                               0                                                                0

                                 0   5         10    15   20         25   30                       0.0     0.1           0.2       0.3    0.4                   0.0   0.1   0.2   0.3   0.4
                                                     c1                                                                  c2                                                 c3


 Gillespie               8                                                                400



 SDE
                         6                                                                300


 ODE
                                                                                Density
               Density




                         4                                                                200




                         2                                                                100




                         0                                                                     0

                               0.0       0.5        1.0        1.5        2.0                      0.000         0.015         0.030     0.045
                                                    c4                                                                   c5




                                                                                                                                                                                              16/19
Summary
Inferring rate constants that govern discrete stochastic kinetic models
is computationally challenging
It appears promising to consider an approximation of the model and
perform exact inference using the approximate model
In this case study, the SDE and ODE models performed surprisingly
well
     Although the ODE model had poor MCMC mixing properties




                                                                      17/19
Future
A hybrid forwards simulator (or indeed any forwards simulator) can be
used as a proposal process inside a particle filter
    However, for this model the Salis & Kaznessis hybrid simulator is
    slower than the standard Gillespie algorithm
A particle MCMC approach provides an exact inference procedure
(see recent Andrieu et al JRSS B read paper)
    Preprint available using this PMCMC. This paper also considers the
    hybrid simulator and the linear noise approximation




                                                                         18/19
Boys, R. J., Wilkinson, D.J. and T.B.L. Kirkwood (2008). Bayesian inference for a discretely
    observed stochastic kinetic model. Statistics and Computing 18.
    Gillespie, C. S. and Golightly, A. and (2010). Bayesian inference for generalized stochastic
    population growth models with application to aphids. JRSS Series C 59(2).
    Heron, E. A., Finkenstadt, B. and D. A. Hand (2007). Bayesian inference for dynamic
    transcriptional regulation; the Hes1 system as a case study. Bioinformatics 23.
    Salis, H. and Y. Kaznessis (2005). Accurate hybrid simulation of a system of coupled
    biochemical reactions. Journal of Chemical Physics 122.
    Wilkinson, D. J. (2006). Stochastic Modelling for Systems Biology. Chapman & Hall/CRC Press.

Contact details...
email: colin.gillespie@ncl.ac.uk
www: http://www.mas.ncl.ac.uk/~ncsg3/




                                                                                                   19/19

More Related Content

What's hot

Lecture on nk [compatibility mode]
Lecture on nk [compatibility mode]Lecture on nk [compatibility mode]
Lecture on nk [compatibility mode]NBER
 
Paris2012 session1
Paris2012 session1Paris2012 session1
Paris2012 session1Cdiscount
 
Paris2012 session2
Paris2012 session2Paris2012 session2
Paris2012 session2Cdiscount
 
State Space Model
State Space ModelState Space Model
State Space ModelCdiscount
 
Lesson 13: Related Rates Problems
Lesson 13: Related Rates ProblemsLesson 13: Related Rates Problems
Lesson 13: Related Rates ProblemsMatthew Leingang
 
Prévision de consommation électrique avec adaptive GAM
Prévision de consommation électrique avec adaptive GAMPrévision de consommation électrique avec adaptive GAM
Prévision de consommation électrique avec adaptive GAMCdiscount
 
Boris Blagov. Financial Crises and Time-Varying Risk Premia in a Small Open E...
Boris Blagov. Financial Crises and Time-Varying Risk Premia in a Small Open E...Boris Blagov. Financial Crises and Time-Varying Risk Premia in a Small Open E...
Boris Blagov. Financial Crises and Time-Varying Risk Premia in a Small Open E...Eesti Pank
 
Lesson15 -exponential_growth_and_decay_021_slides
Lesson15  -exponential_growth_and_decay_021_slidesLesson15  -exponential_growth_and_decay_021_slides
Lesson15 -exponential_growth_and_decay_021_slidesMatthew Leingang
 
On the solvability of a system of forward-backward linear equations with unbo...
On the solvability of a system of forward-backward linear equations with unbo...On the solvability of a system of forward-backward linear equations with unbo...
On the solvability of a system of forward-backward linear equations with unbo...Nikita V. Artamonov
 
Olivier Cappé's talk at BigMC March 2011
Olivier Cappé's talk at BigMC March 2011Olivier Cappé's talk at BigMC March 2011
Olivier Cappé's talk at BigMC March 2011BigMC
 
5. cem granger causality ecm
5. cem granger causality  ecm 5. cem granger causality  ecm
5. cem granger causality ecm Quang Hoang
 
2003 Ames.Models
2003 Ames.Models2003 Ames.Models
2003 Ames.Modelspinchung
 
Speeding up the Gillespie algorithm
Speeding up the Gillespie algorithmSpeeding up the Gillespie algorithm
Speeding up the Gillespie algorithmColin Gillespie
 
New Mathematical Tools for the Financial Sector
New Mathematical Tools for the Financial SectorNew Mathematical Tools for the Financial Sector
New Mathematical Tools for the Financial SectorSSA KPI
 
Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4Fabian Pedregosa
 

What's hot (18)

Rouviere
RouviereRouviere
Rouviere
 
Dynamic Tumor Growth Modelling
Dynamic Tumor Growth ModellingDynamic Tumor Growth Modelling
Dynamic Tumor Growth Modelling
 
Lecture on nk [compatibility mode]
Lecture on nk [compatibility mode]Lecture on nk [compatibility mode]
Lecture on nk [compatibility mode]
 
Paris2012 session1
Paris2012 session1Paris2012 session1
Paris2012 session1
 
Paris2012 session2
Paris2012 session2Paris2012 session2
Paris2012 session2
 
State Space Model
State Space ModelState Space Model
State Space Model
 
Lesson 13: Related Rates Problems
Lesson 13: Related Rates ProblemsLesson 13: Related Rates Problems
Lesson 13: Related Rates Problems
 
Prévision de consommation électrique avec adaptive GAM
Prévision de consommation électrique avec adaptive GAMPrévision de consommation électrique avec adaptive GAM
Prévision de consommation électrique avec adaptive GAM
 
Boris Blagov. Financial Crises and Time-Varying Risk Premia in a Small Open E...
Boris Blagov. Financial Crises and Time-Varying Risk Premia in a Small Open E...Boris Blagov. Financial Crises and Time-Varying Risk Premia in a Small Open E...
Boris Blagov. Financial Crises and Time-Varying Risk Premia in a Small Open E...
 
Lesson15 -exponential_growth_and_decay_021_slides
Lesson15  -exponential_growth_and_decay_021_slidesLesson15  -exponential_growth_and_decay_021_slides
Lesson15 -exponential_growth_and_decay_021_slides
 
On the solvability of a system of forward-backward linear equations with unbo...
On the solvability of a system of forward-backward linear equations with unbo...On the solvability of a system of forward-backward linear equations with unbo...
On the solvability of a system of forward-backward linear equations with unbo...
 
Olivier Cappé's talk at BigMC March 2011
Olivier Cappé's talk at BigMC March 2011Olivier Cappé's talk at BigMC March 2011
Olivier Cappé's talk at BigMC March 2011
 
5. cem granger causality ecm
5. cem granger causality  ecm 5. cem granger causality  ecm
5. cem granger causality ecm
 
2003 Ames.Models
2003 Ames.Models2003 Ames.Models
2003 Ames.Models
 
Speeding up the Gillespie algorithm
Speeding up the Gillespie algorithmSpeeding up the Gillespie algorithm
Speeding up the Gillespie algorithm
 
New Mathematical Tools for the Financial Sector
New Mathematical Tools for the Financial SectorNew Mathematical Tools for the Financial Sector
New Mathematical Tools for the Financial Sector
 
Midsem sol 2013
Midsem sol 2013Midsem sol 2013
Midsem sol 2013
 
Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4Random Matrix Theory and Machine Learning - Part 4
Random Matrix Theory and Machine Learning - Part 4
 

Viewers also liked

Apprendre python3 arab
Apprendre python3 arabApprendre python3 arab
Apprendre python3 arabzan
 
Scilabisnotnaive
ScilabisnotnaiveScilabisnotnaive
Scilabisnotnaivezan
 
Bayesian inference for stochastic population models with application to aphids
Bayesian inference for stochastic population models with application to aphidsBayesian inference for stochastic population models with application to aphids
Bayesian inference for stochastic population models with application to aphidsColin Gillespie
 
An introduction to moment closure techniques
An introduction to moment closure techniquesAn introduction to moment closure techniques
An introduction to moment closure techniquesColin Gillespie
 
Bayesian Experimental Design for Stochastic Kinetic Models
Bayesian Experimental Design for Stochastic Kinetic ModelsBayesian Experimental Design for Stochastic Kinetic Models
Bayesian Experimental Design for Stochastic Kinetic ModelsColin Gillespie
 
Moment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic modelsMoment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic modelsColin Gillespie
 
BMW from the future
BMW from the futureBMW from the future
BMW from the futurecateof
 
Moment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic ModelsMoment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic ModelsColin Gillespie
 
Reference classes: a case study with the poweRlaw package
Reference classes: a case study with the poweRlaw packageReference classes: a case study with the poweRlaw package
Reference classes: a case study with the poweRlaw packageColin Gillespie
 
Gnu linux advanced administration arabic
Gnu linux advanced administration arabicGnu linux advanced administration arabic
Gnu linux advanced administration arabiczan
 
Metro Louisville Urban Development: SoBro
Metro Louisville Urban Development: SoBroMetro Louisville Urban Development: SoBro
Metro Louisville Urban Development: SoBroBrent Collins
 
Introduction to power laws
Introduction to power lawsIntroduction to power laws
Introduction to power lawsColin Gillespie
 
77232345 cours-ip-mobile
77232345 cours-ip-mobile77232345 cours-ip-mobile
77232345 cours-ip-mobilezan
 

Viewers also liked (19)

Dallas walking tours
Dallas walking toursDallas walking tours
Dallas walking tours
 
November2009
November2009November2009
November2009
 
Apprendre python3 arab
Apprendre python3 arabApprendre python3 arab
Apprendre python3 arab
 
Scilabisnotnaive
ScilabisnotnaiveScilabisnotnaive
Scilabisnotnaive
 
Bayesian inference for stochastic population models with application to aphids
Bayesian inference for stochastic population models with application to aphidsBayesian inference for stochastic population models with application to aphids
Bayesian inference for stochastic population models with application to aphids
 
An introduction to moment closure techniques
An introduction to moment closure techniquesAn introduction to moment closure techniques
An introduction to moment closure techniques
 
venkat cv
venkat cvvenkat cv
venkat cv
 
Bayesian Experimental Design for Stochastic Kinetic Models
Bayesian Experimental Design for Stochastic Kinetic ModelsBayesian Experimental Design for Stochastic Kinetic Models
Bayesian Experimental Design for Stochastic Kinetic Models
 
Marketing Tips
Marketing TipsMarketing Tips
Marketing Tips
 
Moment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic modelsMoment closure inference for stochastic kinetic models
Moment closure inference for stochastic kinetic models
 
BMW from the future
BMW from the futureBMW from the future
BMW from the future
 
Active passive
Active passiveActive passive
Active passive
 
Moment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic ModelsMoment Closure Based Parameter Inference of Stochastic Kinetic Models
Moment Closure Based Parameter Inference of Stochastic Kinetic Models
 
Reference classes: a case study with the poweRlaw package
Reference classes: a case study with the poweRlaw packageReference classes: a case study with the poweRlaw package
Reference classes: a case study with the poweRlaw package
 
Gnu linux advanced administration arabic
Gnu linux advanced administration arabicGnu linux advanced administration arabic
Gnu linux advanced administration arabic
 
Metro Louisville Urban Development: SoBro
Metro Louisville Urban Development: SoBroMetro Louisville Urban Development: SoBro
Metro Louisville Urban Development: SoBro
 
Introduction to power laws
Introduction to power lawsIntroduction to power laws
Introduction to power laws
 
Active voice - passive voice
Active voice - passive voice Active voice - passive voice
Active voice - passive voice
 
77232345 cours-ip-mobile
77232345 cours-ip-mobile77232345 cours-ip-mobile
77232345 cours-ip-mobile
 

Similar to WCSB 2012

The tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic modelsThe tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic modelsColin Gillespie
 
Parameter Estimation in Stochastic Differential Equations by Continuous Optim...
Parameter Estimation in Stochastic Differential Equations by Continuous Optim...Parameter Estimation in Stochastic Differential Equations by Continuous Optim...
Parameter Estimation in Stochastic Differential Equations by Continuous Optim...SSA KPI
 
1. Consider experiments with the following censoring mechanism A gr.docx
1. Consider experiments with the following censoring mechanism A gr.docx1. Consider experiments with the following censoring mechanism A gr.docx
1. Consider experiments with the following censoring mechanism A gr.docxstilliegeorgiana
 
On estimating the integrated co volatility using
On estimating the integrated co volatility usingOn estimating the integrated co volatility using
On estimating the integrated co volatility usingkkislas
 
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...Chiheb Ben Hammouda
 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsJulyan Arbel
 
Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...Umberto Picchini
 
Volatility derivatives and default risk
Volatility derivatives and default riskVolatility derivatives and default risk
Volatility derivatives and default riskVolatility
 
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...Chiheb Ben Hammouda
 
Monash University short course, part II
Monash University short course, part IIMonash University short course, part II
Monash University short course, part IIChristian Robert
 
Mcmc & lkd free II
Mcmc & lkd free IIMcmc & lkd free II
Mcmc & lkd free IIDeb Roy
 
Scalable inference for a full multivariate stochastic volatility
Scalable inference for a full multivariate stochastic volatilityScalable inference for a full multivariate stochastic volatility
Scalable inference for a full multivariate stochastic volatilitySYRTO Project
 
Sequential experimentation in clinical trials
Sequential experimentation in clinical trialsSequential experimentation in clinical trials
Sequential experimentation in clinical trialsSpringer
 
kinks and cusps in the transition dynamics of a bloch state
kinks and cusps in the transition dynamics of a bloch statekinks and cusps in the transition dynamics of a bloch state
kinks and cusps in the transition dynamics of a bloch statejiang-min zhang
 
Comparing estimation algorithms for block clustering models
Comparing estimation algorithms for block clustering modelsComparing estimation algorithms for block clustering models
Comparing estimation algorithms for block clustering modelsBigMC
 
Stat150 spring06 markov_cts
Stat150 spring06 markov_ctsStat150 spring06 markov_cts
Stat150 spring06 markov_ctsmbfrosh
 

Similar to WCSB 2012 (20)

PMED Transition Workshop - A Bayesian Model for Joint Longitudinal and Surviv...
PMED Transition Workshop - A Bayesian Model for Joint Longitudinal and Surviv...PMED Transition Workshop - A Bayesian Model for Joint Longitudinal and Surviv...
PMED Transition Workshop - A Bayesian Model for Joint Longitudinal and Surviv...
 
The tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic modelsThe tau-leap method for simulating stochastic kinetic models
The tau-leap method for simulating stochastic kinetic models
 
Parameter Estimation in Stochastic Differential Equations by Continuous Optim...
Parameter Estimation in Stochastic Differential Equations by Continuous Optim...Parameter Estimation in Stochastic Differential Equations by Continuous Optim...
Parameter Estimation in Stochastic Differential Equations by Continuous Optim...
 
1. Consider experiments with the following censoring mechanism A gr.docx
1. Consider experiments with the following censoring mechanism A gr.docx1. Consider experiments with the following censoring mechanism A gr.docx
1. Consider experiments with the following censoring mechanism A gr.docx
 
On estimating the integrated co volatility using
On estimating the integrated co volatility usingOn estimating the integrated co volatility using
On estimating the integrated co volatility using
 
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
Seminar Talk: Multilevel Hybrid Split Step Implicit Tau-Leap for Stochastic R...
 
Congrès SMAI 2019
Congrès SMAI 2019Congrès SMAI 2019
Congrès SMAI 2019
 
Dependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian NonparametricsDependent processes in Bayesian Nonparametrics
Dependent processes in Bayesian Nonparametrics
 
Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...Bayesian inference for mixed-effects models driven by SDEs and other stochast...
Bayesian inference for mixed-effects models driven by SDEs and other stochast...
 
Volatility derivatives and default risk
Volatility derivatives and default riskVolatility derivatives and default risk
Volatility derivatives and default risk
 
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
MCQMC 2020 talk: Importance Sampling for a Robust and Efficient Multilevel Mo...
 
Monash University short course, part II
Monash University short course, part IIMonash University short course, part II
Monash University short course, part II
 
Mcmc & lkd free II
Mcmc & lkd free IIMcmc & lkd free II
Mcmc & lkd free II
 
Lecture_9.pdf
Lecture_9.pdfLecture_9.pdf
Lecture_9.pdf
 
lecture3_2.pdf
lecture3_2.pdflecture3_2.pdf
lecture3_2.pdf
 
Scalable inference for a full multivariate stochastic volatility
Scalable inference for a full multivariate stochastic volatilityScalable inference for a full multivariate stochastic volatility
Scalable inference for a full multivariate stochastic volatility
 
Sequential experimentation in clinical trials
Sequential experimentation in clinical trialsSequential experimentation in clinical trials
Sequential experimentation in clinical trials
 
kinks and cusps in the transition dynamics of a bloch state
kinks and cusps in the transition dynamics of a bloch statekinks and cusps in the transition dynamics of a bloch state
kinks and cusps in the transition dynamics of a bloch state
 
Comparing estimation algorithms for block clustering models
Comparing estimation algorithms for block clustering modelsComparing estimation algorithms for block clustering models
Comparing estimation algorithms for block clustering models
 
Stat150 spring06 markov_cts
Stat150 spring06 markov_ctsStat150 spring06 markov_cts
Stat150 spring06 markov_cts
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 

WCSB 2012

  • 1. Bayesian inference for the chemical master equation using approximate models Colin Gillespie Joint work with Andrew Golightly
  • 2. Modelling Represent a biochemical network with a set of (pseudo-)biochemical reactions Specify the rate laws and rate constants of the reactions Run some stochastic or deterministic computer simulator of the system dynamics Straightforward using the Gillespie algorithm. The reverse problem is trickier – given time course data, and a set of reactions, can we recover the rate constants? 2/19
  • 3. Modelling Generically, k species and r reactions with a typical reaction i c Ri : ui1 Y1 + . . . + uik Yk −→ vi1 Y1 + . . . + vik Yk − Stochastic rate constant: ci Hazard / instantaneous rate: hi (Y (t ), ci ) where Y (t ) = (Y1 (t ), . . . , Yk (t )) is the current state of the system and k Yj (t ) hi (Y (t ), ci ) = ci ∏ uij j =1 3/19
  • 4. Modelling Some remarks: This setup describes a Markov jump process (MJP) The effect of reaction Ri is to change the value of each Yj by vij − uij It can be shown that the time to the next reaction is r τ ∼ Exp {h0 (Y (t ), c )} where h0 ( Y ( t ) , c ) = ∑ hi (Y (t ), ci ) i =1 and the reaction is of type i with probability hi (Y (t ), ci )/h0 (Y (t ), ci ) Hence, the process is easily simulated (and this technique is known as the Gillespie algorithm) 4/19
  • 5. Inference for the exact model Aim: infer the ci given time-course biochemical data. Following Boys et al. (2008) and Wilkinson (2006): Suppose we observe the entire process Y over [0, T ] The i th unit interval contains ni reactions with times and types (tij , kij ), j = 1, 2, . . . , ni Hence, the likelihood for c is T −1 ni T π (Y|c ) = ∏ ∏ hk ij Y (ti ,j −1 ), ckij exp − 0 h0 (Y (t ), c ) dt i =0 j =1 5/19
  • 6. Inference for the exact model Aim: infer the ci given time-course biochemical data. Following Boys et al. (2008) and Wilkinson (2006): Suppose we observe the entire process Y over [0, T ] The i th unit interval contains ni reactions with times and types (tij , kij ), j = 1, 2, . . . , ni Hence, the likelihood for c is T −1 ni T π (Y|c ) = ∏ ∏ hk ij Y (ti ,j −1 ), ckij exp − 0 h0 (Y (t ), c ) dt i =0 j =1 5/19
  • 7. Inference for the exact model Problem: it is not feasible to observe all reaction times and types Assume data are observed on a regular grid with Y0:T = Y (t ) = (Y1 (t ), Y2 (t ), . . . , Yk (t )) : t = 0, 1, 2, . . . , T Idea: use a Gibbs sampler to alternate between draws of 1. times and types of reactions in (0, T ] conditional on c and the observations, 2. each ci conditional on the augmented data Note that step 1 can be performed for each interval (i , i + 1] in turn, due to the factorisation of π (Y|Y0:T , c ) 6/19
  • 8. Inference for the exact model Problem: it is not feasible to observe all reaction times and types Assume data are observed on a regular grid with Y0:T = Y (t ) = (Y1 (t ), Y2 (t ), . . . , Yk (t )) : t = 0, 1, 2, . . . , T Idea: use a Gibbs sampler to alternate between draws of 1. times and types of reactions in (0, T ] conditional on c and the observations, 2. each ci conditional on the augmented data Note that step 1 can be performed for each interval (i , i + 1] in turn, due to the factorisation of π (Y|Y0:T , c ) 6/19
  • 9. Inference for the exact model Problem: it is not feasible to observe all reaction times and types Assume data are observed on a regular grid with Y0:T = Y (t ) = (Y1 (t ), Y2 (t ), . . . , Yk (t )) : t = 0, 1, 2, . . . , T Idea: use a Gibbs sampler to alternate between draws of 1. times and types of reactions in (0, T ] conditional on c and the observations, 2. each ci conditional on the augmented data Note that step 1 can be performed for each interval (i , i + 1] in turn, due to the factorisation of π (Y|Y0:T , c ) 6/19
  • 10. Inference for the exact model Problem: it is not feasible to observe all reaction times and types Assume data are observed on a regular grid with Y0:T = Y (t ) = (Y1 (t ), Y2 (t ), . . . , Yk (t )) : t = 0, 1, 2, . . . , T Idea: use a Gibbs sampler to alternate between draws of 1. times and types of reactions in (0, T ] conditional on c and the observations, 2. each ci conditional on the augmented data Note that step 1 can be performed for each interval (i , i + 1] in turn, due to the factorisation of π (Y|Y0:T , c ) 6/19
  • 11. Difficulties These techniques do not scale well to problems of realistic size and complexity... True process is discrete and stochastic — stochasticity is vital — what about discreteness? Treating molecule numbers as continuous and performing exact inference for the resulting approximate model appears to be promising... 7/19
  • 12. Bayesian filtering Suppose we have noisy observations X0:(i −1) = {X (t ) : t = 0, 1, . . . , i − 1} where for example, X (t ) = Y (t ) + , ∼ N(0, Σ) Goal: generate a sample from π [c , Y (i )|X0:i ] given a new datum X (i ) π [c , Y (i )|X0:i ] ∝ π [c , Y (i − 1)|X0:i −1 ] πh Y(i −1,i ] |c π [Y (i )|X (i )] dY[i −1,i ) ∝ prior × likelihood where Y(i −1,i ] = {Y (t ) : t ∈ (i − 1, i ]} is the latent path in (i − 1, i ] and πh [·|c ] is the associated density under the simulator Idea: if we can sample π [c , Y (i − 1)|X0:i −1 ] then we can use MCMC to sample the target π [c , Y (i )|X0:i ] 8/19
  • 13. Bayesian filtering Suppose we have noisy observations X0:(i −1) = {X (t ) : t = 0, 1, . . . , i − 1} where for example, X (t ) = Y (t ) + , ∼ N(0, Σ) Goal: generate a sample from π [c , Y (i )|X0:i ] given a new datum X (i ) π [c , Y (i )|X0:i ] ∝ π [c , Y (i − 1)|X0:i −1 ] πh Y(i −1,i ] |c π [Y (i )|X (i )] dY[i −1,i ) ∝ prior × likelihood where Y(i −1,i ] = {Y (t ) : t ∈ (i − 1, i ]} is the latent path in (i − 1, i ] and πh [·|c ] is the associated density under the simulator Idea: if we can sample π [c , Y (i − 1)|X0:i −1 ] then we can use MCMC to sample the target π [c , Y (i )|X0:i ] 8/19
  • 14. Bayesian filtering Suppose we have noisy observations X0:(i −1) = {X (t ) : t = 0, 1, . . . , i − 1} where for example, X (t ) = Y (t ) + , ∼ N(0, Σ) Goal: generate a sample from π [c , Y (i )|X0:i ] given a new datum X (i ) π [c , Y (i )|X0:i ] ∝ π [c , Y (i − 1)|X0:i −1 ] πh Y(i −1,i ] |c π [Y (i )|X (i )] dY[i −1,i ) ∝ prior × likelihood where Y(i −1,i ] = {Y (t ) : t ∈ (i − 1, i ]} is the latent path in (i − 1, i ] and πh [·|c ] is the associated density under the simulator Idea: if we can sample π [c , Y (i − 1)|X0:i −1 ] then we can use MCMC to sample the target π [c , Y (i )|X0:i ] 8/19
  • 15. Idealised MCMC scheme 1. Propose (c ∗ , Y (i − 1)∗ ) ∼ π [ · |X0:i −1 ] 2. Draw Y(∗−1,i ] ∼ πh [ · |c ∗ ] by forward simulation i 3. Accept/reject with probability π [Y (i )∗ |X (i )] min 1, . π [Y (i )|X (i )] Since the simulator is used as a proposal process, we don’t need to evaluate its associated density. Similar to Approximate Bayesian Computation (ABC) 9/19
  • 16. A particle approach Since π [c , Y (i − 1)|X0:i −1 ] does not have analytic form (typically) we approximate this density with an equally weighted cloud of points or particles, hence the term particle filter The scheme is initialised with a sample from the prior and a posterior sample calculated at time i − 1 is used as the prior at time i To avoid sample impoverishment, in step 1 of the scheme we sample the kernel density estimate (KDE) of π [c ∗ , Y (i − 1)∗ |X0:i −1 ] by adding zero mean Normal noise to each proposed particle 10/19
  • 17. Three types of approximation The distribution π [c , Y (i − 1)|X0:i −1 ] is represented as a discrete distribution - in theory as the number of particles N increases, the approximation improves We perturb particles - as N increases, perturbation decreases Simulator Exact: Gillespie Approximate: SDE, but can be perform poorly for low molecule numbers Approximate: ODE, but ignores stochasticity 11/19
  • 18. Three types of approximation The distribution π [c , Y (i − 1)|X0:i −1 ] is represented as a discrete distribution - in theory as the number of particles N increases, the approximation improves We perturb particles - as N increases, perturbation decreases Simulator Exact: Gillespie Approximate: SDE, but can be perform poorly for low molecule numbers Approximate: ODE, but ignores stochasticity 11/19
  • 19. Three types of approximation The distribution π [c , Y (i − 1)|X0:i −1 ] is represented as a discrete distribution - in theory as the number of particles N increases, the approximation improves We perturb particles - as N increases, perturbation decreases Simulator Exact: Gillespie Approximate: SDE, but can be perform poorly for low molecule numbers Approximate: ODE, but ignores stochasticity 11/19
  • 20. Case study Wish to compare the hybrid scheme with inferences made assuming the “true” discrete, stochastic model To allow such inferences, consider a toy example: Autoregulatory network c1 c2 Immigration R1 : ∅ − → Y1 − R2 : ∅ − → Y2 − c3 c4 Death R 3 : Y1 − → ∅ − R4 : Y2 − → ∅ − c5 Interaction R5 : Y1 + Y2 − → 2Y2 . − Corrupt the data by constructing Poisson (Yi (t )) if Yi (t ) > 0, Xi (t )|Yi (t ) ∼ Bernoulli(0.1) if Yi (t ) = 0. for each i = 1, 2 so that Y (t ) is not observed anywhere 12/19
  • 21. Stochastic vs deterministic c2 = 10 c2 = 1.0 1500 1000 500 Population 0 c2 = 0.1 c2 = 0.01 1500 1000 500 0 0 200 400 600 800 1000 0 200 400 600 800 1000 Time 13/19
  • 22. Stochastic vs deterministic ODE steady state Stochastic steady state 104 q q q q c2 = 10 × scale 103 q Scale q 10−5 q (Y2 imm’ rate) q 10−4 Y2 q q 10−3 q c3 = 0.01 × scale q q q 10−2 10−1 102 q (Y1 death rate) q q q q q q qq q qq qq q qq qq q 101 102 103 104 105 106 102 103 104 105 106 Y1 14/19
  • 23. Autoregulatory network Initial conditions: 120 q Y1 (0) = Y2 (0) = 10 q q q q q q q Rate constants: c1 = 10, c2 = 0.1, q q Population Level 80 q q q q c3 = 0.1, c4 = 0.7 and c5 = 0.008. q q q Y1 : solid red line q 40 q Y2 : blue dashed lines q q Solid points: twenty one noised q q q q q q q 0 q q q q q q q q q q q q q q points used for inference 0 25 50 75 100 Time 15/19
  • 24. Results: N = 30, 000 particles 0.5 30 30 0.4 25 25 20 20 0.3 Density Density Density 15 15 0.2 Vague priors 10 10 0.1 5 used 5 0.0 0 0 0 5 10 15 20 25 30 0.0 0.1 0.2 0.3 0.4 0.0 0.1 0.2 0.3 0.4 c1 c2 c3 Gillespie 8 400 SDE 6 300 ODE Density Density 4 200 2 100 0 0 0.0 0.5 1.0 1.5 2.0 0.000 0.015 0.030 0.045 c4 c5 16/19
  • 25. Summary Inferring rate constants that govern discrete stochastic kinetic models is computationally challenging It appears promising to consider an approximation of the model and perform exact inference using the approximate model In this case study, the SDE and ODE models performed surprisingly well Although the ODE model had poor MCMC mixing properties 17/19
  • 26. Future A hybrid forwards simulator (or indeed any forwards simulator) can be used as a proposal process inside a particle filter However, for this model the Salis & Kaznessis hybrid simulator is slower than the standard Gillespie algorithm A particle MCMC approach provides an exact inference procedure (see recent Andrieu et al JRSS B read paper) Preprint available using this PMCMC. This paper also considers the hybrid simulator and the linear noise approximation 18/19
  • 27. Boys, R. J., Wilkinson, D.J. and T.B.L. Kirkwood (2008). Bayesian inference for a discretely observed stochastic kinetic model. Statistics and Computing 18. Gillespie, C. S. and Golightly, A. and (2010). Bayesian inference for generalized stochastic population growth models with application to aphids. JRSS Series C 59(2). Heron, E. A., Finkenstadt, B. and D. A. Hand (2007). Bayesian inference for dynamic transcriptional regulation; the Hes1 system as a case study. Bioinformatics 23. Salis, H. and Y. Kaznessis (2005). Accurate hybrid simulation of a system of coupled biochemical reactions. Journal of Chemical Physics 122. Wilkinson, D. J. (2006). Stochastic Modelling for Systems Biology. Chapman & Hall/CRC Press. Contact details... email: colin.gillespie@ncl.ac.uk www: http://www.mas.ncl.ac.uk/~ncsg3/ 19/19