SlideShare a Scribd company logo
Background Pre-computation Experimental Results Conclusion
Pre-processing for SMC-ABC
with undirected graphical models
Matt Moores1,2 Chris Drovandi1 Kerrie Mengersen1,2
Antonietta Mira3 Christian Robert4,5
1Mathematical Sciences School, Queensland University of Technology,
Brisbane, Australia
2Institute for Health and Biomedical Innovation, QUT Kelvin Grove
3Institute of Finance, Università della Svizzera italiana, Switzerland
4CEREMADE, Université Paris Dauphine, France
5Department of Statistics, University of Warwick, UK
ABC in Sydney
July 4, 2014
Background Pre-computation Experimental Results Conclusion
Outline
1 Background
Hidden Potts model
2 Pre-computation
3 Experimental Results
Simulation Study
Satellite Remote Sensing
Background Pre-computation Experimental Results Conclusion
Background
Image analysis often involves:
Large datasets, with millions of pixels
Multiple images with similar characteristics
For example: satellite remote sensing (Landsat), computed
tomography (CT)
Table: Scale of common types of images
Number Landsat CT slices
of pixels (90m2/px) (512×512)
26 0.06km2
. . .
56 14.06km2
0.1
106 900.00km2
3.8
156 10251.56km2
43.5
Background Pre-computation Experimental Results Conclusion
Motivation
Computational cost is dominated by simulation of pseudo-data
e.g. Hidden Potts model in image analysis
(Grelaud et al. 2009, Everitt 2012)
Model fitting with ABC can be separated into:
Learning about the summary statistic, given the parameter
f (s(w) | θ) π(θ)
Choosing parameter values, given a summary statistic
π (θ | δ(s(y), s(w))  )
For latent models, an additional step of learning about the
summary statistic, given the data: s(z) | y, θ
Grelaud, Robert, Marin, Rodolphe  Taly (2009) Bayesian Analysis 4(2)
Everitt (2012) JCGS 21(4)
Background Pre-computation Experimental Results Conclusion
hidden Markov random field
Joint distribution of observed pixel intensities yi ∈ y
and latent labels zi ∈ z:
Pr(y, z|µ, σ2
, β) ∝ L(y|µ, σ2
, z)π(z|β) (1)
Additive Gaussian noise:
yi|zi =j
iid
∼ N µj, σ2
j

(2)
Potts model:
π(zi|zi∼`, β) =
exp {β
P
i∼` δ(zi, z`)}
Pk
j=1 exp {β
P
i∼` δ(j, z`)}
(3)
Potts (1952) Proceedings of the Cambridge Philosophical Society 48(1)
Background Pre-computation Experimental Results Conclusion
Inverse Temperature
Background Pre-computation Experimental Results Conclusion
Doubly-intractable likelihood
p(β|z) = C(β)−1
π(β) exp {β S(z)} (4)
The normalising constant of the Potts model has computational
complexity of O(n2kn), since it involves a sum over all possible
combinations of the labels z ∈ Z:
C(β) =
X
z∈Z
exp {β S(z)} (5)
S(z) is the sufficient statistic of the Potts model:
S(z) =
X
i∼`∈L
δ(zi, z`) (6)
where L is the set of all unique neighbour pairs.
Background Pre-computation Experimental Results Conclusion
Expectation of S(z)
exact expectation of S(z) for n=12 and k=
β
E
(
S
(
z
))
5
10
15
1 2 3 4
2
3
4
(a) n = 12  k ∈ 2, 3, 4
exact expectation of S(z) for k=3 and n=
β
E
(
S
(
z
))
5
10
15
1 2 3 4
4
6
9
12
(b) k = 3  n ∈ 4, 6, 9, 12
Figure: Distribution of Ez|β[S(z)]
Background Pre-computation Experimental Results Conclusion
Standard deviation of S(z)
exact standard deviation of S(z) for n=12 and k=
β
σ
(
S
(
z
))
0.0
0.5
1.0
1.5
2.0
2.5
3.0
1 2 3 4
2
3
4
(a) n = 12  k ∈ 2, 3, 4
exact standard deviation of S(z) for k=3 and n=
β
σ
(
S
(
z
))
0.0
0.5
1.0
1.5
2.0
2.5
1 2 3 4
4
6
9
12
(b) k = 3  n ∈ 4, 6, 9, 12
Figure: Distribution of σz|β[S(z)]
Background Pre-computation Experimental Results Conclusion
Pre-computation
The distribution of the summary statistics f(s(w)|θ) is
independent of the observed data y
By simulating pseudo-data for values of θ, we can create a
binding function φ(θ) for an auxiliary model fA(s(w)|φ(θ))
This binding function can be reused across multiple datasets,
amortising its computational cost
By replacing s(w) with pseudo summary statistics drawn from our
auxiliary model, we avoid the need to simulate pseudo-data during
model fitting.
Wood (2010) Nature 466
Cabras, Castellanos  Ruli (2014) Metron (to appear)
Background Pre-computation Experimental Results Conclusion
Simulation from f(·|β)
0.0 0.5 1.0 1.5 2.0 2.5 3.0
10
15
20
25
30
β
E
(
S
(
z
))
(a) Ez|β (S(w))
0.0 0.5 1.0 1.5 2.0 2.5 3.0
0
1
2
3
4
β
σ
(
S
(
z
))
(b) σz|β (S(w))
Figure: Approximation of S(w)|β using 1000 iterations of
Swendsen-Wang (discarding 500 as burn-in)
Swendsen  Wang (1987) Physical Review Letters 58
Background Pre-computation Experimental Results Conclusion
Piecewise linear model
0.0 0.5 1.0 1.5 2.0 2.5 3.0
10000
15000
20000
25000
30000
β
E
S
(
z
)
(a) φ̂µ(β)
0.0 0.5 1.0 1.5 2.0 2.5 3.0
0
50
100
150
200
250
300
350
β
σ
S
(
z
)
(b) φ̂σ(β)
Figure: Binding functions for S(w) | β with n = 56
, k = 3
Background Pre-computation Experimental Results Conclusion
Approximations (A2
BC?)
This method introduces additional layers of approximation error:
finite sampling points for β
(even more of a problem in higher dimensions)
piecewise linear model for the binding functions φ̂µ(β) and
φ̂σ(β)
Gaussian kernel for Ŝ(w)|β ∼ N

φ̂µ(β), φ̂σ(β)2

But it enables us to scale up ABC for real world problems, for
which it was previously infeasible
Background Pre-computation Experimental Results Conclusion
Scalable SMC-ABC for the hidden Potts model
Algorithm 1 SMC-ABC using precomputed fA(s(w)|φ(θ))
1: Draw N particles β0
i ∼ π0(β)
2: Draw N × M statistics Ŝ(wi,m) ∼ N

φ̂µ(β0
i), φ̂σ(β0
i)2

3: repeat
4: Update S(zt)|y, πt(β)
5: Adaptively select ABC tolerance t
6: Update importance weights ωi for each particle
7: if effective sample size (ESS)  Nmin then
8: Resample particles according to their weights
9: end if
10: Update particles using random walk proposal
(with adaptive RWMH bandwidth σ2
t )
11: until
naccept
N  0.015 or t  10−9 or t ≥ 100
Background Pre-computation Experimental Results Conclusion
Additive Gaussian noise
SMC iteration
S
(
z
)
0 20 40 60 80 100
15000
15500
16000
16500
17000
17500
18000
Figure: Change in the value of S(z) according to the current distribution
of πt(β|z) and L(y|µ, σ2
, z)
Background Pre-computation Experimental Results Conclusion
Stopping criterion
Iteration
0 20 40 60 80 100
0
2000
4000
6000
ε
σ
(a) t
Iteration
proportion
accepted
0 20 40 60 80 100
0.0
0.2
0.4
0.6
0.8
1.0
(b) RWMH acceptance
Iteration
ESS
0 20 40 60 80 100
0
2000
4000
6000
8000
10000
(c) ESSt
Background Pre-computation Experimental Results Conclusion
Simulation Study
20 images, n = 125 × 125, k = 3:
β ∼ U(0, 1.005)
z ∼ f(·|β) using 2000 iterations of Swendsen-Wang
µj ∼ N 0, 1002

1
σ2
j
∼ Γ (1, 100)
Comparison of 2 ABC algorithms:
Scalable SMC-ABC using precomputed ˆ
f(β)
Standard SMC-ABC using 500 iterations of Gibbs sampling
Del Moral, Doucet,  Jasra (2012) Stat. Comput. 22(5)
Background Pre-computation Experimental Results Conclusion
Accuracy of posterior estimates for β
0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
β
posterior
distribution
(a) pseudo-data (M=50)
0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
β
posterior
distribution
(b) pre-computed (M=200)
Background Pre-computation Experimental Results Conclusion
Distribution of posterior sampling error for β
algorithm
error
−0.1
0.0
0.1
0.2
0.3
0.4
0.5
Pseudo−data Pre−computed
Background Pre-computation Experimental Results Conclusion
Improvement in runtime
Pseudo−data Pre−computed
0.5
1.0
2.0
5.0
10.0
20.0
50.0
100.0
algorithm
elapsed
time
(hours)
(a) elapsed (wall clock) time
Pseudo−data Pre−computed
5
10
20
50
100
200
500
algorithm
CPU
time
(hours)
(b) CPU time
Background Pre-computation Experimental Results Conclusion
Satellite image of southwest Brisbane
Figure: Normalised difference vegetation index (NDVI)
Background Pre-computation Experimental Results Conclusion
Auxiliary Model
0.0 0.5 1.0 1.5 2.0 2.5 3.0
500000
1000000
1500000
2000000
β
(a) φ̂µ(β)
0.0 0.5 1.0 1.5 2.0 2.5 3.0
0
1000
2000
3000
4000
5000
β
(b) φ̂σ(β)
Figure: Binding functions for n = 978380, k = 6
Background Pre-computation Experimental Results Conclusion
Results
Precomputation of fA(s(w)|φ(θ)) took 13h 23m
for 987 values of β
Model fitting took 1h 42m for 40 SMC iterations
In contrast:
Generating M = 50 pseudo-datasets per particle
takes 89 hours for a single iteration
(∴  20 weeks for 40 iterations)
Background Pre-computation Experimental Results Conclusion
Posterior Density Estimate
β
Density
0
10
20
30
1.24 1.26 1.28 1.30
Background Pre-computation Experimental Results Conclusion
Summary
Scalability of SMC-ABC can be improved by pre-computing an
auxiliary model fA(s(w)|φ(θ))
Pre-computation took 1.4 hours on a 16 core Xeon server
for 987 values of β with 15,625 pixels
(13.4 hours for 978,380 pixels)
Average runtime for SMC-ABC improved from 71.4 hours to
39 minutes with 15,625 pixels
(1.7 hours for 978,380 pixels)
The binding functions represent the nonlinear, heteroskedastic
relationship between the parameter and the summary statistic.
This method could be extended to multivariate applications, such
as estimating both β and k for the hidden Potts model, or
estimating θ for an ERGM.
Appendix
For Further Reading I
Matt Moores, Chris Drovandi, Kerrie Mengersen  Christian Robert
Pre-processing for approximate Bayesian computation in image analysis.
arXiv:1403.4359 [stat.CO], 2014.
Simon Wood
Statistical inference for noisy nonlinear ecological dynamic systems.
Nature, 466: 1102–04, 2010.
Stefano Cabras, Marı́a Eugenia Castellanos  Erlis Ruli
A Quasi likelihood approximation of posterior distributions for
likelihood-intractable complex models.
To appear in Metron, 2014.
Christopher Drovandi, Anthony Pettitt  Malcolm Faddy
Approximate Bayesian computation using indirect inference.
J. R. Stat. Soc. Ser. C 60(3): 317–37, 2011.
Richard Everitt
Bayesian Parameter Estimation for Latent Markov Random Fields and
Social Networks.
J. Comput. Graph. Stat., 21(4): 940–60, 2012.
Appendix
For Further Reading II
Christopher Drovandi, Anthony Pettitt  Anthony Lee
Bayesian indirect inference using a parametric auxiliary model.
http://eprints.qut.edu.au/63767/3/63767.pdf
Pierre Del Moral, Arnaud Doucet  Ajay Jasra
An adaptive sequential Monte Carlo method for approximate Bayesian
computation.
Statistics  Computing, 22(5): 1009–20, 2012.
A. Grelaud, C. P. Robert, J.-M. Marin, F. Rodolphe  J.-F. Taly
ABC likelihood-free methods for model choice in Gibbs random fields.
Bayesian Analysis, 4(2): 317–36, 2009.
Renfrey B. Potts
Some generalized order-disorder transformations.
Proc. Cambridge Philosophical Society, 48(1): 106–9, 1952.
R. H. Swendsen  J.-S. Wang
Nonuniversal critical dynamics in Monte Carlo simulations.
Physical Review Letters, 58: 86–8, 1987.

More Related Content

What's hot

Monte Carlo in Montréal 2017
Monte Carlo in Montréal 2017Monte Carlo in Montréal 2017
Monte Carlo in Montréal 2017
Christian Robert
 
Pre-computation for ABC in image analysis
Pre-computation for ABC in image analysisPre-computation for ABC in image analysis
Pre-computation for ABC in image analysis
Matt Moores
 
NCE, GANs & VAEs (and maybe BAC)
NCE, GANs & VAEs (and maybe BAC)NCE, GANs & VAEs (and maybe BAC)
NCE, GANs & VAEs (and maybe BAC)
Christian Robert
 
Patch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective DivergencesPatch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective Divergences
Frank Nielsen
 
accurate ABC Oliver Ratmann
accurate ABC Oliver Ratmannaccurate ABC Oliver Ratmann
accurate ABC Oliver Ratmannolli0601
 
comments on exponential ergodicity of the bouncy particle sampler
comments on exponential ergodicity of the bouncy particle samplercomments on exponential ergodicity of the bouncy particle sampler
comments on exponential ergodicity of the bouncy particle sampler
Christian Robert
 
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Frank Nielsen
 
On learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihoodOn learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihood
Frank Nielsen
 
Optimal L-shaped matrix reordering, aka graph's core-periphery
Optimal L-shaped matrix reordering, aka graph's core-peripheryOptimal L-shaped matrix reordering, aka graph's core-periphery
Optimal L-shaped matrix reordering, aka graph's core-periphery
Francesco Tudisco
 
Core–periphery detection in networks with nonlinear Perron eigenvectors
Core–periphery detection in networks with nonlinear Perron eigenvectorsCore–periphery detection in networks with nonlinear Perron eigenvectors
Core–periphery detection in networks with nonlinear Perron eigenvectors
Francesco Tudisco
 
A series of maximum entropy upper bounds of the differential entropy
A series of maximum entropy upper bounds of the differential entropyA series of maximum entropy upper bounds of the differential entropy
A series of maximum entropy upper bounds of the differential entropy
Frank Nielsen
 
Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...
Valentin De Bortoli
 
Small updates of matrix functions used for network centrality
Small updates of matrix functions used for network centralitySmall updates of matrix functions used for network centrality
Small updates of matrix functions used for network centrality
Francesco Tudisco
 
Bayesian model choice in cosmology
Bayesian model choice in cosmologyBayesian model choice in cosmology
Bayesian model choice in cosmology
Christian Robert
 
Divergence clustering
Divergence clusteringDivergence clustering
Divergence clustering
Frank Nielsen
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
The Statistical and Applied Mathematical Sciences Institute
 
Optimal interval clustering: Application to Bregman clustering and statistica...
Optimal interval clustering: Application to Bregman clustering and statistica...Optimal interval clustering: Application to Bregman clustering and statistica...
Optimal interval clustering: Application to Bregman clustering and statistica...
Frank Nielsen
 
Bregman divergences from comparative convexity
Bregman divergences from comparative convexityBregman divergences from comparative convexity
Bregman divergences from comparative convexity
Frank Nielsen
 
A nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formulaA nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formula
Alexander Litvinenko
 
Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Slides: The dual Voronoi diagrams with respect to representational Bregman di...Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Frank Nielsen
 

What's hot (20)

Monte Carlo in Montréal 2017
Monte Carlo in Montréal 2017Monte Carlo in Montréal 2017
Monte Carlo in Montréal 2017
 
Pre-computation for ABC in image analysis
Pre-computation for ABC in image analysisPre-computation for ABC in image analysis
Pre-computation for ABC in image analysis
 
NCE, GANs & VAEs (and maybe BAC)
NCE, GANs & VAEs (and maybe BAC)NCE, GANs & VAEs (and maybe BAC)
NCE, GANs & VAEs (and maybe BAC)
 
Patch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective DivergencesPatch Matching with Polynomial Exponential Families and Projective Divergences
Patch Matching with Polynomial Exponential Families and Projective Divergences
 
accurate ABC Oliver Ratmann
accurate ABC Oliver Ratmannaccurate ABC Oliver Ratmann
accurate ABC Oliver Ratmann
 
comments on exponential ergodicity of the bouncy particle sampler
comments on exponential ergodicity of the bouncy particle samplercomments on exponential ergodicity of the bouncy particle sampler
comments on exponential ergodicity of the bouncy particle sampler
 
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
 
On learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihoodOn learning statistical mixtures maximizing the complete likelihood
On learning statistical mixtures maximizing the complete likelihood
 
Optimal L-shaped matrix reordering, aka graph's core-periphery
Optimal L-shaped matrix reordering, aka graph's core-peripheryOptimal L-shaped matrix reordering, aka graph's core-periphery
Optimal L-shaped matrix reordering, aka graph's core-periphery
 
Core–periphery detection in networks with nonlinear Perron eigenvectors
Core–periphery detection in networks with nonlinear Perron eigenvectorsCore–periphery detection in networks with nonlinear Perron eigenvectors
Core–periphery detection in networks with nonlinear Perron eigenvectors
 
A series of maximum entropy upper bounds of the differential entropy
A series of maximum entropy upper bounds of the differential entropyA series of maximum entropy upper bounds of the differential entropy
A series of maximum entropy upper bounds of the differential entropy
 
Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...
 
Small updates of matrix functions used for network centrality
Small updates of matrix functions used for network centralitySmall updates of matrix functions used for network centrality
Small updates of matrix functions used for network centrality
 
Bayesian model choice in cosmology
Bayesian model choice in cosmologyBayesian model choice in cosmology
Bayesian model choice in cosmology
 
Divergence clustering
Divergence clusteringDivergence clustering
Divergence clustering
 
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
 
Optimal interval clustering: Application to Bregman clustering and statistica...
Optimal interval clustering: Application to Bregman clustering and statistica...Optimal interval clustering: Application to Bregman clustering and statistica...
Optimal interval clustering: Application to Bregman clustering and statistica...
 
Bregman divergences from comparative convexity
Bregman divergences from comparative convexityBregman divergences from comparative convexity
Bregman divergences from comparative convexity
 
A nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formulaA nonlinear approximation of the Bayesian Update formula
A nonlinear approximation of the Bayesian Update formula
 
Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Slides: The dual Voronoi diagrams with respect to representational Bregman di...Slides: The dual Voronoi diagrams with respect to representational Bregman di...
Slides: The dual Voronoi diagrams with respect to representational Bregman di...
 

Similar to Precomputation for SMC-ABC with undirected graphical models

Final PhD Seminar
Final PhD SeminarFinal PhD Seminar
Final PhD SeminarMatt Moores
 
Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...
Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...
Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...
Alexander Litvinenko
 
R package bayesImageS: Scalable Inference for Intractable Likelihoods
R package bayesImageS: Scalable Inference for Intractable LikelihoodsR package bayesImageS: Scalable Inference for Intractable Likelihoods
R package bayesImageS: Scalable Inference for Intractable Likelihoods
Matt Moores
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for Graphs
pione30
 
Propagation of Uncertainties in Density Driven Groundwater Flow
Propagation of Uncertainties in Density Driven Groundwater FlowPropagation of Uncertainties in Density Driven Groundwater Flow
Propagation of Uncertainties in Density Driven Groundwater Flow
Alexander Litvinenko
 
Workshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael MartinWorkshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael Martin
Christian Robert
 
cheb_conf_aksenov.pdf
cheb_conf_aksenov.pdfcheb_conf_aksenov.pdf
cheb_conf_aksenov.pdf
Alexey Vasyukov
 
ABC-Gibbs
ABC-GibbsABC-Gibbs
ABC-Gibbs
Christian Robert
 
fb69b412-97cb-4e8d-8a28-574c09557d35-160618025920
fb69b412-97cb-4e8d-8a28-574c09557d35-160618025920fb69b412-97cb-4e8d-8a28-574c09557d35-160618025920
fb69b412-97cb-4e8d-8a28-574c09557d35-160618025920Karl Rudeen
 
SIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithmsSIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithms
Jagadeeswaran Rathinavel
 
bayesImageS: Bayesian computation for medical Image Segmentation using a hidd...
bayesImageS: Bayesian computation for medical Image Segmentation using a hidd...bayesImageS: Bayesian computation for medical Image Segmentation using a hidd...
bayesImageS: Bayesian computation for medical Image Segmentation using a hidd...
Matt Moores
 
Adaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte CarloAdaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte Carlo
Christian Robert
 
Uncertainty quantification of groundwater contamination
Uncertainty quantification of groundwater contaminationUncertainty quantification of groundwater contamination
Uncertainty quantification of groundwater contamination
Alexander Litvinenko
 
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CHENNAKESAVA KADAPA
 
ABC workshop: 17w5025
ABC workshop: 17w5025ABC workshop: 17w5025
ABC workshop: 17w5025
Christian Robert
 
Hyperon and charm baryons masses from twisted mass Lattice QCD
Hyperon and charm baryons masses from twisted mass Lattice QCDHyperon and charm baryons masses from twisted mass Lattice QCD
Hyperon and charm baryons masses from twisted mass Lattice QCD
Christos Kallidonis
 
Absorbing Random Walk Centrality
Absorbing Random Walk CentralityAbsorbing Random Walk Centrality
Absorbing Random Walk Centrality
Michael Mathioudakis
 
Talk Alexander Litvinenko on SIAM GS Conference in Houston
Talk Alexander Litvinenko on SIAM GS Conference in HoustonTalk Alexander Litvinenko on SIAM GS Conference in Houston
Talk Alexander Litvinenko on SIAM GS Conference in Houston
Alexander Litvinenko
 
Input analysis
Input analysisInput analysis
Input analysis
Bhavik A Shah
 

Similar to Precomputation for SMC-ABC with undirected graphical models (20)

Final PhD Seminar
Final PhD SeminarFinal PhD Seminar
Final PhD Seminar
 
Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...
Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...
Efficient Simulations for Contamination of Groundwater Aquifers under Uncerta...
 
R package bayesImageS: Scalable Inference for Intractable Likelihoods
R package bayesImageS: Scalable Inference for Intractable LikelihoodsR package bayesImageS: Scalable Inference for Intractable Likelihoods
R package bayesImageS: Scalable Inference for Intractable Likelihoods
 
Learning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for GraphsLearning Convolutional Neural Networks for Graphs
Learning Convolutional Neural Networks for Graphs
 
Propagation of Uncertainties in Density Driven Groundwater Flow
Propagation of Uncertainties in Density Driven Groundwater FlowPropagation of Uncertainties in Density Driven Groundwater Flow
Propagation of Uncertainties in Density Driven Groundwater Flow
 
Workshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael MartinWorkshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael Martin
 
cheb_conf_aksenov.pdf
cheb_conf_aksenov.pdfcheb_conf_aksenov.pdf
cheb_conf_aksenov.pdf
 
ABC-Gibbs
ABC-GibbsABC-Gibbs
ABC-Gibbs
 
fb69b412-97cb-4e8d-8a28-574c09557d35-160618025920
fb69b412-97cb-4e8d-8a28-574c09557d35-160618025920fb69b412-97cb-4e8d-8a28-574c09557d35-160618025920
fb69b412-97cb-4e8d-8a28-574c09557d35-160618025920
 
Project Paper
Project PaperProject Paper
Project Paper
 
SIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithmsSIAM - Minisymposium on Guaranteed numerical algorithms
SIAM - Minisymposium on Guaranteed numerical algorithms
 
bayesImageS: Bayesian computation for medical Image Segmentation using a hidd...
bayesImageS: Bayesian computation for medical Image Segmentation using a hidd...bayesImageS: Bayesian computation for medical Image Segmentation using a hidd...
bayesImageS: Bayesian computation for medical Image Segmentation using a hidd...
 
Adaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte CarloAdaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte Carlo
 
Uncertainty quantification of groundwater contamination
Uncertainty quantification of groundwater contaminationUncertainty quantification of groundwater contamination
Uncertainty quantification of groundwater contamination
 
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
 
ABC workshop: 17w5025
ABC workshop: 17w5025ABC workshop: 17w5025
ABC workshop: 17w5025
 
Hyperon and charm baryons masses from twisted mass Lattice QCD
Hyperon and charm baryons masses from twisted mass Lattice QCDHyperon and charm baryons masses from twisted mass Lattice QCD
Hyperon and charm baryons masses from twisted mass Lattice QCD
 
Absorbing Random Walk Centrality
Absorbing Random Walk CentralityAbsorbing Random Walk Centrality
Absorbing Random Walk Centrality
 
Talk Alexander Litvinenko on SIAM GS Conference in Houston
Talk Alexander Litvinenko on SIAM GS Conference in HoustonTalk Alexander Litvinenko on SIAM GS Conference in Houston
Talk Alexander Litvinenko on SIAM GS Conference in Houston
 
Input analysis
Input analysisInput analysis
Input analysis
 

More from Matt Moores

Bayesian Inference and Uncertainty Quantification for Inverse Problems
Bayesian Inference and Uncertainty Quantification for Inverse ProblemsBayesian Inference and Uncertainty Quantification for Inverse Problems
Bayesian Inference and Uncertainty Quantification for Inverse Problems
Matt Moores
 
bayesImageS: an R package for Bayesian image analysis
bayesImageS: an R package for Bayesian image analysisbayesImageS: an R package for Bayesian image analysis
bayesImageS: an R package for Bayesian image analysis
Matt Moores
 
Exploratory Analysis of Multivariate Data
Exploratory Analysis of Multivariate DataExploratory Analysis of Multivariate Data
Exploratory Analysis of Multivariate Data
Matt Moores
 
Importing satellite imagery into R from NASA and the U.S. Geological Survey
Importing satellite imagery into R from NASA and the U.S. Geological SurveyImporting satellite imagery into R from NASA and the U.S. Geological Survey
Importing satellite imagery into R from NASA and the U.S. Geological Survey
Matt Moores
 
Accelerating Pseudo-Marginal MCMC using Gaussian Processes
Accelerating Pseudo-Marginal MCMC using Gaussian ProcessesAccelerating Pseudo-Marginal MCMC using Gaussian Processes
Accelerating Pseudo-Marginal MCMC using Gaussian Processes
Matt Moores
 
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
Matt Moores
 
Variational Bayes
Variational BayesVariational Bayes
Variational Bayes
Matt Moores
 
Informative Priors for Segmentation of Medical Images
Informative Priors for Segmentation of Medical ImagesInformative Priors for Segmentation of Medical Images
Informative Priors for Segmentation of Medical Images
Matt Moores
 

More from Matt Moores (9)

Bayesian Inference and Uncertainty Quantification for Inverse Problems
Bayesian Inference and Uncertainty Quantification for Inverse ProblemsBayesian Inference and Uncertainty Quantification for Inverse Problems
Bayesian Inference and Uncertainty Quantification for Inverse Problems
 
bayesImageS: an R package for Bayesian image analysis
bayesImageS: an R package for Bayesian image analysisbayesImageS: an R package for Bayesian image analysis
bayesImageS: an R package for Bayesian image analysis
 
Exploratory Analysis of Multivariate Data
Exploratory Analysis of Multivariate DataExploratory Analysis of Multivariate Data
Exploratory Analysis of Multivariate Data
 
Importing satellite imagery into R from NASA and the U.S. Geological Survey
Importing satellite imagery into R from NASA and the U.S. Geological SurveyImporting satellite imagery into R from NASA and the U.S. Geological Survey
Importing satellite imagery into R from NASA and the U.S. Geological Survey
 
Accelerating Pseudo-Marginal MCMC using Gaussian Processes
Accelerating Pseudo-Marginal MCMC using Gaussian ProcessesAccelerating Pseudo-Marginal MCMC using Gaussian Processes
Accelerating Pseudo-Marginal MCMC using Gaussian Processes
 
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
R package 'bayesImageS': a case study in Bayesian computation using Rcpp and ...
 
Variational Bayes
Variational BayesVariational Bayes
Variational Bayes
 
Parallel R
Parallel RParallel R
Parallel R
 
Informative Priors for Segmentation of Medical Images
Informative Priors for Segmentation of Medical ImagesInformative Priors for Segmentation of Medical Images
Informative Priors for Segmentation of Medical Images
 

Recently uploaded

Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
vcaxypu
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
alex933524
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
theahmadsaood
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
StarCompliance.io
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
AlejandraGmez176757
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
DOT TECH
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
correoyaya
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 

Recently uploaded (20)

Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
tapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive datatapal brand analysis PPT slide for comptetive data
tapal brand analysis PPT slide for comptetive data
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Investigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_CrimesInvestigate & Recover / StarCompliance.io / Crypto_Crimes
Investigate & Recover / StarCompliance.io / Crypto_Crimes
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 

Precomputation for SMC-ABC with undirected graphical models

  • 1. Background Pre-computation Experimental Results Conclusion Pre-processing for SMC-ABC with undirected graphical models Matt Moores1,2 Chris Drovandi1 Kerrie Mengersen1,2 Antonietta Mira3 Christian Robert4,5 1Mathematical Sciences School, Queensland University of Technology, Brisbane, Australia 2Institute for Health and Biomedical Innovation, QUT Kelvin Grove 3Institute of Finance, Università della Svizzera italiana, Switzerland 4CEREMADE, Université Paris Dauphine, France 5Department of Statistics, University of Warwick, UK ABC in Sydney July 4, 2014
  • 2. Background Pre-computation Experimental Results Conclusion Outline 1 Background Hidden Potts model 2 Pre-computation 3 Experimental Results Simulation Study Satellite Remote Sensing
  • 3. Background Pre-computation Experimental Results Conclusion Background Image analysis often involves: Large datasets, with millions of pixels Multiple images with similar characteristics For example: satellite remote sensing (Landsat), computed tomography (CT) Table: Scale of common types of images Number Landsat CT slices of pixels (90m2/px) (512×512) 26 0.06km2 . . . 56 14.06km2 0.1 106 900.00km2 3.8 156 10251.56km2 43.5
  • 4. Background Pre-computation Experimental Results Conclusion Motivation Computational cost is dominated by simulation of pseudo-data e.g. Hidden Potts model in image analysis (Grelaud et al. 2009, Everitt 2012) Model fitting with ABC can be separated into: Learning about the summary statistic, given the parameter f (s(w) | θ) π(θ) Choosing parameter values, given a summary statistic π (θ | δ(s(y), s(w)) ) For latent models, an additional step of learning about the summary statistic, given the data: s(z) | y, θ Grelaud, Robert, Marin, Rodolphe Taly (2009) Bayesian Analysis 4(2) Everitt (2012) JCGS 21(4)
  • 5. Background Pre-computation Experimental Results Conclusion hidden Markov random field Joint distribution of observed pixel intensities yi ∈ y and latent labels zi ∈ z: Pr(y, z|µ, σ2 , β) ∝ L(y|µ, σ2 , z)π(z|β) (1) Additive Gaussian noise: yi|zi =j iid ∼ N µj, σ2 j (2) Potts model: π(zi|zi∼`, β) = exp {β P i∼` δ(zi, z`)} Pk j=1 exp {β P i∼` δ(j, z`)} (3) Potts (1952) Proceedings of the Cambridge Philosophical Society 48(1)
  • 6. Background Pre-computation Experimental Results Conclusion Inverse Temperature
  • 7. Background Pre-computation Experimental Results Conclusion Doubly-intractable likelihood p(β|z) = C(β)−1 π(β) exp {β S(z)} (4) The normalising constant of the Potts model has computational complexity of O(n2kn), since it involves a sum over all possible combinations of the labels z ∈ Z: C(β) = X z∈Z exp {β S(z)} (5) S(z) is the sufficient statistic of the Potts model: S(z) = X i∼`∈L δ(zi, z`) (6) where L is the set of all unique neighbour pairs.
  • 8. Background Pre-computation Experimental Results Conclusion Expectation of S(z) exact expectation of S(z) for n=12 and k= β E ( S ( z )) 5 10 15 1 2 3 4 2 3 4 (a) n = 12 k ∈ 2, 3, 4 exact expectation of S(z) for k=3 and n= β E ( S ( z )) 5 10 15 1 2 3 4 4 6 9 12 (b) k = 3 n ∈ 4, 6, 9, 12 Figure: Distribution of Ez|β[S(z)]
  • 9. Background Pre-computation Experimental Results Conclusion Standard deviation of S(z) exact standard deviation of S(z) for n=12 and k= β σ ( S ( z )) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 1 2 3 4 2 3 4 (a) n = 12 k ∈ 2, 3, 4 exact standard deviation of S(z) for k=3 and n= β σ ( S ( z )) 0.0 0.5 1.0 1.5 2.0 2.5 1 2 3 4 4 6 9 12 (b) k = 3 n ∈ 4, 6, 9, 12 Figure: Distribution of σz|β[S(z)]
  • 10. Background Pre-computation Experimental Results Conclusion Pre-computation The distribution of the summary statistics f(s(w)|θ) is independent of the observed data y By simulating pseudo-data for values of θ, we can create a binding function φ(θ) for an auxiliary model fA(s(w)|φ(θ)) This binding function can be reused across multiple datasets, amortising its computational cost By replacing s(w) with pseudo summary statistics drawn from our auxiliary model, we avoid the need to simulate pseudo-data during model fitting. Wood (2010) Nature 466 Cabras, Castellanos Ruli (2014) Metron (to appear)
  • 11. Background Pre-computation Experimental Results Conclusion Simulation from f(·|β) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 10 15 20 25 30 β E ( S ( z )) (a) Ez|β (S(w)) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 β σ ( S ( z )) (b) σz|β (S(w)) Figure: Approximation of S(w)|β using 1000 iterations of Swendsen-Wang (discarding 500 as burn-in) Swendsen Wang (1987) Physical Review Letters 58
  • 12. Background Pre-computation Experimental Results Conclusion Piecewise linear model 0.0 0.5 1.0 1.5 2.0 2.5 3.0 10000 15000 20000 25000 30000 β E S ( z ) (a) φ̂µ(β) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 50 100 150 200 250 300 350 β σ S ( z ) (b) φ̂σ(β) Figure: Binding functions for S(w) | β with n = 56 , k = 3
  • 13. Background Pre-computation Experimental Results Conclusion Approximations (A2 BC?) This method introduces additional layers of approximation error: finite sampling points for β (even more of a problem in higher dimensions) piecewise linear model for the binding functions φ̂µ(β) and φ̂σ(β) Gaussian kernel for Ŝ(w)|β ∼ N φ̂µ(β), φ̂σ(β)2 But it enables us to scale up ABC for real world problems, for which it was previously infeasible
  • 14. Background Pre-computation Experimental Results Conclusion Scalable SMC-ABC for the hidden Potts model Algorithm 1 SMC-ABC using precomputed fA(s(w)|φ(θ)) 1: Draw N particles β0 i ∼ π0(β) 2: Draw N × M statistics Ŝ(wi,m) ∼ N φ̂µ(β0 i), φ̂σ(β0 i)2 3: repeat 4: Update S(zt)|y, πt(β) 5: Adaptively select ABC tolerance t 6: Update importance weights ωi for each particle 7: if effective sample size (ESS) Nmin then 8: Resample particles according to their weights 9: end if 10: Update particles using random walk proposal (with adaptive RWMH bandwidth σ2 t ) 11: until naccept N 0.015 or t 10−9 or t ≥ 100
  • 15. Background Pre-computation Experimental Results Conclusion Additive Gaussian noise SMC iteration S ( z ) 0 20 40 60 80 100 15000 15500 16000 16500 17000 17500 18000 Figure: Change in the value of S(z) according to the current distribution of πt(β|z) and L(y|µ, σ2 , z)
  • 16. Background Pre-computation Experimental Results Conclusion Stopping criterion Iteration 0 20 40 60 80 100 0 2000 4000 6000 ε σ (a) t Iteration proportion accepted 0 20 40 60 80 100 0.0 0.2 0.4 0.6 0.8 1.0 (b) RWMH acceptance Iteration ESS 0 20 40 60 80 100 0 2000 4000 6000 8000 10000 (c) ESSt
  • 17. Background Pre-computation Experimental Results Conclusion Simulation Study 20 images, n = 125 × 125, k = 3: β ∼ U(0, 1.005) z ∼ f(·|β) using 2000 iterations of Swendsen-Wang µj ∼ N 0, 1002 1 σ2 j ∼ Γ (1, 100) Comparison of 2 ABC algorithms: Scalable SMC-ABC using precomputed ˆ f(β) Standard SMC-ABC using 500 iterations of Gibbs sampling Del Moral, Doucet, Jasra (2012) Stat. Comput. 22(5)
  • 18. Background Pre-computation Experimental Results Conclusion Accuracy of posterior estimates for β 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 β posterior distribution (a) pseudo-data (M=50) 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 β posterior distribution (b) pre-computed (M=200)
  • 19. Background Pre-computation Experimental Results Conclusion Distribution of posterior sampling error for β algorithm error −0.1 0.0 0.1 0.2 0.3 0.4 0.5 Pseudo−data Pre−computed
  • 20. Background Pre-computation Experimental Results Conclusion Improvement in runtime Pseudo−data Pre−computed 0.5 1.0 2.0 5.0 10.0 20.0 50.0 100.0 algorithm elapsed time (hours) (a) elapsed (wall clock) time Pseudo−data Pre−computed 5 10 20 50 100 200 500 algorithm CPU time (hours) (b) CPU time
  • 21. Background Pre-computation Experimental Results Conclusion Satellite image of southwest Brisbane Figure: Normalised difference vegetation index (NDVI)
  • 22. Background Pre-computation Experimental Results Conclusion Auxiliary Model 0.0 0.5 1.0 1.5 2.0 2.5 3.0 500000 1000000 1500000 2000000 β (a) φ̂µ(β) 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1000 2000 3000 4000 5000 β (b) φ̂σ(β) Figure: Binding functions for n = 978380, k = 6
  • 23. Background Pre-computation Experimental Results Conclusion Results Precomputation of fA(s(w)|φ(θ)) took 13h 23m for 987 values of β Model fitting took 1h 42m for 40 SMC iterations In contrast: Generating M = 50 pseudo-datasets per particle takes 89 hours for a single iteration (∴ 20 weeks for 40 iterations)
  • 24. Background Pre-computation Experimental Results Conclusion Posterior Density Estimate β Density 0 10 20 30 1.24 1.26 1.28 1.30
  • 25. Background Pre-computation Experimental Results Conclusion Summary Scalability of SMC-ABC can be improved by pre-computing an auxiliary model fA(s(w)|φ(θ)) Pre-computation took 1.4 hours on a 16 core Xeon server for 987 values of β with 15,625 pixels (13.4 hours for 978,380 pixels) Average runtime for SMC-ABC improved from 71.4 hours to 39 minutes with 15,625 pixels (1.7 hours for 978,380 pixels) The binding functions represent the nonlinear, heteroskedastic relationship between the parameter and the summary statistic. This method could be extended to multivariate applications, such as estimating both β and k for the hidden Potts model, or estimating θ for an ERGM.
  • 26. Appendix For Further Reading I Matt Moores, Chris Drovandi, Kerrie Mengersen Christian Robert Pre-processing for approximate Bayesian computation in image analysis. arXiv:1403.4359 [stat.CO], 2014. Simon Wood Statistical inference for noisy nonlinear ecological dynamic systems. Nature, 466: 1102–04, 2010. Stefano Cabras, Marı́a Eugenia Castellanos Erlis Ruli A Quasi likelihood approximation of posterior distributions for likelihood-intractable complex models. To appear in Metron, 2014. Christopher Drovandi, Anthony Pettitt Malcolm Faddy Approximate Bayesian computation using indirect inference. J. R. Stat. Soc. Ser. C 60(3): 317–37, 2011. Richard Everitt Bayesian Parameter Estimation for Latent Markov Random Fields and Social Networks. J. Comput. Graph. Stat., 21(4): 940–60, 2012.
  • 27. Appendix For Further Reading II Christopher Drovandi, Anthony Pettitt Anthony Lee Bayesian indirect inference using a parametric auxiliary model. http://eprints.qut.edu.au/63767/3/63767.pdf Pierre Del Moral, Arnaud Doucet Ajay Jasra An adaptive sequential Monte Carlo method for approximate Bayesian computation. Statistics Computing, 22(5): 1009–20, 2012. A. Grelaud, C. P. Robert, J.-M. Marin, F. Rodolphe J.-F. Taly ABC likelihood-free methods for model choice in Gibbs random fields. Bayesian Analysis, 4(2): 317–36, 2009. Renfrey B. Potts Some generalized order-disorder transformations. Proc. Cambridge Philosophical Society, 48(1): 106–9, 1952. R. H. Swendsen J.-S. Wang Nonuniversal critical dynamics in Monte Carlo simulations. Physical Review Letters, 58: 86–8, 1987.