SlideShare a Scribd company logo
Spatial Statistics for Epidemiology
—Spatial Point Processes
By Liu Xu U086105E
Supervisor: Prof Loh Wei Liem
Department of Statistics and Applied Probability
National University of Singapore
15 March 2012
1
Outline
Application in epidemiology
Theory: descriptive statistics
Models: spatial point processes
Data: spatial point patterns
2
A spatial point pattern is …
Space → Rd, d ≥ 2
Points → data values
Pattern → arrangement
Intro Models Theory Application 3
Example: tropical rainforest
Intro Models Theory Application 4
Example: cancer cases
Intro Models Theory Application
5
marks: extra information
attached to points,
categorical/ continuous
Example : the milky way galaxy
Intro Models Theory Application 6
Types of point patterns
Intro Models Theory Application
7
regularityCSRclustering repulsionattraction
  
Aim: describe and model “pattern”
Are points randomly located?
• If so, find a statistical model to
describe the “randomness”;
• If not, …
Intro Models Theory Application
8
Models: spatial point processes
• A spatial point process is a stochastic process X which
generates a countable set of events in defined space.
• A spatial pattern x = {x1, x2, …, xn} on an observational region
W generated from a spatial point process is a realization of
the process.
• Only consider point processes in 2-D space.
• The locations of any object can be modelled−plants, animals,
cells, stars, disease cases, earthquakes, …
Intro Models Theory Application
9
Models: spatial point processes
Intro Models Theory Application
10
• Notation:
W: study region in R2
N(A): number of events inside subregion A, A W.
|A|: area of region A
s: random locations in W
ds: infinitesimal region centered at s
• Assumptions on spatial point processes:
i. Locally finite: the number of events in any bounded region is bounded
ii. At any point location s, there is either one event or no events at all

HPP
A spatial point process in a bounded region W in R2 is a
homogeneous Poisson process (HPP) if:
i. For all subregion A in W, N(A) ~ Poi(λ|A|), where 0 < λ < ∞
is a constant, called intensity (homogeneity).
ii. If A1 and A2 are two disjoint subregions in W, then N(A1) and
N(A2) are independent (independence).
• Standard model for complete spatial randomness (CSR);
• Can be generalized to more complicated models;
• A reference process when analyzing spatial characteristics of
a specific pattern.
Intro Models Theory Application
11
IPP
A spatial point process in a bounded region W in R2 is a
inhomogeneous Poisson process (IPP) if:
i. For any subregion A in W, N(A) ~ Poi(∫Aλ(s)ds), where 0 <
λ(s) < ∞ is the intensity at s .
ii. If A1 and A2 are two disjoint subregions in W, then N(A1) and
N(A2) are independent (independence).
Intro Models Theory Application
12
HPPIPP
tiongeneraliza
casespecial
 
 
Simulation from Poisson processes
Intro Models Theory Application
13
Two Poisson process realizations on the unit square having the same
expected number of events = 100.
Summary statistics: first-order
First-order intensity of a spatial point process is:
• Interpretation: expected number of events per unit area. For
small region ds, λ(s)|ds| describes the probability for an event
in ds.
• Intensity may be constant (homogeneous) or may vary from
location to location (inhomogeneous). If the process is
homogeneous, estimate intensity by
Intro Models Theory Application 14
s
s
s
s d
dNE
d
))((
lim)(
0

W
WN )(ˆ 
Estimate λ(s) in inhomogeneous case
• Estimating the intensity of a spatial point pattern is similar to estimating a
bivariate probability density
• How to estimate bivariate density?
Given an i.i.d. sample (y1, . . . , yn) of a bivariate random variable Y, an estimate of
the density f (·) of Y at y is
where K(·) is the kernel and h is the bandwidth.
• The expression for kernel smoothing of the intensity function of a point
pattern x = {x1, …, xn} at location s is
the bandwidth h is chosen based on some cross-validation criterion.
Intro Models Theory Application
15



n
i
)
h
K(
nh
)(f
1
2
1ˆ i
yy
y



n
i
)
h
K(
h
)(λ
1
2
1ˆ sx
s i
Kernel smoothed intensity of IPP
Intro Models Theory Application
16
Kernel estimated intensity for the point pattern simulated from HPP
with λ(s) = 400xy on [0, 1] * [0, 1].
Summary statistics: second-order
The second-order properties of a point process involve
relationship between number of events at different locations.
• The second-order intensity of a spatial point process is
• A point process is called stationary if
• A stationary point process is isotropic if
Intro Models Theory Application 17










ji
ji
ss
ji
ss
ss
ss
ji dd
dNdNE
dd
)]()([
lim),(
0,
2





)(),(
,)(
22 jiji ssss
ss


)(),( 22 jiji ssss  
If a point process is stationary and isotropic, the K-function of
the process is defined by:
λK(r) = E[number of further events within distance r from an
arbitrary event]
Two properties of K-function:
• For a HPP, λK(r) = λπr2 , thus Kp(r) = πr2
• K(r) is invariant to random thinning.
Intro Models Theory Application
18
K-function
Def. random thinning: each event of a point process X is either
retained or deleted with retention probability p, independently of
other events. The resulting point process X’ contains a subset of
events of the original process X.
Comparing estimated K-functions of simulated point patterns
Intro Models Theory Application 19
CSR: K(r) = πr2
clustered: K(r) > πr2
regular: K(r) < πr2
Estimation of K(r):Ê(# further events…)/λ̂
Intro Models Theory Application
20
negatively biased edge correction
Application in epidemiology
John Snow (15 March 1813 – 16 June
1858) is considered to be one of the
fathers of epidemiology, because of his
work in tracing the source of a cholera
outbreak in Soho, England, in 1854
Intro Models Theory Application
21
Case-control study
Goal: compare the spatial distribution
of disease cases with the underlying
population
• Null hypothesis :
equal spatial distribution
• Controls:
selected to represent population
heterogeneity
Intro Models Theory Application
22
Incidence
of disease
Population
density
Overall
risk of
disease
Other risk
factors, e.g.
distance from
point source
Do disease cases occur randomly
among population?
Case-control data consist of two point patterns:
• the locations of n1 cases of particular disease {x1, x2, …, xn1}
• the locations n0 controls {xn1+1, …, xn1+n0}
in a study region W over a defined period of time. Total
number of data points n = n1 + n0.
Assumption:
• Cases from an IPP with intensity λ1(s)
• Controls from another independent IPP with intensity λ0(s)
Intro Models Theory Application
23
Spatial risk
relative risk:
estimated relative risk:
H0:
test statistic:
estimated test statistic:
significance: Monte Carlo test
Intro Models Theory Application
24
)(
)(
)(
0
1
s
s
s


 
0
1
0)(
n
n
  s


n
i
T
1
2
0 ])([  ix


n
i
T
1
2
0 ])(ˆ[ˆ  ix
)(ˆ
)(ˆ
)(ˆ
0
1
s
s
s


 
Spatial clustering
K0(r)→ amount of clustering due to population
K1(r)→ amount of clustering due to population plus effect
of other possible risk factors
D(r) = K1(r) - K0(r) → the amount of clustering that is not
due to population
estimate:
H0:
Test statistic:
significance: Monte Carlo test
Intro Models Theory Application
25


m
k k
k
rD
rD
D
1 )](var[
)(
)(ˆ)(ˆ)(ˆ
01 rKrKrD 
0D(r)=
Monte Carlo test
1). simulation with random labelling at jth iteration, j=1, 2, …, 99
• randomly select n1 points from n data points and label the selected points as “case”, label
the remaining n0 points as “control”
• with the relabelled data, estimate kernel smoother and at every data point.
• estimate K1j(r) and K0j(r) and compute D̂j(r) at a set of discrete distances {r1, r2, …, rm} .
2). test statistic
• for each j, compute
• compute the variance of D̂(rk) for each k=1, 2, …, m. then get
3). p-value
Intro Models Theory Application
26
)(ˆ),(ˆ
01 xx jj  )(ˆ xj
2
1 0 ])(ˆ[ˆ 

n
i ijjT  x


m
k
k
kj
j
rD
rD
D
1 )](ˆvar[
)(ˆ
ˆ
)199/(]1}ˆˆ{[
)199/(]1}ˆˆ{[
99
1
2
99
1
1






j
j
j
j
DDIp
TTIp
Case study-the chorley data
Intro Models Theory Application
27
58 cases 978 controls
Lots of graphs
Intro Models Theory Application
28
Intro Models Theory Application
29
Monte Carlo test gives p-value = 0.64 →there is no significant spatial variation in
the relative risk.
graph
Intro Models Theory Application
30
p-value = 0.91
→ no significant
relative spatial
clustering
Summary
summary 31
Spatial point
patterns
Spatial point
processes
HPP
IPP
λ(s)
K(r)
CSR
Application in
epidemiology
Thank you for your attention!
Time for Q&A
The end 32

More Related Content

What's hot

Frontiers of data-driven property prediction: molecular machine learning
Frontiers of data-driven property prediction: molecular machine learningFrontiers of data-driven property prediction: molecular machine learning
Frontiers of data-driven property prediction: molecular machine learning
Ichigaku Takigawa
 
Robust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object DetectionRobust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object Detection
ActiveEon
 
Graph Neural Networks.pptx
Graph Neural Networks.pptxGraph Neural Networks.pptx
Graph Neural Networks.pptx
Kumar Iyer
 
Extraction of common conceptual components from multiple ontologies
Extraction of common conceptual components from multiple ontologiesExtraction of common conceptual components from multiple ontologies
Extraction of common conceptual components from multiple ontologies
Valentina Carriero
 
Unit 2: All
Unit 2: AllUnit 2: All
Unit 2: All
Hector Zenil
 
Open and closed queueing network
Open and closed queueing networkOpen and closed queueing network
Open and closed queueing network
Fahmida Afrin
 
Path Planning for Mobile Robots
Path Planning for Mobile RobotsPath Planning for Mobile Robots
Path Planning for Mobile Robots
sriraj317
 
Gnn overview
Gnn overviewGnn overview
Gnn overview
Louis (Yufeng) Wang
 
Geometric Deep Learning
Geometric Deep Learning Geometric Deep Learning
Geometric Deep Learning
PetteriTeikariPhD
 
36 กลยุทธิ์ชนะศึก
36 กลยุทธิ์ชนะศึก36 กลยุทธิ์ชนะศึก
36 กลยุทธิ์ชนะศึกtommy
 
Introduction to Graph neural networks @ Vienna Deep Learning meetup
Introduction to Graph neural networks @  Vienna Deep Learning meetupIntroduction to Graph neural networks @  Vienna Deep Learning meetup
Introduction to Graph neural networks @ Vienna Deep Learning meetup
Liad Magen
 
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Universitat Politècnica de Catalunya
 
Probability Statistics and Random Processes for Electrical Engineering 3rd Ed...
Probability Statistics and Random Processes for Electrical Engineering 3rd Ed...Probability Statistics and Random Processes for Electrical Engineering 3rd Ed...
Probability Statistics and Random Processes for Electrical Engineering 3rd Ed...
JasmineBriggss
 
Graph Analytics for big data
Graph Analytics for big dataGraph Analytics for big data
Graph Analytics for big data
Sigmoid
 
introduction to graph theory
introduction to graph theoryintroduction to graph theory
introduction to graph theoryChuckie Balbuena
 
Tutorial of topological data analysis part 3(Mapper algorithm)
Tutorial of topological data analysis part 3(Mapper algorithm)Tutorial of topological data analysis part 3(Mapper algorithm)
Tutorial of topological data analysis part 3(Mapper algorithm)
Ha Phuong
 
Manifold learning with application to object recognition
Manifold learning with application to object recognitionManifold learning with application to object recognition
Manifold learning with application to object recognitionzukun
 
Graph Representation Learning
Graph Representation LearningGraph Representation Learning
Graph Representation Learning
Jure Leskovec
 
MATCHING GRAPH THEORY
MATCHING GRAPH THEORYMATCHING GRAPH THEORY
MATCHING GRAPH THEORY
garishma bhatia
 
Introduction to mathematical control theory - Dr. Purnima Pandit
Introduction to mathematical control theory - Dr. Purnima PanditIntroduction to mathematical control theory - Dr. Purnima Pandit
Introduction to mathematical control theory - Dr. Purnima Pandit
Purnima Pandit
 

What's hot (20)

Frontiers of data-driven property prediction: molecular machine learning
Frontiers of data-driven property prediction: molecular machine learningFrontiers of data-driven property prediction: molecular machine learning
Frontiers of data-driven property prediction: molecular machine learning
 
Robust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object DetectionRobust Low-rank and Sparse Decomposition for Moving Object Detection
Robust Low-rank and Sparse Decomposition for Moving Object Detection
 
Graph Neural Networks.pptx
Graph Neural Networks.pptxGraph Neural Networks.pptx
Graph Neural Networks.pptx
 
Extraction of common conceptual components from multiple ontologies
Extraction of common conceptual components from multiple ontologiesExtraction of common conceptual components from multiple ontologies
Extraction of common conceptual components from multiple ontologies
 
Unit 2: All
Unit 2: AllUnit 2: All
Unit 2: All
 
Open and closed queueing network
Open and closed queueing networkOpen and closed queueing network
Open and closed queueing network
 
Path Planning for Mobile Robots
Path Planning for Mobile RobotsPath Planning for Mobile Robots
Path Planning for Mobile Robots
 
Gnn overview
Gnn overviewGnn overview
Gnn overview
 
Geometric Deep Learning
Geometric Deep Learning Geometric Deep Learning
Geometric Deep Learning
 
36 กลยุทธิ์ชนะศึก
36 กลยุทธิ์ชนะศึก36 กลยุทธิ์ชนะศึก
36 กลยุทธิ์ชนะศึก
 
Introduction to Graph neural networks @ Vienna Deep Learning meetup
Introduction to Graph neural networks @  Vienna Deep Learning meetupIntroduction to Graph neural networks @  Vienna Deep Learning meetup
Introduction to Graph neural networks @ Vienna Deep Learning meetup
 
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
 
Probability Statistics and Random Processes for Electrical Engineering 3rd Ed...
Probability Statistics and Random Processes for Electrical Engineering 3rd Ed...Probability Statistics and Random Processes for Electrical Engineering 3rd Ed...
Probability Statistics and Random Processes for Electrical Engineering 3rd Ed...
 
Graph Analytics for big data
Graph Analytics for big dataGraph Analytics for big data
Graph Analytics for big data
 
introduction to graph theory
introduction to graph theoryintroduction to graph theory
introduction to graph theory
 
Tutorial of topological data analysis part 3(Mapper algorithm)
Tutorial of topological data analysis part 3(Mapper algorithm)Tutorial of topological data analysis part 3(Mapper algorithm)
Tutorial of topological data analysis part 3(Mapper algorithm)
 
Manifold learning with application to object recognition
Manifold learning with application to object recognitionManifold learning with application to object recognition
Manifold learning with application to object recognition
 
Graph Representation Learning
Graph Representation LearningGraph Representation Learning
Graph Representation Learning
 
MATCHING GRAPH THEORY
MATCHING GRAPH THEORYMATCHING GRAPH THEORY
MATCHING GRAPH THEORY
 
Introduction to mathematical control theory - Dr. Purnima Pandit
Introduction to mathematical control theory - Dr. Purnima PanditIntroduction to mathematical control theory - Dr. Purnima Pandit
Introduction to mathematical control theory - Dr. Purnima Pandit
 

Viewers also liked

Geoestatística
GeoestatísticaGeoestatística
Geoestatística
Eduardo Cabral
 
의사들의 통계맹(Statistically Illiteracy in Doctors)
의사들의 통계맹(Statistically Illiteracy in Doctors)의사들의 통계맹(Statistically Illiteracy in Doctors)
의사들의 통계맹(Statistically Illiteracy in Doctors)
Inha University
 
역학 연구에서 위험 요인 교호작용 평가
역학 연구에서 위험 요인 교호작용 평가역학 연구에서 위험 요인 교호작용 평가
역학 연구에서 위험 요인 교호작용 평가
Inha University
 
Fundamentos de geoestatística
Fundamentos de geoestatísticaFundamentos de geoestatística
Fundamentos de geoestatística
WALTER DE SOUZA FERRAZ NETO
 
건강정책 연구를 위한 R 소프트웨어 소개
건강정책 연구를 위한 R 소프트웨어 소개건강정책 연구를 위한 R 소프트웨어 소개
건강정책 연구를 위한 R 소프트웨어 소개
Inha University
 
أخطاء اللغة العربية المعاصرة عند الكتاب و الإذاعيين
أخطاء اللغة العربية المعاصرة عند الكتاب و الإذاعيينأخطاء اللغة العربية المعاصرة عند الكتاب و الإذاعيين
أخطاء اللغة العربية المعاصرة عند الكتاب و الإذاعيين
Remas Mohamed
 
Poisson distribution
Poisson distributionPoisson distribution
Poisson distributionAntiqNyke
 
Poisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric DistributionPoisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric Distribution
DataminingTools Inc
 

Viewers also liked (8)

Geoestatística
GeoestatísticaGeoestatística
Geoestatística
 
의사들의 통계맹(Statistically Illiteracy in Doctors)
의사들의 통계맹(Statistically Illiteracy in Doctors)의사들의 통계맹(Statistically Illiteracy in Doctors)
의사들의 통계맹(Statistically Illiteracy in Doctors)
 
역학 연구에서 위험 요인 교호작용 평가
역학 연구에서 위험 요인 교호작용 평가역학 연구에서 위험 요인 교호작용 평가
역학 연구에서 위험 요인 교호작용 평가
 
Fundamentos de geoestatística
Fundamentos de geoestatísticaFundamentos de geoestatística
Fundamentos de geoestatística
 
건강정책 연구를 위한 R 소프트웨어 소개
건강정책 연구를 위한 R 소프트웨어 소개건강정책 연구를 위한 R 소프트웨어 소개
건강정책 연구를 위한 R 소프트웨어 소개
 
أخطاء اللغة العربية المعاصرة عند الكتاب و الإذاعيين
أخطاء اللغة العربية المعاصرة عند الكتاب و الإذاعيينأخطاء اللغة العربية المعاصرة عند الكتاب و الإذاعيين
أخطاء اللغة العربية المعاصرة عند الكتاب و الإذاعيين
 
Poisson distribution
Poisson distributionPoisson distribution
Poisson distribution
 
Poisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric DistributionPoisson Distribution, Poisson Process & Geometric Distribution
Poisson Distribution, Poisson Process & Geometric Distribution
 

Similar to Spatial Point Processes and Their Applications in Epidemiology

Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmission
JeremyHeng10
 
Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmission
JeremyHeng10
 
Lecture: Monte Carlo Methods
Lecture: Monte Carlo MethodsLecture: Monte Carlo Methods
Lecture: Monte Carlo Methods
Frank Kienle
 
On non-negative unbiased estimators
On non-negative unbiased estimatorsOn non-negative unbiased estimators
On non-negative unbiased estimators
Pierre Jacob
 
Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...
Frank Nielsen
 
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdfStatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
Anna Carbone
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Chapter-4 combined.pptx
Chapter-4 combined.pptxChapter-4 combined.pptx
Chapter-4 combined.pptx
HamzaHaji6
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
The Statistical and Applied Mathematical Sciences Institute
 
First paper with the NITheCS affiliation
First paper with the NITheCS affiliationFirst paper with the NITheCS affiliation
First paper with the NITheCS affiliation
Rene Kotze
 
2012 mdsp pr04 monte carlo
2012 mdsp pr04 monte carlo2012 mdsp pr04 monte carlo
2012 mdsp pr04 monte carlonozomuhamada
 
Advanced Econometrics L5-6.pptx
Advanced Econometrics L5-6.pptxAdvanced Econometrics L5-6.pptx
Advanced Econometrics L5-6.pptx
akashayosha
 
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference CompilationAccelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Feynman Liang
 
Jere Koskela slides
Jere Koskela slidesJere Koskela slides
Jere Koskela slides
Christian Robert
 
MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...
MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...
MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...
The Statistical and Applied Mathematical Sciences Institute
 
Formulas statistics
Formulas statisticsFormulas statistics
Formulas statistics
Prashi_Jain
 
Richard Everitt's slides
Richard Everitt's slidesRichard Everitt's slides
Richard Everitt's slides
Christian Robert
 
PTSP PPT.pdf
PTSP PPT.pdfPTSP PPT.pdf
PTSP PPT.pdf
goutamkrsahoo
 
Nonlinear Stochastic Optimization by the Monte-Carlo Method
Nonlinear Stochastic Optimization by the Monte-Carlo MethodNonlinear Stochastic Optimization by the Monte-Carlo Method
Nonlinear Stochastic Optimization by the Monte-Carlo Method
SSA KPI
 

Similar to Spatial Point Processes and Their Applications in Epidemiology (20)

Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmission
 
PhysicsSIG2008-01-Seneviratne
PhysicsSIG2008-01-SeneviratnePhysicsSIG2008-01-Seneviratne
PhysicsSIG2008-01-Seneviratne
 
Sequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmissionSequential Monte Carlo algorithms for agent-based models of disease transmission
Sequential Monte Carlo algorithms for agent-based models of disease transmission
 
Lecture: Monte Carlo Methods
Lecture: Monte Carlo MethodsLecture: Monte Carlo Methods
Lecture: Monte Carlo Methods
 
On non-negative unbiased estimators
On non-negative unbiased estimatorsOn non-negative unbiased estimators
On non-negative unbiased estimators
 
Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...Pattern learning and recognition on statistical manifolds: An information-geo...
Pattern learning and recognition on statistical manifolds: An information-geo...
 
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdfStatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
StatPhysPerspectives_AMALEA_Cetraro_AnnaCarbone.pdf
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 
Chapter-4 combined.pptx
Chapter-4 combined.pptxChapter-4 combined.pptx
Chapter-4 combined.pptx
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Model Selection in the...
 
First paper with the NITheCS affiliation
First paper with the NITheCS affiliationFirst paper with the NITheCS affiliation
First paper with the NITheCS affiliation
 
2012 mdsp pr04 monte carlo
2012 mdsp pr04 monte carlo2012 mdsp pr04 monte carlo
2012 mdsp pr04 monte carlo
 
Advanced Econometrics L5-6.pptx
Advanced Econometrics L5-6.pptxAdvanced Econometrics L5-6.pptx
Advanced Econometrics L5-6.pptx
 
Accelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference CompilationAccelerating Metropolis Hastings with Lightweight Inference Compilation
Accelerating Metropolis Hastings with Lightweight Inference Compilation
 
Jere Koskela slides
Jere Koskela slidesJere Koskela slides
Jere Koskela slides
 
MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...
MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...
MUMS Opening Workshop - Model Uncertainty in Data Fusion for Remote Sensing -...
 
Formulas statistics
Formulas statisticsFormulas statistics
Formulas statistics
 
Richard Everitt's slides
Richard Everitt's slidesRichard Everitt's slides
Richard Everitt's slides
 
PTSP PPT.pdf
PTSP PPT.pdfPTSP PPT.pdf
PTSP PPT.pdf
 
Nonlinear Stochastic Optimization by the Monte-Carlo Method
Nonlinear Stochastic Optimization by the Monte-Carlo MethodNonlinear Stochastic Optimization by the Monte-Carlo Method
Nonlinear Stochastic Optimization by the Monte-Carlo Method
 

Spatial Point Processes and Their Applications in Epidemiology

  • 1. Spatial Statistics for Epidemiology —Spatial Point Processes By Liu Xu U086105E Supervisor: Prof Loh Wei Liem Department of Statistics and Applied Probability National University of Singapore 15 March 2012 1
  • 2. Outline Application in epidemiology Theory: descriptive statistics Models: spatial point processes Data: spatial point patterns 2
  • 3. A spatial point pattern is … Space → Rd, d ≥ 2 Points → data values Pattern → arrangement Intro Models Theory Application 3
  • 4. Example: tropical rainforest Intro Models Theory Application 4
  • 5. Example: cancer cases Intro Models Theory Application 5 marks: extra information attached to points, categorical/ continuous
  • 6. Example : the milky way galaxy Intro Models Theory Application 6
  • 7. Types of point patterns Intro Models Theory Application 7 regularityCSRclustering repulsionattraction   
  • 8. Aim: describe and model “pattern” Are points randomly located? • If so, find a statistical model to describe the “randomness”; • If not, … Intro Models Theory Application 8
  • 9. Models: spatial point processes • A spatial point process is a stochastic process X which generates a countable set of events in defined space. • A spatial pattern x = {x1, x2, …, xn} on an observational region W generated from a spatial point process is a realization of the process. • Only consider point processes in 2-D space. • The locations of any object can be modelled−plants, animals, cells, stars, disease cases, earthquakes, … Intro Models Theory Application 9
  • 10. Models: spatial point processes Intro Models Theory Application 10 • Notation: W: study region in R2 N(A): number of events inside subregion A, A W. |A|: area of region A s: random locations in W ds: infinitesimal region centered at s • Assumptions on spatial point processes: i. Locally finite: the number of events in any bounded region is bounded ii. At any point location s, there is either one event or no events at all 
  • 11. HPP A spatial point process in a bounded region W in R2 is a homogeneous Poisson process (HPP) if: i. For all subregion A in W, N(A) ~ Poi(λ|A|), where 0 < λ < ∞ is a constant, called intensity (homogeneity). ii. If A1 and A2 are two disjoint subregions in W, then N(A1) and N(A2) are independent (independence). • Standard model for complete spatial randomness (CSR); • Can be generalized to more complicated models; • A reference process when analyzing spatial characteristics of a specific pattern. Intro Models Theory Application 11
  • 12. IPP A spatial point process in a bounded region W in R2 is a inhomogeneous Poisson process (IPP) if: i. For any subregion A in W, N(A) ~ Poi(∫Aλ(s)ds), where 0 < λ(s) < ∞ is the intensity at s . ii. If A1 and A2 are two disjoint subregions in W, then N(A1) and N(A2) are independent (independence). Intro Models Theory Application 12 HPPIPP tiongeneraliza casespecial    
  • 13. Simulation from Poisson processes Intro Models Theory Application 13 Two Poisson process realizations on the unit square having the same expected number of events = 100.
  • 14. Summary statistics: first-order First-order intensity of a spatial point process is: • Interpretation: expected number of events per unit area. For small region ds, λ(s)|ds| describes the probability for an event in ds. • Intensity may be constant (homogeneous) or may vary from location to location (inhomogeneous). If the process is homogeneous, estimate intensity by Intro Models Theory Application 14 s s s s d dNE d ))(( lim)( 0  W WN )(ˆ 
  • 15. Estimate λ(s) in inhomogeneous case • Estimating the intensity of a spatial point pattern is similar to estimating a bivariate probability density • How to estimate bivariate density? Given an i.i.d. sample (y1, . . . , yn) of a bivariate random variable Y, an estimate of the density f (·) of Y at y is where K(·) is the kernel and h is the bandwidth. • The expression for kernel smoothing of the intensity function of a point pattern x = {x1, …, xn} at location s is the bandwidth h is chosen based on some cross-validation criterion. Intro Models Theory Application 15    n i ) h K( nh )(f 1 2 1ˆ i yy y    n i ) h K( h )(λ 1 2 1ˆ sx s i
  • 16. Kernel smoothed intensity of IPP Intro Models Theory Application 16 Kernel estimated intensity for the point pattern simulated from HPP with λ(s) = 400xy on [0, 1] * [0, 1].
  • 17. Summary statistics: second-order The second-order properties of a point process involve relationship between number of events at different locations. • The second-order intensity of a spatial point process is • A point process is called stationary if • A stationary point process is isotropic if Intro Models Theory Application 17           ji ji ss ji ss ss ss ji dd dNdNE dd )]()([ lim),( 0, 2      )(),( ,)( 22 jiji ssss ss   )(),( 22 jiji ssss  
  • 18. If a point process is stationary and isotropic, the K-function of the process is defined by: λK(r) = E[number of further events within distance r from an arbitrary event] Two properties of K-function: • For a HPP, λK(r) = λπr2 , thus Kp(r) = πr2 • K(r) is invariant to random thinning. Intro Models Theory Application 18 K-function Def. random thinning: each event of a point process X is either retained or deleted with retention probability p, independently of other events. The resulting point process X’ contains a subset of events of the original process X.
  • 19. Comparing estimated K-functions of simulated point patterns Intro Models Theory Application 19 CSR: K(r) = πr2 clustered: K(r) > πr2 regular: K(r) < πr2
  • 20. Estimation of K(r):Ê(# further events…)/λ̂ Intro Models Theory Application 20 negatively biased edge correction
  • 21. Application in epidemiology John Snow (15 March 1813 – 16 June 1858) is considered to be one of the fathers of epidemiology, because of his work in tracing the source of a cholera outbreak in Soho, England, in 1854 Intro Models Theory Application 21
  • 22. Case-control study Goal: compare the spatial distribution of disease cases with the underlying population • Null hypothesis : equal spatial distribution • Controls: selected to represent population heterogeneity Intro Models Theory Application 22 Incidence of disease Population density Overall risk of disease Other risk factors, e.g. distance from point source Do disease cases occur randomly among population?
  • 23. Case-control data consist of two point patterns: • the locations of n1 cases of particular disease {x1, x2, …, xn1} • the locations n0 controls {xn1+1, …, xn1+n0} in a study region W over a defined period of time. Total number of data points n = n1 + n0. Assumption: • Cases from an IPP with intensity λ1(s) • Controls from another independent IPP with intensity λ0(s) Intro Models Theory Application 23
  • 24. Spatial risk relative risk: estimated relative risk: H0: test statistic: estimated test statistic: significance: Monte Carlo test Intro Models Theory Application 24 )( )( )( 0 1 s s s     0 1 0)( n n   s   n i T 1 2 0 ])([  ix   n i T 1 2 0 ])(ˆ[ˆ  ix )(ˆ )(ˆ )(ˆ 0 1 s s s    
  • 25. Spatial clustering K0(r)→ amount of clustering due to population K1(r)→ amount of clustering due to population plus effect of other possible risk factors D(r) = K1(r) - K0(r) → the amount of clustering that is not due to population estimate: H0: Test statistic: significance: Monte Carlo test Intro Models Theory Application 25   m k k k rD rD D 1 )](var[ )( )(ˆ)(ˆ)(ˆ 01 rKrKrD  0D(r)=
  • 26. Monte Carlo test 1). simulation with random labelling at jth iteration, j=1, 2, …, 99 • randomly select n1 points from n data points and label the selected points as “case”, label the remaining n0 points as “control” • with the relabelled data, estimate kernel smoother and at every data point. • estimate K1j(r) and K0j(r) and compute D̂j(r) at a set of discrete distances {r1, r2, …, rm} . 2). test statistic • for each j, compute • compute the variance of D̂(rk) for each k=1, 2, …, m. then get 3). p-value Intro Models Theory Application 26 )(ˆ),(ˆ 01 xx jj  )(ˆ xj 2 1 0 ])(ˆ[ˆ   n i ijjT  x   m k k kj j rD rD D 1 )](ˆvar[ )(ˆ ˆ )199/(]1}ˆˆ{[ )199/(]1}ˆˆ{[ 99 1 2 99 1 1       j j j j DDIp TTIp
  • 27. Case study-the chorley data Intro Models Theory Application 27 58 cases 978 controls
  • 28. Lots of graphs Intro Models Theory Application 28
  • 29. Intro Models Theory Application 29 Monte Carlo test gives p-value = 0.64 →there is no significant spatial variation in the relative risk.
  • 30. graph Intro Models Theory Application 30 p-value = 0.91 → no significant relative spatial clustering
  • 31. Summary summary 31 Spatial point patterns Spatial point processes HPP IPP λ(s) K(r) CSR Application in epidemiology
  • 32. Thank you for your attention! Time for Q&A The end 32