SlideShare a Scribd company logo
1 of 64
Download to read offline
GIS in Public Health Research:
Understanding Spatial Analysis &
Interpreting Outcomes
Kristin Osiecki PhD
Houston Aerosol Characterization &
Health Experiment (HACHE)
• UT Health Science Center School of
Biomedical Informatics
• University of Houston Department
of Earth and Atmospheric Sciences
• Rice University Department of
Sociology and Department of Civil &
Environmental Engineering
Applications in Public Health Research
• Space matters
– communities,census tracts, counties, states

• Multidisciplinary and Interdisciplinary
• Collaborative
• Simple and Complex Models
What research questions are we
trying to answer?
• Do we need visualizations or maps? OR
• Are we interested in investigating possible
spatial relationships within the data?
ArcGIS Toolbox
Handyman’s Dream
or
Do-it-yourself nightmare?
Objectives
•
•
•
•

Traditional Statistics & Spatial Analysis
Permutations
Spatial Weights
EDA & ESDA
"Spatial Statistics" does not mean
applying traditional (non-spatial)
statistical methods to data that just
happens to be spatial (has X and Y
coordinates).
Source: ESRI
http://resources.esri.com/help/9.3/arcgisen
gine/java/gp_toolref/spatial_statistics_tools
/how_generate_spatial_weights_matrix_spa
tial_statistics_works.htm
Spatial Analysis

Traditional
Statistical
Methodology

Spatial
Methodology
Global & Local
Global
Model

EDA
ESDA

Global autocorrelation
Local autocorrelation

Local
Model
The most crucial step in the process
Exploring the Data: EDA & ESDA
Scatter Plot Matrix
1
0.8
0.6
0.4

pct_pov

0.2
0

p_FHH

0

p_blck

pct_pov

0.2

0.4
0.6
p_blck x p_FHH

0.8

1
Exploratory Spatial Data Analysis
• Interactively visualize and explore data
where space matter
• Detect patterns
• Hypothesis generation
• spatial modeling is needed to test
hypotheses
• Works on point feature and polygon
features (i.e. census, epidemiology,
demographic layers)
What is Spatial Randomness?
• Observed spatial pattern of value is equally as
likely as any other spatial pattern
• Value at one location does not depend on
values at neighboring locations under spatial
randomness, the location of values may be
altered without affecting the information
content of the data
• random permutation or reshuffling of values
Dr. Luc Anselin 2012
Spatial Randomness
• Spatial Randomness Null Hypothesis
– Spatial randomness is absence in any pattern
– If rejected, evidence of spatial structure

Dr. Luc Anselin 2012
ArcGIS Spatial Autocorrelation
• The Randomization Null Hypothesis: Where appropriate, the tools in the
Spatial Statistics toolbox use the randomization null hypothesis as the
basis for statistical significance testing. The randomization null hypothesis
postulates that the observed spatial pattern of your data represents one
of many (n!) possible spatial arrangements. If you could pick up your data
values and throw them down onto the features in your study area, you
would have one possible spatial arrangement of those values. (Note that
picking up your data values and throwing them down arbitrarily is an
example of a random spatial process). The randomization null hypothesis
states that if you could do this exercise (pick them up, throw them down)
infinite times, most of the time you would produce a pattern that would
not be markedly different from the observed pattern (your real data).
Once in a while you might accidentally throw all the highest values into
the same corner of your study area, but the probability of doing that is
small. The randomization null hypothesis states that your data is one of
many, many, many possible versions of complete spatial randomness. The
data values are fixed; only their spatial arrangement could vary.
http://resources.arcgis.com/en/help/main/10.
1/index.html#//005p00000006000000
Permutations
• A numerical approach to testing for statistical
significance (in contrast to analytical
approaches)
• It is data-driven and makes no assumptions
(such as normality) about the data
Permutations in Geoda
• Permutation inference is shuffling values around
and re-computing statistics each time with a
different set of random numbers to construct a
reference distribution.
• Permutations are used to determine how likely it
would be to observe the Moran’s I value of an
actual distribution under conditions of spatial
randomness.
• P-values are dependent on the number of
permutations so they are “pseudo p-values”
Permutations
Spatial Weights
The first step in the analysis of spatial
autocorrelation is to construct a spatial weights
file that contains information on the
“neighborhood” structure for each location
(luc anselin)
Generation of Spatial Weights ESRI
• For binary strategies (fixed distance, K nearest
neighbors, or contiguity) a feature is either a
neighbor (1) or it is not (0).
• For weighted strategies (inverse distance or
zone of indifference) neighboring features
have a varying amount of impact (or
influence) and weights are computed to
reflect that variation.
Row Standardization
• Adjusts the weights in a spatial weights matrix
• Each weight is divided by its row sum
• The row sum is the sum of weights for a
feature’s neighbors.
• A weights matrix is row-standardized when
the values of each of its rows sum to one.
Binary vs. row-standardized
• A binary weights matrix looks like:
0

1

0

0

0

0

1

1

1

1

0

0

0

1

1

1

• A row-standardized matrix it looks like:
0

1

0

0

0

0

.5

.5

.5

.5

0

0

0

.33

.33

.33
Spatial Weights

• Formal expression of locational similarity
Distance Models
• Inverse distance – all features influence all
other features, but the closer something is,
the more influence it has
• Distance band – features outside a specified
distance do not influence the features within
the area
• Zone of indifference – combines inverse
distance and distance band
Inverse Distance (impedance) (ArcGIS)
• features impact/influence all other features
– farther away something is, the smaller the impact

• specify a Distance Band/Threshold Distance value
to reduce the number of required computations
– especially with large datasets.
– If not specified, a default threshold
value is computed for you

• Choosing an appropriate distance is important
– Some spatial statistics require each feature to have at
least one neighbor for the analysis to be reliable.
Distance band (sphere of influence)
• impose a sphere of influence, or moving window
conceptual model of spatial interactions onto the data
• Neighbors within the specified distance are weighted
equally. Features outside have no influence (weight = 0)
• Evaluate the statistical properties of your data at a
particular (fixed) spatial scale
• have at least one neighbor, or results will not be valid
• if the input data is skewed make sure that your distance
band is neither too small (only one or two neighbors) nor
too large (include all other features as neighbors)
– resultant z-scores less reliable.
Adjacency Models
• K Nearest Neighbors – a specified number of
neighboring features are included in
calculations
• Polygon Contiguity – polygons that share an
edge or node influence each other
K-nearest neighbors

• each feature assessed in the spatial context of a
specified number of its closest neighbors. If K (t is
8, then eight closest neighbors to the target
feature will be included If feature density is high spatial context of the analysis will be smaller.
• If feature density is sparse, the spatial context for
the analysis will be larger.
• method is available using the Generate Spatial
Weights Matrix tool
Polygon contiguity (first order)
• polygons that share an edge (that have
coincident boundaries) are included in
computations for the target polygon
• modeling some type of contagious process or
are dealing with continuous data represented
as polygons.
Binary Contiguity Weights
• contiguity = common border
• i and j share a border, then wij = 1
• i and j are not neighbors, then wij = 0
• weights are 0 or 1, hence binary
Distance-Based Weights
• distance between points
• distance between polygon
centroids or central points
• distance-band weights:
wij nonzero for dij < d
less than a critical distance d
• k-nearest neighbor weights:
same number of neighbors for all
observations
potential problems with ties
Global vs. Local Statistics
• Global statistics (Clustering) – identify and
measure the pattern of the entire study area
– Do not indicate where specific patterns occur

• Local Statistics (Clusters) – identify variation
across the study area, focusing on individual
features and their relationships to nearby
features (i.e. specific areas of clustering)
Spatial Autocorrelation (Moran’s I)
• Global statistic
• Measures whether the pattern of feature values is clustered,
dispersed, or random.
• Compares the difference between the mean of the target
feature and the mean for all features to the difference
between the mean for each neighbor and the mean for all
features.
Mean of Target
Feature

Mean of each
neighbor
Mean of
all
features
Z-Score & P-value (ArcGIS)
• Very high or very low (negative) z-scores,
associated with very small p-values, are found in
the tails of the normal distribution
• it is unlikely that the observed spatial pattern
reflects the theoretical random pattern
represented by your null hypothesis (CSR)
• The null hypothesis for the pattern analysis tools
is Complete Spatial Randomness (CSR), either of
the features themselves or of the values
associated with those features.
http://resources.arcgis.com/en/help/main/10.
1/index.html#//005p00000006000000
Pseudo P-Value
• significance levels are dependent on the
number of permutations
• One-sided significance test
• For instance, if an observed Moran's I value is
higher than any of the randomly generated
Moran's I values, the pseudo p-value would be
1/100=0.01 for 99 permutations or
1/1,000=0.001 for 999 permutations
Spatial Autocorrelation (Moran’s I)
Polygon Contiguity (first order)
Spatial Autocorrelation (Moran’s I)
Polygon Contiguity (first order)
Percent Black Population, Cook County, IL
Generate Spatial Weights Matrix
K-Nearest Neighbor
Spatial Autocorrelation (Moran’s I)
K-Nearest Neighbor
Percent Black Population, Cook County, IL
Spatial Autocorrelation (Moran’s I)
K-Nearest Neighbor
Percent Black Population, Cook County, IL
Spatial Autocorrelation (Getis –Ord General G High/Low Clustering)
Polygon Contiguity
Percent Black Population, Cook County, IL

If the z-score value is positive, the observed General G index is larger than the expected
General G index, indicating high values for the attribute are clustered in the study area
Geoda Spatial Autocorrelation (Moran’s I)
Percent Black Population, Cook County, IL
Geoda Spatial Autocorrelation (Moran’s I)
Queen Contiguity Weight (1st order)
Percent Black Population, Cook County, IL
Geoda Spatial Autocorrelation (Moran’s I)
K-Nearest Neighbor (eight)
Percent Black Population, Cook County, IL
Geoda Spatial Autocorrelation (Moran’s I)
K-Nearest Neighbor (four)
Percent Black Population, Cook County, IL
Anselin Local Moran’s I
• Local statistic
• Measures the strength of patterns for
each specific feature.
• Compares the value of each feature in a
pair to the mean value for all features in
the study area.
Anselin Local Moran’s I
• Positive I value:
– Feature is surrounded by features with similar values, either high or low.
– Feature is part of a cluster.
– Statistically significant clusters can consist of high values (HH) or low
values (LL)

• Negative I value:
– Feature is surrounded by features with dissimilar values.
– Feature is an outlier.
– Statistically significant outliers can be a feature with a high value
surrounded by features with low values (HL) or a feature with a low
value surrounded by features with high values (LH).
Anselin Local Moran’s I

• The z- scores and p-values are measures of statistical
significance which tell you whether or not to reject the
null hypothesis, feature by feature.
• Indicate whether the apparent similarity (or
dissimilarity) in values for a feature and its neighbors is
greater than one would expect in a random distribution.
http://resources.esri.com/help/9.3/arcgisengine/java/gp_toolref/spatial_statistics_tools/clu
ster_and_outlier_analysis_colon_anselin_local_moran_s_i_spatial_statistics_.htm
index

z-score

p-value
Anselin’s Local Moran’s I
Polygon Contiguity Weight
Percent Black Population
Cook County, IL

HH LH
Geoda Univariate LISA
Queen Contiguity Weight
Percent Black Population, Cook County, IL
p-values 499 Permutations

p-values 999 Permutations
Geoda Univariate LISA
Queen Contiguity Weight
Percent Black Population, Cook County, IL
HH HL 999 Permutations
Comparison ArcGIS & Geoda Results
Queen Contiguity Weight
Percent Black Population, Cook County, IL
p-values
Comparison ArcGIS & Geoda Univariate LISA
Queen Contiguity Weight
Percent Black Population, Cook County, IL
HH HL

HH HL 999 Permutations
Bivariate LISA Scatterplot
High - High

Low-Low

High - Low

Non-point Source
Cancer Risk

Low-High

Percent Poverty

Chow test for selected/unselected regression subsets distribution F(2,1339)
ratio=214.6 p-value=0
INTERCEPT
# of

R^2

Constant

Observations

Std

t-statistic

SLOPE
p-value

Slope

Error

Std

t-statistic

p-value

Error

1343

0.209

0.00442

0.0176

0.251

0.802

0.332

0.0176

18.8

0

80

0.1116

1.58

0.0797

19.8

0

0.045

0.0475

0.957

0.342

1263

0.118

-0.0794

0.0161

-4.92

0

0.223

0.0172

13

0
Global
Model

EDA
ESDA

Local
Model

More Related Content

What's hot

Drone flight data processing
Drone flight data processingDrone flight data processing
Drone flight data processingDany Laksono
 
Introduction to arc gis
Introduction to arc gisIntroduction to arc gis
Introduction to arc gisMohamed Hamed
 
TYBSC IT PGIS Unit III Chapter I Spatial Referencing and Positioning
TYBSC IT PGIS Unit III Chapter I Spatial Referencing and PositioningTYBSC IT PGIS Unit III Chapter I Spatial Referencing and Positioning
TYBSC IT PGIS Unit III Chapter I Spatial Referencing and PositioningArti Parab Academics
 
DATA in GIS and DATA Query
DATA in GIS and DATA QueryDATA in GIS and DATA Query
DATA in GIS and DATA QueryKU Leuven
 
Introduction to GIS
Introduction to GISIntroduction to GIS
Introduction to GISEhsan Hamzei
 
Gis (geographic information system)
Gis (geographic information system)Gis (geographic information system)
Gis (geographic information system)Saad Bare
 
Getting started with GIS
Getting started with GISGetting started with GIS
Getting started with GISEsri India
 
An introduction to geographic information systems (gis) m goulbourne 2007
An introduction to geographic information systems (gis)   m goulbourne 2007An introduction to geographic information systems (gis)   m goulbourne 2007
An introduction to geographic information systems (gis) m goulbourne 2007Michelle Goulbourne @ DiaMind Health
 
Digital Image Classification.pptx
Digital Image Classification.pptxDigital Image Classification.pptx
Digital Image Classification.pptxHline Win
 
Introduction and Application of GIS
Introduction and Application of GISIntroduction and Application of GIS
Introduction and Application of GISSatish Taji
 
Land information system in Nepal
Land information system in NepalLand information system in Nepal
Land information system in NepalQust04
 
Image classification and land cover mapping
Image classification and land cover mappingImage classification and land cover mapping
Image classification and land cover mappingKabir Uddin
 
Remote sensing GIS and public policy
Remote sensing GIS and public policyRemote sensing GIS and public policy
Remote sensing GIS and public policyprasun_70
 
Data base management system
Data base management systemData base management system
Data base management systemPramoda Raj
 

What's hot (20)

Gis
GisGis
Gis
 
Introduction to GIS
Introduction to GISIntroduction to GIS
Introduction to GIS
 
Drone flight data processing
Drone flight data processingDrone flight data processing
Drone flight data processing
 
Introduction to arc gis
Introduction to arc gisIntroduction to arc gis
Introduction to arc gis
 
TYBSC IT PGIS Unit III Chapter I Spatial Referencing and Positioning
TYBSC IT PGIS Unit III Chapter I Spatial Referencing and PositioningTYBSC IT PGIS Unit III Chapter I Spatial Referencing and Positioning
TYBSC IT PGIS Unit III Chapter I Spatial Referencing and Positioning
 
DATA in GIS and DATA Query
DATA in GIS and DATA QueryDATA in GIS and DATA Query
DATA in GIS and DATA Query
 
Introduction to GIS
Introduction to GISIntroduction to GIS
Introduction to GIS
 
Gis (geographic information system)
Gis (geographic information system)Gis (geographic information system)
Gis (geographic information system)
 
Getting started with GIS
Getting started with GISGetting started with GIS
Getting started with GIS
 
An introduction to geographic information systems (gis) m goulbourne 2007
An introduction to geographic information systems (gis)   m goulbourne 2007An introduction to geographic information systems (gis)   m goulbourne 2007
An introduction to geographic information systems (gis) m goulbourne 2007
 
Principles of photogrammetry
Principles of photogrammetryPrinciples of photogrammetry
Principles of photogrammetry
 
Digital Image Classification.pptx
Digital Image Classification.pptxDigital Image Classification.pptx
Digital Image Classification.pptx
 
Introduction and Application of GIS
Introduction and Application of GISIntroduction and Application of GIS
Introduction and Application of GIS
 
Introduction to gis
Introduction to gisIntroduction to gis
Introduction to gis
 
Land information system in Nepal
Land information system in NepalLand information system in Nepal
Land information system in Nepal
 
GIS Geographical Information System
GIS Geographical Information SystemGIS Geographical Information System
GIS Geographical Information System
 
Digital Cartography
Digital CartographyDigital Cartography
Digital Cartography
 
Image classification and land cover mapping
Image classification and land cover mappingImage classification and land cover mapping
Image classification and land cover mapping
 
Remote sensing GIS and public policy
Remote sensing GIS and public policyRemote sensing GIS and public policy
Remote sensing GIS and public policy
 
Data base management system
Data base management systemData base management system
Data base management system
 

Viewers also liked

Geographic information system
Geographic information systemGeographic information system
Geographic information systemSumanta Das
 
GIS presentation
GIS presentationGIS presentation
GIS presentationarniontech
 
What is GIS
What is GISWhat is GIS
What is GISEsri
 
Integrating Virtual Environment and GIS for 3D Virtual City.ppt
Integrating Virtual Environment and GIS for 3D Virtual City.pptIntegrating Virtual Environment and GIS for 3D Virtual City.ppt
Integrating Virtual Environment and GIS for 3D Virtual City.pptgrssieee
 
Esri Health GIS Conference
Esri Health GIS ConferenceEsri Health GIS Conference
Esri Health GIS ConferenceEsri
 
Applications of GIS in Public Health Engineering
Applications of GIS in Public Health EngineeringApplications of GIS in Public Health Engineering
Applications of GIS in Public Health EngineeringVignesh Sekar
 
Open source health gis presentation final
Open source health gis  presentation finalOpen source health gis  presentation final
Open source health gis presentation finalJISC GECO
 
Spatial data analysis 2
Spatial data analysis 2Spatial data analysis 2
Spatial data analysis 2Johan Blomme
 
Geographic data in public health: Lessons from the field
Geographic data in public health: Lessons from the fieldGeographic data in public health: Lessons from the field
Geographic data in public health: Lessons from the fieldMEASURE Evaluation
 
Virginia Gis Health Presentation
Virginia Gis Health PresentationVirginia Gis Health Presentation
Virginia Gis Health PresentationKoren M
 
GIS Day 2015 - New Light Technologies, Inc.
GIS Day 2015 - New Light Technologies, Inc.GIS Day 2015 - New Light Technologies, Inc.
GIS Day 2015 - New Light Technologies, Inc.Daniel Sandhaus
 
Mosquera assign ch1_1_1
Mosquera assign ch1_1_1Mosquera assign ch1_1_1
Mosquera assign ch1_1_1Gordi Hatch
 
ABSTRACT Public Health GIS- DST NATIONAL PROJECT
ABSTRACT Public Health GIS- DST NATIONAL PROJECTABSTRACT Public Health GIS- DST NATIONAL PROJECT
ABSTRACT Public Health GIS- DST NATIONAL PROJECTIyyanki V Muralikrishna
 

Viewers also liked (20)

GIS in Health
GIS in HealthGIS in Health
GIS in Health
 
Introduction to Health GIS
Introduction  to Health GIS Introduction  to Health GIS
Introduction to Health GIS
 
Geographic information system
Geographic information systemGeographic information system
Geographic information system
 
What Is GIS?
What Is GIS?What Is GIS?
What Is GIS?
 
GIS presentation
GIS presentationGIS presentation
GIS presentation
 
My ppt on gis
My ppt on gisMy ppt on gis
My ppt on gis
 
What is GIS
What is GISWhat is GIS
What is GIS
 
Integrating Virtual Environment and GIS for 3D Virtual City.ppt
Integrating Virtual Environment and GIS for 3D Virtual City.pptIntegrating Virtual Environment and GIS for 3D Virtual City.ppt
Integrating Virtual Environment and GIS for 3D Virtual City.ppt
 
Esri Health GIS Conference
Esri Health GIS ConferenceEsri Health GIS Conference
Esri Health GIS Conference
 
Applications of GIS in Public Health Engineering
Applications of GIS in Public Health EngineeringApplications of GIS in Public Health Engineering
Applications of GIS in Public Health Engineering
 
Gis dr rahul
Gis dr rahulGis dr rahul
Gis dr rahul
 
Open source health gis presentation final
Open source health gis  presentation finalOpen source health gis  presentation final
Open source health gis presentation final
 
Spatial data analysis 2
Spatial data analysis 2Spatial data analysis 2
Spatial data analysis 2
 
Geographic data in public health: Lessons from the field
Geographic data in public health: Lessons from the fieldGeographic data in public health: Lessons from the field
Geographic data in public health: Lessons from the field
 
Nepal Earthquake 2015 ICIMOD’s focus on reconstruction
Nepal Earthquake 2015 ICIMOD’s focus on reconstructionNepal Earthquake 2015 ICIMOD’s focus on reconstruction
Nepal Earthquake 2015 ICIMOD’s focus on reconstruction
 
Virginia Gis Health Presentation
Virginia Gis Health PresentationVirginia Gis Health Presentation
Virginia Gis Health Presentation
 
GIS Day 2015 - New Light Technologies, Inc.
GIS Day 2015 - New Light Technologies, Inc.GIS Day 2015 - New Light Technologies, Inc.
GIS Day 2015 - New Light Technologies, Inc.
 
Mosquera assign ch1_1_1
Mosquera assign ch1_1_1Mosquera assign ch1_1_1
Mosquera assign ch1_1_1
 
Geography and Health
Geography and HealthGeography and Health
Geography and Health
 
ABSTRACT Public Health GIS- DST NATIONAL PROJECT
ABSTRACT Public Health GIS- DST NATIONAL PROJECTABSTRACT Public Health GIS- DST NATIONAL PROJECT
ABSTRACT Public Health GIS- DST NATIONAL PROJECT
 

Similar to GIS in Public Health Research: Understanding Spatial Analysis and Interpreting Outcomes 1-31-14

Spatial analysis & interpolation in ARC GIS
Spatial analysis & interpolation in ARC GISSpatial analysis & interpolation in ARC GIS
Spatial analysis & interpolation in ARC GISKU Leuven
 
Building maps with analysis
Building maps with analysisBuilding maps with analysis
Building maps with analysisLindaBeale
 
Sa Presentation 20070917111 Thomas
Sa Presentation 20070917111 ThomasSa Presentation 20070917111 Thomas
Sa Presentation 20070917111 Thomasnspiropo
 
SPATIAL POINT PATTERNS
SPATIAL POINT PATTERNSSPATIAL POINT PATTERNS
SPATIAL POINT PATTERNSLiemNguyenDuy
 
report2.doc
report2.docreport2.doc
report2.docbutest
 
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RFinding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RRevolution Analytics
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersAlbert Y. C. Chen
 
ODSC India 2018: Topological space creation &amp; Clustering at BigData scale
ODSC India 2018: Topological space creation &amp; Clustering at BigData scaleODSC India 2018: Topological space creation &amp; Clustering at BigData scale
ODSC India 2018: Topological space creation &amp; Clustering at BigData scaleKuldeep Jiwani
 
Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis Baivab Nag
 
DMTM 2015 - 06 Introduction to Clustering
DMTM 2015 - 06 Introduction to ClusteringDMTM 2015 - 06 Introduction to Clustering
DMTM 2015 - 06 Introduction to ClusteringPier Luca Lanzi
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-Ihktripathy
 
Spatial Data Mining : Seminar
Spatial Data Mining : SeminarSpatial Data Mining : Seminar
Spatial Data Mining : SeminarIpsit Dash
 
DMTM Lecture 11 Clustering
DMTM Lecture 11 ClusteringDMTM Lecture 11 Clustering
DMTM Lecture 11 ClusteringPier Luca Lanzi
 
Developing a Tutorial for Grouping Analysis in ArcGIS
Developing a Tutorial for Grouping Analysis in ArcGISDeveloping a Tutorial for Grouping Analysis in ArcGIS
Developing a Tutorial for Grouping Analysis in ArcGISCOGS Presentations
 
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringAbility Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringKamleshKumar394
 
Spatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSpatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSwapnil Shrivastav
 
Updating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesUpdating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesTown Peterson
 

Similar to GIS in Public Health Research: Understanding Spatial Analysis and Interpreting Outcomes 1-31-14 (20)

Spatial analysis & interpolation in ARC GIS
Spatial analysis & interpolation in ARC GISSpatial analysis & interpolation in ARC GIS
Spatial analysis & interpolation in ARC GIS
 
Building maps with analysis
Building maps with analysisBuilding maps with analysis
Building maps with analysis
 
Sa Presentation 20070917111 Thomas
Sa Presentation 20070917111 ThomasSa Presentation 20070917111 Thomas
Sa Presentation 20070917111 Thomas
 
GEOSTATISTICAL_ANALYST
GEOSTATISTICAL_ANALYSTGEOSTATISTICAL_ANALYST
GEOSTATISTICAL_ANALYST
 
SPATIAL POINT PATTERNS
SPATIAL POINT PATTERNSSPATIAL POINT PATTERNS
SPATIAL POINT PATTERNS
 
report2.doc
report2.docreport2.doc
report2.doc
 
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in RFinding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
Finding Meaning in Points, Areas and Surfaces: Spatial Analysis in R
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional Managers
 
ODSC India 2018: Topological space creation &amp; Clustering at BigData scale
ODSC India 2018: Topological space creation &amp; Clustering at BigData scaleODSC India 2018: Topological space creation &amp; Clustering at BigData scale
ODSC India 2018: Topological space creation &amp; Clustering at BigData scale
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Data analysis05 clustering
Data analysis05 clusteringData analysis05 clustering
Data analysis05 clustering
 
Cluster Analysis
Cluster Analysis Cluster Analysis
Cluster Analysis
 
DMTM 2015 - 06 Introduction to Clustering
DMTM 2015 - 06 Introduction to ClusteringDMTM 2015 - 06 Introduction to Clustering
DMTM 2015 - 06 Introduction to Clustering
 
Lect4 principal component analysis-I
Lect4 principal component analysis-ILect4 principal component analysis-I
Lect4 principal component analysis-I
 
Spatial Data Mining : Seminar
Spatial Data Mining : SeminarSpatial Data Mining : Seminar
Spatial Data Mining : Seminar
 
DMTM Lecture 11 Clustering
DMTM Lecture 11 ClusteringDMTM Lecture 11 Clustering
DMTM Lecture 11 Clustering
 
Developing a Tutorial for Grouping Analysis in ArcGIS
Developing a Tutorial for Grouping Analysis in ArcGISDeveloping a Tutorial for Grouping Analysis in ArcGIS
Developing a Tutorial for Grouping Analysis in ArcGIS
 
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringAbility Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
 
Spatial analysis and Analysis Tools
Spatial analysis and Analysis ToolsSpatial analysis and Analysis Tools
Spatial analysis and Analysis Tools
 
Updating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling MethodologiesUpdating Ecological Niche Modeling Methodologies
Updating Ecological Niche Modeling Methodologies
 

Recently uploaded

VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...Miss joya
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...narwatsonia7
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Modelssonalikaur4
 
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy GirlsCall Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy Girlsnehamumbai
 
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...narwatsonia7
 
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service BangaloreCall Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalorenarwatsonia7
 
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment BookingHousewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Bookingnarwatsonia7
 
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowRiya Pathan
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...Miss joya
 
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service MumbaiLow Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbaisonalikaur4
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Serviceparulsinha
 
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort ServiceCollege Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort ServiceNehru place Escorts
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...narwatsonia7
 
Call Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service SuratCall Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service Suratnarwatsonia7
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableNehru place Escorts
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 

Recently uploaded (20)

VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
VIP Call Girls Pune Vrinda 9907093804 Short 1500 Night 6000 Best call girls S...
 
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
Call Girls Service in Bommanahalli - 7001305949 with real photos and phone nu...
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
 
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy GirlsCall Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
Call Girls In Andheri East Call 9920874524 Book Hot And Sexy Girls
 
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Kanakapura Road Just Call 7001305949 Top Class Call Girl Service A...
 
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service BangaloreCall Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
Call Girl Bangalore Nandini 7001305949 Independent Escort Service Bangalore
 
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment BookingHousewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
 
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hsr Layout Just Call 7001305949 Top Class Call Girl Service Available
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
 
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
College Call Girls Pune Mira 9907093804 Short 1500 Night 6000 Best call girls...
 
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service MumbaiLow Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
Low Rate Call Girls Mumbai Suman 9910780858 Independent Escort Service Mumbai
 
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Whitefield Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Hebbal Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
 
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort ServiceCollege Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
College Call Girls Vyasarpadi Whatsapp 7001305949 Independent Escort Service
 
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Majestic 📞 9907093804 High Profile Service 100% Safe
 
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
Russian Call Girl Brookfield - 7001305949 Escorts Service 50% Off with Cash O...
 
Call Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service SuratCall Girl Surat Madhuri 7001305949 Independent Escort Service Surat
Call Girl Surat Madhuri 7001305949 Independent Escort Service Surat
 
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls AvailableVip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
Vip Call Girls Anna Salai Chennai 👉 8250192130 ❣️💯 Top Class Girls Available
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
 

GIS in Public Health Research: Understanding Spatial Analysis and Interpreting Outcomes 1-31-14

  • 1. GIS in Public Health Research: Understanding Spatial Analysis & Interpreting Outcomes Kristin Osiecki PhD
  • 2.
  • 3. Houston Aerosol Characterization & Health Experiment (HACHE)
  • 4. • UT Health Science Center School of Biomedical Informatics • University of Houston Department of Earth and Atmospheric Sciences • Rice University Department of Sociology and Department of Civil & Environmental Engineering
  • 5.
  • 6.
  • 7. Applications in Public Health Research • Space matters – communities,census tracts, counties, states • Multidisciplinary and Interdisciplinary • Collaborative • Simple and Complex Models
  • 8. What research questions are we trying to answer? • Do we need visualizations or maps? OR • Are we interested in investigating possible spatial relationships within the data?
  • 10. Objectives • • • • Traditional Statistics & Spatial Analysis Permutations Spatial Weights EDA & ESDA
  • 11. "Spatial Statistics" does not mean applying traditional (non-spatial) statistical methods to data that just happens to be spatial (has X and Y coordinates). Source: ESRI http://resources.esri.com/help/9.3/arcgisen gine/java/gp_toolref/spatial_statistics_tools /how_generate_spatial_weights_matrix_spa tial_statistics_works.htm
  • 13.
  • 14. Global & Local Global Model EDA ESDA Global autocorrelation Local autocorrelation Local Model
  • 15. The most crucial step in the process
  • 16. Exploring the Data: EDA & ESDA
  • 17.
  • 18.
  • 20.
  • 21.
  • 22. Exploratory Spatial Data Analysis • Interactively visualize and explore data where space matter • Detect patterns • Hypothesis generation • spatial modeling is needed to test hypotheses • Works on point feature and polygon features (i.e. census, epidemiology, demographic layers)
  • 23. What is Spatial Randomness? • Observed spatial pattern of value is equally as likely as any other spatial pattern • Value at one location does not depend on values at neighboring locations under spatial randomness, the location of values may be altered without affecting the information content of the data • random permutation or reshuffling of values Dr. Luc Anselin 2012
  • 24. Spatial Randomness • Spatial Randomness Null Hypothesis – Spatial randomness is absence in any pattern – If rejected, evidence of spatial structure Dr. Luc Anselin 2012
  • 25. ArcGIS Spatial Autocorrelation • The Randomization Null Hypothesis: Where appropriate, the tools in the Spatial Statistics toolbox use the randomization null hypothesis as the basis for statistical significance testing. The randomization null hypothesis postulates that the observed spatial pattern of your data represents one of many (n!) possible spatial arrangements. If you could pick up your data values and throw them down onto the features in your study area, you would have one possible spatial arrangement of those values. (Note that picking up your data values and throwing them down arbitrarily is an example of a random spatial process). The randomization null hypothesis states that if you could do this exercise (pick them up, throw them down) infinite times, most of the time you would produce a pattern that would not be markedly different from the observed pattern (your real data). Once in a while you might accidentally throw all the highest values into the same corner of your study area, but the probability of doing that is small. The randomization null hypothesis states that your data is one of many, many, many possible versions of complete spatial randomness. The data values are fixed; only their spatial arrangement could vary. http://resources.arcgis.com/en/help/main/10. 1/index.html#//005p00000006000000
  • 26. Permutations • A numerical approach to testing for statistical significance (in contrast to analytical approaches) • It is data-driven and makes no assumptions (such as normality) about the data
  • 27. Permutations in Geoda • Permutation inference is shuffling values around and re-computing statistics each time with a different set of random numbers to construct a reference distribution. • Permutations are used to determine how likely it would be to observe the Moran’s I value of an actual distribution under conditions of spatial randomness. • P-values are dependent on the number of permutations so they are “pseudo p-values”
  • 29. Spatial Weights The first step in the analysis of spatial autocorrelation is to construct a spatial weights file that contains information on the “neighborhood” structure for each location (luc anselin)
  • 30. Generation of Spatial Weights ESRI • For binary strategies (fixed distance, K nearest neighbors, or contiguity) a feature is either a neighbor (1) or it is not (0). • For weighted strategies (inverse distance or zone of indifference) neighboring features have a varying amount of impact (or influence) and weights are computed to reflect that variation.
  • 31. Row Standardization • Adjusts the weights in a spatial weights matrix • Each weight is divided by its row sum • The row sum is the sum of weights for a feature’s neighbors. • A weights matrix is row-standardized when the values of each of its rows sum to one.
  • 32. Binary vs. row-standardized • A binary weights matrix looks like: 0 1 0 0 0 0 1 1 1 1 0 0 0 1 1 1 • A row-standardized matrix it looks like: 0 1 0 0 0 0 .5 .5 .5 .5 0 0 0 .33 .33 .33
  • 33. Spatial Weights • Formal expression of locational similarity
  • 34. Distance Models • Inverse distance – all features influence all other features, but the closer something is, the more influence it has • Distance band – features outside a specified distance do not influence the features within the area • Zone of indifference – combines inverse distance and distance band
  • 35. Inverse Distance (impedance) (ArcGIS) • features impact/influence all other features – farther away something is, the smaller the impact • specify a Distance Band/Threshold Distance value to reduce the number of required computations – especially with large datasets. – If not specified, a default threshold value is computed for you • Choosing an appropriate distance is important – Some spatial statistics require each feature to have at least one neighbor for the analysis to be reliable.
  • 36. Distance band (sphere of influence) • impose a sphere of influence, or moving window conceptual model of spatial interactions onto the data • Neighbors within the specified distance are weighted equally. Features outside have no influence (weight = 0) • Evaluate the statistical properties of your data at a particular (fixed) spatial scale • have at least one neighbor, or results will not be valid • if the input data is skewed make sure that your distance band is neither too small (only one or two neighbors) nor too large (include all other features as neighbors) – resultant z-scores less reliable.
  • 37. Adjacency Models • K Nearest Neighbors – a specified number of neighboring features are included in calculations • Polygon Contiguity – polygons that share an edge or node influence each other
  • 38. K-nearest neighbors • each feature assessed in the spatial context of a specified number of its closest neighbors. If K (t is 8, then eight closest neighbors to the target feature will be included If feature density is high spatial context of the analysis will be smaller. • If feature density is sparse, the spatial context for the analysis will be larger. • method is available using the Generate Spatial Weights Matrix tool
  • 39. Polygon contiguity (first order) • polygons that share an edge (that have coincident boundaries) are included in computations for the target polygon • modeling some type of contagious process or are dealing with continuous data represented as polygons.
  • 40. Binary Contiguity Weights • contiguity = common border • i and j share a border, then wij = 1 • i and j are not neighbors, then wij = 0 • weights are 0 or 1, hence binary Distance-Based Weights • distance between points • distance between polygon centroids or central points • distance-band weights: wij nonzero for dij < d less than a critical distance d • k-nearest neighbor weights: same number of neighbors for all observations potential problems with ties
  • 41. Global vs. Local Statistics • Global statistics (Clustering) – identify and measure the pattern of the entire study area – Do not indicate where specific patterns occur • Local Statistics (Clusters) – identify variation across the study area, focusing on individual features and their relationships to nearby features (i.e. specific areas of clustering)
  • 42. Spatial Autocorrelation (Moran’s I) • Global statistic • Measures whether the pattern of feature values is clustered, dispersed, or random. • Compares the difference between the mean of the target feature and the mean for all features to the difference between the mean for each neighbor and the mean for all features. Mean of Target Feature Mean of each neighbor Mean of all features
  • 43. Z-Score & P-value (ArcGIS) • Very high or very low (negative) z-scores, associated with very small p-values, are found in the tails of the normal distribution • it is unlikely that the observed spatial pattern reflects the theoretical random pattern represented by your null hypothesis (CSR) • The null hypothesis for the pattern analysis tools is Complete Spatial Randomness (CSR), either of the features themselves or of the values associated with those features. http://resources.arcgis.com/en/help/main/10. 1/index.html#//005p00000006000000
  • 44. Pseudo P-Value • significance levels are dependent on the number of permutations • One-sided significance test • For instance, if an observed Moran's I value is higher than any of the randomly generated Moran's I values, the pseudo p-value would be 1/100=0.01 for 99 permutations or 1/1,000=0.001 for 999 permutations
  • 45. Spatial Autocorrelation (Moran’s I) Polygon Contiguity (first order)
  • 46. Spatial Autocorrelation (Moran’s I) Polygon Contiguity (first order) Percent Black Population, Cook County, IL
  • 47. Generate Spatial Weights Matrix K-Nearest Neighbor
  • 48. Spatial Autocorrelation (Moran’s I) K-Nearest Neighbor Percent Black Population, Cook County, IL
  • 49. Spatial Autocorrelation (Moran’s I) K-Nearest Neighbor Percent Black Population, Cook County, IL
  • 50. Spatial Autocorrelation (Getis –Ord General G High/Low Clustering) Polygon Contiguity Percent Black Population, Cook County, IL If the z-score value is positive, the observed General G index is larger than the expected General G index, indicating high values for the attribute are clustered in the study area
  • 51. Geoda Spatial Autocorrelation (Moran’s I) Percent Black Population, Cook County, IL
  • 52. Geoda Spatial Autocorrelation (Moran’s I) Queen Contiguity Weight (1st order) Percent Black Population, Cook County, IL
  • 53. Geoda Spatial Autocorrelation (Moran’s I) K-Nearest Neighbor (eight) Percent Black Population, Cook County, IL
  • 54. Geoda Spatial Autocorrelation (Moran’s I) K-Nearest Neighbor (four) Percent Black Population, Cook County, IL
  • 55. Anselin Local Moran’s I • Local statistic • Measures the strength of patterns for each specific feature. • Compares the value of each feature in a pair to the mean value for all features in the study area.
  • 56. Anselin Local Moran’s I • Positive I value: – Feature is surrounded by features with similar values, either high or low. – Feature is part of a cluster. – Statistically significant clusters can consist of high values (HH) or low values (LL) • Negative I value: – Feature is surrounded by features with dissimilar values. – Feature is an outlier. – Statistically significant outliers can be a feature with a high value surrounded by features with low values (HL) or a feature with a low value surrounded by features with high values (LH).
  • 57. Anselin Local Moran’s I • The z- scores and p-values are measures of statistical significance which tell you whether or not to reject the null hypothesis, feature by feature. • Indicate whether the apparent similarity (or dissimilarity) in values for a feature and its neighbors is greater than one would expect in a random distribution. http://resources.esri.com/help/9.3/arcgisengine/java/gp_toolref/spatial_statistics_tools/clu ster_and_outlier_analysis_colon_anselin_local_moran_s_i_spatial_statistics_.htm
  • 58. index z-score p-value Anselin’s Local Moran’s I Polygon Contiguity Weight Percent Black Population Cook County, IL HH LH
  • 59. Geoda Univariate LISA Queen Contiguity Weight Percent Black Population, Cook County, IL p-values 499 Permutations p-values 999 Permutations
  • 60. Geoda Univariate LISA Queen Contiguity Weight Percent Black Population, Cook County, IL HH HL 999 Permutations
  • 61. Comparison ArcGIS & Geoda Results Queen Contiguity Weight Percent Black Population, Cook County, IL p-values
  • 62. Comparison ArcGIS & Geoda Univariate LISA Queen Contiguity Weight Percent Black Population, Cook County, IL HH HL HH HL 999 Permutations
  • 63. Bivariate LISA Scatterplot High - High Low-Low High - Low Non-point Source Cancer Risk Low-High Percent Poverty Chow test for selected/unselected regression subsets distribution F(2,1339) ratio=214.6 p-value=0 INTERCEPT # of R^2 Constant Observations Std t-statistic SLOPE p-value Slope Error Std t-statistic p-value Error 1343 0.209 0.00442 0.0176 0.251 0.802 0.332 0.0176 18.8 0 80 0.1116 1.58 0.0797 19.8 0 0.045 0.0475 0.957 0.342 1263 0.118 -0.0794 0.0161 -4.92 0 0.223 0.0172 13 0