SlideShare a Scribd company logo
1 of 47
Developing a Tutorial for
Grouping Analysis in ArcGIS
Daniel Pierre
May 29, 2014
1. Introduction
2. Data
3. Grouping Analysis Workflows
4. Tutorial Exercises
5. Conclusions: Recommendations
Presentation Outline
Lauren Rosenshein Bennett, MS
Geoprocessing Product Engineer, Esri
Lbennett@esri.com
Dr. Konrad Dramowicz
Faculty, Centre of Geographic Sciences
Konrad.Dramowicz@nscc.ca
Dr. Ela Dramowicz
Faculty, Centre of Geographic Sciences
Ela.Dramowicz@nscc.ca
Introduction
Project Sponsor & Supervisors
Introduction
• Experimental testing of tool with
multiple datasets
• Incorporation of Grouping
Analysis with other tools
• Review of technical literature
on clustering algorithms
• Review of existing tutorials
Project Overview
Introduction
• Introduced at ArcGIS 10.1
• Available with Basic, Standard and
Advanced license levels
• Found in the Spatial Statistics
toolbox, within the Mapping
Clusters toolset
• Script tool
Grouping Analysis Tool
Introduction
• “...Performs a classification
procedure that tries to find natural
clusters in your data.” - Esri
• An aid for data comprehension
• Feature similarity is based on
attributes specified as analysis fields
and optionally, spatial constraints
• Given a number of groups, features
within each output group are as
similar as possible while groups are
as different as possible
Grouping Analysis Tool
Introduction
• Two algorithm types: cluster
analysis (traditional K-means) and
regionalization (spatial K-means)
• Thirteen parameters
(six required)
• Grouping results contingent on
the number of groups, analysis
fields, and type of spatial
constraint
Grouping Analysis Tool
Data
Features:
• Esri
• City of Vancouver
Multivariate Data:
• World Bank
• BBC
• Weatherbase
• Statistics Canada
Data Sources
Data
• Data Enrichment (ArcGIS Online)
• HTML table import
• Spreadsheet reformatting
• Table joins
• Feature class edits
Data Preparation
Data
Selection Criteria:
• Two scales of analysis
• Illustration of various spatial
constraint effects on results
• Sufficient number of features
• Visible spatial patterns in results
Tutorial Datasets
General Steps:
• Exploratory data analysis
• Preprocessing
• Determining appropriate Grouping
Analysis settings
• Postprocessing, interpretation
and evaluation of results
Grouping Analysis Workflows
Exploratory Data Analysis
1. Distribution of variable values
• Thematic mapping
• Spatial autocorrelation
2. Spatial relationships among
features
• Contiguity of features and number
of neighbours
• Spatial autocorrelation
Exploratory Data Analysis
Exploratory Data Analysis
• Explore distribution of dataset
variables
• Choropleth maps and graduated
symbol maps
• Identify set of variables to be used
for Grouping Analysis
Thematic Mapping
Exploratory Data Analysis
• Analyze contiguity relationships
among features
• Polygon Neighbors tool
• Determine relative connectivity of
features by counting number
of neighbours
• Frequency tool
Spatial Relationships
Exploratory Data Analysis
• Analyze contiguity and/or proximity
relationships among features using
GeoDa
• Create spatial weights
• Display histogram of feature
connectivity according to
defined spatial relationships
• Histogram linked to map and
attribute table
Alternative Approach
Exploratory Data Analysis
• Considers attribute values and
location of features simultaneously
• Moran’s I statistic determines
whether spatial pattern of values is
dispersed, random or clustered
• Significance of pattern evaluated
with corresponding z-score
• One variable at a time
Spatial Autocorrelation
Preprocessing
Use hot spots to limit study
area for Grouping Analysis:
• Calculate incremental spatial
autocorrelation
• Identify distance band of most
intense clustering
• Create hot spot map
• Select features from original
dataset based on location
of hot spots
Preprocessing
Grouping Analysis Settings
1. How many groups should be created?
2. Which analysis fields should be used?
3. Is a spatial constraint necessary?
If so, which type is appropriate?
Grouping Analysis Settings:
Key Considerations
Grouping Analysis Settings
• Default number is 2
• Sturge’s rule:
C = 1 + 3.3 log(n), where
C is the number of groups and
n is the number of features
• Evaluate the optimal number of groups
(up to a maximum of 15)
Number of Groups
Grouping Analysis Settings
Two vs. Three Groups
Grouping Analysis Settings
• Generally driven by research purpose
and objectives of grouping
• Guide selection of analysis fields with
exploratory data analysis findings
• Spatial variables may be used as
indirect spatial constraints
• Assess effectiveness of fields to
distinguish features with output report
Analysis Fields
Grouping Analysis Settings
Temperature: Spatial Variable
Grouping Analysis Settings
• Choice of spatial constraint or no
spatial constraint determines which
algorithm is used for grouping
• No spatial constraint – traditional
K-Means (data space only)
• Any spatial constraint – Spatial ‘K’luster
Analysis by Tree Edge Removal (SKATER)
method (spatial K-Means)
Spatial Constraints
Grouping Analysis Settings
No Spatial Constraint vs.
Spatial Constraint
Grouping Analysis Settings
• Contiguity – edges only (“rook” type) or
edges and corners (“queen” type)
• Delaunay triangulation – contiguity of
representations of features as Voronoi
polygons
• Proximity – K nearest neighbours
• Spatial weights
Spatial Constraint Types
Grouping Analysis Settings
• Evaluate optimal number of groups
• Guide selection of analysis fields with
calculated R2 values
• Visually assess results of specified
spatial constraint
Iterative Process for Optimizing
Grouping Analysis
Interpretation & Evaluation
• Spatial distribution of groups (map)
• Global statistics (output report)
• Group and variable statistics
(output report)
• Group profiles
Interpretation of Results
Interpretation & Evaluation
• Compare group means with each
other and global range
Group Profiles
Interpretation & Evaluation
• Compare group means and ranges
for each variable
Group Profiles (2)
• Consider global mean, median and
range for each variable
Group Profiles (3)
Interpretation & Evaluation
Interpretation & Evaluation
• Global Moran’s I statistic
• Determine spatial pattern of group
membership
• Measure spatial compactness of
group membership
• Clustered groups generally desired
Evaluation of Results:
Spatial Autocorrelation
Dispersed
Clustered
Random
Interpretation & Evaluation
• Smallest to largest group
• Indicator of balance in group
membership
• Balanced number of group
members generally desired for
comparison of statistics
• Frequency tool
Evaluation of Results:
Cluster Size Ratio
Interpretation & Evaluation
• Goodness measure that combines
concepts of cohesion and separation
• Adapted from cluster analysis to
consider attribute data and location
• Silhouette coefficient is calculated
for every feature and the average is
taken for the entire dataset
Evaluation of Results:
Silhouette
Interpretation & Evaluation
(B – A) / max(A, B) where
A is the distance between a
feature and its group center
B is the distance between the
feature and its neighbouring
group center
Silhouette Coefficient
Interpretation & Evaluation
• Range between –1 (poor)
and 1 (excellent)
• < 0.2 indicates poor clustering
• > 0.5 indicates good partition
of the data
Silhouette Coefficient Values
Tutorial Exercises
• Six exercises
• Two scenarios (3 exercises for each)
• Suitable for users at all levels of
experience
• Exercises take the user through the
steps of preprocessing, group
creation, interpretation and
evaluation of results outlined here
Grouping Analysis Tutorial
Tutorial Exercises
Exercises:
1. Data exploration
2. Grouping for exploratory data
analysis
3. Using Spatial Statistics tools to
target areas of interest
Scenario 1:
Analysis of Crime in Chicago
Tutorial Exercises
Exercises:
4. Create groups and use results to
write profiles
5. Explore effects of spatial
constraints
6. Evaluation of results
Scenario 2:
Analysis of Olympic Results
Tutorial Exercises
1. All tutorial exercises use polygon
data exclusively; point features not
covered
2. Space-time constraints using
spatial weights matrix file not
covered
3. Catered to general user; no
exercises specifically target
advanced users
Limitations
Recommendations
1. Exploratory data analysis
2. Grouping Analysis
3. Evaluation of results
Recommendations:
Enhancements and Additional Tools
Recommendations
• Multi-step process using Polygon
Neighbors, Frequency and table
joins could be simplified
• Dynamic linking of objects
can make use of existing
ArcGIS functionality
Determining Spatial Relationships
Among Features
Recommendations
• Expand types of spatial
relationships that can be analyzed
• Enable the analysis of higher order
relationships
Determining Spatial Relationships
Among Features (continued)
Recommendations
• Tools for determining most useful
diagnostic or predictor variables
• Guide selection of analysis fields for
data partitioning
• Adapt neural networks or other
data mining tools to work with
spatial constraints
Identification of Useful
Diagnostic Variables
Recommendations
Grouping Analysis Tool
Enhancements
• Create unique identifier
• Replace null values
Recommendations
• Spatial weights matrix can be
used as the spatial constraint
for creating groups
• Custom weights require
either manual table creation
or programming
• Solution: interactive feature
selection
User-defined spatial relationships
among features
Recommendations
• Expand beyond R2 and F-statistic
values in output report
• Adapt methods used to evaluate
cluster analysis algorithms
(e.g. Silhouette)
• Challenge: universally applicable
evaluation methods may not be
feasible
Evaluation of Results
THANK YOU!

More Related Content

Viewers also liked

Relationship chart gordon henry kraft:john cheney
Relationship chart gordon henry kraft:john cheneyRelationship chart gordon henry kraft:john cheney
Relationship chart gordon henry kraft:john cheneyGordon Kraft
 
Geohazard Study
Geohazard StudyGeohazard Study
Geohazard StudyGeodata AS
 
Nova Scotia Surnames and Mapping Methods
Nova Scotia Surnames and Mapping MethodsNova Scotia Surnames and Mapping Methods
Nova Scotia Surnames and Mapping MethodsCOGS Presentations
 
Charlotte Gallant Presentation
Charlotte Gallant PresentationCharlotte Gallant Presentation
Charlotte Gallant PresentationCOGS Presentations
 
Emerging Hot Spot Analysis
Emerging Hot Spot AnalysisEmerging Hot Spot Analysis
Emerging Hot Spot AnalysisBlue Raster
 
Geocap Water Column and Seafloor for ArcGIS - Oil and Gas seminar October 10th
Geocap Water Column and Seafloor for ArcGIS - Oil and Gas seminar October 10th Geocap Water Column and Seafloor for ArcGIS - Oil and Gas seminar October 10th
Geocap Water Column and Seafloor for ArcGIS - Oil and Gas seminar October 10th Geodata AS
 
The Richard McCully Aerial Photograph Collection Presentation
The Richard McCully Aerial Photograph Collection PresentationThe Richard McCully Aerial Photograph Collection Presentation
The Richard McCully Aerial Photograph Collection Presentationadambarter
 

Viewers also liked (8)

Relationship chart gordon henry kraft:john cheney
Relationship chart gordon henry kraft:john cheneyRelationship chart gordon henry kraft:john cheney
Relationship chart gordon henry kraft:john cheney
 
Geohazard Study
Geohazard StudyGeohazard Study
Geohazard Study
 
CMacMullen_CubaPresentation
CMacMullen_CubaPresentationCMacMullen_CubaPresentation
CMacMullen_CubaPresentation
 
Nova Scotia Surnames and Mapping Methods
Nova Scotia Surnames and Mapping MethodsNova Scotia Surnames and Mapping Methods
Nova Scotia Surnames and Mapping Methods
 
Charlotte Gallant Presentation
Charlotte Gallant PresentationCharlotte Gallant Presentation
Charlotte Gallant Presentation
 
Emerging Hot Spot Analysis
Emerging Hot Spot AnalysisEmerging Hot Spot Analysis
Emerging Hot Spot Analysis
 
Geocap Water Column and Seafloor for ArcGIS - Oil and Gas seminar October 10th
Geocap Water Column and Seafloor for ArcGIS - Oil and Gas seminar October 10th Geocap Water Column and Seafloor for ArcGIS - Oil and Gas seminar October 10th
Geocap Water Column and Seafloor for ArcGIS - Oil and Gas seminar October 10th
 
The Richard McCully Aerial Photograph Collection Presentation
The Richard McCully Aerial Photograph Collection PresentationThe Richard McCully Aerial Photograph Collection Presentation
The Richard McCully Aerial Photograph Collection Presentation
 

Similar to Developing a Tutorial for Grouping Analysis in ArcGIS

Data mining techniques unit v
Data mining techniques unit vData mining techniques unit v
Data mining techniques unit vmalathieswaran29
 
algoritma klastering.pdf
algoritma klastering.pdfalgoritma klastering.pdf
algoritma klastering.pdfbintis1
 
Creativity and Curiosity - The Trial and Error of Data Science
Creativity and Curiosity - The Trial and Error of Data ScienceCreativity and Curiosity - The Trial and Error of Data Science
Creativity and Curiosity - The Trial and Error of Data ScienceDamianMingle
 
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...Maninda Edirisooriya
 
An introduction to variable and feature selection
An introduction to variable and feature selectionAn introduction to variable and feature selection
An introduction to variable and feature selectionMarco Meoni
 
Modeling and Aggregation of Complex Annotations
Modeling and Aggregation of Complex AnnotationsModeling and Aggregation of Complex Annotations
Modeling and Aggregation of Complex AnnotationsAlexander Braylan
 
Machine learning algorithms for data mining
Machine learning algorithms for data miningMachine learning algorithms for data mining
Machine learning algorithms for data miningAshikur Rahman
 
Feature recognition and classification
Feature recognition and classificationFeature recognition and classification
Feature recognition and classificationSooraz Sresta
 
What is cluster analysis
What is cluster analysisWhat is cluster analysis
What is cluster analysisPrabhat gangwar
 
Multi-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender SystemsMulti-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender SystemsAravind Sesagiri Raamkumar
 
26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.pptvikassingh569137
 
How to gain a foothold in the world of classification
How to gain a foothold in the world of classificationHow to gain a foothold in the world of classification
How to gain a foothold in the world of classificationTorsten Schön
 
Cluster_saumitra.ppt
Cluster_saumitra.pptCluster_saumitra.ppt
Cluster_saumitra.pptssuser6b3336
 
Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014 Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014 Cataldo Musto
 
DS9 - Clustering.pptx
DS9 - Clustering.pptxDS9 - Clustering.pptx
DS9 - Clustering.pptxJK970901
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Dmitry Grapov
 

Similar to Developing a Tutorial for Grouping Analysis in ArcGIS (20)

DM_clustering.ppt
DM_clustering.pptDM_clustering.ppt
DM_clustering.ppt
 
Data mining techniques unit v
Data mining techniques unit vData mining techniques unit v
Data mining techniques unit v
 
algoritma klastering.pdf
algoritma klastering.pdfalgoritma klastering.pdf
algoritma klastering.pdf
 
Creativity and Curiosity - The Trial and Error of Data Science
Creativity and Curiosity - The Trial and Error of Data ScienceCreativity and Curiosity - The Trial and Error of Data Science
Creativity and Curiosity - The Trial and Error of Data Science
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
Lecture 11 - KNN and Clustering, a lecture in subject module Statistical & Ma...
 
Machine Learning with R
Machine Learning with RMachine Learning with R
Machine Learning with R
 
An introduction to variable and feature selection
An introduction to variable and feature selectionAn introduction to variable and feature selection
An introduction to variable and feature selection
 
Modeling and Aggregation of Complex Annotations
Modeling and Aggregation of Complex AnnotationsModeling and Aggregation of Complex Annotations
Modeling and Aggregation of Complex Annotations
 
Machine learning algorithms for data mining
Machine learning algorithms for data miningMachine learning algorithms for data mining
Machine learning algorithms for data mining
 
Feature recognition and classification
Feature recognition and classificationFeature recognition and classification
Feature recognition and classification
 
What is cluster analysis
What is cluster analysisWhat is cluster analysis
What is cluster analysis
 
Multi-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender SystemsMulti-method Evaluation in Scientific Paper Recommender Systems
Multi-method Evaluation in Scientific Paper Recommender Systems
 
26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt26-Clustering MTech-2017.ppt
26-Clustering MTech-2017.ppt
 
How to gain a foothold in the world of classification
How to gain a foothold in the world of classificationHow to gain a foothold in the world of classification
How to gain a foothold in the world of classification
 
Cluster_saumitra.ppt
Cluster_saumitra.pptCluster_saumitra.ppt
Cluster_saumitra.ppt
 
Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014 Apache Mahout Tutorial - Recommendation - 2013/2014
Apache Mahout Tutorial - Recommendation - 2013/2014
 
Nbvtalkonfeatureselection
NbvtalkonfeatureselectionNbvtalkonfeatureselection
Nbvtalkonfeatureselection
 
DS9 - Clustering.pptx
DS9 - Clustering.pptxDS9 - Clustering.pptx
DS9 - Clustering.pptx
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
 

More from COGS Presentations

Base mapping of the St.Mary's District
Base mapping of the St.Mary's DistrictBase mapping of the St.Mary's District
Base mapping of the St.Mary's DistrictCOGS Presentations
 
Interactive Web Map of New Zealand Earthquakes
Interactive Web Map of New Zealand EarthquakesInteractive Web Map of New Zealand Earthquakes
Interactive Web Map of New Zealand EarthquakesCOGS Presentations
 
Exploring Halifax Attractions using the Esri Runtime SDK for Android
Exploring Halifax Attractions using the Esri Runtime SDK for AndroidExploring Halifax Attractions using the Esri Runtime SDK for Android
Exploring Halifax Attractions using the Esri Runtime SDK for AndroidCOGS Presentations
 
The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...
The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...
The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...COGS Presentations
 
Southwest mongolia multispectral program title
Southwest mongolia multispectral program titleSouthwest mongolia multispectral program title
Southwest mongolia multispectral program titleCOGS Presentations
 
Model for Prioritizing Catchments for Terrestrial Liming in NS
Model for Prioritizing Catchments for Terrestrial Liming in NSModel for Prioritizing Catchments for Terrestrial Liming in NS
Model for Prioritizing Catchments for Terrestrial Liming in NSCOGS Presentations
 
Remote Sensing Field Camp 2016
Remote Sensing Field Camp 2016 Remote Sensing Field Camp 2016
Remote Sensing Field Camp 2016 COGS Presentations
 
Trying to decipher fort beausejour
Trying to decipher fort beausejourTrying to decipher fort beausejour
Trying to decipher fort beausejourCOGS Presentations
 
Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...
Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...
Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...COGS Presentations
 
The essentials for life at cogs
The essentials for life at cogsThe essentials for life at cogs
The essentials for life at cogsCOGS Presentations
 
Automated change detection in grass gis
Automated change detection in grass gisAutomated change detection in grass gis
Automated change detection in grass gisCOGS Presentations
 
Online Mapping Support - Age Advantage Association
Online Mapping Support - Age Advantage AssociationOnline Mapping Support - Age Advantage Association
Online Mapping Support - Age Advantage AssociationCOGS Presentations
 

More from COGS Presentations (20)

Karman vortices
Karman vorticesKarman vortices
Karman vortices
 
Adams Hunt Lawrence May 2016
Adams Hunt Lawrence May 2016Adams Hunt Lawrence May 2016
Adams Hunt Lawrence May 2016
 
Presentation: Fee & Brigley
Presentation: Fee & BrigleyPresentation: Fee & Brigley
Presentation: Fee & Brigley
 
Presentation Brake & Scott
Presentation Brake & ScottPresentation Brake & Scott
Presentation Brake & Scott
 
Sutherland final presentation
Sutherland final presentationSutherland final presentation
Sutherland final presentation
 
Base mapping of the St.Mary's District
Base mapping of the St.Mary's DistrictBase mapping of the St.Mary's District
Base mapping of the St.Mary's District
 
Interactive Web Map of New Zealand Earthquakes
Interactive Web Map of New Zealand EarthquakesInteractive Web Map of New Zealand Earthquakes
Interactive Web Map of New Zealand Earthquakes
 
Exploring Halifax Attractions using the Esri Runtime SDK for Android
Exploring Halifax Attractions using the Esri Runtime SDK for AndroidExploring Halifax Attractions using the Esri Runtime SDK for Android
Exploring Halifax Attractions using the Esri Runtime SDK for Android
 
The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...
The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...
The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...
 
Coastal erosion
Coastal erosionCoastal erosion
Coastal erosion
 
Various frontslides2016
Various frontslides2016Various frontslides2016
Various frontslides2016
 
Southwest mongolia multispectral program title
Southwest mongolia multispectral program titleSouthwest mongolia multispectral program title
Southwest mongolia multispectral program title
 
Model for Prioritizing Catchments for Terrestrial Liming in NS
Model for Prioritizing Catchments for Terrestrial Liming in NSModel for Prioritizing Catchments for Terrestrial Liming in NS
Model for Prioritizing Catchments for Terrestrial Liming in NS
 
Remote Sensing Field Camp 2016
Remote Sensing Field Camp 2016 Remote Sensing Field Camp 2016
Remote Sensing Field Camp 2016
 
Trying to decipher fort beausejour
Trying to decipher fort beausejourTrying to decipher fort beausejour
Trying to decipher fort beausejour
 
Test2016
Test2016Test2016
Test2016
 
Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...
Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...
Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...
 
The essentials for life at cogs
The essentials for life at cogsThe essentials for life at cogs
The essentials for life at cogs
 
Automated change detection in grass gis
Automated change detection in grass gisAutomated change detection in grass gis
Automated change detection in grass gis
 
Online Mapping Support - Age Advantage Association
Online Mapping Support - Age Advantage AssociationOnline Mapping Support - Age Advantage Association
Online Mapping Support - Age Advantage Association
 

Recently uploaded

Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 

Recently uploaded (20)

Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 

Developing a Tutorial for Grouping Analysis in ArcGIS

  • 1. Developing a Tutorial for Grouping Analysis in ArcGIS Daniel Pierre May 29, 2014
  • 2. 1. Introduction 2. Data 3. Grouping Analysis Workflows 4. Tutorial Exercises 5. Conclusions: Recommendations Presentation Outline
  • 3. Lauren Rosenshein Bennett, MS Geoprocessing Product Engineer, Esri Lbennett@esri.com Dr. Konrad Dramowicz Faculty, Centre of Geographic Sciences Konrad.Dramowicz@nscc.ca Dr. Ela Dramowicz Faculty, Centre of Geographic Sciences Ela.Dramowicz@nscc.ca Introduction Project Sponsor & Supervisors
  • 4. Introduction • Experimental testing of tool with multiple datasets • Incorporation of Grouping Analysis with other tools • Review of technical literature on clustering algorithms • Review of existing tutorials Project Overview
  • 5. Introduction • Introduced at ArcGIS 10.1 • Available with Basic, Standard and Advanced license levels • Found in the Spatial Statistics toolbox, within the Mapping Clusters toolset • Script tool Grouping Analysis Tool
  • 6. Introduction • “...Performs a classification procedure that tries to find natural clusters in your data.” - Esri • An aid for data comprehension • Feature similarity is based on attributes specified as analysis fields and optionally, spatial constraints • Given a number of groups, features within each output group are as similar as possible while groups are as different as possible Grouping Analysis Tool
  • 7. Introduction • Two algorithm types: cluster analysis (traditional K-means) and regionalization (spatial K-means) • Thirteen parameters (six required) • Grouping results contingent on the number of groups, analysis fields, and type of spatial constraint Grouping Analysis Tool
  • 8. Data Features: • Esri • City of Vancouver Multivariate Data: • World Bank • BBC • Weatherbase • Statistics Canada Data Sources
  • 9. Data • Data Enrichment (ArcGIS Online) • HTML table import • Spreadsheet reformatting • Table joins • Feature class edits Data Preparation
  • 10. Data Selection Criteria: • Two scales of analysis • Illustration of various spatial constraint effects on results • Sufficient number of features • Visible spatial patterns in results Tutorial Datasets
  • 11. General Steps: • Exploratory data analysis • Preprocessing • Determining appropriate Grouping Analysis settings • Postprocessing, interpretation and evaluation of results Grouping Analysis Workflows
  • 12. Exploratory Data Analysis 1. Distribution of variable values • Thematic mapping • Spatial autocorrelation 2. Spatial relationships among features • Contiguity of features and number of neighbours • Spatial autocorrelation Exploratory Data Analysis
  • 13. Exploratory Data Analysis • Explore distribution of dataset variables • Choropleth maps and graduated symbol maps • Identify set of variables to be used for Grouping Analysis Thematic Mapping
  • 14. Exploratory Data Analysis • Analyze contiguity relationships among features • Polygon Neighbors tool • Determine relative connectivity of features by counting number of neighbours • Frequency tool Spatial Relationships
  • 15. Exploratory Data Analysis • Analyze contiguity and/or proximity relationships among features using GeoDa • Create spatial weights • Display histogram of feature connectivity according to defined spatial relationships • Histogram linked to map and attribute table Alternative Approach
  • 16. Exploratory Data Analysis • Considers attribute values and location of features simultaneously • Moran’s I statistic determines whether spatial pattern of values is dispersed, random or clustered • Significance of pattern evaluated with corresponding z-score • One variable at a time Spatial Autocorrelation
  • 17. Preprocessing Use hot spots to limit study area for Grouping Analysis: • Calculate incremental spatial autocorrelation • Identify distance band of most intense clustering • Create hot spot map • Select features from original dataset based on location of hot spots Preprocessing
  • 18. Grouping Analysis Settings 1. How many groups should be created? 2. Which analysis fields should be used? 3. Is a spatial constraint necessary? If so, which type is appropriate? Grouping Analysis Settings: Key Considerations
  • 19. Grouping Analysis Settings • Default number is 2 • Sturge’s rule: C = 1 + 3.3 log(n), where C is the number of groups and n is the number of features • Evaluate the optimal number of groups (up to a maximum of 15) Number of Groups
  • 20. Grouping Analysis Settings Two vs. Three Groups
  • 21. Grouping Analysis Settings • Generally driven by research purpose and objectives of grouping • Guide selection of analysis fields with exploratory data analysis findings • Spatial variables may be used as indirect spatial constraints • Assess effectiveness of fields to distinguish features with output report Analysis Fields
  • 23. Grouping Analysis Settings • Choice of spatial constraint or no spatial constraint determines which algorithm is used for grouping • No spatial constraint – traditional K-Means (data space only) • Any spatial constraint – Spatial ‘K’luster Analysis by Tree Edge Removal (SKATER) method (spatial K-Means) Spatial Constraints
  • 24. Grouping Analysis Settings No Spatial Constraint vs. Spatial Constraint
  • 25. Grouping Analysis Settings • Contiguity – edges only (“rook” type) or edges and corners (“queen” type) • Delaunay triangulation – contiguity of representations of features as Voronoi polygons • Proximity – K nearest neighbours • Spatial weights Spatial Constraint Types
  • 26. Grouping Analysis Settings • Evaluate optimal number of groups • Guide selection of analysis fields with calculated R2 values • Visually assess results of specified spatial constraint Iterative Process for Optimizing Grouping Analysis
  • 27. Interpretation & Evaluation • Spatial distribution of groups (map) • Global statistics (output report) • Group and variable statistics (output report) • Group profiles Interpretation of Results
  • 28. Interpretation & Evaluation • Compare group means with each other and global range Group Profiles
  • 29. Interpretation & Evaluation • Compare group means and ranges for each variable Group Profiles (2)
  • 30. • Consider global mean, median and range for each variable Group Profiles (3) Interpretation & Evaluation
  • 31. Interpretation & Evaluation • Global Moran’s I statistic • Determine spatial pattern of group membership • Measure spatial compactness of group membership • Clustered groups generally desired Evaluation of Results: Spatial Autocorrelation Dispersed Clustered Random
  • 32. Interpretation & Evaluation • Smallest to largest group • Indicator of balance in group membership • Balanced number of group members generally desired for comparison of statistics • Frequency tool Evaluation of Results: Cluster Size Ratio
  • 33. Interpretation & Evaluation • Goodness measure that combines concepts of cohesion and separation • Adapted from cluster analysis to consider attribute data and location • Silhouette coefficient is calculated for every feature and the average is taken for the entire dataset Evaluation of Results: Silhouette
  • 34. Interpretation & Evaluation (B – A) / max(A, B) where A is the distance between a feature and its group center B is the distance between the feature and its neighbouring group center Silhouette Coefficient
  • 35. Interpretation & Evaluation • Range between –1 (poor) and 1 (excellent) • < 0.2 indicates poor clustering • > 0.5 indicates good partition of the data Silhouette Coefficient Values
  • 36. Tutorial Exercises • Six exercises • Two scenarios (3 exercises for each) • Suitable for users at all levels of experience • Exercises take the user through the steps of preprocessing, group creation, interpretation and evaluation of results outlined here Grouping Analysis Tutorial
  • 37. Tutorial Exercises Exercises: 1. Data exploration 2. Grouping for exploratory data analysis 3. Using Spatial Statistics tools to target areas of interest Scenario 1: Analysis of Crime in Chicago
  • 38. Tutorial Exercises Exercises: 4. Create groups and use results to write profiles 5. Explore effects of spatial constraints 6. Evaluation of results Scenario 2: Analysis of Olympic Results
  • 39. Tutorial Exercises 1. All tutorial exercises use polygon data exclusively; point features not covered 2. Space-time constraints using spatial weights matrix file not covered 3. Catered to general user; no exercises specifically target advanced users Limitations
  • 40. Recommendations 1. Exploratory data analysis 2. Grouping Analysis 3. Evaluation of results Recommendations: Enhancements and Additional Tools
  • 41. Recommendations • Multi-step process using Polygon Neighbors, Frequency and table joins could be simplified • Dynamic linking of objects can make use of existing ArcGIS functionality Determining Spatial Relationships Among Features
  • 42. Recommendations • Expand types of spatial relationships that can be analyzed • Enable the analysis of higher order relationships Determining Spatial Relationships Among Features (continued)
  • 43. Recommendations • Tools for determining most useful diagnostic or predictor variables • Guide selection of analysis fields for data partitioning • Adapt neural networks or other data mining tools to work with spatial constraints Identification of Useful Diagnostic Variables
  • 44. Recommendations Grouping Analysis Tool Enhancements • Create unique identifier • Replace null values
  • 45. Recommendations • Spatial weights matrix can be used as the spatial constraint for creating groups • Custom weights require either manual table creation or programming • Solution: interactive feature selection User-defined spatial relationships among features
  • 46. Recommendations • Expand beyond R2 and F-statistic values in output report • Adapt methods used to evaluate cluster analysis algorithms (e.g. Silhouette) • Challenge: universally applicable evaluation methods may not be feasible Evaluation of Results