SlideShare a Scribd company logo
Margaret Furr
MINING CRIME INCIDENT DATA FOR SPACE-TIME
PATTERNS AROUND WASHINGTON, DC CAMPUSES
Photo Source: http://www.thecollegefix.com/post/19184/
DATA
• 2012 Crime Incidents; 2013 Crime Incidents; 2014 Crime Incidents
• File type: shapefile, points
• Variables of interest: X coordinates, Y coordinates, Offense, Report Date, Start Date, End Date
• Observations: 109,656
• Projection: Spatial reference = 26985
• Source: Open DC Data
• University and College Campuses; Campus Areas Zoning
• File type: shapefile, polygons
• Variables of interest: Campus names, Campus areas, Campus lengths
• Observations: 8 college/university campus zoning areas,
30 college/universities
• Projection: Spatial reference = 26985
• Source: Open DC Data
• USStates
• File type: shapefile, polygons
• Observations: 1 for each state + 1 for Washington, DC
• Projection: GCS_WGS_1984 GCS
• Source: United States Geodatabase
1Photo Source: http://opendata.dc.gov/
RESEARCH MOTIVATIONS
• Crime -- less likely to occur on campuses
• Washington, DC -- not a city with the most crime
• 2 DC universities - ranked as the most dangerous campuses in some news reports
• Gallaudet University
• Howard University (7 forcible rapes, 90 robberies, 27 aggravated assaults, 160 burglaries, 43 car thefts)
• In general -- a growing concerns about students’ safety on campuses since 1990s
• Understanding crime frequencies and trends across time helps (1) police departments,
(2) administrators, (3) policymakers, (4) journalists and (5) students make decisions
2
RESEARCH MOTIVATIONS
• Researchers have analyzed space-time crime patterns
• None have looked at DC crimes as they relate to campuses’
locations
• I have conducted spatial-regression analyses using Chicago crime data
and R spatial packages
• I have not conducted space-time analyses, using the time series
component of crime data, but I find this component to be an important
one
• I have not analyzed crime data that occurs in the DC area, but this is
where 2 universities are reported to be unsafe
• I have not used ArcGIS tools yet!
3
RESEARCH QUESTIONS
• What is the frequency of each crime type at locations
near campuses, and how do these frequencies compare to
the frequencies of each type far from campuses?
• Looking at the start datetimes of 2012-2014 crime
reports, are there any patterns in when each incident
occurs near campuses?
• Do these patterns reveal emerging trends? If so, how do
trends differ by crime type?
• What trends are emerging near the two most
dangerously ranked DC campuses?
4
RESEARCH APPROACH
• Space-Time Analysis of crime within buffered campus areas
• Space-Time Cube
• Emerging Hot Spot Analysis
5
• Define Projection and Project
• Selection
• Merge
• Buffer
• Frequency Analysis
• Convert Time
• Create Space-Time Cube
• Emerging Hot Spot Analysis
TOOLS
6
Photo Sources: https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining-toolbox/learnmorecreatecube.htm
https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining-toolbox/learnmoreemerging.htm
GEODATABASE
• FinalProjectMargaretFurr.gdb
• Crime Incident data
• Campus Area data
• DC polygon
7
PROJECTION
• Crime Incident data and Campus Area data had no projection
• Metadata said this data was 26985
• Defined data as NAD83-Maryland, Projection
• Reprojected data to be NAD83 17N, UTM
• DC data had WGS_1985 Coordinates and WGS_1984 datum
• Reprojected data to be NAD83 17, UTM
8
MERGE
• Merged Crime Incidents
• 2012 Crime Incidents (NAD83-17N)
• 2013 Crime Incidents (NAD83-17N)
• 2014 Crime Incidents (NAD83-17N)
• Merged Campus Areas
• University College Campuses (NAD83-17N)
• Campus Areas Zoning (NAD83-17N)
9
DC LAYER: SELECT
• Data from DC Open data does not have any shapefiles for the
general DC area
• UnitedStates.gdb has a USStates shapefile, which includes
DC as one state
• Selected DC from USStates
• Exported selection as its own DC layer (DC_NAD8317)
10
INITIAL DATA VISUALIZATIONS
11
INITIAL DATA VISUALIZATIONS
12
FREQUENCIES BY TYPE
13
CREATE BUFFERS
• Create buffer around campus areas
• 1000 ft. is a standard buffer for schools
• 1000 ft = 305m
• 305m as the first buffer distance
• 1500 ft, 458m and 2000 ft, 610m* as second and third buffer distances
• SELECT crime points within buffers
• 40,873 incidents with 2000 ft. or 610m* of campus areas
14
BUFFERED AREAS
15
INITIAL TIME SERIES VISUALIZATION
16
Buffer Distance: 2000 ft./610 m
BUFFERED FREQUENCIES BY TYPE
17
BUFFERED FREQUENCIES BY TYPE
Offense Frequency - all data Frequency - data in buffer 1 Frequency - data in buffer 2 Frequency - data in buffer 3
Arson 95 11 15 21
Assault w/ dangerous weapon 7224 783 1174 1533
Burglary 10151 1470 2247 2991
Homicide 296 28 39 52
Motor vehicle theft 8633 967 1573 2070
Robbery 11470 1568 2437 3341
Sex abuse 862 171 232 269
Theft f/ auto 31299 6283 9650 13162
Theft / other 39269 8540 13561 17414
0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
50%
Arson Assault w/
dangerous
weapon
Burglary Homicide Motor
vehicle theft
Robbery Sex abuse Theft f/ auto Theft /
other
Frequency - data in buffer 1
Frequency - data in buffer 2
Frequency - data in buffer 3
Frequency Differences: Buffered Data Type Frequencies/All Data Type Frequencies
18
• “Summarizes a set of points into a netCDF data structure by aggregating
them into space-time bins.”
• “Within each bin, the points are counted.”
• “For all bin locations, the trend for counts over time are evaluated.”
SPACE-TIME PATTERN MINING TOOLBOX:
CREATE SPACE TIME CUBE
19
Photo Sources: https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining-
toolbox/learnmorecreatecube.htm
• “Identifies trends in the clustering of point densities (counts) or
summary fields in a space time cube created.”
• “Categories: (1) new, (2) consecutive, (3) intensifying, (4) persistent,
(5) diminishing, (6) sporadic, (7) oscillating, and (8) historical hot and
cold spots.”
SPACE-TIME PATTERN MINING TOOLBOX:
EMERGING HOT SPOT ANALYSIS
20
Photo Sources: https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining-
toolbox/learnmorecreatecube.htm
CONVERT TIME
• Original date variable = String type
• Date type is required for Space-Time Pattern Mining Tools
• Report date, Start date, End date
21
INCORRECT VALUES
22
CREATE SPACE TIME CUBE
• Time Step Alignment: start time, end time, reference time
• End time; eliminates bias from choosing a reference time
• Time Step Interval: 1 Weeks
• Distance Interval:
• Calculated optimal interval based on algorithm that considers
spatial distribution (histogram bin-width optimization)
• Template Cube:
• did not use
23
Photo Sources: https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining-
toolbox/learnmorecreatecube.htm
STATISTICS FROM SPACE TIME CUBE
• Mann-Kendall Statistic
• Statistical question: “are the events represented by the input points increasing or
decreasing over time?”
• Answer: “the number of points for all locations in each time-step interval,
analyzed as a time series of count values”
• Rank correlation analysis for the bin count or value and their time sequence
• +1 if 1st bin < 2nd bin
• -1 if 1st bin > 2nd bin
• 0 if 1st bin = 2nd bin)
• Results are summed
• Observed sum compared to expected sum
• p-value
• small p-value: the trend is statistically significant
24
STATISTICS FROM SPACE TIME CUBE
All Data Buffered Data 3
Total Number of Locations 8989 8400
Locations with at least one point 2945 1814
Associated Bins 5289220 3198082
% Non-zero Sparseness 1.54 1.03
Time Step Interval 1 week 1 week
Distance Interval 201m 134m
Number of Time Steps 1796
Cube Extent Across Space
Min Y 836904.0313m 837838.3256m
Min X 4303610.9737m 4309021.9271m
Max X 854777.2083m 849063.8549m
Max Y 4323770.1568m 4322418.4232m
Rows 101 100
Columns 89 84
Total bins 16144244 14809200
Overall Data Trend
Trend Direction Increasing
Trend Statistic 24.3788 23.9058
Trend p-value 0 0
25
• Theft/Other: Increasing Trend, Significant
• Theft f/Auto: Increasing Trend, Significant
• Robbery: Decreasing Trend (>-5), Significant
• Burglary: Increasing Trend (<5 though unlike others >18), Significant (> 0
though unlike 0)
• Motor Vehicle Theft: Increasing Trend (<6), Significant
• Assault with Dangerous Weapon: Decreasing Trend, Insignificant
• Sex Assault: Increasing Trend, Significant
• Homicide: too few records for analysis
• Arson: too few records for analysis
STATISTICS FROM SPACE TIME CUBES BY TYPE
26
EMERGING HOT SPOTS
27
EMERGING HOT SPOTS
28
EMERGING HOT SPOTS
29
EMERGING HOT SPOTS
30
CONCLUSIONS
• Around Howard University there are significant patterns of theft/auto crimes and also sexual
assault
• Theft/auto incidents are emerging consecutively
• Sexual Assault incidents emerging sporadically
• These types are two reported (among with others) in the media
• Around George Washington University and Georgetown University there are also significant
patterns of theft/other crimes
• Theft/other incidents are emerging consecutively to the non-river, and also Howard University
side of George Washington University
• Theft/other incidents are emerging sporadically on the river-side of Georgetown University
and George Washington University
• Both types of theft are emerging consecutively around other US colleges’ and universities’ DC
campuses
• Cornell in Washington, University of California Washington Center
31
FUTURE WORK
32
• More years of crime data?
• Experiment with ArcGIS Pro or ArcGIS’s multidimensional toolbox
to visualize the space time box and understand it better
• Develop a Python script to tune, or optimize the time and distance
intervals as well as the buffer distance for more accurate results
• Experiment with the template cube for potentially more consistent
results across types
33
QUESTIONS?

More Related Content

Similar to ArcGIS Space-Time Mining of Crime Data

HunchLab 2.0 Getting Started
HunchLab 2.0 Getting StartedHunchLab 2.0 Getting Started
HunchLab 2.0 Getting Started
Azavea
 
Surveys - Urban And Regional Planning
Surveys - Urban And Regional PlanningSurveys - Urban And Regional Planning
Surveys - Urban And Regional Planning
vinayak1211
 
A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...
A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...
A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...
Nandana Mihindukulasooriya
 
Big data
Big dataBig data
Big data
Zeeshan Khan
 
IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079
ibankuk
 
DSD-INT 2019 How machine learning will change flood risk and impact assessmen...
DSD-INT 2019 How machine learning will change flood risk and impact assessmen...DSD-INT 2019 How machine learning will change flood risk and impact assessmen...
DSD-INT 2019 How machine learning will change flood risk and impact assessmen...
Deltares
 
Big data
Big dataBig data
Big data
Big dataBig data
Big data
Harshit Namdev
 
Hadoop PDF
Hadoop PDFHadoop PDF
Hadoop PDF
1904saikrishna
 
Cobweb: Using citizen science data to support flood risk modelling
Cobweb: Using citizen science data to support flood risk modellingCobweb: Using citizen science data to support flood risk modelling
Cobweb: Using citizen science data to support flood risk modelling
COBWEB Project
 
Geodemographics: Open tools and mehtods
Geodemographics: Open tools and mehtodsGeodemographics: Open tools and mehtods
Geodemographics: Open tools and mehtods
Dr Muhammad Adnan
 
Skillwise Big data
Skillwise Big dataSkillwise Big data
Skillwise Big data
Skillwise Group
 
How we built a global search engine for genetic data
How we built a global search engine for genetic dataHow we built a global search engine for genetic data
How we built a global search engine for genetic data
Miro Cupak
 
Revealing spatial and temporal patterns from Flickr photography: a case study...
Revealing spatial and temporal patterns from Flickr photography: a case study...Revealing spatial and temporal patterns from Flickr photography: a case study...
Revealing spatial and temporal patterns from Flickr photography: a case study...
Sander van der Drift
 
How the Web of Data Will be Won
How the Web of Data Will be WonHow the Web of Data Will be Won
How the Web of Data Will be Won
Jeni Tennison
 
From Virtual Museums to Peacebuilding: Creating and Using Linked Knowledge
From Virtual Museums to Peacebuilding: Creating and Using Linked KnowledgeFrom Virtual Museums to Peacebuilding: Creating and Using Linked Knowledge
From Virtual Museums to Peacebuilding: Creating and Using Linked Knowledge
Craig Knoblock
 
Big data
Big dataBig data
wang.pptx
wang.pptxwang.pptx
wang.pptx
SteveIrwin25
 
HunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the HoodHunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the HoodAzavea
 
Crime rate data analysis in Los Angeles
Crime rate data analysis in Los AngelesCrime rate data analysis in Los Angeles
Crime rate data analysis in Los Angeles
Ramdharan Donda
 

Similar to ArcGIS Space-Time Mining of Crime Data (20)

HunchLab 2.0 Getting Started
HunchLab 2.0 Getting StartedHunchLab 2.0 Getting Started
HunchLab 2.0 Getting Started
 
Surveys - Urban And Regional Planning
Surveys - Urban And Regional PlanningSurveys - Urban And Regional Planning
Surveys - Urban And Regional Planning
 
A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...
A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...
A Two-Fold Quality Assurance Approach for Dynamic Knowledge Bases : The 3cixt...
 
Big data
Big dataBig data
Big data
 
IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079IBANK - Big data www.ibank.uk.com 07474222079
IBANK - Big data www.ibank.uk.com 07474222079
 
DSD-INT 2019 How machine learning will change flood risk and impact assessmen...
DSD-INT 2019 How machine learning will change flood risk and impact assessmen...DSD-INT 2019 How machine learning will change flood risk and impact assessmen...
DSD-INT 2019 How machine learning will change flood risk and impact assessmen...
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Hadoop PDF
Hadoop PDFHadoop PDF
Hadoop PDF
 
Cobweb: Using citizen science data to support flood risk modelling
Cobweb: Using citizen science data to support flood risk modellingCobweb: Using citizen science data to support flood risk modelling
Cobweb: Using citizen science data to support flood risk modelling
 
Geodemographics: Open tools and mehtods
Geodemographics: Open tools and mehtodsGeodemographics: Open tools and mehtods
Geodemographics: Open tools and mehtods
 
Skillwise Big data
Skillwise Big dataSkillwise Big data
Skillwise Big data
 
How we built a global search engine for genetic data
How we built a global search engine for genetic dataHow we built a global search engine for genetic data
How we built a global search engine for genetic data
 
Revealing spatial and temporal patterns from Flickr photography: a case study...
Revealing spatial and temporal patterns from Flickr photography: a case study...Revealing spatial and temporal patterns from Flickr photography: a case study...
Revealing spatial and temporal patterns from Flickr photography: a case study...
 
How the Web of Data Will be Won
How the Web of Data Will be WonHow the Web of Data Will be Won
How the Web of Data Will be Won
 
From Virtual Museums to Peacebuilding: Creating and Using Linked Knowledge
From Virtual Museums to Peacebuilding: Creating and Using Linked KnowledgeFrom Virtual Museums to Peacebuilding: Creating and Using Linked Knowledge
From Virtual Museums to Peacebuilding: Creating and Using Linked Knowledge
 
Big data
Big dataBig data
Big data
 
wang.pptx
wang.pptxwang.pptx
wang.pptx
 
HunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the HoodHunchLab 2.0 Predictive Missions: Under the Hood
HunchLab 2.0 Predictive Missions: Under the Hood
 
Crime rate data analysis in Los Angeles
Crime rate data analysis in Los AngelesCrime rate data analysis in Los Angeles
Crime rate data analysis in Los Angeles
 

Recently uploaded

Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
eddie19851
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Subhajit Sahu
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 

Recently uploaded (20)

Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
Nanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdfNanandann Nilekani's ppt On India's .pdf
Nanandann Nilekani's ppt On India's .pdf
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
Algorithmic optimizations for Dynamic Levelwise PageRank (from STICD) : SHORT...
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 

ArcGIS Space-Time Mining of Crime Data

  • 1. Margaret Furr MINING CRIME INCIDENT DATA FOR SPACE-TIME PATTERNS AROUND WASHINGTON, DC CAMPUSES Photo Source: http://www.thecollegefix.com/post/19184/
  • 2. DATA • 2012 Crime Incidents; 2013 Crime Incidents; 2014 Crime Incidents • File type: shapefile, points • Variables of interest: X coordinates, Y coordinates, Offense, Report Date, Start Date, End Date • Observations: 109,656 • Projection: Spatial reference = 26985 • Source: Open DC Data • University and College Campuses; Campus Areas Zoning • File type: shapefile, polygons • Variables of interest: Campus names, Campus areas, Campus lengths • Observations: 8 college/university campus zoning areas, 30 college/universities • Projection: Spatial reference = 26985 • Source: Open DC Data • USStates • File type: shapefile, polygons • Observations: 1 for each state + 1 for Washington, DC • Projection: GCS_WGS_1984 GCS • Source: United States Geodatabase 1Photo Source: http://opendata.dc.gov/
  • 3. RESEARCH MOTIVATIONS • Crime -- less likely to occur on campuses • Washington, DC -- not a city with the most crime • 2 DC universities - ranked as the most dangerous campuses in some news reports • Gallaudet University • Howard University (7 forcible rapes, 90 robberies, 27 aggravated assaults, 160 burglaries, 43 car thefts) • In general -- a growing concerns about students’ safety on campuses since 1990s • Understanding crime frequencies and trends across time helps (1) police departments, (2) administrators, (3) policymakers, (4) journalists and (5) students make decisions 2
  • 4. RESEARCH MOTIVATIONS • Researchers have analyzed space-time crime patterns • None have looked at DC crimes as they relate to campuses’ locations • I have conducted spatial-regression analyses using Chicago crime data and R spatial packages • I have not conducted space-time analyses, using the time series component of crime data, but I find this component to be an important one • I have not analyzed crime data that occurs in the DC area, but this is where 2 universities are reported to be unsafe • I have not used ArcGIS tools yet! 3
  • 5. RESEARCH QUESTIONS • What is the frequency of each crime type at locations near campuses, and how do these frequencies compare to the frequencies of each type far from campuses? • Looking at the start datetimes of 2012-2014 crime reports, are there any patterns in when each incident occurs near campuses? • Do these patterns reveal emerging trends? If so, how do trends differ by crime type? • What trends are emerging near the two most dangerously ranked DC campuses? 4
  • 6. RESEARCH APPROACH • Space-Time Analysis of crime within buffered campus areas • Space-Time Cube • Emerging Hot Spot Analysis 5
  • 7. • Define Projection and Project • Selection • Merge • Buffer • Frequency Analysis • Convert Time • Create Space-Time Cube • Emerging Hot Spot Analysis TOOLS 6 Photo Sources: https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining-toolbox/learnmorecreatecube.htm https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining-toolbox/learnmoreemerging.htm
  • 8. GEODATABASE • FinalProjectMargaretFurr.gdb • Crime Incident data • Campus Area data • DC polygon 7
  • 9. PROJECTION • Crime Incident data and Campus Area data had no projection • Metadata said this data was 26985 • Defined data as NAD83-Maryland, Projection • Reprojected data to be NAD83 17N, UTM • DC data had WGS_1985 Coordinates and WGS_1984 datum • Reprojected data to be NAD83 17, UTM 8
  • 10. MERGE • Merged Crime Incidents • 2012 Crime Incidents (NAD83-17N) • 2013 Crime Incidents (NAD83-17N) • 2014 Crime Incidents (NAD83-17N) • Merged Campus Areas • University College Campuses (NAD83-17N) • Campus Areas Zoning (NAD83-17N) 9
  • 11. DC LAYER: SELECT • Data from DC Open data does not have any shapefiles for the general DC area • UnitedStates.gdb has a USStates shapefile, which includes DC as one state • Selected DC from USStates • Exported selection as its own DC layer (DC_NAD8317) 10
  • 15. CREATE BUFFERS • Create buffer around campus areas • 1000 ft. is a standard buffer for schools • 1000 ft = 305m • 305m as the first buffer distance • 1500 ft, 458m and 2000 ft, 610m* as second and third buffer distances • SELECT crime points within buffers • 40,873 incidents with 2000 ft. or 610m* of campus areas 14
  • 17. INITIAL TIME SERIES VISUALIZATION 16 Buffer Distance: 2000 ft./610 m
  • 19. BUFFERED FREQUENCIES BY TYPE Offense Frequency - all data Frequency - data in buffer 1 Frequency - data in buffer 2 Frequency - data in buffer 3 Arson 95 11 15 21 Assault w/ dangerous weapon 7224 783 1174 1533 Burglary 10151 1470 2247 2991 Homicide 296 28 39 52 Motor vehicle theft 8633 967 1573 2070 Robbery 11470 1568 2437 3341 Sex abuse 862 171 232 269 Theft f/ auto 31299 6283 9650 13162 Theft / other 39269 8540 13561 17414 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50% Arson Assault w/ dangerous weapon Burglary Homicide Motor vehicle theft Robbery Sex abuse Theft f/ auto Theft / other Frequency - data in buffer 1 Frequency - data in buffer 2 Frequency - data in buffer 3 Frequency Differences: Buffered Data Type Frequencies/All Data Type Frequencies 18
  • 20. • “Summarizes a set of points into a netCDF data structure by aggregating them into space-time bins.” • “Within each bin, the points are counted.” • “For all bin locations, the trend for counts over time are evaluated.” SPACE-TIME PATTERN MINING TOOLBOX: CREATE SPACE TIME CUBE 19 Photo Sources: https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining- toolbox/learnmorecreatecube.htm
  • 21. • “Identifies trends in the clustering of point densities (counts) or summary fields in a space time cube created.” • “Categories: (1) new, (2) consecutive, (3) intensifying, (4) persistent, (5) diminishing, (6) sporadic, (7) oscillating, and (8) historical hot and cold spots.” SPACE-TIME PATTERN MINING TOOLBOX: EMERGING HOT SPOT ANALYSIS 20 Photo Sources: https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining- toolbox/learnmorecreatecube.htm
  • 22. CONVERT TIME • Original date variable = String type • Date type is required for Space-Time Pattern Mining Tools • Report date, Start date, End date 21
  • 24. CREATE SPACE TIME CUBE • Time Step Alignment: start time, end time, reference time • End time; eliminates bias from choosing a reference time • Time Step Interval: 1 Weeks • Distance Interval: • Calculated optimal interval based on algorithm that considers spatial distribution (histogram bin-width optimization) • Template Cube: • did not use 23 Photo Sources: https://desktop.arcgis.com/en/desktop/latest/tools/space-time-pattern-mining- toolbox/learnmorecreatecube.htm
  • 25. STATISTICS FROM SPACE TIME CUBE • Mann-Kendall Statistic • Statistical question: “are the events represented by the input points increasing or decreasing over time?” • Answer: “the number of points for all locations in each time-step interval, analyzed as a time series of count values” • Rank correlation analysis for the bin count or value and their time sequence • +1 if 1st bin < 2nd bin • -1 if 1st bin > 2nd bin • 0 if 1st bin = 2nd bin) • Results are summed • Observed sum compared to expected sum • p-value • small p-value: the trend is statistically significant 24
  • 26. STATISTICS FROM SPACE TIME CUBE All Data Buffered Data 3 Total Number of Locations 8989 8400 Locations with at least one point 2945 1814 Associated Bins 5289220 3198082 % Non-zero Sparseness 1.54 1.03 Time Step Interval 1 week 1 week Distance Interval 201m 134m Number of Time Steps 1796 Cube Extent Across Space Min Y 836904.0313m 837838.3256m Min X 4303610.9737m 4309021.9271m Max X 854777.2083m 849063.8549m Max Y 4323770.1568m 4322418.4232m Rows 101 100 Columns 89 84 Total bins 16144244 14809200 Overall Data Trend Trend Direction Increasing Trend Statistic 24.3788 23.9058 Trend p-value 0 0 25
  • 27. • Theft/Other: Increasing Trend, Significant • Theft f/Auto: Increasing Trend, Significant • Robbery: Decreasing Trend (>-5), Significant • Burglary: Increasing Trend (<5 though unlike others >18), Significant (> 0 though unlike 0) • Motor Vehicle Theft: Increasing Trend (<6), Significant • Assault with Dangerous Weapon: Decreasing Trend, Insignificant • Sex Assault: Increasing Trend, Significant • Homicide: too few records for analysis • Arson: too few records for analysis STATISTICS FROM SPACE TIME CUBES BY TYPE 26
  • 32. CONCLUSIONS • Around Howard University there are significant patterns of theft/auto crimes and also sexual assault • Theft/auto incidents are emerging consecutively • Sexual Assault incidents emerging sporadically • These types are two reported (among with others) in the media • Around George Washington University and Georgetown University there are also significant patterns of theft/other crimes • Theft/other incidents are emerging consecutively to the non-river, and also Howard University side of George Washington University • Theft/other incidents are emerging sporadically on the river-side of Georgetown University and George Washington University • Both types of theft are emerging consecutively around other US colleges’ and universities’ DC campuses • Cornell in Washington, University of California Washington Center 31
  • 33. FUTURE WORK 32 • More years of crime data? • Experiment with ArcGIS Pro or ArcGIS’s multidimensional toolbox to visualize the space time box and understand it better • Develop a Python script to tune, or optimize the time and distance intervals as well as the buffer distance for more accurate results • Experiment with the template cube for potentially more consistent results across types