SlideShare a Scribd company logo
TEAM DC: Georgetown Data Analytics:
Exploring DC crime with neighborhood cluster
analysis, interactive mapping tool, and regression
analysis.
Problem
DC property owners and metro area residents are at greater risk of
becoming victims of crime if they are unaware of the frequency
and types of crime in their neighborhoods of interest.
Imperfect information - A person visiting or buying a home in an
unfamiliar neighborhood faces imperfect information regarding
crime and its impact on safety and property values.
Data Science Pipeline
Began with:
-DC Crime Data - 2011-2014
-DC Neighborhood Data (US Census Data) - 2000-2012
Based on given data, we decided to go with the most up to date property values
and population numbers (2012) from DC Neighborhood data to be most
representative of current state.
Data Munging/Exploration
DC Crime Data:
1. Did not have neighborhood names, but rather cluster, block number,
and latitude/longitude coordinates as far as geographic location.
2. Through visualization (Tableau), we discovered that the
latitude/longitude coordinates were out of date and needed to be
updated to be read by our mapping tool.
3. Only provided a long date (MM/DD/YYYY TT:TT:TT).
Data Munging/Exploration:
Problems
Data Munging/Exploration:
Solutions to problems
Long Date problem:
• MonthName function (Excel or Access) to populate month field
• DayName function (Excel or Access) to populate day field
• Right function (Excel or Access) to parse out time of day
• Updated Year field by individual year conversion (SQL)
SQL Example to update year field from long date:
UPDATE [CrimeData_2011-2014]
SET [CrimeData_2011-2014].REPORT_YEAR = "2014"
WHERE ((([CrimeData_2011-2014].REPORT_DAT) Like "*2014*"));
Data Munging/Exploration -
Solutions to problems
Neighborhood Names problem:
• Neighborhood Data contained cluster number and neighborhood
name, while Crime data only included cluster. An inner join on the
cluster fields brought in Neighborhood Names.
SQL Code for INNER JOIN:
UPDATE [CrimeData_2011-2014]
INNER JOIN [CensusbyNeighborhood-CSV1]
ON [CrimeData_2011-2014].NEIGHBORHOOD_CLUSTER = [CensusbyNeighborhood-
CSV1].NeighClst
SET [CrimeData_2011-2014].NEIGHBORHOOD = [CensusbyNeighborhood-
CSV1].[Neighborhood];
Data Munging/Exploration -
Solutions to problems
Latitude/Longitude problem:
• Latitude/Longitude coordinates were outdated, and Tableau couldn’t
read them for mapping.
• We used Python (pyproj and proj.4 libraries) to bring in up-to-date
latitude/longitude coordinates.
Code can be viewed on Github.
Data Analysis & Exploration
Methods/Tools
• Using Tableau, R and Excel explored the data visually and with regression.
• Tableau mapping and charts – getting an understanding of the data, the
neighborhood clusters and crime incidence.
• Linear/multiple regressions on current 39 Cluster data (2012 median
property value, total per crime category).
• Exploring time series median home price data on violent crime per 1000
residents 2000-2011 in R.
• Exploring regression coefficients between crime areas to determine
impact of violent crime events on property value and if there is marginal
negative return to violent crime – as hypothesized.
Data exploration of raw counts of crime by neighborhood cluster. Shows
heavy crime incidence in commercial areas (dominated by theft). Needs
further exploration.
Total Crime per Capita (000)
Map of Clusters: Tableau (hyperlink)
Comparing Neighborhoods - Crime Types
AdMo/Kalorama- Prop: $1.1M - Crime: 2863 Foxhall/Palisades- Prop: $1.0M - Crime: 602
Fairlawn- Prop: $243K - Crime: 2763Union St/E. Market- Prop: $549K - Crime: 6178
Adams Morgan/Kalorama
Comparing Neighborhoods - Day of Week
Foxhall/Palisades
Union Station/Eastern Market Fairlawn
Products for Residents/Business Owners
http://danieljwood.github.io/DC-crime/
https://www.google.com/maps/d/edit?mid=zUgxKwpr9tdQ.kygX4j2r7scg
Property Value plotted against Violent
Crime Rate, by neighborhood cluster,
2000-2011
Total crimes (count) show little relationship to property value –
explained mainly due to the slightly positive correlation (and
aggregate number) of non-violent crime events.
Correlations: Negative correlations for violent crime indicate impact on
property values – flat/positive correlations for non-violent theft
ARSON
ASSAUL
T
W/DAN
GEROUS
WEAPO
N
BURGL
ARY
HOMICI
DE
MOTOR
VEHICLE
THEFT
ROBBER
Y
SEX
ABUSE
THEFT
F/AUTO
THEFT/
OTHER
Correlation:
Property Value -0.34 -0.5981 -0.41 -0.494 -0.615527 -0.3761 -0.3893 0.11122 0.191
Analysis of Correlation and Map
Clustering
• Maps are excellent for quickly looking through categories in geographic
proximity, but don’t provide much insight without neighborhood specifics
such as commercial activity(JP will elaborate). Of particularly interest are
population density and # liquor licenses (as a proxy for commercial
activity)
• Correlations show that the only two crime groups that are weakly
positively correlated with property value are Theft and Theft from Motor
Vehicle (not Motor Vehicle Theft). This conforms to our expectations
about violent crime impact property value. The correlation itself is fairly
weak, but its sign is interesting.
• Other inferences from the map: trees commit very few crimes ☺
Model1=lm(formula =
Median2012 ~ VCrime2010,
data = dccrimereg)
Coefficients:
(Intercept) VCrime2010
940372 -31296
lm(formula = Prop09 ~ VC07 * VC08, data = dccrimereg2)
Coefficients:
(Intercept) VC07 VC08 VC07:VC08
1065111 -22563 -38566 1290
Average % violent crime per 1000 plotted against % change in housing
asset value over time – not what I expected!
Or is it! Perhaps what we’re observing is the decline in overall violent
crime is showing a massive rebound for the higher crime (asset price
depressed) neighborhood clusters. Gentrification is occurring, but what
is also observed is that housing prices seem to be playing catch up in
those high crime neighborhoods.
Regression Discussion
• Non-violent crime is neutral to weakly associated with increasing property values
(unsurprising).
• Asset value accumulation is an important part of breaking cycle of poverty and
violent crime thus has a detrimental effect on the ability to accumulate
wealth….HOWEVER, high violent crime areas are experiencing much growth in
asset values which indicates that falling crime rates in otherwise desirable areas
are an instrumental factor. Decay rate seems fast. The perception of falling
crime rates may be key factor.
• Further thoughts on where the data might go….
– Drilling down to one number for the impact of violent crime on property value
– Decay rate of violent crime negative impact on property value growth rate.
– Quantifying negative wealth accumulation effect over time of violent crime.
Conclusion
Original Problem: DC property owners and metro area residents are at greater risk of
becoming victims of crime if they are unaware of the frequency and types of crime in
their neighborhoods of interest.
Solution: By implementing effective data analytic techniques, we were able to expose
important trends and relationships regarding crime activity in relation to the different
neighborhoods and associated property values throughout the DC metro area. This
information can provide important insight for business owners, potential
homebuyers, as well as the DC Metro Police Department.
Additional Development: Create a web-based application which can graphically
represent trends, patterns, and crime frequencies in near-real time.
https://www.google.com/maps/d/edit?mid=zUgxKwpr9tdQ.kygX4j2r7scg
Appendix
Crime Type Percentage per Capita
Data Analysis
• Neighborhoods which have more retail businesses, restaurants
and bars are more affected by certain types of crime (e.g.
theft) than residential neighborhoods.
• Residential neighborhoods with higher average incomes and
property values generally have a lower occurrence of violent
crimes vs residential neighborhoods with lower average
incomes and property values
• Day of the week appears to impact certain types of crime
(robbery/theft) in high-traffic neighborhoods but it is not a
direct correlation in every case.

More Related Content

What's hot

ACT 2014 How We Move Around New Research on Transportation Impacts of Urban M...
ACT 2014 How We Move Around New Research on Transportation Impacts of Urban M...ACT 2014 How We Move Around New Research on Transportation Impacts of Urban M...
ACT 2014 How We Move Around New Research on Transportation Impacts of Urban M...
Association for Commuter Transportation (ACT)
 
Sowo570
Sowo570 Sowo570
The Changing Face of Government IT
The Changing Face of Government ITThe Changing Face of Government IT
The Changing Face of Government IT
Dustin Haisler
 
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
Istituto nazionale di statistica
 
El arte veraz: visualización e infografía para comunicación
El arte veraz: visualización e infografía para comunicaciónEl arte veraz: visualización e infografía para comunicación
El arte veraz: visualización e infografía para comunicación
SocialBiblio
 
Project Benson: MTA Turnstile Data Analysis
Project Benson: MTA Turnstile Data AnalysisProject Benson: MTA Turnstile Data Analysis
Project Benson: MTA Turnstile Data Analysis
Rosanne Hoyem
 
Francesca Froy "What is the role of spatial configuration and urban morpholog...
Francesca Froy "What is the role of spatial configuration and urban morpholog...Francesca Froy "What is the role of spatial configuration and urban morpholog...
Francesca Froy "What is the role of spatial configuration and urban morpholog...
HannahParr3
 
SpiesFinalUPP461
SpiesFinalUPP461SpiesFinalUPP461
SpiesFinalUPP461
Benjamin Spies
 
Route Performance VNO - FRA
Route Performance VNO - FRA Route Performance VNO - FRA
Route Performance VNO - FRA
Mohammed Awad
 
Using Maps to Support the Road to Reopening in Rural Communities
Using Maps to Support the Road to Reopening in Rural CommunitiesUsing Maps to Support the Road to Reopening in Rural Communities
Using Maps to Support the Road to Reopening in Rural Communities
CARTO
 

What's hot (10)

ACT 2014 How We Move Around New Research on Transportation Impacts of Urban M...
ACT 2014 How We Move Around New Research on Transportation Impacts of Urban M...ACT 2014 How We Move Around New Research on Transportation Impacts of Urban M...
ACT 2014 How We Move Around New Research on Transportation Impacts of Urban M...
 
Sowo570
Sowo570 Sowo570
Sowo570
 
The Changing Face of Government IT
The Changing Face of Government ITThe Changing Face of Government IT
The Changing Face of Government IT
 
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
 
El arte veraz: visualización e infografía para comunicación
El arte veraz: visualización e infografía para comunicaciónEl arte veraz: visualización e infografía para comunicación
El arte veraz: visualización e infografía para comunicación
 
Project Benson: MTA Turnstile Data Analysis
Project Benson: MTA Turnstile Data AnalysisProject Benson: MTA Turnstile Data Analysis
Project Benson: MTA Turnstile Data Analysis
 
Francesca Froy "What is the role of spatial configuration and urban morpholog...
Francesca Froy "What is the role of spatial configuration and urban morpholog...Francesca Froy "What is the role of spatial configuration and urban morpholog...
Francesca Froy "What is the role of spatial configuration and urban morpholog...
 
SpiesFinalUPP461
SpiesFinalUPP461SpiesFinalUPP461
SpiesFinalUPP461
 
Route Performance VNO - FRA
Route Performance VNO - FRA Route Performance VNO - FRA
Route Performance VNO - FRA
 
Using Maps to Support the Road to Reopening in Rural Communities
Using Maps to Support the Road to Reopening in Rural CommunitiesUsing Maps to Support the Road to Reopening in Rural Communities
Using Maps to Support the Road to Reopening in Rural Communities
 

Similar to Georgetown Data Analytics Project (Team DC)

Crime Dataset Analysis for City of Chicago
Crime Dataset Analysis for City of ChicagoCrime Dataset Analysis for City of Chicago
Crime Dataset Analysis for City of Chicago
Stuti Deshpande
 
Crime analysis
Crime analysisCrime analysis
Crime analysis
Colin Bartram
 
John MitchellMazala MoodyProfessor BrownMath 36April 1.docx
John MitchellMazala MoodyProfessor BrownMath 36April 1.docxJohn MitchellMazala MoodyProfessor BrownMath 36April 1.docx
John MitchellMazala MoodyProfessor BrownMath 36April 1.docx
jesssueann
 
Chicago Crime Dataset Project Proposal
Chicago Crime Dataset Project ProposalChicago Crime Dataset Project Proposal
Chicago Crime Dataset Project Proposal
Aashri Tandon
 
Crime analysis
Crime analysisCrime analysis
Crime analysis
Frank Smilda
 
Crime Data Analysis and Prediction for city of Los Angeles
Crime Data Analysis and Prediction for city of Los AngelesCrime Data Analysis and Prediction for city of Los Angeles
Crime Data Analysis and Prediction for city of Los Angeles
Heta Parekh
 
Crime prediction based on crime types
Crime prediction based on crime typesCrime prediction based on crime types
Crime prediction based on crime types
IJDKP
 
Analytics-Based Crime Prediction
Analytics-Based Crime PredictionAnalytics-Based Crime Prediction
Analytics-Based Crime Prediction
Prodapt Solutions
 
Assignment Presentation of Criminal Justice Research and Geogra.docx
Assignment Presentation of Criminal Justice Research and Geogra.docxAssignment Presentation of Criminal Justice Research and Geogra.docx
Assignment Presentation of Criminal Justice Research and Geogra.docx
lynettearnold46882
 
DSDP Demographic Study 2016
DSDP Demographic Study 2016DSDP Demographic Study 2016
DSDP Demographic Study 2016
Caroline Stevens
 
An Intelligence Analysis of Crime Data for Law Enforcement Using Data Mining
An Intelligence Analysis of Crime Data for Law Enforcement Using Data MiningAn Intelligence Analysis of Crime Data for Law Enforcement Using Data Mining
An Intelligence Analysis of Crime Data for Law Enforcement Using Data Mining
Waqas Tariq
 
Crime analysis and Prediction Crime analysis and Prediction
Crime analysis and Prediction Crime analysis and PredictionCrime analysis and Prediction Crime analysis and Prediction
Crime analysis and Prediction Crime analysis and Prediction
Fahad Almassoud
 
Game Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime DetectionGame Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime Detection
IOSR Journals
 
Final%20Analysis%20Code%20Displayed.html
Final%20Analysis%20Code%20Displayed.htmlFinal%20Analysis%20Code%20Displayed.html
Final%20Analysis%20Code%20Displayed.html
Ryan Haeri
 
CRIME RATE IN SAN FRANCISCO
CRIME RATE IN SAN FRANCISCOCRIME RATE IN SAN FRANCISCO
CRIME RATE IN SAN FRANCISCO
Muhammad Ozair
 
Am I Safe in My Home? Fear of Crime Analyzed with Spatial Statistics Methods ...
Am I Safe in My Home? Fear of Crime Analyzed with Spatial Statistics Methods ...Am I Safe in My Home? Fear of Crime Analyzed with Spatial Statistics Methods ...
Am I Safe in My Home? Fear of Crime Analyzed with Spatial Statistics Methods ...
Beniamino Murgante
 
Fundamentalsof Crime Mapping 6
Fundamentalsof Crime Mapping 6Fundamentalsof Crime Mapping 6
Fundamentalsof Crime Mapping 6
Osokop
 
Building an urban theft map by analyzing newspaper - SMAP 2018
Building an urban theft map by analyzing newspaper - SMAP 2018Building an urban theft map by analyzing newspaper - SMAP 2018
Building an urban theft map by analyzing newspaper - SMAP 2018
Laura Po
 
GEOSPATIAL DATA SOURCES
GEOSPATIAL DATA SOURCESGEOSPATIAL DATA SOURCES
GEOSPATIAL DATA SOURCES
Expert Writing Help
 
IRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data AnalyticsIRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data Analytics
IRJET Journal
 

Similar to Georgetown Data Analytics Project (Team DC) (20)

Crime Dataset Analysis for City of Chicago
Crime Dataset Analysis for City of ChicagoCrime Dataset Analysis for City of Chicago
Crime Dataset Analysis for City of Chicago
 
Crime analysis
Crime analysisCrime analysis
Crime analysis
 
John MitchellMazala MoodyProfessor BrownMath 36April 1.docx
John MitchellMazala MoodyProfessor BrownMath 36April 1.docxJohn MitchellMazala MoodyProfessor BrownMath 36April 1.docx
John MitchellMazala MoodyProfessor BrownMath 36April 1.docx
 
Chicago Crime Dataset Project Proposal
Chicago Crime Dataset Project ProposalChicago Crime Dataset Project Proposal
Chicago Crime Dataset Project Proposal
 
Crime analysis
Crime analysisCrime analysis
Crime analysis
 
Crime Data Analysis and Prediction for city of Los Angeles
Crime Data Analysis and Prediction for city of Los AngelesCrime Data Analysis and Prediction for city of Los Angeles
Crime Data Analysis and Prediction for city of Los Angeles
 
Crime prediction based on crime types
Crime prediction based on crime typesCrime prediction based on crime types
Crime prediction based on crime types
 
Analytics-Based Crime Prediction
Analytics-Based Crime PredictionAnalytics-Based Crime Prediction
Analytics-Based Crime Prediction
 
Assignment Presentation of Criminal Justice Research and Geogra.docx
Assignment Presentation of Criminal Justice Research and Geogra.docxAssignment Presentation of Criminal Justice Research and Geogra.docx
Assignment Presentation of Criminal Justice Research and Geogra.docx
 
DSDP Demographic Study 2016
DSDP Demographic Study 2016DSDP Demographic Study 2016
DSDP Demographic Study 2016
 
An Intelligence Analysis of Crime Data for Law Enforcement Using Data Mining
An Intelligence Analysis of Crime Data for Law Enforcement Using Data MiningAn Intelligence Analysis of Crime Data for Law Enforcement Using Data Mining
An Intelligence Analysis of Crime Data for Law Enforcement Using Data Mining
 
Crime analysis and Prediction Crime analysis and Prediction
Crime analysis and Prediction Crime analysis and PredictionCrime analysis and Prediction Crime analysis and Prediction
Crime analysis and Prediction Crime analysis and Prediction
 
Game Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime DetectionGame Theory Approach for Identity Crime Detection
Game Theory Approach for Identity Crime Detection
 
Final%20Analysis%20Code%20Displayed.html
Final%20Analysis%20Code%20Displayed.htmlFinal%20Analysis%20Code%20Displayed.html
Final%20Analysis%20Code%20Displayed.html
 
CRIME RATE IN SAN FRANCISCO
CRIME RATE IN SAN FRANCISCOCRIME RATE IN SAN FRANCISCO
CRIME RATE IN SAN FRANCISCO
 
Am I Safe in My Home? Fear of Crime Analyzed with Spatial Statistics Methods ...
Am I Safe in My Home? Fear of Crime Analyzed with Spatial Statistics Methods ...Am I Safe in My Home? Fear of Crime Analyzed with Spatial Statistics Methods ...
Am I Safe in My Home? Fear of Crime Analyzed with Spatial Statistics Methods ...
 
Fundamentalsof Crime Mapping 6
Fundamentalsof Crime Mapping 6Fundamentalsof Crime Mapping 6
Fundamentalsof Crime Mapping 6
 
Building an urban theft map by analyzing newspaper - SMAP 2018
Building an urban theft map by analyzing newspaper - SMAP 2018Building an urban theft map by analyzing newspaper - SMAP 2018
Building an urban theft map by analyzing newspaper - SMAP 2018
 
GEOSPATIAL DATA SOURCES
GEOSPATIAL DATA SOURCESGEOSPATIAL DATA SOURCES
GEOSPATIAL DATA SOURCES
 
IRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data AnalyticsIRJET- Crime Analysis using Data Mining and Data Analytics
IRJET- Crime Analysis using Data Mining and Data Analytics
 

Recently uploaded

一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
1tyxnjpia
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
dataschool1
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
TeukuEriSyahputra
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理
ugydym
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Kaxil Naik
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
Vietnam Cotton & Spinning Association
 
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
oaxefes
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
Márton Kodok
 
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
agdhot
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
Alireza Kamrani
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
ihavuls
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
Vietnam Cotton & Spinning Association
 
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理 原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
tzu5xla
 
一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理
keesa2
 
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
osoyvvf
 
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
hqfek
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 

Recently uploaded (20)

一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理
 
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics May 2024
 
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
 
Build applications with generative AI on Google Cloud
Build applications with generative AI on Google CloudBuild applications with generative AI on Google Cloud
Build applications with generative AI on Google Cloud
 
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
一比一原版加拿大麦吉尔大学毕业证(mcgill毕业证书)如何办理
 
How To Control IO Usage using Resource Manager
How To Control IO Usage using Resource ManagerHow To Control IO Usage using Resource Manager
How To Control IO Usage using Resource Manager
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
 
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理 原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
 
一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理一比一原版悉尼大学毕业证如何办理
一比一原版悉尼大学毕业证如何办理
 
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
一比一原版(uom毕业证书)曼彻斯特大学毕业证如何办理
 
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 

Georgetown Data Analytics Project (Team DC)

  • 1. TEAM DC: Georgetown Data Analytics: Exploring DC crime with neighborhood cluster analysis, interactive mapping tool, and regression analysis.
  • 2. Problem DC property owners and metro area residents are at greater risk of becoming victims of crime if they are unaware of the frequency and types of crime in their neighborhoods of interest. Imperfect information - A person visiting or buying a home in an unfamiliar neighborhood faces imperfect information regarding crime and its impact on safety and property values.
  • 4. Began with: -DC Crime Data - 2011-2014 -DC Neighborhood Data (US Census Data) - 2000-2012 Based on given data, we decided to go with the most up to date property values and population numbers (2012) from DC Neighborhood data to be most representative of current state. Data Munging/Exploration
  • 5. DC Crime Data: 1. Did not have neighborhood names, but rather cluster, block number, and latitude/longitude coordinates as far as geographic location. 2. Through visualization (Tableau), we discovered that the latitude/longitude coordinates were out of date and needed to be updated to be read by our mapping tool. 3. Only provided a long date (MM/DD/YYYY TT:TT:TT). Data Munging/Exploration: Problems
  • 6. Data Munging/Exploration: Solutions to problems Long Date problem: • MonthName function (Excel or Access) to populate month field • DayName function (Excel or Access) to populate day field • Right function (Excel or Access) to parse out time of day • Updated Year field by individual year conversion (SQL) SQL Example to update year field from long date: UPDATE [CrimeData_2011-2014] SET [CrimeData_2011-2014].REPORT_YEAR = "2014" WHERE ((([CrimeData_2011-2014].REPORT_DAT) Like "*2014*"));
  • 7. Data Munging/Exploration - Solutions to problems Neighborhood Names problem: • Neighborhood Data contained cluster number and neighborhood name, while Crime data only included cluster. An inner join on the cluster fields brought in Neighborhood Names. SQL Code for INNER JOIN: UPDATE [CrimeData_2011-2014] INNER JOIN [CensusbyNeighborhood-CSV1] ON [CrimeData_2011-2014].NEIGHBORHOOD_CLUSTER = [CensusbyNeighborhood- CSV1].NeighClst SET [CrimeData_2011-2014].NEIGHBORHOOD = [CensusbyNeighborhood- CSV1].[Neighborhood];
  • 8. Data Munging/Exploration - Solutions to problems Latitude/Longitude problem: • Latitude/Longitude coordinates were outdated, and Tableau couldn’t read them for mapping. • We used Python (pyproj and proj.4 libraries) to bring in up-to-date latitude/longitude coordinates. Code can be viewed on Github.
  • 9. Data Analysis & Exploration Methods/Tools • Using Tableau, R and Excel explored the data visually and with regression. • Tableau mapping and charts – getting an understanding of the data, the neighborhood clusters and crime incidence. • Linear/multiple regressions on current 39 Cluster data (2012 median property value, total per crime category). • Exploring time series median home price data on violent crime per 1000 residents 2000-2011 in R. • Exploring regression coefficients between crime areas to determine impact of violent crime events on property value and if there is marginal negative return to violent crime – as hypothesized.
  • 10. Data exploration of raw counts of crime by neighborhood cluster. Shows heavy crime incidence in commercial areas (dominated by theft). Needs further exploration.
  • 11. Total Crime per Capita (000)
  • 12. Map of Clusters: Tableau (hyperlink)
  • 13.
  • 14.
  • 15.
  • 16. Comparing Neighborhoods - Crime Types AdMo/Kalorama- Prop: $1.1M - Crime: 2863 Foxhall/Palisades- Prop: $1.0M - Crime: 602 Fairlawn- Prop: $243K - Crime: 2763Union St/E. Market- Prop: $549K - Crime: 6178
  • 17. Adams Morgan/Kalorama Comparing Neighborhoods - Day of Week Foxhall/Palisades Union Station/Eastern Market Fairlawn
  • 18. Products for Residents/Business Owners http://danieljwood.github.io/DC-crime/ https://www.google.com/maps/d/edit?mid=zUgxKwpr9tdQ.kygX4j2r7scg
  • 19.
  • 20.
  • 21. Property Value plotted against Violent Crime Rate, by neighborhood cluster, 2000-2011
  • 22.
  • 23.
  • 24. Total crimes (count) show little relationship to property value – explained mainly due to the slightly positive correlation (and aggregate number) of non-violent crime events.
  • 25. Correlations: Negative correlations for violent crime indicate impact on property values – flat/positive correlations for non-violent theft ARSON ASSAUL T W/DAN GEROUS WEAPO N BURGL ARY HOMICI DE MOTOR VEHICLE THEFT ROBBER Y SEX ABUSE THEFT F/AUTO THEFT/ OTHER Correlation: Property Value -0.34 -0.5981 -0.41 -0.494 -0.615527 -0.3761 -0.3893 0.11122 0.191
  • 26. Analysis of Correlation and Map Clustering • Maps are excellent for quickly looking through categories in geographic proximity, but don’t provide much insight without neighborhood specifics such as commercial activity(JP will elaborate). Of particularly interest are population density and # liquor licenses (as a proxy for commercial activity) • Correlations show that the only two crime groups that are weakly positively correlated with property value are Theft and Theft from Motor Vehicle (not Motor Vehicle Theft). This conforms to our expectations about violent crime impact property value. The correlation itself is fairly weak, but its sign is interesting. • Other inferences from the map: trees commit very few crimes ☺
  • 27. Model1=lm(formula = Median2012 ~ VCrime2010, data = dccrimereg) Coefficients: (Intercept) VCrime2010 940372 -31296
  • 28. lm(formula = Prop09 ~ VC07 * VC08, data = dccrimereg2) Coefficients: (Intercept) VC07 VC08 VC07:VC08 1065111 -22563 -38566 1290
  • 29. Average % violent crime per 1000 plotted against % change in housing asset value over time – not what I expected! Or is it! Perhaps what we’re observing is the decline in overall violent crime is showing a massive rebound for the higher crime (asset price depressed) neighborhood clusters. Gentrification is occurring, but what is also observed is that housing prices seem to be playing catch up in those high crime neighborhoods.
  • 30. Regression Discussion • Non-violent crime is neutral to weakly associated with increasing property values (unsurprising). • Asset value accumulation is an important part of breaking cycle of poverty and violent crime thus has a detrimental effect on the ability to accumulate wealth….HOWEVER, high violent crime areas are experiencing much growth in asset values which indicates that falling crime rates in otherwise desirable areas are an instrumental factor. Decay rate seems fast. The perception of falling crime rates may be key factor. • Further thoughts on where the data might go…. – Drilling down to one number for the impact of violent crime on property value – Decay rate of violent crime negative impact on property value growth rate. – Quantifying negative wealth accumulation effect over time of violent crime.
  • 31. Conclusion Original Problem: DC property owners and metro area residents are at greater risk of becoming victims of crime if they are unaware of the frequency and types of crime in their neighborhoods of interest. Solution: By implementing effective data analytic techniques, we were able to expose important trends and relationships regarding crime activity in relation to the different neighborhoods and associated property values throughout the DC metro area. This information can provide important insight for business owners, potential homebuyers, as well as the DC Metro Police Department. Additional Development: Create a web-based application which can graphically represent trends, patterns, and crime frequencies in near-real time. https://www.google.com/maps/d/edit?mid=zUgxKwpr9tdQ.kygX4j2r7scg
  • 33. Crime Type Percentage per Capita
  • 34. Data Analysis • Neighborhoods which have more retail businesses, restaurants and bars are more affected by certain types of crime (e.g. theft) than residential neighborhoods. • Residential neighborhoods with higher average incomes and property values generally have a lower occurrence of violent crimes vs residential neighborhoods with lower average incomes and property values • Day of the week appears to impact certain types of crime (robbery/theft) in high-traffic neighborhoods but it is not a direct correlation in every case.