SlideShare a Scribd company logo
1 of 16
Download to read offline
Agenda
★ Introduction & Context
★ What? - Research Question
★ Data Description
★ Exploratory Data Analysis
★ Data Cleaning and Curation
★ Regression Analysis
★ Conclusion
WHYthis project
8,277,829
$14.3 billion
property crime offenses
in the nation in 2014.
estimated losses caused by
property crimes in 2014.
Source: https://www.fbi.gov/about-us/cjis/ucr/crime-in-the-u.s/2014/crime-in-the-u.s.-2014/offenses-known-to-law-enforcement/property-crime
WHATare we doing
Does crime in a neighborhood
affect the home values over time?
RESEARCH QUESTION
DATAdescription
★ 1.14 million records
★ Timeline 2010-15
★ 19 variables
★ 46 crime types
★ Time-series data
★ Dimensions: 5692 x 88
★ Year, Month, ZHVI,
Neighborhood, Region
Seattle 911 Response Zillow Data Research
EXPLORATORY
Data Analysis
EXPLORATORY
Data Analysis
DATACleaning & Curation Methods
Getting rid of unwanted records and columns
Reverse Geocoding Latitudes & Longitudes ~ High
Scale Conversion using Shapefiles in R
Data Transformation ~ Tidy Data (Method applied
Hadley Wickham - Journal of Statistical Software,
vol. 59, 2014.)
Data Aggregation by Year, Month & Neighborhood
Merging the clean datasets by Aggregated Matching
Ready for
Regression Analysis
DATACleaning - R Packages used
gpclib - General Polygon Clipping Library for R
rgeos - Interface to Geometry Engine - Open Source
(GEOS)
maptools - Reading / Handling Spatial Objects
rgdal - Bindings for the Geospatial Data Abstraction
Library
spatialEco - Spatial Analysis and Modelling
tidyr dplyrggplot2
Over 300 lines of R code written for the
project:
DATACleaning & Curation numbers
1.14 Million records narrowed down to 229K values
Reverse geocoded 229K different Latitude-
Longitude combinations and mapped them to 82
neighborhood zones
Zillow data transformed from dim [5692 x 88] to
[20592 x 4] dataframes (Reduce & Transpose)
229K values aggregated to 5904 rows (5 Years, 60
Months, 82 Neighborhoods, 5 Crime types)
Final Merged Dataset to append the Zillow Home
Value to each of the corresponding aggregated 911
value
Ready for
Regression Analysis
Data Analysis
Phew, Finally
Predicting home values as a function of
property crimes
WHAT IS OUR MODEL?
Crimes aggregated by month, year and
neighborhood with corresponding value
of house during the same period and
neighborhood
✔ Linear Relationship between
data
✔ Nearly normally-distributed
residuals
✔ Constant variability
✔ Independent observations
Data Analysis
Collinearity
Data Analysis
Multiple Regression
home.value = β0
+ β1
* Assault.Count + β2
* Arrest.Count + β3
* Burglary.Count + β4
* Disturbance.Count + β5
* Robbery.Count
Estimate Pr (>|t|) 95% Confidence Interval
(Intercept) 469028.28 < 2e-16 461599.3, 476457.2
Assault.Count 1401.19 5.18e-09 931.7, 1870.6
Arrest.Count -2278.03 < 2e-16 -2531.6, -2024.4
Burglary.Count -114.53 0.2973 -329.9, 100.87
Disturbance.Count 56.38 0.0273 6.3, 106.4
Robbery.Count -2413.51 < 2e-16 -2876.40, -1950.6
Multiple R-squared: 0.1106
Assault Count $ 1401.19
Arrest Count $ 2278.03
Burglary Count $ 114.53
Disturbance Count $ 56.38
Robbery Count $ 2413.51
Data Analysis
Model Interpretation
Data Analysis
Special Inferences
Since 2010, 4471 of total 17523 assault
cases reported in Downtown Seattle
Disturbance complaints have less or no correlation
with time or neighborhood ~ evenly distributed
SeattlersliketoParty!
Burglary - Unlawful entry to a structure to commit theft or a felony. In
order for burglary to take place, a victim does not have to be present.
Robbery - Take something from someone that has value by utilizing
intimidation, force or threat. In order for robbery to take place, a
victim must be present at the scene
You are less likely to be
arrested if you stay in
an expensive house!
Other Factors include:
■ Distance from Workplace,
UW and SeaTac Airport.
■ Access to Transportation &
Departmental Stores.
■ History of neighborhood
and past data.
Project Scope
Limitations and Key Learnings
Key Learnings
✔ Data Cleaning
✔ R Programming
✔ Open-ended
questions
✔ Data Science
Thank you!

More Related Content

Similar to Home Value Patrol

Analyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyAnalyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyCOGS Presentations
 
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseForecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseAzavea
 
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...Azavea
 
10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime AnalysisAzavea
 
Data Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataData Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataAzavea
 

Similar to Home Value Patrol (6)

Analyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyAnalyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis County
 
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseForecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
 
HackerOne_Report
HackerOne_ReportHackerOne_Report
HackerOne_Report
 
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
 
10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis
 
Data Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataData Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) Data
 

Recently uploaded

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 

Recently uploaded (20)

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 

Home Value Patrol

  • 1.
  • 2. Agenda ★ Introduction & Context ★ What? - Research Question ★ Data Description ★ Exploratory Data Analysis ★ Data Cleaning and Curation ★ Regression Analysis ★ Conclusion
  • 3. WHYthis project 8,277,829 $14.3 billion property crime offenses in the nation in 2014. estimated losses caused by property crimes in 2014. Source: https://www.fbi.gov/about-us/cjis/ucr/crime-in-the-u.s/2014/crime-in-the-u.s.-2014/offenses-known-to-law-enforcement/property-crime
  • 4. WHATare we doing Does crime in a neighborhood affect the home values over time? RESEARCH QUESTION
  • 5. DATAdescription ★ 1.14 million records ★ Timeline 2010-15 ★ 19 variables ★ 46 crime types ★ Time-series data ★ Dimensions: 5692 x 88 ★ Year, Month, ZHVI, Neighborhood, Region Seattle 911 Response Zillow Data Research
  • 8. DATACleaning & Curation Methods Getting rid of unwanted records and columns Reverse Geocoding Latitudes & Longitudes ~ High Scale Conversion using Shapefiles in R Data Transformation ~ Tidy Data (Method applied Hadley Wickham - Journal of Statistical Software, vol. 59, 2014.) Data Aggregation by Year, Month & Neighborhood Merging the clean datasets by Aggregated Matching Ready for Regression Analysis
  • 9. DATACleaning - R Packages used gpclib - General Polygon Clipping Library for R rgeos - Interface to Geometry Engine - Open Source (GEOS) maptools - Reading / Handling Spatial Objects rgdal - Bindings for the Geospatial Data Abstraction Library spatialEco - Spatial Analysis and Modelling tidyr dplyrggplot2 Over 300 lines of R code written for the project:
  • 10. DATACleaning & Curation numbers 1.14 Million records narrowed down to 229K values Reverse geocoded 229K different Latitude- Longitude combinations and mapped them to 82 neighborhood zones Zillow data transformed from dim [5692 x 88] to [20592 x 4] dataframes (Reduce & Transpose) 229K values aggregated to 5904 rows (5 Years, 60 Months, 82 Neighborhoods, 5 Crime types) Final Merged Dataset to append the Zillow Home Value to each of the corresponding aggregated 911 value Ready for Regression Analysis
  • 11. Data Analysis Phew, Finally Predicting home values as a function of property crimes WHAT IS OUR MODEL? Crimes aggregated by month, year and neighborhood with corresponding value of house during the same period and neighborhood
  • 12. ✔ Linear Relationship between data ✔ Nearly normally-distributed residuals ✔ Constant variability ✔ Independent observations Data Analysis Collinearity
  • 13. Data Analysis Multiple Regression home.value = β0 + β1 * Assault.Count + β2 * Arrest.Count + β3 * Burglary.Count + β4 * Disturbance.Count + β5 * Robbery.Count Estimate Pr (>|t|) 95% Confidence Interval (Intercept) 469028.28 < 2e-16 461599.3, 476457.2 Assault.Count 1401.19 5.18e-09 931.7, 1870.6 Arrest.Count -2278.03 < 2e-16 -2531.6, -2024.4 Burglary.Count -114.53 0.2973 -329.9, 100.87 Disturbance.Count 56.38 0.0273 6.3, 106.4 Robbery.Count -2413.51 < 2e-16 -2876.40, -1950.6 Multiple R-squared: 0.1106
  • 14. Assault Count $ 1401.19 Arrest Count $ 2278.03 Burglary Count $ 114.53 Disturbance Count $ 56.38 Robbery Count $ 2413.51 Data Analysis Model Interpretation
  • 15. Data Analysis Special Inferences Since 2010, 4471 of total 17523 assault cases reported in Downtown Seattle Disturbance complaints have less or no correlation with time or neighborhood ~ evenly distributed SeattlersliketoParty! Burglary - Unlawful entry to a structure to commit theft or a felony. In order for burglary to take place, a victim does not have to be present. Robbery - Take something from someone that has value by utilizing intimidation, force or threat. In order for robbery to take place, a victim must be present at the scene You are less likely to be arrested if you stay in an expensive house!
  • 16. Other Factors include: ■ Distance from Workplace, UW and SeaTac Airport. ■ Access to Transportation & Departmental Stores. ■ History of neighborhood and past data. Project Scope Limitations and Key Learnings Key Learnings ✔ Data Cleaning ✔ R Programming ✔ Open-ended questions ✔ Data Science Thank you!