SlideShare a Scribd company logo
DSO 510 Business Analytics | Group Project
1
Crime in San Francisco
DSO 510 Business Analytics | Group Project
Phase 4 Presentation
Andrew Chen | Yile Wu | Chi Zhang | Chulsoon Pak
DSO 510 Business Analytics | Group Project
2
DSO 510 Business Analytics | Group Project
3
PHASE I
Define business analytics proposal, data required, data analysis approach,
and decision making and innovation framework
DSO 510 Business Analytics | Group Project
4
• 2014 Population: 852,4691
• 13th most populous city in the nation
• Separated into 10 districts
• Top global innovation center, with
highest concentration of technology-
related jobs in the U.S.
• Crime in San Francisco has historically
been higher than the U.S. average
• Crime Index of SF is rated 3 out of 100
(safer than 3% of other U.S. cities)2
BACKGROUND: SAN FRANCISCO
1. United States Census Bureau, July 2014
2. "Crime rates for San Francisco, CA", NeighborhoodScout, 2013
DSO 510 Business Analytics | Group Project
5
In order to make San Francisco a safer place,
we aim identify factors that promote criminal
behavior to predict crime more accurately.
GOAL DEFINITION
DSO 510 Business Analytics | Group Project
6
DEFINING OUR VARIABLES
Dependent Variables
1. Number of Crimes
2. Time of Crime
3. Date of Crime
4. Severity of Crime
5. Type of Crime
6. Location of Crime
Independent Variables
1. Day of Week
2. Season of Year
3. Weather
4. Daylight
5. Income Level of District
6. Average Housing Price of District
7. Age Composition of District
8. Population density
9. Degree of Urbanization
10. Modes of Transportation
11. Level of Education
12. Divorce rate of Families
DSO 510 Business Analytics | Group Project
7
• San Francisco Crime Data:
• https://data.sfgov.org/Public-Safety/
• 700,000+ data points (5 years of data)
• San Francisco Weather, Population,
Housing, Hazard Risk, and Demographics
• www.sfclimatehealth.org/
• http://aa.usno.navy.mil/data/docs/RS_
OneYear.php
• San Francisco Housing, Income Level,
Employment, and Transportation by
District
• www.sf-planning.org
DATA COLLECTION
DSO 510 Business Analytics | Group Project
8
INTERPRETATION & ACTION
1. Identify to what degree different variables contribute to crime
2. Predict probability and severity of crimes in terms of time and location
Implementation
1. Assist SFPD in efficient deployment of its police force
2. Integrate data with mapping algorithms to provide the safest real-time routes
3. Organize anti-crime education in high crime areas (how to handle crimes under different
situations)
4. Utilize data in product development and marketing of security-related products
5. Enhance San Francisco city-planning to reduce crimes
DSO 510 Business Analytics | Group Project
9
PHASE II
DSO 510 Business Analytics | Group Project
10
In order to make San Francisco a safer place,
we aim identify factors that promote criminal
behavior to predict crime more accurately.
GOAL DEFINITION
DSO 510 Business Analytics | Group Project
11
DEFINING OUR VARIABLES
Dependent Variables
1. Number of Crimes per Day
2. Number of Crimes per Month
3. Time Slot of Crime
4. Date of Crime
5. Severity of Crime
6. Location of Crime
Independent Variables
1. Day of Week
2. Month
3. Weather
4. Daylight
5. Income Level of District
6. Age Composition of District
7. Modes of Transportation
8. Level of Education
9. Employment of District
DSO 510 Business Analytics | Group Project
12
SUMMARY STATISTICS
• 5 Years of Data
• From August 2010
• To August 2015
• 726,245 Crimes
Reported
Monthly Statistics
DSO 510 Business Analytics | Group Project
13
MONTHLY CRIME DATA (2010 – 2015)
DSO 510 Business Analytics | Group Project
14
CRIME PATTERNS BY MONTH OF YEAR
DSO 510 Business Analytics | Group Project
15
WHICH CRIMES ARE MOST FREQUENTLY
COMMITTED?
Top 5 Crimes*
1. Theft
2. Assault
3. Vandalism
4. Drug Violation
5. Vehicle Theft
*Other Offenses, Non-Criminal
Offenses, and Warrants are excluded
DSO 510 Business Analytics | Group Project
16
CRIMES THAT DEMAND GREATER ATTENTION
Assault,
Robbery,
Missing Person
Theft,
Vandalism
Forcible Sex Offenses,
Murder,
Kidnapping
Disorderly Conduct,
Gambling,
Loitering
HIGH FREQUENCY
LOW FREQUENCY
HIGH
SEVERITY
LOW
SEVERITY
DSO 510 Business Analytics | Group Project
17
CRIMES PER DAY OF THE WEEK
• Friday and Saturday’s have
the most crimes committed
– Late night parties/Events
• Sunday and Monday’s have
the least crimes committed
– Church, Family gatherings
– Back to Work/School
DSO 510 Business Analytics | Group Project
18
CRIME PER DISTRICT
• Number of Crimes per District
• Some districts have significantly higher crime than others
• A good indicator to help SFPD deploy police forces by districts
DSO 510 Business Analytics | Group Project
19
INNER JOIN WITH DAYLIGHT DATA
Crime Data Sunrise and Sunset Data
Inner join by date
DSO 510 Business Analytics | Group Project
20
DAYLIGHT AFFECTS SOME TYPES OF CRIMES
• Crime breakdown based on day or nighttime (in percentages)
– Data eliminated our initial hypothesis that crimes are more likely committed during the night
DSO 510 Business Analytics | Group Project
21
LOOKING AHEAD
Data Manipulation
• Clean up and join other demographic data to existing data
• Categorize meaningful variables into numeric values in order to
run further statistical models
• Assign values for severity and frequency of each crime
Further Insights
• Dig deeper into crimes by district, day of week, and time of day
• Produce a spatial map of crime
DSO 510 Business Analytics | Group Project
22
PHASE III
DSO 510 Business Analytics | Group Project
23
DEFINING OUR VARIABLES
Dependent Variables
1. Number of Crimes per Day
2. Number of Crimes during the Day
3. Number of Crimes during the Night
4. Number of Crimes per Month
5. Time Slot of Crime
Independent Variables
1. Day of Week
2. Month
3. Average Temperature
4. Precipitation
5. Daylight
6. Income Level of District
7. Age Composition of District
8. Modes of Transportation
9. Level of Education
10. Employment of District
DSO 510 Business Analytics | Group Project
24
TEN CRIMES TO FOCUS ON
• Weighted based on frequency and severity of crime sentence
Frequency Low.yr High.yr Avg.yr Weight
LARCENY/THEFT 168,901 0 25 13 2,136,598
ASSAULT 62,449 1 25 13 811,837
DRUG/NARCOTIC 31,180 1 40 20 631,395
ROBBERY 18,652 15 30 23 419,670
BURGLARY 29,020 3 20 12 333,730
SEX OFFENSES,
FORCIBLE
3,927 20 100 60 235,620
FRAUD 14,237 1 25 13 185,081
VEHICLE THEFT 31,002 5 5 5 155,010
KIDNAPPING 2,162 0 100 50 108,208
WEAPON LAWS 7,444 0 20 10 74,812
DSO 510 Business Analytics | Group Project
25
CORRELATIONS
DSO 510 Business Analytics | Group Project
26
LINEAR REGRESSION MODEL
DSO 510 Business Analytics | Group Project
27
LINEAR REGRESSION MODEL
• Dependent Variable:
• Total Daily Crime
• Independent Variables:
• Day of Week
• Average Temperature
• Precipitation
• Significance level: <.0001
• R-Squared Value: 0.1956
DSO 510 Business Analytics | Group Project
28
RESIDUALS ANALYSIS
DSO 510 Business Analytics | Group Project
29
ANOVA
DSO 510 Business Analytics | Group Project
30
TEN CRIMES TO FOCUS ON
• Weighted based on frequency and severity of crime sentence
Frequency Low.yr High.yr Avg.yr Weight
LARCENY/THEFT 168,901 0 25 13 2,136,598
ASSAULT 62,449 1 25 13 811,837
DRUG/NARCOTIC 31,180 1 40 20 631,395
ROBBERY 18,652 15 30 23 419,670
BURGLARY 29,020 3 20 12 333,730
SEX OFFENSES,
FORCIBLE
3,927 20 100 60 235,620
FRAUD 14,237 1 25 13 185,081
VEHICLE THEFT 31,002 5 5 5 155,010
KIDNAPPING 2,162 0 100 50 108,208
WEAPON LAWS 7,444 0 20 10 74,812
DSO 510 Business Analytics | Group Project
31
PHASE IV
DSO 510 Business Analytics | Group Project
32
BINARY LOGISTIC REGRESSION
DSO 510 Business Analytics | Group Project
33
BINARY LOGISTIC REGRESSION
DSO 510 Business Analytics | Group Project
34
BINARY LOGISTIC REGRESSION
DSO 510 Business Analytics | Group Project
35
PREDICTIVE MODELING
DSO 510 Business Analytics | Group Project
36
PREDICTIVE MODELING
DSO 510 Business Analytics | Group Project
37
PREDICTIVE MODELING
DSO 510 Business Analytics | Group Project
38
PREDICTIVE MODELING
DSO 510 Business Analytics | Group Project
39
PREDICTIVE MODELING

More Related Content

Similar to San Francisco Crime

Crime Dataset Analysis for City of Chicago
Crime Dataset Analysis for City of ChicagoCrime Dataset Analysis for City of Chicago
Crime Dataset Analysis for City of Chicago
Stuti Deshpande
 
Police CRJ630 San Diego Police DepartmentSan Diego.docx
Police   CRJ630 San Diego Police DepartmentSan Diego.docxPolice   CRJ630 San Diego Police DepartmentSan Diego.docx
Police CRJ630 San Diego Police DepartmentSan Diego.docx
harrisonhoward80223
 
Analyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyAnalyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyCOGS Presentations
 
Database and Analytics Programming - Project report
Database and Analytics Programming - Project reportDatabase and Analytics Programming - Project report
Database and Analytics Programming - Project report
sarthakkhare3
 
NC4 Public Safety Solutions
NC4 Public Safety SolutionsNC4 Public Safety Solutions
NC4 Public Safety SolutionsRob Brandenberg
 
Survey on Crime Interpretation and Forecasting Using Machine Learning
Survey on Crime Interpretation and Forecasting Using Machine LearningSurvey on Crime Interpretation and Forecasting Using Machine Learning
Survey on Crime Interpretation and Forecasting Using Machine Learning
IRJET Journal
 
Webinar - Location Intelligence Police.pdf
Webinar - Location Intelligence Police.pdfWebinar - Location Intelligence Police.pdf
Webinar - Location Intelligence Police.pdf
Galigeo
 
DWBI - Criminalytics: Entities affecting the Rate of Crime in Republic of Ireland
DWBI - Criminalytics: Entities affecting the Rate of Crime in Republic of IrelandDWBI - Criminalytics: Entities affecting the Rate of Crime in Republic of Ireland
DWBI - Criminalytics: Entities affecting the Rate of Crime in Republic of Ireland
Shrikant Samarth
 
Hello Criminals! Meet Big Data: Preventing Crime in San Francisco by Predicti...
Hello Criminals! Meet Big Data: Preventing Crime in San Francisco by Predicti...Hello Criminals! Meet Big Data: Preventing Crime in San Francisco by Predicti...
Hello Criminals! Meet Big Data: Preventing Crime in San Francisco by Predicti...
Tarun Amarnath
 
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...Azavea
 
Crime Data Analysis and Prediction for city of Los Angeles
Crime Data Analysis and Prediction for city of Los AngelesCrime Data Analysis and Prediction for city of Los Angeles
Crime Data Analysis and Prediction for city of Los Angeles
Heta Parekh
 
Analytics-Based Crime Prediction
Analytics-Based Crime PredictionAnalytics-Based Crime Prediction
Analytics-Based Crime Prediction
Prodapt Solutions
 
Peelian Principle in a Data-Driven World, by The R Simmons Group
Peelian Principle in a Data-Driven World, by The R Simmons GroupPeelian Principle in a Data-Driven World, by The R Simmons Group
Peelian Principle in a Data-Driven World, by The R Simmons Group
The R Simmons Group
 
Peelian Principle in a Data-Driven World, By The R Simmons group
Peelian Principle in a Data-Driven World, By The R Simmons groupPeelian Principle in a Data-Driven World, By The R Simmons group
Peelian Principle in a Data-Driven World, By The R Simmons group
Rufus Simmons III,MBA
 
Smart policing and technology applications presentation
Smart policing and technology applications presentationSmart policing and technology applications presentation
Smart policing and technology applications presentation
Roel Palmaers
 
4.1Updated April-09Lecture NotesChapter 4Enterpr.docx
4.1Updated April-09Lecture NotesChapter 4Enterpr.docx4.1Updated April-09Lecture NotesChapter 4Enterpr.docx
4.1Updated April-09Lecture NotesChapter 4Enterpr.docx
lorainedeserre
 
Crime analysis and Prediction Crime analysis and Prediction
Crime analysis and Prediction Crime analysis and PredictionCrime analysis and Prediction Crime analysis and Prediction
Crime analysis and Prediction Crime analysis and Prediction
Fahad Almassoud
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
dawnrk
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
dawnrk
 

Similar to San Francisco Crime (20)

Crime Dataset Analysis for City of Chicago
Crime Dataset Analysis for City of ChicagoCrime Dataset Analysis for City of Chicago
Crime Dataset Analysis for City of Chicago
 
Police CRJ630 San Diego Police DepartmentSan Diego.docx
Police   CRJ630 San Diego Police DepartmentSan Diego.docxPolice   CRJ630 San Diego Police DepartmentSan Diego.docx
Police CRJ630 San Diego Police DepartmentSan Diego.docx
 
Analyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyAnalyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis County
 
Database and Analytics Programming - Project report
Database and Analytics Programming - Project reportDatabase and Analytics Programming - Project report
Database and Analytics Programming - Project report
 
NC4 Public Safety Solutions
NC4 Public Safety SolutionsNC4 Public Safety Solutions
NC4 Public Safety Solutions
 
Survey on Crime Interpretation and Forecasting Using Machine Learning
Survey on Crime Interpretation and Forecasting Using Machine LearningSurvey on Crime Interpretation and Forecasting Using Machine Learning
Survey on Crime Interpretation and Forecasting Using Machine Learning
 
Webinar - Location Intelligence Police.pdf
Webinar - Location Intelligence Police.pdfWebinar - Location Intelligence Police.pdf
Webinar - Location Intelligence Police.pdf
 
DWBI - Criminalytics: Entities affecting the Rate of Crime in Republic of Ireland
DWBI - Criminalytics: Entities affecting the Rate of Crime in Republic of IrelandDWBI - Criminalytics: Entities affecting the Rate of Crime in Republic of Ireland
DWBI - Criminalytics: Entities affecting the Rate of Crime in Republic of Ireland
 
Hello Criminals! Meet Big Data: Preventing Crime in San Francisco by Predicti...
Hello Criminals! Meet Big Data: Preventing Crime in San Francisco by Predicti...Hello Criminals! Meet Big Data: Preventing Crime in San Francisco by Predicti...
Hello Criminals! Meet Big Data: Preventing Crime in San Francisco by Predicti...
 
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
 
Crime Data Analysis and Prediction for city of Los Angeles
Crime Data Analysis and Prediction for city of Los AngelesCrime Data Analysis and Prediction for city of Los Angeles
Crime Data Analysis and Prediction for city of Los Angeles
 
Analytics-Based Crime Prediction
Analytics-Based Crime PredictionAnalytics-Based Crime Prediction
Analytics-Based Crime Prediction
 
Peelian Principle in a Data-Driven World, by The R Simmons Group
Peelian Principle in a Data-Driven World, by The R Simmons GroupPeelian Principle in a Data-Driven World, by The R Simmons Group
Peelian Principle in a Data-Driven World, by The R Simmons Group
 
Peelian Principle in a Data-Driven World, By The R Simmons group
Peelian Principle in a Data-Driven World, By The R Simmons groupPeelian Principle in a Data-Driven World, By The R Simmons group
Peelian Principle in a Data-Driven World, By The R Simmons group
 
Smart policing and technology applications presentation
Smart policing and technology applications presentationSmart policing and technology applications presentation
Smart policing and technology applications presentation
 
4.1Updated April-09Lecture NotesChapter 4Enterpr.docx
4.1Updated April-09Lecture NotesChapter 4Enterpr.docx4.1Updated April-09Lecture NotesChapter 4Enterpr.docx
4.1Updated April-09Lecture NotesChapter 4Enterpr.docx
 
Crime analysis and Prediction Crime analysis and Prediction
Crime analysis and Prediction Crime analysis and PredictionCrime analysis and Prediction Crime analysis and Prediction
Crime analysis and Prediction Crime analysis and Prediction
 
report
reportreport
report
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
 

Recently uploaded

一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
ocavb
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
NABLAS株式会社
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 

Recently uploaded (20)

一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单一比一原版(TWU毕业证)西三一大学毕业证成绩单
一比一原版(TWU毕业证)西三一大学毕业证成绩单
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .社内勉強会資料_LLM Agents                              .
社内勉強会資料_LLM Agents                              .
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 

San Francisco Crime

  • 1. DSO 510 Business Analytics | Group Project 1 Crime in San Francisco DSO 510 Business Analytics | Group Project Phase 4 Presentation Andrew Chen | Yile Wu | Chi Zhang | Chulsoon Pak
  • 2. DSO 510 Business Analytics | Group Project 2
  • 3. DSO 510 Business Analytics | Group Project 3 PHASE I Define business analytics proposal, data required, data analysis approach, and decision making and innovation framework
  • 4. DSO 510 Business Analytics | Group Project 4 • 2014 Population: 852,4691 • 13th most populous city in the nation • Separated into 10 districts • Top global innovation center, with highest concentration of technology- related jobs in the U.S. • Crime in San Francisco has historically been higher than the U.S. average • Crime Index of SF is rated 3 out of 100 (safer than 3% of other U.S. cities)2 BACKGROUND: SAN FRANCISCO 1. United States Census Bureau, July 2014 2. "Crime rates for San Francisco, CA", NeighborhoodScout, 2013
  • 5. DSO 510 Business Analytics | Group Project 5 In order to make San Francisco a safer place, we aim identify factors that promote criminal behavior to predict crime more accurately. GOAL DEFINITION
  • 6. DSO 510 Business Analytics | Group Project 6 DEFINING OUR VARIABLES Dependent Variables 1. Number of Crimes 2. Time of Crime 3. Date of Crime 4. Severity of Crime 5. Type of Crime 6. Location of Crime Independent Variables 1. Day of Week 2. Season of Year 3. Weather 4. Daylight 5. Income Level of District 6. Average Housing Price of District 7. Age Composition of District 8. Population density 9. Degree of Urbanization 10. Modes of Transportation 11. Level of Education 12. Divorce rate of Families
  • 7. DSO 510 Business Analytics | Group Project 7 • San Francisco Crime Data: • https://data.sfgov.org/Public-Safety/ • 700,000+ data points (5 years of data) • San Francisco Weather, Population, Housing, Hazard Risk, and Demographics • www.sfclimatehealth.org/ • http://aa.usno.navy.mil/data/docs/RS_ OneYear.php • San Francisco Housing, Income Level, Employment, and Transportation by District • www.sf-planning.org DATA COLLECTION
  • 8. DSO 510 Business Analytics | Group Project 8 INTERPRETATION & ACTION 1. Identify to what degree different variables contribute to crime 2. Predict probability and severity of crimes in terms of time and location Implementation 1. Assist SFPD in efficient deployment of its police force 2. Integrate data with mapping algorithms to provide the safest real-time routes 3. Organize anti-crime education in high crime areas (how to handle crimes under different situations) 4. Utilize data in product development and marketing of security-related products 5. Enhance San Francisco city-planning to reduce crimes
  • 9. DSO 510 Business Analytics | Group Project 9 PHASE II
  • 10. DSO 510 Business Analytics | Group Project 10 In order to make San Francisco a safer place, we aim identify factors that promote criminal behavior to predict crime more accurately. GOAL DEFINITION
  • 11. DSO 510 Business Analytics | Group Project 11 DEFINING OUR VARIABLES Dependent Variables 1. Number of Crimes per Day 2. Number of Crimes per Month 3. Time Slot of Crime 4. Date of Crime 5. Severity of Crime 6. Location of Crime Independent Variables 1. Day of Week 2. Month 3. Weather 4. Daylight 5. Income Level of District 6. Age Composition of District 7. Modes of Transportation 8. Level of Education 9. Employment of District
  • 12. DSO 510 Business Analytics | Group Project 12 SUMMARY STATISTICS • 5 Years of Data • From August 2010 • To August 2015 • 726,245 Crimes Reported Monthly Statistics
  • 13. DSO 510 Business Analytics | Group Project 13 MONTHLY CRIME DATA (2010 – 2015)
  • 14. DSO 510 Business Analytics | Group Project 14 CRIME PATTERNS BY MONTH OF YEAR
  • 15. DSO 510 Business Analytics | Group Project 15 WHICH CRIMES ARE MOST FREQUENTLY COMMITTED? Top 5 Crimes* 1. Theft 2. Assault 3. Vandalism 4. Drug Violation 5. Vehicle Theft *Other Offenses, Non-Criminal Offenses, and Warrants are excluded
  • 16. DSO 510 Business Analytics | Group Project 16 CRIMES THAT DEMAND GREATER ATTENTION Assault, Robbery, Missing Person Theft, Vandalism Forcible Sex Offenses, Murder, Kidnapping Disorderly Conduct, Gambling, Loitering HIGH FREQUENCY LOW FREQUENCY HIGH SEVERITY LOW SEVERITY
  • 17. DSO 510 Business Analytics | Group Project 17 CRIMES PER DAY OF THE WEEK • Friday and Saturday’s have the most crimes committed – Late night parties/Events • Sunday and Monday’s have the least crimes committed – Church, Family gatherings – Back to Work/School
  • 18. DSO 510 Business Analytics | Group Project 18 CRIME PER DISTRICT • Number of Crimes per District • Some districts have significantly higher crime than others • A good indicator to help SFPD deploy police forces by districts
  • 19. DSO 510 Business Analytics | Group Project 19 INNER JOIN WITH DAYLIGHT DATA Crime Data Sunrise and Sunset Data Inner join by date
  • 20. DSO 510 Business Analytics | Group Project 20 DAYLIGHT AFFECTS SOME TYPES OF CRIMES • Crime breakdown based on day or nighttime (in percentages) – Data eliminated our initial hypothesis that crimes are more likely committed during the night
  • 21. DSO 510 Business Analytics | Group Project 21 LOOKING AHEAD Data Manipulation • Clean up and join other demographic data to existing data • Categorize meaningful variables into numeric values in order to run further statistical models • Assign values for severity and frequency of each crime Further Insights • Dig deeper into crimes by district, day of week, and time of day • Produce a spatial map of crime
  • 22. DSO 510 Business Analytics | Group Project 22 PHASE III
  • 23. DSO 510 Business Analytics | Group Project 23 DEFINING OUR VARIABLES Dependent Variables 1. Number of Crimes per Day 2. Number of Crimes during the Day 3. Number of Crimes during the Night 4. Number of Crimes per Month 5. Time Slot of Crime Independent Variables 1. Day of Week 2. Month 3. Average Temperature 4. Precipitation 5. Daylight 6. Income Level of District 7. Age Composition of District 8. Modes of Transportation 9. Level of Education 10. Employment of District
  • 24. DSO 510 Business Analytics | Group Project 24 TEN CRIMES TO FOCUS ON • Weighted based on frequency and severity of crime sentence Frequency Low.yr High.yr Avg.yr Weight LARCENY/THEFT 168,901 0 25 13 2,136,598 ASSAULT 62,449 1 25 13 811,837 DRUG/NARCOTIC 31,180 1 40 20 631,395 ROBBERY 18,652 15 30 23 419,670 BURGLARY 29,020 3 20 12 333,730 SEX OFFENSES, FORCIBLE 3,927 20 100 60 235,620 FRAUD 14,237 1 25 13 185,081 VEHICLE THEFT 31,002 5 5 5 155,010 KIDNAPPING 2,162 0 100 50 108,208 WEAPON LAWS 7,444 0 20 10 74,812
  • 25. DSO 510 Business Analytics | Group Project 25 CORRELATIONS
  • 26. DSO 510 Business Analytics | Group Project 26 LINEAR REGRESSION MODEL
  • 27. DSO 510 Business Analytics | Group Project 27 LINEAR REGRESSION MODEL • Dependent Variable: • Total Daily Crime • Independent Variables: • Day of Week • Average Temperature • Precipitation • Significance level: <.0001 • R-Squared Value: 0.1956
  • 28. DSO 510 Business Analytics | Group Project 28 RESIDUALS ANALYSIS
  • 29. DSO 510 Business Analytics | Group Project 29 ANOVA
  • 30. DSO 510 Business Analytics | Group Project 30 TEN CRIMES TO FOCUS ON • Weighted based on frequency and severity of crime sentence Frequency Low.yr High.yr Avg.yr Weight LARCENY/THEFT 168,901 0 25 13 2,136,598 ASSAULT 62,449 1 25 13 811,837 DRUG/NARCOTIC 31,180 1 40 20 631,395 ROBBERY 18,652 15 30 23 419,670 BURGLARY 29,020 3 20 12 333,730 SEX OFFENSES, FORCIBLE 3,927 20 100 60 235,620 FRAUD 14,237 1 25 13 185,081 VEHICLE THEFT 31,002 5 5 5 155,010 KIDNAPPING 2,162 0 100 50 108,208 WEAPON LAWS 7,444 0 20 10 74,812
  • 31. DSO 510 Business Analytics | Group Project 31 PHASE IV
  • 32. DSO 510 Business Analytics | Group Project 32 BINARY LOGISTIC REGRESSION
  • 33. DSO 510 Business Analytics | Group Project 33 BINARY LOGISTIC REGRESSION
  • 34. DSO 510 Business Analytics | Group Project 34 BINARY LOGISTIC REGRESSION
  • 35. DSO 510 Business Analytics | Group Project 35 PREDICTIVE MODELING
  • 36. DSO 510 Business Analytics | Group Project 36 PREDICTIVE MODELING
  • 37. DSO 510 Business Analytics | Group Project 37 PREDICTIVE MODELING
  • 38. DSO 510 Business Analytics | Group Project 38 PREDICTIVE MODELING
  • 39. DSO 510 Business Analytics | Group Project 39 PREDICTIVE MODELING

Editor's Notes

  1. 10 districts BAYVIEW CENTRAL INGLESIDE MISSION NORTHERN PARK RICHMOND SOUTHERN TARAVAL TENDERLOIN
  2. 10 districts BAYVIEW CENTRAL INGLESIDE MISSION NORTHERN PARK RICHMOND SOUTHERN TARAVAL TENDERLOIN
  3. Interpretation, Action, and Feedback Describe interpretation of the data analysis and modeling in the context of the business goal Outline the possible options and/or decisions available to the business based on the data analysis and modeling - Adjustments/Improvements to SFPD police deployment - Connect with map/navigation tools to provide safest routes in real-time - Contribute to city-planning of SF municipal (by identifying factors that add to- vs. reduce- chances of crime) 1. the analysis could help San Francisco city to launch anticrime education in high crime rate areas. Citizens would realize how to handle crime under different situation. 2. the analysis could help certain anticrime products company to locate their potential customers. According to different crime type, companies could sell products related to "house security", "vehicle alarm", and "self-defense weapon". (aka Amazon)
  4. John
  5. John
  6. Andrew
  7. Charles These crimes would be weighted higher in terms of our model.
  8. Charles
  9. Dylan
  10. Dylan
  11. Types of Crime to Analyze Charles
  12. Types of Crime to Analyze Charles
  13. MISSING – TESTING global null hypothesis Analysis of Maximum Likelihood Estimates
  14. Only significant for Friday, Saturday, and Sunday
  15. MISSING Association of Predicted Probabilities and Observed Responses ROC Plot
  16. MISSING
  17. Cumulative Lift is Missing and Bottom Two Statistics are missing as well ROC Plot
  18. Cumulative Lift is Missing and Bottom Two Statistics are missing as well ROC Plot