Safety is the major issue anywhere. There are a lot of crimes happening every day. It would be very insightful to analyze the crime rate data to identify frequency of crimes, types of crimes, areas with a higher number of crimes etc. These insights will then have the potential to aid in proactive preventive measures by police increasing the level of safety in certain are.To add a different dimension to the analysis we considered California State University Los Angeles as our focal point and proceeded to project the data based on different parameters like time and distance. This would result in extracting key findings about crimes occurring around California State University Los Angeles and in Los Angeles.
1. Jongwook Woo
HiPIC
CSULA
Crime rate data analysis
in Los Angeles
Presented by,
Donda, Ram Dharan
Puli, Sridhar Reddy
Advised by,
Dr Jongwook Woo
2. High Performance Information Computing Center
Jongwook Woo
CSULA
Contents
Introduction
Microsoft Azure HDInsight Cluster Details
Raw data projection
Detailed analysis of Crime Data
Conclusion
Data set
3. High Performance Information Computing Center
Jongwook Woo
CSULA
Introduction
Day to day exponential growth of crimes
US holds 44th position with 50.15% crime
index
Despite of having technology
Total reported crimes in USA are 377.76
million in 2012-15
4. High Performance Information Computing Center
Jongwook Woo
CSULA
Specifications of Data Set
Data is collected from Los Angeles Police
Department (LAPD)
Offenses like Criminal, Vandalism, Burglary,
Assault, Traffic and Theft occurred in 2012-15 are
analysed.
File Size – 151MB
Number of Files – 1
File Format – CSV (Comma Separated Values)
Total Number of offenses – 8.94 million
5. High Performance Information Computing Center
Jongwook Woo
CSULA
Microsoft Azure
HDInsight Cluster Details
Number of data nodes - 2
CPU – 4cores
Memory – 14GB
Operating system - Windows server 2012
6. High Performance Information Computing Center
Jongwook Woo
CSULA
Projection of Raw Data
0
10000
20000
30000
40000
50000
60000
70000
80000
90000
year2012 year2013 year2014 year2015
7. High Performance Information Computing Center
Jongwook Woo
CSULA
Total No. of Crimes in 2012-15
0
5000
10000
15000
20000
25000
year2012 year2013 year2014 year2015
8. High Performance Information Computing Center
Jongwook Woo
CSULA
Mapping of Crimes Occurred within 5miles
from CSULA, UCLA and USC in 2015
9. High Performance Information Computing Center
Jongwook Woo
CSULA
Mapping of Crimes Occurred within 5miles
from CSULA, UCLA and USC in 2014
10. High Performance Information Computing Center
Jongwook Woo
CSULA
Mapping of Crimes Occurred within 5miles
from CSULA, UCLA and USC in 2013
11. High Performance Information Computing Center
Jongwook Woo
CSULA
No.of Crimes for every 5miles
from CSULA
0
10000
20000
30000
40000
50000
60000
70000
80000
90000
0-5 5-10 11-15 15-20 20-25 25-30 30-35 >35
csula_2012 csula_2013 csula_2014 csula_2015
12. High Performance Information Computing Center
Jongwook Woo
CSULA
No.of Crimes for every 5miles
from UCLA
0
20000
40000
60000
80000
100000
120000
0-5 5-10 11-15 15-20 20-25 25-30 30-35 35-40 >40
ucla_2012 ucla_2013 ucla_2014 ucla_2015
13. High Performance Information Computing Center
Jongwook Woo
CSULA
No. of Crimes for every 5miles
from USC
0
20000
40000
60000
80000
100000
120000
0-5 5-10 11-15 15-20 20-25 25-30 30-35 35-40 >40
ucla_2012 ucla_2013 ucla_2014 ucla_2015
14. High Performance Information Computing Center
Jongwook Woo
CSULA
Comparision of Crimes for every
5miles from CSULA, UCLA and USC
in 2012
0
20000
40000
60000
80000
100000
120000
csula_2012 ucla_2012 usc_2012
15. High Performance Information Computing Center
Jongwook Woo
CSULA
Comparision of Crimes for every
5miles from CSULA, UCLA and USC
in 2013
0
20000
40000
60000
80000
100000
120000
0-5 5-10 11-15 15-20 20-25 25-30 30-35 35-40 >40
csula_2013 ucla_2013 usc_2013
16. High Performance Information Computing Center
Jongwook Woo
CSULA
Comparision of Crimes for every
5miles from CSULA, UCLA and USC
in 2014
0
20000
40000
60000
80000
100000
120000
0-5 5-10 11-15 15-20 20-25 25-30 30-35 >35
csula_2014 ucla_2014 usc_2014
17. High Performance Information Computing Center
Jongwook Woo
CSULA
Comparision of Crimes for every
5miles from CSULA, UCLA and USC
in 2015
0
20000
40000
60000
80000
100000
120000
0-5 5-10 11-15 15-20 20-25 25-30 30-35 35-40 40-50 >50
csula_2015 ucla_2015 usc_2015
18. High Performance Information Computing Center
Jongwook Woo
CSULA
No.of crimes per area in LA
0
2000
4000
6000
8000
10000
12000
14000
16000
18000
77thStreet
Mission
Newton
Rampart
Southwest
Topanga
VanNuys
Wilshire
Central
Devonshire
Foothill
Harbor
Hollenbeck
Hollywood
NHollywood
Pacific
WestValley
Northeast
Olympic
Southeast
WestLA
in2012 in2013 in2014 in2015
19. High Performance Information Computing Center
Jongwook Woo
CSULA
Total No.of Crimes for every
2hours in LA
0
2000
4000
6000
8000
10000
12000
14000
16000
18000
77thStreet
Mission
Newton
Rampart
Southwest
Topanga
VanNuys
Wilshire
Central
Devonshire
Foothill
Harbor
Hollenbeck
Hollywood
NHollywood
Pacific
WestValley
Northeast
Olympic
Southeast
WestLA
in2012 in2013 in2014 in2015
20. High Performance Information Computing Center
Jongwook Woo
CSULA
No.of crimes for every 2hrs within
5miles from CSULA, UCLA and USC
in 2012
0
1000
2000
3000
4000
5000
6000
7000
8000
9000
10000
csula ucla usc
21. High Performance Information Computing Center
Jongwook Woo
CSULA
No.of crimes for every 2hrs within
5miles from CSULA, UCLA and USC
in 2013
0
2000
4000
6000
8000
10000
12000
csula ucla usc
22. High Performance Information Computing Center
Jongwook Woo
CSULA
No.of crimes for every 2hrs within
5miles from CSULA, UCLA and USC
in 2014
0
2000
4000
6000
8000
10000
12000
Series1 Series2 Series3
23. High Performance Information Computing Center
Jongwook Woo
CSULA
No.of crimes for every 2hrs within
5miles from CSULA, UCLA and USC
in 2015
0 2000 4000 6000 8000 10000 12000
00:00-02:00
02:00-04:00
04:00-06:00
06:00-08:00
08:00-10:00
10:00-12:00
12:00-14:00
14:00-16:00
16:00-18:00
18:00-20:00
20:00-22:00
22:00-24:00
usc ucla csula
24. High Performance Information Computing Center
Jongwook Woo
CSULA
Conclusion
Crime rate is higher in the heart of the city than
in other regions in LA
Average crime rate in the year 2014 is 20312.5
Nearly 119 kinds of crimes are reported
Microsoft Azure allowed us to process the entire
data with minimal cost
Huge storage space in cloud made Hadoop to
store the data without any data loss
HQL made it simple to extract the data from
HDFS
25. High Performance Information Computing Center
Jongwook Woo
CSULA
Data Set details
https://data.lacity.org/A-Safe-City/LAPD-
Crime-and-Collision-Raw-Data-2014/eta5-
h8qx