Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Crime project (Big Data Certification Course #6)


Published on

ผลงานของกลุ่ม Crime Project ในหลักสูตร Big Data Certification รุ่นที่ 6 ที่นำเสนอเมื่อวันที่ 20 มกราคม 2561

Published in: Technology
  • Be the first to comment

Crime project (Big Data Certification Course #6)

  1. 1. Evaluation Model Model Analysis Data Preparation Data Understanding Business Case
  2. 2. • Explore potential determinant of arrest rate • Predict probability of arrest • To provide insights on arresting performance • Prediction model of arrest
  3. 3. Column Name Description Date Date when the incident occurred. this is sometimes a best estimate. Primary Type The primary description of the crime Description The secondary description of the crime, Location Description Description of the location where the incident occurred. Arrest Indicates whether an arrest was made. Domestic Indicates whether the incident was domestic-related Beat Indicates the beat where the incident occurred. District Indicates the police district where the incident occurred. Ward The ward (City Council district) where the incident occurred. Community Area Indicates the community area where the incident occurred. Year Year the incident occurred. Updated On Date and time the record was last updated. Latitude The latitude of the location where the incident occurred. Longitude The longitude of the location where the incident occurred. Location The location where the incident occurred *Data source : CSV file *Data Characteristics: Volumes (6 million records) *Data format : Structure / Batch
  4. 4. • Define day of week in crime • Split data 80:20 • Transform data of arrest to ratio of arrest • Transformation categorical variable to numerical one ( Primary Type, Location ) • Index label of arrest rate
  5. 5. • Feature • Day of week • Primary Type • Location Description • Label • Arrest Decision Tree Data Algorithm ClassificationCategory • Confusion matrix for model evaluation
  6. 6. Presentation • Relation of factor crime • Danger & Safe Zone • Prediction Model Proportion • Arrest • Planning day for safety • Location analysis
  7. 7. Location Description Map
  8. 8. Location Description Type