SlideShare a Scribd company logo
1 of 23
FATAL OR INJURY- A CASE OF DECIDING
ON PRIORITIZING RESPONDER RESOURCES
By
Piyush Lohana
Maximum accidents in the year 2007 happened due to motor vehicles.
WHY THIS PROJECT
• “Every 12 minutes someone dies in a car crash in the United States due to a car accident or a collision
between two motor vehicles.” (-NCIPC)
• Most of times the accidents are fatal or involve serious injuries and by the time the help arrives at the crash
site, a lot of loss has been done.
• We attempt to build a model that can predict the seriousness of an accident case (i.e. if an accident is fatal
or results in injury) based on the various predictors like rush or no rush hour, work zone, weather
conditions, speed limits, interstate etc.
• This helps to prioritize situations and allocates resources in scenarios where there is a high possibility of an
accident resulting in fatalities or serious injury.
• This will enable the emergency care provider on focusing on the measures and resource that can be taken
when they arrive at the scene. The accuracy of pre-hospital crash scene details and crash victim assessment
has important implications on the care that can be provided at the time of the crash scene.
WHAT ARE WE CONSIDERING
• We will be looking at the characteristics of the environment in which the accident
occurred (weather, road condition, type of road, time of day, the day of the week, and
month of the year) and the characteristics of the crash (direction of accident, speed
limit on the road, work zone area, and how many vehicles were involved).
• All of these variables can effect in what kind of accident has occurred (no injury,
injury or fatal). This can further help the medic’s team to come prepared for the
necessary actions that need to be taken at the scene.
DATA SOURCE
• http://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=1158
• It has 24 different attributes and 42,183 records
• Identified Predictor and Outcome Variables
CLEAR DESCRIPTION OF DATA SET
Sl. No Variables Description
1 HOUR_I_R 1=rush hour, 0=not (rush = 6-9 am, 4-7 pm)
2 ALIGN_I 1 = straight, 2 = curve
3
STRATUM_R
1= NASS Crashes Involving At Least One Passenger
Vehicle towed due to damage from the crash scene and no
medium or heavy trucks are Involved, 0=not
4 WRK_ZONE 1= yes, 0= no
5 WKDY_I_R 1=weekday, 0=weekend
6 INT_HWY Interstate? 1=yes, 0=no
7
LGTCON_I_R
Light conditions - 1=day, 2=dark (including dawn/dusk),
3=dark, but lighted,4=dawn or dusk
8 MAN_COL_I 0=no collision, 1=head-on, 2=other form of collision
9 PED_ACC_R 1=pedestrian/cyclist involved, 0=not
10
REL_JCT_I_R
1=accident at intersection/interchange, 0=not at
intersection
CLEAR DESCRIPTION OF DATA SET
Sl. No Variables Description
11 SPD_LIM Speed limit, miles per hour
12
SUR_CON
Surface conditions (1=dry, 2=wet, 3=snow/slush, 4=ice,
5=sand/dirt/oil, 8=other, 9=unknown)
13 TRAF_WAY 1=two-way traffic, 2=divided hwy, 3=one-way road
14 VEH_INVL Number of vehicles involved
15
WEATHER_R
1=no adverse conditions, 2= rain, snow or other adverse
condition
16 INJURY_CRASH 1=yes, 0= no
17 NO_INJ_I Number of injuries
18 FATALITIES 1= yes, 0= no
19 MAX_SEV_IR 0=no injury, 1=non-fatal inj., 2=fatal inj.
FILTERING DATA
• Filtering method used is "Standard Deviations from the
Mean",
• This will eliminate the observations that are farther than
three standard deviations from their means.
DATA PARTITIONING
• We build the model with Training Data
• Test its correctness with Test Data
• Validate it with Validation Data
PREDICT, CLASSIFY OR CLUSTER ?
As we are trying to predict the categorical class label MAX_SER_INJ, our analysis is
supervised classification.
Our model intends to discover relationships between the attributes that would make it
possible to predict the outcome variable.
MODEL
The following three models are used for our analysis
• Memory Based Reasoning(MBR)
• Decision Trees
• Logistic Regression
FINAL MODEL
RESULTS AND DISCUSSION
BASELINE MISCLASSIFICATION
• MAX_SEV_IR- 0=no injury, 1=non-fatal inj., 2=fatal inj.
• Class 0 (No injury): 4949
• Class 1(Non-fatal injury): 4900
• Class 2 (Fatal Injury): 150
• The majority class is 0 (No injury)
• The percentage of majority class in the dataset is: 49.49 % (4949/9999)
• The baseline misclassification rate: 50.51 %
• This is the baseline, the model that we build will make any sense if its
misclassification rate is less than baseline misclassification.
OUR DEFINITION OF BEST MODEL AS PER BUSINESS
REQUIREMENT
• Decision Tree : A supervised learning data driven method for classification
• It is based on separating observations into more homogeneous subgroups by creating splits
on predictors.
• As Per our business requirement , this model is best in classifying the event of accident into
three cases to prioritize resources.
RESULTS
The _MISC_ Misclassification rate :
• Training: 0.40945
• Validation: 0.4113
• Test: 0.42305
NODE RULES
INTERPRETATION AND IMPLEMENTATION
• Based on this rules, an application/website can be created which upon
entering all the 5 most important factors(Predictors) will give an idea of the
percentage of chances of an accident resulting in Fatality/Injury/No Injury.
• The emergency service provider can then take a decision and send the
response team to the site of an accident accordingly.
BLUE PRINT OF IMPLEMENTATION
OUTCOME
• Depending on the Node Rule, it will predict the outcome
• Red Cross predict’s there are 80% chances of Injury
• Red Cross predict’s there are 10 % chances of Fatality
• Red Cross predict’s there are 10 % chances of No injury
SCOPE FOR IMPROVEMENT
• In order to build more focused and rigorous model, we are working on identifying more predictors that
can help determine the status of accident and a more clean model that has a less misclassification.
• In order to achieve this, we intend to try Neural Network data mining algorithm.
THANK YOU

More Related Content

Similar to Data Mining Project-Predicting Injury or Fatality in case of an accident

Storm Prediction data analysis using R/SAS
Storm Prediction data analysis using R/SASStorm Prediction data analysis using R/SAS
Storm Prediction data analysis using R/SASGautam Sawant
 
IRJET- Road Traffic Prediction using Machine Learning
IRJET- Road Traffic Prediction using Machine LearningIRJET- Road Traffic Prediction using Machine Learning
IRJET- Road Traffic Prediction using Machine LearningIRJET Journal
 
Ieeepro techno solutions 2013 ieee embedded project decision making in coll...
Ieeepro techno solutions   2013 ieee embedded project decision making in coll...Ieeepro techno solutions   2013 ieee embedded project decision making in coll...
Ieeepro techno solutions 2013 ieee embedded project decision making in coll...srinivasanece7
 
EVALUATION OF PARTICLE SWARM OPTIMIZATION ALGORITHM IN PREDICTION OF THE CAR ...
EVALUATION OF PARTICLE SWARM OPTIMIZATION ALGORITHM IN PREDICTION OF THE CAR ...EVALUATION OF PARTICLE SWARM OPTIMIZATION ALGORITHM IN PREDICTION OF THE CAR ...
EVALUATION OF PARTICLE SWARM OPTIMIZATION ALGORITHM IN PREDICTION OF THE CAR ...ijcsa
 
Machine Learning statistical model using Transportation data
Machine Learning statistical model using Transportation dataMachine Learning statistical model using Transportation data
Machine Learning statistical model using Transportation datajagan477830
 
Federal Highway Administration Initiative and Proven Countermeasures
Federal Highway Administration Initiative and Proven CountermeasuresFederal Highway Administration Initiative and Proven Countermeasures
Federal Highway Administration Initiative and Proven CountermeasuresTexas A&M Transportation Institute
 
Modeling Road Traffic Accidents
Modeling Road Traffic AccidentsModeling Road Traffic Accidents
Modeling Road Traffic AccidentsJulius Nyerere
 
Decentralized system to compute safest route
Decentralized system to compute safest routeDecentralized system to compute safest route
Decentralized system to compute safest routeAnushka Patil
 
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...IRJET Journal
 
IRJET- Measuring The Driver's Perception Error in the Traffic Accident Risk E...
IRJET- Measuring The Driver's Perception Error in the Traffic Accident Risk E...IRJET- Measuring The Driver's Perception Error in the Traffic Accident Risk E...
IRJET- Measuring The Driver's Perception Error in the Traffic Accident Risk E...IRJET Journal
 
An efficient automotive collision avoidance system for indian traffic conditions
An efficient automotive collision avoidance system for indian traffic conditionsAn efficient automotive collision avoidance system for indian traffic conditions
An efficient automotive collision avoidance system for indian traffic conditionseSAT Journals
 
Accident Prediction System Using Machine Learning
Accident Prediction System Using Machine LearningAccident Prediction System Using Machine Learning
Accident Prediction System Using Machine LearningIRJET Journal
 
RISK-ACADEMY’s guide on compliance risk in non-financial companies. Free down...
RISK-ACADEMY’s guide on compliance risk in non-financial companies. Free down...RISK-ACADEMY’s guide on compliance risk in non-financial companies. Free down...
RISK-ACADEMY’s guide on compliance risk in non-financial companies. Free down...Alexei Sidorenko, CRMP
 
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...IRJET Journal
 

Similar to Data Mining Project-Predicting Injury or Fatality in case of an accident (20)

Storm Prediction data analysis using R/SAS
Storm Prediction data analysis using R/SASStorm Prediction data analysis using R/SAS
Storm Prediction data analysis using R/SAS
 
IRJET- Road Traffic Prediction using Machine Learning
IRJET- Road Traffic Prediction using Machine LearningIRJET- Road Traffic Prediction using Machine Learning
IRJET- Road Traffic Prediction using Machine Learning
 
Ieeepro techno solutions 2013 ieee embedded project decision making in coll...
Ieeepro techno solutions   2013 ieee embedded project decision making in coll...Ieeepro techno solutions   2013 ieee embedded project decision making in coll...
Ieeepro techno solutions 2013 ieee embedded project decision making in coll...
 
EVALUATION OF PARTICLE SWARM OPTIMIZATION ALGORITHM IN PREDICTION OF THE CAR ...
EVALUATION OF PARTICLE SWARM OPTIMIZATION ALGORITHM IN PREDICTION OF THE CAR ...EVALUATION OF PARTICLE SWARM OPTIMIZATION ALGORITHM IN PREDICTION OF THE CAR ...
EVALUATION OF PARTICLE SWARM OPTIMIZATION ALGORITHM IN PREDICTION OF THE CAR ...
 
Presentation 5.pptx
Presentation 5.pptxPresentation 5.pptx
Presentation 5.pptx
 
Machine Learning statistical model using Transportation data
Machine Learning statistical model using Transportation dataMachine Learning statistical model using Transportation data
Machine Learning statistical model using Transportation data
 
Summer Program on Transportation Statistics, Why Highway Crashes Have Recurri...
Summer Program on Transportation Statistics, Why Highway Crashes Have Recurri...Summer Program on Transportation Statistics, Why Highway Crashes Have Recurri...
Summer Program on Transportation Statistics, Why Highway Crashes Have Recurri...
 
Summer Program on Transportation Statistics, What governs Highway Crashes Rec...
Summer Program on Transportation Statistics, What governs Highway Crashes Rec...Summer Program on Transportation Statistics, What governs Highway Crashes Rec...
Summer Program on Transportation Statistics, What governs Highway Crashes Rec...
 
Federal Highway Administration Initiative and Proven Countermeasures
Federal Highway Administration Initiative and Proven CountermeasuresFederal Highway Administration Initiative and Proven Countermeasures
Federal Highway Administration Initiative and Proven Countermeasures
 
Modeling Road Traffic Accidents
Modeling Road Traffic AccidentsModeling Road Traffic Accidents
Modeling Road Traffic Accidents
 
Decentralized system to compute safest route
Decentralized system to compute safest routeDecentralized system to compute safest route
Decentralized system to compute safest route
 
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
 
SunGard Risk Assessment Module
SunGard Risk Assessment ModuleSunGard Risk Assessment Module
SunGard Risk Assessment Module
 
IRJET- Measuring The Driver's Perception Error in the Traffic Accident Risk E...
IRJET- Measuring The Driver's Perception Error in the Traffic Accident Risk E...IRJET- Measuring The Driver's Perception Error in the Traffic Accident Risk E...
IRJET- Measuring The Driver's Perception Error in the Traffic Accident Risk E...
 
An efficient automotive collision avoidance system for indian traffic conditions
An efficient automotive collision avoidance system for indian traffic conditionsAn efficient automotive collision avoidance system for indian traffic conditions
An efficient automotive collision avoidance system for indian traffic conditions
 
Accident Prediction System Using Machine Learning
Accident Prediction System Using Machine LearningAccident Prediction System Using Machine Learning
Accident Prediction System Using Machine Learning
 
RISK-ACADEMY’s guide on compliance risk in non-financial companies. Free down...
RISK-ACADEMY’s guide on compliance risk in non-financial companies. Free down...RISK-ACADEMY’s guide on compliance risk in non-financial companies. Free down...
RISK-ACADEMY’s guide on compliance risk in non-financial companies. Free down...
 
DataMining_CA2-4
DataMining_CA2-4DataMining_CA2-4
DataMining_CA2-4
 
Summer Program on Transportation Statistics, Assessing Crash Risk for Highly ...
Summer Program on Transportation Statistics, Assessing Crash Risk for Highly ...Summer Program on Transportation Statistics, Assessing Crash Risk for Highly ...
Summer Program on Transportation Statistics, Assessing Crash Risk for Highly ...
 
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
 

Recently uploaded

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 

Recently uploaded (20)

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 

Data Mining Project-Predicting Injury or Fatality in case of an accident

  • 1. FATAL OR INJURY- A CASE OF DECIDING ON PRIORITIZING RESPONDER RESOURCES By Piyush Lohana
  • 2. Maximum accidents in the year 2007 happened due to motor vehicles.
  • 3. WHY THIS PROJECT • “Every 12 minutes someone dies in a car crash in the United States due to a car accident or a collision between two motor vehicles.” (-NCIPC) • Most of times the accidents are fatal or involve serious injuries and by the time the help arrives at the crash site, a lot of loss has been done. • We attempt to build a model that can predict the seriousness of an accident case (i.e. if an accident is fatal or results in injury) based on the various predictors like rush or no rush hour, work zone, weather conditions, speed limits, interstate etc. • This helps to prioritize situations and allocates resources in scenarios where there is a high possibility of an accident resulting in fatalities or serious injury. • This will enable the emergency care provider on focusing on the measures and resource that can be taken when they arrive at the scene. The accuracy of pre-hospital crash scene details and crash victim assessment has important implications on the care that can be provided at the time of the crash scene.
  • 4. WHAT ARE WE CONSIDERING • We will be looking at the characteristics of the environment in which the accident occurred (weather, road condition, type of road, time of day, the day of the week, and month of the year) and the characteristics of the crash (direction of accident, speed limit on the road, work zone area, and how many vehicles were involved). • All of these variables can effect in what kind of accident has occurred (no injury, injury or fatal). This can further help the medic’s team to come prepared for the necessary actions that need to be taken at the scene.
  • 5. DATA SOURCE • http://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=1158 • It has 24 different attributes and 42,183 records • Identified Predictor and Outcome Variables
  • 6. CLEAR DESCRIPTION OF DATA SET Sl. No Variables Description 1 HOUR_I_R 1=rush hour, 0=not (rush = 6-9 am, 4-7 pm) 2 ALIGN_I 1 = straight, 2 = curve 3 STRATUM_R 1= NASS Crashes Involving At Least One Passenger Vehicle towed due to damage from the crash scene and no medium or heavy trucks are Involved, 0=not 4 WRK_ZONE 1= yes, 0= no 5 WKDY_I_R 1=weekday, 0=weekend 6 INT_HWY Interstate? 1=yes, 0=no 7 LGTCON_I_R Light conditions - 1=day, 2=dark (including dawn/dusk), 3=dark, but lighted,4=dawn or dusk 8 MAN_COL_I 0=no collision, 1=head-on, 2=other form of collision 9 PED_ACC_R 1=pedestrian/cyclist involved, 0=not 10 REL_JCT_I_R 1=accident at intersection/interchange, 0=not at intersection
  • 7. CLEAR DESCRIPTION OF DATA SET Sl. No Variables Description 11 SPD_LIM Speed limit, miles per hour 12 SUR_CON Surface conditions (1=dry, 2=wet, 3=snow/slush, 4=ice, 5=sand/dirt/oil, 8=other, 9=unknown) 13 TRAF_WAY 1=two-way traffic, 2=divided hwy, 3=one-way road 14 VEH_INVL Number of vehicles involved 15 WEATHER_R 1=no adverse conditions, 2= rain, snow or other adverse condition 16 INJURY_CRASH 1=yes, 0= no 17 NO_INJ_I Number of injuries 18 FATALITIES 1= yes, 0= no 19 MAX_SEV_IR 0=no injury, 1=non-fatal inj., 2=fatal inj.
  • 8. FILTERING DATA • Filtering method used is "Standard Deviations from the Mean", • This will eliminate the observations that are farther than three standard deviations from their means.
  • 9. DATA PARTITIONING • We build the model with Training Data • Test its correctness with Test Data • Validate it with Validation Data
  • 10. PREDICT, CLASSIFY OR CLUSTER ? As we are trying to predict the categorical class label MAX_SER_INJ, our analysis is supervised classification. Our model intends to discover relationships between the attributes that would make it possible to predict the outcome variable.
  • 11. MODEL The following three models are used for our analysis • Memory Based Reasoning(MBR) • Decision Trees • Logistic Regression
  • 14. BASELINE MISCLASSIFICATION • MAX_SEV_IR- 0=no injury, 1=non-fatal inj., 2=fatal inj. • Class 0 (No injury): 4949 • Class 1(Non-fatal injury): 4900 • Class 2 (Fatal Injury): 150 • The majority class is 0 (No injury) • The percentage of majority class in the dataset is: 49.49 % (4949/9999) • The baseline misclassification rate: 50.51 % • This is the baseline, the model that we build will make any sense if its misclassification rate is less than baseline misclassification.
  • 15. OUR DEFINITION OF BEST MODEL AS PER BUSINESS REQUIREMENT • Decision Tree : A supervised learning data driven method for classification • It is based on separating observations into more homogeneous subgroups by creating splits on predictors. • As Per our business requirement , this model is best in classifying the event of accident into three cases to prioritize resources.
  • 16. RESULTS The _MISC_ Misclassification rate : • Training: 0.40945 • Validation: 0.4113 • Test: 0.42305
  • 17.
  • 19. INTERPRETATION AND IMPLEMENTATION • Based on this rules, an application/website can be created which upon entering all the 5 most important factors(Predictors) will give an idea of the percentage of chances of an accident resulting in Fatality/Injury/No Injury. • The emergency service provider can then take a decision and send the response team to the site of an accident accordingly.
  • 20. BLUE PRINT OF IMPLEMENTATION
  • 21. OUTCOME • Depending on the Node Rule, it will predict the outcome • Red Cross predict’s there are 80% chances of Injury • Red Cross predict’s there are 10 % chances of Fatality • Red Cross predict’s there are 10 % chances of No injury
  • 22. SCOPE FOR IMPROVEMENT • In order to build more focused and rigorous model, we are working on identifying more predictors that can help determine the status of accident and a more clean model that has a less misclassification. • In order to achieve this, we intend to try Neural Network data mining algorithm.