SlideShare a Scribd company logo
1 of 22
Madhu Kiran Chiti
Radhika Patel
Surabhi Parab
PREDICTING TRAFFIC ACCIDENT
SEVERITY AND VEHICLE COUNT USING
MACHINE LEARNING
TABLE OF CONTENT
0 2
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab
01
02
03
04
05
06
07
Introduction
Literature review
Methodology
Model training algorithms
Result and Analysis
Conclusion
References
INTRODUCTION
• Road traffic accidents result in 1.3 million deaths and 50 million injuries worldwide
each year.
• In the US alone, there were over 36,000 traffic fatalities in 2019.
• Economic costs of traffic accidents range from $230 billion to $900 billion globally.
• This project aims to build a multiclass classification model to predict whether an
accident is likely to result in a severe or minor injury.
• We use a subset of features from the US Accidents dataset, including distance,
temperature, humidity, pressure, precipitation, and weather condition.
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 3
LITERATURE REVIEW
• Conducted a literature review before project implementation
• This helped in precisely formulating the project objective and tailoring models
as per project requirements.
• Several user-created notebooks on Kaggle [4]
• An IEEE paper on Comparison of Machine Learning algorithms [5]
• Article on the impact of rainy weather on road accidents. (based on Weather) [6]
• Our project's unique aspect is focusing on weather conditions and location to
provide a severity report, which is different from the existing literature that we
came across.
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 4
METHODOLOGY
• Kaggle Dataset
• Features like - Severity, Time, Latitude, Longitude,
Distance(mi), Street, Number, Side, Zipcode,
Temperature(F), Wind_Chill(F), Humidity(%), Pressure(in),
Precipitation(in), Weather_Condition.
DATASET
DATASET
EVALUATION
• Feature Selection for analysis
• Two separate analysis - one according to the time and
location, second according to the Weather Conditions
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 5
METHODOLOGY
• Finding missing values and missing columns
• Replacing missing numerical values with mean (tried, but
dropped column itself)
• Scaling features using PCA and Standardization
DATA CLEANING
MODEL
TRAINING
• Logistic Regression (LR), Random Forest Classifier
(RF), Support Vector Classifier (SVC), K Nearest
Neighbor (KNN), Gaussian Naive Bayes (GNB) and the
Decision Tree models.
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 6
WORK FLOWCHART
Input Raw Data
jmjj
Remove null and noisy data
Input all road condition related
features
Remove the unwanted
features
Perform features selection
techniques
Top 11 Features
Perform the Classification techniques
(Naive Bayes, Logistic Regression,
Decision Tree, Random Forest, Support
Vector Machine)
Select the Best Prediction
Method
Resultant Final Model
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 7
PCA WITH NAÏVE BAYES ALGORITHM
Load the dataset and display its summary statistics.
Check for missing values and drop columns with more than
50% missing data.
Rename columns and convert DateTime variables to DateTime
format.
Impute missing numerical values with mean imputation.
encoding categorical variables
Split the dataset into training and testing
sets.
Scale the features using StandardScaler.
Perform feature selection using PCA and plot the
first two principal components.
Train a Gaussian Naive Bayes Classifier on the
training set.
Evaluate the performance of the model
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 8
PCA WITH LR, RF,
AND SVC
M A D H U K I R A N C H I T I , R a d h i k a P a t e L , S u r a b h i P a r a b 0 9
KNN ALGORITHM
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 0
DECISION TREE MODEL
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 11
GAUSSIAN NAIVE BAYES
(GNB)
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 2
RESULTS AND ANALYSIS
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 3
RESULTS AND ANALYSIS
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 4
CONFUSION MATRIX
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 5
CONFUSION MATRIX
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 6
ACCURACY PRECISION RECALL
CONCLUSION
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 7
CONCLUSION
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 8
FUTURE WORK
• Due to the limited time constraints, we were unable to fine-tune our models to bring more
accurate results. As part of the future work, we can experiment more with the learning rates and
parameters of the code; and make our model even more robust.
• In the SVC model, we can try changing the kernel to poly (Polynomial) and rbf (Radial Basis
Function).
• A deeper analysis of the relationships between different variables and accident severity or vehicle
count can provide valuable insights. We can explore more performance metrics to analyze the
model in future.
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 9
REFERENCES
• [1] S. Moosavi, M. Ardalan, and M. K. H. Shafiei, "US Accidents (3.5 million records): A Countrywide Traffic Accident Dataset," Kaggle,
2019. https://www.kaggle.com/sobhanmoosavi/us-accidents
• [2] Moosavi, Sobhan, Mohammad Hossein Samavatian, Srinivasan Parthasarathy, and Rajiv Ramnath. “A Countrywide Traffic Accident
Dataset.”, 2019. https://smoosavi.org/datasets/us_accidents
• [3] Moosavi, Sobhan, Mohammad Hossein Samavatian, Srinivasan Parthasarathy, Radu Teodorescu, and Rajiv Ramnath. "Accident Risk
Prediction based on Heterogeneous Sparse Data: New Dataset and Insights." In Proceedings of the 27th ACM SIGSPATIAL International
Conference on Advances in Geographic Information Systems, ACM, 2019. https://arxiv.org/abs/1909.09638
• [4] https://www.kaggle.com/code/satyabrataroy/60-insights-extraction-us?accident-analysis
• [5] R. E. AlMamlook, K. M. Kwayu, M. R. Alkasisbeh, and A. A. Frefer, "Comparison of Machine Learning Algorithms for Predicting Traffic
Accident Severity," 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT),
Amman, Jordan, 2019, pp. 272-276, doi: 10.1109/JEEIT.2019.8717393.
• [6] Harold Brodsky, A.Shalom Hakkert, “Risk of a road accident in rainy weather,” Accident Analysis & Prevention, Volume 20, Issue 3, 1988,
Pages 161-176, ISSN 0001-4575, https://doi.org/10.1016/0001- 4575(88)90001-2
• [7] Anna Trawén, Pia Maraste, Ulf Persson, “International comparison of costs of a fatal casualty of road accidents in 1990 and 1999,”
Accident Analysis & Prevention, Volume 34, Issue 3, 2002, Pages 323-332, ISSN 0001-4575, https://doi.org/10.1016/S0001-4575(01)00029-X
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 2 0
CODE LINK
[8] Code1: https://github.com/sparab2/ML_Project_Traffic/blob/main/Project_Final .ipynb
[9] Code2: https://github.com/Radhika-95/Project.gi
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab
Code
1
Code
2
2 1
Thank You!
MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 2 2

More Related Content

Similar to BiPredicting Traffic Accident Severity and Vehicle Count using Machine Learning.pptx

Review Paper for floodplain mapping with applications of HEC-HMS, HEC-RAS and...
Review Paper for floodplain mapping with applications of HEC-HMS, HEC-RAS and...Review Paper for floodplain mapping with applications of HEC-HMS, HEC-RAS and...
Review Paper for floodplain mapping with applications of HEC-HMS, HEC-RAS and...IRJET Journal
 
Rides Request Demand Forecast- OLA Bike
Rides Request Demand Forecast- OLA BikeRides Request Demand Forecast- OLA Bike
Rides Request Demand Forecast- OLA BikeIRJET Journal
 
Structuring EMR Data For Analytics
Structuring EMR Data For AnalyticsStructuring EMR Data For Analytics
Structuring EMR Data For AnalyticsBrandon Stange
 
A COMPARATIVE STUDY OF DIFFERENT INTEGRATED MULTIPLE CRITERIA DECISION MAKING...
A COMPARATIVE STUDY OF DIFFERENT INTEGRATED MULTIPLE CRITERIA DECISION MAKING...A COMPARATIVE STUDY OF DIFFERENT INTEGRATED MULTIPLE CRITERIA DECISION MAKING...
A COMPARATIVE STUDY OF DIFFERENT INTEGRATED MULTIPLE CRITERIA DECISION MAKING...Shankha Goswami
 
Adaptation Metrics: Community Based Measuring And Prioritizing Adaptation Act...
Adaptation Metrics: Community Based Measuring And Prioritizing Adaptation Act...Adaptation Metrics: Community Based Measuring And Prioritizing Adaptation Act...
Adaptation Metrics: Community Based Measuring And Prioritizing Adaptation Act...Prabhakar SVRK
 
Predicting Flood Impacts: Analyzing Flood Dataset using Machine Learning Algo...
Predicting Flood Impacts: Analyzing Flood Dataset using Machine Learning Algo...Predicting Flood Impacts: Analyzing Flood Dataset using Machine Learning Algo...
Predicting Flood Impacts: Analyzing Flood Dataset using Machine Learning Algo...IRJET Journal
 
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...Adel Sabour
 
IRJET- Road Traffic Prediction using Machine Learning
IRJET- Road Traffic Prediction using Machine LearningIRJET- Road Traffic Prediction using Machine Learning
IRJET- Road Traffic Prediction using Machine LearningIRJET Journal
 
Blood Transfusion success rate prediction using Artificial Intelligence
Blood Transfusion success rate prediction using Artificial IntelligenceBlood Transfusion success rate prediction using Artificial Intelligence
Blood Transfusion success rate prediction using Artificial IntelligenceIRJET Journal
 
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...IRJET Journal
 
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...Biplav Srivastava
 
Federal Highway Administration Initiative and Proven Countermeasures
Federal Highway Administration Initiative and Proven CountermeasuresFederal Highway Administration Initiative and Proven Countermeasures
Federal Highway Administration Initiative and Proven CountermeasuresTexas A&M Transportation Institute
 
7 ncdot&u senal deladvcurvahor 2010 150p
7   ncdot&u senal deladvcurvahor 2010 150p7   ncdot&u senal deladvcurvahor 2010 150p
7 ncdot&u senal deladvcurvahor 2010 150pSierra Francisco Justo
 
Applied research. Optimization of the Shuttle Services
Applied research. Optimization of the Shuttle ServicesApplied research. Optimization of the Shuttle Services
Applied research. Optimization of the Shuttle ServicesRAMON RIOS
 
New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...ijseajournal
 
Regression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network ApproachRegression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network ApproachKhulna University
 
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...IJDKP
 
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...IJDKP
 

Similar to BiPredicting Traffic Accident Severity and Vehicle Count using Machine Learning.pptx (20)

Review Paper for floodplain mapping with applications of HEC-HMS, HEC-RAS and...
Review Paper for floodplain mapping with applications of HEC-HMS, HEC-RAS and...Review Paper for floodplain mapping with applications of HEC-HMS, HEC-RAS and...
Review Paper for floodplain mapping with applications of HEC-HMS, HEC-RAS and...
 
Rides Request Demand Forecast- OLA Bike
Rides Request Demand Forecast- OLA BikeRides Request Demand Forecast- OLA Bike
Rides Request Demand Forecast- OLA Bike
 
20120140507001
2012014050700120120140507001
20120140507001
 
Structuring EMR Data For Analytics
Structuring EMR Data For AnalyticsStructuring EMR Data For Analytics
Structuring EMR Data For Analytics
 
A COMPARATIVE STUDY OF DIFFERENT INTEGRATED MULTIPLE CRITERIA DECISION MAKING...
A COMPARATIVE STUDY OF DIFFERENT INTEGRATED MULTIPLE CRITERIA DECISION MAKING...A COMPARATIVE STUDY OF DIFFERENT INTEGRATED MULTIPLE CRITERIA DECISION MAKING...
A COMPARATIVE STUDY OF DIFFERENT INTEGRATED MULTIPLE CRITERIA DECISION MAKING...
 
Adaptation Metrics: Community Based Measuring And Prioritizing Adaptation Act...
Adaptation Metrics: Community Based Measuring And Prioritizing Adaptation Act...Adaptation Metrics: Community Based Measuring And Prioritizing Adaptation Act...
Adaptation Metrics: Community Based Measuring And Prioritizing Adaptation Act...
 
Predicting Flood Impacts: Analyzing Flood Dataset using Machine Learning Algo...
Predicting Flood Impacts: Analyzing Flood Dataset using Machine Learning Algo...Predicting Flood Impacts: Analyzing Flood Dataset using Machine Learning Algo...
Predicting Flood Impacts: Analyzing Flood Dataset using Machine Learning Algo...
 
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Str...
 
IRJET- Road Traffic Prediction using Machine Learning
IRJET- Road Traffic Prediction using Machine LearningIRJET- Road Traffic Prediction using Machine Learning
IRJET- Road Traffic Prediction using Machine Learning
 
Blood Transfusion success rate prediction using Artificial Intelligence
Blood Transfusion success rate prediction using Artificial IntelligenceBlood Transfusion success rate prediction using Artificial Intelligence
Blood Transfusion success rate prediction using Artificial Intelligence
 
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
 
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
Case Studies in Managing Traffic in a Developing Country with Privacy-Preserv...
 
Federal Highway Administration Initiative and Proven Countermeasures
Federal Highway Administration Initiative and Proven CountermeasuresFederal Highway Administration Initiative and Proven Countermeasures
Federal Highway Administration Initiative and Proven Countermeasures
 
7 ncdot&u senal deladvcurvahor 2010 150p
7   ncdot&u senal deladvcurvahor 2010 150p7   ncdot&u senal deladvcurvahor 2010 150p
7 ncdot&u senal deladvcurvahor 2010 150p
 
Applied research. Optimization of the Shuttle Services
Applied research. Optimization of the Shuttle ServicesApplied research. Optimization of the Shuttle Services
Applied research. Optimization of the Shuttle Services
 
New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...New Research Articles 2020 November Issue International Journal of Software E...
New Research Articles 2020 November Issue International Journal of Software E...
 
Regression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network ApproachRegression and Classification: An Artificial Neural Network Approach
Regression and Classification: An Artificial Neural Network Approach
 
Masters Project
Masters ProjectMasters Project
Masters Project
 
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
 
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
DATA MINING ATTRIBUTE SELECTION APPROACH FOR DROUGHT MODELLING: A CASE STUDY ...
 

Recently uploaded

Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 

Recently uploaded (20)

Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 

BiPredicting Traffic Accident Severity and Vehicle Count using Machine Learning.pptx

  • 1. Madhu Kiran Chiti Radhika Patel Surabhi Parab PREDICTING TRAFFIC ACCIDENT SEVERITY AND VEHICLE COUNT USING MACHINE LEARNING
  • 2. TABLE OF CONTENT 0 2 MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 01 02 03 04 05 06 07 Introduction Literature review Methodology Model training algorithms Result and Analysis Conclusion References
  • 3. INTRODUCTION • Road traffic accidents result in 1.3 million deaths and 50 million injuries worldwide each year. • In the US alone, there were over 36,000 traffic fatalities in 2019. • Economic costs of traffic accidents range from $230 billion to $900 billion globally. • This project aims to build a multiclass classification model to predict whether an accident is likely to result in a severe or minor injury. • We use a subset of features from the US Accidents dataset, including distance, temperature, humidity, pressure, precipitation, and weather condition. MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 3
  • 4. LITERATURE REVIEW • Conducted a literature review before project implementation • This helped in precisely formulating the project objective and tailoring models as per project requirements. • Several user-created notebooks on Kaggle [4] • An IEEE paper on Comparison of Machine Learning algorithms [5] • Article on the impact of rainy weather on road accidents. (based on Weather) [6] • Our project's unique aspect is focusing on weather conditions and location to provide a severity report, which is different from the existing literature that we came across. MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 4
  • 5. METHODOLOGY • Kaggle Dataset • Features like - Severity, Time, Latitude, Longitude, Distance(mi), Street, Number, Side, Zipcode, Temperature(F), Wind_Chill(F), Humidity(%), Pressure(in), Precipitation(in), Weather_Condition. DATASET DATASET EVALUATION • Feature Selection for analysis • Two separate analysis - one according to the time and location, second according to the Weather Conditions MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 5
  • 6. METHODOLOGY • Finding missing values and missing columns • Replacing missing numerical values with mean (tried, but dropped column itself) • Scaling features using PCA and Standardization DATA CLEANING MODEL TRAINING • Logistic Regression (LR), Random Forest Classifier (RF), Support Vector Classifier (SVC), K Nearest Neighbor (KNN), Gaussian Naive Bayes (GNB) and the Decision Tree models. MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 6
  • 7. WORK FLOWCHART Input Raw Data jmjj Remove null and noisy data Input all road condition related features Remove the unwanted features Perform features selection techniques Top 11 Features Perform the Classification techniques (Naive Bayes, Logistic Regression, Decision Tree, Random Forest, Support Vector Machine) Select the Best Prediction Method Resultant Final Model MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 7
  • 8. PCA WITH NAÏVE BAYES ALGORITHM Load the dataset and display its summary statistics. Check for missing values and drop columns with more than 50% missing data. Rename columns and convert DateTime variables to DateTime format. Impute missing numerical values with mean imputation. encoding categorical variables Split the dataset into training and testing sets. Scale the features using StandardScaler. Perform feature selection using PCA and plot the first two principal components. Train a Gaussian Naive Bayes Classifier on the training set. Evaluate the performance of the model MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 0 8
  • 9. PCA WITH LR, RF, AND SVC M A D H U K I R A N C H I T I , R a d h i k a P a t e L , S u r a b h i P a r a b 0 9
  • 10. KNN ALGORITHM MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 0
  • 11. DECISION TREE MODEL MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 11
  • 12. GAUSSIAN NAIVE BAYES (GNB) MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 2
  • 13. RESULTS AND ANALYSIS MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 3
  • 14. RESULTS AND ANALYSIS MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 4
  • 15. CONFUSION MATRIX MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 5
  • 16. CONFUSION MATRIX MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 6
  • 17. ACCURACY PRECISION RECALL CONCLUSION MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 7
  • 18. CONCLUSION MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 8
  • 19. FUTURE WORK • Due to the limited time constraints, we were unable to fine-tune our models to bring more accurate results. As part of the future work, we can experiment more with the learning rates and parameters of the code; and make our model even more robust. • In the SVC model, we can try changing the kernel to poly (Polynomial) and rbf (Radial Basis Function). • A deeper analysis of the relationships between different variables and accident severity or vehicle count can provide valuable insights. We can explore more performance metrics to analyze the model in future. MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 1 9
  • 20. REFERENCES • [1] S. Moosavi, M. Ardalan, and M. K. H. Shafiei, "US Accidents (3.5 million records): A Countrywide Traffic Accident Dataset," Kaggle, 2019. https://www.kaggle.com/sobhanmoosavi/us-accidents • [2] Moosavi, Sobhan, Mohammad Hossein Samavatian, Srinivasan Parthasarathy, and Rajiv Ramnath. “A Countrywide Traffic Accident Dataset.”, 2019. https://smoosavi.org/datasets/us_accidents • [3] Moosavi, Sobhan, Mohammad Hossein Samavatian, Srinivasan Parthasarathy, Radu Teodorescu, and Rajiv Ramnath. "Accident Risk Prediction based on Heterogeneous Sparse Data: New Dataset and Insights." In Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM, 2019. https://arxiv.org/abs/1909.09638 • [4] https://www.kaggle.com/code/satyabrataroy/60-insights-extraction-us?accident-analysis • [5] R. E. AlMamlook, K. M. Kwayu, M. R. Alkasisbeh, and A. A. Frefer, "Comparison of Machine Learning Algorithms for Predicting Traffic Accident Severity," 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT), Amman, Jordan, 2019, pp. 272-276, doi: 10.1109/JEEIT.2019.8717393. • [6] Harold Brodsky, A.Shalom Hakkert, “Risk of a road accident in rainy weather,” Accident Analysis & Prevention, Volume 20, Issue 3, 1988, Pages 161-176, ISSN 0001-4575, https://doi.org/10.1016/0001- 4575(88)90001-2 • [7] Anna Trawén, Pia Maraste, Ulf Persson, “International comparison of costs of a fatal casualty of road accidents in 1990 and 1999,” Accident Analysis & Prevention, Volume 34, Issue 3, 2002, Pages 323-332, ISSN 0001-4575, https://doi.org/10.1016/S0001-4575(01)00029-X MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 2 0
  • 21. CODE LINK [8] Code1: https://github.com/sparab2/ML_Project_Traffic/blob/main/Project_Final .ipynb [9] Code2: https://github.com/Radhika-95/Project.gi MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab Code 1 Code 2 2 1
  • 22. Thank You! MADHU KIRAN CHITI, Radhika PateL, Surabhi Parab 2 2