SlideShare a Scribd company logo
1 of 20
DATA MINING FRAMEWORK TO ANALYZE
ROAD ACCIDENT DATA
PROJECT GUIDE
Prof .Bharani B.R.
(Asst. Professor)
TEAM MEMBERS
Aishwarya Saseendran(1CD13IS005)
Aishwarya Vaishali (1CD13IS006)
Ajitha AS (1CD13IS007)
Swetha D (1CD13IS053)
•ABSTRACT
•INTRODUCTION
•PROBLEM DEFINITION
•EXISTING SYSTEM
•LITERATURE SURVEY
•HARDWARE & SOFTWARE REQUIREMENTS
•DESIGN AND IMPLEMENTATION
•COMPILED RESULTS WITH COMPARISON
•CONCLUSION
•REFERENCE
•In the developed as well as developing countries, Infrastructure
development is one of the major investment by the government,
while safety of passengers on roads is of utmost importance.
• A road optimization during the construction or during
maintenance phase, requires that the engineers analyze all the
parameters that play a crucial role in ensuring safety for the
passengers and preventing accidents.
•One of the key objectives in accident data analysis is to identify the
main factors associated with road accidents.
•The data to be analyzed(both structured and
unstructured) is collected from various sources
and has several attributes.It is a challenge to
gather all such relevant data, detect and analyze
it together to generate decision trees that give
insights on previous accidents.
•For this purpose,we propose to harness the
power of Data Mining technologies like
Hadoop.The analysis will be represented in the
form of Decision tree which can be represented
graphically.
INTRODUTION
 Road accidents are uncertain and unpredictable incidents and their
analysis requires the knowledge of the factors affecting them.
 The major problem in the analysis of accident data is its Heterogenous
nature.
 Thus, heterogenity must be considered during analysis of the data,
otherwise some relationship between the data may remain hidden.
 Although, researchers used segmentation of the data to reduce this
heterogeneity using some measures such as expert knowledge, but
there is no guarantee that this will lead to an optimal segmentation which
consists of homogeneous groups of road accidents.
 Therefore, cluster analysis can assist the segmentation of road
accidents.
This is a research based data analysis
project in which we try to analyze a large
data set not capable of being analyzed by
typical database or data analysis software
like Excel.
To overcome this, we try to implement
distributed processing using Hadoop and
pipe the result with Apache Zepellin to
analyze and visualize the data set and
generate a decision tree.
•The traditional analysis method mainly
depends on database system and the
education of customers.
•The database system are limited to size ,
inaccurate and takes more time for huge
data set.
•Database systems can process only
structured data.
•Therefore using a traditional database will
not be efficient
PROPOSED SYSTEM
 The proposed system overcomes the
above mentioned issues in an efficient
way.
 The proposed system uses Data
mining approach to compare various
road accidents occurred in last 5 years
and identify highly accident prone
areas.
 Government agencies can run
instructional recommendation
systems based on this analysis.
LITERATURE SURVEY
Author Objective Data Mining Techniques Accuracy
Chaozhonget.al (2009) To identify the factors
significantly influencing
single vehicle crash severity.
Random Forest, Rough set theory 0.73%
Ali et.al (2010) To identify Most important
factors which affect injury
severity
Classification & Regression tree 72.49%
Liping et.al (2010) To predict Traffic accident
duration of incident and
driver information system
Artificial neural Networks 85.35%
DipoT.Akomolafe, Akinbola
Olutayo (2012)
To predict causes of
accidents and accident prone
locations.
Decision tree: Id3, Functional tree 70.27%
Tibebe et.al (2013) To Explore the possible
application of data mining
technology for developing a
classification model
Classification & Regression tree 87.47%
HARDWARE REQUIREMENTS
PROCESSOR TYPE: Intel® Core™ i5-6200U CPU
PROCESSOR SPEED: 2.30 GHz
HARD DISK: 1 TB
RAM: 8 GB
SOFTWARE REQUIREMENTS
OPERATING SYSTEM: UBUNTU
FRONT END: APACHE ZEPPELIN
BACK END: SCALA-SPARK
DESIGN AND IMPLEMENTATION
ARCHITECTURE OF THE SYSTEM
LOGIN MODULE
FLOWCHART OF LOGIN MODULE
Pre-processing is a huge
task in any data analysis
Some of the challenges
faced in this project
related to Data Pre-
processing are:
 Handling missing
values
 Handling noisy data
 Integrating the data
from 2 different
datasets
 Removing duplicate
records for multiple
index entries.
DATA PRE-PROCESSING
MODULE
CLUSTERING USING MAPREDUCE
COMPILED RESULTS (EXPECTED)
 Distribution of Accidents
across Attributes
 Decision Tree Visualization
 Severity Vs User Defined
Attribute
CONCLUSION
 The tree generated is pruned to large extent due to
memory restrictions and varied type of data.
 Further room for improvement exists by adding more
clusters to the distributed processing module & using
more user friendly visualizations.
THANK YOU

More Related Content

What's hot

Breast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning pptBreast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning pptAnkitGupta1476
 
Drowsiness State Detection of Driver using Eyelid Movement- IRE Journal Confe...
Drowsiness State Detection of Driver using Eyelid Movement- IRE Journal Confe...Drowsiness State Detection of Driver using Eyelid Movement- IRE Journal Confe...
Drowsiness State Detection of Driver using Eyelid Movement- IRE Journal Confe...Vignesh C
 
Drowsiness Detection using machine learning (1).pptx
Drowsiness Detection using machine learning (1).pptxDrowsiness Detection using machine learning (1).pptx
Drowsiness Detection using machine learning (1).pptxsathiyasowmi
 
IRJET- Diabetes Prediction using Machine Learning
IRJET- Diabetes Prediction using Machine LearningIRJET- Diabetes Prediction using Machine Learning
IRJET- Diabetes Prediction using Machine LearningIRJET Journal
 
Real Time Eye Blinking and Yawning Detection
Real Time Eye Blinking and Yawning DetectionReal Time Eye Blinking and Yawning Detection
Real Time Eye Blinking and Yawning Detectionijtsrd
 
DISEASE PREDICTION SYSTEM USING DATA MINING
DISEASE PREDICTION SYSTEM USING  DATA MININGDISEASE PREDICTION SYSTEM USING  DATA MINING
DISEASE PREDICTION SYSTEM USING DATA MININGshivaniyadav112
 
Driver drowsinees detection and alert.pptx slide
Driver drowsinees detection and alert.pptx slideDriver drowsinees detection and alert.pptx slide
Driver drowsinees detection and alert.pptx slidekavinakshi
 
Heart disease prediction system
Heart disease prediction systemHeart disease prediction system
Heart disease prediction systemSWAMI06
 
IRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning AlgorithmIRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning AlgorithmIRJET Journal
 
Predicting Diabetes Using Machine Learning
Predicting Diabetes Using Machine LearningPredicting Diabetes Using Machine Learning
Predicting Diabetes Using Machine LearningJohn Alex
 
Hot Topics in Machine Learning for Research and Thesis
Hot Topics in Machine Learning for Research and ThesisHot Topics in Machine Learning for Research and Thesis
Hot Topics in Machine Learning for Research and ThesisWriteMyThesis
 
Final thesis presentation
Final thesis presentationFinal thesis presentation
Final thesis presentationPawan Singh
 
Stock Price Prediction using Machine Learning Algorithms: ARIMA, LSTM & Linea...
Stock Price Prediction using Machine Learning Algorithms: ARIMA, LSTM & Linea...Stock Price Prediction using Machine Learning Algorithms: ARIMA, LSTM & Linea...
Stock Price Prediction using Machine Learning Algorithms: ARIMA, LSTM & Linea...IRJET Journal
 
CRIME ANALYSIS AND PREDICTION USING MACHINE LEARNING
CRIME ANALYSIS AND PREDICTION USING MACHINE LEARNINGCRIME ANALYSIS AND PREDICTION USING MACHINE LEARNING
CRIME ANALYSIS AND PREDICTION USING MACHINE LEARNINGIRJET Journal
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learningSandeep Garg
 
Heart disease prediction
Heart disease predictionHeart disease prediction
Heart disease predictionAriful Haque
 
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A SurveyPrediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A Surveyrahulmonikasharma
 
Traffic sign recognition
Traffic sign recognitionTraffic sign recognition
Traffic sign recognitionAKR Education
 

What's hot (20)

Breast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning pptBreast cancer diagnosis machine learning ppt
Breast cancer diagnosis machine learning ppt
 
Drowsiness State Detection of Driver using Eyelid Movement- IRE Journal Confe...
Drowsiness State Detection of Driver using Eyelid Movement- IRE Journal Confe...Drowsiness State Detection of Driver using Eyelid Movement- IRE Journal Confe...
Drowsiness State Detection of Driver using Eyelid Movement- IRE Journal Confe...
 
Machine Learning for dummies!
Machine Learning for dummies!Machine Learning for dummies!
Machine Learning for dummies!
 
Drowsiness Detection using machine learning (1).pptx
Drowsiness Detection using machine learning (1).pptxDrowsiness Detection using machine learning (1).pptx
Drowsiness Detection using machine learning (1).pptx
 
Machine learning
Machine learning Machine learning
Machine learning
 
IRJET- Diabetes Prediction using Machine Learning
IRJET- Diabetes Prediction using Machine LearningIRJET- Diabetes Prediction using Machine Learning
IRJET- Diabetes Prediction using Machine Learning
 
Real Time Eye Blinking and Yawning Detection
Real Time Eye Blinking and Yawning DetectionReal Time Eye Blinking and Yawning Detection
Real Time Eye Blinking and Yawning Detection
 
DISEASE PREDICTION SYSTEM USING DATA MINING
DISEASE PREDICTION SYSTEM USING  DATA MININGDISEASE PREDICTION SYSTEM USING  DATA MINING
DISEASE PREDICTION SYSTEM USING DATA MINING
 
Driver drowsinees detection and alert.pptx slide
Driver drowsinees detection and alert.pptx slideDriver drowsinees detection and alert.pptx slide
Driver drowsinees detection and alert.pptx slide
 
Heart disease prediction system
Heart disease prediction systemHeart disease prediction system
Heart disease prediction system
 
IRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning AlgorithmIRJET- Road Accident Prediction using Machine Learning Algorithm
IRJET- Road Accident Prediction using Machine Learning Algorithm
 
Predicting Diabetes Using Machine Learning
Predicting Diabetes Using Machine LearningPredicting Diabetes Using Machine Learning
Predicting Diabetes Using Machine Learning
 
Hot Topics in Machine Learning for Research and Thesis
Hot Topics in Machine Learning for Research and ThesisHot Topics in Machine Learning for Research and Thesis
Hot Topics in Machine Learning for Research and Thesis
 
Final thesis presentation
Final thesis presentationFinal thesis presentation
Final thesis presentation
 
Stock Price Prediction using Machine Learning Algorithms: ARIMA, LSTM & Linea...
Stock Price Prediction using Machine Learning Algorithms: ARIMA, LSTM & Linea...Stock Price Prediction using Machine Learning Algorithms: ARIMA, LSTM & Linea...
Stock Price Prediction using Machine Learning Algorithms: ARIMA, LSTM & Linea...
 
CRIME ANALYSIS AND PREDICTION USING MACHINE LEARNING
CRIME ANALYSIS AND PREDICTION USING MACHINE LEARNINGCRIME ANALYSIS AND PREDICTION USING MACHINE LEARNING
CRIME ANALYSIS AND PREDICTION USING MACHINE LEARNING
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learning
 
Heart disease prediction
Heart disease predictionHeart disease prediction
Heart disease prediction
 
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A SurveyPrediction of Heart Disease using Machine Learning Algorithms: A Survey
Prediction of Heart Disease using Machine Learning Algorithms: A Survey
 
Traffic sign recognition
Traffic sign recognitionTraffic sign recognition
Traffic sign recognition
 

Similar to DATA MINING FRAMEWORK TO ANALYZE ROAD ACCIDENT DATA

IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...IRJET Journal
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...ertekg
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Gurdal Ertek
 
A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...
A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...
A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...IJCSIS Research Publications
 
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...IRJET Journal
 
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...IRJET Journal
 
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...ijistjournal
 
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...ijistjournal
 
IRJET - Real Time Facial Analysis using Tensorflowand OpenCV
IRJET -  	  Real Time Facial Analysis using Tensorflowand OpenCVIRJET -  	  Real Time Facial Analysis using Tensorflowand OpenCV
IRJET - Real Time Facial Analysis using Tensorflowand OpenCVIRJET Journal
 
The TARDEC Advanced Systems Engineering Capability ASEC Framework(5)
The TARDEC Advanced Systems Engineering Capability ASEC Framework(5)The TARDEC Advanced Systems Engineering Capability ASEC Framework(5)
The TARDEC Advanced Systems Engineering Capability ASEC Framework(5)Pradeep Mendonza
 
SCCAI- A Student Career Counselling Artificial Intelligence
SCCAI- A Student Career Counselling Artificial IntelligenceSCCAI- A Student Career Counselling Artificial Intelligence
SCCAI- A Student Career Counselling Artificial Intelligencevivatechijri
 
CRIME EXPLORATION AND FORECAST
CRIME EXPLORATION AND FORECASTCRIME EXPLORATION AND FORECAST
CRIME EXPLORATION AND FORECASTIRJET Journal
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYashShiva3
 
ISEN 613_Team3_Final Project Report
ISEN 613_Team3_Final Project ReportISEN 613_Team3_Final Project Report
ISEN 613_Team3_Final Project ReportNaman Kapoor
 
ISEN 613_Team3_Final Project Report
ISEN 613_Team3_Final Project ReportISEN 613_Team3_Final Project Report
ISEN 613_Team3_Final Project ReportRahul Garg, CSSGB
 
WR Based Opinion Mining on Traffic Sentiment Analysis on Social Media
WR Based Opinion Mining on Traffic Sentiment Analysis on Social MediaWR Based Opinion Mining on Traffic Sentiment Analysis on Social Media
WR Based Opinion Mining on Traffic Sentiment Analysis on Social MediaIRJET Journal
 

Similar to DATA MINING FRAMEWORK TO ANALYZE ROAD ACCIDENT DATA (20)

IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
IRJET- Smart Automated Modelling using ECLAT Algorithm for Traffic Accident P...
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...
 
A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...
A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...
A Software Infrastructure for Multidimensional Data Analysis: A Data Modellin...
 
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...IRJET -  	  An User Friendly Interface for Data Preprocessing and Visualizati...
IRJET - An User Friendly Interface for Data Preprocessing and Visualizati...
 
Dss
DssDss
Dss
 
Dss
DssDss
Dss
 
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
Sensor Based Detection & Classification of Actionable & Non-Actionable Condit...
 
Robust ai
Robust aiRobust ai
Robust ai
 
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
 
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
FUZZY ANALYTIC HIERARCHY BASED DBMS SELECTION IN TURKISH NATIONAL IDENTITY CA...
 
IRJET - Real Time Facial Analysis using Tensorflowand OpenCV
IRJET -  	  Real Time Facial Analysis using Tensorflowand OpenCVIRJET -  	  Real Time Facial Analysis using Tensorflowand OpenCV
IRJET - Real Time Facial Analysis using Tensorflowand OpenCV
 
The TARDEC Advanced Systems Engineering Capability ASEC Framework(5)
The TARDEC Advanced Systems Engineering Capability ASEC Framework(5)The TARDEC Advanced Systems Engineering Capability ASEC Framework(5)
The TARDEC Advanced Systems Engineering Capability ASEC Framework(5)
 
SCCAI- A Student Career Counselling Artificial Intelligence
SCCAI- A Student Career Counselling Artificial IntelligenceSCCAI- A Student Career Counselling Artificial Intelligence
SCCAI- A Student Career Counselling Artificial Intelligence
 
CRIME EXPLORATION AND FORECAST
CRIME EXPLORATION AND FORECASTCRIME EXPLORATION AND FORECAST
CRIME EXPLORATION AND FORECAST
 
YASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptxYASH DATA SCIENCE SEMINAR.pptx
YASH DATA SCIENCE SEMINAR.pptx
 
ISEN 613_Team3_Final Project Report
ISEN 613_Team3_Final Project ReportISEN 613_Team3_Final Project Report
ISEN 613_Team3_Final Project Report
 
ISEN 613_Team3_Final Project Report
ISEN 613_Team3_Final Project ReportISEN 613_Team3_Final Project Report
ISEN 613_Team3_Final Project Report
 
ONE HIDDEN LAYER ANFIS MODEL FOR OOS DEVELOPMENT EFFORT ESTIMATION
ONE HIDDEN LAYER ANFIS MODEL FOR OOS DEVELOPMENT EFFORT ESTIMATIONONE HIDDEN LAYER ANFIS MODEL FOR OOS DEVELOPMENT EFFORT ESTIMATION
ONE HIDDEN LAYER ANFIS MODEL FOR OOS DEVELOPMENT EFFORT ESTIMATION
 
WR Based Opinion Mining on Traffic Sentiment Analysis on Social Media
WR Based Opinion Mining on Traffic Sentiment Analysis on Social MediaWR Based Opinion Mining on Traffic Sentiment Analysis on Social Media
WR Based Opinion Mining on Traffic Sentiment Analysis on Social Media
 

More from Aishwarya Saseendran

More from Aishwarya Saseendran (7)

Web Analytics in 10 slides
Web  Analytics in 10 slidesWeb  Analytics in 10 slides
Web Analytics in 10 slides
 
Interview Puzzles
Interview PuzzlesInterview Puzzles
Interview Puzzles
 
Evolution of wireless technology
Evolution of wireless technologyEvolution of wireless technology
Evolution of wireless technology
 
Evolution Of Programming Languages
Evolution Of Programming LanguagesEvolution Of Programming Languages
Evolution Of Programming Languages
 
DATA MINING FRAMEWORK TO ANALYZE ROAD ACCIDENT DATA
DATA MINING FRAMEWORK TO ANALYZE ROAD ACCIDENT DATADATA MINING FRAMEWORK TO ANALYZE ROAD ACCIDENT DATA
DATA MINING FRAMEWORK TO ANALYZE ROAD ACCIDENT DATA
 
Gi fi
Gi fiGi fi
Gi fi
 
Gifi
GifiGifi
Gifi
 

Recently uploaded

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringmulugeta48
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptNANDHAKUMARA10
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoordharasingh5698
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Arindam Chakraborty, Ph.D., P.E. (CA, TX)
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapRishantSharmaFr
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 

Recently uploaded (20)

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoorTop Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
Top Rated Call Girls In chittoor 📱 {7001035870} VIP Escorts chittoor
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bangalore ☎ 7737669865 🥵 Book Your One night Stand
 

DATA MINING FRAMEWORK TO ANALYZE ROAD ACCIDENT DATA

  • 1. DATA MINING FRAMEWORK TO ANALYZE ROAD ACCIDENT DATA
  • 2. PROJECT GUIDE Prof .Bharani B.R. (Asst. Professor) TEAM MEMBERS Aishwarya Saseendran(1CD13IS005) Aishwarya Vaishali (1CD13IS006) Ajitha AS (1CD13IS007) Swetha D (1CD13IS053)
  • 3. •ABSTRACT •INTRODUCTION •PROBLEM DEFINITION •EXISTING SYSTEM •LITERATURE SURVEY •HARDWARE & SOFTWARE REQUIREMENTS •DESIGN AND IMPLEMENTATION •COMPILED RESULTS WITH COMPARISON •CONCLUSION •REFERENCE
  • 4. •In the developed as well as developing countries, Infrastructure development is one of the major investment by the government, while safety of passengers on roads is of utmost importance. • A road optimization during the construction or during maintenance phase, requires that the engineers analyze all the parameters that play a crucial role in ensuring safety for the passengers and preventing accidents. •One of the key objectives in accident data analysis is to identify the main factors associated with road accidents.
  • 5. •The data to be analyzed(both structured and unstructured) is collected from various sources and has several attributes.It is a challenge to gather all such relevant data, detect and analyze it together to generate decision trees that give insights on previous accidents. •For this purpose,we propose to harness the power of Data Mining technologies like Hadoop.The analysis will be represented in the form of Decision tree which can be represented graphically.
  • 6. INTRODUTION  Road accidents are uncertain and unpredictable incidents and their analysis requires the knowledge of the factors affecting them.  The major problem in the analysis of accident data is its Heterogenous nature.  Thus, heterogenity must be considered during analysis of the data, otherwise some relationship between the data may remain hidden.  Although, researchers used segmentation of the data to reduce this heterogeneity using some measures such as expert knowledge, but there is no guarantee that this will lead to an optimal segmentation which consists of homogeneous groups of road accidents.  Therefore, cluster analysis can assist the segmentation of road accidents.
  • 7. This is a research based data analysis project in which we try to analyze a large data set not capable of being analyzed by typical database or data analysis software like Excel. To overcome this, we try to implement distributed processing using Hadoop and pipe the result with Apache Zepellin to analyze and visualize the data set and generate a decision tree.
  • 8. •The traditional analysis method mainly depends on database system and the education of customers. •The database system are limited to size , inaccurate and takes more time for huge data set. •Database systems can process only structured data. •Therefore using a traditional database will not be efficient
  • 9. PROPOSED SYSTEM  The proposed system overcomes the above mentioned issues in an efficient way.  The proposed system uses Data mining approach to compare various road accidents occurred in last 5 years and identify highly accident prone areas.  Government agencies can run instructional recommendation systems based on this analysis.
  • 10. LITERATURE SURVEY Author Objective Data Mining Techniques Accuracy Chaozhonget.al (2009) To identify the factors significantly influencing single vehicle crash severity. Random Forest, Rough set theory 0.73% Ali et.al (2010) To identify Most important factors which affect injury severity Classification & Regression tree 72.49% Liping et.al (2010) To predict Traffic accident duration of incident and driver information system Artificial neural Networks 85.35% DipoT.Akomolafe, Akinbola Olutayo (2012) To predict causes of accidents and accident prone locations. Decision tree: Id3, Functional tree 70.27% Tibebe et.al (2013) To Explore the possible application of data mining technology for developing a classification model Classification & Regression tree 87.47%
  • 11. HARDWARE REQUIREMENTS PROCESSOR TYPE: Intel® Core™ i5-6200U CPU PROCESSOR SPEED: 2.30 GHz HARD DISK: 1 TB RAM: 8 GB SOFTWARE REQUIREMENTS OPERATING SYSTEM: UBUNTU FRONT END: APACHE ZEPPELIN BACK END: SCALA-SPARK
  • 12.
  • 15. Pre-processing is a huge task in any data analysis Some of the challenges faced in this project related to Data Pre- processing are:  Handling missing values  Handling noisy data  Integrating the data from 2 different datasets  Removing duplicate records for multiple index entries. DATA PRE-PROCESSING MODULE
  • 17. COMPILED RESULTS (EXPECTED)  Distribution of Accidents across Attributes  Decision Tree Visualization  Severity Vs User Defined Attribute
  • 18. CONCLUSION  The tree generated is pruned to large extent due to memory restrictions and varied type of data.  Further room for improvement exists by adding more clusters to the distributed processing module & using more user friendly visualizations.
  • 19.