SlideShare a Scribd company logo
1 of 16
SCHOOL OF ELECTRICAL ENGINEERING AND COMPUTING
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
ARTICLE CRITICAL REVIEW
CLASSIFICATION OF HEADACHE DISORDER USING RANDOM FOREST ALGORITHM
PRESENTER: ALEMU GUDETA
1
About Article
Authors
Dhiyaussalam, Adi Wibowo, Fajar Agung Nugroho, Eko Adi Sarwoko, I Made Agus Setiawan
2020 4th International Conference on Informatics and Computational Sciences (ICICoS)
Publisher - IEEE
Classification of Headache Disorder Using Random Forest Algorithm
2
General Summary
 Headache disorder is one of the most common illnesses.
 The purpose of this article is to help anyone figure out what kind of headache they are experiencing
at home without seeking medical attention.
 The majority of headache sufferers prefer to treat themselves at home when they feel they have a
headache.
 Model for categorizing the different forms of headaches and generating feature importance was
created using the Random Forest algorithm.
 To create the best model, they used the Migbase dataset and adjusted numerous algorithmic
parameters. The 850 data in the Migbase dataset have 39 features. Also it has three class labels for
migraine, tension, and cluster, with respective percentages of 71.73%, 21.67%, and 6.60%.
3
Key Findings
 Random Forest - an ensemble learning method for classification, regression and other tasks
that operates by constructing a multitude of decision trees at training time.
 N_estimators - used to determine the number of trees contained in the Random Forest model.
 Max_features - used to determine the maximum number of features used.
 Max_depth - used to determine the maximum depth of a tree.
 Bagging - Bootstrapping(Feature extraction) + Aggregation (Combining result and picking the
dominant)
 Gini impurity - a measurement of how evenly class labels are distributed among nodes.
4
Research Gap
 Particularly in a developing nation, the patient does not need to see a doctor for a
headache. However, as the headache wears on, negative effects may result.
 However, utilizing a computer, those who are experiencing primary headaches can
independently analyze their condition and choose the best course of treatment.
 In some cases, it may be challenging for the doctors to diagnose the symptoms. In this
approach, a headache type can be identified by the random forest algorithm with greater
accuracy than by a doctor.
5
Objective / Purpose
 The article's goal is to provide the optimal model for classifying different types of headaches using
a dataset that contains information on experiencing symptoms.
 Different scholars used various techniques, including decision trees, to attempt to solve the same
issue. The authors of this research employed a random forest method to classify the different types
of headaches.
 The created model aids the person with a headache in determining its nature and initiating self-
care measures.
6
Methodology
 Data Collection - Migbase dataset is downloaded from the internet with 850 rows and 39 features.
 Algorithm – Random Forest. RF combines a number of Decision Trees to produce classification and
regression results. RF is less likely to overfitting.
 Gini Impurity - Calculate how class labels are distributed among a nodes.
 t = node j = number of children at node t, 𝑛𝑐𝑖 is the number of samples with the value 𝑥𝑖
belonging to class 𝑐, 𝑚𝑖 is the number of samples with the value 𝑥𝑖 at node 𝑡
7
Methodology … cntd
 Feature importance - calculated from the average impurity reduction of all the Decision Tree in a
Random Forest without assuming whether the data used is linearly separated or not.
 𝐹𝐼 is feature-𝑖 in a Decision Tree, and 𝑘 is represent all of node.
 Data Processing - feature extraction done manually and correlation matrix of the target label is
also used to calculate the feature importance. Correlation values outside the range between -0.5 to 0.5
are removed.
8
Methodology … cntd
 Model Development - The classification model using the Random Forest algorithm in this study
was built based on the Scikit-learn library documentation.
 Model Evaluation - The model will be evaluated using 10 data with the contents of 4 classes of
migraine, 3 classes of tension, and 3 classes of clusters.
9
Confusion matrix of test result on one of model
Methodology … cntd
 Parameter Optimization
 To get the best performance results from the Random Forest model, setting-ups some parameter
need to be done manually.
 Manually setting the number of trees or n_estimators can reduce the error rate on the performance
of a Random Forest model.
 Accordingly, for n_estimators the values to be used are 10, 20, 50, and 100, Max_depth the values
to be used are 4, 5, and none (no limitation), and for other parameters that will be set manually is
the Max_features parameter with 3 choices: 6, 14, or 33.
10
Methodology … cntd
11
Key Findings
 By adjusting N_estimators, Max_features and Max_depth authors achieved the following
performance accuracy.
12
Contributions
 The article assessed the effectiveness of the random forest algorithm to classify the types of
headaches that various writers were unable get the specified performance accuracy.
 Vandewiele et al., (2018) , Krawczyk et al., (2013) , Aljaaf et al., (2015) conducted a research ad
suggested decision tree. But these authors got better performance accuracy compared with those
authors.
 Practically, it made life easier and increased societal awareness of a person's ability to diagnose
their headaches at home.
13
Critique
 The majority of the data processing in machine learning, including the amount of features,
depth, and estimators to use, is done manually.
 Additionally, the authors made no suggestions or indicated what they would do to
enhance the paper in the future.
 They weren't anticipating any new symptoms, which may have appeared at any time. In
this situation, the model's performance will suffer.
14
Conclusion
 According to the authors the model performs better than the model trained before
using decision tree on same dataset, by researchers Vandewiele et al., (2018) ,
Krawczyk et al., (2013) , Aljaaf et al., (2015)
 The highest performance resulted in an accuracy rating of 99.56%, and the lowest
performance resulted in a value of 97.79% by adjusting n_estimators, max features,
and max depth were 100, 33, and 5 respectively.
15
 THANK YOU!
16

More Related Content

Similar to Classification of Headache Disorder Using Random Forest Algorithm.pptx

Diabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning AlgorithmsDiabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning AlgorithmsIRJET Journal
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision TreesSara Hooker
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
 
Machine learning session6(decision trees random forrest)
Machine learning   session6(decision trees random forrest)Machine learning   session6(decision trees random forrest)
Machine learning session6(decision trees random forrest)Abhimanyu Dwivedi
 
An efficient convolutional neural network-based classifier for an imbalanced ...
An efficient convolutional neural network-based classifier for an imbalanced ...An efficient convolutional neural network-based classifier for an imbalanced ...
An efficient convolutional neural network-based classifier for an imbalanced ...IAESIJAI
 
Prediction of Dengue, Diabetes and Swine Flu using Random Forest Classificati...
Prediction of Dengue, Diabetes and Swine Flu using Random Forest Classificati...Prediction of Dengue, Diabetes and Swine Flu using Random Forest Classificati...
Prediction of Dengue, Diabetes and Swine Flu using Random Forest Classificati...IRJET Journal
 
IRJET- Prediction of Autism Spectrum Disorder using Deep Learning: A Survey
IRJET- Prediction of Autism Spectrum Disorder using Deep Learning: A SurveyIRJET- Prediction of Autism Spectrum Disorder using Deep Learning: A Survey
IRJET- Prediction of Autism Spectrum Disorder using Deep Learning: A SurveyIRJET Journal
 
Alzheimer Disease Prediction using Machine Learning Algorithms
Alzheimer Disease Prediction using Machine Learning AlgorithmsAlzheimer Disease Prediction using Machine Learning Algorithms
Alzheimer Disease Prediction using Machine Learning AlgorithmsIRJET Journal
 
Health Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep LearningHealth Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep LearningIRJET Journal
 
first review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbh
first review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbhfirst review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbh
first review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbhmithun302002
 
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...IRJET Journal
 
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...IRJET Journal
 
Simplified Knowledge Prediction: Application of Machine Learning in Real Life
Simplified Knowledge Prediction: Application of Machine Learning in Real LifeSimplified Knowledge Prediction: Application of Machine Learning in Real Life
Simplified Knowledge Prediction: Application of Machine Learning in Real LifePeea Bal Chakraborty
 
Comparative study of various supervisedclassification methodsforanalysing def...
Comparative study of various supervisedclassification methodsforanalysing def...Comparative study of various supervisedclassification methodsforanalysing def...
Comparative study of various supervisedclassification methodsforanalysing def...eSAT Publishing House
 
IRJET- Comparative Study of Machine Learning Models for Alzheimer’s Detec...
IRJET-  	  Comparative Study of Machine Learning Models for Alzheimer’s Detec...IRJET-  	  Comparative Study of Machine Learning Models for Alzheimer’s Detec...
IRJET- Comparative Study of Machine Learning Models for Alzheimer’s Detec...IRJET Journal
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESVikash Kumar
 
Ai project report
Ai project reportAi project report
Ai project reportsanjeevtmk
 
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...gerogepatton
 

Similar to Classification of Headache Disorder Using Random Forest Algorithm.pptx (20)

Diabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning AlgorithmsDiabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning Algorithms
 
Disease Prediction Using Machine Learning
Disease Prediction Using Machine LearningDisease Prediction Using Machine Learning
Disease Prediction Using Machine Learning
 
Module 5: Decision Trees
Module 5: Decision TreesModule 5: Decision Trees
Module 5: Decision Trees
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
 
Machine learning session6(decision trees random forrest)
Machine learning   session6(decision trees random forrest)Machine learning   session6(decision trees random forrest)
Machine learning session6(decision trees random forrest)
 
An efficient convolutional neural network-based classifier for an imbalanced ...
An efficient convolutional neural network-based classifier for an imbalanced ...An efficient convolutional neural network-based classifier for an imbalanced ...
An efficient convolutional neural network-based classifier for an imbalanced ...
 
Prediction of Dengue, Diabetes and Swine Flu using Random Forest Classificati...
Prediction of Dengue, Diabetes and Swine Flu using Random Forest Classificati...Prediction of Dengue, Diabetes and Swine Flu using Random Forest Classificati...
Prediction of Dengue, Diabetes and Swine Flu using Random Forest Classificati...
 
IRJET- Prediction of Autism Spectrum Disorder using Deep Learning: A Survey
IRJET- Prediction of Autism Spectrum Disorder using Deep Learning: A SurveyIRJET- Prediction of Autism Spectrum Disorder using Deep Learning: A Survey
IRJET- Prediction of Autism Spectrum Disorder using Deep Learning: A Survey
 
ASA.pptx
ASA.pptxASA.pptx
ASA.pptx
 
Alzheimer Disease Prediction using Machine Learning Algorithms
Alzheimer Disease Prediction using Machine Learning AlgorithmsAlzheimer Disease Prediction using Machine Learning Algorithms
Alzheimer Disease Prediction using Machine Learning Algorithms
 
Health Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep LearningHealth Care Application using Machine Learning and Deep Learning
Health Care Application using Machine Learning and Deep Learning
 
first review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbh
first review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbhfirst review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbh
first review.pptxgghggggvvvvbbvvvvvhhjjjbbvvvvbbbbbhhhhhhhhhbbh
 
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
IRJET- Hybrid Architecture of Heart Disease Prediction System using Genetic N...
 
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
Utilizing Machine Learning, Detect Chronic Kidney Disease and Suggest A Healt...
 
Simplified Knowledge Prediction: Application of Machine Learning in Real Life
Simplified Knowledge Prediction: Application of Machine Learning in Real LifeSimplified Knowledge Prediction: Application of Machine Learning in Real Life
Simplified Knowledge Prediction: Application of Machine Learning in Real Life
 
Comparative study of various supervisedclassification methodsforanalysing def...
Comparative study of various supervisedclassification methodsforanalysing def...Comparative study of various supervisedclassification methodsforanalysing def...
Comparative study of various supervisedclassification methodsforanalysing def...
 
IRJET- Comparative Study of Machine Learning Models for Alzheimer’s Detec...
IRJET-  	  Comparative Study of Machine Learning Models for Alzheimer’s Detec...IRJET-  	  Comparative Study of Machine Learning Models for Alzheimer’s Detec...
IRJET- Comparative Study of Machine Learning Models for Alzheimer’s Detec...
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
 
Ai project report
Ai project reportAi project report
Ai project report
 
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
A MODIFIED MAXIMUM RELEVANCE MINIMUM REDUNDANCY FEATURE SELECTION METHOD BASE...
 

Recently uploaded

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 

Recently uploaded (20)

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 

Classification of Headache Disorder Using Random Forest Algorithm.pptx

  • 1. SCHOOL OF ELECTRICAL ENGINEERING AND COMPUTING DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING ARTICLE CRITICAL REVIEW CLASSIFICATION OF HEADACHE DISORDER USING RANDOM FOREST ALGORITHM PRESENTER: ALEMU GUDETA 1
  • 2. About Article Authors Dhiyaussalam, Adi Wibowo, Fajar Agung Nugroho, Eko Adi Sarwoko, I Made Agus Setiawan 2020 4th International Conference on Informatics and Computational Sciences (ICICoS) Publisher - IEEE Classification of Headache Disorder Using Random Forest Algorithm 2
  • 3. General Summary  Headache disorder is one of the most common illnesses.  The purpose of this article is to help anyone figure out what kind of headache they are experiencing at home without seeking medical attention.  The majority of headache sufferers prefer to treat themselves at home when they feel they have a headache.  Model for categorizing the different forms of headaches and generating feature importance was created using the Random Forest algorithm.  To create the best model, they used the Migbase dataset and adjusted numerous algorithmic parameters. The 850 data in the Migbase dataset have 39 features. Also it has three class labels for migraine, tension, and cluster, with respective percentages of 71.73%, 21.67%, and 6.60%. 3
  • 4. Key Findings  Random Forest - an ensemble learning method for classification, regression and other tasks that operates by constructing a multitude of decision trees at training time.  N_estimators - used to determine the number of trees contained in the Random Forest model.  Max_features - used to determine the maximum number of features used.  Max_depth - used to determine the maximum depth of a tree.  Bagging - Bootstrapping(Feature extraction) + Aggregation (Combining result and picking the dominant)  Gini impurity - a measurement of how evenly class labels are distributed among nodes. 4
  • 5. Research Gap  Particularly in a developing nation, the patient does not need to see a doctor for a headache. However, as the headache wears on, negative effects may result.  However, utilizing a computer, those who are experiencing primary headaches can independently analyze their condition and choose the best course of treatment.  In some cases, it may be challenging for the doctors to diagnose the symptoms. In this approach, a headache type can be identified by the random forest algorithm with greater accuracy than by a doctor. 5
  • 6. Objective / Purpose  The article's goal is to provide the optimal model for classifying different types of headaches using a dataset that contains information on experiencing symptoms.  Different scholars used various techniques, including decision trees, to attempt to solve the same issue. The authors of this research employed a random forest method to classify the different types of headaches.  The created model aids the person with a headache in determining its nature and initiating self- care measures. 6
  • 7. Methodology  Data Collection - Migbase dataset is downloaded from the internet with 850 rows and 39 features.  Algorithm – Random Forest. RF combines a number of Decision Trees to produce classification and regression results. RF is less likely to overfitting.  Gini Impurity - Calculate how class labels are distributed among a nodes.  t = node j = number of children at node t, 𝑛𝑐𝑖 is the number of samples with the value 𝑥𝑖 belonging to class 𝑐, 𝑚𝑖 is the number of samples with the value 𝑥𝑖 at node 𝑡 7
  • 8. Methodology … cntd  Feature importance - calculated from the average impurity reduction of all the Decision Tree in a Random Forest without assuming whether the data used is linearly separated or not.  𝐹𝐼 is feature-𝑖 in a Decision Tree, and 𝑘 is represent all of node.  Data Processing - feature extraction done manually and correlation matrix of the target label is also used to calculate the feature importance. Correlation values outside the range between -0.5 to 0.5 are removed. 8
  • 9. Methodology … cntd  Model Development - The classification model using the Random Forest algorithm in this study was built based on the Scikit-learn library documentation.  Model Evaluation - The model will be evaluated using 10 data with the contents of 4 classes of migraine, 3 classes of tension, and 3 classes of clusters. 9 Confusion matrix of test result on one of model
  • 10. Methodology … cntd  Parameter Optimization  To get the best performance results from the Random Forest model, setting-ups some parameter need to be done manually.  Manually setting the number of trees or n_estimators can reduce the error rate on the performance of a Random Forest model.  Accordingly, for n_estimators the values to be used are 10, 20, 50, and 100, Max_depth the values to be used are 4, 5, and none (no limitation), and for other parameters that will be set manually is the Max_features parameter with 3 choices: 6, 14, or 33. 10
  • 12. Key Findings  By adjusting N_estimators, Max_features and Max_depth authors achieved the following performance accuracy. 12
  • 13. Contributions  The article assessed the effectiveness of the random forest algorithm to classify the types of headaches that various writers were unable get the specified performance accuracy.  Vandewiele et al., (2018) , Krawczyk et al., (2013) , Aljaaf et al., (2015) conducted a research ad suggested decision tree. But these authors got better performance accuracy compared with those authors.  Practically, it made life easier and increased societal awareness of a person's ability to diagnose their headaches at home. 13
  • 14. Critique  The majority of the data processing in machine learning, including the amount of features, depth, and estimators to use, is done manually.  Additionally, the authors made no suggestions or indicated what they would do to enhance the paper in the future.  They weren't anticipating any new symptoms, which may have appeared at any time. In this situation, the model's performance will suffer. 14
  • 15. Conclusion  According to the authors the model performs better than the model trained before using decision tree on same dataset, by researchers Vandewiele et al., (2018) , Krawczyk et al., (2013) , Aljaaf et al., (2015)  The highest performance resulted in an accuracy rating of 99.56%, and the lowest performance resulted in a value of 97.79% by adjusting n_estimators, max features, and max depth were 100, 33, and 5 respectively. 15