SlideShare a Scribd company logo
Adverse effects of COVID-19
vaccination: Machine
learning and statistical
approach
Rama Irsheidat
01 02 03
04 05
TABLE OF CONTENTS
SECTION
Introduction
SECTION
Goal
SECTION
Methodology
SECTION
Conclusion
SECTION
Critique
INTRODUCTION
01.
Vaccination
Is a well-accepted reliable approach to
preventing diseases. It has proven to be one
of the most effective strategies to control
pandemics, such as the SARS-CoV-2 outbreak.
All vaccines result in at least a small number
of patients that demonstrate some kind of
post-vaccination side effects. Although
vaccines are life-saving medications, they can
sometimes result in an after-effect,
sometimes even resulting in severe
symptoms, although with a low probability.
GOAL
02.
● Identify and analyze the most probable causes in
the individuals' medical history that resulted in
adverse reactions to vaccination.
● Identify the key symptoms that indicate the cause
or causes of the adverse reactions.
● Developing the ML models for the prediction and
classification of individuals most susceptible to
the adverse effects of vaccination and therefore,
may require high medical attention.
METHODOLOGY
03.
LET’S START
Dataset
&
Feature extraction
STEP 1
Statistical methods
&
ML models
STEP 2
Technology
&
Evaluation metrics
STEP 3
Discussion
STEP 4
Result
STEP 5
● Vaccine Adverse Event Reporting System (VAERS) dataset
○ General data (Such as: Age, Sex, Current illness, Medical
history, Allergic history, etc.)
○ Vaccination status
○ Post-vaccination symptoms
● More than 354 thousand samples
● Individuals who have been vaccinated between 1st
January 2021 to 11th June 2021 and also reported adverse
reactions.
Dataset
● Convert features in textual format into attributes by
employing the String matching technique.
Feature extraction
● Diseases with greater than 500 counts in medical history
have been considered as attributes and the rest
neglected.
● Based on the frequency of symptoms, a total of 49
symptoms have been used here for the analysis.
Feature extraction
85
different features for over 354 thousand samples.
● Chi-square ( χ2 ) Test
● The test of independence analyses the association
between various attributes of the dataset and the
outcome (target).
● Target variables (Outcome)
○ Died
○ Hospitalized
○ COVID-19 positive
Statistical methods
● Logistic Regression (LR)
● Random Forest (RF)
● Naive Bayes (NB)
● Light Grading Boosting Machine (LGBM)
● Multilayer feed-forward Perceptron (MLP)
ML models
Logistic Regression (LR)
● Logistic regression is a predictive
analysis.
● Used to describe data and to explain the
relationship between one dependent
binary variable and one or more nominal,
ordinal, interval or ratio-level
independent variables.
● The Line is fit to the data using maximum
likelihood.
Light Gradient Boosting
Machine (LGBM)
● LightGBM is a gradient boosting algorithm that employs a tree-
based learning framework. As compared to other tree-based
frameworks, it grew trees vertically (leaf-wise) whereas, others
horizontally (level-wise).
Multilayer feed-forward
Perceptron (MLP)
● The MLP is a type of neural network which may
have one input layer, multiple hidden layers, and
one output layer.
 All these layers contain several artificial
neurons that have been connected with each
other in a unidirectional manner by mesh
arrangements.
● It is a mathematical model which aims to mimic
the functioning of human brains.
● Python
● Keras (Keras is an open-source software library that provides a
Python interface for artificial neural networks.)
Technology
Evaluation metrics
● The investigation has been carried out in three
scenarios:
○ Based on medical history only
○ Based on the reaction of vaccination only
○ Based on both medical history and adverse reaction
● The important contributing features have been identified
by both statistical analysis and developed ML models
(LR, RF, and LGBM) in all scenarios.
● All the developed ML models have been used to predict
the important key outcomes of interest (Death,
Hospitalization, and COVID-19 positive).
Discussion
Result
Died Hospitalized
COVID-19 positive
Result
Died Hospitalized
COVID-19 positive
Result
Died Hospitalized
COVID-19 positive
CONCLUSION
04.
● The people in the age group of 50–70 have been found as most susceptible to the SARS-CoV-2.
● The male population has been identified as more vulnerable than to female population.
● The population with a history of life-threatening diseases should be vaccinated in close
observation.
● The most common post-vaccination symptoms have been identified. Most of these major
symptoms have been found normal and do not indicate towards any sign of serious and
immediate concern.
Conclusion
CRITIQUE
05.
● Strengths :
○ This has been the very first study that analyses the impact of COVID-19 vaccines
employing more than 354 thousand samples.
● Weaknesses:
○ In dataset there was twice the number of female participants compared to male
participants and almost half of them were recorded as regularly taking other
medications.
● Limitations:
○ The study is conducted on USA population.
Critique
CREDITS: This presentation template was created by
Slidesgo, including icons by Flaticon, and
infographics & images by Freepik
THANKS!

More Related Content

Similar to Vaccine hesitancy in the post‑vaccination COVID‑19 era: a machine learning and statistical analysis driven study

Journal Club: 2015 August; START study
Journal Club: 2015 August; START studyJournal Club: 2015 August; START study
Journal Club: 2015 August; START study
Sri Lanka College of Sexual Health and HIV Medicine
 
Introduction to Pharmacoepidemiology
Introduction to PharmacoepidemiologyIntroduction to Pharmacoepidemiology
Introduction to Pharmacoepidemiology
Perficient
 
RSS 2009 - Investigating the impact of the QOF on quality of primary care
RSS 2009 - Investigating the impact of the QOF on quality of primary careRSS 2009 - Investigating the impact of the QOF on quality of primary care
RSS 2009 - Investigating the impact of the QOF on quality of primary care
Evangelos Kontopantelis
 
Next-Generation Immuno-Oncology Biomarkers: Insights for Developing Companion...
Next-Generation Immuno-Oncology Biomarkers: Insights for Developing Companion...Next-Generation Immuno-Oncology Biomarkers: Insights for Developing Companion...
Next-Generation Immuno-Oncology Biomarkers: Insights for Developing Companion...
Andrew Aijian
 
Problem formulation considerations for externally applied dsRNA-based product...
Problem formulation considerations for externally applied dsRNA-based product...Problem formulation considerations for externally applied dsRNA-based product...
Problem formulation considerations for externally applied dsRNA-based product...
OECD Environment
 
Signal detection and their assessment in clinical trials
Signal detection and their assessment in clinical trialsSignal detection and their assessment in clinical trials
Signal detection and their assessment in clinical trials
ClinosolIndia
 
Progress in lupus trial design
Progress in lupus trial designProgress in lupus trial design
Progress in lupus trial design
Mark Abrahams, MD
 
Analytical epidemiology
Analytical  epidemiologyAnalytical  epidemiology
Analytical epidemiology
b_bhushan
 
12 Dr. Thomas Schreitmueller Roche
12 Dr. Thomas Schreitmueller   Roche12 Dr. Thomas Schreitmueller   Roche
Linked Open Data for Medical Guidelines Interactions
Linked Open Data for Medical  Guidelines InteractionsLinked Open Data for Medical  Guidelines Interactions
Linked Open Data for Medical Guidelines Interactions
Frank van Harmelen
 
Improving Patient Outcomes With Cancer Immunotherapies Throughout the Lung Ca...
Improving Patient Outcomes With Cancer Immunotherapies Throughout the Lung Ca...Improving Patient Outcomes With Cancer Immunotherapies Throughout the Lung Ca...
Improving Patient Outcomes With Cancer Immunotherapies Throughout the Lung Ca...
PVI, PeerView Institute for Medical Education
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk prediction
Ewout Steyerberg
 
v2 3rd (11-13 June 2015) KNH and UON Conference-Research as a Driver for Scie...
v2 3rd (11-13 June 2015) KNH and UON Conference-Research as a Driver for Scie...v2 3rd (11-13 June 2015) KNH and UON Conference-Research as a Driver for Scie...
v2 3rd (11-13 June 2015) KNH and UON Conference-Research as a Driver for Scie...
Jomo Kenyatta University of Agriculture and Technology
 
Beating the Beast: Best Current Pharmacological Modalities for Treating Covid...
Beating the Beast: Best Current Pharmacological Modalities for Treating Covid...Beating the Beast: Best Current Pharmacological Modalities for Treating Covid...
Beating the Beast: Best Current Pharmacological Modalities for Treating Covid...
Imad Hassan
 
Oncol Lett Vol12 No6 Pg5043
Oncol Lett Vol12 No6 Pg5043Oncol Lett Vol12 No6 Pg5043
Oncol Lett Vol12 No6 Pg5043
Lenka Kellermann
 
Lung Cancer Risk Prediction Models
Lung Cancer Risk Prediction ModelsLung Cancer Risk Prediction Models
Lung Cancer Risk Prediction Models
ThaoNgo60
 
Blood cancer 19oct
Blood cancer 19octBlood cancer 19oct
Blood cancer 19oct
Zeena Nackerdien
 
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Thien Q. Tran
 
smoldering myeloma
smoldering myelomasmoldering myeloma
smoldering myeloma
spa718
 
Case control design
Case control designCase control design
Case control design
amitakashyap1
 

Similar to Vaccine hesitancy in the post‑vaccination COVID‑19 era: a machine learning and statistical analysis driven study (20)

Journal Club: 2015 August; START study
Journal Club: 2015 August; START studyJournal Club: 2015 August; START study
Journal Club: 2015 August; START study
 
Introduction to Pharmacoepidemiology
Introduction to PharmacoepidemiologyIntroduction to Pharmacoepidemiology
Introduction to Pharmacoepidemiology
 
RSS 2009 - Investigating the impact of the QOF on quality of primary care
RSS 2009 - Investigating the impact of the QOF on quality of primary careRSS 2009 - Investigating the impact of the QOF on quality of primary care
RSS 2009 - Investigating the impact of the QOF on quality of primary care
 
Next-Generation Immuno-Oncology Biomarkers: Insights for Developing Companion...
Next-Generation Immuno-Oncology Biomarkers: Insights for Developing Companion...Next-Generation Immuno-Oncology Biomarkers: Insights for Developing Companion...
Next-Generation Immuno-Oncology Biomarkers: Insights for Developing Companion...
 
Problem formulation considerations for externally applied dsRNA-based product...
Problem formulation considerations for externally applied dsRNA-based product...Problem formulation considerations for externally applied dsRNA-based product...
Problem formulation considerations for externally applied dsRNA-based product...
 
Signal detection and their assessment in clinical trials
Signal detection and their assessment in clinical trialsSignal detection and their assessment in clinical trials
Signal detection and their assessment in clinical trials
 
Progress in lupus trial design
Progress in lupus trial designProgress in lupus trial design
Progress in lupus trial design
 
Analytical epidemiology
Analytical  epidemiologyAnalytical  epidemiology
Analytical epidemiology
 
12 Dr. Thomas Schreitmueller Roche
12 Dr. Thomas Schreitmueller   Roche12 Dr. Thomas Schreitmueller   Roche
12 Dr. Thomas Schreitmueller Roche
 
Linked Open Data for Medical Guidelines Interactions
Linked Open Data for Medical  Guidelines InteractionsLinked Open Data for Medical  Guidelines Interactions
Linked Open Data for Medical Guidelines Interactions
 
Improving Patient Outcomes With Cancer Immunotherapies Throughout the Lung Ca...
Improving Patient Outcomes With Cancer Immunotherapies Throughout the Lung Ca...Improving Patient Outcomes With Cancer Immunotherapies Throughout the Lung Ca...
Improving Patient Outcomes With Cancer Immunotherapies Throughout the Lung Ca...
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk prediction
 
v2 3rd (11-13 June 2015) KNH and UON Conference-Research as a Driver for Scie...
v2 3rd (11-13 June 2015) KNH and UON Conference-Research as a Driver for Scie...v2 3rd (11-13 June 2015) KNH and UON Conference-Research as a Driver for Scie...
v2 3rd (11-13 June 2015) KNH and UON Conference-Research as a Driver for Scie...
 
Beating the Beast: Best Current Pharmacological Modalities for Treating Covid...
Beating the Beast: Best Current Pharmacological Modalities for Treating Covid...Beating the Beast: Best Current Pharmacological Modalities for Treating Covid...
Beating the Beast: Best Current Pharmacological Modalities for Treating Covid...
 
Oncol Lett Vol12 No6 Pg5043
Oncol Lett Vol12 No6 Pg5043Oncol Lett Vol12 No6 Pg5043
Oncol Lett Vol12 No6 Pg5043
 
Lung Cancer Risk Prediction Models
Lung Cancer Risk Prediction ModelsLung Cancer Risk Prediction Models
Lung Cancer Risk Prediction Models
 
Blood cancer 19oct
Blood cancer 19octBlood cancer 19oct
Blood cancer 19oct
 
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
Deep learning in healthcare: Oppotunities and challenges with Electronic Medi...
 
smoldering myeloma
smoldering myelomasmoldering myeloma
smoldering myeloma
 
Case control design
Case control designCase control design
Case control design
 

Recently uploaded

Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 

Recently uploaded (20)

Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 

Vaccine hesitancy in the post‑vaccination COVID‑19 era: a machine learning and statistical analysis driven study

  • 1. Adverse effects of COVID-19 vaccination: Machine learning and statistical approach Rama Irsheidat
  • 2. 01 02 03 04 05 TABLE OF CONTENTS SECTION Introduction SECTION Goal SECTION Methodology SECTION Conclusion SECTION Critique
  • 4. Vaccination Is a well-accepted reliable approach to preventing diseases. It has proven to be one of the most effective strategies to control pandemics, such as the SARS-CoV-2 outbreak. All vaccines result in at least a small number of patients that demonstrate some kind of post-vaccination side effects. Although vaccines are life-saving medications, they can sometimes result in an after-effect, sometimes even resulting in severe symptoms, although with a low probability.
  • 6. ● Identify and analyze the most probable causes in the individuals' medical history that resulted in adverse reactions to vaccination. ● Identify the key symptoms that indicate the cause or causes of the adverse reactions. ● Developing the ML models for the prediction and classification of individuals most susceptible to the adverse effects of vaccination and therefore, may require high medical attention.
  • 8. LET’S START Dataset & Feature extraction STEP 1 Statistical methods & ML models STEP 2 Technology & Evaluation metrics STEP 3 Discussion STEP 4 Result STEP 5
  • 9. ● Vaccine Adverse Event Reporting System (VAERS) dataset ○ General data (Such as: Age, Sex, Current illness, Medical history, Allergic history, etc.) ○ Vaccination status ○ Post-vaccination symptoms ● More than 354 thousand samples ● Individuals who have been vaccinated between 1st January 2021 to 11th June 2021 and also reported adverse reactions. Dataset
  • 10. ● Convert features in textual format into attributes by employing the String matching technique. Feature extraction
  • 11. ● Diseases with greater than 500 counts in medical history have been considered as attributes and the rest neglected. ● Based on the frequency of symptoms, a total of 49 symptoms have been used here for the analysis. Feature extraction
  • 12. 85 different features for over 354 thousand samples.
  • 13.
  • 14. ● Chi-square ( χ2 ) Test ● The test of independence analyses the association between various attributes of the dataset and the outcome (target). ● Target variables (Outcome) ○ Died ○ Hospitalized ○ COVID-19 positive Statistical methods
  • 15. ● Logistic Regression (LR) ● Random Forest (RF) ● Naive Bayes (NB) ● Light Grading Boosting Machine (LGBM) ● Multilayer feed-forward Perceptron (MLP) ML models
  • 16. Logistic Regression (LR) ● Logistic regression is a predictive analysis. ● Used to describe data and to explain the relationship between one dependent binary variable and one or more nominal, ordinal, interval or ratio-level independent variables. ● The Line is fit to the data using maximum likelihood.
  • 17. Light Gradient Boosting Machine (LGBM) ● LightGBM is a gradient boosting algorithm that employs a tree- based learning framework. As compared to other tree-based frameworks, it grew trees vertically (leaf-wise) whereas, others horizontally (level-wise).
  • 18. Multilayer feed-forward Perceptron (MLP) ● The MLP is a type of neural network which may have one input layer, multiple hidden layers, and one output layer.  All these layers contain several artificial neurons that have been connected with each other in a unidirectional manner by mesh arrangements. ● It is a mathematical model which aims to mimic the functioning of human brains.
  • 19. ● Python ● Keras (Keras is an open-source software library that provides a Python interface for artificial neural networks.) Technology
  • 21. ● The investigation has been carried out in three scenarios: ○ Based on medical history only ○ Based on the reaction of vaccination only ○ Based on both medical history and adverse reaction ● The important contributing features have been identified by both statistical analysis and developed ML models (LR, RF, and LGBM) in all scenarios. ● All the developed ML models have been used to predict the important key outcomes of interest (Death, Hospitalization, and COVID-19 positive). Discussion
  • 22.
  • 25.
  • 28.
  • 32. ● The people in the age group of 50–70 have been found as most susceptible to the SARS-CoV-2. ● The male population has been identified as more vulnerable than to female population. ● The population with a history of life-threatening diseases should be vaccinated in close observation. ● The most common post-vaccination symptoms have been identified. Most of these major symptoms have been found normal and do not indicate towards any sign of serious and immediate concern. Conclusion
  • 34. ● Strengths : ○ This has been the very first study that analyses the impact of COVID-19 vaccines employing more than 354 thousand samples. ● Weaknesses: ○ In dataset there was twice the number of female participants compared to male participants and almost half of them were recorded as regularly taking other medications. ● Limitations: ○ The study is conducted on USA population. Critique
  • 35. CREDITS: This presentation template was created by Slidesgo, including icons by Flaticon, and infographics & images by Freepik THANKS!