SlideShare a Scribd company logo
1 of 14
InternationalConference On Distributed Computing And Electrical Circuits And Electronics
(ICDCECE-2022)
organizes
Ballari Institute Of Technology and Management, Ballari
(Autonomous Institute under VTU, Belagavi | Approved by AICTE, New Delhi | Recognized by Govt. of Karnataka )
Technical Co-Sponsor
Predicting Life Expectancy of
Hepatitis B Patients using Machine
Learning
1. Nabeel Ali BBDITM
2. Dolley Srivastava BBDITM
3. Aditya Tiwari BBDITM
4. Akash Pandey BBDITM
5. Akshat Sahu BBDITM
6. Abhay Kumar Pandey BBDITM
Contents
Abstract
Introduction
Literature Survey
Problem Definition
Proposed Work
Methodology & Implementation
Results & Discussions
Conclusion
References
• The goal of this work is to find the best tool for predicting the life
expectancy of people with Hepatitis B.
• Different Machine Learning methods have been completely studied and
various Machine Learning methods have been carried out by different
experimenters all over the world.
• The Machine Learning models and algorithms such as the Classification
model, Logistic Regression model, Recursive Feature Elimination
Algorithm, Cirrhosis Mortality model, Extreme Gradient Boosting,
Random Forest, Decision Tree have been utilized by different researchers
to predict the life expectancy of Hepatitis B patients.
Abstract
• Life expectation is the number of years a person is projected to live based
on the statistical normal.
• Hepatitis B, is one of the severe disorders that compromises the liver's
functions. The presence of infection in the liver is the main cause of
Hepatitis B symptoms.
• Hepatitis B symptoms include yellowing of the eyes, stomach pain, and
black urine, among others.
• The two most crucial elements for predicting the life expectancy of a
patient with any disease are:-
i. the selection of appropriate parameters and
ii. proper data analysis with skilled knowledge.
Introduction
• Various queries have been made in this field for opinions and prediction of
circumstances, and patient’s life expectancy.
• Tao Wang used ancient statistical methods to research and construct a model
of chronic hepatitis B carriers' life expectancy.
• Somaya et al. calculated various machine learning methods in the prediction of
advanced fibrosis in chronic Hepatitis C cases using serum biomarkers.
• Mingxue Yu et al. developed and validated a predictive model for the
prediction of Chronic liver failure in chronic Hepatitis B cases using a recursive
feature elimination technique.
• Xiaolu Tian et al. used multiple machine learning methods to predict the
possibility of Hepatitis B Surface Antigen Seroclearance in hepatitis B patients.
Literature Survey
• Supervised data mining techniques have been successful in hepatitis
disease diagnosis through a set of datasets.
• Many methods have been developed by the aids of data mining
techniques for hepatitis disease diagnosis.
• The majority of these methods are developed by single learning
techniques. In addition, these methods do not support the ensemble
learning of the data.
• Combining the outputs of several predictors can result in improved
accuracy in classification problems.
Problem Definition
• In this study, we will compare and evaluate the usefulness of different
machine learning techniques in predicting life expectancy of Hepatitis B
patients by developing classification models.
• Logical Regression (LR), Decision tree (DT), and K-Nearest Neighbour
(KNN) models for prediction were be developed.
• The proposed models should be easy to perform, inexpensive, and give
numerical and accurate results in real time. These models will predict the
life expectancy of patients with high accuracy.
Proposed Work
Tecniques Involved are:-
• EDA :-Exploratory Data Analysis is the process of investigating the dataset to discover
patterns, and anomalies (outliers), and form hypotheses based on our understanding of
the dataset.
• Outlier Detection:- Outliers are observations in a dataset that don't fit in some way.
They can skew statistical measures and data distributions andmislead representation of
the underlying data and relationships.
• Feature Selection:-Feature selection is the process of reducing the number of input
variables when developing a predictive model. It involve evaluating the relationship
between each input variable and the target variable using statistics and selecting those
input variables that have the strongest relationship with the target variable.
• RFE:-Recursive Feature Elimination is used for selecting those features (columns) in a
training dataset that are more or most relevant in predicting the target variable.
Methodology & Implementation
• Extra Tree Classifier :-Extremely Randomized Trees Classifier (Extra Trees Classifier) is a
type of ensemble learning technique which aggregates the results of multiple de-
correlated decision trees collected in a “forest” to output its classification result.
• Confusion Matrix :-The confusion matrix is a matrix used to determine the
performance of the classification models for a given set of test data. It can only be
determined if the true values for test data are known.
Cont.
• Data was collected from various online medical records and by surveying patients
suffering from Hepatitis B with different backgrounds. Data was thoroughly analyzed
and cleaned before using.
• Data collection included demographics, age, sex, use of steroid, antivirals, fatigue,
malaise, anorexia, liver big, liver firm, spleen palpable, spiders, ascites, varices,
bilirubin, alk phosphate, sgot, albumin, protime, histology.
• To explore the predictive power of individual variables, we first developed a
univariate logistic model for each variable.
• The Machine Learning algorithms such as Logistic Regression (LR), K Nearest
Neighbour (KNN), and Decision Tree were considered as the classification and
prediction tools for predicting the life expectancy of Hepatitis B patients.
• These models were trained and tested on the best 14 variables selected.
• The Logistic Regression showed an accuracy score of 0.72 while KNN and Decision
Tree showed similar accuracy score of 0.74.
Results & Discussions
• Logistic regression model is a classic statistical classification method. It investigates
the correlation between binary-dependent variable and -independent variables by
estimating probabilities using a logistic function.
• Decision tree is a nonparametric supervised learning method used for classification
and regression that uses a tree-like graph or model of decision to predict the value of
a target variable by learning simple decision rules inferred from the data features.
• K-nearest neighbors (KNN) algorithm uses ‘feature similarity’ to predict the values of
new datapoints which further means that the new data point will be assigned a value
based on how closely it matches the points in the training set.
• These models were trained and tested on the best 14 variables selected.
Contd.
• All of the models had reasonable estimations.
• All the three models Logistic Regression, KNN and Decision Tree showed almost
similar accuracy scores based on the best features available.
• Logistic regression model is a classic statistical classification method. It investigates
the correlation between binary-dependent variable and -independent variables by
estimating probabilities using a logistic function. It showed an accuracy of 72%.
• Decision tree is a nonparametric supervised learning method used for classification
and regression that uses a tree-like graph or model of decision to predict the value of
a target variable by learning simple decision rules inferred from the data features. It
showed an accuracy of 74%.
• K-nearest neighbors (KNN) algorithm uses ‘feature similarity’ to predict the values of
new datapoints which further means that the new data point will be assigned a value
based on how closely it matches the points in the training set. It also showed an
accuracy of 74%.
Conclusion
• Tao Wang [2009]. Model of Life
Expectancy of Chronic Hepatitis B
Carriers in an Endemic Region.
Journal of Epidemiology.[1]
• Brent C. Taylor [2009]. Clinical
Outcomes in Adults with Chronic
Hepatitis B in Association with
Patient and Viral Characteristics.
Hepatology Communications.[2]
• Mamta K. Jain [2009]. Mortality
in Patients Coinfected with
Hepatitis B Virus and HIV. Clinical
Infectious Diseases.[3]
• J.Wolfson [2015]. A Naïve Bayes
machine learning approach to
risk prediction using censored,
time-to-event data. US National
Library of Medicine National
Institutes of Health (NCBI).[4]
• Somaya Hashem, Gamal Esmat
[2017]. Comparison of Machine
Learning Approaches for
Prediction of Advanced Liver
Fibrosis in Chronic Hepatitis C
Patients. IEEE/ACM transactions
on computational biology and
bioinformatics / IEEE, ACM.[5]
• Yaming Zhang [2018]. Modeling
for the prediction of Hepatitis B
incidence based on integrated
online search indexes.
Informatics in Medicine
Unlocked.[6]
• Xiaolu Tian, Yutian Chong [2019].
Using Machine Learning
Algorithms to Predict Hepatitis B
Surface Antigen Seroclearance.
Hindawi Computational and
Mathematical Methods in
Medicine.[7]
• Hailemichael Desalegn [2019].
Predictors of mortality in patients
under treatment for chronic
hepatitis B in Ethiopia. BMC
Gastroenterology. [8]
• Fasiha Kanwal, MD, MSHS;
Thomas J. Taylor, PhD [2020].
Development, Validation, and
Evaluation of a Simple Machine
Learning Model to Predict
Cirrhosis Mortality. JAMA
Network Open.[9]
• Mingxue Yu, Xiangyong Li [2021].
Development and Validation of a
Novel Risk Prediction Model
UsingRecursive Feature
Elimination Algorithm for Acute-
on- Chronic Liver Failure in
Chronic Hepatitis B Patients with
Severe Acute Exacerbation.
Frontiers in Medicine.[10]
References
Thank You

More Related Content

Similar to Predicting Life Expectancy of Hepatitis B Patients

EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...IJDKP
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...IJDKP
 
methods of randomization.pptx
methods of randomization.pptxmethods of randomization.pptx
methods of randomization.pptxNehagurbani
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillanceJoel Saltz
 
Enhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEnhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEditor IJCATR
 
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...Editor IJCATR
 
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...Editor IJCATR
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEwout Steyerberg
 
Narrative review | Systematic review | Data extraction
Narrative review | Systematic review | Data extractionNarrative review | Systematic review | Data extraction
Narrative review | Systematic review | Data extractionPubrica
 
Systematic Review & Meta Analysis.pptx
Systematic Review & Meta Analysis.pptxSystematic Review & Meta Analysis.pptx
Systematic Review & Meta Analysis.pptxDr. Anik Chakraborty
 
Hybrid filtering methods for feature selection in high-dimensional cancer data
Hybrid filtering methods for feature selection in high-dimensional cancer dataHybrid filtering methods for feature selection in high-dimensional cancer data
Hybrid filtering methods for feature selection in high-dimensional cancer dataIJECEIAES
 
Meta analysis
Meta analysisMeta analysis
Meta analysisJunaidAKG
 
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...IRJET Journal
 
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...mlaij
 
Leaf Disease Detection at Different Stages with Percentage estimation and Su...
Leaf Disease Detection at Different Stages with Percentage  estimation and Su...Leaf Disease Detection at Different Stages with Percentage  estimation and Su...
Leaf Disease Detection at Different Stages with Percentage estimation and Su...Hemant Kumar Gurjar
 
Data Integrity in Decentralized Clinical Trials (DCTs)
Data Integrity in Decentralized Clinical Trials (DCTs)Data Integrity in Decentralized Clinical Trials (DCTs)
Data Integrity in Decentralized Clinical Trials (DCTs)InsideScientific
 

Similar to Predicting Life Expectancy of Hepatitis B Patients (20)

EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
EFFICACY OF NON-NEGATIVE MATRIX FACTORIZATION FOR FEATURE SELECTION IN CANCER...
 
Comparison of breast cancer classification models on Wisconsin dataset
Comparison of breast cancer classification models on Wisconsin  datasetComparison of breast cancer classification models on Wisconsin  dataset
Comparison of breast cancer classification models on Wisconsin dataset
 
methods of randomization.pptx
methods of randomization.pptxmethods of randomization.pptx
methods of randomization.pptx
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
 
Enhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication NetworksEnhanced Detection System for Trust Aware P2P Communication Networks
Enhanced Detection System for Trust Aware P2P Communication Networks
 
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
Comparative Study of Diabetic Patient Data’s Using Classification Algorithm i...
 
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
C omparative S tudy of D iabetic P atient D ata’s U sing C lassification A lg...
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk prediction
 
Narrative review | Systematic review | Data extraction
Narrative review | Systematic review | Data extractionNarrative review | Systematic review | Data extraction
Narrative review | Systematic review | Data extraction
 
Systematic Review & Meta Analysis.pptx
Systematic Review & Meta Analysis.pptxSystematic Review & Meta Analysis.pptx
Systematic Review & Meta Analysis.pptx
 
Hybrid filtering methods for feature selection in high-dimensional cancer data
Hybrid filtering methods for feature selection in high-dimensional cancer dataHybrid filtering methods for feature selection in high-dimensional cancer data
Hybrid filtering methods for feature selection in high-dimensional cancer data
 
cadd.pptx
cadd.pptxcadd.pptx
cadd.pptx
 
Meta analysis
Meta analysisMeta analysis
Meta analysis
 
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
IRJET - Breast Cancer Prediction using Supervised Machine Learning Algorithms...
 
Regenstrief WIP 07012015
Regenstrief WIP 07012015Regenstrief WIP 07012015
Regenstrief WIP 07012015
 
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
Machine Learning Based Approaches for Cancer Classification Using Gene Expres...
 
Leaf Disease Detection at Different Stages with Percentage estimation and Su...
Leaf Disease Detection at Different Stages with Percentage  estimation and Su...Leaf Disease Detection at Different Stages with Percentage  estimation and Su...
Leaf Disease Detection at Different Stages with Percentage estimation and Su...
 
Data Integrity in Decentralized Clinical Trials (DCTs)
Data Integrity in Decentralized Clinical Trials (DCTs)Data Integrity in Decentralized Clinical Trials (DCTs)
Data Integrity in Decentralized Clinical Trials (DCTs)
 
BFRG AI Investor Aug 2023
BFRG AI Investor Aug 2023BFRG AI Investor Aug 2023
BFRG AI Investor Aug 2023
 

Recently uploaded

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 

Recently uploaded (20)

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEDJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 

Predicting Life Expectancy of Hepatitis B Patients

  • 1. InternationalConference On Distributed Computing And Electrical Circuits And Electronics (ICDCECE-2022) organizes Ballari Institute Of Technology and Management, Ballari (Autonomous Institute under VTU, Belagavi | Approved by AICTE, New Delhi | Recognized by Govt. of Karnataka ) Technical Co-Sponsor Predicting Life Expectancy of Hepatitis B Patients using Machine Learning 1. Nabeel Ali BBDITM 2. Dolley Srivastava BBDITM 3. Aditya Tiwari BBDITM 4. Akash Pandey BBDITM 5. Akshat Sahu BBDITM 6. Abhay Kumar Pandey BBDITM
  • 2. Contents Abstract Introduction Literature Survey Problem Definition Proposed Work Methodology & Implementation Results & Discussions Conclusion References
  • 3. • The goal of this work is to find the best tool for predicting the life expectancy of people with Hepatitis B. • Different Machine Learning methods have been completely studied and various Machine Learning methods have been carried out by different experimenters all over the world. • The Machine Learning models and algorithms such as the Classification model, Logistic Regression model, Recursive Feature Elimination Algorithm, Cirrhosis Mortality model, Extreme Gradient Boosting, Random Forest, Decision Tree have been utilized by different researchers to predict the life expectancy of Hepatitis B patients. Abstract
  • 4. • Life expectation is the number of years a person is projected to live based on the statistical normal. • Hepatitis B, is one of the severe disorders that compromises the liver's functions. The presence of infection in the liver is the main cause of Hepatitis B symptoms. • Hepatitis B symptoms include yellowing of the eyes, stomach pain, and black urine, among others. • The two most crucial elements for predicting the life expectancy of a patient with any disease are:- i. the selection of appropriate parameters and ii. proper data analysis with skilled knowledge. Introduction
  • 5. • Various queries have been made in this field for opinions and prediction of circumstances, and patient’s life expectancy. • Tao Wang used ancient statistical methods to research and construct a model of chronic hepatitis B carriers' life expectancy. • Somaya et al. calculated various machine learning methods in the prediction of advanced fibrosis in chronic Hepatitis C cases using serum biomarkers. • Mingxue Yu et al. developed and validated a predictive model for the prediction of Chronic liver failure in chronic Hepatitis B cases using a recursive feature elimination technique. • Xiaolu Tian et al. used multiple machine learning methods to predict the possibility of Hepatitis B Surface Antigen Seroclearance in hepatitis B patients. Literature Survey
  • 6. • Supervised data mining techniques have been successful in hepatitis disease diagnosis through a set of datasets. • Many methods have been developed by the aids of data mining techniques for hepatitis disease diagnosis. • The majority of these methods are developed by single learning techniques. In addition, these methods do not support the ensemble learning of the data. • Combining the outputs of several predictors can result in improved accuracy in classification problems. Problem Definition
  • 7. • In this study, we will compare and evaluate the usefulness of different machine learning techniques in predicting life expectancy of Hepatitis B patients by developing classification models. • Logical Regression (LR), Decision tree (DT), and K-Nearest Neighbour (KNN) models for prediction were be developed. • The proposed models should be easy to perform, inexpensive, and give numerical and accurate results in real time. These models will predict the life expectancy of patients with high accuracy. Proposed Work
  • 8. Tecniques Involved are:- • EDA :-Exploratory Data Analysis is the process of investigating the dataset to discover patterns, and anomalies (outliers), and form hypotheses based on our understanding of the dataset. • Outlier Detection:- Outliers are observations in a dataset that don't fit in some way. They can skew statistical measures and data distributions andmislead representation of the underlying data and relationships. • Feature Selection:-Feature selection is the process of reducing the number of input variables when developing a predictive model. It involve evaluating the relationship between each input variable and the target variable using statistics and selecting those input variables that have the strongest relationship with the target variable. • RFE:-Recursive Feature Elimination is used for selecting those features (columns) in a training dataset that are more or most relevant in predicting the target variable. Methodology & Implementation
  • 9. • Extra Tree Classifier :-Extremely Randomized Trees Classifier (Extra Trees Classifier) is a type of ensemble learning technique which aggregates the results of multiple de- correlated decision trees collected in a “forest” to output its classification result. • Confusion Matrix :-The confusion matrix is a matrix used to determine the performance of the classification models for a given set of test data. It can only be determined if the true values for test data are known. Cont.
  • 10. • Data was collected from various online medical records and by surveying patients suffering from Hepatitis B with different backgrounds. Data was thoroughly analyzed and cleaned before using. • Data collection included demographics, age, sex, use of steroid, antivirals, fatigue, malaise, anorexia, liver big, liver firm, spleen palpable, spiders, ascites, varices, bilirubin, alk phosphate, sgot, albumin, protime, histology. • To explore the predictive power of individual variables, we first developed a univariate logistic model for each variable. • The Machine Learning algorithms such as Logistic Regression (LR), K Nearest Neighbour (KNN), and Decision Tree were considered as the classification and prediction tools for predicting the life expectancy of Hepatitis B patients. • These models were trained and tested on the best 14 variables selected. • The Logistic Regression showed an accuracy score of 0.72 while KNN and Decision Tree showed similar accuracy score of 0.74. Results & Discussions
  • 11. • Logistic regression model is a classic statistical classification method. It investigates the correlation between binary-dependent variable and -independent variables by estimating probabilities using a logistic function. • Decision tree is a nonparametric supervised learning method used for classification and regression that uses a tree-like graph or model of decision to predict the value of a target variable by learning simple decision rules inferred from the data features. • K-nearest neighbors (KNN) algorithm uses ‘feature similarity’ to predict the values of new datapoints which further means that the new data point will be assigned a value based on how closely it matches the points in the training set. • These models were trained and tested on the best 14 variables selected. Contd.
  • 12. • All of the models had reasonable estimations. • All the three models Logistic Regression, KNN and Decision Tree showed almost similar accuracy scores based on the best features available. • Logistic regression model is a classic statistical classification method. It investigates the correlation between binary-dependent variable and -independent variables by estimating probabilities using a logistic function. It showed an accuracy of 72%. • Decision tree is a nonparametric supervised learning method used for classification and regression that uses a tree-like graph or model of decision to predict the value of a target variable by learning simple decision rules inferred from the data features. It showed an accuracy of 74%. • K-nearest neighbors (KNN) algorithm uses ‘feature similarity’ to predict the values of new datapoints which further means that the new data point will be assigned a value based on how closely it matches the points in the training set. It also showed an accuracy of 74%. Conclusion
  • 13. • Tao Wang [2009]. Model of Life Expectancy of Chronic Hepatitis B Carriers in an Endemic Region. Journal of Epidemiology.[1] • Brent C. Taylor [2009]. Clinical Outcomes in Adults with Chronic Hepatitis B in Association with Patient and Viral Characteristics. Hepatology Communications.[2] • Mamta K. Jain [2009]. Mortality in Patients Coinfected with Hepatitis B Virus and HIV. Clinical Infectious Diseases.[3] • J.Wolfson [2015]. A Naïve Bayes machine learning approach to risk prediction using censored, time-to-event data. US National Library of Medicine National Institutes of Health (NCBI).[4] • Somaya Hashem, Gamal Esmat [2017]. Comparison of Machine Learning Approaches for Prediction of Advanced Liver Fibrosis in Chronic Hepatitis C Patients. IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM.[5] • Yaming Zhang [2018]. Modeling for the prediction of Hepatitis B incidence based on integrated online search indexes. Informatics in Medicine Unlocked.[6] • Xiaolu Tian, Yutian Chong [2019]. Using Machine Learning Algorithms to Predict Hepatitis B Surface Antigen Seroclearance. Hindawi Computational and Mathematical Methods in Medicine.[7] • Hailemichael Desalegn [2019]. Predictors of mortality in patients under treatment for chronic hepatitis B in Ethiopia. BMC Gastroenterology. [8] • Fasiha Kanwal, MD, MSHS; Thomas J. Taylor, PhD [2020]. Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality. JAMA Network Open.[9] • Mingxue Yu, Xiangyong Li [2021]. Development and Validation of a Novel Risk Prediction Model UsingRecursive Feature Elimination Algorithm for Acute- on- Chronic Liver Failure in Chronic Hepatitis B Patients with Severe Acute Exacerbation. Frontiers in Medicine.[10] References