SlideShare a Scribd company logo
1 of 10
ANALYZING LIVER DISEASE IN
INDIAN PATIENTS PROJECT
• Liver disease is a significant health concern globally, and in
India, it poses a substantial burden on public health
• In this context, this modeling project aims to analyze a
dataset specific to liver disease in Indian patients,
leveraging advanced machine learning techniques.
• We're diving into a project to understand and tackle liver
disease among people in India.
• The goal? To predict and catch liver problems early, so
doctors can help people better.
•Aspartate Aminotransferase
•Total Protiens
•Albumin
•Albumin and Globulin Ratio
•Total Bilirubin
•Direct Bilirubin
•Alkaline Phosphotase
•Alamine Aminotransferase
OVERVIEW:
• This data set contains 416 liver patient records and
167 non liver patient records collected from North
East of Andhra Pradesh, India.
• Based on chemical
compounds(bilirubin,albumin,protiens,alkaline
phosphatase) present in human body and tests like
SGOT , SGPT the outcome mentioned whether
person is patient i.e needs to be diagnosed or not.
• The roadmap is clear: analyze the data, building a
model, and pave the way for more effective liver
disease management across India. This project isn't
just about data; it's about making a positive impact
on people's lives.
• Check Null Values:
Dealing with missing or null values is a crucial aspect
of our analysis on liver disease in Indian patients.
I have filled the null values with the mean value
in the dataset.
• CONVERTING CATEGORICAL VARIABLE
TO INDICATOR VARIABLE
• i.e “gender”
• Many algorithms and statistical models require
numerical input. By converting the "gender" variable
into indicator variables, to transform it into a format that
can be easily processed by these algorithms.
3 2 1 MODEL
SELECTION
3 2
1.LOGISTIC
REGRESSION:
• Logistic regression is a math trick
for predicting things with two
options, like yes or no. It's like using
a special formula to guess the
chance of an event happening
based on some factors. Super handy
for making simple predictions!
• REASON:-
• the dataset is not massive, logistic
regression can be efficient. It doesn't
require a vast amount of data to perform
well, which can be beneficial when
working with medical data that might be
limited.
3
2.RANDOM FOREST:
• Random Forest is a teamwork algorithm
where a bunch of decision trees collaborate
to make better predictions by leveraging
the collective wisdom of the "forest." It's
great for handling complex data and
making more accurate predictions in various
scenarios.
• REASON:-
• Random Forest's ability to handle complex
relationships, provide variable importance
insights, and robustly handle various data
challenges makes it a compelling choice for this
dataset.
1
2 1
Support Vector
Machines (SVM) :
It is a powerful algorithm used for binary
classification in technical terms. It's great for
situations where you want a clear boundary
between two groups, like predicting if , or
in our case, figuring out if someone might
have a particular health condition or not.
REASON:-
SVM can perform well even when the dataset is
relatively small. In medical research, where
obtaining extensive data can be challenging,
SVM's ability to handle smaller sample sizes is
beneficial.
Predicting Liver Disease in India: A Machine Learning Approach

More Related Content

Similar to Predicting Liver Disease in India: A Machine Learning Approach

A Neural Network Based Diabetes Prediction on Imbalance Dataset.pptx
A Neural Network Based Diabetes Prediction on Imbalance Dataset.pptxA Neural Network Based Diabetes Prediction on Imbalance Dataset.pptx
A Neural Network Based Diabetes Prediction on Imbalance Dataset.pptx
shivani28yadav
 
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
ijsc
 
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
ijsc
 
leaf deseasegcbnkkllllkjjkmlghjjkkkgdjnjjj
leaf deseasegcbnkkllllkjjkmlghjjkkkgdjnjjjleaf deseasegcbnkkllllkjjkmlghjjkkkgdjnjjj
leaf deseasegcbnkkllllkjjkmlghjjkkkgdjnjjj
BalaKrishna616942
 

Similar to Predicting Liver Disease in India: A Machine Learning Approach (20)

Diabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning AlgorithmsDiabetes Prediction using Machine Learning Algorithms
Diabetes Prediction using Machine Learning Algorithms
 
Thesis Presentation.pptx
Thesis Presentation.pptxThesis Presentation.pptx
Thesis Presentation.pptx
 
STATISCAL PAKAGE.pptx
STATISCAL PAKAGE.pptxSTATISCAL PAKAGE.pptx
STATISCAL PAKAGE.pptx
 
Enable breakthrough in Parkinson disease research- Ido Karavany-
Enable breakthrough in Parkinson disease research- Ido Karavany-Enable breakthrough in Parkinson disease research- Ido Karavany-
Enable breakthrough in Parkinson disease research- Ido Karavany-
 
Explainable AI in Drug Hunting
Explainable AI in Drug HuntingExplainable AI in Drug Hunting
Explainable AI in Drug Hunting
 
Diagnosis Support by Machine Learning Using Posturography Data
Diagnosis Support by Machine Learning Using Posturography DataDiagnosis Support by Machine Learning Using Posturography Data
Diagnosis Support by Machine Learning Using Posturography Data
 
A Neural Network Based Diabetes Prediction on Imbalance Dataset.pptx
A Neural Network Based Diabetes Prediction on Imbalance Dataset.pptxA Neural Network Based Diabetes Prediction on Imbalance Dataset.pptx
A Neural Network Based Diabetes Prediction on Imbalance Dataset.pptx
 
Researc-paper_Project Work Phase-1 PPT (21CS09).pptx
Researc-paper_Project Work Phase-1 PPT (21CS09).pptxResearc-paper_Project Work Phase-1 PPT (21CS09).pptx
Researc-paper_Project Work Phase-1 PPT (21CS09).pptx
 
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
 
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
 
A Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic RegressionA Heart Disease Prediction Model using Logistic Regression
A Heart Disease Prediction Model using Logistic Regression
 
ML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early StageML In Predicting Diabetes In The Early Stage
ML In Predicting Diabetes In The Early Stage
 
Application of Expert System in medical systems
Application of Expert System in medical systemsApplication of Expert System in medical systems
Application of Expert System in medical systems
 
Food calorie Final ppt.pptx
Food calorie Final ppt.pptxFood calorie Final ppt.pptx
Food calorie Final ppt.pptx
 
IRJET - Machine Learning for Diagnosis of Diabetes
IRJET - Machine Learning for Diagnosis of DiabetesIRJET - Machine Learning for Diagnosis of Diabetes
IRJET - Machine Learning for Diagnosis of Diabetes
 
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
IRJET- Extending Association Rule Summarization Techniques to Assess Risk of ...
 
Artificial Intelligence in OBGYN Keynote Address on 19th March 2022 at MOGS...
Artificial Intelligence in OBGYN  Keynote Address on 19th March 2022  at MOGS...Artificial Intelligence in OBGYN  Keynote Address on 19th March 2022  at MOGS...
Artificial Intelligence in OBGYN Keynote Address on 19th March 2022 at MOGS...
 
Big data in healthcare
Big data in healthcareBig data in healthcare
Big data in healthcare
 
Risk Of Heart Disease Prediction Using Machine Learning
Risk Of Heart Disease Prediction Using Machine LearningRisk Of Heart Disease Prediction Using Machine Learning
Risk Of Heart Disease Prediction Using Machine Learning
 
leaf deseasegcbnkkllllkjjkmlghjjkkkgdjnjjj
leaf deseasegcbnkkllllkjjkmlghjjkkkgdjnjjjleaf deseasegcbnkkllllkjjkmlghjjkkkgdjnjjj
leaf deseasegcbnkkllllkjjkmlghjjkkkgdjnjjj
 

More from Boston Institute of Analytics

More from Boston Institute of Analytics (20)

Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 
Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor NetworksSensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
Unveiling the Market: Predicting House Prices with Data Science
Unveiling the Market: Predicting House Prices with Data ScienceUnveiling the Market: Predicting House Prices with Data Science
Unveiling the Market: Predicting House Prices with Data Science
 
Beyond Thumbs Up/Down: Using AI to Analyze Movie Reviews
Beyond Thumbs Up/Down: Using AI to Analyze Movie ReviewsBeyond Thumbs Up/Down: Using AI to Analyze Movie Reviews
Beyond Thumbs Up/Down: Using AI to Analyze Movie Reviews
 
Fuel Efficiency Forecast: Predictive Analytics for a Greener Automotive Future
Fuel Efficiency Forecast: Predictive Analytics for a Greener Automotive FutureFuel Efficiency Forecast: Predictive Analytics for a Greener Automotive Future
Fuel Efficiency Forecast: Predictive Analytics for a Greener Automotive Future
 
Unveiling the Patterns: A Cluster Analysis of NYC Shootings
Unveiling the Patterns: A Cluster Analysis of NYC ShootingsUnveiling the Patterns: A Cluster Analysis of NYC Shootings
Unveiling the Patterns: A Cluster Analysis of NYC Shootings
 
Enhancing Cybersecurity: An In-depth Analysis of Travelblog.org
Enhancing Cybersecurity: An In-depth Analysis of Travelblog.orgEnhancing Cybersecurity: An In-depth Analysis of Travelblog.org
Enhancing Cybersecurity: An In-depth Analysis of Travelblog.org
 
Exploring Web Security Threats: A Practical Study on SQL Injection and CSRF
Exploring Web Security Threats: A Practical Study on SQL Injection and CSRFExploring Web Security Threats: A Practical Study on SQL Injection and CSRF
Exploring Web Security Threats: A Practical Study on SQL Injection and CSRF
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Detecting Credit Card Fraud: An AI-driven Approach
Detecting Credit Card Fraud: An AI-driven ApproachDetecting Credit Card Fraud: An AI-driven Approach
Detecting Credit Card Fraud: An AI-driven Approach
 
Predicting House Prices: A Machine Learning Approach
Predicting House Prices: A Machine Learning ApproachPredicting House Prices: A Machine Learning Approach
Predicting House Prices: A Machine Learning Approach
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Decoding Loan Approval with Predictive Modeling in Action Discovering Weaknes...
Decoding Loan Approval with Predictive Modeling in Action Discovering Weaknes...Decoding Loan Approval with Predictive Modeling in Action Discovering Weaknes...
Decoding Loan Approval with Predictive Modeling in Action Discovering Weaknes...
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
NLP Based project presentation: Analyzing Automobile Prices
NLP Based project presentation: Analyzing Automobile PricesNLP Based project presentation: Analyzing Automobile Prices
NLP Based project presentation: Analyzing Automobile Prices
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
Analyzing Movie Reviews : Machine learning project
Analyzing Movie Reviews : Machine learning projectAnalyzing Movie Reviews : Machine learning project
Analyzing Movie Reviews : Machine learning project
 
Data Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health ClassificationData Science Project: Advancements in Fetal Health Classification
Data Science Project: Advancements in Fetal Health Classification
 

Recently uploaded

QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
httgc7rh9c
 

Recently uploaded (20)

Play hard learn harder: The Serious Business of Play
Play hard learn harder:  The Serious Business of PlayPlay hard learn harder:  The Serious Business of Play
Play hard learn harder: The Serious Business of Play
 
Our Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdfOur Environment Class 10 Science Notes pdf
Our Environment Class 10 Science Notes pdf
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
 
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdfUGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
UGC NET Paper 1 Unit 7 DATA INTERPRETATION.pdf
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lessonQUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
QUATER-1-PE-HEALTH-LC2- this is just a sample of unpacked lesson
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111Details on CBSE Compartment Exam.pptx1111
Details on CBSE Compartment Exam.pptx1111
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 

Predicting Liver Disease in India: A Machine Learning Approach

  • 1.
  • 2. ANALYZING LIVER DISEASE IN INDIAN PATIENTS PROJECT • Liver disease is a significant health concern globally, and in India, it poses a substantial burden on public health • In this context, this modeling project aims to analyze a dataset specific to liver disease in Indian patients, leveraging advanced machine learning techniques. • We're diving into a project to understand and tackle liver disease among people in India. • The goal? To predict and catch liver problems early, so doctors can help people better.
  • 3. •Aspartate Aminotransferase •Total Protiens •Albumin •Albumin and Globulin Ratio •Total Bilirubin •Direct Bilirubin •Alkaline Phosphotase •Alamine Aminotransferase
  • 4. OVERVIEW: • This data set contains 416 liver patient records and 167 non liver patient records collected from North East of Andhra Pradesh, India. • Based on chemical compounds(bilirubin,albumin,protiens,alkaline phosphatase) present in human body and tests like SGOT , SGPT the outcome mentioned whether person is patient i.e needs to be diagnosed or not. • The roadmap is clear: analyze the data, building a model, and pave the way for more effective liver disease management across India. This project isn't just about data; it's about making a positive impact on people's lives.
  • 5. • Check Null Values: Dealing with missing or null values is a crucial aspect of our analysis on liver disease in Indian patients. I have filled the null values with the mean value in the dataset. • CONVERTING CATEGORICAL VARIABLE TO INDICATOR VARIABLE • i.e “gender” • Many algorithms and statistical models require numerical input. By converting the "gender" variable into indicator variables, to transform it into a format that can be easily processed by these algorithms.
  • 6. 3 2 1 MODEL SELECTION
  • 7. 3 2 1.LOGISTIC REGRESSION: • Logistic regression is a math trick for predicting things with two options, like yes or no. It's like using a special formula to guess the chance of an event happening based on some factors. Super handy for making simple predictions! • REASON:- • the dataset is not massive, logistic regression can be efficient. It doesn't require a vast amount of data to perform well, which can be beneficial when working with medical data that might be limited.
  • 8. 3 2.RANDOM FOREST: • Random Forest is a teamwork algorithm where a bunch of decision trees collaborate to make better predictions by leveraging the collective wisdom of the "forest." It's great for handling complex data and making more accurate predictions in various scenarios. • REASON:- • Random Forest's ability to handle complex relationships, provide variable importance insights, and robustly handle various data challenges makes it a compelling choice for this dataset. 1
  • 9. 2 1 Support Vector Machines (SVM) : It is a powerful algorithm used for binary classification in technical terms. It's great for situations where you want a clear boundary between two groups, like predicting if , or in our case, figuring out if someone might have a particular health condition or not. REASON:- SVM can perform well even when the dataset is relatively small. In medical research, where obtaining extensive data can be challenging, SVM's ability to handle smaller sample sizes is beneficial.