Machine Learning on clinical datasets to predict the risk of chronic disease conditions like Type 2 Diabetes mellitus beforehand; as well as predicting outcomes like hospital readmission using EMR RWE data.
Existing model uses structured data to predict the patients of either high risk or low risk.
But for a complex disease, structured data is not a good way to describe the disease.
We propose a new convolutional neural network (CNN)-based multimodal disease risk prediction algorithm using structured and unstructured data from hospital.
In this paper, we mainly focus on the risk prediction of cerebral infarction.
Short presentation for a special lecture on Medicine Graduation Course in Hospital de Clínicas (https://www.hcpa.edu.br/), as part of a one-day special discipline on Machine Learning and Healthcare. The goal was introducing the importance of Deep Learning for Healthcare as well as showing some of the recent impact.
Prospects of Deep Learning in Medical ImagingGodswll Egegwu
A SEMINAR Presentation on the Prospects of Deep Learning in Medical Imaging Presented to the Department of Computer Science, Nasarawa State Polytechnic, Lafia.
BY:
EGEGWU, GODSWILL
08166643792
http://facebook.com/godswill.egegwu
http://egegwugodswill.name.ng
Women Who Code-HSV Event:
'An Introduction to Machine Learning and Genomics'. Dr. Lasseigne will introduce the R programming language and the foundational concepts of machine learning with real-world examples including applications in the field of genomics with an emphasis on complex human disease research.
Brittany Lasseigne, PhD, is a postdoctoral fellow in the lab of Dr. Richard Myers at the HudsonAlpha Institute for Biotechnology and a 2016-2017 Prevent Cancer Foundation Fellow. Dr. Lasseigne received a BS in biological engineering from the James Worth Bagley College of Engineering at Mississippi State University and a PhD in biotechnology science and engineering from The University of Alabama in Huntsville. As a graduate student, she studied the role of epigenetics and copy number variation in cancer, identifying novel diagnostic biomarkers and prognostic signatures associated with kidney cancer. In her current position, Dr. Lasseigne’s research focus is the application of genetics and genomics to complex human diseases. Her recent work includes the identification of gene variants linked to ALS, characterization of gene expression patterns in schizophrenia and bipolar disorder, and development of non-invasive biomarker assays. Dr. Lasseigne is currently focused on integrating genomic data across cancers with functional annotations and patient information to explore novel mechanisms in cancer etiology and progression, identify therapeutic targets, and understand genomic changes associated with patient survival. Based upon those analyses, she is creating tools to share with the scientific community.
Existing model uses structured data to predict the patients of either high risk or low risk.
But for a complex disease, structured data is not a good way to describe the disease.
We propose a new convolutional neural network (CNN)-based multimodal disease risk prediction algorithm using structured and unstructured data from hospital.
In this paper, we mainly focus on the risk prediction of cerebral infarction.
Short presentation for a special lecture on Medicine Graduation Course in Hospital de Clínicas (https://www.hcpa.edu.br/), as part of a one-day special discipline on Machine Learning and Healthcare. The goal was introducing the importance of Deep Learning for Healthcare as well as showing some of the recent impact.
Prospects of Deep Learning in Medical ImagingGodswll Egegwu
A SEMINAR Presentation on the Prospects of Deep Learning in Medical Imaging Presented to the Department of Computer Science, Nasarawa State Polytechnic, Lafia.
BY:
EGEGWU, GODSWILL
08166643792
http://facebook.com/godswill.egegwu
http://egegwugodswill.name.ng
Women Who Code-HSV Event:
'An Introduction to Machine Learning and Genomics'. Dr. Lasseigne will introduce the R programming language and the foundational concepts of machine learning with real-world examples including applications in the field of genomics with an emphasis on complex human disease research.
Brittany Lasseigne, PhD, is a postdoctoral fellow in the lab of Dr. Richard Myers at the HudsonAlpha Institute for Biotechnology and a 2016-2017 Prevent Cancer Foundation Fellow. Dr. Lasseigne received a BS in biological engineering from the James Worth Bagley College of Engineering at Mississippi State University and a PhD in biotechnology science and engineering from The University of Alabama in Huntsville. As a graduate student, she studied the role of epigenetics and copy number variation in cancer, identifying novel diagnostic biomarkers and prognostic signatures associated with kidney cancer. In her current position, Dr. Lasseigne’s research focus is the application of genetics and genomics to complex human diseases. Her recent work includes the identification of gene variants linked to ALS, characterization of gene expression patterns in schizophrenia and bipolar disorder, and development of non-invasive biomarker assays. Dr. Lasseigne is currently focused on integrating genomic data across cancers with functional annotations and patient information to explore novel mechanisms in cancer etiology and progression, identify therapeutic targets, and understand genomic changes associated with patient survival. Based upon those analyses, she is creating tools to share with the scientific community.
Machine Learning for Disease PredictionMustafa Oğuz
A great application field of machine learning is predicting diseases. This presentation introduces what is preventable diseases and deaths. Then examines three diverse papers to explain what has been done in the field and how the technology works. Finishes with future possibilities and enablers of the disease prediction technology.
Webinar on AI in Medical Diagnosis with Emerging TechnologiesBIS Research Inc.
Artificial Intelligence (AI) is a rapidly growing field that has the potential to revolutionize the way medical diagnosis is performed. AI systems use advanced algorithms to analyze large amounts of data, such as medical images, patient histories, and lab results, to identify patterns and make predictions about patient outcomes.
In medical diagnosis, AI can be used in a variety of ways, including medical imaging, disease diagnosis, personalized treatment planning, and early disease detection. By leveraging the power of AI, healthcare providers can make faster and more accurate diagnoses, resulting in better patient outcomes.
Agenda:
The main agenda of this webinar is to understand and explore the following:
• Policy implications related to AI and digital transformation in healthcare.
• Initiatives taken and regulations implemented to address cybersecurity concerns.
• Major factors driving and hindering the growth of AI in medical devices diagnosis.
• Major players in the ecosystem
PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUESIAEME Publication
Diabetes mellitus is a common disease caused by a set of metabolic ailments
where the sugar stages over drawn-out period is very high. It touches diverse organs
of the human body which therefore harm a huge number of the body's system, in
precise the blood strains and nerves. Early prediction in such disease can be exact
and save human life. To achieve the goal, this research work mainly discovers
numerous factors associated to this disease using machine learning techniques.
Machine learning methods provide effectual outcome to extract knowledge by building
predicting models from diagnostic medical datasets together from the diabetic
patients. Quarrying knowledge from such data can be valuable to predict diabetic
patients. In this research, six popular used machine learning techniques, namely
Random Forest (RF), Logistic Regression (LR), Naive Bayes (NB), C4.5 Decision
Tree (DT), K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) are
compared in order to get outstanding machine learning techniques to forecast diabetic
mellitus. Our new outcome shows that Support Vector Machine (SVM) achieved
higher accuracy compared to other machine learning techniques.
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
Data mining techniques are used for a variety of applications. In healthcare industry, datamining plays an important
role in predicting diseases. For detecting a disease number of tests should be required from the patient. But using data
mining technique the number of tests can be reduced. This reduced test plays an important role in time and performance.
This report analyses data mining techniques which can be used for predicting different types of diseases. This report reviewed
the research papers which mainly concentrate on predicting various disease
Artificial intelligence enters the medical fieldRuchi Jain
In the medical and health field, artificial intelligence can help reduce the cost of ongoing health operations, and can have an impact on the quality of medical care for patients everywhere. By diagnosing diseases earlier, AI can also improve patient outcomes. No matter how you look at it, artificial intelligence has great potential in healthcare.
Internet of Things (IoT) and Artificial Intelligence (AI) role in Medical and...Hamidreza Bolhasani
Internet of Things (IoT) and Artificial Intelligence (AI) role in Medical and Healthcare Systems
+ History of IoT
+ Internet of Nano Things (IoNT)
+ IoT and IoNT for Medical and Healthcare Systems
+ IoT and Artificial Intelligence (AI)
+ IoT and AI for Health
+ Deep Learning Accelerator
This research paper introduces a novel application for predicting plant diseases in cotton and potato plants using Convolutional Neural Networks (CNNs).
Separate CNN models were trained on labeled datasets of cotton and potato leaves, each associated with their respective diseases. The primary goal is to employ a fusion of two standard CNN systems to detect various diseases in cotton and potato plants.
Given India's heavy reliance on agriculture, this innovation is crucial to address challenges faced by the sector, including technological limitations, limited access to credit and markets, and the impact of climate change.
Cotton and potatoes are significant crops; this research paper are susceptible to various diseases that can impede their growth and result in substantial yield losses.
The conventional disease detection methods involve manual inspection and disease prognosis, which are time consuming and less accurate. The research showcases the effectiveness of the automated plant disease detection system, with two best models achieving impressive accuracies of 97.10% and 96.94% for cotton and potato plants, respectively.
These results offer promising insights for potential applications in crop management, benefiting the agricultural sector and contributing to increased productivity and profitability.
following topics are discussed inside the PPT:
Introduction
Objective
Motivation
Literature Survey
Some Key Features of Disease
Plan of Action
Methodology Adopted
Data Collection
Steps to be Performed
Functional Architecture
Adaptive K-Means Clustering Algorithm for MR Breast Image Segmentation
3D Brain Tumor Segmentation Scheme using K-mean Clustering and Connected Component Labeling Algorithms
Volume Identification and Estimation of MRI Brain Tumor
MRI Breast cancer diagnosis hybrid approach using adaptive Ant-based segmentation and Multilayer Perceptron NN classifier
Data Science Deep Roots in Healthcare IndustryDinesh V
Data Science transforms the healthcare industry with impeccable solutions that can improve patient care through EHRs, medical imaging, drug discovery, predictive medicines and genetics and genomics.
Machine Learning for Disease PredictionMustafa Oğuz
A great application field of machine learning is predicting diseases. This presentation introduces what is preventable diseases and deaths. Then examines three diverse papers to explain what has been done in the field and how the technology works. Finishes with future possibilities and enablers of the disease prediction technology.
Webinar on AI in Medical Diagnosis with Emerging TechnologiesBIS Research Inc.
Artificial Intelligence (AI) is a rapidly growing field that has the potential to revolutionize the way medical diagnosis is performed. AI systems use advanced algorithms to analyze large amounts of data, such as medical images, patient histories, and lab results, to identify patterns and make predictions about patient outcomes.
In medical diagnosis, AI can be used in a variety of ways, including medical imaging, disease diagnosis, personalized treatment planning, and early disease detection. By leveraging the power of AI, healthcare providers can make faster and more accurate diagnoses, resulting in better patient outcomes.
Agenda:
The main agenda of this webinar is to understand and explore the following:
• Policy implications related to AI and digital transformation in healthcare.
• Initiatives taken and regulations implemented to address cybersecurity concerns.
• Major factors driving and hindering the growth of AI in medical devices diagnosis.
• Major players in the ecosystem
PREDICTION OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUESIAEME Publication
Diabetes mellitus is a common disease caused by a set of metabolic ailments
where the sugar stages over drawn-out period is very high. It touches diverse organs
of the human body which therefore harm a huge number of the body's system, in
precise the blood strains and nerves. Early prediction in such disease can be exact
and save human life. To achieve the goal, this research work mainly discovers
numerous factors associated to this disease using machine learning techniques.
Machine learning methods provide effectual outcome to extract knowledge by building
predicting models from diagnostic medical datasets together from the diabetic
patients. Quarrying knowledge from such data can be valuable to predict diabetic
patients. In this research, six popular used machine learning techniques, namely
Random Forest (RF), Logistic Regression (LR), Naive Bayes (NB), C4.5 Decision
Tree (DT), K-Nearest Neighbor (KNN), and Support Vector Machine (SVM) are
compared in order to get outstanding machine learning techniques to forecast diabetic
mellitus. Our new outcome shows that Support Vector Machine (SVM) achieved
higher accuracy compared to other machine learning techniques.
HEALTH PREDICTION ANALYSIS USING DATA MININGAshish Salve
Data mining techniques are used for a variety of applications. In healthcare industry, datamining plays an important
role in predicting diseases. For detecting a disease number of tests should be required from the patient. But using data
mining technique the number of tests can be reduced. This reduced test plays an important role in time and performance.
This report analyses data mining techniques which can be used for predicting different types of diseases. This report reviewed
the research papers which mainly concentrate on predicting various disease
Artificial intelligence enters the medical fieldRuchi Jain
In the medical and health field, artificial intelligence can help reduce the cost of ongoing health operations, and can have an impact on the quality of medical care for patients everywhere. By diagnosing diseases earlier, AI can also improve patient outcomes. No matter how you look at it, artificial intelligence has great potential in healthcare.
Internet of Things (IoT) and Artificial Intelligence (AI) role in Medical and...Hamidreza Bolhasani
Internet of Things (IoT) and Artificial Intelligence (AI) role in Medical and Healthcare Systems
+ History of IoT
+ Internet of Nano Things (IoNT)
+ IoT and IoNT for Medical and Healthcare Systems
+ IoT and Artificial Intelligence (AI)
+ IoT and AI for Health
+ Deep Learning Accelerator
This research paper introduces a novel application for predicting plant diseases in cotton and potato plants using Convolutional Neural Networks (CNNs).
Separate CNN models were trained on labeled datasets of cotton and potato leaves, each associated with their respective diseases. The primary goal is to employ a fusion of two standard CNN systems to detect various diseases in cotton and potato plants.
Given India's heavy reliance on agriculture, this innovation is crucial to address challenges faced by the sector, including technological limitations, limited access to credit and markets, and the impact of climate change.
Cotton and potatoes are significant crops; this research paper are susceptible to various diseases that can impede their growth and result in substantial yield losses.
The conventional disease detection methods involve manual inspection and disease prognosis, which are time consuming and less accurate. The research showcases the effectiveness of the automated plant disease detection system, with two best models achieving impressive accuracies of 97.10% and 96.94% for cotton and potato plants, respectively.
These results offer promising insights for potential applications in crop management, benefiting the agricultural sector and contributing to increased productivity and profitability.
following topics are discussed inside the PPT:
Introduction
Objective
Motivation
Literature Survey
Some Key Features of Disease
Plan of Action
Methodology Adopted
Data Collection
Steps to be Performed
Functional Architecture
Adaptive K-Means Clustering Algorithm for MR Breast Image Segmentation
3D Brain Tumor Segmentation Scheme using K-mean Clustering and Connected Component Labeling Algorithms
Volume Identification and Estimation of MRI Brain Tumor
MRI Breast cancer diagnosis hybrid approach using adaptive Ant-based segmentation and Multilayer Perceptron NN classifier
Data Science Deep Roots in Healthcare IndustryDinesh V
Data Science transforms the healthcare industry with impeccable solutions that can improve patient care through EHRs, medical imaging, drug discovery, predictive medicines and genetics and genomics.
Oscar Rodríguez-El impacto de las ciencias ómicas en la medicina, la nutrició...Fundación Ramón Areces
El 29 de marzo de 2016 celebramos un Simposio Internacional sobre el 'Impacto de las ciencias ómicas en la medicina, nutrición y biotecnología'. Organizado por la Fundación Ramón Areces en colaboración con la Real Academia Nacional de Medicina y BioEuroLatina, abordó cómo un mejor conocimiento del genoma humano está permitiendo notables avances hacia una medicina de precisión.
Detection of myocardial infarction on recent dataset using machine learningIJICTJOURNAL
In developing countries such as India, with a large aging population and limited access to medical facilities, remote and timely diagnosis of myocardial infarction (MI) has the potential to save the life of many. An electrocardiogram is the primary clinical tool utilized in the onset or detection of a previous MI incident. Artificial intelligence has made a great impact on every area of research as well as in medical diagnosis. In medical diagnosis, the hypothesis might be doctors' experience which would be used as input to predict a disease that saves the life of mankind. It is been observed that a properly cleaned and pruned dataset provides far better accuracy than an unclean one with missing values. Selection of suitable techniques for data cleaning alongside proper classification algorithms will cause the event of prediction systems that give enhanced accuracy. In this proposal detection of myocardial infarction using new parameters is proposed with increased accuracy and efficiency of the existing model. Additional parameters are used to predict MI with more accuracy. The proposed model is used to predict an early diagnosis of MI with the help of expertise experiences and data gathered from hospitals.
Genomics, Personalized Medicine and Electronic Medical RecordsLyle Berkowitz, MD
We are now unlocking the secrets of health at a molecular level – which includes not only why some people get diseases, but also how to prevent or cure them. However, as Osler points out, knowing this information is only valuable in the context of making it available for the right patient at the right time.
This presentation provides a basic introduction to genomic or personalized medicine, and discusses how this information can and should be integrated into our electronic medical record systems.
These slides were originally presented at the HIMSS Annual Conference in February of 2007.
Therapeutic management of diseases based on fuzzy logic system- hypertriglyce...TELKOMNIKA JOURNAL
The support systems for assisting clinical decision highly improve the quality and efficiency of the therapeutic and diagnostic treatment in medicine. The proper implementation of such systems can emulate the reasoning of health care professionals in such a way that suggest reasonable decisions on patient treatment. The fuzzy logic system can be considered as one of the efficient techniques for converting a complex decision tree that usually facing the physician into artificial intelligent procedure embedded in a computer program. So many properties in fuzzy logic system that can facilitate the process of medical diagnosis and therapeutic management. In this paper, a system for therapeutic management of hypertriglyceridemia was efficiently realized using a fuzzy logic technique. The obtained results had shown that the proposed fuzzy logic contributes a reliable managing procedure for assisting the physicians and pharmacist in treating the hypertriglyceridemia. Many different hypertriglyceridemia treatment cases showed a perfect matching decision between the standard guidelines and that given by the proposed system.
K-Nearest Neighbours based diagnosis of hyperglycemiaijtsrd
AI or artificial intelligence is the simulation of human intelligence processes by machines, especially computer systems. These processes include learning (the acquisition of information and rules for using the information), reasoning (using the rules to reach approximate or definite conclusions), and self-correction. As a result, Artificial Intelligence is gaining Importance in science and engineering fields. The use of Artificial Intelligence in medical diagnosis too is becoming increasingly common and has been used widely in the diagnosis of cancers, tumors, hepatitis, lung diseases, etc... The main aim of this paper is to build an Artificial Intelligent System that after analysis of certain parameters can predict that whether a person is diabetic or not. Diabetes is the name used to describe a metabolic condition of having higher than normal blood sugar levels. Diabetes is becoming increasingly more common throughout the world, due to increased obesity - which can lead to metabolic syndrome or pre-diabetes leading to higher incidences of type 2 diabetes. Authors have identified 10 parameters that play an important role in diabetes and prepared a rich database of training data which served as the backbone of the prediction algorithm. Keeping in view this training data authors developed a system that uses the artificial neural networks algorithm to serve the purpose. These are capable of predicting new observations (on specific variables) from previous observations (on the same or other variables) after executing a process of so-called learning from existing training data (Haykin 1998).The results indicate that the performance of KNN method when compared with the medical diagnosis system was found to be 91%. This system can be used to assist medical programs especially in geographically remote areas where expert human diagnosis not possible with an advantage of minimal expenses and faster results. Abid Sarwar"K-Nearest Neighbours based diagnosis of hyperglycemia" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-1 , December 2017, URL: http://www.ijtsrd.com/papers/ijtsrd7046.pdf http://www.ijtsrd.com/computer-science/artificial-intelligence/7046/k-nearest-neighbours-based-diagnosis-of-hyperglycemia/abid-sarwar
An AI-based Decision Platform built using unified data model, incorporating systems biology topics for unit analysis using semi-supervised learning models
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
2. DATA ANALYTICS IN HEALTHCARE & LIFE SCIENCES
1. VITAL BUSINESS PROBLEMS:
So many different problems exist and they are of varying degree of complexity:
- What impacts favorable clinical outcomes
- Drivers of adverse events
- Factors impacting cost of care
- Earlier diagnosis of cancers and chronic diseases
Understanding these different business problems is critical for generating
possible solutions
2. POTENTIAL DATA SOURCES:
Huge amounts of data is getting generated nowadays from different sources that
are capable of capturing information :
- Electronic Health Records
- Healthcare claims from Insurance companies
- Pharmacies – claims and medication reviews
- Lab tests and Imaging results
- Population health data – Social Determinants of Health
- Genomics (and later Proteomics and Metabolomics)
- Wearable and other devices
- Other sources (Surveys, Patient Reported Outcomes)
The volume, velocity, variety, and veracity that is getting generated is staggering
– typical Big Data problem.
3. DATA PROCESSING, MANAGEMENT AND ANALYSIS:
Making sense of these varied sources of data and processing them so that they are useful for analysis is a data engineering challenge.
Structured data needs to be cleaned and curated; data from different sources need to be matched to get a complete 360 degree view of the customer.
Semi-structured and unstructured data sources (Physician notes, imaging data) pose challenges to curate and store the information so that it can be retrieved and
analyzed at scale and speed.
Various Big Data technologies have been developed to tackle this problem of storing(HADOOP ecosystem, SPARK) and analyzing semi-structured and unstructured data
(Text mining, NLP, Deep Learning for Image and Video Analytics).
4. SOLUTIONS TO THE PROBLEMS:
At the end of the day, all the analysis should be able to generate actionable insights. Interpretation of the results and their implementation to solve the problem are key.
3. HOW ML/DL CAN AUGMENT THE DECISION MAKING
PROCESS FOR CLINICIANS
PROGNOSIS
•A machine-learning
model can learn the
patterns of health
trajectories of vast
numbers of patients.
This facility can help
physicians to
anticipate future
events at an expert
level, drawing from
information well
beyond the
individual physician’s
practice experience.
For example, how
likely is it that a
patient will be able
to return to work, or
how quickly will the
disease progress?
DIAGNOSIS
•A diagnostic error
will occur in the
care of nearly every
patient in his or her
lifetime, and
receiving the right
diagnosis is critical
to receiving
appropriate care.
This problem is not
limited to rare
conditions. Cardiac
chest pain, TB,
dysentery, and
complications of
childbirth are
commonly not
detected even in
developing
countries
TREATMENT
•In a large health
care system with
tens of thousands of
physicians treating
tens of millions of
patients, there is
variation in when
and why patients
present for care and
how patients with
similar conditions
are treated. Can a
model sort through
these natural
variations to help
physicians identify
when the collective
experience points to
a preferred
treatment pathway?
CLINICALWORKFLOW
•The same machine-
learning techniques
that are used in
many consumer
products can be
used to make
clinicians more
efficient. Machine
learning that drives
search engines can
help expose reqd.
.information in a
patient’s chart for a
clinician without
multiple clicks.
Data entry of forms
and text fields can
be improved with
the use of machine-
learning
techniques.
REMOTEAREAS
•There is no way for
physicians to
individually interact
with all the patients
who may need care.
Can machine learning
extend the reach of
clinicians to provide
expert-level medical
assessment without
involvement? For
example, patients
with new rashes may
be able to obtain a
diagnosis by sending
a picture that they
take on their
smartphones,
thereby averting
unnecessary urgent-
care visits.
REFERENCE: https://www.nejm.org/doi/full/10.1056/NEJMra1814259
4. COMPONENTS OF ELECTRONIC HEALTH RECORDS
EMR
DEMOG &
HISTORY
DRUGS
ALLERGIES
VISITS
ADMISSIONS
DIAGNOSES
LAB
RESULTS
PROCEDURE
ADDITIONAL DATA FACTORS (normally not present)
GENOMICS
SOCIAL DETERMINANTS OF HEALTH
IMAGING DATA – X-RAY/USG/CT/MRI
PATIENT REPORTED OUTCOMES - PRO
STANDARD EMR/EHR DATA COMPONENTS
DEMOGRAHICS – Age, Gender, Race, Language, Religion, Insurance, Location
CLINICAL HISTORY – Habits, Past Dx and Observations
MEDICATIONS – Drug NDC, Quantity, Refills, Route, Rx dates
FOOD AND DRUG ALLERGIES – Allergen, Reaction Desc., Severity, Dates
VISITS TO ER AND OPD – Date/Time, Encounter Type, Provider Info
INPATIENT ADMISSIONS – Date/Time, Source, Discharge Code
PRIMARY DIAGNOSES AND COMORBIDITIES – ICD9/10, SNOMED
PROCEDURES AND SURGERIES – Procedure codes and ICD codes
LABORATORY RESULTS – LOINC, Date/Time, Reference Range, Value, UOM
Standard dictionaries: ICD9/10, SNOMED-CT, NDC, LOINC, NPI
GENOMICS IMAGING SDoH OUTCOMES
5. DIABETES – THE MAGNITUDE OF THE PROBLEM
Diabetes is the world's
eighth biggest killer,
accounting for some 1.5
million deaths each year. A
major new World Health
Organization report has
now revealed that the
number of cases around the
world has nearly
quadrupled to 422 million
in 2014 from 108 million in
1980. The Eastern-
Mediterranean region had
the biggest increase in cases
during that time frame.
Diabetes now affects one in
11 adults with high blood
sugar levels linked to 3.8
million deaths every year.
REFERENCE:
https://www.statista.com/chart/4617/the-
unrelenting-global-march-of-diabetes/
6. WHAT HAPPENS IN DIABETES MELLITUS
• https://youtu.be/qn2dhw0NJxo
Type 1 diabetes (T2DM)
In people with type 1 diabetes, the
body does not make insulin. The
immune system attacks and destroys
the cells in the pancreas that make
insulin. Type 1 diabetes is usually
diagnosed in children and young
adults, although it can appear at any
age. People with type 1 diabetes need
to take insulin every day to stay alive.
Type 2 diabetes (T1DM)
In people having type 2 diabetes, the
body does not make or use insulin
well. It can develop diabetes at any
age, even during childhood. However,
this type of diabetes occurs most often
in middle-aged and older people. Type
2 is the most common type of
diabetes.
COURTESY: NIDDK
https://www.niddk.nih.gov/health-
information/diabetes/overview/what-is-diabetes
IMAGE COURTESY: KHAN ACADEMY
7. HOW MACHINE LEARNING CAN HELP IN DIABETES
Predicting risk of heart failure for
diabetes patients with help from
machine learning
Identification of Type 2 Diabetes
Risk Factors Using Phenotypes
Consisting of Anthropometry and
Triglycerides based on Machine
Learning
Use of a Machine Learning
Algorithm Improves Prediction of
Progression to Diabetes
Predicting Future Glucose
Fluctuations Using Machine
Learning and Wearable Sensor Data
Predicting Diabetes Mellitus With
Machine Learning Techniques
Machine-learning to stratify
diabetic patients using novel
cardiac biomarkers and integrative
genomics
Predicting diabetic retinopathy and
identifying interpretable biomedical
features using machine learning
algorithms
Impact of HbA1c Measurement on
Hospital Readmission Rates:
Analysis of 70,000 Clinical Database
Patient Records
Data-Driven Blood Glucose Pattern
Classification and Anomalies
Detection: Machine-Learning
Applications in Type 1 Diabetes
8. APPROACH FOR DM READMISSION PREDICTIVE MODEL
• DMT2 risk prediction using clinical data and statistical and machine learning
algorithms/models
8
Predictor Variables (total 44 variables)
Demographic
Age
Gender
Ethnicity
Diagnosis
Type of Condition(DM T1/T2) diagnosis
# of comorbidities
Position (primary, secondary, etc.) of
diagnosis
Encounter
IP, OP, AE visits
Medications
Dosage, frequency, route
Lab results
Test names, dates, UOM, value
Normal/abnormal result
Admission
Length of stay
Admission method (elective, non-
elective)
Discharge destination
Procedure
Count of procedures
Cost of procedures
Response Variable
Readmission within 30 days
INPUT MODEL OUTPUT
4 years 1 year
Observation
window
Performance
window
Validation
window
Data split into time windows1
2 Models built using following algorithms (data from
observation and performance windows)
Logistic regression model (LOG)
Decision tree model (DT)
Random forest model (RF)
Model Ensembles
3 In-time validation (within performance window)
48.6%
74.3%
34.9%
29.4%
37.3%
68.7%
38.5%
28.2%
53.5%
76.7%
39.8%
33.7%
GINI AUC KS WORST
DECILE
CAPTURELOG DT RF
4 Out-of-time validation (in validation window)
All three models provided accuracy of
~80% in out-of-time validation scenario
RF model with ~76% AUC indicates reasonably good fit
Significant variables (major
drivers of readmission)
SEVERITY OF DM
# of DM spells in past 1 year
ED LOS in past 1 year
# of procedures undergone
# of OPD visits in past 1 year
# of ED visits in past 1 year
# of IP visits in past 1 year
# of comorbidities
Distance from hospital
DM LOS in past 1 year
Time since last ED visit
Total ED cost in past 1 year
Age of patient
Patient category based on
risk score
HighLow
5
6
9. 9
RISK PREDICTION MODEL: DESIGN, EVALUATION
• Mean/Median
• Regression
• KNN
Missing
imputation
• Feature Imp
• RFE
• WoE and IV
Feature
Selection
• Tree based
(DT, RF, GBT)
• Others (SVM,
NN, NB)
Model
Build
• K-fold cross
validation
• ROC curve
Model
Evaluation
Patient cohorts are created based on ICD 9/10 codes for defined chronic disease (e.g. DMT2) and also on the time of
diagnosis to separate already diagnosed patients from those who will potentially develop the disease.
Prospective
Cohort -
Scoring
Dataset
Feature selection
mechanisms help to
focus on the most
important variables
which the outcome
variable – methods
mentioned above
have been used.
EMR data has many
dimensions and this
also means lot of
values are missing –
imputation methods
help keep most of
the features usable.
The basic task is
classification which
is done by
computing the
probability of
outcome at each
patient level and
then applying
thresholds.
Multiple models
were created and
then validated for
accuracy metrics to
select the best
model. Cross
validation and area
under ROC curve
utilized.
Scoring was done
on the prospective
cohort to group
patients into high
risk, medium risk
and low risk. High
risk group was to be
targeted for
interventions.
10. PRACTICAL USE CASE AND CODE DEMO
USE CASE
DATASET
• Risk Prediction for Diabetes
• Impact of HbA1c Measurement on Hospital Readmission Rates:
Analysis of Clinical Database Patient Records
UCI MACHINE LEARNING REPOSITORY - Description
100000 T2DM patients from 30 hospitals; CERNER HEALTH FACTS
OUTCOME
• How likely is a patient to be diagnosed with DM in near future?
• How likely is a T2DM patient to come back to the hospital, before
30 days post discharge and after 30 days discharge?
METHODS
Multiple ML models generated and compared
Individual Classifiers: DT, LOGREG, SVC
Ensemble Classifiers: RF, GBC
GitHub Link