SlideShare a Scribd company logo
Deep learning in healthcare
Opportunities and challenges
with Electronic Medical Records (EMR) data
Trần Quang Thiện
University of Tsukuba
(Master student)
1
● Tran Quang Thien (1993)
● Has been in Japan since 2012
● Currently a master student at Tsukuba University
● Researchs
○ Feature selection in prediction of infectious disease using large-scale
search log data (Yahoo Japan search log).
○ Applied research with Long-term care insurance data and Electronic
Medical Record (EMR) data.
● Recent interests
○ Bayesian modeling
○ Selective inference
Introduction
Who am I
2
1. Introduction to Electronic Medical Records (EMR) data
2. Some common tasks and example solutions
3. Opportunities and challenges
Introduction
Agenda
3
Introduction
What is EMR data
http://api.sunlab.org/static/media/uxJ/UL5/5b6aef5b241ba60001bec1c1.pdf
Age, sex, race,
incomes ….
- Medicine codes
- Procedure codes
- Vital signs and labs
- Pulse Ox, heart rate,
Temperature, Co2...
- Diagnosis codes
- ICD9, ICD10 ...
- Additional comments
Nowadays, many types of medical data is collecting across multiples institutes
Electronic
Medical
Record
(EMR data)
4
CODE CODE CODE
CODE CODE CODE
CODE CODE CODE
CODE CODE CODE
CODE CODE CODE
5
● From wikipedia:
“Diagnostic coding is the translation of written descriptions of
diseases, illnesses and injuries into codes from a particular
classification”
● ICD-9, ICD-10
○ The most popular and currently the international standard
○ Contains diseases, signs, symptoms, external cause, …
○ 14,000 different codes
○ Has a hierarchical structure
Introduction
OK, so what is diagnosis code
6
Introduction
Structure of ICD-10 code
https://doctors.practo.com/icd-10-codes-important-doctors
/
Gan hóa sợi và xơ gan có mã bệnh là K74
- Gan hóa sợi K74.0
- Gan hóa xơ K74.1
- Gan hóa sợi với gan hóa xơ K74.2
- Xơ gan mật nguyên phát K74.3
- Xơ gan mật thứ phát K74.4
- Xơ gan mật không xác định K74.5
- Xơ gan khác và không xác định K74.6
7
Introduction
A sample of diagnosis code
Risk Prediction on Electronic Health Records with Prior Medical Knowledge, Fenglong Ma et. al
ICD
codes
8
When
EMR data
meets
deep learning
9
Commons tasks
The (typical) overall framework
http://api.sunlab.org/static/media/uxJ/UL5/5b6aef5b241ba60001bec1c1.pdf 10
Concept embedding
Learning a mapping from raw EMR data to useful
representation or medical concept.
● Also called as electronic phenotyping task.
● The result embedding can be used in any other tasks
● Many studies focus on diagnosis/medicine/procedure codes
● Some take inspiration from NLP domain (GloVe, skip-gram v..v)
since these codes can be treated as a bag of codes
● Almost trained in unsupervised setting (no ground-truth label)
Commons tasks
11
GRAM: Graph-based Attention Model for Healthcare Representation Learning, Edward Choi et. al (KDD 2017)
Concept embedding
Commons tasks
12
GRAM: Graph-based Attention Model for Healthcare Representation Learning, Edward Choi et. al (KDD 2017)
Concept embedding
Commons tasks
● An embedding vector of a code is the mixture of the (base) embedding vectors of
○ The current leaf code
○ It’s ancestors
● Advantages:
○ Usable information from high-level nodes, especially useful for rare codes
○ Something like hierarchical bayes model
13
GRAM: Graph-based Attention Model for Healthcare Representation Learning, Edward Choi et. al (KDD 2017)
GRAM GloVe skip-gram
● The embedding result matches with medical ontology
● Similar to the shrinkage of parameters in hierarchical models
Concept embedding
Commons tasks
14
GRAM: Graph-based Attention Model for Healthcare Representation Learning, Edward Choi et. al (KDD 2017)
● The embedding result matches with medical ontology
● And also help increasing prediction performance for post-embedding
tasks
Concept embedding
Commons tasks
15
Detecting whether specific diseases/stage of disease can
be confirmed in the EMR data
● Predicting a target disease using EMR data is not easy
○ Assigning diagnosis is a noisy task due to
different practices of doctors or hospitals
○ Predict diagnosis codes from other data
● Examples:
○ Predicting diagnose code from clinical notes
○ Classification stage of Parkinson’s disease
○ Predict heart failure risk
○ Predict medicine codes ←AI doctor...
Disease/drugs classification
Commons tasks
16
Disease/drugs classification
Commons tasks
1. RNN
2. Reinforcement finetune
3. Beam search
Main techniques
LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity,
Yutao Zhang et. al (KDD 2017) 17
Disease/drugs classification
Commons tasks
Scoring criterion
[2] Complete without unfavorable drug-drug interaction
[1] Partially complete without unfavorable interaction
[0] Address less than 50% diagnoses or including
negative interaction
Evaluation criterion: Jaccard coefficient
● The proposed method works better than rule-based or other competitors
● However, the quality seems not enough to be ready in real application yet
18
LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity,
Yutao Zhang et. al (KDD 2017)
Predicting future clinical events based on past
longitudinal event sequences
● For examples:
○ Prediction of 30-day hospital readmission
○ Predict of heart failure risk
○ Predict of mortality in Intensive Care Unit (ICU)
● Useful in grouping/detection patients with high risk
Sequential prediction of clinical events
Commons tasks
19
Sequential prediction of clinical events
Commons tasks
RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism,
Edward Choi et al. NIPS 2016
- Interpretable model for sequential prediction
- Giving explanation for prediction using attention mechanism
- 2 level of attention
- Importance of each visit
- Importance of each code within a visit
20
Sequential prediction of clinical events
Commons tasks
- The attention is calculated using time-reversed RNN model
- Mimic the clinician that look at the ERM data backward
- Recent visit contributes more
21RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism,
Edward Choi et al. NIPS 2016
Sequential prediction of clinical events
Commons tasks
MLP Logistic regression with a hidden layer
RNN Two layer of RNN, no attention
RNN+ _M RNN with visit-level attention
RNN+ _R Reversed-RNN with visit-level attention
RETAIN Reversed-RNN with 2 levels attentions
● Task: predict whether an heart failure (HF) event occurs
Input: EMR data until the date HF event happens
● The AUC is around 0.87 (High?)
● However, normal RNN is also high ….
● NOTE: y-axis limit starts from 0.76
22RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism,
Edward Choi et al. NIPS 2016
Sequential prediction of clinical events
Commons tasks
● Since the different in performance is not significance
● RETAIN give us the answer for the question WHY
23RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism,
Edward Choi et al. NIPS 2016
Data augmentation:
Creating synthesis data elements or patient records
based on real EMR data
● Saving labeling cost
● Generated synthesis data can be useful for privacy issue
● Most researches based on GANs techniques
● medGAN paper reported that generated data is
indistinguishable to a human doctor expert.
Data augmentation
Commons tasks
24
Data augmentation
Commons tasks
Generating Multi-label Discrete Patient Records using Generative Adversarial Networks
Edward Choi et al. (JMLR 2017)
● Task
Generating synthetic patient records from real data
○ Require 1: (statistically) similar to the real records
○ Require 2: Individual patient information can not be exploited
● Result
Indistinguishable to a human doctor, except a few outliers
25
Opportunities
Challenges
is out there
26
Opportunities
Growing data availability
Adoption of EMR system in Japan
医療実施調査(厚生労働省)
● The data is quite old….
● In 2017, only 41.6% of hospital, clinical in Japan adopted EMR system
● However, the number is about 85% for big hospitals ( >400 beds)
27
Opportunities
Growing data availability
Adoption of EMR system in US
● The data is also old ….
● 87% physicians interviewed uses EMR system
● However, the coverage on hospitals/clinicals is unclear 28
Opportunities
Growing data availability
Thực hiện Nghị quyết 20, đến nay, Bộ Y tế đã xây dựng xong phần mềm hồ sơ
sức khỏe điện tử (EHR) sử dụng nguồn dữ liệu hộ gia đình tham gia BHYT của
Bảo hiểm xã hội Việt Nam để tạo lập mã số định danh (ID). Theo kế hoạch, từ
tháng 1-2019 đến tháng 6-2019, triển khai và hoàn thiện phần mềm hồ sơ sức
khỏe điện tử cho tám tỉnh, thành phố trong mô hình điểm. Từ tháng 7-2019 tổ
chức triển khai nhân rộng trên toàn quốc. Ðến cuối năm 2019 sẽ hình thành
hệ thống hồ sơ sức khỏe điện tử cho mỗi người dân. Khi người dân đến cơ
sở y tế, người thầy thuốc ở bất kỳ đâu trên lãnh thổ Việt Nam, chỉ cần một động
tác nhấn chuột, máy tính sẽ hiện ra đầy đủ thông tin về hiện trạng sức khỏe của
người đó, giúp ích rất nhiều cho chẩn đoán và điều trị.
(Báo nhân dân điện tử, 31/01/2019)
Adoption of EMR system in Vietnam
● NEW newspaper !
● Nationwide EMR system will be completed in 2019 !
● A better, digitized healthcare system is coming !
29
Opportunities
Growing data availability
Adoption of EMR system in Vietnam
Image from a friend doctor in Vietnam…
Medical research scene, Cho Ray hospital, 11/4/2019
30
Opportunities
Promising achieves of deep learning
● Have shown outstanding performance in various tasks
○ Computer vision
○ Speech recognition
○ Natural language processing
○ Reinforcement learning
○ …….
● Can bypass the feature engineering process
○ E.g. ICD-10 code contains more than 70,000 types of codes
How can we express these code in our model ?
● Deep learning structures are well suited with various healthcare tasks
○ CNN
○ RNN, LSTM
○ GANs
○ Attention mechanism
○ …...
31
However,
developing deep
learning models
using EMR data
is a challenging
task.
32
Challenges
● Lack of data
○ The EMR system is not ready for some countries, regions
○ Complicated process to get access to healthcare data
due to the significant sensitivity of the data.
○ Combining EMR data across institutions can be complicated
because of difference in EMR versions between institutions.
● Lack of labels (both in quality and quantity)
○ Annotation requires domain knowledge of trained experts
○ Rare diseases implies less data
○ Even noisy between clinicians or institutions
■ E.g. ICD-10 contains many similar codes
■ Different practices of doctors
Challenges with EMR data
33
Challenges
Challenges with EMR data
● Heterogeneous types of data
○ Numeric values (lab tests)
○ Discrete codes (diagnosis, medicine, procedures)
○ Continuous monitoring data
○ Free-text clinical notes
○ Medical images
How to effectively use these data altogether ?
● Temporality and irregularity
○ Events are irregularly sampled (and also biased)
○ Long-term and short-term dependencies
34
Challenges
Challenges with healthcare domain
● Decision in healthcare is sensitive
○ Require immediate information and decision
○ Sometimes have life or death consequences
○ High standard required
How much of accuracy will be considered enough?
● Model needs interpretability
○ An accurate black-box might not be enough
○ Clinicians often require reasoning behind prediction
● Toward more complex output
○ Question and answer
35
Thank
you!
Contact:
Trần Quang Thiện
thientquang@gmail.com
36
Some public datasets
● MIMIC-III, a freely accessible critical care database.
Johnson AEW, Pollard TJ, Shen L, Lehman L, Feng M, Ghassemi M, Moody B, Szolovits P,
Celi LA, and Mark RG. Scientific Data (2016)
https://mimic.physionet.org/
● EMRBOTS, experiment with artificial large medical datasets without
worrying about privacy.
http://www.emrbots.org/
37
Other inferences
● Deep learning for healthcare: review, opportunities and challenges
Riccardo Miotto, Fei Wang, Shuang Wang, Xiaoqian Jiang and Joel T. Dudley
● Opportunities and challenges in developing deep learning models
using electronic health records data: a systematic review
Cao Xiao, Edward Choi and Jimeng Sun
● Harnessing the Power of Data in Health
Stanford Medicine 2017 Health Trends Report
● The opportunities and challenges of data analytics in health care
Paul B. Ginsburg, Andrés de Loera-Brust, Caitlin Brandt, and Abigail Durak
● AI And Healthcare: A Giant Opportunity
Insights Team, Insights Contributor FORBES INSIGHTS With Intel AI
38

More Related Content

What's hot

Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
Disease Prediction by Machine Learning Over Big Data From Healthcare CommunitiesDisease Prediction by Machine Learning Over Big Data From Healthcare Communities
Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
Khulna University of Engineering & Tecnology
 
ML DL AI DS BD - An Introduction
ML DL AI DS BD - An IntroductionML DL AI DS BD - An Introduction
ML DL AI DS BD - An Introduction
Dony Riyanto
 
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AIIntro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Seth Grimes
 
Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective
Saurabh Kaushik
 
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
SlideTeam
 
Deep Learning in Bio-Medical Imaging
Deep Learning in Bio-Medical ImagingDeep Learning in Bio-Medical Imaging
Deep Learning in Bio-Medical Imaging
Joonhyung Lee
 
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
SUJIT SHIBAPRASAD MAITY
 
Feature selection
Feature selectionFeature selection
Feature selection
Dong Guo
 
Explainable AI in Industry (WWW 2020 Tutorial)
Explainable AI in Industry (WWW 2020 Tutorial)Explainable AI in Industry (WWW 2020 Tutorial)
Explainable AI in Industry (WWW 2020 Tutorial)
Krishnaram Kenthapadi
 
Machine Learning in Healthcare Diagnostics
Machine Learning in Healthcare DiagnosticsMachine Learning in Healthcare Diagnostics
Machine Learning in Healthcare Diagnostics
Larry Smarr
 
Machine learning
Machine learningMachine learning
Machine learning
Dr Geetha Mohan
 
brain tumor ppt.pptx
brain tumor ppt.pptxbrain tumor ppt.pptx
brain tumor ppt.pptx
AdityaSingh728086
 
Brain Tumor Detection using CNN
Brain Tumor Detection using CNNBrain Tumor Detection using CNN
Brain Tumor Detection using CNN
MohammadRakib8
 
Heart Disease Prediction using Machine Learning Algorithm
Heart Disease Prediction using Machine Learning AlgorithmHeart Disease Prediction using Machine Learning Algorithm
Heart Disease Prediction using Machine Learning Algorithm
ijtsrd
 
Machine Learning vs. Deep Learning
Machine Learning vs. Deep LearningMachine Learning vs. Deep Learning
Machine Learning vs. Deep Learning
Belatrix Software
 
Project on disease prediction
Project on disease predictionProject on disease prediction
Project on disease prediction
KOYELMAJUMDAR1
 
Machine learning ppt.
Machine learning ppt.Machine learning ppt.
Machine learning ppt.
ASHOK KUMAR
 
Deep learning ppt
Deep learning pptDeep learning ppt
Deep learning ppt
BalneSridevi
 
deep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumordeep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumor
Venkat Projects
 
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptxCardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Taminul Islam
 

What's hot (20)

Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
Disease Prediction by Machine Learning Over Big Data From Healthcare CommunitiesDisease Prediction by Machine Learning Over Big Data From Healthcare Communities
Disease Prediction by Machine Learning Over Big Data From Healthcare Communities
 
ML DL AI DS BD - An Introduction
ML DL AI DS BD - An IntroductionML DL AI DS BD - An Introduction
ML DL AI DS BD - An Introduction
 
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AIIntro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
 
Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective Explainable AI (XAI) - A Perspective
Explainable AI (XAI) - A Perspective
 
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
 
Deep Learning in Bio-Medical Imaging
Deep Learning in Bio-Medical ImagingDeep Learning in Bio-Medical Imaging
Deep Learning in Bio-Medical Imaging
 
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.Heart Disease Identification Method Using Machine Learnin in E-healthcare.
Heart Disease Identification Method Using Machine Learnin in E-healthcare.
 
Feature selection
Feature selectionFeature selection
Feature selection
 
Explainable AI in Industry (WWW 2020 Tutorial)
Explainable AI in Industry (WWW 2020 Tutorial)Explainable AI in Industry (WWW 2020 Tutorial)
Explainable AI in Industry (WWW 2020 Tutorial)
 
Machine Learning in Healthcare Diagnostics
Machine Learning in Healthcare DiagnosticsMachine Learning in Healthcare Diagnostics
Machine Learning in Healthcare Diagnostics
 
Machine learning
Machine learningMachine learning
Machine learning
 
brain tumor ppt.pptx
brain tumor ppt.pptxbrain tumor ppt.pptx
brain tumor ppt.pptx
 
Brain Tumor Detection using CNN
Brain Tumor Detection using CNNBrain Tumor Detection using CNN
Brain Tumor Detection using CNN
 
Heart Disease Prediction using Machine Learning Algorithm
Heart Disease Prediction using Machine Learning AlgorithmHeart Disease Prediction using Machine Learning Algorithm
Heart Disease Prediction using Machine Learning Algorithm
 
Machine Learning vs. Deep Learning
Machine Learning vs. Deep LearningMachine Learning vs. Deep Learning
Machine Learning vs. Deep Learning
 
Project on disease prediction
Project on disease predictionProject on disease prediction
Project on disease prediction
 
Machine learning ppt.
Machine learning ppt.Machine learning ppt.
Machine learning ppt.
 
Deep learning ppt
Deep learning pptDeep learning ppt
Deep learning ppt
 
deep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumordeep learning applications in medical image analysis brain tumor
deep learning applications in medical image analysis brain tumor
 
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptxCardiovascular Disease Prediction Using Machine Learning Approaches.pptx
Cardiovascular Disease Prediction Using Machine Learning Approaches.pptx
 

Similar to Deep learning in healthcare: Oppotunities and challenges with Electronic Medical Records (EMR) data

Medical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareMedical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in Healthcare
NUS-ISS
 
IRJET - Prediction and Analysis of Multiple Diseases using Machine Learni...
IRJET -  	  Prediction and Analysis of Multiple Diseases using Machine Learni...IRJET -  	  Prediction and Analysis of Multiple Diseases using Machine Learni...
IRJET - Prediction and Analysis of Multiple Diseases using Machine Learni...
IRJET Journal
 
IRJET - E-Health Chain and Anticipation of Future Disease
IRJET - E-Health Chain and Anticipation of Future DiseaseIRJET - E-Health Chain and Anticipation of Future Disease
IRJET - E-Health Chain and Anticipation of Future Disease
IRJET Journal
 
Proposed Model for Chest Disease Prediction using Data Analytics
Proposed Model for Chest Disease Prediction using Data AnalyticsProposed Model for Chest Disease Prediction using Data Analytics
Proposed Model for Chest Disease Prediction using Data Analytics
vivatechijri
 
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
MIS Quarterly
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AI
Xavier Amatriain
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
cscpconf
 
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
ijdms
 
A comprehensive study on disease risk predictions in machine learning
A comprehensive study on disease risk predictions  in machine learning A comprehensive study on disease risk predictions  in machine learning
A comprehensive study on disease risk predictions in machine learning
IJECEIAES
 
cognitive computing for electronic medical record
cognitive computing for electronic medical record cognitive computing for electronic medical record
cognitive computing for electronic medical record
selamu shirtawi
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
Ashish Salve
 
Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017
Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017
Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017
MLconf
 
ML to cure the world
ML to cure the worldML to cure the world
ML to cure the world
Xavier Amatriain
 
Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...
Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...
Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...
IRJET Journal
 
Multi Disease Detection using Deep Learning
Multi Disease Detection using Deep LearningMulti Disease Detection using Deep Learning
Multi Disease Detection using Deep Learning
IRJET Journal
 
Diagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approachDiagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approach
csandit
 
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
cscpconf
 
The Many Lives of Data
The Many Lives of DataThe Many Lives of Data
The Many Lives of Data
ljmcneill33
 
Electronic health records and machine learning
Electronic health records and machine learningElectronic health records and machine learning
Electronic health records and machine learning
Eman Abdelrazik
 

Similar to Deep learning in healthcare: Oppotunities and challenges with Electronic Medical Records (EMR) data (20)

Medical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareMedical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in Healthcare
 
IRJET - Prediction and Analysis of Multiple Diseases using Machine Learni...
IRJET -  	  Prediction and Analysis of Multiple Diseases using Machine Learni...IRJET -  	  Prediction and Analysis of Multiple Diseases using Machine Learni...
IRJET - Prediction and Analysis of Multiple Diseases using Machine Learni...
 
IRJET - E-Health Chain and Anticipation of Future Disease
IRJET - E-Health Chain and Anticipation of Future DiseaseIRJET - E-Health Chain and Anticipation of Future Disease
IRJET - E-Health Chain and Anticipation of Future Disease
 
Proposed Model for Chest Disease Prediction using Data Analytics
Proposed Model for Chest Disease Prediction using Data AnalyticsProposed Model for Chest Disease Prediction using Data Analytics
Proposed Model for Chest Disease Prediction using Data Analytics
 
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
Healthcare Predicitive Analytics for Risk Profiling in Chronic Care: A Bayesi...
 
Towards online universal quality healthcare through AI
Towards online universal quality healthcare through AITowards online universal quality healthcare through AI
Towards online universal quality healthcare through AI
 
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUESPREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
PREDICTIVE ANALYTICS IN HEALTHCARE SYSTEM USING DATA MINING TECHNIQUES
 
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
PERFORMANCE OF DATA MINING TECHNIQUES TO PREDICT IN HEALTHCARE CASE STUDY: CH...
 
A comprehensive study on disease risk predictions in machine learning
A comprehensive study on disease risk predictions  in machine learning A comprehensive study on disease risk predictions  in machine learning
A comprehensive study on disease risk predictions in machine learning
 
cognitive computing for electronic medical record
cognitive computing for electronic medical record cognitive computing for electronic medical record
cognitive computing for electronic medical record
 
HEALTH PREDICTION ANALYSIS USING DATA MINING
HEALTH PREDICTION ANALYSIS USING DATA  MININGHEALTH PREDICTION ANALYSIS USING DATA  MINING
HEALTH PREDICTION ANALYSIS USING DATA MINING
 
Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017
Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017
Xavier Amatriain, Cofounder & CTO, Curai at MLconf SF 2017
 
ML to cure the world
ML to cure the worldML to cure the world
ML to cure the world
 
vaagdevi paper.pdf
vaagdevi paper.pdfvaagdevi paper.pdf
vaagdevi paper.pdf
 
Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...
Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...
Disease detection for multilabel big dataset using MLAM, Naive Bayes, Adaboos...
 
Multi Disease Detection using Deep Learning
Multi Disease Detection using Deep LearningMulti Disease Detection using Deep Learning
Multi Disease Detection using Deep Learning
 
Diagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approachDiagnosis of rheumatoid arthritis using an ensemble learning approach
Diagnosis of rheumatoid arthritis using an ensemble learning approach
 
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
DIAGNOSIS OF RHEUMATOID ARTHRITIS USING AN ENSEMBLE LEARNING APPROACH
 
The Many Lives of Data
The Many Lives of DataThe Many Lives of Data
The Many Lives of Data
 
Electronic health records and machine learning
Electronic health records and machine learningElectronic health records and machine learning
Electronic health records and machine learning
 

More from Thien Q. Tran

LLM Threats: Prompt Injections and Jailbreak Attacks
LLM Threats: Prompt Injections and Jailbreak AttacksLLM Threats: Prompt Injections and Jailbreak Attacks
LLM Threats: Prompt Injections and Jailbreak Attacks
Thien Q. Tran
 
Introduction to FAST-LAMP
Introduction to FAST-LAMPIntroduction to FAST-LAMP
Introduction to FAST-LAMP
Thien Q. Tran
 
Finding statistically significant interactions between continuous features (I...
Finding statistically significant interactions between continuous features (I...Finding statistically significant interactions between continuous features (I...
Finding statistically significant interactions between continuous features (I...
Thien Q. Tran
 
Introduction to TCAV (ICML2018)
Introduction to TCAV (ICML2018)Introduction to TCAV (ICML2018)
Introduction to TCAV (ICML2018)
Thien Q. Tran
 
Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...
Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...
Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...
Thien Q. Tran
 
Hypothesis testing and statistically sound-pattern mining
Hypothesis testing and statistically sound-pattern miningHypothesis testing and statistically sound-pattern mining
Hypothesis testing and statistically sound-pattern mining
Thien Q. Tran
 

More from Thien Q. Tran (6)

LLM Threats: Prompt Injections and Jailbreak Attacks
LLM Threats: Prompt Injections and Jailbreak AttacksLLM Threats: Prompt Injections and Jailbreak Attacks
LLM Threats: Prompt Injections and Jailbreak Attacks
 
Introduction to FAST-LAMP
Introduction to FAST-LAMPIntroduction to FAST-LAMP
Introduction to FAST-LAMP
 
Finding statistically significant interactions between continuous features (I...
Finding statistically significant interactions between continuous features (I...Finding statistically significant interactions between continuous features (I...
Finding statistically significant interactions between continuous features (I...
 
Introduction to TCAV (ICML2018)
Introduction to TCAV (ICML2018)Introduction to TCAV (ICML2018)
Introduction to TCAV (ICML2018)
 
Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...
Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...
Set Transfomer: A Framework for Attention-based Permutaion-Invariant Neural N...
 
Hypothesis testing and statistically sound-pattern mining
Hypothesis testing and statistically sound-pattern miningHypothesis testing and statistically sound-pattern mining
Hypothesis testing and statistically sound-pattern mining
 

Recently uploaded

insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
AADYARAJPANDEY1
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
plant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptxplant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptx
yusufzako14
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Predicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdfPredicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdf
binhminhvu04
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
Michel Dumontier
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
Large scale production of streptomycin.pptx
Large scale production of streptomycin.pptxLarge scale production of streptomycin.pptx
Large scale production of streptomycin.pptx
Cherry
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 

Recently uploaded (20)

insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
plant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptxplant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptx
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Predicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdfPredicting property prices with machine learning algorithms.pdf
Predicting property prices with machine learning algorithms.pdf
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
Large scale production of streptomycin.pptx
Large scale production of streptomycin.pptxLarge scale production of streptomycin.pptx
Large scale production of streptomycin.pptx
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 

Deep learning in healthcare: Oppotunities and challenges with Electronic Medical Records (EMR) data

  • 1. Deep learning in healthcare Opportunities and challenges with Electronic Medical Records (EMR) data Trần Quang Thiện University of Tsukuba (Master student) 1
  • 2. ● Tran Quang Thien (1993) ● Has been in Japan since 2012 ● Currently a master student at Tsukuba University ● Researchs ○ Feature selection in prediction of infectious disease using large-scale search log data (Yahoo Japan search log). ○ Applied research with Long-term care insurance data and Electronic Medical Record (EMR) data. ● Recent interests ○ Bayesian modeling ○ Selective inference Introduction Who am I 2
  • 3. 1. Introduction to Electronic Medical Records (EMR) data 2. Some common tasks and example solutions 3. Opportunities and challenges Introduction Agenda 3
  • 4. Introduction What is EMR data http://api.sunlab.org/static/media/uxJ/UL5/5b6aef5b241ba60001bec1c1.pdf Age, sex, race, incomes …. - Medicine codes - Procedure codes - Vital signs and labs - Pulse Ox, heart rate, Temperature, Co2... - Diagnosis codes - ICD9, ICD10 ... - Additional comments Nowadays, many types of medical data is collecting across multiples institutes Electronic Medical Record (EMR data) 4
  • 5. CODE CODE CODE CODE CODE CODE CODE CODE CODE CODE CODE CODE CODE CODE CODE 5
  • 6. ● From wikipedia: “Diagnostic coding is the translation of written descriptions of diseases, illnesses and injuries into codes from a particular classification” ● ICD-9, ICD-10 ○ The most popular and currently the international standard ○ Contains diseases, signs, symptoms, external cause, … ○ 14,000 different codes ○ Has a hierarchical structure Introduction OK, so what is diagnosis code 6
  • 7. Introduction Structure of ICD-10 code https://doctors.practo.com/icd-10-codes-important-doctors / Gan hóa sợi và xơ gan có mã bệnh là K74 - Gan hóa sợi K74.0 - Gan hóa xơ K74.1 - Gan hóa sợi với gan hóa xơ K74.2 - Xơ gan mật nguyên phát K74.3 - Xơ gan mật thứ phát K74.4 - Xơ gan mật không xác định K74.5 - Xơ gan khác và không xác định K74.6 7
  • 8. Introduction A sample of diagnosis code Risk Prediction on Electronic Health Records with Prior Medical Knowledge, Fenglong Ma et. al ICD codes 8
  • 10. Commons tasks The (typical) overall framework http://api.sunlab.org/static/media/uxJ/UL5/5b6aef5b241ba60001bec1c1.pdf 10
  • 11. Concept embedding Learning a mapping from raw EMR data to useful representation or medical concept. ● Also called as electronic phenotyping task. ● The result embedding can be used in any other tasks ● Many studies focus on diagnosis/medicine/procedure codes ● Some take inspiration from NLP domain (GloVe, skip-gram v..v) since these codes can be treated as a bag of codes ● Almost trained in unsupervised setting (no ground-truth label) Commons tasks 11
  • 12. GRAM: Graph-based Attention Model for Healthcare Representation Learning, Edward Choi et. al (KDD 2017) Concept embedding Commons tasks 12
  • 13. GRAM: Graph-based Attention Model for Healthcare Representation Learning, Edward Choi et. al (KDD 2017) Concept embedding Commons tasks ● An embedding vector of a code is the mixture of the (base) embedding vectors of ○ The current leaf code ○ It’s ancestors ● Advantages: ○ Usable information from high-level nodes, especially useful for rare codes ○ Something like hierarchical bayes model 13
  • 14. GRAM: Graph-based Attention Model for Healthcare Representation Learning, Edward Choi et. al (KDD 2017) GRAM GloVe skip-gram ● The embedding result matches with medical ontology ● Similar to the shrinkage of parameters in hierarchical models Concept embedding Commons tasks 14
  • 15. GRAM: Graph-based Attention Model for Healthcare Representation Learning, Edward Choi et. al (KDD 2017) ● The embedding result matches with medical ontology ● And also help increasing prediction performance for post-embedding tasks Concept embedding Commons tasks 15
  • 16. Detecting whether specific diseases/stage of disease can be confirmed in the EMR data ● Predicting a target disease using EMR data is not easy ○ Assigning diagnosis is a noisy task due to different practices of doctors or hospitals ○ Predict diagnosis codes from other data ● Examples: ○ Predicting diagnose code from clinical notes ○ Classification stage of Parkinson’s disease ○ Predict heart failure risk ○ Predict medicine codes ←AI doctor... Disease/drugs classification Commons tasks 16
  • 17. Disease/drugs classification Commons tasks 1. RNN 2. Reinforcement finetune 3. Beam search Main techniques LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity, Yutao Zhang et. al (KDD 2017) 17
  • 18. Disease/drugs classification Commons tasks Scoring criterion [2] Complete without unfavorable drug-drug interaction [1] Partially complete without unfavorable interaction [0] Address less than 50% diagnoses or including negative interaction Evaluation criterion: Jaccard coefficient ● The proposed method works better than rule-based or other competitors ● However, the quality seems not enough to be ready in real application yet 18 LEAP: Learning to Prescribe Effective and Safe Treatment Combinations for Multimorbidity, Yutao Zhang et. al (KDD 2017)
  • 19. Predicting future clinical events based on past longitudinal event sequences ● For examples: ○ Prediction of 30-day hospital readmission ○ Predict of heart failure risk ○ Predict of mortality in Intensive Care Unit (ICU) ● Useful in grouping/detection patients with high risk Sequential prediction of clinical events Commons tasks 19
  • 20. Sequential prediction of clinical events Commons tasks RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism, Edward Choi et al. NIPS 2016 - Interpretable model for sequential prediction - Giving explanation for prediction using attention mechanism - 2 level of attention - Importance of each visit - Importance of each code within a visit 20
  • 21. Sequential prediction of clinical events Commons tasks - The attention is calculated using time-reversed RNN model - Mimic the clinician that look at the ERM data backward - Recent visit contributes more 21RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism, Edward Choi et al. NIPS 2016
  • 22. Sequential prediction of clinical events Commons tasks MLP Logistic regression with a hidden layer RNN Two layer of RNN, no attention RNN+ _M RNN with visit-level attention RNN+ _R Reversed-RNN with visit-level attention RETAIN Reversed-RNN with 2 levels attentions ● Task: predict whether an heart failure (HF) event occurs Input: EMR data until the date HF event happens ● The AUC is around 0.87 (High?) ● However, normal RNN is also high …. ● NOTE: y-axis limit starts from 0.76 22RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism, Edward Choi et al. NIPS 2016
  • 23. Sequential prediction of clinical events Commons tasks ● Since the different in performance is not significance ● RETAIN give us the answer for the question WHY 23RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism, Edward Choi et al. NIPS 2016
  • 24. Data augmentation: Creating synthesis data elements or patient records based on real EMR data ● Saving labeling cost ● Generated synthesis data can be useful for privacy issue ● Most researches based on GANs techniques ● medGAN paper reported that generated data is indistinguishable to a human doctor expert. Data augmentation Commons tasks 24
  • 25. Data augmentation Commons tasks Generating Multi-label Discrete Patient Records using Generative Adversarial Networks Edward Choi et al. (JMLR 2017) ● Task Generating synthetic patient records from real data ○ Require 1: (statistically) similar to the real records ○ Require 2: Individual patient information can not be exploited ● Result Indistinguishable to a human doctor, except a few outliers 25
  • 27. Opportunities Growing data availability Adoption of EMR system in Japan 医療実施調査(厚生労働省) ● The data is quite old…. ● In 2017, only 41.6% of hospital, clinical in Japan adopted EMR system ● However, the number is about 85% for big hospitals ( >400 beds) 27
  • 28. Opportunities Growing data availability Adoption of EMR system in US ● The data is also old …. ● 87% physicians interviewed uses EMR system ● However, the coverage on hospitals/clinicals is unclear 28
  • 29. Opportunities Growing data availability Thực hiện Nghị quyết 20, đến nay, Bộ Y tế đã xây dựng xong phần mềm hồ sơ sức khỏe điện tử (EHR) sử dụng nguồn dữ liệu hộ gia đình tham gia BHYT của Bảo hiểm xã hội Việt Nam để tạo lập mã số định danh (ID). Theo kế hoạch, từ tháng 1-2019 đến tháng 6-2019, triển khai và hoàn thiện phần mềm hồ sơ sức khỏe điện tử cho tám tỉnh, thành phố trong mô hình điểm. Từ tháng 7-2019 tổ chức triển khai nhân rộng trên toàn quốc. Ðến cuối năm 2019 sẽ hình thành hệ thống hồ sơ sức khỏe điện tử cho mỗi người dân. Khi người dân đến cơ sở y tế, người thầy thuốc ở bất kỳ đâu trên lãnh thổ Việt Nam, chỉ cần một động tác nhấn chuột, máy tính sẽ hiện ra đầy đủ thông tin về hiện trạng sức khỏe của người đó, giúp ích rất nhiều cho chẩn đoán và điều trị. (Báo nhân dân điện tử, 31/01/2019) Adoption of EMR system in Vietnam ● NEW newspaper ! ● Nationwide EMR system will be completed in 2019 ! ● A better, digitized healthcare system is coming ! 29
  • 30. Opportunities Growing data availability Adoption of EMR system in Vietnam Image from a friend doctor in Vietnam… Medical research scene, Cho Ray hospital, 11/4/2019 30
  • 31. Opportunities Promising achieves of deep learning ● Have shown outstanding performance in various tasks ○ Computer vision ○ Speech recognition ○ Natural language processing ○ Reinforcement learning ○ ……. ● Can bypass the feature engineering process ○ E.g. ICD-10 code contains more than 70,000 types of codes How can we express these code in our model ? ● Deep learning structures are well suited with various healthcare tasks ○ CNN ○ RNN, LSTM ○ GANs ○ Attention mechanism ○ …... 31
  • 32. However, developing deep learning models using EMR data is a challenging task. 32
  • 33. Challenges ● Lack of data ○ The EMR system is not ready for some countries, regions ○ Complicated process to get access to healthcare data due to the significant sensitivity of the data. ○ Combining EMR data across institutions can be complicated because of difference in EMR versions between institutions. ● Lack of labels (both in quality and quantity) ○ Annotation requires domain knowledge of trained experts ○ Rare diseases implies less data ○ Even noisy between clinicians or institutions ■ E.g. ICD-10 contains many similar codes ■ Different practices of doctors Challenges with EMR data 33
  • 34. Challenges Challenges with EMR data ● Heterogeneous types of data ○ Numeric values (lab tests) ○ Discrete codes (diagnosis, medicine, procedures) ○ Continuous monitoring data ○ Free-text clinical notes ○ Medical images How to effectively use these data altogether ? ● Temporality and irregularity ○ Events are irregularly sampled (and also biased) ○ Long-term and short-term dependencies 34
  • 35. Challenges Challenges with healthcare domain ● Decision in healthcare is sensitive ○ Require immediate information and decision ○ Sometimes have life or death consequences ○ High standard required How much of accuracy will be considered enough? ● Model needs interpretability ○ An accurate black-box might not be enough ○ Clinicians often require reasoning behind prediction ● Toward more complex output ○ Question and answer 35
  • 37. Some public datasets ● MIMIC-III, a freely accessible critical care database. Johnson AEW, Pollard TJ, Shen L, Lehman L, Feng M, Ghassemi M, Moody B, Szolovits P, Celi LA, and Mark RG. Scientific Data (2016) https://mimic.physionet.org/ ● EMRBOTS, experiment with artificial large medical datasets without worrying about privacy. http://www.emrbots.org/ 37
  • 38. Other inferences ● Deep learning for healthcare: review, opportunities and challenges Riccardo Miotto, Fei Wang, Shuang Wang, Xiaoqian Jiang and Joel T. Dudley ● Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review Cao Xiao, Edward Choi and Jimeng Sun ● Harnessing the Power of Data in Health Stanford Medicine 2017 Health Trends Report ● The opportunities and challenges of data analytics in health care Paul B. Ginsburg, Andrés de Loera-Brust, Caitlin Brandt, and Abigail Durak ● AI And Healthcare: A Giant Opportunity Insights Team, Insights Contributor FORBES INSIGHTS With Intel AI 38