SlideShare a Scribd company logo
1
Clinical Risk Prediction with Temporal Probabilistic
Asymmetric Multi-Task Learning
1School of Computing, 2Graduate School of AI,
Korea Advanced Institute of Science and Technology,
3Aitrics, 4Department of Computer Science, University of Oxford
Tuan Nguyen* 1,4, Hyewon Jeong* 1, Eunho Yang 1,2,3, and Sung Ju Hwang 1,2,3
Clinical Risk Prediction with Multi-Task Learning
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
2
Introduction
Heart Rate (HR)
Respiratory Rate (RR)
Oxygen saturation (SpO2)
Body Temperature (BT)
White Blood Cell Count (WBC)
Body Temperature Elevation
Vital Sign (>37.7 C, 99.9 F)
Diagnostic
Test
Symptoms and Signs
as a result of infection
Positive for Bacteria
/ Fungus / Virus
Task 1 : Fever Task 2 : Infection
Evidence & Proof of infection
One probable
result of infection
Task 3 : Mortality
Mortality
Features Tasks
Task1: Fever
Task2: Infection
Task3: Mortality
Negative Transfer
MTL: clinical setting (MIMIC III-Infection)
Clinical Risk Prediction with Multi-Task Learning
Negative Transfer Problem in Multi-Task Learning
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
3
Introduction
Heart Rate (HR)
Respiratory Rate (RR)
Oxygen saturation (SpO2)
Body Temperature (BT)
White Blood Cell Count (WBC)
Body Temperature Elevation
Vital Sign (>37.7 C, 99.9 F)
Diagnostic
Test
Symptoms and Signs
as a result of infection
Positive for Bacteria
/ Fungus / Virus
Task 1 : Fever Task 2 : Infection
Evidence & Proof of infection
One probable
result of infection
Task 3 : Mortality
Mortality
Features Tasks
Task1: Fever
Task2: Infection
Task3: Mortality
Negative Transfer
MTL: clinical setting (MIMIC III-Infection)
Unreliable Predictor
Clinical Risk Prediction with Multi-Task Learning
Asymmetric Knowledge Transfer Across Timesteps
4
Introduction
𝑓!
𝑓"
𝑓#
…
Fever
𝑖!
𝑖"
𝑖#
Step 1
𝑚!
𝑚"
𝑚#
…
…
Step 2 Step T
Infection
Mortality
낮은
불확실성
높은
불확실성
Body Temperature Elevation
Vital Sign (>37.7 C, 99.9 F)
Diagnostic
Test
Symptoms and Signs
as a result of infection
Positive for Bacteria
/ Fungus / Virus
Task 1 : Fever Task 2 : Infection
Evidence & Proof of infection
One probable
result of infection
Task 3 : Mortality
Mortality
Deep AMTFL
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
MTL: clinical setting (MIMIC III-Infection)
Probabilistic Asymmetric Multi-Task Learning (P-AMTL)
Introduction
Uncertainty-Aware Asymmetric Multi-Task Learning
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
Probabilistic Asymmetric Multi-Task Learning (P-AMTL)
6
0.3
0.4
0.5
0.6
0.7
0
0.02
0.04
0.06
0.08
0.1
0.12
Task 0 Task 1
Knowledge
Transfer
Loss
KT in Loss-based AMTL
Loss
KT
0
0.02
0.04
0.06
0.08
0.1
0.12
0
0.2
0.4
0.6
0.8
Task 0 Task 1
Knowledge
Transfer
Uncertainty
KT in P-AMTL
UC
KT
2000 200
instances 2000 200
instances
-0.02
-0.015
-0.01
-0.005
0
0.005
0.01
0.015
Accuracy
Improvement
over
STL
Loss-based AMTL
TPAMTL
…
…
Step 1 Step 2 Step T
Loss
Loss
Loss-Based AMTL (Lee et al., 2018)
fd
(1) fd
(2) fd
(3)
fj
(1) fj
(2) fj
(3)
…
Low UC
High UC
𝑓!
(#)
UC-aware AMTL
…
Step 1 Step 2 Step T
fd
(1) fd
(2) fd
(3)
fj
(1) fj
(2) fj
(3)
𝑍!, 𝑍" :	High level latent feature
𝑓!, 𝑓" : Multiple features
across timesteps
(𝑍! = 𝑓!
#
, 𝑓!
$
, … , 𝑓!
%
)
(𝑍" = 𝑓"
#
, 𝑓"
$
, … , 𝑓"
%
)
Task J
Task D
Approach
Failure of Loss-based Asymmetric Multi-Task Learning
Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
Multiple Features
Across Timesteps
Failure of Loss-based AMTL
7
Approach
Table 1. Task Performance of MNIST-variation Experiment
(AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Knowledge transfer happens from more reliable to less reliable features. Knowledge transfer happens
inter-task(in order to capture task relatedness) and across-timestep.
Uncertainty Aware Knowledge Transfer: example case
!
Multiple Features
(zj for Task j)
+ Gj
2
αd,j
Gd
1
!
fd
(1)
Multiple Features
(zd for Task d)
αj,d
Gj
1
+
Gj
1 Gj
1
fd
(3)
fd
(1)
fj
(1)
fd
(3)
fd
(1)
fj
(1)
Transform from more reliable to less reliable latent features.
Knowledge transfer from Certain (low UC) task to Uncertain (high UC) task
!!,# = #!,# $!,#, $#, &!,#
$ , &#
$
!#,! = ##,! $#,!, $!, &#,!
$
, &!
$
"!
(#)
= $!
(#)
+ &!(∑ ∑ )%,!
',#
∗ &%
#
'()
*
%+) $%
'
)	∀. ∈ {1,2, … , !}	
* Same also happens for intra-task, inter-timestep knowledge transfer
TP-AMTL: Uncertainty-Aware Knowledge Transfer
Approach
TP-AMTL: Uncertainty-Aware Knowledge Transfer
Knowledge transfer happens from more reliable to less reliable features. Knowledge transfer happens
inter-task(in order to capture task relatedness) and across-timestep.
Uncertainty Aware Knowledge Transfer: example case
𝑇
Multiple Features
(zj for Task j)
+ Gj
2
αd,j
Gd
1
𝑇
fd
(1)
Multiple Features
(zd for Task d)
αj,d
Gj
1
+
Gj
1 Gj
1
fd
(3)
fd
(1)
fj
(1)
fd
(3)
fd
(1)
fj
(1)
Transform from more reliable to less reliable latent features.
Knowledge transfer from Certain (low UC) task to Uncertain (high UC) task
Approach
𝛼!,# = 𝐹!,# 𝑍!,#, 𝑍#, 𝜎!,#
$
, 𝜎#
$
𝛼#,! = 𝐹#,! 𝑍#,!, 𝑍!, 𝜎#,!
$
, 𝜎!
$
𝐶%
(&)
= 𝑓%
(&)
+ 𝐺%(∑!'(
)
∑*+(
&
𝛼!,%
*,&
∗ 𝐺! 𝑓!
*
) ∀𝑡 ∈ {1,2, … , 𝑇}
* Same also happens for intra-task, inter-timestep knowledge transfer
𝑧# ∼ 𝑝% 𝑧# 𝑥, 𝜔
𝑝% 𝑧# 𝑥, 𝜔 ∼ 𝒩(𝑧#; 𝜇#, 𝑑𝑖𝑎𝑔 𝜎#
$
)
Complexity Analysis
10
Approach
Supplementary Table 1. Time Complexity of the Baseline Models
Tasks and Datasets
11
Task 1 : Stay < 3
Length of ICU Stay
Task 2 : Cardiac
Recovering from
Cardiac Surgery
Task 4 : Mortality
Task 3 : Recovery
Recovering from
general surgery
PhysioNet2012
Body Temperature Elevation
Vital Sign (>37.7 C, 99.9 F)
Diagnostic
Test
Symptoms and Signs
as a result of infection
Positive for Bacteria
/ Fungus / Virus
Task 1 : Fever Task 2 : Infection
Evidence & Proof of infection
One probable
result of infection
Task 3 : Mortality
Mortality
MIMIC - III Infection
2,000 data points
Tasks : Fever à Infection à Mortality
Features: 12 Infection related features : including heart rate,
arterial blood pressure, and Glasgow Coma Scale(GCS) etc.
4,000 distinct hospital (ICU) records
Tasks: Stay < 3 / Cardiac / Recovery à Mortality
Features: 31 physiological signs including heart rate,
respiratory rate, temperature, etc.
Experiments
Information on MIMIC - III Respiratory Failure, Heart Failure can be found in the supplementary file
Quantitative Results
12
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Quantitative Results
13
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
1
Quantitative Results
14
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Quantitative Results
15
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Quantitative Results
16
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Quantitative Results
17
STL : Singletask Learning
MTL : Multitask Learning
Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and
Multi-Task Learning(MTL) baselines on both datasets.
Experiments
Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset.
(Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
Source features with low uncertainties transfer knowledge more, while at the target,
features with high uncertainties receive more knowledge transfer.
Qualitative Results: Knowledge Transfer Graph
Normalized amount of knowledge transfer from
multiple sources (task 𝑗 at time 𝑡) to task 𝑑
(normalized over the number of targets)
18
Normalized amount of knowledge transfer to multiple
targets (task 𝑑 at time 𝑡) from task 𝑗
(normalized over the number of sources)
Incoming Transfer to different Targets
Outgoing Transfer from different Sources
𝛼!,#
&,&
+ 𝛼!,#
&,&'(
+ ⋯ + 𝛼!,#
&,)
𝑇 − 𝑡 + 1
− (1)
𝛼!,%
(,&
+ 𝛼!,%
-,&
+ ⋯ + 𝛼!,%
&,&
𝑡
− (2)
Experiments
Qualitative Results: Medical Interpretation
19
Interpretation of the Learned Knowledge Graph
By analyzing selected clinical case studies, we could identify steps where knowledge transferred as we
designed and meaningful medical events occur, which correlates with interactions between selected tasks.
MechVent - Mechanical Ventilation, FiO2 - Fractional inspired Oxygen, SBP - Systolic arterial blood pressure,
DBP - Diastolic arterial blood pressure, HR - Heart Rate, Temp - Body Temperature, Urine - Urine output,
GCS - Glasgow Coma Score, WBC - White Blood Cell Count, Culture - Culture Results.
Experiments
Ablation Study
20
AMTL-Intratask
Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer
AMTL-Samestep
TD-AMTL
Deterministic variant of TP-AMTL
Experiments
TP-AMTL (constrained)
Effectiveness of Future-to-Past Transfer
TP-AMTL (epistemic)
Effectiveness of Uncertainty Types
TP-AMTL (aleatoric)
𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0)
Knowledge Transfer only happens from the later timestep
to earlier ones
Ablation Study
21
AMTL-Intratask
Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer
AMTL-Samestep
TD-AMTL
Deterministic variant of TP-AMTL
Experiments
TP-AMTL (constrained)
Effectiveness of Future-to-Past Transfer
TP-AMTL (epistemic)
Effectiveness of Uncertainty Types
TP-AMTL (aleatoric)
𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0)
Knowledge Transfer only happens from the later timestep
to earlier ones
Ablation Study
22
AMTL-Intratask
Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer
AMTL-Samestep
TD-AMTL
Deterministic variant of TP-AMTL
Experiments
TP-AMTL (constrained)
Effectiveness of Future-to-Past Transfer
TP-AMTL (epistemic)
Effectiveness of Uncertainty Types
TP-AMTL (aleatoric)
𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0)
Knowledge Transfer only happens from the later timestep
to earlier ones
Ablation Study
23
AMTL-Intratask
Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer
AMTL-Samestep
TD-AMTL
Deterministic variant of TP-AMTL
Experiments
TP-AMTL (constrained)
Effectiveness of Future-to-Past Transfer
TP-AMTL (epistemic)
Effectiveness of Uncertainty Types
TP-AMTL (aleatoric)
𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0)
Knowledge Transfer only happens from the later timestep
to earlier ones
• We proposed a novel probabilistic asymmetric multi-task learning framework
that allows asymmetric knowledge transfer between tasks at different timesteps,
based on the uncertainty.
• We use a probabilistic Bayesian formulation for asymmetric knowledge transfer,
where the amount of knowledge transfer depends on the uncertainty at the
feature level.
• We validate our model on clinical risk prediction tasks, on which it achieves
significant improvements over baselines and provides meaningful interpretations,
including temporal relationships between tasks.
Conclusions
24
Thank you
25

More Related Content

Similar to Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning

Introduction to Item Response Theory
Introduction to Item Response TheoryIntroduction to Item Response Theory
Introduction to Item Response Theory
OpenThink Labs
 
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATADETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
IJCSEA Journal
 
Computational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisComputational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysis
Aboul Ella Hassanien
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
Servio Fernando Lima Reina
 
Error entropy minimization for brain image registration using hilbert huang t...
Error entropy minimization for brain image registration using hilbert huang t...Error entropy minimization for brain image registration using hilbert huang t...
Error entropy minimization for brain image registration using hilbert huang t...
eSAT Publishing House
 
A mathematical model of movement in virtual reality through thoughts
A mathematical model of movement in virtual reality through thoughts A mathematical model of movement in virtual reality through thoughts
A mathematical model of movement in virtual reality through thoughts
IJECEIAES
 
ITK Tutorial Presentation Slides-950
ITK Tutorial Presentation Slides-950ITK Tutorial Presentation Slides-950
ITK Tutorial Presentation Slides-950
Kitware Kitware
 
Exponential software reliability using SPRT: MLE
Exponential software reliability using SPRT: MLEExponential software reliability using SPRT: MLE
Exponential software reliability using SPRT: MLE
IOSR Journals
 
chapter10
chapter10chapter10
chapter10
butest
 
Robust Immunological Algorithms for High-Dimensional Global Optimization
Robust Immunological Algorithms for High-Dimensional Global OptimizationRobust Immunological Algorithms for High-Dimensional Global Optimization
Robust Immunological Algorithms for High-Dimensional Global Optimization
Mario Pavone
 
Week 1 Lec 1-5 with watermarking.pdf
Week 1 Lec 1-5 with watermarking.pdfWeek 1 Lec 1-5 with watermarking.pdf
Week 1 Lec 1-5 with watermarking.pdf
meghana092
 
Week_1_Lec_1-5_with_watermarking_(1).pdf
Week_1_Lec_1-5_with_watermarking_(1).pdfWeek_1_Lec_1-5_with_watermarking_(1).pdf
Week_1_Lec_1-5_with_watermarking_(1).pdf
PrabhaK22
 
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
IOSR Journals
 
Using Item Response Theory to Improve Assessment
Using Item Response Theory to Improve AssessmentUsing Item Response Theory to Improve Assessment
Using Item Response Theory to Improve Assessment
Nathan Thompson
 
Gaining Confidence in Signalling and Regulatory Networks
Gaining Confidence in Signalling and Regulatory NetworksGaining Confidence in Signalling and Regulatory Networks
Gaining Confidence in Signalling and Regulatory Networks
Michael Stumpf
 
V01 i010401
V01 i010401V01 i010401
V01 i010401
IJARBEST JOURNAL
 
Meru_A_Patil
Meru_A_PatilMeru_A_Patil
Meru_A_Patil
Meru Patil
 
Puce U kentucky_2020
Puce U kentucky_2020Puce U kentucky_2020
Puce U kentucky_2020
Aina Puce
 
Machine learning in Healthcare - WeCloudData
Machine learning in Healthcare - WeCloudDataMachine learning in Healthcare - WeCloudData
Machine learning in Healthcare - WeCloudData
WeCloudData
 
CLASSIFIER SELECTION MODELS FOR INTRUSION DETECTION SYSTEM (IDS)
CLASSIFIER SELECTION MODELS FOR INTRUSION DETECTION SYSTEM (IDS)CLASSIFIER SELECTION MODELS FOR INTRUSION DETECTION SYSTEM (IDS)
CLASSIFIER SELECTION MODELS FOR INTRUSION DETECTION SYSTEM (IDS)
ieijjournal1
 

Similar to Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning (20)

Introduction to Item Response Theory
Introduction to Item Response TheoryIntroduction to Item Response Theory
Introduction to Item Response Theory
 
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATADETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
DETECTION OF RELIABLE SOFTWARE USING SPRT ON TIME DOMAIN DATA
 
Computational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysisComputational model for artificial learning using formal concept analysis
Computational model for artificial learning using formal concept analysis
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
 
Error entropy minimization for brain image registration using hilbert huang t...
Error entropy minimization for brain image registration using hilbert huang t...Error entropy minimization for brain image registration using hilbert huang t...
Error entropy minimization for brain image registration using hilbert huang t...
 
A mathematical model of movement in virtual reality through thoughts
A mathematical model of movement in virtual reality through thoughts A mathematical model of movement in virtual reality through thoughts
A mathematical model of movement in virtual reality through thoughts
 
ITK Tutorial Presentation Slides-950
ITK Tutorial Presentation Slides-950ITK Tutorial Presentation Slides-950
ITK Tutorial Presentation Slides-950
 
Exponential software reliability using SPRT: MLE
Exponential software reliability using SPRT: MLEExponential software reliability using SPRT: MLE
Exponential software reliability using SPRT: MLE
 
chapter10
chapter10chapter10
chapter10
 
Robust Immunological Algorithms for High-Dimensional Global Optimization
Robust Immunological Algorithms for High-Dimensional Global OptimizationRobust Immunological Algorithms for High-Dimensional Global Optimization
Robust Immunological Algorithms for High-Dimensional Global Optimization
 
Week 1 Lec 1-5 with watermarking.pdf
Week 1 Lec 1-5 with watermarking.pdfWeek 1 Lec 1-5 with watermarking.pdf
Week 1 Lec 1-5 with watermarking.pdf
 
Week_1_Lec_1-5_with_watermarking_(1).pdf
Week_1_Lec_1-5_with_watermarking_(1).pdfWeek_1_Lec_1-5_with_watermarking_(1).pdf
Week_1_Lec_1-5_with_watermarking_(1).pdf
 
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
Image Processing for Automated Flaw Detection and CMYK model for Color Image ...
 
Using Item Response Theory to Improve Assessment
Using Item Response Theory to Improve AssessmentUsing Item Response Theory to Improve Assessment
Using Item Response Theory to Improve Assessment
 
Gaining Confidence in Signalling and Regulatory Networks
Gaining Confidence in Signalling and Regulatory NetworksGaining Confidence in Signalling and Regulatory Networks
Gaining Confidence in Signalling and Regulatory Networks
 
V01 i010401
V01 i010401V01 i010401
V01 i010401
 
Meru_A_Patil
Meru_A_PatilMeru_A_Patil
Meru_A_Patil
 
Puce U kentucky_2020
Puce U kentucky_2020Puce U kentucky_2020
Puce U kentucky_2020
 
Machine learning in Healthcare - WeCloudData
Machine learning in Healthcare - WeCloudDataMachine learning in Healthcare - WeCloudData
Machine learning in Healthcare - WeCloudData
 
CLASSIFIER SELECTION MODELS FOR INTRUSION DETECTION SYSTEM (IDS)
CLASSIFIER SELECTION MODELS FOR INTRUSION DETECTION SYSTEM (IDS)CLASSIFIER SELECTION MODELS FOR INTRUSION DETECTION SYSTEM (IDS)
CLASSIFIER SELECTION MODELS FOR INTRUSION DETECTION SYSTEM (IDS)
 

More from MLAI2

Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
MLAI2
 
Online Hyperparameter Meta-Learning with Hypergradient Distillation
Online Hyperparameter Meta-Learning with Hypergradient DistillationOnline Hyperparameter Meta-Learning with Hypergradient Distillation
Online Hyperparameter Meta-Learning with Hypergradient Distillation
MLAI2
 
Online Coreset Selection for Rehearsal-based Continual Learning
Online Coreset Selection for Rehearsal-based Continual LearningOnline Coreset Selection for Rehearsal-based Continual Learning
Online Coreset Selection for Rehearsal-based Continual Learning
MLAI2
 
Representational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual LearningRepresentational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual Learning
MLAI2
 
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual LearningSequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
MLAI2
 
Skill-Based Meta-Reinforcement Learning
Skill-Based Meta-Reinforcement LearningSkill-Based Meta-Reinforcement Learning
Skill-Based Meta-Reinforcement Learning
MLAI2
 
Edge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsEdge Representation Learning with Hypergraphs
Edge Representation Learning with Hypergraphs
MLAI2
 
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...
MLAI2
 
Mini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
Mini-Batch Consistent Slot Set Encoder For Scalable Set EncodingMini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
Mini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
MLAI2
 
Task Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive LearningTask Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive Learning
MLAI2
 
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
MLAI2
 
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-LearningMeta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
MLAI2
 
Accurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingAccurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset Pooling
MLAI2
 
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
MLAI2
 
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and ArchitecturesMetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MLAI2
 
Adversarial Self-Supervised Contrastive Learning
Adversarial Self-Supervised Contrastive LearningAdversarial Self-Supervised Contrastive Learning
Adversarial Self-Supervised Contrastive Learning
MLAI2
 
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
MLAI2
 
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...
Neural Mask Generator : Learning to Generate Adaptive WordMaskings for Langu...Neural Mask Generator : Learning to Generate Adaptive WordMaskings for Langu...
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...
MLAI2
 
Cost-effective Interactive Attention Learning with Neural Attention Process
Cost-effective Interactive Attention Learning with Neural Attention ProcessCost-effective Interactive Attention Learning with Neural Attention Process
Cost-effective Interactive Attention Learning with Neural Attention Process
MLAI2
 
Adversarial Neural Pruning with Latent Vulnerability Suppression
Adversarial Neural Pruning with Latent Vulnerability SuppressionAdversarial Neural Pruning with Latent Vulnerability Suppression
Adversarial Neural Pruning with Latent Vulnerability Suppression
MLAI2
 

More from MLAI2 (20)

Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
 
Online Hyperparameter Meta-Learning with Hypergradient Distillation
Online Hyperparameter Meta-Learning with Hypergradient DistillationOnline Hyperparameter Meta-Learning with Hypergradient Distillation
Online Hyperparameter Meta-Learning with Hypergradient Distillation
 
Online Coreset Selection for Rehearsal-based Continual Learning
Online Coreset Selection for Rehearsal-based Continual LearningOnline Coreset Selection for Rehearsal-based Continual Learning
Online Coreset Selection for Rehearsal-based Continual Learning
 
Representational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual LearningRepresentational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual Learning
 
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual LearningSequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
 
Skill-Based Meta-Reinforcement Learning
Skill-Based Meta-Reinforcement LearningSkill-Based Meta-Reinforcement Learning
Skill-Based Meta-Reinforcement Learning
 
Edge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsEdge Representation Learning with Hypergraphs
Edge Representation Learning with Hypergraphs
 
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Genera...
 
Mini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
Mini-Batch Consistent Slot Set Encoder For Scalable Set EncodingMini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
Mini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
 
Task Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive LearningTask Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive Learning
 
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
 
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-LearningMeta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
 
Accurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingAccurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset Pooling
 
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
 
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and ArchitecturesMetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
 
Adversarial Self-Supervised Contrastive Learning
Adversarial Self-Supervised Contrastive LearningAdversarial Self-Supervised Contrastive Learning
Adversarial Self-Supervised Contrastive Learning
 
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
 
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...
Neural Mask Generator : Learning to Generate Adaptive WordMaskings for Langu...Neural Mask Generator : Learning to Generate Adaptive WordMaskings for Langu...
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...
 
Cost-effective Interactive Attention Learning with Neural Attention Process
Cost-effective Interactive Attention Learning with Neural Attention ProcessCost-effective Interactive Attention Learning with Neural Attention Process
Cost-effective Interactive Attention Learning with Neural Attention Process
 
Adversarial Neural Pruning with Latent Vulnerability Suppression
Adversarial Neural Pruning with Latent Vulnerability SuppressionAdversarial Neural Pruning with Latent Vulnerability Suppression
Adversarial Neural Pruning with Latent Vulnerability Suppression
 

Recently uploaded

GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
fredae14
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
GDSC PJATK
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
HarisZaheer8
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
saastr
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
Postman
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
flufftailshop
 

Recently uploaded (20)

GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Recommendation System using RAG Architecture
Recommendation System using RAG ArchitectureRecommendation System using RAG Architecture
Recommendation System using RAG Architecture
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!Finale of the Year: Apply for Next One!
Finale of the Year: Apply for Next One!
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
WeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation TechniquesWeTestAthens: Postman's AI & Automation Techniques
WeTestAthens: Postman's AI & Automation Techniques
 
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdfNunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
Nunit vs XUnit vs MSTest Differences Between These Unit Testing Frameworks.pdf
 

Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning

  • 1. 1 Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning 1School of Computing, 2Graduate School of AI, Korea Advanced Institute of Science and Technology, 3Aitrics, 4Department of Computer Science, University of Oxford Tuan Nguyen* 1,4, Hyewon Jeong* 1, Eunho Yang 1,2,3, and Sung Ju Hwang 1,2,3
  • 2. Clinical Risk Prediction with Multi-Task Learning Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018. 2 Introduction Heart Rate (HR) Respiratory Rate (RR) Oxygen saturation (SpO2) Body Temperature (BT) White Blood Cell Count (WBC) Body Temperature Elevation Vital Sign (>37.7 C, 99.9 F) Diagnostic Test Symptoms and Signs as a result of infection Positive for Bacteria / Fungus / Virus Task 1 : Fever Task 2 : Infection Evidence & Proof of infection One probable result of infection Task 3 : Mortality Mortality Features Tasks Task1: Fever Task2: Infection Task3: Mortality Negative Transfer MTL: clinical setting (MIMIC III-Infection)
  • 3. Clinical Risk Prediction with Multi-Task Learning Negative Transfer Problem in Multi-Task Learning Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018. 3 Introduction Heart Rate (HR) Respiratory Rate (RR) Oxygen saturation (SpO2) Body Temperature (BT) White Blood Cell Count (WBC) Body Temperature Elevation Vital Sign (>37.7 C, 99.9 F) Diagnostic Test Symptoms and Signs as a result of infection Positive for Bacteria / Fungus / Virus Task 1 : Fever Task 2 : Infection Evidence & Proof of infection One probable result of infection Task 3 : Mortality Mortality Features Tasks Task1: Fever Task2: Infection Task3: Mortality Negative Transfer MTL: clinical setting (MIMIC III-Infection) Unreliable Predictor
  • 4. Clinical Risk Prediction with Multi-Task Learning Asymmetric Knowledge Transfer Across Timesteps 4 Introduction 𝑓! 𝑓" 𝑓# … Fever 𝑖! 𝑖" 𝑖# Step 1 𝑚! 𝑚" 𝑚# … … Step 2 Step T Infection Mortality 낮은 불확실성 높은 불확실성 Body Temperature Elevation Vital Sign (>37.7 C, 99.9 F) Diagnostic Test Symptoms and Signs as a result of infection Positive for Bacteria / Fungus / Virus Task 1 : Fever Task 2 : Infection Evidence & Proof of infection One probable result of infection Task 3 : Mortality Mortality Deep AMTFL Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018. MTL: clinical setting (MIMIC III-Infection)
  • 5. Probabilistic Asymmetric Multi-Task Learning (P-AMTL) Introduction Uncertainty-Aware Asymmetric Multi-Task Learning Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018.
  • 6. Probabilistic Asymmetric Multi-Task Learning (P-AMTL) 6 0.3 0.4 0.5 0.6 0.7 0 0.02 0.04 0.06 0.08 0.1 0.12 Task 0 Task 1 Knowledge Transfer Loss KT in Loss-based AMTL Loss KT 0 0.02 0.04 0.06 0.08 0.1 0.12 0 0.2 0.4 0.6 0.8 Task 0 Task 1 Knowledge Transfer Uncertainty KT in P-AMTL UC KT 2000 200 instances 2000 200 instances -0.02 -0.015 -0.01 -0.005 0 0.005 0.01 0.015 Accuracy Improvement over STL Loss-based AMTL TPAMTL … … Step 1 Step 2 Step T Loss Loss Loss-Based AMTL (Lee et al., 2018) fd (1) fd (2) fd (3) fj (1) fj (2) fj (3) … Low UC High UC 𝑓! (#) UC-aware AMTL … Step 1 Step 2 Step T fd (1) fd (2) fd (3) fj (1) fj (2) fj (3) 𝑍!, 𝑍" : High level latent feature 𝑓!, 𝑓" : Multiple features across timesteps (𝑍! = 𝑓! # , 𝑓! $ , … , 𝑓! % ) (𝑍" = 𝑓" # , 𝑓" $ , … , 𝑓" % ) Task J Task D Approach Failure of Loss-based Asymmetric Multi-Task Learning Hae Beom Lee, Eunho Yang, and Sung Ju Hwang. Deep asymmetric multi-task feature learning. ICML 2018. Multiple Features Across Timesteps
  • 7. Failure of Loss-based AMTL 7 Approach Table 1. Task Performance of MNIST-variation Experiment (AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 8. Knowledge transfer happens from more reliable to less reliable features. Knowledge transfer happens inter-task(in order to capture task relatedness) and across-timestep. Uncertainty Aware Knowledge Transfer: example case ! Multiple Features (zj for Task j) + Gj 2 αd,j Gd 1 ! fd (1) Multiple Features (zd for Task d) αj,d Gj 1 + Gj 1 Gj 1 fd (3) fd (1) fj (1) fd (3) fd (1) fj (1) Transform from more reliable to less reliable latent features. Knowledge transfer from Certain (low UC) task to Uncertain (high UC) task !!,# = #!,# $!,#, $#, &!,# $ , &# $ !#,! = ##,! $#,!, $!, &#,! $ , &! $ "! (#) = $! (#) + &!(∑ ∑ )%,! ',# ∗ &% # '() * %+) $% ' ) ∀. ∈ {1,2, … , !} * Same also happens for intra-task, inter-timestep knowledge transfer TP-AMTL: Uncertainty-Aware Knowledge Transfer Approach
  • 9. TP-AMTL: Uncertainty-Aware Knowledge Transfer Knowledge transfer happens from more reliable to less reliable features. Knowledge transfer happens inter-task(in order to capture task relatedness) and across-timestep. Uncertainty Aware Knowledge Transfer: example case 𝑇 Multiple Features (zj for Task j) + Gj 2 αd,j Gd 1 𝑇 fd (1) Multiple Features (zd for Task d) αj,d Gj 1 + Gj 1 Gj 1 fd (3) fd (1) fj (1) fd (3) fd (1) fj (1) Transform from more reliable to less reliable latent features. Knowledge transfer from Certain (low UC) task to Uncertain (high UC) task Approach 𝛼!,# = 𝐹!,# 𝑍!,#, 𝑍#, 𝜎!,# $ , 𝜎# $ 𝛼#,! = 𝐹#,! 𝑍#,!, 𝑍!, 𝜎#,! $ , 𝜎! $ 𝐶% (&) = 𝑓% (&) + 𝐺%(∑!'( ) ∑*+( & 𝛼!,% *,& ∗ 𝐺! 𝑓! * ) ∀𝑡 ∈ {1,2, … , 𝑇} * Same also happens for intra-task, inter-timestep knowledge transfer 𝑧# ∼ 𝑝% 𝑧# 𝑥, 𝜔 𝑝% 𝑧# 𝑥, 𝜔 ∼ 𝒩(𝑧#; 𝜇#, 𝑑𝑖𝑎𝑔 𝜎# $ )
  • 10. Complexity Analysis 10 Approach Supplementary Table 1. Time Complexity of the Baseline Models
  • 11. Tasks and Datasets 11 Task 1 : Stay < 3 Length of ICU Stay Task 2 : Cardiac Recovering from Cardiac Surgery Task 4 : Mortality Task 3 : Recovery Recovering from general surgery PhysioNet2012 Body Temperature Elevation Vital Sign (>37.7 C, 99.9 F) Diagnostic Test Symptoms and Signs as a result of infection Positive for Bacteria / Fungus / Virus Task 1 : Fever Task 2 : Infection Evidence & Proof of infection One probable result of infection Task 3 : Mortality Mortality MIMIC - III Infection 2,000 data points Tasks : Fever à Infection à Mortality Features: 12 Infection related features : including heart rate, arterial blood pressure, and Glasgow Coma Scale(GCS) etc. 4,000 distinct hospital (ICU) records Tasks: Stay < 3 / Cardiac / Recovery à Mortality Features: 31 physiological signs including heart rate, respiratory rate, temperature, etc. Experiments Information on MIMIC - III Respiratory Failure, Heart Failure can be found in the supplementary file
  • 12. Quantitative Results 12 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 13. Quantitative Results 13 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red) 1
  • 14. Quantitative Results 14 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 15. Quantitative Results 15 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 16. Quantitative Results 16 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 17. Quantitative Results 17 STL : Singletask Learning MTL : Multitask Learning Our model, TP-AMTL obtains significant improvement over all Single-Task Learning and Multi-Task Learning(MTL) baselines on both datasets. Experiments Table 2. Task Performance of the MIMIC-III Infection and PhysioNet Dataset. (Average AUROC over 5 runs. MTL model accuracies lower than those of their STL counterparts are colored in red)
  • 18. Source features with low uncertainties transfer knowledge more, while at the target, features with high uncertainties receive more knowledge transfer. Qualitative Results: Knowledge Transfer Graph Normalized amount of knowledge transfer from multiple sources (task 𝑗 at time 𝑡) to task 𝑑 (normalized over the number of targets) 18 Normalized amount of knowledge transfer to multiple targets (task 𝑑 at time 𝑡) from task 𝑗 (normalized over the number of sources) Incoming Transfer to different Targets Outgoing Transfer from different Sources 𝛼!,# &,& + 𝛼!,# &,&'( + ⋯ + 𝛼!,# &,) 𝑇 − 𝑡 + 1 − (1) 𝛼!,% (,& + 𝛼!,% -,& + ⋯ + 𝛼!,% &,& 𝑡 − (2) Experiments
  • 19. Qualitative Results: Medical Interpretation 19 Interpretation of the Learned Knowledge Graph By analyzing selected clinical case studies, we could identify steps where knowledge transferred as we designed and meaningful medical events occur, which correlates with interactions between selected tasks. MechVent - Mechanical Ventilation, FiO2 - Fractional inspired Oxygen, SBP - Systolic arterial blood pressure, DBP - Diastolic arterial blood pressure, HR - Heart Rate, Temp - Body Temperature, Urine - Urine output, GCS - Glasgow Coma Score, WBC - White Blood Cell Count, Culture - Culture Results. Experiments
  • 20. Ablation Study 20 AMTL-Intratask Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer AMTL-Samestep TD-AMTL Deterministic variant of TP-AMTL Experiments TP-AMTL (constrained) Effectiveness of Future-to-Past Transfer TP-AMTL (epistemic) Effectiveness of Uncertainty Types TP-AMTL (aleatoric) 𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0) Knowledge Transfer only happens from the later timestep to earlier ones
  • 21. Ablation Study 21 AMTL-Intratask Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer AMTL-Samestep TD-AMTL Deterministic variant of TP-AMTL Experiments TP-AMTL (constrained) Effectiveness of Future-to-Past Transfer TP-AMTL (epistemic) Effectiveness of Uncertainty Types TP-AMTL (aleatoric) 𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0) Knowledge Transfer only happens from the later timestep to earlier ones
  • 22. Ablation Study 22 AMTL-Intratask Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer AMTL-Samestep TD-AMTL Deterministic variant of TP-AMTL Experiments TP-AMTL (constrained) Effectiveness of Future-to-Past Transfer TP-AMTL (epistemic) Effectiveness of Uncertainty Types TP-AMTL (aleatoric) 𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0) Knowledge Transfer only happens from the later timestep to earlier ones
  • 23. Ablation Study 23 AMTL-Intratask Effectiveness of Inter-Task and Inter-Timestep Knowledge Transfer AMTL-Samestep TD-AMTL Deterministic variant of TP-AMTL Experiments TP-AMTL (constrained) Effectiveness of Future-to-Past Transfer TP-AMTL (epistemic) Effectiveness of Uncertainty Types TP-AMTL (aleatoric) 𝑝. 𝑧% 𝑥, 𝜔 ∼ 𝒩(𝑧%; 𝜇%, 0) Knowledge Transfer only happens from the later timestep to earlier ones
  • 24. • We proposed a novel probabilistic asymmetric multi-task learning framework that allows asymmetric knowledge transfer between tasks at different timesteps, based on the uncertainty. • We use a probabilistic Bayesian formulation for asymmetric knowledge transfer, where the amount of knowledge transfer depends on the uncertainty at the feature level. • We validate our model on clinical risk prediction tasks, on which it achieves significant improvements over baselines and provides meaningful interpretations, including temporal relationships between tasks. Conclusions 24