SlideShare a Scribd company logo
1 of 22
Download to read offline
Introduction Results Description Conclusions
HISA Big Data 2014 – April 3rd 2014 ( #BD14 )
Enhancing Diagnostics for Invasive Aspergillosis using
Machine Learning
Simone Romano
simone.romano@unimelb.edu.au
@ialuronico
James Bailey1
Lawrence Cavedon1,2,3
Orla Morrissey4,5
Monica slavin6,7
Karin Verspoor1,2
1The University of Melbourne, Dept. of Computing and Information Systems
2NICTA (National ICT Aust.) VRL
3School of Computer Science and IT, RMIT University
4Alfred Health 5Monash University
6Peter MacCallum Cancer Centre 7Melbourne Health
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Introduction
Invasive Aspergillosis
Challenging Big Data Task
Results
Diagnostic Model
Description
Machine Learning for Diagnosis
Diagnosis of Invasive Aspergillosis
Conclusions
Summary
Future Work
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Invasive Aspergillosis
Invasive Aspergillosis (IA)
Serious fungal infection and major cause of
mortality in patients undergoing allogeneic
stem cell transplantation or chemotherapy
for acute leukaemia.
Figure : Pulmonary IA.
http://en.wikipedia.org/wiki/Aspergillosis
Facts
34–43% mortality rate;
culture methods low sensitivity, only 40–50% IA cases identified;
IA patient results in +7 days of hospital stay and +$30,957.
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Invasive Aspergillosis
Diagnosis and Treatment
Cases are classified with ProvenIA/ProbableIA/PossibleIA.
Current criteria for diagnosing IA are:
1. microbiology, risk factors, and CT scan findings;
2. Improved biomarkers such as Aspergillus PCR and Galactomannan
(GM) tested twice a week.
positive biopsy OR (positive CT scan AND single positive PCR/GM)
⇒ ProvenIA
≥ 2 consecutive positive PCR/GM in 2 week time frame
⇒ ProbableIA
Problem
One single positive biomarker might be a False Positive
⇒ Unnecessary harmful treatment.
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Challenging Big Data Task
Big Data task
In a randomised controlled trial comparing the two different strategies for
diagnosis IA, large amount of data was collected from 240 patients
between Sept. 2005 and Nov. 2009 at six Australian Centres.
Objective: Leverage such data to produce more accurate prediction of
IA with Machine Learning techniques.
Are we really dealing with Big Data?
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Challenging Big Data Task
Big Data task
In a randomised controlled trial comparing the two different strategies for
diagnosis IA, large amount of data was collected from 240 patients
between Sept. 2005 and Nov. 2009 at six Australian Centres.
Objective: Leverage such data to produce more accurate prediction of
IA with Machine Learning techniques.
Are we really dealing with Big Data?
All patients tracked for 26 weeks providing rich longitudinal data on
daily and weekly tests for each patient.
240 × 26 × 7 = 45,680 records.
Bed-side interpretation is a challenging task!
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnostic Model
Introduction
Invasive Aspergillosis
Challenging Big Data Task
Results
Diagnostic Model
Description
Machine Learning for Diagnosis
Diagnosis of Invasive Aspergillosis
Conclusions
Summary
Future Work
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnostic Model
Model
Our training set is a collection of 358 single positive biomarker tests that
precede the earliest label of IA.
Transplant/Chemotherapy
begins
1st 2nd 3rd 4th 5th months
positive biomarkers infection
Just 29 of the positive biomarkers were associated with a Proven IA or
Probable IA label within a week (329 false positives)
Built a model to output a probability of infection within a week
value;
Validated by a patient-level cross-validation framework.
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnostic Model
1 − TNR
TPR
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
AUC = 0.63
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnostic Model
1 − TNR
TPR
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
AUC = 0.63 AUC not too good
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnostic Model
1 − TNR
TPR
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
AUC = 0.63
But good in
classifying negatives!
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnostic Model
Result
Setting a low threshold on the model output probability to achieve high
NPV (100%) we were able to identify 95 (26.5%) tests that do not
lead to an IA infection (TNR = 28.9%) within a week.
⇒ Doctors can avoid to start treatment in 26.5% cases!
avoid over-treatment;
reduce drug-toxicity;
reduce antifungal drug costs
(E.g. Amphotericin B $8,260 per patient per week).
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Machine Learning for Diagnosis
Introduction
Invasive Aspergillosis
Challenging Big Data Task
Results
Diagnostic Model
Description
Machine Learning for Diagnosis
Diagnosis of Invasive Aspergillosis
Conclusions
Summary
Future Work
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Machine Learning for Diagnosis
Classification Models
Logistic regression;
Decision trees;
Random forest
Training set
Voting
resampling
random tree
resampling
random tree
resampling
random tree
resampling
random tree
resampling
random tree
Random forest because:
It has the capability to work with heterogeneous features
(categorical/continuous);
It can work with many features.
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnosis of Invasive Aspergillosis
Features to use
Known at baseline: Gender, age, BMI, smoking attitude status,etc.
Daily tested: neutrophil count, body temperature, amount of
administered steroids, haemoglobin, platelets, white cell count, urea,
creatinine, ALT, AST, GGT, bilirubin, LDH, etc.
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnosis of Invasive Aspergillosis
Features to use
Known at baseline: Gender, age, BMI, smoking attitude status,etc.
Daily tested: neutrophil count, body temperature, amount of
administered steroids, haemoglobin, platelets, white cell count, urea,
creatinine, ALT, AST, GGT, bilirubin, LDH, etc.
Very heterogeneous features!!!
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnosis of Invasive Aspergillosis
Heterogeneous Features
Features constant along the treatment: Age, Gender, etc.
Features that varied over time: neutrophil count, temperature,
corticosteroid doses, etc.
When we have a positive biomarker test we can use the recent past
information to predict IA. We consider recent past the values in the 3
week window prior a single positive test result.
May Jun Jul
36.537.538.5
date
temperature
window
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Diagnosis of Invasive Aspergillosis
Features that varied over time
Duration Features we count the number of days the value each
parameter lay within a particular range. For example, we divide the
measured temperature measurements into the intervals [36,37],
(37,38], (38,39], (39, 40], and and greater than 40(>40) Celsius
degrees and counted the number of days temperature occurred in
each interval;
Trajectories We select two days in the 3 week window preceding a
positive test test and compute the mean value, the standard
deviation, and the relative difference between those values. We
do it for all possible intervals in the window.
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Summary
Introduction
Invasive Aspergillosis
Challenging Big Data Task
Results
Diagnostic Model
Description
Machine Learning for Diagnosis
Diagnosis of Invasive Aspergillosis
Conclusions
Summary
Future Work
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Summary
Summary
Target: Enhance Diagnostics for biomarkers for Invasive
Aspergillosis
Method: Random forest for heterogeneous features creating
duration features, and trajectories features;
Validation: patient-level cross-validation;
Results: Setting a low threshold on the output probability, NPV =
100%, TNR = 28.9%. Safe avoidance of antifungal
therapy for 26.5% cases. Savings around $8K per patient
per week.
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Future Work
Future Work
make the model more accurate in predicting when a positive test is
associated with an immediate infection to trigger the antifungal
treatment earlier in time;
search for alternative diagnosis when the outcomes are equally
probable according to the model;
make the model output more interpretable to clinical practitioners,
e.g. by identifying the trajectories in the data which generate a low
or high probability of IA.
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
Introduction Results Description Conclusions
Future Work
Thank you.
Questions?
Simone Romano The University of Melbourne
Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning

More Related Content

Similar to Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning

J.Gras HPV mRNA Eurogin 2010
J.Gras HPV mRNA Eurogin 2010J.Gras HPV mRNA Eurogin 2010
J.Gras HPV mRNA Eurogin 2010jgras
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsNigel Collier
 
Role of the Laboratory in Antimicrobial Resistance Data
Role of the Laboratory in Antimicrobial Resistance DataRole of the Laboratory in Antimicrobial Resistance Data
Role of the Laboratory in Antimicrobial Resistance DataAnuj Sharma
 
Bioinformatics in dermato-oncology
Bioinformatics in dermato-oncologyBioinformatics in dermato-oncology
Bioinformatics in dermato-oncologyJoaquin Dopazo
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsIncedo
 
Sk microfluidics and lab on-a-chip-ch6
Sk microfluidics and lab on-a-chip-ch6Sk microfluidics and lab on-a-chip-ch6
Sk microfluidics and lab on-a-chip-ch6stanislas547
 
CSCI 6505 Machine Learning Project
CSCI 6505 Machine Learning ProjectCSCI 6505 Machine Learning Project
CSCI 6505 Machine Learning Projectbutest
 
[Startup Nations Summit 2014] Competition - Greece
[Startup Nations Summit 2014] Competition - Greece[Startup Nations Summit 2014] Competition - Greece
[Startup Nations Summit 2014] Competition - GreeceStartupNations
 
Bayesian modelling for COVID-19 seroprevalence studies
Bayesian modelling for COVID-19 seroprevalence studiesBayesian modelling for COVID-19 seroprevalence studies
Bayesian modelling for COVID-19 seroprevalence studiesChristian Perone
 
2018_june_ast_m39_update.pdf
2018_june_ast_m39_update.pdf2018_june_ast_m39_update.pdf
2018_june_ast_m39_update.pdfShahriarHabib4
 
IRJET- Survey Paper on Oral Cancer Detection using Machine Learning
IRJET-  	  Survey Paper on Oral Cancer Detection using Machine LearningIRJET-  	  Survey Paper on Oral Cancer Detection using Machine Learning
IRJET- Survey Paper on Oral Cancer Detection using Machine LearningIRJET Journal
 
Lab-on-a-Chip for cancer diagnostics and monitoring
Lab-on-a-Chip for cancer diagnostics and monitoringLab-on-a-Chip for cancer diagnostics and monitoring
Lab-on-a-Chip for cancer diagnostics and monitoringstanislas547
 
UAB Pulmonary board review study design and statistical principles
UAB Pulmonary board review study  design and statistical principles UAB Pulmonary board review study  design and statistical principles
UAB Pulmonary board review study design and statistical principles Terry Shaneyfelt
 
Laboratory diagnosis of (hiv)
Laboratory diagnosis of  (hiv)Laboratory diagnosis of  (hiv)
Laboratory diagnosis of (hiv)abdulrahman amer
 
Cancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision SystemCancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision SystemQuahog Life Sciences
 
Improving Prediction Accuracy Results by Using Q-Statistic Algorithm in High ...
Improving Prediction Accuracy Results by Using Q-Statistic Algorithm in High ...Improving Prediction Accuracy Results by Using Q-Statistic Algorithm in High ...
Improving Prediction Accuracy Results by Using Q-Statistic Algorithm in High ...rahulmonikasharma
 

Similar to Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning (20)

J.Gras HPV mRNA Eurogin 2010
J.Gras HPV mRNA Eurogin 2010J.Gras HPV mRNA Eurogin 2010
J.Gras HPV mRNA Eurogin 2010
 
Exploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease InformaticsExploiting NLP for Digital Disease Informatics
Exploiting NLP for Digital Disease Informatics
 
Role of the Laboratory in Antimicrobial Resistance Data
Role of the Laboratory in Antimicrobial Resistance DataRole of the Laboratory in Antimicrobial Resistance Data
Role of the Laboratory in Antimicrobial Resistance Data
 
Bioinformatics in dermato-oncology
Bioinformatics in dermato-oncologyBioinformatics in dermato-oncology
Bioinformatics in dermato-oncology
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
The Lachman Test
The Lachman TestThe Lachman Test
The Lachman Test
 
Sk microfluidics and lab on-a-chip-ch6
Sk microfluidics and lab on-a-chip-ch6Sk microfluidics and lab on-a-chip-ch6
Sk microfluidics and lab on-a-chip-ch6
 
CSCI 6505 Machine Learning Project
CSCI 6505 Machine Learning ProjectCSCI 6505 Machine Learning Project
CSCI 6505 Machine Learning Project
 
Sequencing and Beyond?
Sequencing and Beyond?Sequencing and Beyond?
Sequencing and Beyond?
 
[Startup Nations Summit 2014] Competition - Greece
[Startup Nations Summit 2014] Competition - Greece[Startup Nations Summit 2014] Competition - Greece
[Startup Nations Summit 2014] Competition - Greece
 
Bayesian modelling for COVID-19 seroprevalence studies
Bayesian modelling for COVID-19 seroprevalence studiesBayesian modelling for COVID-19 seroprevalence studies
Bayesian modelling for COVID-19 seroprevalence studies
 
2018_june_ast_m39_update.pdf
2018_june_ast_m39_update.pdf2018_june_ast_m39_update.pdf
2018_june_ast_m39_update.pdf
 
IRJET- Survey Paper on Oral Cancer Detection using Machine Learning
IRJET-  	  Survey Paper on Oral Cancer Detection using Machine LearningIRJET-  	  Survey Paper on Oral Cancer Detection using Machine Learning
IRJET- Survey Paper on Oral Cancer Detection using Machine Learning
 
Lab-on-a-Chip for cancer diagnostics and monitoring
Lab-on-a-Chip for cancer diagnostics and monitoringLab-on-a-Chip for cancer diagnostics and monitoring
Lab-on-a-Chip for cancer diagnostics and monitoring
 
UAB Pulmonary board review study design and statistical principles
UAB Pulmonary board review study  design and statistical principles UAB Pulmonary board review study  design and statistical principles
UAB Pulmonary board review study design and statistical principles
 
Laboratory diagnosis of (hiv)
Laboratory diagnosis of  (hiv)Laboratory diagnosis of  (hiv)
Laboratory diagnosis of (hiv)
 
K.3 Vineis
K.3 VineisK.3 Vineis
K.3 Vineis
 
Cancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision SystemCancer Care using Quahog Health Decision System
Cancer Care using Quahog Health Decision System
 
Use cases
Use casesUse cases
Use cases
 
Improving Prediction Accuracy Results by Using Q-Statistic Algorithm in High ...
Improving Prediction Accuracy Results by Using Q-Statistic Algorithm in High ...Improving Prediction Accuracy Results by Using Q-Statistic Algorithm in High ...
Improving Prediction Accuracy Results by Using Q-Statistic Algorithm in High ...
 

More from Simone Romano

Startups and you 2021
Startups and you 2021Startups and you 2021
Startups and you 2021Simone Romano
 
Measuring Dependency via Intrinsic Dimensionality (ICPR 2016)
Measuring Dependency via Intrinsic Dimensionality (ICPR 2016)Measuring Dependency via Intrinsic Dimensionality (ICPR 2016)
Measuring Dependency via Intrinsic Dimensionality (ICPR 2016)Simone Romano
 
A Framework to Adjust Dependency Measure Estimates for Chance
A Framework to Adjust Dependency Measure Estimates for Chance      A Framework to Adjust Dependency Measure Estimates for Chance
A Framework to Adjust Dependency Measure Estimates for Chance Simone Romano
 
PhD Completion Seminar
PhD Completion Seminar PhD Completion Seminar
PhD Completion Seminar Simone Romano
 
My Entry to the Sportsbet/CIKM competition
My Entry to the Sportsbet/CIKM competitionMy Entry to the Sportsbet/CIKM competition
My Entry to the Sportsbet/CIKM competitionSimone Romano
 

More from Simone Romano (6)

Startups and you 2021
Startups and you 2021Startups and you 2021
Startups and you 2021
 
Startups and You
Startups and YouStartups and You
Startups and You
 
Measuring Dependency via Intrinsic Dimensionality (ICPR 2016)
Measuring Dependency via Intrinsic Dimensionality (ICPR 2016)Measuring Dependency via Intrinsic Dimensionality (ICPR 2016)
Measuring Dependency via Intrinsic Dimensionality (ICPR 2016)
 
A Framework to Adjust Dependency Measure Estimates for Chance
A Framework to Adjust Dependency Measure Estimates for Chance      A Framework to Adjust Dependency Measure Estimates for Chance
A Framework to Adjust Dependency Measure Estimates for Chance
 
PhD Completion Seminar
PhD Completion Seminar PhD Completion Seminar
PhD Completion Seminar
 
My Entry to the Sportsbet/CIKM competition
My Entry to the Sportsbet/CIKM competitionMy Entry to the Sportsbet/CIKM competition
My Entry to the Sportsbet/CIKM competition
 

Recently uploaded

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Joonhun Lee
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 

Recently uploaded (20)

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 

Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning

  • 1. Introduction Results Description Conclusions HISA Big Data 2014 – April 3rd 2014 ( #BD14 ) Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning Simone Romano simone.romano@unimelb.edu.au @ialuronico James Bailey1 Lawrence Cavedon1,2,3 Orla Morrissey4,5 Monica slavin6,7 Karin Verspoor1,2 1The University of Melbourne, Dept. of Computing and Information Systems 2NICTA (National ICT Aust.) VRL 3School of Computer Science and IT, RMIT University 4Alfred Health 5Monash University 6Peter MacCallum Cancer Centre 7Melbourne Health Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 2. Introduction Results Description Conclusions Introduction Invasive Aspergillosis Challenging Big Data Task Results Diagnostic Model Description Machine Learning for Diagnosis Diagnosis of Invasive Aspergillosis Conclusions Summary Future Work Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 3. Introduction Results Description Conclusions Invasive Aspergillosis Invasive Aspergillosis (IA) Serious fungal infection and major cause of mortality in patients undergoing allogeneic stem cell transplantation or chemotherapy for acute leukaemia. Figure : Pulmonary IA. http://en.wikipedia.org/wiki/Aspergillosis Facts 34–43% mortality rate; culture methods low sensitivity, only 40–50% IA cases identified; IA patient results in +7 days of hospital stay and +$30,957. Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 4. Introduction Results Description Conclusions Invasive Aspergillosis Diagnosis and Treatment Cases are classified with ProvenIA/ProbableIA/PossibleIA. Current criteria for diagnosing IA are: 1. microbiology, risk factors, and CT scan findings; 2. Improved biomarkers such as Aspergillus PCR and Galactomannan (GM) tested twice a week. positive biopsy OR (positive CT scan AND single positive PCR/GM) ⇒ ProvenIA ≥ 2 consecutive positive PCR/GM in 2 week time frame ⇒ ProbableIA Problem One single positive biomarker might be a False Positive ⇒ Unnecessary harmful treatment. Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 5. Introduction Results Description Conclusions Challenging Big Data Task Big Data task In a randomised controlled trial comparing the two different strategies for diagnosis IA, large amount of data was collected from 240 patients between Sept. 2005 and Nov. 2009 at six Australian Centres. Objective: Leverage such data to produce more accurate prediction of IA with Machine Learning techniques. Are we really dealing with Big Data? Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 6. Introduction Results Description Conclusions Challenging Big Data Task Big Data task In a randomised controlled trial comparing the two different strategies for diagnosis IA, large amount of data was collected from 240 patients between Sept. 2005 and Nov. 2009 at six Australian Centres. Objective: Leverage such data to produce more accurate prediction of IA with Machine Learning techniques. Are we really dealing with Big Data? All patients tracked for 26 weeks providing rich longitudinal data on daily and weekly tests for each patient. 240 × 26 × 7 = 45,680 records. Bed-side interpretation is a challenging task! Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 7. Introduction Results Description Conclusions Diagnostic Model Introduction Invasive Aspergillosis Challenging Big Data Task Results Diagnostic Model Description Machine Learning for Diagnosis Diagnosis of Invasive Aspergillosis Conclusions Summary Future Work Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 8. Introduction Results Description Conclusions Diagnostic Model Model Our training set is a collection of 358 single positive biomarker tests that precede the earliest label of IA. Transplant/Chemotherapy begins 1st 2nd 3rd 4th 5th months positive biomarkers infection Just 29 of the positive biomarkers were associated with a Proven IA or Probable IA label within a week (329 false positives) Built a model to output a probability of infection within a week value; Validated by a patient-level cross-validation framework. Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 9. Introduction Results Description Conclusions Diagnostic Model 1 − TNR TPR 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 AUC = 0.63 Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 10. Introduction Results Description Conclusions Diagnostic Model 1 − TNR TPR 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 AUC = 0.63 AUC not too good Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 11. Introduction Results Description Conclusions Diagnostic Model 1 − TNR TPR 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 AUC = 0.63 But good in classifying negatives! Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 12. Introduction Results Description Conclusions Diagnostic Model Result Setting a low threshold on the model output probability to achieve high NPV (100%) we were able to identify 95 (26.5%) tests that do not lead to an IA infection (TNR = 28.9%) within a week. ⇒ Doctors can avoid to start treatment in 26.5% cases! avoid over-treatment; reduce drug-toxicity; reduce antifungal drug costs (E.g. Amphotericin B $8,260 per patient per week). Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 13. Introduction Results Description Conclusions Machine Learning for Diagnosis Introduction Invasive Aspergillosis Challenging Big Data Task Results Diagnostic Model Description Machine Learning for Diagnosis Diagnosis of Invasive Aspergillosis Conclusions Summary Future Work Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 14. Introduction Results Description Conclusions Machine Learning for Diagnosis Classification Models Logistic regression; Decision trees; Random forest Training set Voting resampling random tree resampling random tree resampling random tree resampling random tree resampling random tree Random forest because: It has the capability to work with heterogeneous features (categorical/continuous); It can work with many features. Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 15. Introduction Results Description Conclusions Diagnosis of Invasive Aspergillosis Features to use Known at baseline: Gender, age, BMI, smoking attitude status,etc. Daily tested: neutrophil count, body temperature, amount of administered steroids, haemoglobin, platelets, white cell count, urea, creatinine, ALT, AST, GGT, bilirubin, LDH, etc. Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 16. Introduction Results Description Conclusions Diagnosis of Invasive Aspergillosis Features to use Known at baseline: Gender, age, BMI, smoking attitude status,etc. Daily tested: neutrophil count, body temperature, amount of administered steroids, haemoglobin, platelets, white cell count, urea, creatinine, ALT, AST, GGT, bilirubin, LDH, etc. Very heterogeneous features!!! Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 17. Introduction Results Description Conclusions Diagnosis of Invasive Aspergillosis Heterogeneous Features Features constant along the treatment: Age, Gender, etc. Features that varied over time: neutrophil count, temperature, corticosteroid doses, etc. When we have a positive biomarker test we can use the recent past information to predict IA. We consider recent past the values in the 3 week window prior a single positive test result. May Jun Jul 36.537.538.5 date temperature window Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 18. Introduction Results Description Conclusions Diagnosis of Invasive Aspergillosis Features that varied over time Duration Features we count the number of days the value each parameter lay within a particular range. For example, we divide the measured temperature measurements into the intervals [36,37], (37,38], (38,39], (39, 40], and and greater than 40(>40) Celsius degrees and counted the number of days temperature occurred in each interval; Trajectories We select two days in the 3 week window preceding a positive test test and compute the mean value, the standard deviation, and the relative difference between those values. We do it for all possible intervals in the window. Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 19. Introduction Results Description Conclusions Summary Introduction Invasive Aspergillosis Challenging Big Data Task Results Diagnostic Model Description Machine Learning for Diagnosis Diagnosis of Invasive Aspergillosis Conclusions Summary Future Work Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 20. Introduction Results Description Conclusions Summary Summary Target: Enhance Diagnostics for biomarkers for Invasive Aspergillosis Method: Random forest for heterogeneous features creating duration features, and trajectories features; Validation: patient-level cross-validation; Results: Setting a low threshold on the output probability, NPV = 100%, TNR = 28.9%. Safe avoidance of antifungal therapy for 26.5% cases. Savings around $8K per patient per week. Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 21. Introduction Results Description Conclusions Future Work Future Work make the model more accurate in predicting when a positive test is associated with an immediate infection to trigger the antifungal treatment earlier in time; search for alternative diagnosis when the outcomes are equally probable according to the model; make the model output more interpretable to clinical practitioners, e.g. by identifying the trajectories in the data which generate a low or high probability of IA. Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning
  • 22. Introduction Results Description Conclusions Future Work Thank you. Questions? Simone Romano The University of Melbourne Enhancing Diagnostics for Invasive Aspergillosis using Machine Learning