SlideShare a Scribd company logo
1 of 55
Leveraging Text
Classification Strategies
for Clinical and Public
Health Applications
Karin M. Verspoor
@karinv
karin.verspoor@unimelb.edu.au
The University of Melbourne
Melbourne, Victoria, Australia
January 2016, Qatar Computing Research Institute
(clinical) Data everywhere
• Electronic health records
– Patient demographics and biometrics
– Laboratory test results
– Clinical notes
• Radiology and pathology
– Images: X-ray, MRI and PET Scans
– (Synoptic) Reports
• Databases
– Health Service reporting
– National Prescribing Service
– Registry data, Births and Deaths
– Medicare/insurance claim data etc…..
Don’t forget unstructured data!
• About 80% of clinical information is in textual form
– ED triage notes
– Clinical progress notes
– Radiology and Pathology reports
– GP and specialist letters
– Discharge summaries
• Published Literature
– Clinical Trials
– Molecular-level studies
• and … social media text!
How is text used in medicine?
• Direct analysis of clinical records
– Information retrieval for clinical trials
– Syndromic surveillance
– Hospital Services Research
– Clinical Decision Support
– Pharmacovigilance
• Literature mining
– Evidence-based medicine
– Systematic Reviews
Evidence from EHRs
Mining electronic health records: towards better research applications and clinical care
Peter B. Jensen, Lars J. Jensen & Søren Brunak, Nature Reviews Genetics 13, 395-405 (2012)
doi:10.1038/nrg3208
Pharmacovigilance from EHRs
Mining of clinical records to identify adverse drug events
Estimated >90% of adverse events do not appear in coded data
6
LePendu et al. (2013) “Pharmacovigilance Using Clinical Notes” Clinical Pharmacology &
Therapeutics 93(6), 547–555; doi: 10.1038/clpt.2013.47
… from social media
Pacific Symposium on Biocomputing Shared Task on
Social Media Mining
Classification of tweets: mention an Adverse Drug Reaction?
ADR classified
@NAME Q makes me hungry.
Olanzapine made me want to
eat my own arm!
Non-ADR classified
I couldnt be a chef without
nicotine and caffeine
Outline
Problem setting
Approach and Results
EHR disease classification
Kocbek et al (2015) Evaluating classification power of linked admission data
sources with text mining; Proceedings of the Scientific Stream at Big Data in Health
Analytics 2015 (BigData 2015).
ICD classification of EHR data
• We address the task of detecting clinical records
in a large record system corresponding to a given
diagnosis of interest, based on text analysis
• We focus on lung cancer records for a pilot study
• We developed a system that classifies each
admission as positive or negative for lung cancer
• Not as simple as looking for “lung cancer” or
synonyms in the EHRs!
Kocbek et al (2015) HISA Big Data conference.
http://ceur-ws.org/Vol-1468/bd2015_kocbek.pdf
Alfred REASON platform
Kocbek et al. Big Data 2015, Sydney
• 15+ years of data from.
• 171,000+ updates each day.
• 62.4 million updates per annum.
Radiology question
50yo complaining of left shoulder
pain. Tender generally. Difficulty
abducting the shoulder past 45
degrees. Home on HITH
tomorrow - either inpatient or
outpatient please
Task
Radiology report
Mobile Chest performed on 02-JUN-2012
at 08:27 AM: The nasogastric tube has
its tip in the stomach. The tracheostomy is
seen at T2 level. ….
Pathology report
Urine Culture
Acc No: 12-183-0731Source: Urine
------------ URINE MICROSCOPY (PHASE
CONTRAST) ------------- Leucocytes
x10^6/L (Ref <10).... <10
Erythrocytes x10^6/L (Ref <10).. <10.......
Additional data
Age: 50
Date of admission: Jun/12
Gender: F
Country: …
Admission
ICD-10 code
Data Characteristics
• Extracted data for 2 financial years, 2012-2014:
– 150,521 admissions,
– 40,800 radiology reports with associated question,
– 20,872 pathology reports,
– 121,700 additional data entries (demographics, hospital
admission info).
• Admissions are associated to ICD-10 codes:
– Used as ground truth
– ICD-10 code C34.*; positive cases for lung cancer
– 496 such positive admissions
– an additional 496 non-lung cancer submissions
randomly subsampled as negatives
Outline
Problem setting
Approach and Results
EHR disease classification
Research Question
• Most previous TM applications use a single
textual data source from the EHR despite a
diversity of potential data
• What is the impact of using more than one textual
data source for the EHR classification task?
– Considering different text sources;
– and including patient (structured) meta-data?
Methods
Radiology
reports
Machine learning
algorithm (SVM)
Textual and
other features
Biomedical
knowledge
sources Language
processing
Classification
Model
Additional
data
Pathology
reports
Radiology
questions
REASON sources
Text Processing
• Medical terminology recognition and normalisation
using MetaMap
• NegEx to detect negation and negation scope
The nasogastric tube has its tip in the stomach.
Meta Candidates (Total=9; Excluded=0; Pruned=0; Remaining=9)
1000 C0085678:Nasogastric tube [Medical Device]
1000 C0812428:Nasogastric tube (Nasogastric tube procedures) [Therapeutic Procedure]
861 C0175730:Tube (biomedical tube device) [Medical Device]
861 C0694637:Nasogastric (Nasogastric Route of Drug Administration) [Functional Concept]
861 C1547937:Tube NOS (Specimen Source Codes - Tube) [Intellectual Product]
861 C1561954:tube [Conceptual Entity]
861 C1704730:TUBE (Packaging Tube) [Medical Device]
861 C1704731:Tube (Tube Device Component) [Medical Device]
861 C3282907:Nasogastric [Body Location or Region]
Meta Mapping (1000):
1000 C0085678:Nasogastric tube [Medical Device]
Features
Texts
• bag of (MetaMap) phrases
– separate feature for Positive/Negative context
– experimented with keeping phrases separated
according to source, or merging across sources
Patient meta-data
• demographic data (gender, age, ethnic origin, country,
language, marital status, religion, and death date)
• hospital-related admission data (hospital code,
admission date and time, discharge date and time, length
of stay, reason for admission, admission unit, discharge
unit, admission type, source, destination and criteria)
Experimental setting
• Heavily skewed data: undersampling of negatives
• 10-fold cross validation
• Support Vector Machine (Weka)
Results: Lung Cancer
0.873
0.901
0.870
0.885
0.900
0.915
0.930
1 2 3 4
radiology reports + 1 data source
(F-Score)
radiology question pathology report additional data
0.873
0.901
0.917
0.870
0.885
0.900
0.915
0.930
1 2 3 4
radiology reports + 2 additional data sources
(F-score)
radiology question pathology reports additional data
Results: Lung Cancer
0.873
0.901
0.917
0.930
0.870
0.885
0.900
0.915
0.930
1 2 3 4
F-Score using 4 data sources
radiology question pathology reports additional data
Results: Lung Cancer
Discussion
• More data sources lead to better performance
• The classifier with the highest performance was
built using features from all four data sources
• Merging sources into aggregate features better
• Not all improvements are significant:
– Radiology question and metadata add clear value
– Pathology reports does not
• Not all admissions had a pathology report associated with
them.
Case study 1: Conclusions
• We built a text mining system for detecting lung
cancer admissions using machine learning
methods.
• Our results show more effective systems can
generally be built by including multiple linked data
sources.
• Work in progress:
– Other diseases
– Imbalanced datasets
– Feature engineering
and selection
0.893
0.820
0.830
0.840
0.850
0.860
0.870
0.880
0.890
0.900
0.910
0.920
1 2 3 4
Breast cancer
Outline
DOD with Twitter
Emotion classification
DOD signal 1: Tweet emotion shift
DOD signal 2: Tweet lexical shift
Disease Outbreak Detection
Ofoghi et al (2016) Towards early discovery of salient health threats: A social media
emotion classification technique; Pacific Symposium on Biocomputing.
25
Twitter for Outbreak Detection
Assumptions
• People tweet about diseases in the context of
emerging outbreaks
• Twitter can provide an “early warning” of an
outbreak
“Tweets started to rise in Nigeria 3-7 days prior
to the official announcement of the first probable
Ebola case. The topics discussed in tweets include
risk factors, prevention education, disease
trends, and compassion.”
Amer J Infection Control (2015)
“Early warning” tweets
Ebola on Twitter
28
Twitter for Outbreak Detection
Strategy
• Trends: counting of (hashtag, term) frequencies
• Coupled with geographic origin of tweets
• Sentiment or content analyis
Challenges
• High volume of (mostly irrelevant) tweets
• Hashtags alone may not be adequate
• A mention of a disease does not necessarily
indicate an active case
Many reasons to mention Ebola
DOD with Twitter | Previous Work
31
Is there a local emergent threat?
Can we use shifts in
emotional and lexical content of tweets
to detect a disease outbreak?
A sliding window model
Ebola event/background data
Dataset Date (±7) pre-corpus post-corpus
#tweets |vocab| #tweets |vocab|
ebola-event-1 29-Dec-14 73 204 337 906
ebola-event-2 31-Jan-15 165 700 90 417
ebola-background 16-12-14 429 1453 340 1208
Outline
DOD with Twitter
Emotion classification
DOD signal 1: Tweet emotion shift
DOD signal 2: Tweet lexical shift
Emotion classes
• ECs: Ekman’s six basic emotions plus …
– News-related
– Criticism
– Sarcasm
https://www.behance.net/gallery/6-Basic-Emotions/930168
Sarcastic
atsign atsign think I got Ebola
there two minutes ago
News-related
atsign Another 4 American
Ebola workers flown back to
USA for monitoring..
Emotion classifier data
• Data: collection
– Twitter API
– Second half of March 2015
– Total of 12,101 tweets
– Contained “ebola” or “#ebola”
– 4,405 tweets remained after some filtering…
– Amazon’s Mechanical Turk was used to label tweets
Lexicon-Based Classification
• Created an emotion vocabulary
– Profile of Mood States (POMS)
– FrameNet
– Existing “feelings list”
– Wikipedia
• Vector space model
– Binary vector per emotion
– Binary vector per tweet
– Cosine Similarity emotion vs tweet
1
2
3 anxious
4
5
6
7 affronted
8
9
497
498
499 :-|
.
.
.
https://bitbucket.org/readbiomed/socialsurveillance
39
Outline
DOD with Twitter
Emotion classification
DOD signal 1: Tweet emotion shift
DOD signal 2: Tweet lexical shift
Emotion class distribution
Classes Dataset p-value
6 emotions ebola-event-1 0.004*
ebola-event-2 0.002*
ebola-backgr. 0.259
6 emotions + 3 add’l ebola-event-1 0.009*
ebola-event-2 0.007*
ebola-backgr. 0.079
paired t-test, pre- and post-event windows; * Statistically significant at 5% level
Jensen-Shannon divergence
Class ebola-
event-1
ebola-
event-2
ebola-bg.
Sarcasm 0.0227 0.0032 0.1365
News-rel. 0.0226 0.0001 0.0074
Anger 0.0572 0.0382 0.0169
Criticism 0.0180 0.0056 0.0060
Surprise 0.1161 0.0220 0.0023
Fear 0.0768 0.0813 0.0913
Happiness 0.0444 0.0415 0.0064
Disgust 0.0604 0.0025 0.0044
Sadness 0.0023 0.0322 0.0060
AVERAGE 0.0467 0.0252 0.0308
Big differences
compared with
background,
in both e1 and e2
Outline
DOD with Twitter
Emotion classification
DOD signal 1: Tweet emotion shift
DOD signal 2: Tweet lexical shift
Lexical shift analysis
Within-corpus analysis:
Cross-corpus analysis:
Term freq changes: Event 1
Term freq changes: Background
Case study 2: Conclusions
• We introduced an Ebola tweet-based emotion
classifier.
• There are statistically significant differences in the
distribution of emotion classes and lexical items in
tweets preceding and following a salient emergent
health threat.
• This effect does not occur in a neutral background
collection.
Proposal:
• Disease outbreak detection can be supported with
monitoring of tweets using a sliding window model
that tests for such distributional changes
Conclusions
• There are myriad problems in the clinical context
where unstructured data can be leveraged to
good effect
• Text classification is one tool that can be drawn
on to make use of this unstructured data
• Heterogeneous data integration is also important
• Challenges exist in
– Terminology
– Skewed data
– Missing data
Acknowledgements
• Amazon Mechanical Turkers
• James McCaw, Melbourne School of Population and
Global Health
Bahador Ofoghi Lawrence CavedonSimon Kocbek
Thank you!
ML-Based Classification
• MALLET Naïve Bayes
• Features
– bag of words[+lem,-lem]
– Lexicon-based similarity
– emotion vocabulary
– emoticons
– punctuation
– (Stanford) sentiment
KL-Divergence, full vocabulary
Emotion-level distribution
KL-divergence (pre- vs. post-event, post- vs. pre-)
P(x) and Q(x) represent probability of positive and negative emotion classes
in the respective corpora
Top lexically distinct items
Log Likelihood analysis

More Related Content

What's hot

Information Technology: The Third Pillar of Medical Education
Information Technology: The Third Pillar of Medical EducationInformation Technology: The Third Pillar of Medical Education
Information Technology: The Third Pillar of Medical EducationBen Williams
 
Secondary Data Analysis
Secondary Data AnalysisSecondary Data Analysis
Secondary Data AnalysisREY DECASTRO
 
ePCR Research Paper for Western Journal for EM
ePCR Research Paper for Western Journal for EMePCR Research Paper for Western Journal for EM
ePCR Research Paper for Western Journal for EMlarry_johnson
 
HIV Tracking System in Forsyth County, NC
HIV Tracking System in Forsyth County, NCHIV Tracking System in Forsyth County, NC
HIV Tracking System in Forsyth County, NCwebbmother
 
Estimating the Statistical Significance of Classifiers used in the Predictio...
Estimating the Statistical Significance of Classifiers used in the  Predictio...Estimating the Statistical Significance of Classifiers used in the  Predictio...
Estimating the Statistical Significance of Classifiers used in the Predictio...IOSR Journals
 
Systematic Review Of Observational Studies By Yusuf Abdu Misau
Systematic Review Of Observational Studies By Yusuf Abdu MisauSystematic Review Of Observational Studies By Yusuf Abdu Misau
Systematic Review Of Observational Studies By Yusuf Abdu MisauYusuf Misau
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision MedicineMatthieu Schapranow
 
NER Public Health Digital Library Project
NER Public Health Digital Library ProjectNER Public Health Digital Library Project
NER Public Health Digital Library ProjectElaine Martin
 
Global Dementia Legacy Event: Dr Neil Buckholtz
Global Dementia Legacy Event: Dr Neil Buckholtz Global Dementia Legacy Event: Dr Neil Buckholtz
Global Dementia Legacy Event: Dr Neil Buckholtz Department of Health
 
The reality of moving towards precision medicine
The reality of moving towards precision medicineThe reality of moving towards precision medicine
The reality of moving towards precision medicineElia Stupka
 
Use of Electronic Devices for National License Examination Self-Study among R...
Use of Electronic Devices for National License Examination Self-Study among R...Use of Electronic Devices for National License Examination Self-Study among R...
Use of Electronic Devices for National License Examination Self-Study among R...Nawanan Theera-Ampornpunt
 
Recruit(1)
Recruit(1)Recruit(1)
Recruit(1)wangjiaz
 
Towards Digitally Enabled Genomic Medicine: the Patient of The Future
Towards Digitally Enabled Genomic Medicine: the Patient of The FutureTowards Digitally Enabled Genomic Medicine: the Patient of The Future
Towards Digitally Enabled Genomic Medicine: the Patient of The FutureLarry Smarr
 
Biosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di Tada
Biosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di TadaBiosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di Tada
Biosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di TadaTaha Kass-Hout, MD, MS
 
IRJET - Chronic or Acute Disease with Doctor Specialist using Data Mining
IRJET -  	  Chronic or Acute Disease with Doctor Specialist using Data MiningIRJET -  	  Chronic or Acute Disease with Doctor Specialist using Data Mining
IRJET - Chronic or Acute Disease with Doctor Specialist using Data MiningIRJET Journal
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineMatthieu Schapranow
 
Simplifying semantics for biomedical applications
Simplifying semantics for biomedical applicationsSimplifying semantics for biomedical applications
Simplifying semantics for biomedical applicationsSemantic Web San Diego
 

What's hot (20)

Information Technology: The Third Pillar of Medical Education
Information Technology: The Third Pillar of Medical EducationInformation Technology: The Third Pillar of Medical Education
Information Technology: The Third Pillar of Medical Education
 
Secondary Data Analysis
Secondary Data AnalysisSecondary Data Analysis
Secondary Data Analysis
 
ePCR Research Paper for Western Journal for EM
ePCR Research Paper for Western Journal for EMePCR Research Paper for Western Journal for EM
ePCR Research Paper for Western Journal for EM
 
HIV Tracking System in Forsyth County, NC
HIV Tracking System in Forsyth County, NCHIV Tracking System in Forsyth County, NC
HIV Tracking System in Forsyth County, NC
 
Estimating the Statistical Significance of Classifiers used in the Predictio...
Estimating the Statistical Significance of Classifiers used in the  Predictio...Estimating the Statistical Significance of Classifiers used in the  Predictio...
Estimating the Statistical Significance of Classifiers used in the Predictio...
 
Systematic Review Of Observational Studies By Yusuf Abdu Misau
Systematic Review Of Observational Studies By Yusuf Abdu MisauSystematic Review Of Observational Studies By Yusuf Abdu Misau
Systematic Review Of Observational Studies By Yusuf Abdu Misau
 
Turning Big Data into Precision Medicine
Turning Big Data into Precision MedicineTurning Big Data into Precision Medicine
Turning Big Data into Precision Medicine
 
NER Public Health Digital Library Project
NER Public Health Digital Library ProjectNER Public Health Digital Library Project
NER Public Health Digital Library Project
 
Global Dementia Legacy Event: Dr Neil Buckholtz
Global Dementia Legacy Event: Dr Neil Buckholtz Global Dementia Legacy Event: Dr Neil Buckholtz
Global Dementia Legacy Event: Dr Neil Buckholtz
 
The reality of moving towards precision medicine
The reality of moving towards precision medicineThe reality of moving towards precision medicine
The reality of moving towards precision medicine
 
NETWORK OF DISEASES AND ITS ENDOWMENT TOWARDS DISEASE
NETWORK OF DISEASES AND ITS ENDOWMENT TOWARDS DISEASE NETWORK OF DISEASES AND ITS ENDOWMENT TOWARDS DISEASE
NETWORK OF DISEASES AND ITS ENDOWMENT TOWARDS DISEASE
 
Use of Electronic Devices for National License Examination Self-Study among R...
Use of Electronic Devices for National License Examination Self-Study among R...Use of Electronic Devices for National License Examination Self-Study among R...
Use of Electronic Devices for National License Examination Self-Study among R...
 
Recruit(1)
Recruit(1)Recruit(1)
Recruit(1)
 
Towards Digitally Enabled Genomic Medicine: the Patient of The Future
Towards Digitally Enabled Genomic Medicine: the Patient of The FutureTowards Digitally Enabled Genomic Medicine: the Patient of The Future
Towards Digitally Enabled Genomic Medicine: the Patient of The Future
 
Time trends & patterns, TB
Time trends & patterns, TBTime trends & patterns, TB
Time trends & patterns, TB
 
Health IT Project
Health IT Project Health IT Project
Health IT Project
 
Biosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di Tada
Biosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di TadaBiosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di Tada
Biosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di Tada
 
IRJET - Chronic or Acute Disease with Doctor Specialist using Data Mining
IRJET -  	  Chronic or Acute Disease with Doctor Specialist using Data MiningIRJET -  	  Chronic or Acute Disease with Doctor Specialist using Data Mining
IRJET - Chronic or Acute Disease with Doctor Specialist using Data Mining
 
How Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision MedicineHow Real-time Analysis turns Big Medical Data into Precision Medicine
How Real-time Analysis turns Big Medical Data into Precision Medicine
 
Simplifying semantics for biomedical applications
Simplifying semantics for biomedical applicationsSimplifying semantics for biomedical applications
Simplifying semantics for biomedical applications
 

Viewers also liked

111102 wildevuur-ehealth-evers
111102 wildevuur-ehealth-evers111102 wildevuur-ehealth-evers
111102 wildevuur-ehealth-everslikewildfire
 
Clinical-Community Relationships as a Pathway To Improve Health: Tools for Re...
Clinical-Community Relationships as a Pathway To Improve Health: Tools for Re...Clinical-Community Relationships as a Pathway To Improve Health: Tools for Re...
Clinical-Community Relationships as a Pathway To Improve Health: Tools for Re...AHRQ Health Care Innovations Exchange
 
Ar ramtha assessment
Ar ramtha assessmentAr ramtha assessment
Ar ramtha assessmentHani Abu-Dieh
 
Ecological models of health behavior
Ecological models of health behaviorEcological models of health behavior
Ecological models of health behaviorVibha Amblihalli
 
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...Ilya Kuzovkin
 
Principles of community health nursing
Principles of community health nursingPrinciples of community health nursing
Principles of community health nursingmary jacob
 
WhosOn live Chat - Analytics, Interface Design &amp; CRM Intergration
WhosOn live Chat - Analytics, Interface Design &amp; CRM IntergrationWhosOn live Chat - Analytics, Interface Design &amp; CRM Intergration
WhosOn live Chat - Analytics, Interface Design &amp; CRM Intergrationianrowley
 
Introduction about nicotine
Introduction about nicotineIntroduction about nicotine
Introduction about nicotineIMT ProHunt
 
Models Of Health Behaviors By Yusuf Abdu Misau
Models Of Health Behaviors By Yusuf Abdu MisauModels Of Health Behaviors By Yusuf Abdu Misau
Models Of Health Behaviors By Yusuf Abdu MisauYusuf Misau
 
Unsupervised Learning with Apache Spark
Unsupervised Learning with Apache SparkUnsupervised Learning with Apache Spark
Unsupervised Learning with Apache SparkDB Tsai
 
(1) introduction to community health nursing
(1) introduction to community  health nursing(1) introduction to community  health nursing
(1) introduction to community health nursingDr. Nazar Jaf
 
Health education lecture 1
Health education lecture 1Health education lecture 1
Health education lecture 1cchaudoin87
 
Health system models-an overview
Health system models-an overviewHealth system models-an overview
Health system models-an overviewAhmed-Refat Refat
 
Deep Learning: Theory, History, State of the Art & Practical Tools
Deep Learning: Theory, History, State of the Art & Practical ToolsDeep Learning: Theory, History, State of the Art & Practical Tools
Deep Learning: Theory, History, State of the Art & Practical ToolsIlya Kuzovkin
 
Epidemiology and community health
Epidemiology and community healthEpidemiology and community health
Epidemiology and community healthSaleh Ahmed
 
Supervised and unsupervised learning
Supervised and unsupervised learningSupervised and unsupervised learning
Supervised and unsupervised learningParas Kohli
 
Automating Web Analytics
Automating Web AnalyticsAutomating Web Analytics
Automating Web AnalyticsAnand Bagmar
 

Viewers also liked (20)

111102 wildevuur-ehealth-evers
111102 wildevuur-ehealth-evers111102 wildevuur-ehealth-evers
111102 wildevuur-ehealth-evers
 
Clinical-Community Relationships as a Pathway To Improve Health: Tools for Re...
Clinical-Community Relationships as a Pathway To Improve Health: Tools for Re...Clinical-Community Relationships as a Pathway To Improve Health: Tools for Re...
Clinical-Community Relationships as a Pathway To Improve Health: Tools for Re...
 
Ar ramtha assessment
Ar ramtha assessmentAr ramtha assessment
Ar ramtha assessment
 
Ecological models of health behavior
Ecological models of health behaviorEcological models of health behavior
Ecological models of health behavior
 
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
Article overview: Unsupervised Learning of Visual Structure Using Predictive ...
 
Principles of community health nursing
Principles of community health nursingPrinciples of community health nursing
Principles of community health nursing
 
WhosOn live Chat - Analytics, Interface Design &amp; CRM Intergration
WhosOn live Chat - Analytics, Interface Design &amp; CRM IntergrationWhosOn live Chat - Analytics, Interface Design &amp; CRM Intergration
WhosOn live Chat - Analytics, Interface Design &amp; CRM Intergration
 
The new discovery about Ebola virus
The new discovery about Ebola virusThe new discovery about Ebola virus
The new discovery about Ebola virus
 
Introduction about nicotine
Introduction about nicotineIntroduction about nicotine
Introduction about nicotine
 
Words of Wisdom
Words of WisdomWords of Wisdom
Words of Wisdom
 
Models Of Health Behaviors By Yusuf Abdu Misau
Models Of Health Behaviors By Yusuf Abdu MisauModels Of Health Behaviors By Yusuf Abdu Misau
Models Of Health Behaviors By Yusuf Abdu Misau
 
Unsupervised Learning with Apache Spark
Unsupervised Learning with Apache SparkUnsupervised Learning with Apache Spark
Unsupervised Learning with Apache Spark
 
(1) introduction to community health nursing
(1) introduction to community  health nursing(1) introduction to community  health nursing
(1) introduction to community health nursing
 
Health education lecture 1
Health education lecture 1Health education lecture 1
Health education lecture 1
 
Health system models-an overview
Health system models-an overviewHealth system models-an overview
Health system models-an overview
 
Deep Learning: Theory, History, State of the Art & Practical Tools
Deep Learning: Theory, History, State of the Art & Practical ToolsDeep Learning: Theory, History, State of the Art & Practical Tools
Deep Learning: Theory, History, State of the Art & Practical Tools
 
Epidemiology and community health
Epidemiology and community healthEpidemiology and community health
Epidemiology and community health
 
Supervised and unsupervised learning
Supervised and unsupervised learningSupervised and unsupervised learning
Supervised and unsupervised learning
 
Automating Web Analytics
Automating Web AnalyticsAutomating Web Analytics
Automating Web Analytics
 
Health education
Health educationHealth education
Health education
 

Similar to Leveraging Text Classification Strategies for Clinical and Public Health Applications

Using real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questionsUsing real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questionsKarin Verspoor
 
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019Warren Kibbe
 
2010 06 07 - LOINC Introduction
2010 06 07 - LOINC Introduction2010 06 07 - LOINC Introduction
2010 06 07 - LOINC Introductiondvreeman
 
UCSF Informatics Day 2014 - Ida Sim, "Informatics Technologies: From a Data-C...
UCSF Informatics Day 2014 - Ida Sim, "Informatics Technologies: From a Data-C...UCSF Informatics Day 2014 - Ida Sim, "Informatics Technologies: From a Data-C...
UCSF Informatics Day 2014 - Ida Sim, "Informatics Technologies: From a Data-C...CTSI at UCSF
 
Brendan Delany – Chair in Medical Informatics and Decision Making, Imperial...
  Brendan Delany – Chair in Medical Informatics and Decision Making, Imperial...  Brendan Delany – Chair in Medical Informatics and Decision Making, Imperial...
Brendan Delany – Chair in Medical Informatics and Decision Making, Imperial...HIMSS UK
 
Methods for Observational Comparative Effectiveness Research on Healthcare De...
Methods for Observational Comparative Effectiveness Research on Healthcare De...Methods for Observational Comparative Effectiveness Research on Healthcare De...
Methods for Observational Comparative Effectiveness Research on Healthcare De...Marion Sills
 
Clinical Genomics and Medicine
Clinical Genomics and MedicineClinical Genomics and Medicine
Clinical Genomics and MedicineWarren Kibbe
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinarPistoia Alliance
 
The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesPhilip Payne
 
Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...lucenerevolution
 
Towards a National Learning Health System - Aziz Sheikh
Towards a National Learning Health System - Aziz SheikhTowards a National Learning Health System - Aziz Sheikh
Towards a National Learning Health System - Aziz SheikhNIHR CLAHRC West Midlands
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Remedy Informatics
 
The Future of Personalized Medicine
The Future of Personalized MedicineThe Future of Personalized Medicine
The Future of Personalized MedicineEdgewater
 
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...CTSI at UCSF
 
eHealth Practice in Europe: where do we stand?
eHealth Practice in Europe: where do we stand?eHealth Practice in Europe: where do we stand?
eHealth Practice in Europe: where do we stand?chronaki
 
LIMS in Modern Molecular Pathology by Dr. Perry Maxwell
LIMS in Modern Molecular Pathology by Dr. Perry MaxwellLIMS in Modern Molecular Pathology by Dr. Perry Maxwell
LIMS in Modern Molecular Pathology by Dr. Perry MaxwellCirdan
 
Automated Abstracting - NCRA San Antonio 2015
Automated Abstracting - NCRA San Antonio 2015Automated Abstracting - NCRA San Antonio 2015
Automated Abstracting - NCRA San Antonio 2015Victor Brunka
 

Similar to Leveraging Text Classification Strategies for Clinical and Public Health Applications (20)

Watson – from Jeopardy to healthcare
Watson – from Jeopardy to healthcareWatson – from Jeopardy to healthcare
Watson – from Jeopardy to healthcare
 
Using real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questionsUsing real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questions
 
Bioterrorism Talk.ppt
Bioterrorism Talk.pptBioterrorism Talk.ppt
Bioterrorism Talk.ppt
 
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
Data in precision oncology SAMSI Precision Medicine Meeting mar 2019
 
PMED: APPM Workshop: Data & Analytics in Precision Oncology- Warren Kibbe, Ma...
PMED: APPM Workshop: Data & Analytics in Precision Oncology- Warren Kibbe, Ma...PMED: APPM Workshop: Data & Analytics in Precision Oncology- Warren Kibbe, Ma...
PMED: APPM Workshop: Data & Analytics in Precision Oncology- Warren Kibbe, Ma...
 
2010 06 07 - LOINC Introduction
2010 06 07 - LOINC Introduction2010 06 07 - LOINC Introduction
2010 06 07 - LOINC Introduction
 
UCSF Informatics Day 2014 - Ida Sim, "Informatics Technologies: From a Data-C...
UCSF Informatics Day 2014 - Ida Sim, "Informatics Technologies: From a Data-C...UCSF Informatics Day 2014 - Ida Sim, "Informatics Technologies: From a Data-C...
UCSF Informatics Day 2014 - Ida Sim, "Informatics Technologies: From a Data-C...
 
Brendan Delany – Chair in Medical Informatics and Decision Making, Imperial...
  Brendan Delany – Chair in Medical Informatics and Decision Making, Imperial...  Brendan Delany – Chair in Medical Informatics and Decision Making, Imperial...
Brendan Delany – Chair in Medical Informatics and Decision Making, Imperial...
 
Methods for Observational Comparative Effectiveness Research on Healthcare De...
Methods for Observational Comparative Effectiveness Research on Healthcare De...Methods for Observational Comparative Effectiveness Research on Healthcare De...
Methods for Observational Comparative Effectiveness Research on Healthcare De...
 
Clinical Genomics and Medicine
Clinical Genomics and MedicineClinical Genomics and Medicine
Clinical Genomics and Medicine
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
 
The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across Scales
 
Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...Next generation electronic medical records and search a test implementation i...
Next generation electronic medical records and search a test implementation i...
 
Towards a National Learning Health System - Aziz Sheikh
Towards a National Learning Health System - Aziz SheikhTowards a National Learning Health System - Aziz Sheikh
Towards a National Learning Health System - Aziz Sheikh
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
 
The Future of Personalized Medicine
The Future of Personalized MedicineThe Future of Personalized Medicine
The Future of Personalized Medicine
 
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
UCSF Informatics Day 2014 - Sorena Nadaf, "Translational Informatics OnCore C...
 
eHealth Practice in Europe: where do we stand?
eHealth Practice in Europe: where do we stand?eHealth Practice in Europe: where do we stand?
eHealth Practice in Europe: where do we stand?
 
LIMS in Modern Molecular Pathology by Dr. Perry Maxwell
LIMS in Modern Molecular Pathology by Dr. Perry MaxwellLIMS in Modern Molecular Pathology by Dr. Perry Maxwell
LIMS in Modern Molecular Pathology by Dr. Perry Maxwell
 
Automated Abstracting - NCRA San Antonio 2015
Automated Abstracting - NCRA San Antonio 2015Automated Abstracting - NCRA San Antonio 2015
Automated Abstracting - NCRA San Antonio 2015
 

More from Karin Verspoor

Robogals 10th Anniversary Gala Keynote, Karin Verspoor
Robogals 10th Anniversary Gala Keynote, Karin VerspoorRobogals 10th Anniversary Gala Keynote, Karin Verspoor
Robogals 10th Anniversary Gala Keynote, Karin VerspoorKarin Verspoor
 
Doctor Digital will see you now
Doctor Digital will see you nowDoctor Digital will see you now
Doctor Digital will see you nowKarin Verspoor
 
Using text mining to inform genetic variant interpretation
Using text mining to inform genetic variant interpretationUsing text mining to inform genetic variant interpretation
Using text mining to inform genetic variant interpretationKarin Verspoor
 
Function and Phenotype Prediction through Data and Knowledge Fusion
Function and Phenotype Prediction through Data and Knowledge FusionFunction and Phenotype Prediction through Data and Knowledge Fusion
Function and Phenotype Prediction through Data and Knowledge FusionKarin Verspoor
 
Syndromic Surveillance from Emergency Department Triage Notes
Syndromic Surveillance from Emergency Department Triage NotesSyndromic Surveillance from Emergency Department Triage Notes
Syndromic Surveillance from Emergency Department Triage NotesKarin Verspoor
 
Topic modeling of Emergency Department Triage notes for characterising pain-r...
Topic modeling of Emergency Department Triage notes for characterising pain-r...Topic modeling of Emergency Department Triage notes for characterising pain-r...
Topic modeling of Emergency Department Triage notes for characterising pain-r...Karin Verspoor
 
Medical Information Retrieval Workshop Keynote (MedIR@SIGIR2014)
Medical Information Retrieval Workshop Keynote (MedIR@SIGIR2014)Medical Information Retrieval Workshop Keynote (MedIR@SIGIR2014)
Medical Information Retrieval Workshop Keynote (MedIR@SIGIR2014)Karin Verspoor
 

More from Karin Verspoor (7)

Robogals 10th Anniversary Gala Keynote, Karin Verspoor
Robogals 10th Anniversary Gala Keynote, Karin VerspoorRobogals 10th Anniversary Gala Keynote, Karin Verspoor
Robogals 10th Anniversary Gala Keynote, Karin Verspoor
 
Doctor Digital will see you now
Doctor Digital will see you nowDoctor Digital will see you now
Doctor Digital will see you now
 
Using text mining to inform genetic variant interpretation
Using text mining to inform genetic variant interpretationUsing text mining to inform genetic variant interpretation
Using text mining to inform genetic variant interpretation
 
Function and Phenotype Prediction through Data and Knowledge Fusion
Function and Phenotype Prediction through Data and Knowledge FusionFunction and Phenotype Prediction through Data and Knowledge Fusion
Function and Phenotype Prediction through Data and Knowledge Fusion
 
Syndromic Surveillance from Emergency Department Triage Notes
Syndromic Surveillance from Emergency Department Triage NotesSyndromic Surveillance from Emergency Department Triage Notes
Syndromic Surveillance from Emergency Department Triage Notes
 
Topic modeling of Emergency Department Triage notes for characterising pain-r...
Topic modeling of Emergency Department Triage notes for characterising pain-r...Topic modeling of Emergency Department Triage notes for characterising pain-r...
Topic modeling of Emergency Department Triage notes for characterising pain-r...
 
Medical Information Retrieval Workshop Keynote (MedIR@SIGIR2014)
Medical Information Retrieval Workshop Keynote (MedIR@SIGIR2014)Medical Information Retrieval Workshop Keynote (MedIR@SIGIR2014)
Medical Information Retrieval Workshop Keynote (MedIR@SIGIR2014)
 

Recently uploaded

Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...narwatsonia7
 
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service JaipurHigh Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipurparulsinha
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowRiya Pathan
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photosnarwatsonia7
 
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowKolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowNehru place Escorts
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Availablenarwatsonia7
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...CALL GIRLS
 
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment BookingCall Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment Bookingnarwatsonia7
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safenarwatsonia7
 
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls ServiceCall Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Servicesonalikaur4
 
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.MiadAlsulami
 
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...Miss joya
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Modelssonalikaur4
 
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment BookingHousewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Bookingnarwatsonia7
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Serviceparulsinha
 
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near MeHi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Menarwatsonia7
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...Garima Khatri
 
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...narwatsonia7
 

Recently uploaded (20)

Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
Call Girls Frazer Town Just Call 7001305949 Top Class Call Girl Service Avail...
 
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCREscort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
Escort Service Call Girls In Sarita Vihar,, 99530°56974 Delhi NCR
 
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service JaipurHigh Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
High Profile Call Girls Jaipur Vani 8445551418 Independent Escort Service Jaipur
 
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Servicesauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
sauth delhi call girls in Bhajanpura 🔝 9953056974 🔝 escort Service
 
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowSonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Sonagachi Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
 
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original PhotosCall Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
Call Girl Service Bidadi - For 7001305949 Cheap & Best with original Photos
 
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call NowKolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
Kolkata Call Girls Services 9907093804 @24x7 High Class Babes Here Call Now
 
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service AvailableCall Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
Call Girls Jp Nagar Just Call 7001305949 Top Class Call Girl Service Available
 
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
Call Girls Service Surat Samaira ❤️🍑 8250192130 👄 Independent Escort Service ...
 
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment BookingCall Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
Call Girl Koramangala | 7001305949 At Low Cost Cash Payment Booking
 
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% SafeBangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
Bangalore Call Girls Marathahalli 📞 9907093804 High Profile Service 100% Safe
 
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls ServiceCall Girls Thane Just Call 9910780858 Get High Class Call Girls Service
Call Girls Thane Just Call 9910780858 Get High Class Call Girls Service
 
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
Artifacts in Nuclear Medicine with Identifying and resolving artifacts.
 
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
Russian Call Girls in Pune Riya 9907093804 Short 1500 Night 6000 Best call gi...
 
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking ModelsMumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
Mumbai Call Girls Service 9910780858 Real Russian Girls Looking Models
 
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment BookingHousewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
Housewife Call Girls Hoskote | 7001305949 At Low Cost Cash Payment Booking
 
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort ServiceCall Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
Call Girls Service In Shyam Nagar Whatsapp 8445551418 Independent Escort Service
 
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near MeHi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
Hi,Fi Call Girl In Mysore Road - 7001305949 | 24x7 Service Available Near Me
 
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
VIP Mumbai Call Girls Hiranandani Gardens Just Call 9920874524 with A/C Room ...
 
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
Call Girls Electronic City Just Call 7001305949 Top Class Call Girl Service A...
 

Leveraging Text Classification Strategies for Clinical and Public Health Applications

  • 1. Leveraging Text Classification Strategies for Clinical and Public Health Applications Karin M. Verspoor @karinv karin.verspoor@unimelb.edu.au The University of Melbourne Melbourne, Victoria, Australia January 2016, Qatar Computing Research Institute
  • 2. (clinical) Data everywhere • Electronic health records – Patient demographics and biometrics – Laboratory test results – Clinical notes • Radiology and pathology – Images: X-ray, MRI and PET Scans – (Synoptic) Reports • Databases – Health Service reporting – National Prescribing Service – Registry data, Births and Deaths – Medicare/insurance claim data etc…..
  • 3. Don’t forget unstructured data! • About 80% of clinical information is in textual form – ED triage notes – Clinical progress notes – Radiology and Pathology reports – GP and specialist letters – Discharge summaries • Published Literature – Clinical Trials – Molecular-level studies • and … social media text!
  • 4. How is text used in medicine? • Direct analysis of clinical records – Information retrieval for clinical trials – Syndromic surveillance – Hospital Services Research – Clinical Decision Support – Pharmacovigilance • Literature mining – Evidence-based medicine – Systematic Reviews
  • 5. Evidence from EHRs Mining electronic health records: towards better research applications and clinical care Peter B. Jensen, Lars J. Jensen & Søren Brunak, Nature Reviews Genetics 13, 395-405 (2012) doi:10.1038/nrg3208
  • 6. Pharmacovigilance from EHRs Mining of clinical records to identify adverse drug events Estimated >90% of adverse events do not appear in coded data 6 LePendu et al. (2013) “Pharmacovigilance Using Clinical Notes” Clinical Pharmacology & Therapeutics 93(6), 547–555; doi: 10.1038/clpt.2013.47
  • 7. … from social media Pacific Symposium on Biocomputing Shared Task on Social Media Mining Classification of tweets: mention an Adverse Drug Reaction? ADR classified @NAME Q makes me hungry. Olanzapine made me want to eat my own arm! Non-ADR classified I couldnt be a chef without nicotine and caffeine
  • 8. Outline Problem setting Approach and Results EHR disease classification Kocbek et al (2015) Evaluating classification power of linked admission data sources with text mining; Proceedings of the Scientific Stream at Big Data in Health Analytics 2015 (BigData 2015).
  • 9. ICD classification of EHR data • We address the task of detecting clinical records in a large record system corresponding to a given diagnosis of interest, based on text analysis • We focus on lung cancer records for a pilot study • We developed a system that classifies each admission as positive or negative for lung cancer • Not as simple as looking for “lung cancer” or synonyms in the EHRs! Kocbek et al (2015) HISA Big Data conference. http://ceur-ws.org/Vol-1468/bd2015_kocbek.pdf
  • 10. Alfred REASON platform Kocbek et al. Big Data 2015, Sydney • 15+ years of data from. • 171,000+ updates each day. • 62.4 million updates per annum.
  • 11. Radiology question 50yo complaining of left shoulder pain. Tender generally. Difficulty abducting the shoulder past 45 degrees. Home on HITH tomorrow - either inpatient or outpatient please Task Radiology report Mobile Chest performed on 02-JUN-2012 at 08:27 AM: The nasogastric tube has its tip in the stomach. The tracheostomy is seen at T2 level. …. Pathology report Urine Culture Acc No: 12-183-0731Source: Urine ------------ URINE MICROSCOPY (PHASE CONTRAST) ------------- Leucocytes x10^6/L (Ref <10).... <10 Erythrocytes x10^6/L (Ref <10).. <10....... Additional data Age: 50 Date of admission: Jun/12 Gender: F Country: … Admission ICD-10 code
  • 12. Data Characteristics • Extracted data for 2 financial years, 2012-2014: – 150,521 admissions, – 40,800 radiology reports with associated question, – 20,872 pathology reports, – 121,700 additional data entries (demographics, hospital admission info). • Admissions are associated to ICD-10 codes: – Used as ground truth – ICD-10 code C34.*; positive cases for lung cancer – 496 such positive admissions – an additional 496 non-lung cancer submissions randomly subsampled as negatives
  • 13. Outline Problem setting Approach and Results EHR disease classification
  • 14. Research Question • Most previous TM applications use a single textual data source from the EHR despite a diversity of potential data • What is the impact of using more than one textual data source for the EHR classification task? – Considering different text sources; – and including patient (structured) meta-data?
  • 15. Methods Radiology reports Machine learning algorithm (SVM) Textual and other features Biomedical knowledge sources Language processing Classification Model Additional data Pathology reports Radiology questions REASON sources
  • 16. Text Processing • Medical terminology recognition and normalisation using MetaMap • NegEx to detect negation and negation scope The nasogastric tube has its tip in the stomach. Meta Candidates (Total=9; Excluded=0; Pruned=0; Remaining=9) 1000 C0085678:Nasogastric tube [Medical Device] 1000 C0812428:Nasogastric tube (Nasogastric tube procedures) [Therapeutic Procedure] 861 C0175730:Tube (biomedical tube device) [Medical Device] 861 C0694637:Nasogastric (Nasogastric Route of Drug Administration) [Functional Concept] 861 C1547937:Tube NOS (Specimen Source Codes - Tube) [Intellectual Product] 861 C1561954:tube [Conceptual Entity] 861 C1704730:TUBE (Packaging Tube) [Medical Device] 861 C1704731:Tube (Tube Device Component) [Medical Device] 861 C3282907:Nasogastric [Body Location or Region] Meta Mapping (1000): 1000 C0085678:Nasogastric tube [Medical Device]
  • 17. Features Texts • bag of (MetaMap) phrases – separate feature for Positive/Negative context – experimented with keeping phrases separated according to source, or merging across sources Patient meta-data • demographic data (gender, age, ethnic origin, country, language, marital status, religion, and death date) • hospital-related admission data (hospital code, admission date and time, discharge date and time, length of stay, reason for admission, admission unit, discharge unit, admission type, source, destination and criteria)
  • 18. Experimental setting • Heavily skewed data: undersampling of negatives • 10-fold cross validation • Support Vector Machine (Weka)
  • 19. Results: Lung Cancer 0.873 0.901 0.870 0.885 0.900 0.915 0.930 1 2 3 4 radiology reports + 1 data source (F-Score) radiology question pathology report additional data
  • 20. 0.873 0.901 0.917 0.870 0.885 0.900 0.915 0.930 1 2 3 4 radiology reports + 2 additional data sources (F-score) radiology question pathology reports additional data Results: Lung Cancer
  • 21. 0.873 0.901 0.917 0.930 0.870 0.885 0.900 0.915 0.930 1 2 3 4 F-Score using 4 data sources radiology question pathology reports additional data Results: Lung Cancer
  • 22. Discussion • More data sources lead to better performance • The classifier with the highest performance was built using features from all four data sources • Merging sources into aggregate features better • Not all improvements are significant: – Radiology question and metadata add clear value – Pathology reports does not • Not all admissions had a pathology report associated with them.
  • 23. Case study 1: Conclusions • We built a text mining system for detecting lung cancer admissions using machine learning methods. • Our results show more effective systems can generally be built by including multiple linked data sources. • Work in progress: – Other diseases – Imbalanced datasets – Feature engineering and selection 0.893 0.820 0.830 0.840 0.850 0.860 0.870 0.880 0.890 0.900 0.910 0.920 1 2 3 4 Breast cancer
  • 24. Outline DOD with Twitter Emotion classification DOD signal 1: Tweet emotion shift DOD signal 2: Tweet lexical shift Disease Outbreak Detection Ofoghi et al (2016) Towards early discovery of salient health threats: A social media emotion classification technique; Pacific Symposium on Biocomputing.
  • 25. 25
  • 26. Twitter for Outbreak Detection Assumptions • People tweet about diseases in the context of emerging outbreaks • Twitter can provide an “early warning” of an outbreak “Tweets started to rise in Nigeria 3-7 days prior to the official announcement of the first probable Ebola case. The topics discussed in tweets include risk factors, prevention education, disease trends, and compassion.” Amer J Infection Control (2015)
  • 29. Twitter for Outbreak Detection Strategy • Trends: counting of (hashtag, term) frequencies • Coupled with geographic origin of tweets • Sentiment or content analyis Challenges • High volume of (mostly irrelevant) tweets • Hashtags alone may not be adequate • A mention of a disease does not necessarily indicate an active case
  • 30. Many reasons to mention Ebola
  • 31. DOD with Twitter | Previous Work 31
  • 32. Is there a local emergent threat? Can we use shifts in emotional and lexical content of tweets to detect a disease outbreak?
  • 34. Ebola event/background data Dataset Date (±7) pre-corpus post-corpus #tweets |vocab| #tweets |vocab| ebola-event-1 29-Dec-14 73 204 337 906 ebola-event-2 31-Jan-15 165 700 90 417 ebola-background 16-12-14 429 1453 340 1208
  • 35. Outline DOD with Twitter Emotion classification DOD signal 1: Tweet emotion shift DOD signal 2: Tweet lexical shift
  • 36. Emotion classes • ECs: Ekman’s six basic emotions plus … – News-related – Criticism – Sarcasm https://www.behance.net/gallery/6-Basic-Emotions/930168 Sarcastic atsign atsign think I got Ebola there two minutes ago News-related atsign Another 4 American Ebola workers flown back to USA for monitoring..
  • 37. Emotion classifier data • Data: collection – Twitter API – Second half of March 2015 – Total of 12,101 tweets – Contained “ebola” or “#ebola” – 4,405 tweets remained after some filtering… – Amazon’s Mechanical Turk was used to label tweets
  • 38. Lexicon-Based Classification • Created an emotion vocabulary – Profile of Mood States (POMS) – FrameNet – Existing “feelings list” – Wikipedia • Vector space model – Binary vector per emotion – Binary vector per tweet – Cosine Similarity emotion vs tweet 1 2 3 anxious 4 5 6 7 affronted 8 9 497 498 499 :-| . . . https://bitbucket.org/readbiomed/socialsurveillance
  • 39. 39
  • 40. Outline DOD with Twitter Emotion classification DOD signal 1: Tweet emotion shift DOD signal 2: Tweet lexical shift
  • 41. Emotion class distribution Classes Dataset p-value 6 emotions ebola-event-1 0.004* ebola-event-2 0.002* ebola-backgr. 0.259 6 emotions + 3 add’l ebola-event-1 0.009* ebola-event-2 0.007* ebola-backgr. 0.079 paired t-test, pre- and post-event windows; * Statistically significant at 5% level
  • 42. Jensen-Shannon divergence Class ebola- event-1 ebola- event-2 ebola-bg. Sarcasm 0.0227 0.0032 0.1365 News-rel. 0.0226 0.0001 0.0074 Anger 0.0572 0.0382 0.0169 Criticism 0.0180 0.0056 0.0060 Surprise 0.1161 0.0220 0.0023 Fear 0.0768 0.0813 0.0913 Happiness 0.0444 0.0415 0.0064 Disgust 0.0604 0.0025 0.0044 Sadness 0.0023 0.0322 0.0060 AVERAGE 0.0467 0.0252 0.0308 Big differences compared with background, in both e1 and e2
  • 43. Outline DOD with Twitter Emotion classification DOD signal 1: Tweet emotion shift DOD signal 2: Tweet lexical shift
  • 44. Lexical shift analysis Within-corpus analysis: Cross-corpus analysis:
  • 46. Term freq changes: Background
  • 47. Case study 2: Conclusions • We introduced an Ebola tweet-based emotion classifier. • There are statistically significant differences in the distribution of emotion classes and lexical items in tweets preceding and following a salient emergent health threat. • This effect does not occur in a neutral background collection. Proposal: • Disease outbreak detection can be supported with monitoring of tweets using a sliding window model that tests for such distributional changes
  • 48. Conclusions • There are myriad problems in the clinical context where unstructured data can be leveraged to good effect • Text classification is one tool that can be drawn on to make use of this unstructured data • Heterogeneous data integration is also important • Challenges exist in – Terminology – Skewed data – Missing data
  • 49. Acknowledgements • Amazon Mechanical Turkers • James McCaw, Melbourne School of Population and Global Health Bahador Ofoghi Lawrence CavedonSimon Kocbek
  • 51. ML-Based Classification • MALLET Naïve Bayes • Features – bag of words[+lem,-lem] – Lexicon-based similarity – emotion vocabulary – emoticons – punctuation – (Stanford) sentiment
  • 53. Emotion-level distribution KL-divergence (pre- vs. post-event, post- vs. pre-) P(x) and Q(x) represent probability of positive and negative emotion classes in the respective corpora

Editor's Notes

  1. Another way we are making sense of data is by uncovering data that is buried in text. Where can text be found? Read bullets How do we uncover hidden information. Explain the researchers’ background required to do text mining.
  2. --Alerting: Method used to detect drug-adverse effects earlier than official detection (by profiling back in time) --Look at drug-drug interactions as well as single drug-ADE associations
  3. Initiative by Alfred Health Informatics Department. Technologies, tools and large-scale data sources to support: REsearch, AnalysiS, and OperatioNs Integrates data sources from multiple hospital departments. Historical patient data linked by unique Unit Record number.
  4. Work in progress: - this method does not always work (E.g., breast cancer – other approaches are needed).