SlideShare a Scribd company logo
1 of 29
1
Implicit Entity Recognition in Clinical Documents
Sujan Perera1, Pablo Mendes2, Amit Sheth1, Krishnaprasad
Thirunarayan1, Adarsh Alex1, Christopher Heid3, Greg Mott3
1Kno.e.sis Center, Wright State University, 2IBM Research, San Jose,
3Boonshoft School of Medicine, Wight State University
“Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac
catheterization because of a positive exercise tolerance test. Recently, he
started to have left shoulder twinges and tingling in his hands. A stress test
done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes,
stopped due to fatigue. However, Mr. Smith is comfortably breathing in room
air. He also showed accumulation of fluid in his extremities. He does not have
any chest pain.”
Example
2Implicit Entity Recognition in Clinical Documents
“Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac
catheterization because of a positive exercise tolerance test. Recently, he
started to have left shoulder twinges and tingling in his hands. A stress test
done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes,
stopped due to fatigue. However, Mr. Smith is comfortably breathing in room
air. He also showed accumulation of fluid in his extremities. He does not have
any chest pain.”
Named Entity Recognition
Person Person
Example
3Implicit Entity Recognition in Clinical Documents
“Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac
catheterization because of a positive exercise tolerance test. Recently, he
started to have left shoulder twinges and tingling in his hands. A stress test
done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes,
stopped due to fatigue. However, Mr. Smith is comfortably breathing in room
air. He also showed accumulation of fluid in his extremities. He does not have
any chest pain.”
Named Entity Recognition
Entity Linking
Person Person C0018795
C0015672
C0008031
Example
4Implicit Entity Recognition in Clinical Documents
“Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac
catheterization because of a positive exercise tolerance test. Recently, he
started to have left shoulder twinges and tingling in his hands. A stress test
done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes,
stopped due to fatigue. However, Mr. Smith is comfortably breathing in room
air. He also showed accumulation of fluid in his extremities. He does not have
any chest pain.”
Named Entity Recognition
Entity Linking
Co-reference Resolution
Person Person C0018795
C0015672
C0008031
Example
5Implicit Entity Recognition in Clinical Documents
“Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac
catheterization because of a positive exercise tolerance test. Recently, he
started to have left shoulder twinges and tingling in his hands. A stress test
done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes,
stopped due to fatigue. However, Mr. Smith is comfortably breathing in room
air. He also showed accumulation of fluid in his extremities. He does not have
any chest pain.”
Named Entity Recognition
Entity Linking
Co-reference Resolution
Negation Detection
Person Person C0018795
C0015672
C0008031
Example
6Implicit Entity Recognition in Clinical Documents
“Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac
catheterization because of a positive exercise tolerance test. Recently, he
started to have left shoulder twinges and tingling in his hands. A stress test
done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes,
stopped due to fatigue. However, Mr. Smith is comfortably breathing in room
air. He also showed accumulation of fluid in his extremities. He does not have
any chest pain.”
Example
7Implicit Entity Recognition in Clinical Documents
“Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac
catheterization because of a positive exercise tolerance test. Recently, he
started to have left shoulder twinges and tingling in his hands. A stress test
done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes,
stopped due to fatigue. However, Mr. Smith is comfortably breathing in room
air. He also showed accumulation of fluid in his extremities. He does not have
any chest pain.”
Example
Shortness of Breath (NEG)
Edema
Implicit Entity Recognition
8Implicit Entity Recognition in Clinical Documents
Sentence Entity
“Rounded calcific density in right upper quadrant likely representing
a gallstone within the neck of the gallbladder.”
“His tip of the appendix is inflamed.”
“The respirations were unlabored and there were no use of
accessory muscles.”
“She was walking outside on her driveway and suddenly fell
unconcious, with no prodrome, or symptoms preceding the event.”
“This is important to prevent shortness of breath and lower
extremity swelling from fluid accumulation.”
More Examples
Cholecystitis
Appendicitis
Shortness of
breath (NEG)
Syncope
Edema
9Implicit Entity Recognition in Clinical Documents
Implicit Entity Recognition
Implicit Entity Recognition (IER) is the task of
determining whether a sentence has a reference
to an entity, even though it does not mention
that entity by its name.
10Implicit Entity Recognition in Clinical Documents
Automation of clinical documents
• New healthcare policies : automation required
• State-of-art approaches focus on explicit mentions.
• The overall understanding about the patients record needs:
• Explicit/implicit facts
• Domain knowledge
• Some conditions are frequently mentioned implicitly.
• 40% of shortness of breath mentions.
• 35% of edema mentions.
CAC CDI
Readmission
Prediction
Assisting
Professionals
11Implicit Entity Recognition in Clinical Documents
What is involved in solving the problem?
“At the time of discharge she was breathing comfortably with a respiratory
rate of 12 to 15 breaths per minute.”
“Rounded calcific density in right upper quadrant likely representing a
gallstone within the neck of the gallbladder.”
• Language understanding.
Term ‘comfortable’ is the antonym of ‘uncomfortable’
• Domain knowledge.
‘gallstones blocking the tube leading out of your gallbladder
cause cholecystitis’
Shortness of breath (NEG)
Cholecystitis (POS)
12Implicit Entity Recognition in Clinical Documents
Our Solution
Candidate Sentence
Selection
Candidate Sentence
Pruning
Similarity Calculation
Annotations
Entity Representative Term Selection
Entity Model Creation
Implicit Entity Recognition in Clinical Documents
ERT Selection
• The knowledge base consists of definitions of the entities.
• Entity Representative Terms may indicate the implicit mentions of
the entities.
• The representative power of a term for an entity is calculated by its
TF-IDF value.
rt is the representative power of the term t for the entity e, freq(t,Qe) is the
frequency of the term t in the definitions of e, E is the total number of
entities, Et is the number of entities defined using term t.
breathing
fluid
gallstone
Shortness of breath
edema
cholecystitis
Implicit Entity Recognition in Clinical Documents
Entity Model
• Entity Indicator.
• Entity Indicator consists of the terms that describe features of
the entity in the definition.
• E.g., ‘A disorder characterized by an uncomfortable sensation of
difficulty breathing’ – {uncomfortable, sensation, difficulty,
breathing}.
• Entity Model – collection of entity indicators
Entity Model
Entity Indicator1
Entity Indicator3
Entity Indicator2
16Implicit Entity Recognition in Clinical Documents
Candidate Sentence Selection & Pruning
• Candidate sentences – sentences with ERT.
• Candidate sentences are pruned to remove the noise.
• Selected nouns, verbs, adjectives and adverbs within the fixed
window size from the ERT of the sentence.
“His propofol was increased and he was allowed to wake up a second time
later on the evening of surgery and was ultimately weaned from mechanical
ventilation and successfully extubated at about 09:30 that evening.”
{weaned, mechanical, ventilation, successfully, extubated}
pruning
17Implicit Entity Recognition in Clinical Documents
Similarity Calculation
• The similarity between entity model and pruned candidate sentence
is calculated to annotate the sentence.
• The syntactic diversity of the words and the negated mentions need
special attention.
• Multiple similarity measures are used.
t1 and t2 are the words, M is set of similarity measures. M = {WUP, LCH, LIN,
JCN, Word2Vec, Levenshtein}
Implicit Entity Recognition in Clinical Documents
Similarity Calculation
• The similarity between entity model and the pruned sentence is
calculated by weighting the maximum similarity of each word in the
entity model by its representative power.
e – entity indicator
s – pruned sentence
α(te, s) – determines if term t in e
is antonym of any term in s.
Implicit Entity Recognition in Clinical Documents
Similarity Calculation
• The similarity between entity model and the pruned sentence is
calculated by weighting the maximum similarity of each word in the
entity model by its representative power.
e – entity indicator
s – pruned sentence
α(te, s) – determines if term t in e
is antonym of any term in s.
f(te, s) – calculates the similarity
of term in e with the terms in
sentence.
Implicit Entity Recognition in Clinical Documents
Similarity Calculation
• The similarity between entity model and the pruned sentence is
calculated by weighting the maximum similarity of each word in the
entity model by its representative power.
e – entity indicator
s – pruned sentence
α(te, s) – determines if term t in e
is antonym of any term in s.
f(te, s) – calculates the similarity
of term in e with the terms in
sentence.
sim(e, s) – measures the similarity
between entity indicator and the
pruned sentence.
Implicit Entity Recognition in Clinical Documents
Dataset
• Used the dataset used by SemEval-2014 task 7.
• 857 sentences selected for 8 entities.
• The entities are selected based on the frequency of their
appearance and feedback from domain experts.
• Annotated by three domain experts.
• Annotation agreement 0.58.
Implicit Entity Recognition in Clinical Documents
Dataset
Entity Positive
Assertions
Negative
Assertions
None
Shortness of Breath 93 94 29
Edema 115 35 81
Syncope 96 92 24
Cholecystitis 78 36 4
Gastrointestinal Gas 18 14 5
Colitis 12 11 0
Cellulitis 8 2 0
Fasciitis 7 3 0
Implicit Entity Recognition in Clinical Documents
Evaluation
• Baselines
• MCS algorithm (Mihalcea 2006)
• SVM (trained on n-grams)
• Evaluation metrics
• Positive Precision and recall
• Negative Precision and recall
• 70% training and 30% testing
• Threshold selection for our algorithm and MCS
• Thresholds were selected based on the annotation performance
in the training dataset
Implicit Entity Recognition in Clinical Documents
Annotation Performance
Method PP PR PF1 NP NR NF1
Our 0.66 0.87 0.75 0.73 0.73 0.73
MCS 0.50 0.93 0.65 0.31 0.76 0.44
SVM 0.73 0.82 0.77 0.66 0.67 0.67
• Our algorithm outperforms baselines in negative category.
• SVM is able to leverage the supervision to beat our algorithm in
positive category.
Implicit Entity Recognition in Clinical Documents
Annotation Performance
Method PP PR PF1 NP NR NF1
SVM 0.73 0.82 0.77 0.66 0.67 0.67
SVM+MCS 0.73 0.82 0.77 0.66 0.66 0.66
SVM+Our 0.77 0.85 0.81 0.72 0.75 0.73
• The similarity value of our algorithm as a feature to the SVM.
• This proves our similarity value can be used as an effective feature
with a supervised approach.
Implicit Entity Recognition in Clinical Documents
Annotation Performance with
varying training dataset size
Positive Assertions Negative Assertions
Implicit Entity Recognition in Clinical Documents
Limitations
• The approach misses the implicit mentions of entities with no ERT.
• Implicit mentions of shortness of breath without the term
‘breathing’
• “The patient had low oxygen saturation”
• “The patient was gasping for air”
• “Patient was air hunger”
• 113 instances vs 8990 instances
Implicit Entity Recognition in Clinical Documents
Conclusion
• Introduced the problem of implicit entity recognition in clinical
documents.
• Developed a unsupervised approach and showed that it
outperforms supervised approach.
• Proved that supervised approach can use our similarity value as a
feature to reduce labeling cost and to improve the performance.
Thank You
Sujan Perera, Pablo Mendes, Amit Sheth, Krishnaprasad Thirunarayan, Adarsh Alex, Christopher
Heid, Greg Mott, 'Implicit Entity Recognition in Clinical Documents', In proceedings of The Fourth
Joint Conference on Lexical and Computational Semantics (*SEM), 2015, PDF
http://knoesis.org/researchers/sujan/
Implicit Entity Recognition in Clinical Documents

More Related Content

Viewers also liked

Semantic, Cognitive and Perceptual Computing -Using semantics and statistics ...
Semantic, Cognitive and Perceptual Computing -Using semantics and statistics ...Semantic, Cognitive and Perceptual Computing -Using semantics and statistics ...
Semantic, Cognitive and Perceptual Computing -Using semantics and statistics ...Artificial Intelligence Institute at UofSC
 
Semantic, Cognitive and Perceptual Computing -Perceptual computing from the f...
Semantic, Cognitive and Perceptual Computing -Perceptual computing from the f...Semantic, Cognitive and Perceptual Computing -Perceptual computing from the f...
Semantic, Cognitive and Perceptual Computing -Perceptual computing from the f...Artificial Intelligence Institute at UofSC
 
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...Artificial Intelligence Institute at UofSC
 
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...Artificial Intelligence Institute at UofSC
 
Semantic, Cognitive and Perceptual Computing -Cognitive computing in autonomo...
Semantic, Cognitive and Perceptual Computing -Cognitive computing in autonomo...Semantic, Cognitive and Perceptual Computing -Cognitive computing in autonomo...
Semantic, Cognitive and Perceptual Computing -Cognitive computing in autonomo...Artificial Intelligence Institute at UofSC
 
Trending: Social media analysis to monitor cannabis and synthetic cannabino...
Trending: Social media analysis to monitor cannabis and synthetic cannabino...Trending: Social media analysis to monitor cannabis and synthetic cannabino...
Trending: Social media analysis to monitor cannabis and synthetic cannabino...Artificial Intelligence Institute at UofSC
 
Semantic, Cognitive and Perceptual Computing -Keynote artificial intelligence...
Semantic, Cognitive and Perceptual Computing -Keynote artificial intelligence...Semantic, Cognitive and Perceptual Computing -Keynote artificial intelligence...
Semantic, Cognitive and Perceptual Computing -Keynote artificial intelligence...Artificial Intelligence Institute at UofSC
 

Viewers also liked (18)

Semantic, Cognitive and Perceptual Computing -Using semantics and statistics ...
Semantic, Cognitive and Perceptual Computing -Using semantics and statistics ...Semantic, Cognitive and Perceptual Computing -Using semantics and statistics ...
Semantic, Cognitive and Perceptual Computing -Using semantics and statistics ...
 
Exploring Synthetic Cannabinoid Effects Using Web Forum Data
Exploring Synthetic Cannabinoid Effects Using Web Forum Data Exploring Synthetic Cannabinoid Effects Using Web Forum Data
Exploring Synthetic Cannabinoid Effects Using Web Forum Data
 
RDF Streams and Continuous SPARQL (C-SPARQL)
RDF Streams and Continuous SPARQL (C-SPARQL)RDF Streams and Continuous SPARQL (C-SPARQL)
RDF Streams and Continuous SPARQL (C-SPARQL)
 
Semantic, Cognitive and Perceptual Computing -Perceptual computing from the f...
Semantic, Cognitive and Perceptual Computing -Perceptual computing from the f...Semantic, Cognitive and Perceptual Computing -Perceptual computing from the f...
Semantic, Cognitive and Perceptual Computing -Perceptual computing from the f...
 
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
Stream Reasoning: mastering the velocity and variety dimensions of Big Data...
 
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
 
Semantic, Cognitive and Perceptual Computing -Cognitive computing in autonomo...
Semantic, Cognitive and Perceptual Computing -Cognitive computing in autonomo...Semantic, Cognitive and Perceptual Computing -Cognitive computing in autonomo...
Semantic, Cognitive and Perceptual Computing -Cognitive computing in autonomo...
 
Semantic, Cognitive and Perceptual Computing -Human mental representation
Semantic, Cognitive and Perceptual Computing -Human mental representationSemantic, Cognitive and Perceptual Computing -Human mental representation
Semantic, Cognitive and Perceptual Computing -Human mental representation
 
Semantic perception tkp(1)
Semantic perception tkp(1)Semantic perception tkp(1)
Semantic perception tkp(1)
 
Finding Street Gang Members on Twitter
Finding Street Gang Members on TwitterFinding Street Gang Members on Twitter
Finding Street Gang Members on Twitter
 
Trending: Social media analysis to monitor cannabis and synthetic cannabino...
Trending: Social media analysis to monitor cannabis and synthetic cannabino...Trending: Social media analysis to monitor cannabis and synthetic cannabino...
Trending: Social media analysis to monitor cannabis and synthetic cannabino...
 
Integrating Sensor and Social Data for Understanding City Events
Integrating Sensor and Social Data for Understanding City EventsIntegrating Sensor and Social Data for Understanding City Events
Integrating Sensor and Social Data for Understanding City Events
 
Semantic, Cognitive and Perceptual Computing -Keynote artificial intelligence...
Semantic, Cognitive and Perceptual Computing -Keynote artificial intelligence...Semantic, Cognitive and Perceptual Computing -Keynote artificial intelligence...
Semantic, Cognitive and Perceptual Computing -Keynote artificial intelligence...
 
Finding Street Gang Members on Twitter
Finding Street Gang Members on TwitterFinding Street Gang Members on Twitter
Finding Street Gang Members on Twitter
 
Feedbackdriven radiologyreportretrieval ichi2015-v2
Feedbackdriven radiologyreportretrieval ichi2015-v2Feedbackdriven radiologyreportretrieval ichi2015-v2
Feedbackdriven radiologyreportretrieval ichi2015-v2
 
Understanding City Traffic Dynamics Utilizing Sensor and Textual Observations
Understanding City Traffic Dynamics Utilizing Sensor and Textual ObservationsUnderstanding City Traffic Dynamics Utilizing Sensor and Textual Observations
Understanding City Traffic Dynamics Utilizing Sensor and Textual Observations
 
Context Aware Harassment Detection in Social Media [Overview]
Context Aware Harassment Detection in Social Media [Overview]Context Aware Harassment Detection in Social Media [Overview]
Context Aware Harassment Detection in Social Media [Overview]
 
Contrast Pattern Aided Regression and Classification
Contrast Pattern Aided Regression and ClassificationContrast Pattern Aided Regression and Classification
Contrast Pattern Aided Regression and Classification
 

Similar to Implicit Entity Recognition in Clinical Documents

Knowledge-driven Implicit Information Extraction
Knowledge-driven Implicit Information ExtractionKnowledge-driven Implicit Information Extraction
Knowledge-driven Implicit Information ExtractionSujan Perera
 
EpisodicFocused   SOAP Note Exemplar (pls use this template).docx
EpisodicFocused   SOAP Note Exemplar (pls use this template).docxEpisodicFocused   SOAP Note Exemplar (pls use this template).docx
EpisodicFocused   SOAP Note Exemplar (pls use this template).docxrusselldayna
 
Smart Data in Health – How we will exploit personal, clinical, and social “Bi...
Smart Data in Health – How we will exploit personal, clinical, and social “Bi...Smart Data in Health – How we will exploit personal, clinical, and social “Bi...
Smart Data in Health – How we will exploit personal, clinical, and social “Bi...Amit Sheth
 
Beth Israel President's Reception
Beth Israel President's ReceptionBeth Israel President's Reception
Beth Israel President's ReceptionNeil Theise
 
Evidence Based Medicine
Evidence Based MedicineEvidence Based Medicine
Evidence Based MedicineCSN Vittal
 
ABB_ArestyPoster_041715_kws
ABB_ArestyPoster_041715_kwsABB_ArestyPoster_041715_kws
ABB_ArestyPoster_041715_kwsNicholas Addo
 
Drs. Olson’s and Jackson’s CMC Pediatric X-Ray Mastery: December Cases
Drs. Olson’s and Jackson’s CMC Pediatric X-Ray Mastery: December CasesDrs. Olson’s and Jackson’s CMC Pediatric X-Ray Mastery: December Cases
Drs. Olson’s and Jackson’s CMC Pediatric X-Ray Mastery: December CasesSean M. Fox
 
GEMC- Hanging and Strangulation, Asphyxiation & AeA "The Choking Game"- Resid...
GEMC- Hanging and Strangulation, Asphyxiation & AeA "The Choking Game"- Resid...GEMC- Hanging and Strangulation, Asphyxiation & AeA "The Choking Game"- Resid...
GEMC- Hanging and Strangulation, Asphyxiation & AeA "The Choking Game"- Resid...Open.Michigan
 
Survey of human anatomy and physiology Chapter 1 to 4
Survey of human anatomy and physiology Chapter 1 to 4Survey of human anatomy and physiology Chapter 1 to 4
Survey of human anatomy and physiology Chapter 1 to 4cmahon57
 
Human Persons, Pragmatism, & Death
Human Persons, Pragmatism, & DeathHuman Persons, Pragmatism, & Death
Human Persons, Pragmatism, & DeathBill Reichart
 
Health Assessment ppt Jitendra bokha.pptx
Health Assessment ppt Jitendra bokha.pptxHealth Assessment ppt Jitendra bokha.pptx
Health Assessment ppt Jitendra bokha.pptxJitendra Bokha
 
Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...
Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...
Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...Larry Smarr
 

Similar to Implicit Entity Recognition in Clinical Documents (18)

Knowledge-driven Implicit Information Extraction
Knowledge-driven Implicit Information ExtractionKnowledge-driven Implicit Information Extraction
Knowledge-driven Implicit Information Extraction
 
Knowledge-driven Implicit Information Extraction
Knowledge-driven Implicit Information ExtractionKnowledge-driven Implicit Information Extraction
Knowledge-driven Implicit Information Extraction
 
Patterns: the language of the subluxation
Patterns: the language of the subluxationPatterns: the language of the subluxation
Patterns: the language of the subluxation
 
Tariq faridi 45 minute talk
Tariq faridi 45 minute talkTariq faridi 45 minute talk
Tariq faridi 45 minute talk
 
EpisodicFocused   SOAP Note Exemplar (pls use this template).docx
EpisodicFocused   SOAP Note Exemplar (pls use this template).docxEpisodicFocused   SOAP Note Exemplar (pls use this template).docx
EpisodicFocused   SOAP Note Exemplar (pls use this template).docx
 
Smart Data in Health – How we will exploit personal, clinical, and social “Bi...
Smart Data in Health – How we will exploit personal, clinical, and social “Bi...Smart Data in Health – How we will exploit personal, clinical, and social “Bi...
Smart Data in Health – How we will exploit personal, clinical, and social “Bi...
 
Tariq faridi 30 minute talk
Tariq faridi 30 minute talkTariq faridi 30 minute talk
Tariq faridi 30 minute talk
 
Beth Israel President's Reception
Beth Israel President's ReceptionBeth Israel President's Reception
Beth Israel President's Reception
 
Evidence Based Medicine
Evidence Based MedicineEvidence Based Medicine
Evidence Based Medicine
 
ABB_ArestyPoster_041715_kws
ABB_ArestyPoster_041715_kwsABB_ArestyPoster_041715_kws
ABB_ArestyPoster_041715_kws
 
Drs. Olson’s and Jackson’s CMC Pediatric X-Ray Mastery: December Cases
Drs. Olson’s and Jackson’s CMC Pediatric X-Ray Mastery: December CasesDrs. Olson’s and Jackson’s CMC Pediatric X-Ray Mastery: December Cases
Drs. Olson’s and Jackson’s CMC Pediatric X-Ray Mastery: December Cases
 
GEMC- Hanging and Strangulation, Asphyxiation & AeA "The Choking Game"- Resid...
GEMC- Hanging and Strangulation, Asphyxiation & AeA "The Choking Game"- Resid...GEMC- Hanging and Strangulation, Asphyxiation & AeA "The Choking Game"- Resid...
GEMC- Hanging and Strangulation, Asphyxiation & AeA "The Choking Game"- Resid...
 
The Expository Essay
The Expository EssayThe Expository Essay
The Expository Essay
 
Survey of human anatomy and physiology Chapter 1 to 4
Survey of human anatomy and physiology Chapter 1 to 4Survey of human anatomy and physiology Chapter 1 to 4
Survey of human anatomy and physiology Chapter 1 to 4
 
Human Persons, Pragmatism, & Death
Human Persons, Pragmatism, & DeathHuman Persons, Pragmatism, & Death
Human Persons, Pragmatism, & Death
 
Polygraph
PolygraphPolygraph
Polygraph
 
Health Assessment ppt Jitendra bokha.pptx
Health Assessment ppt Jitendra bokha.pptxHealth Assessment ppt Jitendra bokha.pptx
Health Assessment ppt Jitendra bokha.pptx
 
Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...
Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...
Using Data Analytics to Discover the 100 Trillion Bacteria Living Within Each...
 

Recently uploaded

CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...asadnawaz62
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 

Recently uploaded (20)

CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 

Implicit Entity Recognition in Clinical Documents

  • 1. 1 Implicit Entity Recognition in Clinical Documents Sujan Perera1, Pablo Mendes2, Amit Sheth1, Krishnaprasad Thirunarayan1, Adarsh Alex1, Christopher Heid3, Greg Mott3 1Kno.e.sis Center, Wright State University, 2IBM Research, San Jose, 3Boonshoft School of Medicine, Wight State University
  • 2. “Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac catheterization because of a positive exercise tolerance test. Recently, he started to have left shoulder twinges and tingling in his hands. A stress test done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes, stopped due to fatigue. However, Mr. Smith is comfortably breathing in room air. He also showed accumulation of fluid in his extremities. He does not have any chest pain.” Example 2Implicit Entity Recognition in Clinical Documents
  • 3. “Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac catheterization because of a positive exercise tolerance test. Recently, he started to have left shoulder twinges and tingling in his hands. A stress test done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes, stopped due to fatigue. However, Mr. Smith is comfortably breathing in room air. He also showed accumulation of fluid in his extremities. He does not have any chest pain.” Named Entity Recognition Person Person Example 3Implicit Entity Recognition in Clinical Documents
  • 4. “Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac catheterization because of a positive exercise tolerance test. Recently, he started to have left shoulder twinges and tingling in his hands. A stress test done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes, stopped due to fatigue. However, Mr. Smith is comfortably breathing in room air. He also showed accumulation of fluid in his extremities. He does not have any chest pain.” Named Entity Recognition Entity Linking Person Person C0018795 C0015672 C0008031 Example 4Implicit Entity Recognition in Clinical Documents
  • 5. “Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac catheterization because of a positive exercise tolerance test. Recently, he started to have left shoulder twinges and tingling in his hands. A stress test done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes, stopped due to fatigue. However, Mr. Smith is comfortably breathing in room air. He also showed accumulation of fluid in his extremities. He does not have any chest pain.” Named Entity Recognition Entity Linking Co-reference Resolution Person Person C0018795 C0015672 C0008031 Example 5Implicit Entity Recognition in Clinical Documents
  • 6. “Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac catheterization because of a positive exercise tolerance test. Recently, he started to have left shoulder twinges and tingling in his hands. A stress test done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes, stopped due to fatigue. However, Mr. Smith is comfortably breathing in room air. He also showed accumulation of fluid in his extremities. He does not have any chest pain.” Named Entity Recognition Entity Linking Co-reference Resolution Negation Detection Person Person C0018795 C0015672 C0008031 Example 6Implicit Entity Recognition in Clinical Documents
  • 7. “Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac catheterization because of a positive exercise tolerance test. Recently, he started to have left shoulder twinges and tingling in his hands. A stress test done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes, stopped due to fatigue. However, Mr. Smith is comfortably breathing in room air. He also showed accumulation of fluid in his extremities. He does not have any chest pain.” Example 7Implicit Entity Recognition in Clinical Documents
  • 8. “Bob Smith is a 61-year-old man referred by Dr. Davis for outpatient cardiac catheterization because of a positive exercise tolerance test. Recently, he started to have left shoulder twinges and tingling in his hands. A stress test done on 2013-06-02 revealed that the patient exercised for 6 1/2 minutes, stopped due to fatigue. However, Mr. Smith is comfortably breathing in room air. He also showed accumulation of fluid in his extremities. He does not have any chest pain.” Example Shortness of Breath (NEG) Edema Implicit Entity Recognition 8Implicit Entity Recognition in Clinical Documents
  • 9. Sentence Entity “Rounded calcific density in right upper quadrant likely representing a gallstone within the neck of the gallbladder.” “His tip of the appendix is inflamed.” “The respirations were unlabored and there were no use of accessory muscles.” “She was walking outside on her driveway and suddenly fell unconcious, with no prodrome, or symptoms preceding the event.” “This is important to prevent shortness of breath and lower extremity swelling from fluid accumulation.” More Examples Cholecystitis Appendicitis Shortness of breath (NEG) Syncope Edema 9Implicit Entity Recognition in Clinical Documents
  • 10. Implicit Entity Recognition Implicit Entity Recognition (IER) is the task of determining whether a sentence has a reference to an entity, even though it does not mention that entity by its name. 10Implicit Entity Recognition in Clinical Documents
  • 11. Automation of clinical documents • New healthcare policies : automation required • State-of-art approaches focus on explicit mentions. • The overall understanding about the patients record needs: • Explicit/implicit facts • Domain knowledge • Some conditions are frequently mentioned implicitly. • 40% of shortness of breath mentions. • 35% of edema mentions. CAC CDI Readmission Prediction Assisting Professionals 11Implicit Entity Recognition in Clinical Documents
  • 12. What is involved in solving the problem? “At the time of discharge she was breathing comfortably with a respiratory rate of 12 to 15 breaths per minute.” “Rounded calcific density in right upper quadrant likely representing a gallstone within the neck of the gallbladder.” • Language understanding. Term ‘comfortable’ is the antonym of ‘uncomfortable’ • Domain knowledge. ‘gallstones blocking the tube leading out of your gallbladder cause cholecystitis’ Shortness of breath (NEG) Cholecystitis (POS) 12Implicit Entity Recognition in Clinical Documents
  • 13. Our Solution Candidate Sentence Selection Candidate Sentence Pruning Similarity Calculation Annotations Entity Representative Term Selection Entity Model Creation Implicit Entity Recognition in Clinical Documents
  • 14. ERT Selection • The knowledge base consists of definitions of the entities. • Entity Representative Terms may indicate the implicit mentions of the entities. • The representative power of a term for an entity is calculated by its TF-IDF value. rt is the representative power of the term t for the entity e, freq(t,Qe) is the frequency of the term t in the definitions of e, E is the total number of entities, Et is the number of entities defined using term t. breathing fluid gallstone Shortness of breath edema cholecystitis Implicit Entity Recognition in Clinical Documents
  • 15. Entity Model • Entity Indicator. • Entity Indicator consists of the terms that describe features of the entity in the definition. • E.g., ‘A disorder characterized by an uncomfortable sensation of difficulty breathing’ – {uncomfortable, sensation, difficulty, breathing}. • Entity Model – collection of entity indicators Entity Model Entity Indicator1 Entity Indicator3 Entity Indicator2 16Implicit Entity Recognition in Clinical Documents
  • 16. Candidate Sentence Selection & Pruning • Candidate sentences – sentences with ERT. • Candidate sentences are pruned to remove the noise. • Selected nouns, verbs, adjectives and adverbs within the fixed window size from the ERT of the sentence. “His propofol was increased and he was allowed to wake up a second time later on the evening of surgery and was ultimately weaned from mechanical ventilation and successfully extubated at about 09:30 that evening.” {weaned, mechanical, ventilation, successfully, extubated} pruning 17Implicit Entity Recognition in Clinical Documents
  • 17. Similarity Calculation • The similarity between entity model and pruned candidate sentence is calculated to annotate the sentence. • The syntactic diversity of the words and the negated mentions need special attention. • Multiple similarity measures are used. t1 and t2 are the words, M is set of similarity measures. M = {WUP, LCH, LIN, JCN, Word2Vec, Levenshtein} Implicit Entity Recognition in Clinical Documents
  • 18. Similarity Calculation • The similarity between entity model and the pruned sentence is calculated by weighting the maximum similarity of each word in the entity model by its representative power. e – entity indicator s – pruned sentence α(te, s) – determines if term t in e is antonym of any term in s. Implicit Entity Recognition in Clinical Documents
  • 19. Similarity Calculation • The similarity between entity model and the pruned sentence is calculated by weighting the maximum similarity of each word in the entity model by its representative power. e – entity indicator s – pruned sentence α(te, s) – determines if term t in e is antonym of any term in s. f(te, s) – calculates the similarity of term in e with the terms in sentence. Implicit Entity Recognition in Clinical Documents
  • 20. Similarity Calculation • The similarity between entity model and the pruned sentence is calculated by weighting the maximum similarity of each word in the entity model by its representative power. e – entity indicator s – pruned sentence α(te, s) – determines if term t in e is antonym of any term in s. f(te, s) – calculates the similarity of term in e with the terms in sentence. sim(e, s) – measures the similarity between entity indicator and the pruned sentence. Implicit Entity Recognition in Clinical Documents
  • 21. Dataset • Used the dataset used by SemEval-2014 task 7. • 857 sentences selected for 8 entities. • The entities are selected based on the frequency of their appearance and feedback from domain experts. • Annotated by three domain experts. • Annotation agreement 0.58. Implicit Entity Recognition in Clinical Documents
  • 22. Dataset Entity Positive Assertions Negative Assertions None Shortness of Breath 93 94 29 Edema 115 35 81 Syncope 96 92 24 Cholecystitis 78 36 4 Gastrointestinal Gas 18 14 5 Colitis 12 11 0 Cellulitis 8 2 0 Fasciitis 7 3 0 Implicit Entity Recognition in Clinical Documents
  • 23. Evaluation • Baselines • MCS algorithm (Mihalcea 2006) • SVM (trained on n-grams) • Evaluation metrics • Positive Precision and recall • Negative Precision and recall • 70% training and 30% testing • Threshold selection for our algorithm and MCS • Thresholds were selected based on the annotation performance in the training dataset Implicit Entity Recognition in Clinical Documents
  • 24. Annotation Performance Method PP PR PF1 NP NR NF1 Our 0.66 0.87 0.75 0.73 0.73 0.73 MCS 0.50 0.93 0.65 0.31 0.76 0.44 SVM 0.73 0.82 0.77 0.66 0.67 0.67 • Our algorithm outperforms baselines in negative category. • SVM is able to leverage the supervision to beat our algorithm in positive category. Implicit Entity Recognition in Clinical Documents
  • 25. Annotation Performance Method PP PR PF1 NP NR NF1 SVM 0.73 0.82 0.77 0.66 0.67 0.67 SVM+MCS 0.73 0.82 0.77 0.66 0.66 0.66 SVM+Our 0.77 0.85 0.81 0.72 0.75 0.73 • The similarity value of our algorithm as a feature to the SVM. • This proves our similarity value can be used as an effective feature with a supervised approach. Implicit Entity Recognition in Clinical Documents
  • 26. Annotation Performance with varying training dataset size Positive Assertions Negative Assertions Implicit Entity Recognition in Clinical Documents
  • 27. Limitations • The approach misses the implicit mentions of entities with no ERT. • Implicit mentions of shortness of breath without the term ‘breathing’ • “The patient had low oxygen saturation” • “The patient was gasping for air” • “Patient was air hunger” • 113 instances vs 8990 instances Implicit Entity Recognition in Clinical Documents
  • 28. Conclusion • Introduced the problem of implicit entity recognition in clinical documents. • Developed a unsupervised approach and showed that it outperforms supervised approach. • Proved that supervised approach can use our similarity value as a feature to reduce labeling cost and to improve the performance.
  • 29. Thank You Sujan Perera, Pablo Mendes, Amit Sheth, Krishnaprasad Thirunarayan, Adarsh Alex, Christopher Heid, Greg Mott, 'Implicit Entity Recognition in Clinical Documents', In proceedings of The Fourth Joint Conference on Lexical and Computational Semantics (*SEM), 2015, PDF http://knoesis.org/researchers/sujan/ Implicit Entity Recognition in Clinical Documents

Editor's Notes

  1. Give the example of appendix and inflammation