NegBio: a high-performance tool for negation and uncertainty detection in radiology reports

Yifan Peng1, Xiaosong Wang2, Le Lu2, Mohammadhadi Bagheri2,
Ronald Summers2, Zhiyong Lu1
1 National Center for Biotechnology Information, NLM, NIH
2 Clinical Center, NIH
Twitter: #AMIA2018
NegBio: a high-performance tool for
negation and uncertainty detection in
radiology reports
Oral Presentations – Imaging, S41

• The availability of well-labeled data is the key for large scale machine learning, e.g. deep
learning
• Hospitals have accumulated a large number of raw radiology images and reports
• Conventional ways for collecting image labels are NOT applicable
• the security and privacy issues
• requires comprehension of domain-specific medical knowledge
All Start with Data
Large scale natural image datasets
Large scale
Medical Image dataset
2AMIA 2018 | amia.org

Overview
Mining image labels via NLP for multi-label pathology classification
One of ImageNet pre-trained models
GoogLeNet ResNetVggNetAlexNet
Weights from
predication layer
Pooling Layer
NE
recognizer
(MetaMap)
Negative/
Equivocal
detection
Labels
Image
data
conv1/7x7_s2
conv1/rule_7x7
inception_5b/
output
data
conv1
res5c
res5c_relu
data
conv1_1
relu1_1
conv5_3
relu5_3
data
conv1
conv1
conv5
relu5
MAX LSE AVE
Transition
Layer

A Sample Entry
Image Report Label
Findings: pa and lateral views of the
chest demonstrate significantly
improved bilateral lower lung field
interstitial markings compatible with
linear atelectasis. unchanged right
9th rib fracture peripherally.
unchanged ossification left
coracoacromial ligament. the cardiac
and mediastinal contours are stable.
Impression: improved bilateral lower
lung field linear atelectasis.
Atelectasis

14 Common Thorax Diseases
• Atelectasis
• Cardiomegaly
• Consolidation
• Edema
• Effusion
• Emphysema
• Fibrosis
• Hernia
• Infiltration
• Mass
• Nodule
• Pleural Thickening
• Pneumonia
• Pneumothorax

Challenges
Negative and equivocal findings may indicate the absence of findings
mentioned within the radiology report
Findings: right internal jugular catheter remains in place. Large metastatic lung mass
in the lateral left upper lobe is again noted. No infiltrate or effusion. Extensive
surgical clips again noted left axilla.
Impression: no significant change.
Reason for exam (entered by ordering clinician into cris): bilateral pneumonia no
change in the tracheostomy tube or right internal jugular venous catheter. Unchanged
bilateral alveolar infiltrates, fluid in the right minor fissure, lucency at the right
costophrenic angle suggesting pneumonia. Overall, no significant change

Related Work
Chapman W, et al. A simple algorithm for identifying negated findings and diseases in
discharge summaries. Journal of Biomedical Informatics. 2001;34:301-310.
Harkema H, et al. ConText: an algorithm for determining negation, experiencer, and
temporal status from clinical reports. Journal of biomedical informatics. 2009;42:839-851.
Mutalik P, et al. Use of general-purpose negation detection to augment concept indexing
of medical documents: a quantitative study using the UMLS. Journal of the American
Medical Informatics Association. 2001;8:598-609.
Sohn S, Wu S, Chute C. Dependency parser-based negation detection in clinical
narratives. In AMIA Summits on Translational Science proceedings AMIA Summit on
Translational Science. 2012;2012:1-8.
Mehrabi S, et al. DEEPEN: A negation detection system for clinical text incorporating
dependency relation into NegEx. Journal of Biomedical Informatics. 2015;54:213-219.

Related Work
Ogren P, et al. Constructing evaluation corpora for automated clinical named entity
recognition. In Proceedings of the Sixth International Conference on Language
Resources and Evaluation (LREC'08). 2008;28-30.
Uzuner South B, et al. 2010 i2b2/VA challenge on concepts, assertions, and relations in
clinical text. Journal of the American Medical Informatics Association. 2011;18:552-556.
Suominen H, et al. Overview of the ShARe/CLEF eHealth evaluation lab 2013. In
International Conference of the Cross-Language Evaluation Forum for European
Languages. 2013;212-231.
Albright D, et al. Towards comprehensive syntactic and semantic annotations of the
clinical narrative. Journal of the American Medical Informatics Association. 2013;20:922-
930.
etc..

Our overall method
1. MetaMap (Aronson et al. 2010) was used to map every mention of keywords
in a report to a unique concept ID in the Systematized Nomenclature of
Medicine Clinical Terms (SNOMED-CT)
2. Remove negative and equivocal findings within the radiology report
1. Tokenize
2. Parse
3. Apply rules
9
NE recognizer
(MetaMap)
Tokenize
(NLTK)
Apply rules
Dependency parse
(Bllip/Stanford)
Labels
AMIA 2018 | amia.org

Utilize the universal dependency graph to define patterns
• a directed graph
• vertices are words or phrases labeled with information such as part-of-
speech and the lemma
• edges represent typed dependencies from the governor to its dependent
and are labeled with dependency type
10
Negation and Uncertainty detection

Sample rules
11
• Defined rules on the dependency graphs by utilizing the dependency label
and direction information

Experiments
• Experiments on corpora with positive findings annotated
• OpenI: 3,851 reports, 1,354 findings
• Chest X-ray: 900 reports, 2131 findings
• Experiments on corpora with negative findings annotated
• BioScope: 977 reports, 466 negative scopes
• PK: 116 reports, 491 negative phrases

Results
13
OpenI ChestX-ray
P R F P R F
MetaMap+NegEx 77.2 84.6 80.7 82.8 95.5 88.7
MetaMap+NegBio 89.8 85.0 87.3 94.4 94.4 94.4
BioScope PK
P R F P R F
NegEx 70.6 98.7 82.3 95.1 91.2 93.1
NegBio 96.1 95.7 95.9 98.4 88.6 93.3

NIH Chest X-ray Dataset
One of the largest publicly available chest x-ray datasets to scientific
community
• 112,120 frontal-view X-ray images
• 30,805 unique patients
https://nihcc.app.box.com/v/ChestXray-NIHCC

NegBio is an open source tool
15
https://github.com/ncbi-nlp/NegBio

Overview
Mining image labels via NLP for multi-label pathology classification
One of ImageNet pre-trained models
GoogLeNet ResNetVggNetAlexNet
Weights from
predication layer
Pooling Layer
NE
recognizer
(MetaMap)
Negative/
Equivocal
detection
Labels
Image
data
conv1/7x7_s2
conv1/rule_7x7
inception_5b/
output
data
conv1
res5c
res5c_relu
data
conv1_1
relu1_1
conv5_3
relu5_3
data
conv1
conv1
conv5
relu5
MAX LSE AVE
Transition
Layer

Multi-label Classification and Localization
Wang X, Peng Y, Lu L, Bagheri M, Lu Z, Summers
R. ChestX-ray8: Hospital-scale Chest X-ray database
and benchmarks on weakly-supervised classification
and localization of common thorax diseases. IEEE
Conference on Computer Vision and Pattern
Recognition (CVPR). 2017, 2097-2106.
Wang X*, Peng Y*, Lu L, Lu Z, Summers
R. TieNet: Text-Image Embedding Network for
Common Thorax Disease Classification and
Reporting in Chest X-rays. IEEE Conference on
Computer Vision and Pattern Recognition
(CVPR). 2018.

Conclusion and Future work
• We propose an algorithm, NegBio, to determine negative and uncertain
findings in radiology reports.
• We evaluated NegBio on three publicly available corpora and a newly
constructed corpus. We showed that NegBio achieved a significant
improvement on all datasets over the state of the art.
• We made NegBio an open source tool.
Future work
• To explore NegBio’s applicability in clinical texts beyond radiology reports.

Acknowledgment
This work was supported by the Intramural Research Programs of the National
Institutes of Health, National Library of Medicine and Clinical Center.
The authors of NegEx and MetaMap for making their software tools publicly
available.
Drs. Dina Demner-Fushman and Willie J Rogers for the helpful discussion.

Thank you!
yifan.peng@nih.gov
https://github.com/ncbi-nlp/NegBio

NegBio: a high-performance tool for negation and uncertainty detection in radiology reports

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to NegBio: a high-performance tool for negation and uncertainty detection in radiology reports

Similar to NegBio: a high-performance tool for negation and uncertainty detection in radiology reports (20)

Recently uploaded

Recently uploaded (20)

NegBio: a high-performance tool for negation and uncertainty detection in radiology reports

Editor's Notes