SlideShare a Scribd company logo
1 of 45
Medical Question Answering: Dealing with the complexity
and specificity of consumer health questions and visual questions
November 12, 2019 - Seattle, WA, USA
Allen Institute for Artificial Intelligence (AI2
Dr. Asma Ben Abacha
Disclaimer
The views and opinions expressed do not necessarily state or
reflect those of the U.S. Government, and they may not be used
for advertising or product endorsement purposes.
● PhD in Computer Science, Paris-Sud
University, France.
● Master Degree, Paris 13 University, France.
● Software engineering degree, ENSI, Tunisia.
● Postdoctoral researcher, LIST, Luxembourg.
● Lecturer at the University of Lorraine, France.
● PhD Student at LIMSI-CNRS, France.
Topic: Medical question answering.
Short Bio
● Currently, a staff scientist at the NIH/NLM.
U.S. National Institutes of
Health (NIH)
• www.nih.gov
• 27 institutes & centers
U.S. National Library of Medicine (NLM) www.nlm.nih.gov
 NLM receives hundreds of requests per day including many
consumer health questions.
 Manual answers returned in 4 days from consumer services.
 Goal: Automatic system to assist users in their search.
Why Consumer Health Question Answering?
Challenges
in Consumer Health
Question Answering
Example:
“obese daughter. i'm a mother of 5 children. My son
has leukemia who is the youngest. My daughter is 12yrs old and
is obese. i dont know where to turn but i need some professional help.
my daughter has always suffered with weight issues. while i deal
with my 11yr old son at md anderson im so scared of loosing my
daughter. Shes already been suicidal and her grades are so bad.
please i need help. “
7
1st Challenge: Automatic Question Analysis & Understanding
2nd Challenge: Retrieval of relevant & reliable answers
Short Query: “How to treat obesity in children”
2nd Challenge: Retrieval of relevant & reliable answers
A good answer in Page 3 using a short query (“How to treat obesity in children”)
11
1) Recognizing Question Entailment
• “A Question-Entailment Approach to
Question Answering”.
Ben Abacha & Demner-Fushman. BMC
Bioinformatics 2019.
Answering
Complex
Questions
2) Question Summarization
• “On the Summarization of Consumer
Health Questions”.
Ben Abacha & Demner-Fushman. ACL 2019.
◼ Consumer Health Question
(1’)
Relation?
How to define and recognize entailment between questions?
Recognizing Question Entailment (RQE)
13
 Answering new questions by retrieving entailed
questions with existing answers.
MedQuAD: Medical QA Dataset
 Created from trusted resources.
 Contains 47k question-answer pairs extracted from 12 NIH websites
(e.g. MedlinePlus, NCI, GARD, NIDDK).
 Covers 16 question types about Diseases (e.g. Treatment, Susceptibility),
20 types about Drugs (e.g. Usage, Interaction), and an additional type
(Information) for all possible question foci (e.g. Procedure, Exam).
14
https://github.com/abachaa/MedQuAD
RQE Models
Observations:
Logistic Regression outperformed neural networks when trained
with traditional word embeddings such as Glove and Word2Vec.
 Relying on recent language models such as Bert for pre-training led
to a better performance (e.g. results of MEDIQA-RQE 2019). 15
Ben Abacha & Demner-Fushman. BMC Bioinformatics 2019
16
1) Recognizing Entailment/Inference in the Medical Domain
2) Entailment/Inference for Question Answering (QA)
3) New Shared Task at ACL-BioNLP:
Need of dedicated gold-standard datasets
and a common evaluation platform.
 2006:
 2016:
Methods for Using Textual Entailment in Open-Domain Question
Answering. Sanda Harabagiu & Andrew Hickl
Recognizing Question Entailment for Medical Question Answering.
Asma Ben Abacha & Dina Demner-Fushman
MEDIQA @ ACL-BioNLP 2019
17
 MEDIQA 2019: 3 tasks (NLI, RQE & QA)
“Overview of the MEDIQA 2019 Shared Task on Textual
Inference, Question Entailment and Question Answering”.
Ben Abacha, Shivade & Demner-Fushman.
ACL-BioNLP 2019.
 MEDIQA 2019: 72 teams, 20 papers, and interesting approaches & results.
CHiQA
19
https://chiqa.nlm.nih.gov
CHiQA
20
https://chiqa.nlm.nih.gov
Consumer Health Information and Question Answering: Helping consumers find answers to their
health-related information needs. Demner-Fushman, D., Mrabet, Y., and Ben Abacha, A. JAMIA 2019.
CHiQA
21
https://chiqa.nlm.nih.gov
Metrics LiveQA-Med Dataset
IR RQE COMB LiveQA-
Med Best
Average Score 1.183 0.827 1.308 0.637
MAP@10 0.405 0.311 0.445 --
MRR@10 0.438 0.333 0.516 --
o Benchmark from the Medical Question
Answering Competition @ LiveQA 2017
(Ben Abacha et al., TREC 2017).
22
1) Recognizing Question Entailment
• “A Question-Entailment Approach to
Question Answering”.
Ben Abacha & Demner-Fushman. BMC
Bioinformatics 2019.
Answering
Complex
Questions
2) Question Summarization
• “On the Summarization of Consumer
Health Questions”.
Ben Abacha & Demner-Fushman. ACL 2019.
23
 Tackling the complexity of consumer questions by automatic summarization.
Question Summarization
24
“On the Role of Question Summarization and Information
Source Restriction in Consumer Health Question Answering”.
Ben Abacha & Demner-Fushman. AMIA 2019 Informatics Summit.
Measures Original
Questions
Question
Summaries
AvgScore (0-3) 0.711 1.125
Succ@2+ 0.442 0.663
Succ@3+ 0.192 0.317
Succ@4+ 0.077 0.144
Prec@2+ 0.46 0.663
Prec@3+ 0.2 0.317
Prec@4+ 0.08 0.144
QA performance w(o)/ Summarization
o Benchmark from the Medical Question
Answering Competition @ LiveQA 2017
(Ben Abacha et al., TREC 2017).
Automatic Summarization: Pointer-generator network
25
See et al., ACL 2017
Data Creation & Augmentation
26
Method Type Example
#1 MeQSum Dataset Consumer
Health
Question
I suffered a massive stroke on [DATE] with paralysis on my left side of
my body, I'm home and conduct searches on the internet to find help
with recovery, and always this product called neuroaid appears
claiming to restore function. to my knowledge it isn't approved by the
FDA, but it sounds so promising. do you know anything about it and id
there anything approved by our FDA, that does help?
Summary What are treatments for stroke paralysis, including neuroaid?
#2 Augmentation
with Clinical Data
Clinical
Question
55-year-old woman. This lady has epigastric pain and gallbladder
symptoms. How do you assess her gallbladder function when you
don't see stones on the ultrasound? Can a nonfunctioning gallbladder
cause symptoms or do you only get symptoms if you have stones?
Summary Can a nonfunctioning gallbladder cause symptoms or do you only get
symptoms if you have stones?
• #3 Augmentation
with Semantic
Selection
Medical
Question
Is it healthy to ingest 500 mg of vitamin c a day? Should I be taking
more or less?
Summary How much vitamin C should I take a day?
Ben Abacha & Demner-Fushman. ACL 2019
27
Method Training
Set
ROUGE-1 ROUGE-2 ROUGE-L
Seq2seq
Attentional
Model
#1 24.80 13.84 24.27
#2 28.97 18.34 28.74
#3 27.62 15.70 27.11
Pointer
Generator
(PG)
#1 35.80 20.19 34.79
#2 42.77 25.00 40.97
#3 44.16 27.64 42.78
PG +
Coverage
#1 39.57 23.05 38.45
#2 40.00 24.13 38.56
#3 41.76 24.80 40.50
• #1 MeQSum Dataset
• #2 Augmentation with
Clinical Data
• #3 Augmentation with
Semantic Selection
Question Summarization Results
Ben Abacha & Demner-Fushman. ACL 2019
Data augmentation
from question datasets
improves the overall
performance.
Question Summarization Results
28
Question
Ben Abacha & Demner-Fushman. ACL 2019
The best performance of
44.16% is comparable to the
state-of-the-art results in open-
domain using 8K training
pairs (2.5% of the size of the
CNN-DailyMail dataset).
Summing up
 Safe and efficient search alternatives are needed to protect non-expert users from potential
misinformation found online.
 Medical QA systems should follow strict guidelines regarding the goal and background knowledge
and resources to protect the consumers from misleading or harmful information.
Our experiments showed that:
 relying on the retrieval of entailed questions is a viable strategy to answer consumer health questions
 limiting the number of answer sources could enhance the performance of QA systems
 summarizing the questions leads to a substantial improvement
30
Answering
Medication
Questions
3) Questions about Medications
• “Bridging the Gap between Consumers’
Medication Questions and Trusted Answers”.
Ben Abacha et al. MEDINFO 2019.
Plan
1. New gold standard corpus for
Question Answering about
medications.
1. Deep learning experiments on
question type identification (CNN),
focus recognition (Bi-LSTM-CRF),
and question answering (CHiQA).
2. Research insights from the dataset
creation process and experiments.
31
Dataset
◼ The gold standard contains 674 consumer questions submitted to
with associated answers and annotations.
43%
19%
38% DailyMed
MedlinePlus
Other Websites
Question
Type # Example
Information 112 what type of drug is amphetamine?
Dose 70 what is a daily amount of prednisolone
eye drops to take?
Usage 61 how to self inject enoxaparin sodium?
Side Effects 60 does benazepril aggravate hepatitis?
Indication 55 why is pyridostigmine prescribed?
Interaction 51 can i drink cataflam when i drink medrol?
32
Discussion (1/3)
Difficulty and ambiguity of medication questions:
◼ linguistic ambiguity due to misspellings, or wrong grammar leading to
unclear meaning or multiple interpretations.
◼ the use of too many words or general terms to express a medical term
(circumlocution).
◼ the requested medical information/knowledge was not formalized or written
online (knowledge barrier).
◼ underspecified questions lacking essential information.
33
Discussion (2/3)
Complexity of Manual Answer Retrieval:
◼ Conditional answers including answers that depend on the manufacturer,
on the disease, or on patient information.
◼ Distributed answers that can be formed only by combining different text
snippets from different answers and/or sources.
◼ Other answers could not be found without expert inference based on
background knowledge.
34
Discussion (3/3)
Required Solutions for the automation of Medication QA:
◼ Medical text translation, simplification, and inference
are often needed to find relevant answers and/or make the
answers readable for non-expert users.
◼ More data and resources are needed to cover information
about drug interactions and usage guidelines (e.g.
scientific literature, clinical sources).
◼ Conditional answers require different solutions such as
providing a list of answers or interacting with the user in a
dialogue-based approach.
35
36
4) Visual Question Answering
• “VQA-Med: Overview of the Medical Visual
Question Answering Task at ImageCLEF
2019”. Ben Abacha et al. CLEF 2019.
Answering
Visual
Questions
37
• Visual Question Answering (VQA) poses a challenging problem
involving Natural Language Processing and Computer Vision.
• Systems capable of understanding radiology images and answering
related questions could support clinical education, clinical decision
making, and patient education.
Visual Question Answering in the Medical Domain
38
VQA-Med @ ImageCLEF 2019
Four categories of questions:
Modality, Plane, Abnormality & Organ system
• What is most alarming about this ct scan?
• Is this a t1 weighted, t2 weighted, or flair image?
• What is the organ principally shown in this x-ray?
• In which plane is the mri displayed?
Examples:
◼ Training, validation, and test sets created automatically:
◼ The training set includes 3,200 radiology images and 12,792 question-answer pairs.
◼ The answers of the test set have been validated by two medical doctors.
VQA-Med 2019 Data
Results (Accuracy)
• The best team achieved 0.624
overall accuracy and 0.644 BLEU
score.
• Methods: transfer learning, multi-
task learning, ensemble methods,
and hybrid approaches combining
classification models and answer
generation methods.
VQA-Med @ ImageCLEF 2020
41
Thank you!
42
 Contact: asma.benabacha@nih.gov
 Datasets: github.com/abachaa
 CHiQA: chiqa.nlm.nih.gov
Examples of consumer health questions
ID Question (Q) / Answer (A)
1a (Q) “what does prednisone do to the body?”
1b (Q) “what would a normal dose be for valacyclovir?”
1c (Q) “how much gravol to kill you?”
1d (Q) “what time should take memantine?”
2a (Q) “what color is phenytoin?”
(A) [depends on the manufacturer]
 PINK
 WHITE (/Light Lavender)
 ORANGE
2b (Q) “why would my urine test be negative for benzodiazepines when i take Ativan?”
(A) “Common limitations exist for screening benzodiazepines when using traditional immunoassay (IA) tests. IA testing
for benzodiazepines often targets nordiazepam and oxazepam (…)” AND “Some commonly prescribed drugs have
limited cross-reactivity (…)”
2c (Q) “is it alright touse fluticasone when using oxygen?”
(A) “Pharmacological therapy can influence morbidity and mortality in severe chronic obstructive pulmonary disease
(COPD). Long-term domiciliary oxygen therapy (LTOT) improves survival in COPD with chronic hypoxaemia. Oral
steroid medication has been associated with improved survival in men and increased mortality in women, while inhaled
steroid medication has been associated with a reduction in the exacerbation rate (…).”
2d (Q) “how long does vicodin stay in breast milk?”
(A) “Following a 10 mg oral dose of hydrocodone administered to five adult male subjects, the mean peak concentration
was 23.6 ± 5.2 ng/mL. Maximum serum levels were achieved at 1.3 ± 0.3 hours and the half-life was determined to be
3.8 ± 0.3 hours.”
44
Question Summarization Results
45
Question Summarization
Ben Abacha & Demner-Fushman. ACL 2019

More Related Content

What's hot

Contemporary Memo, Merging technologies in the workforce
Contemporary Memo, Merging technologies in the workforceContemporary Memo, Merging technologies in the workforce
Contemporary Memo, Merging technologies in the workforceWarren Sprecher
 
How will the Clinicians, Patients and Consumers of the Future ensure appropri...
How will the Clinicians, Patients and Consumers of the Future ensure appropri...How will the Clinicians, Patients and Consumers of the Future ensure appropri...
How will the Clinicians, Patients and Consumers of the Future ensure appropri...SharpBrains
 
2019.10.30 DayOne Expert Event patient centricity
2019.10.30 DayOne Expert Event patient centricity2019.10.30 DayOne Expert Event patient centricity
2019.10.30 DayOne Expert Event patient centricityDayOne
 
Rock Health Demo Day Notes
Rock Health Demo Day NotesRock Health Demo Day Notes
Rock Health Demo Day NotesLindsay Meyer
 
Taking cognitive/ emotional assessments and therapies to scale
Taking cognitive/ emotional assessments and therapies to scaleTaking cognitive/ emotional assessments and therapies to scale
Taking cognitive/ emotional assessments and therapies to scaleSharpBrains
 
Applying deep learning to medical data
Applying deep learning to medical dataApplying deep learning to medical data
Applying deep learning to medical dataHyun-seok Min
 
Clinical Trial Accrual Challenges: Is Social Media Here to Help? (A. Denicoff)
Clinical Trial Accrual Challenges: Is Social Media Here to Help? (A. Denicoff)Clinical Trial Accrual Challenges: Is Social Media Here to Help? (A. Denicoff)
Clinical Trial Accrual Challenges: Is Social Media Here to Help? (A. Denicoff)Esmeralda Casas-Silva, Ph.D.
 
(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imagingKyuhwan Jung
 
Predictive analytics for personalized healthcare
Predictive analytics for personalized healthcarePredictive analytics for personalized healthcare
Predictive analytics for personalized healthcareJohn Cai
 
The RSNA Image Share Network: Initial 12 Month Results from the UCSF Pilot Site
The RSNA Image Share Network: Initial 12 Month Results from the UCSF Pilot SiteThe RSNA Image Share Network: Initial 12 Month Results from the UCSF Pilot Site
The RSNA Image Share Network: Initial 12 Month Results from the UCSF Pilot SiteTrimed Media Group
 
Outsmarting Smart Technology to Reclaim our Health and Focus
Outsmarting Smart Technology to Reclaim our Health and FocusOutsmarting Smart Technology to Reclaim our Health and Focus
Outsmarting Smart Technology to Reclaim our Health and FocusSharpBrains
 
HEC 2016 Panel: Putting User-Generated Data in Action: Improving Interpretabi...
HEC 2016 Panel: Putting User-Generated Data in Action: Improving Interpretabi...HEC 2016 Panel: Putting User-Generated Data in Action: Improving Interpretabi...
HEC 2016 Panel: Putting User-Generated Data in Action: Improving Interpretabi...Pei-Yun Sabrina Hsueh
 
Deep learning for episodic interventional data
Deep learning for episodic interventional dataDeep learning for episodic interventional data
Deep learning for episodic interventional dataDeakin University
 
Sleep quality prediction from wearable data using deep learning
Sleep quality prediction from wearable data using deep learningSleep quality prediction from wearable data using deep learning
Sleep quality prediction from wearable data using deep learningLuis Fernandez Luque
 
Medical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareMedical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareNUS-ISS
 
한국에서 혁신적인 디지털 헬스케어 스타트업이 탄생하려면
한국에서 혁신적인 디지털 헬스케어 스타트업이 탄생하려면한국에서 혁신적인 디지털 헬스케어 스타트업이 탄생하려면
한국에서 혁신적인 디지털 헬스케어 스타트업이 탄생하려면Yoon Sup Choi
 
New Normal, New Future - Free Download E book
New Normal, New Future - Free Download E bookNew Normal, New Future - Free Download E book
New Normal, New Future - Free Download E bookkevin brown
 
[KNAPS] 포스트 코로나 시대, 제약 산업과 디지털 헬스케어
[KNAPS] 포스트 코로나 시대, 제약 산업과 디지털 헬스케어[KNAPS] 포스트 코로나 시대, 제약 산업과 디지털 헬스케어
[KNAPS] 포스트 코로나 시대, 제약 산업과 디지털 헬스케어Yoon Sup Choi
 
What do 7.5 billion human brains need to thrive in the Digital Age, and what ...
What do 7.5 billion human brains need to thrive in the Digital Age, and what ...What do 7.5 billion human brains need to thrive in the Digital Age, and what ...
What do 7.5 billion human brains need to thrive in the Digital Age, and what ...SharpBrains
 

What's hot (20)

Contemporary Memo, Merging technologies in the workforce
Contemporary Memo, Merging technologies in the workforceContemporary Memo, Merging technologies in the workforce
Contemporary Memo, Merging technologies in the workforce
 
How will the Clinicians, Patients and Consumers of the Future ensure appropri...
How will the Clinicians, Patients and Consumers of the Future ensure appropri...How will the Clinicians, Patients and Consumers of the Future ensure appropri...
How will the Clinicians, Patients and Consumers of the Future ensure appropri...
 
2019.10.30 DayOne Expert Event patient centricity
2019.10.30 DayOne Expert Event patient centricity2019.10.30 DayOne Expert Event patient centricity
2019.10.30 DayOne Expert Event patient centricity
 
Rock Health Demo Day Notes
Rock Health Demo Day NotesRock Health Demo Day Notes
Rock Health Demo Day Notes
 
Taking cognitive/ emotional assessments and therapies to scale
Taking cognitive/ emotional assessments and therapies to scaleTaking cognitive/ emotional assessments and therapies to scale
Taking cognitive/ emotional assessments and therapies to scale
 
Applying deep learning to medical data
Applying deep learning to medical dataApplying deep learning to medical data
Applying deep learning to medical data
 
Clinical Trial Accrual Challenges: Is Social Media Here to Help? (A. Denicoff)
Clinical Trial Accrual Challenges: Is Social Media Here to Help? (A. Denicoff)Clinical Trial Accrual Challenges: Is Social Media Here to Help? (A. Denicoff)
Clinical Trial Accrual Challenges: Is Social Media Here to Help? (A. Denicoff)
 
(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging
 
Predictive analytics for personalized healthcare
Predictive analytics for personalized healthcarePredictive analytics for personalized healthcare
Predictive analytics for personalized healthcare
 
The RSNA Image Share Network: Initial 12 Month Results from the UCSF Pilot Site
The RSNA Image Share Network: Initial 12 Month Results from the UCSF Pilot SiteThe RSNA Image Share Network: Initial 12 Month Results from the UCSF Pilot Site
The RSNA Image Share Network: Initial 12 Month Results from the UCSF Pilot Site
 
Outsmarting Smart Technology to Reclaim our Health and Focus
Outsmarting Smart Technology to Reclaim our Health and FocusOutsmarting Smart Technology to Reclaim our Health and Focus
Outsmarting Smart Technology to Reclaim our Health and Focus
 
HEC 2016 Panel: Putting User-Generated Data in Action: Improving Interpretabi...
HEC 2016 Panel: Putting User-Generated Data in Action: Improving Interpretabi...HEC 2016 Panel: Putting User-Generated Data in Action: Improving Interpretabi...
HEC 2016 Panel: Putting User-Generated Data in Action: Improving Interpretabi...
 
Deep learning for episodic interventional data
Deep learning for episodic interventional dataDeep learning for episodic interventional data
Deep learning for episodic interventional data
 
Sleep quality prediction from wearable data using deep learning
Sleep quality prediction from wearable data using deep learningSleep quality prediction from wearable data using deep learning
Sleep quality prediction from wearable data using deep learning
 
Medical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareMedical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in Healthcare
 
Research Poster
Research PosterResearch Poster
Research Poster
 
한국에서 혁신적인 디지털 헬스케어 스타트업이 탄생하려면
한국에서 혁신적인 디지털 헬스케어 스타트업이 탄생하려면한국에서 혁신적인 디지털 헬스케어 스타트업이 탄생하려면
한국에서 혁신적인 디지털 헬스케어 스타트업이 탄생하려면
 
New Normal, New Future - Free Download E book
New Normal, New Future - Free Download E bookNew Normal, New Future - Free Download E book
New Normal, New Future - Free Download E book
 
[KNAPS] 포스트 코로나 시대, 제약 산업과 디지털 헬스케어
[KNAPS] 포스트 코로나 시대, 제약 산업과 디지털 헬스케어[KNAPS] 포스트 코로나 시대, 제약 산업과 디지털 헬스케어
[KNAPS] 포스트 코로나 시대, 제약 산업과 디지털 헬스케어
 
What do 7.5 billion human brains need to thrive in the Digital Age, and what ...
What do 7.5 billion human brains need to thrive in the Digital Age, and what ...What do 7.5 billion human brains need to thrive in the Digital Age, and what ...
What do 7.5 billion human brains need to thrive in the Digital Age, and what ...
 

Similar to Consumer Health Questions: Summarizing and Answering Complex Medical Queries

An Integrated Electronic Lifestyle and Mental Health Patient Self-Assessment ...
An Integrated Electronic Lifestyle and Mental Health Patient Self-Assessment ...An Integrated Electronic Lifestyle and Mental Health Patient Self-Assessment ...
An Integrated Electronic Lifestyle and Mental Health Patient Self-Assessment ...Health Informatics New Zealand
 
FINAL DOCUMENTATION
FINAL DOCUMENTATIONFINAL DOCUMENTATION
FINAL DOCUMENTATIONAdityaIshwar
 
Ebm talk-general-mar99-ppt95
Ebm talk-general-mar99-ppt95Ebm talk-general-mar99-ppt95
Ebm talk-general-mar99-ppt95rubenroa
 
Lecture 5B
Lecture 5BLecture 5B
Lecture 5BCMDLMS
 
Healthcare & digital marketing - Today & Future
Healthcare &  digital marketing - Today & FutureHealthcare &  digital marketing - Today & Future
Healthcare & digital marketing - Today & FutureDeepali Thakur
 
System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...
System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...
System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...IRJET Journal
 
How to Use Data to Improve Patient Safety: Part 2
How to Use Data to Improve Patient Safety: Part 2How to Use Data to Improve Patient Safety: Part 2
How to Use Data to Improve Patient Safety: Part 2Health Catalyst
 
Hospital Information Systems (August 18, 2015)
Hospital Information Systems (August 18, 2015)Hospital Information Systems (August 18, 2015)
Hospital Information Systems (August 18, 2015)Nawanan Theera-Ampornpunt
 
Becoming Better Advocates for Your Health
Becoming Better Advocates for Your HealthBecoming Better Advocates for Your Health
Becoming Better Advocates for Your HealthBest Doctors
 
Healthier Longevity: Personalizing the Aging Process Through Connected Health...
Healthier Longevity: Personalizing the Aging Process Through Connected Health...Healthier Longevity: Personalizing the Aging Process Through Connected Health...
Healthier Longevity: Personalizing the Aging Process Through Connected Health...Personal Connected Health Alliance
 
Big Data Means Big Potential Challenges for Nurse Execs Response.pdf
Big Data Means Big Potential Challenges for Nurse Execs Response.pdfBig Data Means Big Potential Challenges for Nurse Execs Response.pdf
Big Data Means Big Potential Challenges for Nurse Execs Response.pdfbkbk37
 
NHSFPX 4000 Capella University Eliminating Medical Errors Bibliography.docx
NHSFPX 4000 Capella University Eliminating Medical Errors Bibliography.docxNHSFPX 4000 Capella University Eliminating Medical Errors Bibliography.docx
NHSFPX 4000 Capella University Eliminating Medical Errors Bibliography.docxwrite5
 
New Data on Clinician Learning: What does it mean for your CME Programs
New Data on Clinician Learning:  What does it mean for your CME ProgramsNew Data on Clinician Learning:  What does it mean for your CME Programs
New Data on Clinician Learning: What does it mean for your CME ProgramsDLC-Solutions.com and EthosCE.com
 

Similar to Consumer Health Questions: Summarizing and Answering Complex Medical Queries (20)

IBM Health and Social Programs Summit: IBM Commitment & investment in health ...
IBM Health and Social Programs Summit: IBM Commitment & investment in health ...IBM Health and Social Programs Summit: IBM Commitment & investment in health ...
IBM Health and Social Programs Summit: IBM Commitment & investment in health ...
 
Health IT for Executives
Health IT for ExecutivesHealth IT for Executives
Health IT for Executives
 
Health IT for Executives (March 28, 2017)
Health IT for Executives (March 28, 2017)Health IT for Executives (March 28, 2017)
Health IT for Executives (March 28, 2017)
 
EBM_2016.pdf
EBM_2016.pdfEBM_2016.pdf
EBM_2016.pdf
 
An Integrated Electronic Lifestyle and Mental Health Patient Self-Assessment ...
An Integrated Electronic Lifestyle and Mental Health Patient Self-Assessment ...An Integrated Electronic Lifestyle and Mental Health Patient Self-Assessment ...
An Integrated Electronic Lifestyle and Mental Health Patient Self-Assessment ...
 
FINAL DOCUMENTATION
FINAL DOCUMENTATIONFINAL DOCUMENTATION
FINAL DOCUMENTATION
 
Ebm talk-general-mar99-ppt95
Ebm talk-general-mar99-ppt95Ebm talk-general-mar99-ppt95
Ebm talk-general-mar99-ppt95
 
Lecture 5B
Lecture 5BLecture 5B
Lecture 5B
 
Healthcare & digital marketing - Today & Future
Healthcare &  digital marketing - Today & FutureHealthcare &  digital marketing - Today & Future
Healthcare & digital marketing - Today & Future
 
Clinical Informatics
Clinical Informatics Clinical Informatics
Clinical Informatics
 
System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...
System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...
System for Recommending Drugs Based on Machine Learning Sentiment Analysis of...
 
Umap22_MainConference.pdf
Umap22_MainConference.pdfUmap22_MainConference.pdf
Umap22_MainConference.pdf
 
How to Use Data to Improve Patient Safety: Part 2
How to Use Data to Improve Patient Safety: Part 2How to Use Data to Improve Patient Safety: Part 2
How to Use Data to Improve Patient Safety: Part 2
 
Hospital Information Systems (August 18, 2015)
Hospital Information Systems (August 18, 2015)Hospital Information Systems (August 18, 2015)
Hospital Information Systems (August 18, 2015)
 
Becoming Better Advocates for Your Health
Becoming Better Advocates for Your HealthBecoming Better Advocates for Your Health
Becoming Better Advocates for Your Health
 
Healthier Longevity: Personalizing the Aging Process Through Connected Health...
Healthier Longevity: Personalizing the Aging Process Through Connected Health...Healthier Longevity: Personalizing the Aging Process Through Connected Health...
Healthier Longevity: Personalizing the Aging Process Through Connected Health...
 
Big Data Means Big Potential Challenges for Nurse Execs Response.pdf
Big Data Means Big Potential Challenges for Nurse Execs Response.pdfBig Data Means Big Potential Challenges for Nurse Execs Response.pdf
Big Data Means Big Potential Challenges for Nurse Execs Response.pdf
 
NHSFPX 4000 Capella University Eliminating Medical Errors Bibliography.docx
NHSFPX 4000 Capella University Eliminating Medical Errors Bibliography.docxNHSFPX 4000 Capella University Eliminating Medical Errors Bibliography.docx
NHSFPX 4000 Capella University Eliminating Medical Errors Bibliography.docx
 
New Data on Clinician Learning: What does it mean for your CME Programs
New Data on Clinician Learning:  What does it mean for your CME ProgramsNew Data on Clinician Learning:  What does it mean for your CME Programs
New Data on Clinician Learning: What does it mean for your CME Programs
 
Chronic illness: Wellness Solutions Personalized with Genomics & Biometrics
Chronic illness: Wellness Solutions  Personalized with Genomics & BiometricsChronic illness: Wellness Solutions  Personalized with Genomics & Biometrics
Chronic illness: Wellness Solutions Personalized with Genomics & Biometrics
 

Recently uploaded

Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 

Recently uploaded (20)

Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 

Consumer Health Questions: Summarizing and Answering Complex Medical Queries

  • 1. Medical Question Answering: Dealing with the complexity and specificity of consumer health questions and visual questions November 12, 2019 - Seattle, WA, USA Allen Institute for Artificial Intelligence (AI2 Dr. Asma Ben Abacha
  • 2. Disclaimer The views and opinions expressed do not necessarily state or reflect those of the U.S. Government, and they may not be used for advertising or product endorsement purposes.
  • 3. ● PhD in Computer Science, Paris-Sud University, France. ● Master Degree, Paris 13 University, France. ● Software engineering degree, ENSI, Tunisia. ● Postdoctoral researcher, LIST, Luxembourg. ● Lecturer at the University of Lorraine, France. ● PhD Student at LIMSI-CNRS, France. Topic: Medical question answering. Short Bio
  • 4. ● Currently, a staff scientist at the NIH/NLM. U.S. National Institutes of Health (NIH) • www.nih.gov • 27 institutes & centers U.S. National Library of Medicine (NLM) www.nlm.nih.gov
  • 5.  NLM receives hundreds of requests per day including many consumer health questions.  Manual answers returned in 4 days from consumer services.  Goal: Automatic system to assist users in their search. Why Consumer Health Question Answering?
  • 7. Example: “obese daughter. i'm a mother of 5 children. My son has leukemia who is the youngest. My daughter is 12yrs old and is obese. i dont know where to turn but i need some professional help. my daughter has always suffered with weight issues. while i deal with my 11yr old son at md anderson im so scared of loosing my daughter. Shes already been suicidal and her grades are so bad. please i need help. “ 7 1st Challenge: Automatic Question Analysis & Understanding
  • 8. 2nd Challenge: Retrieval of relevant & reliable answers
  • 9. Short Query: “How to treat obesity in children” 2nd Challenge: Retrieval of relevant & reliable answers
  • 10. A good answer in Page 3 using a short query (“How to treat obesity in children”)
  • 11. 11 1) Recognizing Question Entailment • “A Question-Entailment Approach to Question Answering”. Ben Abacha & Demner-Fushman. BMC Bioinformatics 2019. Answering Complex Questions 2) Question Summarization • “On the Summarization of Consumer Health Questions”. Ben Abacha & Demner-Fushman. ACL 2019.
  • 12. ◼ Consumer Health Question (1’) Relation? How to define and recognize entailment between questions?
  • 13. Recognizing Question Entailment (RQE) 13  Answering new questions by retrieving entailed questions with existing answers.
  • 14. MedQuAD: Medical QA Dataset  Created from trusted resources.  Contains 47k question-answer pairs extracted from 12 NIH websites (e.g. MedlinePlus, NCI, GARD, NIDDK).  Covers 16 question types about Diseases (e.g. Treatment, Susceptibility), 20 types about Drugs (e.g. Usage, Interaction), and an additional type (Information) for all possible question foci (e.g. Procedure, Exam). 14 https://github.com/abachaa/MedQuAD
  • 15. RQE Models Observations: Logistic Regression outperformed neural networks when trained with traditional word embeddings such as Glove and Word2Vec.  Relying on recent language models such as Bert for pre-training led to a better performance (e.g. results of MEDIQA-RQE 2019). 15 Ben Abacha & Demner-Fushman. BMC Bioinformatics 2019
  • 16. 16 1) Recognizing Entailment/Inference in the Medical Domain 2) Entailment/Inference for Question Answering (QA) 3) New Shared Task at ACL-BioNLP: Need of dedicated gold-standard datasets and a common evaluation platform.  2006:  2016: Methods for Using Textual Entailment in Open-Domain Question Answering. Sanda Harabagiu & Andrew Hickl Recognizing Question Entailment for Medical Question Answering. Asma Ben Abacha & Dina Demner-Fushman MEDIQA @ ACL-BioNLP 2019
  • 17. 17  MEDIQA 2019: 3 tasks (NLI, RQE & QA) “Overview of the MEDIQA 2019 Shared Task on Textual Inference, Question Entailment and Question Answering”. Ben Abacha, Shivade & Demner-Fushman. ACL-BioNLP 2019.
  • 18.  MEDIQA 2019: 72 teams, 20 papers, and interesting approaches & results.
  • 20. CHiQA 20 https://chiqa.nlm.nih.gov Consumer Health Information and Question Answering: Helping consumers find answers to their health-related information needs. Demner-Fushman, D., Mrabet, Y., and Ben Abacha, A. JAMIA 2019.
  • 21. CHiQA 21 https://chiqa.nlm.nih.gov Metrics LiveQA-Med Dataset IR RQE COMB LiveQA- Med Best Average Score 1.183 0.827 1.308 0.637 MAP@10 0.405 0.311 0.445 -- MRR@10 0.438 0.333 0.516 -- o Benchmark from the Medical Question Answering Competition @ LiveQA 2017 (Ben Abacha et al., TREC 2017).
  • 22. 22 1) Recognizing Question Entailment • “A Question-Entailment Approach to Question Answering”. Ben Abacha & Demner-Fushman. BMC Bioinformatics 2019. Answering Complex Questions 2) Question Summarization • “On the Summarization of Consumer Health Questions”. Ben Abacha & Demner-Fushman. ACL 2019.
  • 23. 23  Tackling the complexity of consumer questions by automatic summarization. Question Summarization
  • 24. 24 “On the Role of Question Summarization and Information Source Restriction in Consumer Health Question Answering”. Ben Abacha & Demner-Fushman. AMIA 2019 Informatics Summit. Measures Original Questions Question Summaries AvgScore (0-3) 0.711 1.125 Succ@2+ 0.442 0.663 Succ@3+ 0.192 0.317 Succ@4+ 0.077 0.144 Prec@2+ 0.46 0.663 Prec@3+ 0.2 0.317 Prec@4+ 0.08 0.144 QA performance w(o)/ Summarization o Benchmark from the Medical Question Answering Competition @ LiveQA 2017 (Ben Abacha et al., TREC 2017).
  • 25. Automatic Summarization: Pointer-generator network 25 See et al., ACL 2017
  • 26. Data Creation & Augmentation 26 Method Type Example #1 MeQSum Dataset Consumer Health Question I suffered a massive stroke on [DATE] with paralysis on my left side of my body, I'm home and conduct searches on the internet to find help with recovery, and always this product called neuroaid appears claiming to restore function. to my knowledge it isn't approved by the FDA, but it sounds so promising. do you know anything about it and id there anything approved by our FDA, that does help? Summary What are treatments for stroke paralysis, including neuroaid? #2 Augmentation with Clinical Data Clinical Question 55-year-old woman. This lady has epigastric pain and gallbladder symptoms. How do you assess her gallbladder function when you don't see stones on the ultrasound? Can a nonfunctioning gallbladder cause symptoms or do you only get symptoms if you have stones? Summary Can a nonfunctioning gallbladder cause symptoms or do you only get symptoms if you have stones? • #3 Augmentation with Semantic Selection Medical Question Is it healthy to ingest 500 mg of vitamin c a day? Should I be taking more or less? Summary How much vitamin C should I take a day? Ben Abacha & Demner-Fushman. ACL 2019
  • 27. 27 Method Training Set ROUGE-1 ROUGE-2 ROUGE-L Seq2seq Attentional Model #1 24.80 13.84 24.27 #2 28.97 18.34 28.74 #3 27.62 15.70 27.11 Pointer Generator (PG) #1 35.80 20.19 34.79 #2 42.77 25.00 40.97 #3 44.16 27.64 42.78 PG + Coverage #1 39.57 23.05 38.45 #2 40.00 24.13 38.56 #3 41.76 24.80 40.50 • #1 MeQSum Dataset • #2 Augmentation with Clinical Data • #3 Augmentation with Semantic Selection Question Summarization Results Ben Abacha & Demner-Fushman. ACL 2019 Data augmentation from question datasets improves the overall performance.
  • 28. Question Summarization Results 28 Question Ben Abacha & Demner-Fushman. ACL 2019 The best performance of 44.16% is comparable to the state-of-the-art results in open- domain using 8K training pairs (2.5% of the size of the CNN-DailyMail dataset).
  • 29. Summing up  Safe and efficient search alternatives are needed to protect non-expert users from potential misinformation found online.  Medical QA systems should follow strict guidelines regarding the goal and background knowledge and resources to protect the consumers from misleading or harmful information. Our experiments showed that:  relying on the retrieval of entailed questions is a viable strategy to answer consumer health questions  limiting the number of answer sources could enhance the performance of QA systems  summarizing the questions leads to a substantial improvement
  • 30. 30 Answering Medication Questions 3) Questions about Medications • “Bridging the Gap between Consumers’ Medication Questions and Trusted Answers”. Ben Abacha et al. MEDINFO 2019.
  • 31. Plan 1. New gold standard corpus for Question Answering about medications. 1. Deep learning experiments on question type identification (CNN), focus recognition (Bi-LSTM-CRF), and question answering (CHiQA). 2. Research insights from the dataset creation process and experiments. 31
  • 32. Dataset ◼ The gold standard contains 674 consumer questions submitted to with associated answers and annotations. 43% 19% 38% DailyMed MedlinePlus Other Websites Question Type # Example Information 112 what type of drug is amphetamine? Dose 70 what is a daily amount of prednisolone eye drops to take? Usage 61 how to self inject enoxaparin sodium? Side Effects 60 does benazepril aggravate hepatitis? Indication 55 why is pyridostigmine prescribed? Interaction 51 can i drink cataflam when i drink medrol? 32
  • 33. Discussion (1/3) Difficulty and ambiguity of medication questions: ◼ linguistic ambiguity due to misspellings, or wrong grammar leading to unclear meaning or multiple interpretations. ◼ the use of too many words or general terms to express a medical term (circumlocution). ◼ the requested medical information/knowledge was not formalized or written online (knowledge barrier). ◼ underspecified questions lacking essential information. 33
  • 34. Discussion (2/3) Complexity of Manual Answer Retrieval: ◼ Conditional answers including answers that depend on the manufacturer, on the disease, or on patient information. ◼ Distributed answers that can be formed only by combining different text snippets from different answers and/or sources. ◼ Other answers could not be found without expert inference based on background knowledge. 34
  • 35. Discussion (3/3) Required Solutions for the automation of Medication QA: ◼ Medical text translation, simplification, and inference are often needed to find relevant answers and/or make the answers readable for non-expert users. ◼ More data and resources are needed to cover information about drug interactions and usage guidelines (e.g. scientific literature, clinical sources). ◼ Conditional answers require different solutions such as providing a list of answers or interacting with the user in a dialogue-based approach. 35
  • 36. 36 4) Visual Question Answering • “VQA-Med: Overview of the Medical Visual Question Answering Task at ImageCLEF 2019”. Ben Abacha et al. CLEF 2019. Answering Visual Questions
  • 37. 37 • Visual Question Answering (VQA) poses a challenging problem involving Natural Language Processing and Computer Vision. • Systems capable of understanding radiology images and answering related questions could support clinical education, clinical decision making, and patient education. Visual Question Answering in the Medical Domain
  • 38. 38 VQA-Med @ ImageCLEF 2019 Four categories of questions: Modality, Plane, Abnormality & Organ system • What is most alarming about this ct scan? • Is this a t1 weighted, t2 weighted, or flair image? • What is the organ principally shown in this x-ray? • In which plane is the mri displayed? Examples:
  • 39. ◼ Training, validation, and test sets created automatically: ◼ The training set includes 3,200 radiology images and 12,792 question-answer pairs. ◼ The answers of the test set have been validated by two medical doctors. VQA-Med 2019 Data
  • 40. Results (Accuracy) • The best team achieved 0.624 overall accuracy and 0.644 BLEU score. • Methods: transfer learning, multi- task learning, ensemble methods, and hybrid approaches combining classification models and answer generation methods.
  • 42. Thank you! 42  Contact: asma.benabacha@nih.gov  Datasets: github.com/abachaa  CHiQA: chiqa.nlm.nih.gov
  • 43. Examples of consumer health questions
  • 44. ID Question (Q) / Answer (A) 1a (Q) “what does prednisone do to the body?” 1b (Q) “what would a normal dose be for valacyclovir?” 1c (Q) “how much gravol to kill you?” 1d (Q) “what time should take memantine?” 2a (Q) “what color is phenytoin?” (A) [depends on the manufacturer]  PINK  WHITE (/Light Lavender)  ORANGE 2b (Q) “why would my urine test be negative for benzodiazepines when i take Ativan?” (A) “Common limitations exist for screening benzodiazepines when using traditional immunoassay (IA) tests. IA testing for benzodiazepines often targets nordiazepam and oxazepam (…)” AND “Some commonly prescribed drugs have limited cross-reactivity (…)” 2c (Q) “is it alright touse fluticasone when using oxygen?” (A) “Pharmacological therapy can influence morbidity and mortality in severe chronic obstructive pulmonary disease (COPD). Long-term domiciliary oxygen therapy (LTOT) improves survival in COPD with chronic hypoxaemia. Oral steroid medication has been associated with improved survival in men and increased mortality in women, while inhaled steroid medication has been associated with a reduction in the exacerbation rate (…).” 2d (Q) “how long does vicodin stay in breast milk?” (A) “Following a 10 mg oral dose of hydrocodone administered to five adult male subjects, the mean peak concentration was 23.6 ± 5.2 ng/mL. Maximum serum levels were achieved at 1.3 ± 0.3 hours and the half-life was determined to be 3.8 ± 0.3 hours.” 44
  • 45. Question Summarization Results 45 Question Summarization Ben Abacha & Demner-Fushman. ACL 2019