Stanford University Neuroradiology Talks: Personalizing and Customizing AI Explanations for Clinicians

Stanford University
Neuroradiology Talks
Francisco Maria Calisto
Repository: mimbcd-ui.github.io
Personalizing and Customizing
AI Explanations for Clinicians

Presenter
E-Mail:
fcalisto@andrew.cmu.edu
Webpage: fmcalisto.github.io

Team
João Fernandes
HCI MSc
Francisco M. Calisto
HCI PhD
Carlos Santiago
ML Researcher
Nuno Nunes
HCI Professor
Jacinto Nascimento
ML Professor
Clara Aleluia
Radiologist
Margarida Morais
ML MSc
João Maria Abrantes
Radiologist

9.6 million
deaths in 2018
Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R.L., Torre, L.A. and Jemal, A., 2018. Global cancer statistics 2018: GLOBOCAN estimates of
incidence and mortality worldwide for 36 cancers in 185 countries. CA: a cancer journal for clinicians, 68(6), pp.394-424.

~10%
… yielding false-negative results.
Smith, Robert A., Kimberly S. Andrews, Durado Brooks, Stacey A. Fedewa, Deana Manassaram‐Baptiste, Debbie Saslow, Otis W. Brawley, and
Richard C. Wender. Cancer screening in the United States, 2018: a review of current American Cancer Society guidelines and current issues
in cancer screening. CA: A Cancer Journal for Clinicians 68, no. 4 (2018): 297-316.

~40%
… yielding false-positive results.
Smith, Robert A., Kimberly S. Andrews, Durado Brooks, Stacey A. Fedewa, Deana Manassaram‐Baptiste, Debbie Saslow, Otis W. Brawley, and
Richard C. Wender. Cancer screening in the United States, 2018: a review of current American Cancer Society guidelines and current issues
in cancer screening. CA: A Cancer Journal for Clinicians 68, no. 4 (2018): 297-316.

Early
Detection
Cancer mortality can be reduced if
cases are detected and treated early.

MULTIMODALITY WORKFLOW
+
Magnetic Resonance Imaging
(MRI)
UltraSound
(US)
MammoGraphy
(MG)
+

MULTIMODALITY WORKFLOW
>
Magnetic Resonance Imaging
(MRI)
UltraSound
(US)
MammoGraphy
(MG)
>
Lesions
Calisto, F.M., Nunes, N. and Nascimento, J.C., 2020, September. “BreastScreening: On the Use of Multi-Modality in Medical Imaging
Diagnosis”. In Proceedings of the International Conference on Advanced Visual Interfaces (pp. 1-5).

BREAST SEVERITY
BI-RADS Meaning
0 Needs more information (more exams or waiting for more exams)
1 Negative
2 Benign
3 Probably Benign
4 Suspicious
5 Highly suggestive of malignancy
6 Known biopsy-proven malignancy
Schaekermann, M., Beaton, G., Habib, M., Lim, A., Larson, K. and Law, E., 2019, May. “Capturing Expert Arguments from Medical Adjudication
Discussions in a Machine-readable Format”. In Companion Proceedings of The 2019 World Wide Web Conference (pp. 1131-1137).

✓
PROBLEM
?
Schaekermann, M., 2020. “Human-AI Interaction in the Presence of Ambiguity: From Deliberation-based Labeling to Ambiguity-aware AI”.

MEDICAL IMAGE ASSESSMENT
Prior work in behavioral
sciences for medical
relation extraction
substantiate the
disagreement relations
between inter-variability
and intra-variability.
Dumitrache, A., Aroyo, L. and Welty, C., 2018. “Crowdsourcing Ground Truth for Medical Relation Extraction”. ACM Transactions on Interactive
Intelligent Systems (TiiS), 8(2), pp.1-20.

EXPERT DISAGREEMENT
Disagreement relations are addressed as a function of three phenomena:
1. Differences among clinical professionals, such as the medical background
of each clinical institution and bias;
2. Heterogeneous characteristics of the dataset to be analyzed, such as noisy
and heterogeneous modalities;
3. Nature of the diagnostic guidelines, such as the subjective and ambiguous
classification of the BI-RADS.
Schaekermann, M., Beaton, G., Habib, M., Lim, A., Larson, K. and Law, E., 2019. “Understanding expert disagreement in medical data analysis
through structured adjudication”. Proceedings of the ACM on Human-Computer Interaction, 3(CSCW), pp.1-23.

PROBLEM
In medical imaging, Doctors
need to trust that AI is being
used safely, and for their
benefit during decision-
making.

MEDICAL EXPERIENCE
Interns Juniors Middles Seniors
Calisto, Francisco Maria, Nuno Nunes, and Jacinto C. Nascimento. "Modeling adoption of intelligent agents in medical imaging." International
Journal of Human-Computer Studies 168 (2022): 102922.

AGENTS
Strategies, such as adapting the agent
communication, to promote the unbiased
behavior per each category of medical
experience, improving medical
performance, AI perception and
experience.

MEDICAL ASSISTANCE
Radiologist fatigue levels and
performance are related to
environmental factors such as
number of False-Positives and
False-Negatives.

HUMAN-AI DELIBERATION
BI-RADS = 5
with 99.94%
of accuracy
BI-RADS = 4? BI-RADS = 5
Round 1: the
clinician
interprets the
image alone.
Round 2: the
clinician
interprets AI
suggestions.
Round 3: the
clinician
controls the
final result.
Calisto, F. M., Santiago, C., Nunes, N., & Nascimento, J. C. (2021). Introduction of human-centric AI assistant to aid radiologists for multimodal
breast image classification. International Journal of Human-Computer Studies, 150, 102607.

52 clinicians
… from nine public and private medical institutions in
Portugal.
USER STUDIES

491 patients
from a multimodality dataset of medical images.
Calisto, F. M., Santiago, C., Nunes, N., & Nascimento, J. C. (2022). BreastScreening-AI: Evaluating Medical Intelligent Agents for Human-AI
Interactions. Artificial Intelligence in Medicine, 127, 102285.
DATASET

HOUR
Personalizing and Customizing AI
Explanations for Clinicians
Demo

98%
… of clinicians do understand what the system is thinking.
USER EXPECTATIONS

93%
… trust on the system capability.
USER EXPECTATIONS

INTER-VARIABILITY vs INTRA-VARIABILITY

CLINICAL IMPACT: Clinician-AI vs Clinician-Only

BEHAVIOR CHARACTERISTICS
Understanding the behavior
characteristics of each medical
group while diagnosing the
different groups of patients.

AMBIGUITY-AWARE (Future Work)
Schaekermann, M., Beaton, G., Sanoubari, E., Lim, A., Larson, K. and Law, E., 2020, April. “Ambiguity-aware AI Assistants for Medical Data
Analysis”. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (pp. 1-14).
» What is the potential benefit of communicating ambiguity in AI outputs?
» How ambiguity-aware AI can be implemented for breast cancer diagnosis?
» How to combine techniques to develop AI assistants capable of recognizing
and explaining ambiguous cases?

CONCLUSION
In this research work:
1. We identified how the different suggestion levels (i.e., more suggestive or
imposing the AI recommendations) will impact the radiologists' decision-
making process;
2. We are developing a system that enables clinicians to accept or reject the
AI breast analysis with an adaptive communication depending on the
levels of medical profession experience, such as novice or expert
clinicians;
3. Likewise, we will propose a series of recommendations for a human-
centered approach around personalizable and customizable AI in breast
cancer diagnosis;

“
Into whatsoever houses I enter, I will enter to
help the sick, and I will abstain from all
intentional wrong-doing and harm, especially
from abusing the bodies of man or woman,
bond or free.
- Hippocratic

Questions & Answers
Any question?

Presenter
E-Mail: fcalisto@andrew.cmu.edu
Webpage: fmcalisto.github.io
Repository: mimbcd-ui.github.io

Stanford University Neuroradiology Talks: Personalizing and Customizing AI Explanations for Clinicians

Recommended

Recommended

More Related Content

Similar to Stanford University Neuroradiology Talks: Personalizing and Customizing AI Explanations for Clinicians

Similar to Stanford University Neuroradiology Talks: Personalizing and Customizing AI Explanations for Clinicians (20)

Recently uploaded

Recently uploaded (20)

Stanford University Neuroradiology Talks: Personalizing and Customizing AI Explanations for Clinicians

Editor's Notes