Phenotyping is essential in medical research, as it provides a better
understanding of healthcare problems owing to the fact that clinical phenotypes
identify subsets of patients with common characteristics. Subgroup discovery appears
to be a promising machine learning approach because it provides a framework
with which to search for interesting subgroups according to the relations
between the individual characteristics and a target value. Each single pattern extracted
by SD algorithms is human-readable. However, its complexity (the number
of attributes involved) and the high number of subgroups obtained make the
overall model difficult to understand. In this work, we propose a method with
which to explain SD, designed for the clinical context. We have employed a
two-step process in order to obtain SD model-agnostic explanations based on
a decision tree surrogate model. The complexity involved in evaluating explainable
methods led us to adopt a multiple strategy. We first show how explanations
are built, and test a selection of state-of-the-art SD algorithms and gold-standard
datasets. We then illustrate the suitability of the method in a clinical use case for
an antimicrobial resistance problem. Finally, we study the utility of the method by
surveying a small group in order to validate it from a human-centric perspective
Unit-IV; Professional Sales Representative (PSR).pptx
Simple explanations to summarise Subgroup Discovery outcomes: a case of study concerning patient phenotyping
1. Enrique Valero-Leal 1, M. Campos2,3, J. M. Juarez2
1 Technical University of Madrid
2 AIKE research group (INTICO), University of Murcia
3 IMIB-ARRIXACA Murcian Biomedical Research Institute
Simple explanations to summarise
Subgroup Discovery outcomes: a case of
study concerning patient phenotyping
Sept 19 2022 X-KDD workshop, Grenoble
Funded by Spanish Ministry of Science, Innovation and Universities
under the CONFAINCE project (Ref:PID2021-122194OB-I00 ), and by
the European Fund for Regional Development (EFRD, FEDER).
2. 2
Simple explanations to summarise Subgroup Discovery: patient phenotyping
FULL PAPER DOWNLOADABLE AT:
https://kdd.isti.cnr.it/xkdd2022/papers/XKDD_2022_paper_9989.pdf
Enrique Valero-Leal, M. Campos, J. M. Juarez. Simple
explanations to summarise Subgroup Discovery:
patient phenotyping. Proceedings of the
International Workshop on
eXplainable Knowledge Discovery in Data Mining
XKDD 2022. Lecture Notes in Computer Science.
Springer Series. 2022
These slides summarise the conference
paper presented at XKDD 2022 workshop @ECML-PKDD:
This research was funded by
under the CONFAINCE project
(Ref:PID2021-122194OB-I00 )
3. • OUR RESEARCH GOAL
Generate trustworthy medical hypotheses for
patient phenotyping.
Subgroup Discovery algorithms approach
Medical-friendly explanations
SubgroupExplainer
3
Simple explanations to summarise Subgroup Discovery: patient phenotyping
4. • OUTLINE:
1. Clinical problem & research goal
2. Subgroup discovery
3. Contribution: SubgroupExplainer
4. Experiments
5. Conclusions
4
✓
Simple explanations to summarise Subgroup Discovery: patient phenotyping
5. • SUBGROUP DISCOVERY
Clustering != Subgroup discovery
5
Simple explanations to summarise Subgroup Discovery: patient phenotyping
Picture from: S. Ventura and J. M. Luna (2018). Supervised Descriptive Pattern Mining. Springer books.
13. • EXPERIMENTS
1. Computational properties and scalability
2. Clinical reproducible use case
3. Human subjective study
13
Simple explanations to summarise Subgroup Discovery: patient phenotyping
14. • EXPERIMENTS: Computational properties and scalability
14
Simple explanations to summarise Subgroup Discovery: patient phenotyping
Ssg: subgroups
S: all selectors from Ssg
Su : unique selectors from S
card: mean cardinality |S|/|Ssg|
T: number vertex of tree
purity: proportion correctly class
15. • EXPERIMENTS
– Clinical reproducible use case
MIMIC III dataset (60,000/1280 admissions)
15
Simple explanations to summarise Subgroup Discovery: patient phenotyping
CN2-SD CART+WRAcc
16. • EXPERIMENTS
– Human subjective study
18 participants surveyed
ML and unfamiliar AI
Task oriented: SD & Trees.
Subjective opinion
16
Simple explanations to summarise Subgroup Discovery: patient phenotyping
18. • CONCLUSIONS
– Subgroup explainer:
SD problem pioneer
SD-agnostic, global, tree-like surrogate explanations.
Designed for phenotyping problems.
– Compactness: distil myriad of subgroups
– Comparative method: multiple SD outcomes
– Secondary use: surrogated model.
18
Simple explanations to summarise Subgroup Discovery: patient phenotyping
19. Simple explanations to summarise Subgroup Discovery
outcomes: a case of study concerning patient phenotyping
Contact:
Jose M. Juarez
jmjuarez@um.es
Simple explanations to summarise Subgroup Discovery: patient phenotyping
Subgroup Explainer:
Phenotyping method
Compact, comparative, secondary use
20. 20
Simple explanations to summarise Subgroup Discovery: patient phenotyping
FULL PAPER DOWNLOADABLE AT:
https://kdd.isti.cnr.it/xkdd2022/papers/XKDD_2022_paper_9989.pdf
Enrique Valero-Leal, M. Campos, J. M. Juarez. Simple
explanations to summarise Subgroup Discovery:
patient phenotyping. Proceedings of the
International Workshop on
eXplainable Knowledge Discovery in Data Mining
XKDD 2022. Lecture Notes in Computer Science.
Springer Series. 2022
These slides summarise the conference
paper presented at XKDD 2022 workshop @ECML-PKDD: