SlideShare a Scribd company logo
1 of 20
Download to read offline
Enrique Valero-Leal 1, M. Campos2,3, J. M. Juarez2
1 Technical University of Madrid
2 AIKE research group (INTICO), University of Murcia
3 IMIB-ARRIXACA Murcian Biomedical Research Institute
Simple explanations to summarise
Subgroup Discovery outcomes: a case of
study concerning patient phenotyping
Sept 19 2022 X-KDD workshop, Grenoble
Funded by Spanish Ministry of Science, Innovation and Universities
under the CONFAINCE project (Ref:PID2021-122194OB-I00 ), and by
the European Fund for Regional Development (EFRD, FEDER).
2
Simple explanations to summarise Subgroup Discovery: patient phenotyping
FULL PAPER DOWNLOADABLE AT:
https://kdd.isti.cnr.it/xkdd2022/papers/XKDD_2022_paper_9989.pdf
Enrique Valero-Leal, M. Campos, J. M. Juarez. Simple
explanations to summarise Subgroup Discovery:
patient phenotyping. Proceedings of the
International Workshop on
eXplainable Knowledge Discovery in Data Mining
XKDD 2022. Lecture Notes in Computer Science.
Springer Series. 2022
These slides summarise the conference
paper presented at XKDD 2022 workshop @ECML-PKDD:
This research was funded by
under the CONFAINCE project
(Ref:PID2021-122194OB-I00 )
• OUR RESEARCH GOAL
Generate trustworthy medical hypotheses for
patient phenotyping.
Subgroup Discovery algorithms approach
Medical-friendly explanations
SubgroupExplainer
3
Simple explanations to summarise Subgroup Discovery: patient phenotyping
• OUTLINE:
1. Clinical problem & research goal
2. Subgroup discovery
3. Contribution: SubgroupExplainer
4. Experiments
5. Conclusions
4
✓
Simple explanations to summarise Subgroup Discovery: patient phenotyping
• SUBGROUP DISCOVERY
Clustering != Subgroup discovery
5
Simple explanations to summarise Subgroup Discovery: patient phenotyping
Picture from: S. Ventura and J. M. Luna (2018). Supervised Descriptive Pattern Mining. Springer books.
• SUBGROUP DISCOVERY: DEFINITIONS
6
Simple explanations to summarise Subgroup Discovery: patient phenotyping
𝑫𝒂𝒕𝒂𝒔𝒆𝒕: 𝐷 = 𝐼, 𝐴 𝐴 = 𝑎!, 𝑎", … , 𝑎#
𝑺𝒆𝒍𝒆𝒄𝒕𝒐𝒓 𝑠$%: 𝐼 → 𝐵𝑜𝑜𝑙𝑒𝑎𝑛 𝑠$% 𝑖 = 𝑇 ⟺ 𝑠𝑐 𝑖𝑠 𝑓𝑢𝑙𝑓𝑖𝑙𝑙𝑒𝑑 𝑏𝑦 𝑖
𝑷𝒂𝒕𝒕𝒆𝒓𝒏: 𝑃 = 𝑠!, 𝑠", … , 𝑠& 𝑃 𝑖𝑛 𝑐𝑜𝑛𝑗𝑢𝑛𝑐𝑡𝑖𝑣𝑒 𝑓𝑜𝑟𝑚
𝑺𝒖𝒃𝒈𝒓𝒐𝒖𝒑: 𝑆𝐺 = 𝑃, 𝑠$' 𝑆𝐺 ∘ = ∀𝑖 ∈ 𝐼 |𝑠$% 𝑖 = 𝑇, ∀𝑠𝑐 ∈ 𝑃
SG=	if	(age>35,	culture=Enteroc.Faecium)	THEN	suscept=Resistant
𝑸𝒖𝒂𝒍𝒊𝒕𝒚 𝒇𝒖𝒏𝒄𝒕𝒊𝒐𝒏: 𝑞𝑓: 𝑃, 𝐷 → ℝ
SD	algorithms:	
Frequent pattern mining:	SD-MAP,	Dp-Subroup,	BSD,	etc.
Beam search:	SD,	CN2-SD,	SD4TS	(heuristics)
• OUTLINE:
1. Clinical problem & research goal
2. Subgroup discovery
3. Contribution: SubgroupExplainer
4. Experiments
5. Conclusions
7
✓
Simple explanations to summarise Subgroup Discovery: patient phenotyping
✓
• CONTRIBUTION
– Overcome difficulties explaining SD to clinicians
– Simple explanations to increase trust in SD
8
Simple explanations to summarise Subgroup Discovery: patient phenotyping
• CONTRIBUTION
– SubgroupExplainer
• XAI characteristics
–SD model-agnostic
–Global explanations
–Surrogated model
–Tree-like explanations
9
Simple explanations to summarise Subgroup Discovery: patient phenotyping
• CONTRIBUTION
– SubgroupExplainer
• Tree-like explanations, why?
10
Simple explanations to summarise Subgroup Discovery: patient phenotyping
• CONTRIBUTION
– SubgroupExplainer: simple explanations
11
Simple explanations to summarise Subgroup Discovery: patient phenotyping
DATASET
#attributes: 15
#instances:1049
BlackBox
SD
Algorithm
SG #1
SG #2
SG#19
SG# 20
...
#20
#5,2 #11
#19 #9,2 #7,14
#17
#7 #10
#3
#2,7
AT1 AT2 . . . AT15
1 26 . . . A
. . . . . . . . . B
64 12 . . . A
Step 1
DB
labelling
A1 … A14 L SUBGROUPS
1 … . . . #17,#2, #5
. . . . . . #20
64 . . . #11, #7, #2
Step 2:
SURROGATE
Explainer
building
SUBGROUPS:
SG#1: At1>2,A3=6=>AT15=A
SG#2: At1=4,At2=5, At4=21=>AT15=B
SG#3: At1<11,At2=5=>AT15=A
SG#4: At2>40,At3=5, At4=21=>AT15=C
. . .
SG#20: At2=3,At3<7,AT4<22 =>AT15=A
Labelled Dataset
SD EXPLANATION
Target Attribute
PROPOSAL ADVANTAGES:
• OUTLINE:
1. Clinical problem & research goal
2. Subgroup discovery
3. Contribution: SubgroupExplainer
4. Experiments
5. Conclusions
12
✓
Simple explanations to summarise Subgroup Discovery: patient phenotyping
✓
✓
• EXPERIMENTS
1. Computational properties and scalability
2. Clinical reproducible use case
3. Human subjective study
13
Simple explanations to summarise Subgroup Discovery: patient phenotyping
• EXPERIMENTS: Computational properties and scalability
14
Simple explanations to summarise Subgroup Discovery: patient phenotyping
Ssg: subgroups
S: all selectors from Ssg
Su : unique selectors from S
card: mean cardinality |S|/|Ssg|
T: number vertex of tree
purity: proportion correctly class
• EXPERIMENTS
– Clinical reproducible use case
MIMIC III dataset (60,000/1280 admissions)
15
Simple explanations to summarise Subgroup Discovery: patient phenotyping
CN2-SD CART+WRAcc
• EXPERIMENTS
– Human subjective study
18 participants surveyed
ML and unfamiliar AI
Task oriented: SD & Trees.
Subjective opinion
16
Simple explanations to summarise Subgroup Discovery: patient phenotyping
• OUTLINE:
1. Clinical problem & research goal
2. Subgroup discovery
3. Contribution: SubgroupExplainer
4. Experiments
5. Conclusions
17
✓
Simple explanations to summarise Subgroup Discovery: patient phenotyping
✓
✓
✓
• CONCLUSIONS
– Subgroup explainer:
SD problem pioneer
SD-agnostic, global, tree-like surrogate explanations.
Designed for phenotyping problems.
– Compactness: distil myriad of subgroups
– Comparative method: multiple SD outcomes
– Secondary use: surrogated model.
18
Simple explanations to summarise Subgroup Discovery: patient phenotyping
Simple explanations to summarise Subgroup Discovery
outcomes: a case of study concerning patient phenotyping
Contact:
Jose M. Juarez
jmjuarez@um.es
Simple explanations to summarise Subgroup Discovery: patient phenotyping
Subgroup Explainer:
Phenotyping method
Compact, comparative, secondary use
20
Simple explanations to summarise Subgroup Discovery: patient phenotyping
FULL PAPER DOWNLOADABLE AT:
https://kdd.isti.cnr.it/xkdd2022/papers/XKDD_2022_paper_9989.pdf
Enrique Valero-Leal, M. Campos, J. M. Juarez. Simple
explanations to summarise Subgroup Discovery:
patient phenotyping. Proceedings of the
International Workshop on
eXplainable Knowledge Discovery in Data Mining
XKDD 2022. Lecture Notes in Computer Science.
Springer Series. 2022
These slides summarise the conference
paper presented at XKDD 2022 workshop @ECML-PKDD:

More Related Content

Similar to Simple explanations to summarise Subgroup Discovery outcomes: a case of study concerning patient phenotyping

ML edddddddddddddddddddddddddxduated detection.pptx
ML edddddddddddddddddddddddddxduated detection.pptxML edddddddddddddddddddddddddxduated detection.pptx
ML edddddddddddddddddddddddddxduated detection.pptx
RamithaDevi
 
SympTEMIST Shared Task on Symptoms, Signs and Findings Detection and Normaliz...
SympTEMIST Shared Task on Symptoms, Signs and Findings Detection and Normaliz...SympTEMIST Shared Task on Symptoms, Signs and Findings Detection and Normaliz...
SympTEMIST Shared Task on Symptoms, Signs and Findings Detection and Normaliz...
Martin Krallinger
 
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationMark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Benjamin Good
 
Mb0050 research methodology
Mb0050   research methodologyMb0050   research methodology
Mb0050 research methodology
smumbahelp
 

Similar to Simple explanations to summarise Subgroup Discovery outcomes: a case of study concerning patient phenotyping (20)

Data Science for (Health) Science: tales from a challenging front line, and h...
Data Science for (Health) Science:tales from a challenging front line, and h...Data Science for (Health) Science:tales from a challenging front line, and h...
Data Science for (Health) Science: tales from a challenging front line, and h...
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSISFUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
 
Computational methods for case-cohort studies
Computational methods for case-cohort studiesComputational methods for case-cohort studies
Computational methods for case-cohort studies
 
Multimodal Tutor for CPR presented at AIME'19
Multimodal Tutor for CPR presented at AIME'19Multimodal Tutor for CPR presented at AIME'19
Multimodal Tutor for CPR presented at AIME'19
 
A review on early hospital mortality prediction using vital signals
A review on early hospital mortality prediction using vital signalsA review on early hospital mortality prediction using vital signals
A review on early hospital mortality prediction using vital signals
 
ML edddddddddddddddddddddddddxduated detection.pptx
ML edddddddddddddddddddddddddxduated detection.pptxML edddddddddddddddddddddddddxduated detection.pptx
ML edddddddddddddddddddddddddxduated detection.pptx
 
APPIS-FDGPET.pdf
APPIS-FDGPET.pdfAPPIS-FDGPET.pdf
APPIS-FDGPET.pdf
 
SympTEMIST Shared Task on Symptoms, Signs and Findings Detection and Normaliz...
SympTEMIST Shared Task on Symptoms, Signs and Findings Detection and Normaliz...SympTEMIST Shared Task on Symptoms, Signs and Findings Detection and Normaliz...
SympTEMIST Shared Task on Symptoms, Signs and Findings Detection and Normaliz...
 
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotationMark2Cure: a crowdsourcing platform for biomedical literature annotation
Mark2Cure: a crowdsourcing platform for biomedical literature annotation
 
Mb0050 research methodology
Mb0050   research methodologyMb0050   research methodology
Mb0050 research methodology
 
Corso di Statistica Inferenziale per Data Scientist
Corso di Statistica Inferenziale per Data ScientistCorso di Statistica Inferenziale per Data Scientist
Corso di Statistica Inferenziale per Data Scientist
 
Biostats in ortho
Biostats in orthoBiostats in ortho
Biostats in ortho
 
Automatically Detecting Scientific Misinformation
Automatically Detecting Scientific MisinformationAutomatically Detecting Scientific Misinformation
Automatically Detecting Scientific Misinformation
 
Chapter8
Chapter8Chapter8
Chapter8
 
Texto estudiante etad01
Texto estudiante etad01Texto estudiante etad01
Texto estudiante etad01
 
Détection de profils, application en santé et en économétrie geissler
Détection de profils, application en santé et en économétrie   geisslerDétection de profils, application en santé et en économétrie   geissler
Détection de profils, application en santé et en économétrie geissler
 
Introduction to Statistics
Introduction to StatisticsIntroduction to Statistics
Introduction to Statistics
 
DEEP FACIAL DIAGNOSIS: DEEP TRANSFER LEARNING FROM FACE RECOGNITION TO FACIAL...
DEEP FACIAL DIAGNOSIS: DEEP TRANSFER LEARNING FROM FACE RECOGNITION TO FACIAL...DEEP FACIAL DIAGNOSIS: DEEP TRANSFER LEARNING FROM FACE RECOGNITION TO FACIAL...
DEEP FACIAL DIAGNOSIS: DEEP TRANSFER LEARNING FROM FACE RECOGNITION TO FACIAL...
 
Data Mining.ppt
Data Mining.pptData Mining.ppt
Data Mining.ppt
 
Linear Regression.pdf
Linear Regression.pdfLinear Regression.pdf
Linear Regression.pdf
 

Recently uploaded

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
kauryashika82
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Recently uploaded (20)

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 

Simple explanations to summarise Subgroup Discovery outcomes: a case of study concerning patient phenotyping

  • 1. Enrique Valero-Leal 1, M. Campos2,3, J. M. Juarez2 1 Technical University of Madrid 2 AIKE research group (INTICO), University of Murcia 3 IMIB-ARRIXACA Murcian Biomedical Research Institute Simple explanations to summarise Subgroup Discovery outcomes: a case of study concerning patient phenotyping Sept 19 2022 X-KDD workshop, Grenoble Funded by Spanish Ministry of Science, Innovation and Universities under the CONFAINCE project (Ref:PID2021-122194OB-I00 ), and by the European Fund for Regional Development (EFRD, FEDER).
  • 2. 2 Simple explanations to summarise Subgroup Discovery: patient phenotyping FULL PAPER DOWNLOADABLE AT: https://kdd.isti.cnr.it/xkdd2022/papers/XKDD_2022_paper_9989.pdf Enrique Valero-Leal, M. Campos, J. M. Juarez. Simple explanations to summarise Subgroup Discovery: patient phenotyping. Proceedings of the International Workshop on eXplainable Knowledge Discovery in Data Mining XKDD 2022. Lecture Notes in Computer Science. Springer Series. 2022 These slides summarise the conference paper presented at XKDD 2022 workshop @ECML-PKDD: This research was funded by under the CONFAINCE project (Ref:PID2021-122194OB-I00 )
  • 3. • OUR RESEARCH GOAL Generate trustworthy medical hypotheses for patient phenotyping. Subgroup Discovery algorithms approach Medical-friendly explanations SubgroupExplainer 3 Simple explanations to summarise Subgroup Discovery: patient phenotyping
  • 4. • OUTLINE: 1. Clinical problem & research goal 2. Subgroup discovery 3. Contribution: SubgroupExplainer 4. Experiments 5. Conclusions 4 ✓ Simple explanations to summarise Subgroup Discovery: patient phenotyping
  • 5. • SUBGROUP DISCOVERY Clustering != Subgroup discovery 5 Simple explanations to summarise Subgroup Discovery: patient phenotyping Picture from: S. Ventura and J. M. Luna (2018). Supervised Descriptive Pattern Mining. Springer books.
  • 6. • SUBGROUP DISCOVERY: DEFINITIONS 6 Simple explanations to summarise Subgroup Discovery: patient phenotyping 𝑫𝒂𝒕𝒂𝒔𝒆𝒕: 𝐷 = 𝐼, 𝐴 𝐴 = 𝑎!, 𝑎", … , 𝑎# 𝑺𝒆𝒍𝒆𝒄𝒕𝒐𝒓 𝑠$%: 𝐼 → 𝐵𝑜𝑜𝑙𝑒𝑎𝑛 𝑠$% 𝑖 = 𝑇 ⟺ 𝑠𝑐 𝑖𝑠 𝑓𝑢𝑙𝑓𝑖𝑙𝑙𝑒𝑑 𝑏𝑦 𝑖 𝑷𝒂𝒕𝒕𝒆𝒓𝒏: 𝑃 = 𝑠!, 𝑠", … , 𝑠& 𝑃 𝑖𝑛 𝑐𝑜𝑛𝑗𝑢𝑛𝑐𝑡𝑖𝑣𝑒 𝑓𝑜𝑟𝑚 𝑺𝒖𝒃𝒈𝒓𝒐𝒖𝒑: 𝑆𝐺 = 𝑃, 𝑠$' 𝑆𝐺 ∘ = ∀𝑖 ∈ 𝐼 |𝑠$% 𝑖 = 𝑇, ∀𝑠𝑐 ∈ 𝑃 SG= if (age>35, culture=Enteroc.Faecium) THEN suscept=Resistant 𝑸𝒖𝒂𝒍𝒊𝒕𝒚 𝒇𝒖𝒏𝒄𝒕𝒊𝒐𝒏: 𝑞𝑓: 𝑃, 𝐷 → ℝ SD algorithms: Frequent pattern mining: SD-MAP, Dp-Subroup, BSD, etc. Beam search: SD, CN2-SD, SD4TS (heuristics)
  • 7. • OUTLINE: 1. Clinical problem & research goal 2. Subgroup discovery 3. Contribution: SubgroupExplainer 4. Experiments 5. Conclusions 7 ✓ Simple explanations to summarise Subgroup Discovery: patient phenotyping ✓
  • 8. • CONTRIBUTION – Overcome difficulties explaining SD to clinicians – Simple explanations to increase trust in SD 8 Simple explanations to summarise Subgroup Discovery: patient phenotyping
  • 9. • CONTRIBUTION – SubgroupExplainer • XAI characteristics –SD model-agnostic –Global explanations –Surrogated model –Tree-like explanations 9 Simple explanations to summarise Subgroup Discovery: patient phenotyping
  • 10. • CONTRIBUTION – SubgroupExplainer • Tree-like explanations, why? 10 Simple explanations to summarise Subgroup Discovery: patient phenotyping
  • 11. • CONTRIBUTION – SubgroupExplainer: simple explanations 11 Simple explanations to summarise Subgroup Discovery: patient phenotyping DATASET #attributes: 15 #instances:1049 BlackBox SD Algorithm SG #1 SG #2 SG#19 SG# 20 ... #20 #5,2 #11 #19 #9,2 #7,14 #17 #7 #10 #3 #2,7 AT1 AT2 . . . AT15 1 26 . . . A . . . . . . . . . B 64 12 . . . A Step 1 DB labelling A1 … A14 L SUBGROUPS 1 … . . . #17,#2, #5 . . . . . . #20 64 . . . #11, #7, #2 Step 2: SURROGATE Explainer building SUBGROUPS: SG#1: At1>2,A3=6=>AT15=A SG#2: At1=4,At2=5, At4=21=>AT15=B SG#3: At1<11,At2=5=>AT15=A SG#4: At2>40,At3=5, At4=21=>AT15=C . . . SG#20: At2=3,At3<7,AT4<22 =>AT15=A Labelled Dataset SD EXPLANATION Target Attribute PROPOSAL ADVANTAGES:
  • 12. • OUTLINE: 1. Clinical problem & research goal 2. Subgroup discovery 3. Contribution: SubgroupExplainer 4. Experiments 5. Conclusions 12 ✓ Simple explanations to summarise Subgroup Discovery: patient phenotyping ✓ ✓
  • 13. • EXPERIMENTS 1. Computational properties and scalability 2. Clinical reproducible use case 3. Human subjective study 13 Simple explanations to summarise Subgroup Discovery: patient phenotyping
  • 14. • EXPERIMENTS: Computational properties and scalability 14 Simple explanations to summarise Subgroup Discovery: patient phenotyping Ssg: subgroups S: all selectors from Ssg Su : unique selectors from S card: mean cardinality |S|/|Ssg| T: number vertex of tree purity: proportion correctly class
  • 15. • EXPERIMENTS – Clinical reproducible use case MIMIC III dataset (60,000/1280 admissions) 15 Simple explanations to summarise Subgroup Discovery: patient phenotyping CN2-SD CART+WRAcc
  • 16. • EXPERIMENTS – Human subjective study 18 participants surveyed ML and unfamiliar AI Task oriented: SD & Trees. Subjective opinion 16 Simple explanations to summarise Subgroup Discovery: patient phenotyping
  • 17. • OUTLINE: 1. Clinical problem & research goal 2. Subgroup discovery 3. Contribution: SubgroupExplainer 4. Experiments 5. Conclusions 17 ✓ Simple explanations to summarise Subgroup Discovery: patient phenotyping ✓ ✓ ✓
  • 18. • CONCLUSIONS – Subgroup explainer: SD problem pioneer SD-agnostic, global, tree-like surrogate explanations. Designed for phenotyping problems. – Compactness: distil myriad of subgroups – Comparative method: multiple SD outcomes – Secondary use: surrogated model. 18 Simple explanations to summarise Subgroup Discovery: patient phenotyping
  • 19. Simple explanations to summarise Subgroup Discovery outcomes: a case of study concerning patient phenotyping Contact: Jose M. Juarez jmjuarez@um.es Simple explanations to summarise Subgroup Discovery: patient phenotyping Subgroup Explainer: Phenotyping method Compact, comparative, secondary use
  • 20. 20 Simple explanations to summarise Subgroup Discovery: patient phenotyping FULL PAPER DOWNLOADABLE AT: https://kdd.isti.cnr.it/xkdd2022/papers/XKDD_2022_paper_9989.pdf Enrique Valero-Leal, M. Campos, J. M. Juarez. Simple explanations to summarise Subgroup Discovery: patient phenotyping. Proceedings of the International Workshop on eXplainable Knowledge Discovery in Data Mining XKDD 2022. Lecture Notes in Computer Science. Springer Series. 2022 These slides summarise the conference paper presented at XKDD 2022 workshop @ECML-PKDD: