SlideShare a Scribd company logo
1 of 19
Download to read offline
Data-driven knowledge discovery from Brain
Imaging of Alzheimer’s disease patients
Prof. Giuseppe Di Fatta
Director of the MSc Computing for Data Science
Computer Science and Artificial Intelligence Institute
Faculty of Engineering
Free University of Bozen-Bolzano, Italy
Giuseppe.DiFatta@unibz.it
The 22nd South Tyrol Free Software Conference (SFSCON)
NOI Techpark, Bolzano, Italy
10 November 2023
Prof. Giuseppe Di Fatta 2
Outline
• Motivations
• Dementia and Alzheimer’s Disease (AD)
• MRI data-driven solutions for Alzheimer’s disease
classification
– Data and data processing
– Machine Learning models
• outliers detection, feature selection, regression and classification
– Explainability/Interpretability
• Conclusions
Prof. Giuseppe Di Fatta
Dementia and Alzheimer’s Disease
• Mental disorders are one of the five most costly conditions for
medical spending.
• Dementia is the most expensive condition for medical and social cost
and, still, it receives only a fraction of the research funds w.r.t. other
health conditions.
• Alzheimer’s Disease (AD): 60-80% of dementia cases
▪ Several national and international campaigns aimed at raising
awareness and attention (e.g., www.worldalzmonth.org)
▪ Currently 40+ million people suffer from AD: 130-150 million by 2050,
one of the most significant global social economic health crises.
▪ The exact cause of AD is unknown.
▪ Importance of early and accurate diagnosis: quality of life can be
significantly improved.
3
Prof. Giuseppe Di Fatta
Alzheimer’s Disease (AD)
4
Prof. Giuseppe Di Fatta
Data Workflow
5
data
gathering,
selection
Feature
generation
(stats on RoIs)
Image
Segmentation
Machine
Learning
Pre-processing
• A single MRI scan file typically takes 30-40 MB.
• After pre-processing the files generated for a single image take 350-500 MB.
• Pre-procesing of a single image takes ~8h ➔ several months or even years
• A data parallel approach used to speed up computation ➔ a few months
Brain MRIs from
3170 subjects
a never
ending effort
Prof. Giuseppe Di Fatta
Data Workflow
6
data
gathering,
selection
Feature
generation
(stats on RoIs)
Image
Segmentation
Machine
Learning
Machine Learning tasks
• Outlier Detection
• Feature selection
• Training predictive models
• Regression for age estimation
• Classification: binary and multiclass predictive models
• Testing the models: robust performance estimation
• Explainability and Interpretability
Brain MRIs from
3170 subjects
Various ML tasks
Prof. Giuseppe Di Fatta
KNIME for
7
▪ KNIME is an open-source Data Science and Machine
Learning Platform
▪ codeless Machine Learning and Data Science
▪ open-source and free (General Public License)
▪ open and extendible platform
a KNIME data workflow
(an acyclic graph of computations)
Prof. Giuseppe Di Fatta
Various Approaches since 2014
Binary (HC vs AD) and multiclass classification problems
• Input features:
– No selection: all available (~400) numerical measurements from brain ROIs
– Domain-specific feature selection: 16 hippocampal subfields
– Automatic feature selection
• Filter/Wrapper/Embedded methods
• Classification methods I – focus on predictive accuracy:
1. Binary Region Classifier
2. Probability Weight-based Classification
3. Bayesian Classifier
4. Support Vector Machine
5. Deep Neural Networks
• Classification methods II – predictive accuracy and descriptive power:
– brain age estimation
8
85.8 84.7 84.9
Prof. Giuseppe Di Fatta
Predictive Task: Regression
❑ Input data: structural MRI scans (T1 weighting) of human
brains
– Healthy Control (HC) only or in combination with
– Alzheimer’s Disease (AD) (or a different condition)
❑ Regression task:
– Estimate the subject’s age from a structural MRI of the brain
• e.g., Brain Age Gap Estimation (BrainAGE)
– Challenges: data manipulation, accuracy, curse of dimensionality,
confidence of models
9
Prof. Giuseppe Di Fatta
Brain Age Gap Estimation
• Brain Age Gap Estimation (BrainAGE)
– BrainAGE quantifies the difference between the actual anagraphic age
of a subject and their age as predicted by ML models applied to
neuroimaging data
– It is considered a biomarker of brain health.
– Typical method: age model of healthy subjects
• Voxel-Based Morphometry (VBM) approach: PCA applied to 3,700
voxels to reduce the dimensionality of the input space
• Relevance Vector Regression (RVR) and Support Vector
Regression (SVR)
10
Prof. Giuseppe Di Fatta
The “Apparent” Brain Age
• The Apparent Brain Age (ABA) for a specific neurodegenerative condition
– goal-conditioned brain age estimation based on region-wise segmentation
– inductive bias based on feature selection in order to improve the classification
accuracy of the estimated brain age
Objectives: simplest model with high accuracy, interpretability and specificity
✓ the design of a data workflow with a combination of only linear models to
preserve the original input space semantics
✓ the definition of a novel feature score to measure the specific contribution of each
selected morphological region to the classification prediction
✓ a rigorous and extensive experimental comparative analysis to validate the
method: comparable or superior accuracy than SOTA ML methods
✓ demonstrate the interpretability of the classification method
11
▪ A. Varzandian, M.A.S. Razo, M.R. Sanders, A. Atmakuru, G. Di Fatta,
“Classification-biased apparent brain age for the prediction of Alzheimer's
disease”, Frontiers in Neuroscience (581), 2021.
Prof. Giuseppe Di Fatta
ABA, Classification and Interpretability
1. Biased Forward Feature Selection (BFFS) as coarse-grained method:
cross-validation on a correlation-based filter
2. Linear regression is applied to the selected features: LASSO regression
with fine-grained feature selection method
3. Logistic regression based only on ABA and actual age
4. Interpretability with a measure of feature importance/score
12
➔Feature Score:
where {fi} are the k ROI features, ai and cj are model parameters
with
Prof. Giuseppe Di Fatta
Feature Selection (step 1)
• results based on 1901 subjects (AD and CN)
13
Prof. Giuseppe Di Fatta
ABA Prediction and Classification (2 and 3)
14
M5
M6
Prof. Giuseppe Di Fatta
Classification Accuracy (3)
• results based on 1901 subjects (AD and CN)
15
Baseline
SVM SVM DN
+data +data +data
+BFFS
+data
ABA & LogReg Baseline
SVM SVM DN
+data +data +data
+BFFS
+data
ABA & LogReg
Prof. Giuseppe Di Fatta
Feature Score (4): True Positive
16
Prof. Giuseppe Di Fatta
Feature Score (4): True Negative
17
Prof. Giuseppe Di Fatta 18
Conclusions
▪ With same pre-processing and features most classification methods are
similar in terms of accuracy. However, black-box approaches do lack
descriptive capabilities, especially in such high-dimensional spaces.
▪ A new candidate biomarker, the Apparent Brain Age, helps at improving
predictive accuracy and can be useful on itself for its descriptive power.
▪ The new feature score helps to link age gaps to specific ROIs for a
powerful combination of
➢ state-of-the-art accuracy
➢ interpretability
➢ specificity
➢ Ongoing effort is directed to experimental analysis of multiclass problems
and specificity.
➢ Data from other neurodegenerative diseases to extend the work further.
Prof. Giuseppe Di Fatta
Relevant Publications
▪ A. Varzandian, M.A.S. Razo, M.R. Sanders, A. Atmakuru, G. Di Fatta, “Classification-biased
apparent brain age for the prediction of Alzheimer's disease”, Frontiers in Neuroscience
(581), 2021.
▪ E.E. Bron et al., "Standardized evaluation of algorithms for computer-aided diagnosis of
dementia based on structural MRI: the CADDementia challenge", NeuroImage, an Elsevier Journal
of Brain Function, 1 May 2015, 111:562-79.
▪ A. Spedding, G. Di Fatta and J.D. Saddy, "An LDA and Probability-based Classifier for the
Diagnosis of Alzheimer's Disease from Structural MRI", Workshop on High Performance
Bioinformatics and Biomedicine (HiBB), the IEEE International Conference on Bioinformatics
and Biomedicine (BIBM), Nov. 9-12, 2015, Washington D.C., pp. 1404 - 1411.
▪ A. Spedding, G. Di Fatta and M. Cannataro, "A Genetic Algorithm for the Selection of
Structural MRI Features for Classification of Mild Cognitive Impairment and Alzheimer's
Disease", WS on Machine Learning in decision making for biomedical applications, IEEE Int.l
Conf. on Bioinformatics and Biomedicine, Nov. 9-12, 2015, Washington D.C., pp. 1566-1571.
▪ A. Sarica, G. Di Fatta, G. Smith, M. Cannataro, D. Saddy, "Advanced Feature Selection in
Multinominal Dementia Classification from Structural MRI Data", in the Proc. of the Workshop
on Computer-Aided Diagnosis of Dementia based on structural MRI data (CADDementia), Conf. on
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2014, Boston, MA, USA
▪ A. Sarica, G. Di Fatta, M. Cannataro, "K-Surfer: A KNIME extension for the management and
analysis of human brain MRI FreeSurfer/FSL Data", Int.l Conf. on Brain Informatics and
Health, 11-14 August 2014, Warsaw, Poland, Springer LNCS V.8609, 2014, pp. 481-492.
▪ M. Berthold, N. Cebron, F. Dill, G. Di Fatta, T. Gabriel, F. Georg, T. Meinl, P. Ohl, C.
Sieb, B. Wiswedel, "KNIME: the Konstanz Information Miner", Proc. 4th Annual Industrial
Simulation Conference (ISC), Palermo, Italy, June 5-7, 2006, pp.58-61.
19

More Related Content

Similar to SFSCON23 - Giuseppe Di Fatta - Data-driven knowledge discovery from Brain Imaging of Alzheimer’s disease patients

Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016
Nicola Amoroso
 
AI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelAI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth Israel
Levi Shapiro
 
Brain Tumor Detection using Neural Network
Brain Tumor Detection using Neural NetworkBrain Tumor Detection using Neural Network
Brain Tumor Detection using Neural Network
ijtsrd
 

Similar to SFSCON23 - Giuseppe Di Fatta - Data-driven knowledge discovery from Brain Imaging of Alzheimer’s disease patients (20)

An Improved Self Organizing Feature Map Classifier for Multimodal Biometric R...
An Improved Self Organizing Feature Map Classifier for Multimodal Biometric R...An Improved Self Organizing Feature Map Classifier for Multimodal Biometric R...
An Improved Self Organizing Feature Map Classifier for Multimodal Biometric R...
 
Lec.10 Dr Ahmed Elngar
Lec.10 Dr Ahmed ElngarLec.10 Dr Ahmed Elngar
Lec.10 Dr Ahmed Elngar
 
IRJET - Deep Multiple Instance Learning for Automatic Detection of Diabetic R...
IRJET - Deep Multiple Instance Learning for Automatic Detection of Diabetic R...IRJET - Deep Multiple Instance Learning for Automatic Detection of Diabetic R...
IRJET - Deep Multiple Instance Learning for Automatic Detection of Diabetic R...
 
Age and Gender Classification using Convolutional Neural Network
Age and Gender Classification using Convolutional Neural NetworkAge and Gender Classification using Convolutional Neural Network
Age and Gender Classification using Convolutional Neural Network
 
Prediction Analysis in Clinical and Basic Neuroscience
Prediction Analysis in Clinical and Basic NeurosciencePrediction Analysis in Clinical and Basic Neuroscience
Prediction Analysis in Clinical and Basic Neuroscience
 
Anits dip
Anits dipAnits dip
Anits dip
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 
Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016Curriculum_Amoroso_EN_28_07_2016
Curriculum_Amoroso_EN_28_07_2016
 
Frankie Rybicki slide set for Deep Learning in Radiology / Medicine
Frankie Rybicki slide set for Deep Learning in Radiology / MedicineFrankie Rybicki slide set for Deep Learning in Radiology / Medicine
Frankie Rybicki slide set for Deep Learning in Radiology / Medicine
 
(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging(2017/06)Practical points of deep learning for medical imaging
(2017/06)Practical points of deep learning for medical imaging
 
Pistoia Alliance debates AI in life science
Pistoia Alliance debates AI in life sciencePistoia Alliance debates AI in life science
Pistoia Alliance debates AI in life science
 
BRAINREGION.pptx
BRAINREGION.pptxBRAINREGION.pptx
BRAINREGION.pptx
 
AI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth IsraelAI at GSK_Kim Branson_mHealth Israel
AI at GSK_Kim Branson_mHealth Israel
 
Abstract_537fbfbb4b9a70c487a6fdc8bca78d5a-2.pdf
Abstract_537fbfbb4b9a70c487a6fdc8bca78d5a-2.pdfAbstract_537fbfbb4b9a70c487a6fdc8bca78d5a-2.pdf
Abstract_537fbfbb4b9a70c487a6fdc8bca78d5a-2.pdf
 
Brain Tumor Detection using Neural Network
Brain Tumor Detection using Neural NetworkBrain Tumor Detection using Neural Network
Brain Tumor Detection using Neural Network
 
The Science Behind the Data_ A Deep Dive into Data Science.pdf
The Science Behind the Data_ A Deep Dive into Data Science.pdfThe Science Behind the Data_ A Deep Dive into Data Science.pdf
The Science Behind the Data_ A Deep Dive into Data Science.pdf
 
ARTIFICIAL NEURAL NETWORKING in Dentistry
ARTIFICIAL NEURAL NETWORKING in DentistryARTIFICIAL NEURAL NETWORKING in Dentistry
ARTIFICIAL NEURAL NETWORKING in Dentistry
 
IJSRED-V2I5P44
IJSRED-V2I5P44IJSRED-V2I5P44
IJSRED-V2I5P44
 
Classification of Headache Disorder Using Random Forest Algorithm.pptx
Classification of Headache Disorder Using Random Forest Algorithm.pptxClassification of Headache Disorder Using Random Forest Algorithm.pptx
Classification of Headache Disorder Using Random Forest Algorithm.pptx
 
[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric
[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric
[DigiHealth 22] Artificial intelligence in medicine - Kristijan Saric
 

More from South Tyrol Free Software Conference

SFSCON23 - Rufai Omowunmi Balogun - SMODEX – a Python package for understandi...
SFSCON23 - Rufai Omowunmi Balogun - SMODEX – a Python package for understandi...SFSCON23 - Rufai Omowunmi Balogun - SMODEX – a Python package for understandi...
SFSCON23 - Rufai Omowunmi Balogun - SMODEX – a Python package for understandi...
South Tyrol Free Software Conference
 
SFSCON23 - Marianna d'Atri Enrico Zanardo - How can Blockchain technologies i...
SFSCON23 - Marianna d'Atri Enrico Zanardo - How can Blockchain technologies i...SFSCON23 - Marianna d'Atri Enrico Zanardo - How can Blockchain technologies i...
SFSCON23 - Marianna d'Atri Enrico Zanardo - How can Blockchain technologies i...
South Tyrol Free Software Conference
 
SFSCON23 - Giovanni Giannotta - Intelligent Decision Support System for trace...
SFSCON23 - Giovanni Giannotta - Intelligent Decision Support System for trace...SFSCON23 - Giovanni Giannotta - Intelligent Decision Support System for trace...
SFSCON23 - Giovanni Giannotta - Intelligent Decision Support System for trace...
South Tyrol Free Software Conference
 
SFSCON23 - Elena Maines - Embracing CI/CD workflows for building ETL pipelines
SFSCON23 - Elena Maines - Embracing CI/CD workflows for building ETL pipelinesSFSCON23 - Elena Maines - Embracing CI/CD workflows for building ETL pipelines
SFSCON23 - Elena Maines - Embracing CI/CD workflows for building ETL pipelines
South Tyrol Free Software Conference
 
SFSCON23 - Johannes Näder Linus Sehn - Let’s monitor implementation of Free S...
SFSCON23 - Johannes Näder Linus Sehn - Let’s monitor implementation of Free S...SFSCON23 - Johannes Näder Linus Sehn - Let’s monitor implementation of Free S...
SFSCON23 - Johannes Näder Linus Sehn - Let’s monitor implementation of Free S...
South Tyrol Free Software Conference
 
SFSCON23 - Edoardo Scepi - The Brand-New Version of IGis Maps
SFSCON23 - Edoardo Scepi - The Brand-New Version of IGis MapsSFSCON23 - Edoardo Scepi - The Brand-New Version of IGis Maps
SFSCON23 - Edoardo Scepi - The Brand-New Version of IGis Maps
South Tyrol Free Software Conference
 

More from South Tyrol Free Software Conference (20)

SFSCON23 - Rufai Omowunmi Balogun - SMODEX – a Python package for understandi...
SFSCON23 - Rufai Omowunmi Balogun - SMODEX – a Python package for understandi...SFSCON23 - Rufai Omowunmi Balogun - SMODEX – a Python package for understandi...
SFSCON23 - Rufai Omowunmi Balogun - SMODEX – a Python package for understandi...
 
SFSCON23 - Roberto Innocenti - From the design to reality is here the Communi...
SFSCON23 - Roberto Innocenti - From the design to reality is here the Communi...SFSCON23 - Roberto Innocenti - From the design to reality is here the Communi...
SFSCON23 - Roberto Innocenti - From the design to reality is here the Communi...
 
SFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data Hub
SFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data HubSFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data Hub
SFSCON23 - Martin Rabanser - Real-time aeroplane tracking and the Open Data Hub
 
SFSCON23 - Marianna d'Atri Enrico Zanardo - How can Blockchain technologies i...
SFSCON23 - Marianna d'Atri Enrico Zanardo - How can Blockchain technologies i...SFSCON23 - Marianna d'Atri Enrico Zanardo - How can Blockchain technologies i...
SFSCON23 - Marianna d'Atri Enrico Zanardo - How can Blockchain technologies i...
 
SFSCON23 - Lucas Lasota - The Future of Connectivity, Open Internet and Human...
SFSCON23 - Lucas Lasota - The Future of Connectivity, Open Internet and Human...SFSCON23 - Lucas Lasota - The Future of Connectivity, Open Internet and Human...
SFSCON23 - Lucas Lasota - The Future of Connectivity, Open Internet and Human...
 
SFSCON23 - Giovanni Giannotta - Intelligent Decision Support System for trace...
SFSCON23 - Giovanni Giannotta - Intelligent Decision Support System for trace...SFSCON23 - Giovanni Giannotta - Intelligent Decision Support System for trace...
SFSCON23 - Giovanni Giannotta - Intelligent Decision Support System for trace...
 
SFSCON23 - Elena Maines - Embracing CI/CD workflows for building ETL pipelines
SFSCON23 - Elena Maines - Embracing CI/CD workflows for building ETL pipelinesSFSCON23 - Elena Maines - Embracing CI/CD workflows for building ETL pipelines
SFSCON23 - Elena Maines - Embracing CI/CD workflows for building ETL pipelines
 
SFSCON23 - Christian Busse - Free Software and Open Science
SFSCON23 - Christian Busse - Free Software and Open ScienceSFSCON23 - Christian Busse - Free Software and Open Science
SFSCON23 - Christian Busse - Free Software and Open Science
 
SFSCON23 - Charles H. Schulz - Why open digital infrastructure matters
SFSCON23 - Charles H. Schulz - Why open digital infrastructure mattersSFSCON23 - Charles H. Schulz - Why open digital infrastructure matters
SFSCON23 - Charles H. Schulz - Why open digital infrastructure matters
 
SFSCON23 - Andrea Vianello - Achieving FAIRness with EDP-portal
SFSCON23 - Andrea Vianello - Achieving FAIRness with EDP-portalSFSCON23 - Andrea Vianello - Achieving FAIRness with EDP-portal
SFSCON23 - Andrea Vianello - Achieving FAIRness with EDP-portal
 
SFSCON23 - Thomas Aichner - How IoT and AI are revolutionizing Mass Customiza...
SFSCON23 - Thomas Aichner - How IoT and AI are revolutionizing Mass Customiza...SFSCON23 - Thomas Aichner - How IoT and AI are revolutionizing Mass Customiza...
SFSCON23 - Thomas Aichner - How IoT and AI are revolutionizing Mass Customiza...
 
SFSCON23 - Stefan Mutschlechner - Smart Werke Meran
SFSCON23 - Stefan Mutschlechner - Smart Werke MeranSFSCON23 - Stefan Mutschlechner - Smart Werke Meran
SFSCON23 - Stefan Mutschlechner - Smart Werke Meran
 
SFSCON23 - Mirko Boehm - European regulators cast their eyes on maturing OSS ...
SFSCON23 - Mirko Boehm - European regulators cast their eyes on maturing OSS ...SFSCON23 - Mirko Boehm - European regulators cast their eyes on maturing OSS ...
SFSCON23 - Mirko Boehm - European regulators cast their eyes on maturing OSS ...
 
SFSCON23 - Marco Pavanelli - Monitoring the fleet of Sasa with free software
SFSCON23 - Marco Pavanelli - Monitoring the fleet of Sasa with free softwareSFSCON23 - Marco Pavanelli - Monitoring the fleet of Sasa with free software
SFSCON23 - Marco Pavanelli - Monitoring the fleet of Sasa with free software
 
SFSCON23 - Marco Cortella - KNOWAGE and AICS for 2030 agenda SDG goals monito...
SFSCON23 - Marco Cortella - KNOWAGE and AICS for 2030 agenda SDG goals monito...SFSCON23 - Marco Cortella - KNOWAGE and AICS for 2030 agenda SDG goals monito...
SFSCON23 - Marco Cortella - KNOWAGE and AICS for 2030 agenda SDG goals monito...
 
SFSCON23 - Lina Ceballos - Interoperable Europe Act - A real game changer
SFSCON23 - Lina Ceballos - Interoperable Europe Act - A real game changerSFSCON23 - Lina Ceballos - Interoperable Europe Act - A real game changer
SFSCON23 - Lina Ceballos - Interoperable Europe Act - A real game changer
 
SFSCON23 - Johannes Näder Linus Sehn - Let’s monitor implementation of Free S...
SFSCON23 - Johannes Näder Linus Sehn - Let’s monitor implementation of Free S...SFSCON23 - Johannes Näder Linus Sehn - Let’s monitor implementation of Free S...
SFSCON23 - Johannes Näder Linus Sehn - Let’s monitor implementation of Free S...
 
SFSCON23 - Gabriel Ku Wei Bin - Why Do We Need A Next Generation Internet
SFSCON23 - Gabriel Ku Wei Bin - Why Do We Need A Next Generation InternetSFSCON23 - Gabriel Ku Wei Bin - Why Do We Need A Next Generation Internet
SFSCON23 - Gabriel Ku Wei Bin - Why Do We Need A Next Generation Internet
 
SFSCON23 - Edoardo Scepi - The Brand-New Version of IGis Maps
SFSCON23 - Edoardo Scepi - The Brand-New Version of IGis MapsSFSCON23 - Edoardo Scepi - The Brand-New Version of IGis Maps
SFSCON23 - Edoardo Scepi - The Brand-New Version of IGis Maps
 
SFSCON23 - Davide Vernassa - Empowering Insights Unveiling the latest innova...
SFSCON23 - Davide Vernassa - Empowering Insights  Unveiling the latest innova...SFSCON23 - Davide Vernassa - Empowering Insights  Unveiling the latest innova...
SFSCON23 - Davide Vernassa - Empowering Insights Unveiling the latest innova...
 

Recently uploaded

Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Recently uploaded (20)

ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
WSO2 Micro Integrator for Enterprise Integration in a Decentralized, Microser...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
الأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهلهالأمن السيبراني - ما لا يسع للمستخدم جهله
الأمن السيبراني - ما لا يسع للمستخدم جهله
 

SFSCON23 - Giuseppe Di Fatta - Data-driven knowledge discovery from Brain Imaging of Alzheimer’s disease patients

  • 1. Data-driven knowledge discovery from Brain Imaging of Alzheimer’s disease patients Prof. Giuseppe Di Fatta Director of the MSc Computing for Data Science Computer Science and Artificial Intelligence Institute Faculty of Engineering Free University of Bozen-Bolzano, Italy Giuseppe.DiFatta@unibz.it The 22nd South Tyrol Free Software Conference (SFSCON) NOI Techpark, Bolzano, Italy 10 November 2023
  • 2. Prof. Giuseppe Di Fatta 2 Outline • Motivations • Dementia and Alzheimer’s Disease (AD) • MRI data-driven solutions for Alzheimer’s disease classification – Data and data processing – Machine Learning models • outliers detection, feature selection, regression and classification – Explainability/Interpretability • Conclusions
  • 3. Prof. Giuseppe Di Fatta Dementia and Alzheimer’s Disease • Mental disorders are one of the five most costly conditions for medical spending. • Dementia is the most expensive condition for medical and social cost and, still, it receives only a fraction of the research funds w.r.t. other health conditions. • Alzheimer’s Disease (AD): 60-80% of dementia cases ▪ Several national and international campaigns aimed at raising awareness and attention (e.g., www.worldalzmonth.org) ▪ Currently 40+ million people suffer from AD: 130-150 million by 2050, one of the most significant global social economic health crises. ▪ The exact cause of AD is unknown. ▪ Importance of early and accurate diagnosis: quality of life can be significantly improved. 3
  • 4. Prof. Giuseppe Di Fatta Alzheimer’s Disease (AD) 4
  • 5. Prof. Giuseppe Di Fatta Data Workflow 5 data gathering, selection Feature generation (stats on RoIs) Image Segmentation Machine Learning Pre-processing • A single MRI scan file typically takes 30-40 MB. • After pre-processing the files generated for a single image take 350-500 MB. • Pre-procesing of a single image takes ~8h ➔ several months or even years • A data parallel approach used to speed up computation ➔ a few months Brain MRIs from 3170 subjects a never ending effort
  • 6. Prof. Giuseppe Di Fatta Data Workflow 6 data gathering, selection Feature generation (stats on RoIs) Image Segmentation Machine Learning Machine Learning tasks • Outlier Detection • Feature selection • Training predictive models • Regression for age estimation • Classification: binary and multiclass predictive models • Testing the models: robust performance estimation • Explainability and Interpretability Brain MRIs from 3170 subjects Various ML tasks
  • 7. Prof. Giuseppe Di Fatta KNIME for 7 ▪ KNIME is an open-source Data Science and Machine Learning Platform ▪ codeless Machine Learning and Data Science ▪ open-source and free (General Public License) ▪ open and extendible platform a KNIME data workflow (an acyclic graph of computations)
  • 8. Prof. Giuseppe Di Fatta Various Approaches since 2014 Binary (HC vs AD) and multiclass classification problems • Input features: – No selection: all available (~400) numerical measurements from brain ROIs – Domain-specific feature selection: 16 hippocampal subfields – Automatic feature selection • Filter/Wrapper/Embedded methods • Classification methods I – focus on predictive accuracy: 1. Binary Region Classifier 2. Probability Weight-based Classification 3. Bayesian Classifier 4. Support Vector Machine 5. Deep Neural Networks • Classification methods II – predictive accuracy and descriptive power: – brain age estimation 8 85.8 84.7 84.9
  • 9. Prof. Giuseppe Di Fatta Predictive Task: Regression ❑ Input data: structural MRI scans (T1 weighting) of human brains – Healthy Control (HC) only or in combination with – Alzheimer’s Disease (AD) (or a different condition) ❑ Regression task: – Estimate the subject’s age from a structural MRI of the brain • e.g., Brain Age Gap Estimation (BrainAGE) – Challenges: data manipulation, accuracy, curse of dimensionality, confidence of models 9
  • 10. Prof. Giuseppe Di Fatta Brain Age Gap Estimation • Brain Age Gap Estimation (BrainAGE) – BrainAGE quantifies the difference between the actual anagraphic age of a subject and their age as predicted by ML models applied to neuroimaging data – It is considered a biomarker of brain health. – Typical method: age model of healthy subjects • Voxel-Based Morphometry (VBM) approach: PCA applied to 3,700 voxels to reduce the dimensionality of the input space • Relevance Vector Regression (RVR) and Support Vector Regression (SVR) 10
  • 11. Prof. Giuseppe Di Fatta The “Apparent” Brain Age • The Apparent Brain Age (ABA) for a specific neurodegenerative condition – goal-conditioned brain age estimation based on region-wise segmentation – inductive bias based on feature selection in order to improve the classification accuracy of the estimated brain age Objectives: simplest model with high accuracy, interpretability and specificity ✓ the design of a data workflow with a combination of only linear models to preserve the original input space semantics ✓ the definition of a novel feature score to measure the specific contribution of each selected morphological region to the classification prediction ✓ a rigorous and extensive experimental comparative analysis to validate the method: comparable or superior accuracy than SOTA ML methods ✓ demonstrate the interpretability of the classification method 11 ▪ A. Varzandian, M.A.S. Razo, M.R. Sanders, A. Atmakuru, G. Di Fatta, “Classification-biased apparent brain age for the prediction of Alzheimer's disease”, Frontiers in Neuroscience (581), 2021.
  • 12. Prof. Giuseppe Di Fatta ABA, Classification and Interpretability 1. Biased Forward Feature Selection (BFFS) as coarse-grained method: cross-validation on a correlation-based filter 2. Linear regression is applied to the selected features: LASSO regression with fine-grained feature selection method 3. Logistic regression based only on ABA and actual age 4. Interpretability with a measure of feature importance/score 12 ➔Feature Score: where {fi} are the k ROI features, ai and cj are model parameters with
  • 13. Prof. Giuseppe Di Fatta Feature Selection (step 1) • results based on 1901 subjects (AD and CN) 13
  • 14. Prof. Giuseppe Di Fatta ABA Prediction and Classification (2 and 3) 14 M5 M6
  • 15. Prof. Giuseppe Di Fatta Classification Accuracy (3) • results based on 1901 subjects (AD and CN) 15 Baseline SVM SVM DN +data +data +data +BFFS +data ABA & LogReg Baseline SVM SVM DN +data +data +data +BFFS +data ABA & LogReg
  • 16. Prof. Giuseppe Di Fatta Feature Score (4): True Positive 16
  • 17. Prof. Giuseppe Di Fatta Feature Score (4): True Negative 17
  • 18. Prof. Giuseppe Di Fatta 18 Conclusions ▪ With same pre-processing and features most classification methods are similar in terms of accuracy. However, black-box approaches do lack descriptive capabilities, especially in such high-dimensional spaces. ▪ A new candidate biomarker, the Apparent Brain Age, helps at improving predictive accuracy and can be useful on itself for its descriptive power. ▪ The new feature score helps to link age gaps to specific ROIs for a powerful combination of ➢ state-of-the-art accuracy ➢ interpretability ➢ specificity ➢ Ongoing effort is directed to experimental analysis of multiclass problems and specificity. ➢ Data from other neurodegenerative diseases to extend the work further.
  • 19. Prof. Giuseppe Di Fatta Relevant Publications ▪ A. Varzandian, M.A.S. Razo, M.R. Sanders, A. Atmakuru, G. Di Fatta, “Classification-biased apparent brain age for the prediction of Alzheimer's disease”, Frontiers in Neuroscience (581), 2021. ▪ E.E. Bron et al., "Standardized evaluation of algorithms for computer-aided diagnosis of dementia based on structural MRI: the CADDementia challenge", NeuroImage, an Elsevier Journal of Brain Function, 1 May 2015, 111:562-79. ▪ A. Spedding, G. Di Fatta and J.D. Saddy, "An LDA and Probability-based Classifier for the Diagnosis of Alzheimer's Disease from Structural MRI", Workshop on High Performance Bioinformatics and Biomedicine (HiBB), the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Nov. 9-12, 2015, Washington D.C., pp. 1404 - 1411. ▪ A. Spedding, G. Di Fatta and M. Cannataro, "A Genetic Algorithm for the Selection of Structural MRI Features for Classification of Mild Cognitive Impairment and Alzheimer's Disease", WS on Machine Learning in decision making for biomedical applications, IEEE Int.l Conf. on Bioinformatics and Biomedicine, Nov. 9-12, 2015, Washington D.C., pp. 1566-1571. ▪ A. Sarica, G. Di Fatta, G. Smith, M. Cannataro, D. Saddy, "Advanced Feature Selection in Multinominal Dementia Classification from Structural MRI Data", in the Proc. of the Workshop on Computer-Aided Diagnosis of Dementia based on structural MRI data (CADDementia), Conf. on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2014, Boston, MA, USA ▪ A. Sarica, G. Di Fatta, M. Cannataro, "K-Surfer: A KNIME extension for the management and analysis of human brain MRI FreeSurfer/FSL Data", Int.l Conf. on Brain Informatics and Health, 11-14 August 2014, Warsaw, Poland, Springer LNCS V.8609, 2014, pp. 481-492. ▪ M. Berthold, N. Cebron, F. Dill, G. Di Fatta, T. Gabriel, F. Georg, T. Meinl, P. Ohl, C. Sieb, B. Wiswedel, "KNIME: the Konstanz Information Miner", Proc. 4th Annual Industrial Simulation Conference (ISC), Palermo, Italy, June 5-7, 2006, pp.58-61. 19