SlideShare a Scribd company logo
1 of 22
Classification of FDG-PET* Brain Data
* Fluorodeoxyglucose positron emission tomography
Deborah Mudali 1,* Michael Biehl 1
Klaus L. Leenders 2 Jos B.T.M. Roerdink 1,3
1 Johann Bernoulli Institute for Mathematics
and Computer Science, University of Groningen, NL
2 Department of Neurology
University Medical Center Groningen, NL
3 Neuroimaging Center
University Medical Center Groningen, NL
* Mbarara University of Science & Technology, Uganda
WSOM, Houston, 2016 2
overview
Example application
Classification of Parkinsonian Syndromes
based on FDG-PET brain data
Combination: PCA + GMLVQ
comparison with DT, SVM
Conclusion and Outlook
Prototype-based classification
Learning Vector Quantization
Generalized Matrix Relevance Learning (GMLVQ)
WSOM, Houston, 2016
∙ identification of prototype vectors from labeled example data
∙ (dis)-similarity based classification (e.g. Euclidean distance)
Learning Vector Quantization
N-dimensional data, feature vectors
• initialize prototype vectors
for different classes
competitive learning: Winner-Takes-All LVQ1 [Kohonen, 1990, 1997]
• identify the winner
(closest prototype)
• present a single example
• move the winner
- closer towards the data (same class)
- away from the data (different class)
feature space
WSOM, Houston, 2016
prototype based classifier
- represent data by one or
several prototypes per class
- classify a query according to the
label of the nearest prototype
(or alternative voting schemes)
- local decision boundaries according
to (e.g.) Euclidean distances
+ robustness to outliers, low storage needs and computational effort
- model selection: number of prototypes per class, etc.
feature space
?
? appropriate distance / (dis-) similarity measure
+ parameterization in feature space, interpretability
WSOM, Houston, 2016 5
fixed distance measures:
- choice based on prior knowledge or preprocessing
- determine prototypes from example data
by means of (iterative) learning schemes
e.g. heuristic LVQ1, cost function based Generalized LVQ
relevance learning, adaptive distances:
- employ parameterized distance measure
- update parameters in one training process with prototypes
- optimize adaptive, data driven dissimilarity
example: Matrix Relevance LVQ
Learning Vector Quantization
WSOM, Houston, 2016
Relevance Matrix LVQ
generalized quadratic distance in LVQ:
variants: global/local matrices (piecewise quadratic boundaries)
diagonal relevances (single feature weights)
rectangular (low-dim. representation)
[Schneider et al., 2009]
relevance matrix:
quantifies importance of features and pairs of features
summarizes relevance of feature j
( for equally scaled features )
training: optimize prototypes and Λ w.r.t. classification of examples
WSOM, Houston, 2016
cost function based training
one example: Generalized LVQ [Sato & Yamada, 1995]
sigmoidal (linear for small arguments), e.g.
E approximates number of misclassifications
linear
E favors large margin separation of classes, e.g.
two winning prototypes:
minimize
small , large
E favors class-typical prototypes
WSOM, Houston, 2016
cost function based LVQ
There is nothing objective about objective functions
James McClelland
WSOM, Houston, 2016 9
FDG-PET (Fluorodeoxyglucose positron emission tomography, 3d-images)
condition
Glucoseuptake
n=18 HC
Healhy controls
n= 20 PD
Parkinson’s Disease
n=21 MSA
Multiple System Atrophy
n=17 PSP
Progressive Supranuclear
Palsy
classification of FDG-PET data
[http://glimpsproject.com]
WSOM, Houston, 2016 10
work flow
subjects 1….P
voxels1….N(N≈200000)
SubjectResidualProfileSRP
log-transformed
high-intensityvoxels
GroupInvariant
Subprofile(GIS)
subjectsocres1….P
subjects 1….P
Scaled Subprofile Model PCA
based on a given group of subjects
SSMPCA
data and pre-processing:
D. Mudali, L.K. Teune, R. J. Renken, K. L. Leenders,
J. B. T. M. Roerdink. Computational and Mathematical
Methods in Medicine. March 2015, Art.ID 136921, 10p.
and refs. therein
WSOM, Houston, 2016 11
work flow
subjects 1….P
voxels1….N(N≈200000)
SubjectResidualProfileSRP
log-transformed
high-intensityvoxels
GroupInvariant
Subprofile(GIS)
subjectsocres1….P
subjects 1….P
Scaled Subprofile Model PCA
based on a given group of subjects
applied to
novel subject
test
labels
(condition)
GMLVQ classifier
prototypes and distance
?
SSMPCA
WSOM, Houston, 2016 12
Healthy controls vs. Parkinson’s Disease
38 leave-one-out validation runs
averaged…
prototypes relevance matrix
ROC of leave-one-out
prediction
example: HC vs. PD
(w/o z-score transform.)
WSOM, Houston, 2016 13
Healthy controls vs. Progressive Supranuclear Palsy
35 leave-one-out validation runs,
averaged…
prototypes relevance matrix
example: HC vs. PSP
ROC of leave-one-out
prediction
(w/o z-score transform.)
WSOM, Houston, 2016 14
GMLVQ
NPC
accuracies
Note:
maximum margin perceptron - aka SVM with linear kernel - (Matlab svmtrain)
achieves performance similar to GMLVQ
performance comparison
Decision tree
(C4.5)
using all PC
Mudali et al.
2015
WSOM, Houston, 2016 15
four classes: HC / PD / MSA / PSP
leave-one-out confusion matrix for the four-class problem
GM
lin.
77.8 %
65.0 %
64.7 %
76.2 %
class acc.
66.7 %
60.0 %
52.9 %
89.0 %
class acc.(1 vs 1)
WSOM, Houston, 2016 16
HC / PD / MSA / PSP
HC
PSP
PD
MSA
GMLVQ
visualization of training
data set in terms of the
leading eigenvectors of Λ
WSOM, Houston, 2016 17
diseases only: PD / MSA / PSP
leave-one-out confusion matrix for the three-class problem
lin.(1 vs 1)
WSOM, Houston, 2016 18
diseases only: PD / MSA / PSP
MSA
PD
PSP
GMLVQ
visualization of training
data set in terms of the
leading eigenvectors of Λ
WSOM, Houston, 2016 19
discussion / conclusion
- detection and discrimination of Parkinsonian syndromes:
GMLVQ classifier and SVM clearly outperform decision trees
decision trees
- serious limitations:
small data set
leave-one-out validation
over-fitting
- accuracy is not enough:
can we obtain better insight into the classifiers ?
WSOM, Houston, 2016 20
outlook/work in progress
- optimization of the number of PCs used as features
shown to improve decision tree performance
potential improvement for other classifiers
- larger data sets
- understanding relevances in voxel-space
relevant PC hint at discriminative between-patient variability
PCA:
recent example:
diagnosis of rheumatoid arthritis based on cytokine expression
[L. Yeo et al., Ann. of the Rheumatic Diseases, 2015]
WSOM, Houston, 2016 21
http://matlabserver.cs.rug.nl/gmlvqweb/web/
Matlab code:
Relevance and Matrix adaptation in Learning Vector
Quantization (GRLVQ, GMLVQ and LiRaM LVQ):
http://www.cs.rug.nl/~biehl/
links
Pre- and re-prints etc.:
A no-nonsense beginners’ tool for GMLVQ:
http://www.cs.rug.nl/~biehl/gmlvq
WSOM, Houston, 2016
Questions ?

More Related Content

What's hot

Usage of Semantic Web Technologies (Web 3.0) Aiming to Facilitate the Utilisa...
Usage of Semantic Web Technologies (Web 3.0) Aiming to Facilitate the Utilisa...Usage of Semantic Web Technologies (Web 3.0) Aiming to Facilitate the Utilisa...
Usage of Semantic Web Technologies (Web 3.0) Aiming to Facilitate the Utilisa...
Gunther Eysenbach
 
Data clustering a review
Data clustering a reviewData clustering a review
Data clustering a review
unyil96
 
ppt slides
ppt slidesppt slides
ppt slides
butest
 
Iaetsd an enhanced feature selection for
Iaetsd an enhanced feature selection forIaetsd an enhanced feature selection for
Iaetsd an enhanced feature selection for
Iaetsd Iaetsd
 

What's hot (20)

Multi-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data MiningMulti-Cluster Based Approach for skewed Data in Data Mining
Multi-Cluster Based Approach for skewed Data in Data Mining
 
Classification of Breast Cancer Diseases using Data Mining Techniques
Classification of Breast Cancer Diseases using Data Mining TechniquesClassification of Breast Cancer Diseases using Data Mining Techniques
Classification of Breast Cancer Diseases using Data Mining Techniques
 
Brain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector MachineBrain Tumor Classification using Support Vector Machine
Brain Tumor Classification using Support Vector Machine
 
Oversampling technique in student performance classification from engineering...
Oversampling technique in student performance classification from engineering...Oversampling technique in student performance classification from engineering...
Oversampling technique in student performance classification from engineering...
 
Analysis On Classification Techniques In Mammographic Mass Data Set
Analysis On Classification Techniques In Mammographic Mass Data SetAnalysis On Classification Techniques In Mammographic Mass Data Set
Analysis On Classification Techniques In Mammographic Mass Data Set
 
Second subjective assignment
Second  subjective assignmentSecond  subjective assignment
Second subjective assignment
 
BAYESIAN ENSEMBLE CLASSIFIER (VIDEO 3/4)
BAYESIAN ENSEMBLE CLASSIFIER (VIDEO 3/4)BAYESIAN ENSEMBLE CLASSIFIER (VIDEO 3/4)
BAYESIAN ENSEMBLE CLASSIFIER (VIDEO 3/4)
 
Usage of Semantic Web Technologies (Web 3.0) Aiming to Facilitate the Utilisa...
Usage of Semantic Web Technologies (Web 3.0) Aiming to Facilitate the Utilisa...Usage of Semantic Web Technologies (Web 3.0) Aiming to Facilitate the Utilisa...
Usage of Semantic Web Technologies (Web 3.0) Aiming to Facilitate the Utilisa...
 
EXTRACTION OF SEQUENTIAL RULES (VIDEO 4/4)
EXTRACTION OF SEQUENTIAL RULES (VIDEO 4/4)EXTRACTION OF SEQUENTIAL RULES (VIDEO 4/4)
EXTRACTION OF SEQUENTIAL RULES (VIDEO 4/4)
 
Data clustering a review
Data clustering a reviewData clustering a review
Data clustering a review
 
Edbt2014 talk
Edbt2014 talkEdbt2014 talk
Edbt2014 talk
 
Short Story Submission on Meta Learning
Short Story Submission on Meta LearningShort Story Submission on Meta Learning
Short Story Submission on Meta Learning
 
Effect of Feature Selection on Gene Expression Datasets Classification Accura...
Effect of Feature Selection on Gene Expression Datasets Classification Accura...Effect of Feature Selection on Gene Expression Datasets Classification Accura...
Effect of Feature Selection on Gene Expression Datasets Classification Accura...
 
Breast Cancer
Breast CancerBreast Cancer
Breast Cancer
 
Deep Neural Networks in Text Classification using Active Learning
Deep Neural Networks in Text Classification using Active LearningDeep Neural Networks in Text Classification using Active Learning
Deep Neural Networks in Text Classification using Active Learning
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
IRJET- Survey of Feature Selection based on Ant Colony
IRJET- Survey of Feature Selection based on Ant ColonyIRJET- Survey of Feature Selection based on Ant Colony
IRJET- Survey of Feature Selection based on Ant Colony
 
ppt slides
ppt slidesppt slides
ppt slides
 
Drug discovery presentation
Drug discovery presentationDrug discovery presentation
Drug discovery presentation
 
Iaetsd an enhanced feature selection for
Iaetsd an enhanced feature selection forIaetsd an enhanced feature selection for
Iaetsd an enhanced feature selection for
 

Similar to 2016: Classification of FDG-PET Brain Data

Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning
Shenghui Wang
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
chenhm
 

Similar to 2016: Classification of FDG-PET Brain Data (20)

Biehl hanze-2021
Biehl hanze-2021Biehl hanze-2021
Biehl hanze-2021
 
January 2020: Prototype-based systems in machine learning
January 2020: Prototype-based systems in machine learning  January 2020: Prototype-based systems in machine learning
January 2020: Prototype-based systems in machine learning
 
Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]Wasserstein 1031 thesis [Chung il kim]
Wasserstein 1031 thesis [Chung il kim]
 
PyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear ModelsPyData Miami 2019, Quantum Generalized Linear Models
PyData Miami 2019, Quantum Generalized Linear Models
 
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSISFUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
FUNCTION OF RIVAL SIMILARITY IN A COGNITIVE DATA ANALYSIS
 
A parsimonious SVM model selection criterion for classification of real-world ...
A parsimonious SVM model selection criterion for classification of real-world ...A parsimonious SVM model selection criterion for classification of real-world ...
A parsimonious SVM model selection criterion for classification of real-world ...
 
Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning Similarity Features, and their Role in Concept Alignment Learning
Similarity Features, and their Role in Concept Alignment Learning
 
Prototype-based models in machine learning
Prototype-based models in machine learningPrototype-based models in machine learning
Prototype-based models in machine learning
 
Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...Online learning in estimation of distribution algorithms for dynamic environm...
Online learning in estimation of distribution algorithms for dynamic environm...
 
Text categorization
Text categorizationText categorization
Text categorization
 
Abrigo and love_2015_
Abrigo and love_2015_Abrigo and love_2015_
Abrigo and love_2015_
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
 
mix2.pdf
mix2.pdfmix2.pdf
mix2.pdf
 
MLHEP Lectures - day 1, basic track
MLHEP Lectures - day 1, basic trackMLHEP Lectures - day 1, basic track
MLHEP Lectures - day 1, basic track
 
Reweighting and Boosting to uniforimty in HEP
Reweighting and Boosting to uniforimty in HEPReweighting and Boosting to uniforimty in HEP
Reweighting and Boosting to uniforimty in HEP
 
A Fast Multi-objective Evolutionary Approach for Designing Large-Scale Optica...
A Fast Multi-objective Evolutionary Approach for Designing Large-Scale Optica...A Fast Multi-objective Evolutionary Approach for Designing Large-Scale Optica...
A Fast Multi-objective Evolutionary Approach for Designing Large-Scale Optica...
 
The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)The Gaussian Process Latent Variable Model (GPLVM)
The Gaussian Process Latent Variable Model (GPLVM)
 
Aj Copulas V4
Aj Copulas V4Aj Copulas V4
Aj Copulas V4
 
Dycops2019
Dycops2019 Dycops2019
Dycops2019
 
[ppt]
[ppt][ppt]
[ppt]
 

More from University of Groningen

More from University of Groningen (16)

Interpretable machine learning in endocrinology, M. Biehl, APPIS 2024
Interpretable machine learning in endocrinology, M. Biehl, APPIS 2024Interpretable machine learning in endocrinology, M. Biehl, APPIS 2024
Interpretable machine learning in endocrinology, M. Biehl, APPIS 2024
 
ESE-Eyes-2023.pdf
ESE-Eyes-2023.pdfESE-Eyes-2023.pdf
ESE-Eyes-2023.pdf
 
APPIS-FDGPET.pdf
APPIS-FDGPET.pdfAPPIS-FDGPET.pdf
APPIS-FDGPET.pdf
 
stat-phys-appis-reduced.pdf
stat-phys-appis-reduced.pdfstat-phys-appis-reduced.pdf
stat-phys-appis-reduced.pdf
 
prototypes-AMALEA.pdf
prototypes-AMALEA.pdfprototypes-AMALEA.pdf
prototypes-AMALEA.pdf
 
stat-phys-AMALEA.pdf
stat-phys-AMALEA.pdfstat-phys-AMALEA.pdf
stat-phys-AMALEA.pdf
 
Evidence for tissue and stage-specific composition of the ribosome: machine l...
Evidence for tissue and stage-specific composition of the ribosome: machine l...Evidence for tissue and stage-specific composition of the ribosome: machine l...
Evidence for tissue and stage-specific composition of the ribosome: machine l...
 
The statistical physics of learning revisted: Phase transitions in layered ne...
The statistical physics of learning revisted: Phase transitions in layered ne...The statistical physics of learning revisted: Phase transitions in layered ne...
The statistical physics of learning revisted: Phase transitions in layered ne...
 
2020: Prototype-based classifiers and relevance learning: medical application...
2020: Prototype-based classifiers and relevance learning: medical application...2020: Prototype-based classifiers and relevance learning: medical application...
2020: Prototype-based classifiers and relevance learning: medical application...
 
2020: Phase transitions in layered neural networks: ReLU vs. sigmoidal activa...
2020: Phase transitions in layered neural networks: ReLU vs. sigmoidal activa...2020: Phase transitions in layered neural networks: ReLU vs. sigmoidal activa...
2020: Phase transitions in layered neural networks: ReLU vs. sigmoidal activa...
 
2020: So you thought the ribosome was constant and conserved ...
2020: So you thought the ribosome was constant and conserved ... 2020: So you thought the ribosome was constant and conserved ...
2020: So you thought the ribosome was constant and conserved ...
 
The statistical physics of learning - revisited
The statistical physics of learning - revisitedThe statistical physics of learning - revisited
The statistical physics of learning - revisited
 
2013: Sometimes you can trust a rat - The sbv improver species translation ch...
2013: Sometimes you can trust a rat - The sbv improver species translation ch...2013: Sometimes you can trust a rat - The sbv improver species translation ch...
2013: Sometimes you can trust a rat - The sbv improver species translation ch...
 
2016: Predicting Recurrence in Clear Cell Renal Cell Carcinoma
2016: Predicting Recurrence in Clear Cell Renal Cell Carcinoma2016: Predicting Recurrence in Clear Cell Renal Cell Carcinoma
2016: Predicting Recurrence in Clear Cell Renal Cell Carcinoma
 
2017: Prototype-based models in unsupervised and supervised machine learning
2017: Prototype-based models in unsupervised and supervised machine learning2017: Prototype-based models in unsupervised and supervised machine learning
2017: Prototype-based models in unsupervised and supervised machine learning
 
June 2017: Biomedical applications of prototype-based classifiers and relevan...
June 2017: Biomedical applications of prototype-based classifiers and relevan...June 2017: Biomedical applications of prototype-based classifiers and relevan...
June 2017: Biomedical applications of prototype-based classifiers and relevan...
 

Recently uploaded

Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Cherry
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Cherry
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
Cherry
 
Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycle
Cherry
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Cherry
 
Lipids: types, structure and important functions.
Lipids: types, structure and important functions.Lipids: types, structure and important functions.
Lipids: types, structure and important functions.
Cherry
 
Major groups of bacteria: Spirochetes, Chlamydia, Rickettsia, nanobes, mycopl...
Major groups of bacteria: Spirochetes, Chlamydia, Rickettsia, nanobes, mycopl...Major groups of bacteria: Spirochetes, Chlamydia, Rickettsia, nanobes, mycopl...
Major groups of bacteria: Spirochetes, Chlamydia, Rickettsia, nanobes, mycopl...
Cherry
 

Recently uploaded (20)

Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Plasmid: types, structure and functions.
Plasmid: types, structure and functions.Plasmid: types, structure and functions.
Plasmid: types, structure and functions.
 
Energy is the beat of life irrespective of the domains. ATP- the energy curre...
Energy is the beat of life irrespective of the domains. ATP- the energy curre...Energy is the beat of life irrespective of the domains. ATP- the energy curre...
Energy is the beat of life irrespective of the domains. ATP- the energy curre...
 
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
Genome Projects : Human, Rice,Wheat,E coli and Arabidopsis.
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptx
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
 
Adaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte CarloAdaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte Carlo
 
Concept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdfConcept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdf
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
CONTRIBUTION OF PANCHANAN MAHESHWARI.pptx
CONTRIBUTION OF PANCHANAN MAHESHWARI.pptxCONTRIBUTION OF PANCHANAN MAHESHWARI.pptx
CONTRIBUTION OF PANCHANAN MAHESHWARI.pptx
 
Information science research with large language models: between science and ...
Information science research with large language models: between science and ...Information science research with large language models: between science and ...
Information science research with large language models: between science and ...
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of Asepsis
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycle
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
Lipids: types, structure and important functions.
Lipids: types, structure and important functions.Lipids: types, structure and important functions.
Lipids: types, structure and important functions.
 
Major groups of bacteria: Spirochetes, Chlamydia, Rickettsia, nanobes, mycopl...
Major groups of bacteria: Spirochetes, Chlamydia, Rickettsia, nanobes, mycopl...Major groups of bacteria: Spirochetes, Chlamydia, Rickettsia, nanobes, mycopl...
Major groups of bacteria: Spirochetes, Chlamydia, Rickettsia, nanobes, mycopl...
 

2016: Classification of FDG-PET Brain Data

  • 1. Classification of FDG-PET* Brain Data * Fluorodeoxyglucose positron emission tomography Deborah Mudali 1,* Michael Biehl 1 Klaus L. Leenders 2 Jos B.T.M. Roerdink 1,3 1 Johann Bernoulli Institute for Mathematics and Computer Science, University of Groningen, NL 2 Department of Neurology University Medical Center Groningen, NL 3 Neuroimaging Center University Medical Center Groningen, NL * Mbarara University of Science & Technology, Uganda
  • 2. WSOM, Houston, 2016 2 overview Example application Classification of Parkinsonian Syndromes based on FDG-PET brain data Combination: PCA + GMLVQ comparison with DT, SVM Conclusion and Outlook Prototype-based classification Learning Vector Quantization Generalized Matrix Relevance Learning (GMLVQ)
  • 3. WSOM, Houston, 2016 ∙ identification of prototype vectors from labeled example data ∙ (dis)-similarity based classification (e.g. Euclidean distance) Learning Vector Quantization N-dimensional data, feature vectors • initialize prototype vectors for different classes competitive learning: Winner-Takes-All LVQ1 [Kohonen, 1990, 1997] • identify the winner (closest prototype) • present a single example • move the winner - closer towards the data (same class) - away from the data (different class) feature space
  • 4. WSOM, Houston, 2016 prototype based classifier - represent data by one or several prototypes per class - classify a query according to the label of the nearest prototype (or alternative voting schemes) - local decision boundaries according to (e.g.) Euclidean distances + robustness to outliers, low storage needs and computational effort - model selection: number of prototypes per class, etc. feature space ? ? appropriate distance / (dis-) similarity measure + parameterization in feature space, interpretability
  • 5. WSOM, Houston, 2016 5 fixed distance measures: - choice based on prior knowledge or preprocessing - determine prototypes from example data by means of (iterative) learning schemes e.g. heuristic LVQ1, cost function based Generalized LVQ relevance learning, adaptive distances: - employ parameterized distance measure - update parameters in one training process with prototypes - optimize adaptive, data driven dissimilarity example: Matrix Relevance LVQ Learning Vector Quantization
  • 6. WSOM, Houston, 2016 Relevance Matrix LVQ generalized quadratic distance in LVQ: variants: global/local matrices (piecewise quadratic boundaries) diagonal relevances (single feature weights) rectangular (low-dim. representation) [Schneider et al., 2009] relevance matrix: quantifies importance of features and pairs of features summarizes relevance of feature j ( for equally scaled features ) training: optimize prototypes and Λ w.r.t. classification of examples
  • 7. WSOM, Houston, 2016 cost function based training one example: Generalized LVQ [Sato & Yamada, 1995] sigmoidal (linear for small arguments), e.g. E approximates number of misclassifications linear E favors large margin separation of classes, e.g. two winning prototypes: minimize small , large E favors class-typical prototypes
  • 8. WSOM, Houston, 2016 cost function based LVQ There is nothing objective about objective functions James McClelland
  • 9. WSOM, Houston, 2016 9 FDG-PET (Fluorodeoxyglucose positron emission tomography, 3d-images) condition Glucoseuptake n=18 HC Healhy controls n= 20 PD Parkinson’s Disease n=21 MSA Multiple System Atrophy n=17 PSP Progressive Supranuclear Palsy classification of FDG-PET data [http://glimpsproject.com]
  • 10. WSOM, Houston, 2016 10 work flow subjects 1….P voxels1….N(N≈200000) SubjectResidualProfileSRP log-transformed high-intensityvoxels GroupInvariant Subprofile(GIS) subjectsocres1….P subjects 1….P Scaled Subprofile Model PCA based on a given group of subjects SSMPCA data and pre-processing: D. Mudali, L.K. Teune, R. J. Renken, K. L. Leenders, J. B. T. M. Roerdink. Computational and Mathematical Methods in Medicine. March 2015, Art.ID 136921, 10p. and refs. therein
  • 11. WSOM, Houston, 2016 11 work flow subjects 1….P voxels1….N(N≈200000) SubjectResidualProfileSRP log-transformed high-intensityvoxels GroupInvariant Subprofile(GIS) subjectsocres1….P subjects 1….P Scaled Subprofile Model PCA based on a given group of subjects applied to novel subject test labels (condition) GMLVQ classifier prototypes and distance ? SSMPCA
  • 12. WSOM, Houston, 2016 12 Healthy controls vs. Parkinson’s Disease 38 leave-one-out validation runs averaged… prototypes relevance matrix ROC of leave-one-out prediction example: HC vs. PD (w/o z-score transform.)
  • 13. WSOM, Houston, 2016 13 Healthy controls vs. Progressive Supranuclear Palsy 35 leave-one-out validation runs, averaged… prototypes relevance matrix example: HC vs. PSP ROC of leave-one-out prediction (w/o z-score transform.)
  • 14. WSOM, Houston, 2016 14 GMLVQ NPC accuracies Note: maximum margin perceptron - aka SVM with linear kernel - (Matlab svmtrain) achieves performance similar to GMLVQ performance comparison Decision tree (C4.5) using all PC Mudali et al. 2015
  • 15. WSOM, Houston, 2016 15 four classes: HC / PD / MSA / PSP leave-one-out confusion matrix for the four-class problem GM lin. 77.8 % 65.0 % 64.7 % 76.2 % class acc. 66.7 % 60.0 % 52.9 % 89.0 % class acc.(1 vs 1)
  • 16. WSOM, Houston, 2016 16 HC / PD / MSA / PSP HC PSP PD MSA GMLVQ visualization of training data set in terms of the leading eigenvectors of Λ
  • 17. WSOM, Houston, 2016 17 diseases only: PD / MSA / PSP leave-one-out confusion matrix for the three-class problem lin.(1 vs 1)
  • 18. WSOM, Houston, 2016 18 diseases only: PD / MSA / PSP MSA PD PSP GMLVQ visualization of training data set in terms of the leading eigenvectors of Λ
  • 19. WSOM, Houston, 2016 19 discussion / conclusion - detection and discrimination of Parkinsonian syndromes: GMLVQ classifier and SVM clearly outperform decision trees decision trees - serious limitations: small data set leave-one-out validation over-fitting - accuracy is not enough: can we obtain better insight into the classifiers ?
  • 20. WSOM, Houston, 2016 20 outlook/work in progress - optimization of the number of PCs used as features shown to improve decision tree performance potential improvement for other classifiers - larger data sets - understanding relevances in voxel-space relevant PC hint at discriminative between-patient variability PCA: recent example: diagnosis of rheumatoid arthritis based on cytokine expression [L. Yeo et al., Ann. of the Rheumatic Diseases, 2015]
  • 21. WSOM, Houston, 2016 21 http://matlabserver.cs.rug.nl/gmlvqweb/web/ Matlab code: Relevance and Matrix adaptation in Learning Vector Quantization (GRLVQ, GMLVQ and LiRaM LVQ): http://www.cs.rug.nl/~biehl/ links Pre- and re-prints etc.: A no-nonsense beginners’ tool for GMLVQ: http://www.cs.rug.nl/~biehl/gmlvq