The current state of prediction in neuroimaging

The current state of
prediction in neuroimaging
Saige Rutherford
@being_saige
www.beingsaige.com

Road Map
• Quick review of typical ML workflow + my favorite predictive
model
• Which traits and behaviors can we predict from brain
images?
• What do various successful predictive models have in
common?
• What does a “successful” predictive model look like?
• How does big data fit in, is there hope for smaller datasets?
• Where is there room to improve brain-behavior predictive
models?

Favorite predictive model: Brain Basis Set
Basis Set = Chosen # of top components from PCA decomposition of subjects x features matrix
aka principle component regression

Phenotype BBS CPM
General Executive 0.44 0.42
Processing Speed 0.39 0.23
Penn Progressive
Matrices
0.30 0.32
ASR Externalizing 0.24 0.03
ASR Internalizing 0.20 0.04
ASR Attention 0.21 0.00
NEO-Openness 0.18 0.11
NEO-
Conscientiousness
0.19 0.15
NEO-Extroversion 0.13 0.04
NEO-Agreeableness 0.19 0.10
NEO-Neuroticism 0.00 0.05Number of Components Used to Predict
MeanCorrelationbetweenPredicted&
ObservedPhenotype
Sripada et al. Scientific Reports (2019)
100 held out unrelated subjects10-fold Cross Validation

Successful Predictive Modeling
Test your prediction model in “the wild”
Sripada et. al Molecular Psychiatry (2019)
Ex. controlling for confounds (motion, demographics,
medication), different cross validation splits.
This shows more believable and realistic results!
Rozycki et. al Schizophrenia Bulletin (2017)

Impact of region-definition method on
prediction accuracy
Impact of connectivity
parameterization on prediction
accuracy
Impact of classifier choice on
prediction accuracy
https://www.sciencedirect.com/science/article/pii/S1053811919301594Dadi et al. Neuroimage (2019)

https://www.sciencedirect.com/science/article/pii/S1053811919301594Dadi et al. Neuroimage (2019)

What not to do
don’t be this guy
1. Be a research troll
Research Troll: Someone who is overly protective of their
data, unwilling to share data and well-documented code.
2. No out of sample test set or cross validation

What not to do
Make bold claims about one model/method being the best…
You know what they say when you assume…
You’re probably wrong, and someone will publicly prove this to
you in a Twitter thread

Big Datasets are taking over…
Where does my “small” data fit in?

Big Datasets are taking over…
Where does my “small” data fit in?
Big data can be act as a “discovery” data
set.
Use HCP, ABCD, or UKBiobank to find a
brain basis set then get expression
scores of these components in your
dataset.
Use pretrained models from big data,
treat your dataset as a true out of sample
test set.
Externalizing
Internalizing
Attention
Model
Externalizing*
Multi-Task Learning, Transfer Learning

Contributing to Big Data
Federated Learning: allows us to train models on distributed datasets that you cannot
directly access.
https://blog.openmined.org/federated-learning-differential-privacy-and-encrypted-computation-for-medical-imaging/
https://arxiv.org/pdf/1610.05492.pdf
https://ai.googleblog.com/2017/04/federated-learning-collaborative.html
Federated Learning tutorial using brain age prediction model coming soon

How can we improve prediction?
Put in the (hard) work to prepare your data properly…
Tangential point about preprocessing fMRI data
Haak, Marquand, Beckman, Neuroimage 2017

Lots of papers pointing to this same idea…
Don’t use a fixed atlas!
https://cdn.elifesciences.org/articles/44890/elife-44890-v2.pdf
https://cdn.elifesciences.org/articles/32992/elife-32992-v1.pdf
https://www.ncbi.nlm.nih.gov/pubmed/25598050
https://www.sciencedirect.com/science/article/pii/S1053811917305463
https://www.biorxiv.org/content/10.1101/431833v2https://www.ncbi.nlm.nih.gov/pubmed/29878084

Most of machine learning is about good data hygiene.
UNDERSTAND YOUR DATA!
https://twitter.com/justmarkham/status/1155840938356432896
pip install pandas_profiling
import pandas_profiling
df.profile_report()

Patient or
healthy
control?
Think deeply before you turn a continuous
trait into a categorical trait.
Dimensional neuroimaging: our ability to
place a brain scan into a succinct, yet highly
comprehensive and informative reference
system, dimensions of which will reflect
patterns associated with normal or pathologic
brain structure or function.

Bias in neuroimaging data…we need to do better at acknowledging it.
Big Data != Population Data
Does ML reveal the true nature of relationships, unconstrained by any bias or human influence?
The answer is an unequivocal No.

https://www.cs.mcgill.ca/~jpineau/ReproducibilityChecklist.pdf

Take Home Messages
There is not one perfect prediction framework to rule them all.
Machine Learning No Free Lunch theorem: no machine learning
method is better than the others, on average, over a broad family of
problems.
Embrace and collaborate with big data.
Big data: multi-task learning, share your models
Small data: transfer learning, use pre-trained models
Focus on transparency and reproducibility

Learning Resources
This is a research process, not a final offering.
OHBM 2019 talks on ML:
https://www.pathlms.com/ohbm/courses/12238/sections/15843/video_presentations/138295
Gael Varoquaux talks:
https://www.slideshare.net/GaelVaroquaux/functionalconnectome-biomarkers-to-meet-clinical-needs
https://www.slideshare.net/GaelVaroquaux/machine-learning-on-non-curated-data-154905090
Machine learning in neuroimaging: Progress and challenges. Neuroimage. 2019 August 15.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6499712/pdf/nihms-1025732.pdf
Learn a new Pandas trick everyday: https://www.dataschool.io/python-pandas-tips-and-tricks/

Thank you!
All who have supported/inspired me on my learning journey.
Mike Angstadt, Chandra Sripada, Jenna Wiens, Daniel Kessler, Aman
Taxali, Bennet Fauber, Marlena Duda, GirlsWhoCode Organization, Ivy Tso,
Soo-Eun Chang, Steve Taylor, the entire University of Michigan community!
@being_saige
www.beingsaige.com

The current state of prediction in neuroimaging

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to The current state of prediction in neuroimaging

Similar to The current state of prediction in neuroimaging (20)

More from SaigeRutherford

More from SaigeRutherford (7)

Recently uploaded

Recently uploaded (20)

The current state of prediction in neuroimaging

Editor's Notes