SlideShare a Scribd company logo
1 of 44
A MACHINE LEARNING STUDY OF INTRINSIC FUNCTIONAL
CONNECTIVITY IN AUTISM SPECTRUM DISORDERS
_______________
A Thesis
Presented to the
Faculty of
San Diego State University
_______________
In Partial Fulfillment
of the Requirements for the Degree
Master of Science
in
Computational Science
_______________
by
Colleen Pam Chen
Fall 2014
SAN DIEGO STATE UNIVERSITY
The Undersigned Faculty Committee Approves the
Thesis of Colleen Pam Chen:
A Machine Learning Study of Intrinsic Functional Connectivity in Autism
Spectrum Disorders
_____________________________________________
Ralph-Axel Mueller, Chair
Department of Psychology
_____________________________________________
Mark Pflieger
Computational Science Research Center
_____________________________________________
Barbara Bailey
Department of Mathematics and Statistics
______________________________
Approval Date
iii
Copyright © 2014
by
Colleen Pam Chen
All Rights Reserved
iv
DEDICATION
Dedicated to those who believed in me...
v
Where there’s a will, there’s a way
– a wise man
vi
ABSTRACT OF THE THESIS
A Machine Learning Study of Intrinsic Functional Connectivity in
Autism Spectrum Disorders
by
Colleen Pam Chen
Master of Science in Computational Science
San Diego State University, 2014
Despite consensus on the neurological nature of autism spectrum disorders (ASD),
brain biomarkers remain unknown and diagnosis continues to be based on behavioral criteria.
Growing evidence suggests that brain abnormalities in ASD occur at the level of
interconnected networks; however, previous attempts using functional connectivity data for
diagnostic classification have reached only moderate accuracy. We selected 252 low-motion
resting-state functional MRI (rs-fMRI) scans from the Autism Brain Imaging Data Exchange
(ABIDE) including typically developing (TD) and ASD participants (n=126 each), matched
for age, non-verbal IQ, and head motion. A matrix of functional connectivities between 220
functionally defined regions of interest was used for diagnostic classification, implementing
several machine learning tools. While support vector machines in combination with particle
swarm optimization and recursive feature elimination performed modestly (with accuracies
for validation data sets <70%), diagnostic classification reached high accuracy of 91% with
random forest (RF), a nonparametric ensemble learning method. Among the 100 most
informative features (connectivities), for which this peak accuracy was achieved,
participation of somatosensory, default mode, visual, and subcortical regions stood out.
Whereas these findings were in part not unexpected, given previous findings of default mode
abnormalities and atypical visual functioning in ASD, the prominent role of somatosensory
regions was remarkable. The finding of peak accuracy for 100 interregional functional
connectivities further suggests that brain biomarkers of ASD may be regionally complex and
distributed, rather than localized.
vii
TABLE OF CONTENTS
PAGE
ABSTRACT..............................................................................................................................vi
LIST OF TABLES...................................................................................................................viii
LIST OF FIGURES ..................................................................................................................ix
ACKNOWLEDGEMENTS......................................................................................................xi
CHAPTER
1 INTRODUCTION .........................................................................................................1
2 METHODS AND MATERIALS...................................................................................3
2.1 Data Preprocessing.............................................................................................3
2.2 Motion................................................................................................................4
2.3 ROIs and Connectivity Matrix...........................................................................5
2.4 Machine Learning Algorithms...........................................................................5
2.4.1 Particle Swarm Optimization (PSO-SVM)...............................................5
2.4.2 Recursive Feature Elimination (RFE-SVM).............................................8
2.4.3 Random Forest (RF) .................................................................................9
3 RESULTS ....................................................................................................................15
3.1 Classification Accuracy of the Three Algorithms ...........................................15
3.2 Classification Characterization of Informative Features from RF...................15
4 DISCUSSION ..............................................................................................................21
4.1 Regions and Networks .....................................................................................21
4.2 Issues, Caveats, and Perspectives ....................................................................22
REFERENCES ........................................................................................................................29
viii
LIST OF TABLES
PAGE
Table 2.1. Participant Information.............................................................................................3
Table 2.2. Anatomical Labels Used in Connectogram ..............................................................6
Table 4.1. ABIDE Demographic Information .........................................................................24
Table 4.2. ABIDE Cohort Matching (p-Values from Independent Two-Sample t-
Tests)............................................................................................................................27
ix
LIST OF FIGURES
PAGE
Figure 2.1. For each classification tree (RF’s single unit), data are bootstrapped into
in-bag and out-of-bag (for testing of the model). The tree starts with the in-bag
data at the root node, with a random subset of features, and begins the process
of “splitting.” The splitting criteria at each node are determined by a type of
greedy algorithm that recursively partitions the data into daughter nodes, until
reaching a terminal node. (Greedy recursive partitioning algorithms make the
partitioning choice that performs best at each split, with the goal of globally
optimal classification). The out-of-bag data are then sent down the tree as a
test of model fit. The misclassification rate here is called “OOB error.” This is
the internal validation process that prevents overfitting. Once all 10,000 trees
in the forest have finished this process, the forest takes the majority vote from
the trees, and obtains variable importance measures and OOB error. .........................10
Figure 2.2. OOB error as a function of number of trees in the forest. Initially, the error
fluctuates greatly, then eventually stabilizes with a large enough forest with
10,000 trees. These are OOB errors using all 24,090 features prior to feature
selection. The number of trees in the forest is an important parameter to fine
tune as to ensure stability of the algorithm. .................................................................12
Figure 2.3. Feature selection using Mean Decrease Accuracy (MDA). In RF, vriable
importance is determined based on mean decrease in accuracy. This is done
for each tree by randomly permuting a variable in the OOB data and recording
the change in accuracy. For the ensemble of trees, the permuted predictions
are aggregated and compared against the unpermuted predictions to determine
the importance of the permuted variable by the magnitude of the decrease in
accuracy of that variable. We selected the top features using this MDA plot,
where we look for the plateau at which no further decrease in accuracy is
achieved. Here, 100 features was selected using the MDA measures. ........................13
Figure 2.4. Fine tuning the mtry parameter for RF to select the optimal number of
features to split at each node. Using the top 100 features, we fine tuned RF
again on the parameter mtry, the number of features used at each split, to 5. .............14
Figure 3.1. Connectogram showing informative connections. (A) Top 10 informative
connections. (B) Top 100 informative connections. The labels of the
connectograms can be found in Table 2, in the order in which they appear in
the figure. .....................................................................................................................16
Figure 3.2. Informative features selected by RF. (A) Pie chart showing the number of
informative ROIs per functional network. (B) Normalized number of
informative ROIs per network (ratio of the number of times network ROIs
participate in an informative connection divided by the total number of ROIs
x
in given network). This number can exceed 1 because a given ROI may
participate in several informative connections. (C) Heatmap of informative
connections by functional networks. (D) Number of informative ROIs per
anatomical parcellation. (E) Normalized number of informative ROIs per
anatomical parcel (ratio of the number of times anatomical ROIs participate in
an informative connection divided by the total number of ROIs in given
parcel). (F) Heatmap of informative connections by anatomical parcel......................17
xi
ACKNOWLEDGEMENTS
I would like to thank everyone in the Brain Development Imaging Lab at San Diego
State University. Dr. Ralph-Axel Mller was an excellent mentor, who gave me the freedom
and creativity space to develop my ideas but also the guidance to keep me on track. I would
also like to thank Christopher Keown, who was my mentor the first year in the lab. He is one
of the most driven and hardworking people I know, and He inspired me to work even harder.
For the thesis project specifically, I would like to thank Omar Maximo, Afrooz Jehedi, and
Chris Keown for their help on quality controlling the data, as well as Aarti Nair for collecting
the data and Adriana DiMartino for organizing the ABIDE data consortium. Finally, I would
like to thank my committee members for taking the time to help me through this process.
1
CHAPTER 1
INTRODUCTION
Autism spectrum disorder (ASD) is a highly heterogeneous disorder, diagnosed on
the basis of behavioral criteria. From the neurobiological perspective, ‘ASD’ can be
considered an umbrella term that may encompass multiple distinct neurodevelopmental
etiologies (Greschwind & Levitt, 2007). Since any given cohort is thus likely composed of
ill-understood subtypes (whose brain features may vary subtly or even dramatically), it is not
surprising that brain markers with perfect sensitivity and specificity remain unavailable.
Nonetheless, given the specificity of diagnostic criteria (American Psychiatric Association,
2013), the hope that some (potentially complex) patterns of brain features may be unique to
the disorder is not unreasonable and worthy of pursuit.
Issues of heterogeneity and cohort effects can be partially addressed through the use
of large samples, as provided by the recent Autism Brain Imaging Data Exchange (ABIDE)
(Di Martino, Yan, et al., 2013), which incorporates over 1,100 resting state functional MRI
(rs-fMRI) datasets from 17 sites. The use of these data for examining functional connectivity
matrices for large numbers of ROIs across the entire brain is further promising, as there is
growing consensus about ASD being characterized by aberrant connectivity in numerous
functional brain networks (Schipul, Keller, & Just, 2011; Vissers, Cohen, & Geurts, 2012;
Wass, 2011). However, the functional connectivity literature in ASD is complex and often
inconsistent (Müller et al., 2011; Nair et al., 2014), and data-driven machine learning (ML)
techniques provide valuable exploratory tools for uncovering potentially unexpected patterns
of aberrant connectivity that may be unique to the disorder.
A few previous ASD studies have used intrinsic functional connectivity MRI (fcMRI)
(Van Dijk et al., 2010) for diagnostic classification, i.e., for determining whether a dataset is
from an ASD or typically developing participant solely based on functional connectivity.
Anderson and colleagues (2011), using a large fcMRI connectivity matrix, reached an overall
diagnostic classification accuracy of 79%, which was however lower in a separate small
replication sample. Uddin, Supekar, Lynch, et al. (2013) used a logistic regression classifier
2
for 10 rs-fMRI based features identified by ICA, which corresponded to previously described
functional networks. The classifier achieved accuracies about 60-70% for all but one
component identified as salience network, for which accuracy reached 77%. Imperfect
accuracy in these studies may be attributed to moderate sample sizes (N≤80). However, in a
recent classification study using the much larger ABIDE dataset, Nielsen et al. (2013)
reported an overall accuracy of only 60%, suggesting that the approach selected, a leave-one-
out classifier using a general linear model, may not be sufficiently powerful.
In the present study, we implemented several multivariate learning methods,
including random forest (RF), which is an ensemble learning method that operates by
constructing many individual decision trees, known in the literature as classification and
regression trees (CART). Each decision tree in the forest makes a classification based on a
bootstrap sample of the data and a random subset of the input features. The forest as a whole
makes a prediction based on the majority vote of the trees. One desirable feature of the
random forest algorithm is the bootstrapping of the sample to have a built-in training and
validation mechanism, generating an unbiased out-of-bag error that measures the predictive
power of the forest. Features were intrinsic functional connectivities (Van Dijk et al., 2010)
between a standard set of regions of interest using only highest quality (low motion) data sets
from ABIDE.
3
CHAPTER 2
METHODS AND MATERIALS
Data were selected from the Autism Brain Imaging Data Exchange (ABIDE,
http://fcon_1000.projects.nitrc.org/indi/abide/; Di Martino, Yan, et al., 2013), a collection of
over 1100 resting-state scans from 17 different sites. In view of the sensitivity of intrinsic
fcMRI analyses to motion artifacts and noise (as described below), we prioritized data quality
over sample size. We excluded any datasets exhibiting artifacts, signal dropout, suboptimal
registration or standardization, or excessive motion (see details below). Sites acquiring fewer
than 150 time points were further excluded. Based on these criteria, we selected a subsample
of 252 participants with low head motion. Groups were matched on age and motion to yield a
final sample of 126 TD and 126 ASD, ranging in age from six to thirty-six years (see
Table 2.1).
Table 2.1. Participant Information
Full Sample ASD
M ± SD
[range]
TD
M ± SD
[range]
p-value
(2-sample t-
test)
N (Female) 126 (18) 126 (31)
Age (years) 17.311 ± 6.00
[8.2 - 35.7]
17.116 ± 5.700
[6.5 – 34]
0.800
Motion (mm) .057 ± .020
[.018-.108]
.058 ± .020
[.020-.125]
0.923
Non-verbal IQ 106.86 ± 17.0
[37-149]
106.28 ± 12.8
[67-155]
0.800
2.1 DATA PREPROCESSING
Data were processed using the Analysis of Functional NeuroImages software
(afni.nimh.nih.gov; Cox, 1996) and FSL 5.0 (www.fmrib.ox.ac.uk/fsl; Smith et al., 2004).
Functional images were slice-time corrected, motion corrected to align to the middle time
4
point, field-map corrected and aligned to the anatomical images using FLIRT with six
degrees of freedom. FSL’s nonlinear registration tool (FNIRT) was used to standardize
images to the MNI152 standard image (3mm isotropic) using sinc interpolation. The outputs
were blurred to a global full-width-at-half-maximum of 6mm. Given recent concerns that
traditional filtering approaches can cause rippling of motion confounds to neighboring time
points (Carp, 2013), we used a second-order band-pass Butterworth filter [Power, Barnes,
Snyder, Schlaggar, & Petersen, 2013; Satterthwaite et al., 2013) to isolate low-frequency
BOLD fluctuations (.008 < f < .08 Hz) (Cordes et al., 2001).
Regression of 17 nuisance variables was performed to improve data quality
(Satterthwaite et al., 2013). Nuisance regressors included six rigid-body motion parameters
derived from motion correction and their derivatives. White matter and ventricular masks
were created at the participant level using FSL’s image segmentation (Zhang, Brady, &
Smith, 2001) and trimmed to avoid partial-volume effects. An average time series was
extracted from each mask and was removed using regression, along with its corresponding
derivative. Whole-brain global signal was also included as a regressor to mitigate cross-site
variability. All nuisance regressors were band-pass filtered using the second-order
Butterworth filter (.008 < f < .08 Hz) (Power et al., 2013; Satterthwaite et al., 2013).
2.2 MOTION
Motion was quantified as the Euclidean distance between consecutive time points
(based on detected six rigid-body motion parameters). For any instance greater than .25mm,
considered excessive motion, the time point as well as the preceding and following time
points were censored, or “scrubbed” (Power, Barnes, Snyder, Schlaggar, & Petersen, 2012).
If two censored time points occurred within ten time points of each other, all time points
between them were also censored. Datasets with fewer than 90% of time points or less than
150 total time points remaining after censoring were excluded from the analysis. Runs were
then truncated at the point where 150 usable time points were reached. Motion over the
truncated run was summarized for each participant as the average Euclidean distance moved
between time points (including areas that were censored) and was well matched between
groups (p = 0.92).
5
2.3 ROIS AND CONNECTIVITY MATRIX
We used 220 ROIs (10mm spheres) adopted from a meta-analysis of functional
imaging studies by Power et al. (2011) excluding 44 of their 264 ROIs because of missing
signal in >2 participants. Mean time courses were extracted from each ROI and a 220 x 220
connectivity matrix of Fisher-transformed Pearson correlation coefficients was created for
each subject. We then concatenated each subject’s functional connectivities to construct a
group level data matrix. For each ROI pair, we regressed out age and site as covariates from
data matrix. For interpretation of findings, ROIs were further sorted according to two
classification schemes. One was functional, using ROI assignments into networks by Power
et al. (2011). The other was anatomical, using each ROI’s center MNI coordinate and a
surface based atlas (Fischl et al., 2004) included in FreeSurfer. These anatomical labels were
also used to generate connectograms (Irimia, Chambers, Torgerson, & Horn, 2012) for
complementary visualization of informative connectivities (see Table 2.2 for anatomical
labels used in the connectogram).
2.4 MACHINE LEARNING ALGORITHMS
Three ML algorithms were implemented in this study to perform a binary
classification (ASD vs. TD) using rs-fMRI data: (i) support vector machines (SVM) in
combination with particle swarm optimization (PSO) for feature selection (PSO-SVM); (ii)
support vector machines with recursive feature elimination (RFE-SVM) for feature ranking;
and (iii) a nonparametric ensemble learning method called random forest (RF).
2.4.1 Particle Swarm Optimization (PSO-SVM)
We first used PSO in combination with a base classifier, a linear support vector
machine. PSO is a biologically inspired, stochastic optimization algorithm that models the
behavior of swarming particles. The PSO algorithm was utilized as a feature selection tool to
obtain a compact and discriminative feature subset for improved accuracy and robustness of
the subsequent classifiers.
Particle swarm is a biologically inspired, stochastic optimization algorithm that
models the behavior of swarming particles; it was first developed by Kennedy and Eberhart
(1995) The algorithm seeks to explore the search space by a population of individuals or
6
Table 2.2. Anatomical Labels Usedin Connectogram
Frontal lobe TrFPoG/S
FMarG/S
MFS
LOrS
SbOrS
OrS
RG
InfFGOrp
MFG
OrG
InfFGTrip
InfFS
MedOrS
SupFG
SupFS
InfFGOpp
InfPrCS
SupPrCS
PrCG
SbCG/S
CS
Transverse frontopolar gyri and sulci
Fronto-marginal gyrus (of Wernicke) and sulcus
Middle frontal sulcus
Lateral orbital sulcus
Suborbital sulcus
Orbital sulci
Straight gyrus (gyrus rectus)
Orbital part of the inferior frontal gyrus
Middle frontal gyrus
Orbital gyri
Triangular part of the inferior frontal gyrus
Inferior frontal sulcus
Medial orbital sulcus (olfactory sulcus)
Superior frontal gyrus
Superior frontal sulcus
Opercular part of the inferior frontal gyrus
Inferior part of the precentral sulcus
Superior part of the precentral sulcus
Precentral gyrus
Subcentral gyrus (central operculum) and sulci
Central sulcus (Rolando’s fissure)
Insular lobe ALSHorp
ACirInS
ALSVerp
ShoInG
SupCirInS
LoInG/CInS
InfCirInS
PosLS
Horizontal ramus of the anterior lateral sulcus
Anterior circular sulcus the insula
Vertical ramus of the anterior lateral sulcus
Short insular gyri
Superior segment of circular sulcus of the insula
Long insular gyrus and central insular sulcus
Inferior segment of circular sulcus of the insula
Posterior ramus of lateral sulcus
Limbic lobe ACgG/S
MACgG/S
SbCaG
PerCaS
MPosCgG/S
CgSMarp
PosDCgG
PosVCgG
Anterior part of the cingulate gyrus and sulcus
Middle-anterior part of the cingulate gyrus/sulcus
Subcallosal area, subcallosal gyrus
Pericallosal sulcus (S of corpus callosum)
Middle-posterior part of cingulate gyrus/sulcus
Marginal branch of the cingulate sulcus
Posterior-dorsal part of the cingulate gyrus
Posterior-ventral part of the cingulate gyrus
Temporal lobe TPo
PoPl
SupTGLp
HG
ATrCoS
TrTS
MTG
TPl
InfTG
InfTS
SupTS
Temporal pole
Polar plane of the superior temporal gyrus
Lateral aspect of the superior temporal gyrus
Heschl’s gyrus (anterior transverse temporal gyrus)
Anterior transverse collateral sulcus
Transverse temporal sulcus
Middle temporal gyrus
Temporal plane of the superior temporal gyrus
Inferior temporal gyrus
Inferior temporal sulcus
Superior temporal sulcus
(table continues)
7
Table 2.2. (continued)
Parietal lobe PosCG
SuMarG
PaCL/S
PosCS
JS
SbPS
IntPS/TrPS
SupPL
PrCun
AngG
POcS
Postcentral gyrus
Supramarginal gyrus
Paracentral lobule and sulcus
Postcentral sulcus
Sulcus intermedius primus (of Jensen)
Subparietal sulcus
Intraparietal sulcus and transverse parietal sulci
Superior parietal lobule
Precuneus
Angular gyrus
Parietal-occipital sulcus
Occipital lobe PaHipG
CoS/LinS
LOcTS
FuG
CcS
LinG
AOcS
InfOcG/S
SupOcS/TrOcS
PosTrCoS
Cun
MOcG
MOcS/LuS
SupOcG
OcPo
Parahippocampal gyrus
Medial occipito-temporal sulcus and lingual sulcus
Lateral occipito-temporal sulcus
Fusiform gyrus
Calcarine sulcus
Lingual gyrus
Anterior occipital sulcus
Interior occipital gyrus/sulcus
Superior occipital sulcus/transverse occipital sulcus
Posterior transverse collateral sulcus
Cuneus
Middle occipital gyrus
Middle occipital sulcus and lunatus sulcus
Superior occipital gyrus
Occipital pole
Subcortical Pu
Pal
CaN
NAcc
Amg
Tha
Hip
Putamen
Pallidum
Caudate nucleus
Nucleus accumbens
Amygdala
Thalamus
Hippocampus
Cerebellum CeB Cerebellum
Brain stem BSt Brain stem
particles. Each particle represents a potential solution with a velocity, which is dynamically
adjusted according to its own experience and that of its neighbors. The population of
particles is updated based on each particle’s previous best performance and the best particle
in the population. PSO combines local search with global search for balancing the
exploration and exploitation, thus it is useful for searching high-dimensional problem spaces.
Its implementation here was based on Wang, Yang, Teng, Xia, and Jensen (2007), where a
modified binary PSO algorithm is proposed for feature selection. The feature space was made
8
up of the predefined brain regions (as described in the main text) that were then constructed
as a correlation matrix. The features were selected by PSO based on highest fitness scores, or
cost function, then utilized for binary classification by a linear SVM. The SVM seeks to
minimize the upper bound of the generalization error based on the structural risk
minimization principal that is known to have high generalization performance (Vapnik,
1995). Choosing the optimal input feature subset influences the performance of the SVM.
Feature selection is an important issue in building classification systems, which refers to
choosing subset of attributes from the set of original attributes. The key purpose is to identify
significant features, eliminate the irrelevant or dispensable features and build a good learning
model. Using a new learning scheme based on swarm intelligence, PSO has been found to be
a promising technique for real world engineering and optimization problems due to its strong
global search capability. PSO takes less time for each function evaluation and is easy to
implement. In this study, a discrete binary PSO algorithm was used as a feature selection
vehicle to identify the most discriminant or informative features. The main objective of this
study was to exploit the maximum generalization capability of SVM and apply it to the
autism neuroimaging data to distinguish clinical from control populations. The PSO
algorithm was utilized as a feature selection tool to obtain a compact and discriminative
feature subset, which improved the accuracy and robustness of the subsequent classifiers.
2.4.2 Recursive Feature Elimination (RFE-SVM)
In a second approach, we used recursive feature elimination, a pruning technique that
eliminates original input features by using feature ranking coefficients as classifier weights,
and retains a minimum subset of features that yield best classification performance (Guyon,
Weston, Barnhill, & Vapnik, 2002). The output obtained from this algorithm is a list of all
features ranked in the order of most informative to least. We used RFE to reduce the feature
space dimensionality and select the top informative features.
We implemented an alternate approach for space dimensionality reduction using
SVM methods based on Recursive Feature Elimination (RFE). A known problem in
classification and machine learning is to reduce the dimensionality of the feature space to
overcome the risk of overfitting. Since our data had high dimensionality in the feature space
and comparatively small number of training patterns, we faced the risk of overfitting where
9
the decision function that separates the training data does not perform well on the test data. In
this study, we implemented a pruning technique, proposed by Guyon, that eliminates some of
the original input features and retain a minimum subset of features that yield best
classification performance (Guyon et al., 2002). The proposed feature-ranking technique
yields a fixed number of top ranked features, which may be selected for further analysis or
design a classifier. Using feature ranking coefficients as classifier weights, the inputs that are
weighted by the largest value influence most the classification decision. This multivariate
classifier is optimized during training to handle multiple variables, or features,
simultaneously. RFE is an iterative method that trains the classifier (by optimizing the
weights), computes the ranking criterion for all features, then removes the feature with
smallest ranking criterion. This iterative procedure is an instance of backward feature
elimination. SVM RFE is an application of RFE using the weight magnitude as ranking
criterion. The output obtained from such algorithm is a list of all features ranked in the order
of most informative to least. We then explored the optimal number of (top ranked) features
that maximized classification accuracy on the training data set. We applied the subset of top
ranked features to the independent test set in order to obtain the error rate and the predictive
power of the classifier.
2.4.3 Random Forest (RF)
In RF the basic unit is a classification tree, and the ensemble of trees, or forest, is
used to classify subjects using features (e.g., connectivity measures from fMRI, as in the
present study; see Figure 2.1). For final prediction, each tree gives a classification by
"voting" for a binary class label (e.g., diagnostic status). The forest chooses the classification
having the most votes (aggregated over all the trees in the forest). In random forests, there is
no need for cross-validation or a separate test set to obtain an unbiased estimate of the test set
error, which is instead estimated internally, during the run (www.stat.berkeley.edu/~breiman/
RandomForests). Each tree in the forest is constructed using a different bootstrap sample
from the original data. Since the bootstrap draws samples with replacement from the original
dataset, a probabilistic argument can show that each bootstrap sample will contain on average
about two-thirds of the original observations which results in about one-third of the subjects
being left out (i.e., not used in the construction of a particular tree). These Out-of-Bag
10
Figure 2.1. For each classification tree (RF’s single unit), data are
bootstrapped into in-bag and out-of-bag (for testing of the model).
The tree starts with the in-bag data at the root node, with a
random subset of features, and begins the process of “splitting.”
The splitting criteria at each node are determined by a type of
greedy algorithm that recursively partitions the data into daughter
nodes, until reaching a terminal node. (Greedy recursive
partitioning algorithms make the partitioning choice that
performs best at each split, with the goal of globally optimal
classification). The out-of-bag data are then sent down the tree as
a test of model fit. The misclassification rate here is called “OOB
error.” This is the internal validation process that prevents
overfitting. Once all 10,000 trees in the forest have finished this
process, the forest takes the majority vote from the trees, and
obtains variable importance measures and OOB error.
11
subjects are used to attain a running unbiased estimate of the classification error as trees are
added to the forest; they are also used for estimates of variable importance. The test set
classification, or OOB error estimate, has proven to be unbiased (Breiman, 2001). Using the
OOB subjects to determine error and variable importance, RF minimizes overfitting. RF also
provides variable importance measures for feature selection. The OOB data are used to
determine variable importance, based on mean decrease in accuracy. This is done for each
tree by randomly permuting a variable in the OOB data and recording the change in
accuracy. For the ensemble of trees, the permuted predictions are aggregated and compared
against the unpermuted predictions to determine the importance of the permuted variable by
the magnitude of the decrease in accuracy of that variable. When the number of variables is
very large (in the order of 2 x 10^4), RF can be run once with all the variables, then again
using only the most important variables selected from the first run.
In the present study, we used the R package randomForest (Liaw & Wiener, 2002) for
classification analysis. As importance scores, proximity measures, and error rates vary
because of the random components in RF (out of bag sampling, node-level permutation
testing), we created a forest with a sufficient number of trees to ensure stability. To optimize
the most important RF parameters, we fine-tuned the number of trees per forest to 10,000 as
the plotted error rate was observed to stabilize (Figure 2.2), and set the number of variables
randomly selected per node to 150, after tuning for this parameter. The RF algorithm ranked
all features based on variable importance given by the mean decrease in accuracy. After the
initial run with all features, we used the variable importance measures to select for 100 top
informative features (Figure 2.3). Using the 100 features, we obtained the OOBerror to
assess how well the model predicted new data (see Figure 2.4 for fine tuning of the mtry
parameter once the top 100 features have been selected).
12
Figure 2.2. OOB error as a function of number of trees in the
forest. Initially, the error fluctuates greatly, then eventually
stabilizes with a large enough forest with 10,000 trees. These are
OOB errors using all 24,090 features prior to feature selection.
The number of trees in the forest is an important parameter to
fine tune as to ensure stability of the algorithm.
13
Figure 2.3. Feature selectionusing MeanDecrease Accuracy
(MDA). In RF, vriable importance is determined based on
mean decrease in accuracy. This is done for each tree by
randomly permuting a variable in the OOB data and
recording the change in accuracy. For the ensemble of trees,
the permuted predictions are aggregated and compared
against the unpermuted predictions to determine the
importance of the permuted variable by the magnitude of the
decrease in accuracy of that variable. We selectedthe top
features using this MDA plot, where we look for the plateau at
which no further decrease in accuracy is achieved. Here, 100
features was selectedusing the MDA measures.
14
Figure 2.4. Fine tuning the mtry
parameter for RF to select the optimal
number of features to split at each node.
Using the top 100 features, we fine
tuned RF again on the parameter mtry,
the number of features used at each
split, to 5.
15
CHAPTER 3
RESULTS
3.1 CLASSIFICATION ACCURACYOF THE THREE
ALGORITHMS
The PSO-SVM achieved an accuracy of 81% on the training, and of 58% on the
validation dataset, with 44% sensitivity and 72% specificity. Accuracy for RFE-SVM was
100% on the training dataset and 66% on the validation dataset, with 60% sensitivity and
72% specificity. RF classified ASD with only 58% accuracy without feature selection.
However, using the top 100 features with highest variable importance (as defined above), RF
achieved 90.8% accuracy (OOB error rate 9.2%), with sensitivity at 89% and specificity at
93%. In fact, using only 10 most informative features, RF still attained an accuracy of 75%,
with 75% sensitivity and 75% specificity. Note that the RF methodology does not have an
external validation set, but the bootstrap sample of the data for each tree acts as an internal
validation dataset (see Discussion).
3.2 CLASSIFICATION CHARACTERIZATION OF
INFORMATIVE FEATURES FROM RF
Given the overall better performance from RF (compared to the other ML
techniques), we proceeded to examine the most informative features selected by RF. When
selecting only the 10 top features, several regions were noted to participate in two or more
informative connections: left anterior cingulate gyrus, postcentral gyrus bilaterally, right
precuneus, and left calcarine sulcus (see Figure 3.1A). The pattern for the top 100 features,
which yielded the peak accuracy of 91%, involved connections between 124 ROIs (see
Figure 3.1B). The left paracentral lobule and the right postcentral gyrus participated in ≥ 6
informative connections, which was ≥2SD above the mean participation among the 124
informative ROIs.
We further examined these top 100 features by functional networks, as defined in
Power et al. (2011). We first determined the number of times ROIs from a specific network
participated in an informative connection (Figure 3.2A). The distribution differed
16
Figure 3.1. Connectogram showing informative connections.
(A) Top 10 informative connections. (B) Top 100 informative
connections. The labels of the connectograms can be found in
Table 2, in the order in which they appear in the figure.
17
Figure 3.2. Informative features selectedby RF. (A) Pie chart showing the number of
informative ROIs per functional network. (B) Normalized number of informative ROIs
per network (ratio of the number of times network ROIs participate in an informative
connection divided by the total number of ROIs in given network). This number can
exceed1 because a given ROI may participate in several informative connections. (C)
Heatmap of informative connections by functional networks. (D) Number of
informative ROIs per anatomical parcellation. (E) Normalized number of informative
ROIs per anatomical parcel (ratio of the number of times anatomical ROIs participate
in an informative connection divided by the total number of ROIs in given parcel). (F)
Heatmap of informative connections by anatomical parcel. Abbreviated labels: DMN,
default mode network; SMH, somatosensory and motor [hand]; VIS, visual; SAL,
salience; SUB, subcortical; SMM, somatosensory and motor [mouth]; FPTC, frontal
parietal task control; AUD, audio; UN, unknown; VA, ventral attention; COTC,
cingulo opercular task control; DA, dorsal attention; MR, memory retrieval; CEB,
cerebellum.
18
19
significantly from the overall distribution of ROIs across different networks in Power et al.
(2011) (X2
= 33.348, p = 0.001). Default mode, somatosensory/motor (hand region), and
visual networks ranked at the top, accounting for half of all informative connections. We
further examined normalized network importance (i.e., the number of times network ROIs
participated in an informative connection divided by the total number of ROIs in the given
network; Figure 3.2B). This ratio highlighted the importance of somatosensory/motor
networks (both mouth and hand regions), ahead of subcortical, memory retrieval, and visual
networks. Within the two top networks, somatosensory cortex (postcentral gyrus)
participated in 23 of the top 100 connections, whereas primary motor cortex (precentral
gyrus) participated in only 9. Finally, we depicted the 100 top connections in a heat map
matrix (Figure 3.2C), which highlighted the importance of connections between DMN and
visual ROIs (14 informative features). Informative features for the somatosensory/motor
hand regions were dominated by connections with subcortical ROIs.
Examining the 100 top features by broad anatomical subdivisions (Fischl et al.,
2004), the distribution did not differ significantly from expected (X2
= 5.225, p = 0.98). We
found 60 participations by parietal lobes bilaterally, followed by 45 for occipital lobes
bilaterally. Right frontal lobe also had heightened density of informative connections (with
21 participations; Figure 3.2D). When normalizing participations by the number of ROIs
within each subdivision (analogous to the procedure described above), bilateral parietal lobes
again stood out, followed by right temporal lobe (Figure 3.2E). On the heat map matrix
(Figure 3.2F), the highest concentration of most informative connections was found to
involve parietal lobes bilaterally, with six informative connections each within left and right
parietal lobes, and an additional six between right parietal and left occipital lobe and five
between left parietal and right frontal lobe.
We examined the 100 top features by several other criteria. They were almost evenly
divided with respect to the direction of mean FC differences (45% over-, 55%
underconnected in the ASD group) and type of connection (49% inter-, 51% intra-
hemispheric). 95% of the top connections were between networks, 5% within networks;
87% were mid- to long-distance (>40mm Euclidian distance), 11% were short-distance
(<40mm). However, these percentages of Euclidian distance (X2
= 0.228, p-value = 0.63) and
20
of network connection (X2
= 2.24, p-value = 0.13) did not differ significantly from those
expected based on the totality of 24090 features examined in entire connectivity matrix.
21
CHAPTER 4
DISCUSSION
In this study, we used intrinsic functional connectivities between a set of functionally
defined ROIs (Power et al., 2011) for machine learning diagnostic classification of ASD.
While accuracy remained overall modest for PSO-SVM and RFE-SVM approaches, it was
high (91%) for Random Forest. The RF algorithm is well-suited for classification of fcMRI
data because it (i) is applicable when there are more features than observations, (ii) performs
embedded feature selection and is relatively insensitive to any large number of irrelevant
features, (iii) incorporates interactions between features, (iv) is based on the theory of
ensemble learning that allows the algorithm to learn accurately both simple and complex
classification functions, and (v) is applicable for both binary and multi-category classification
tasks (Breiman, 2001).
4.1 REGIONS AND NETWORKS
We examined in two steps which single regions and functional networks were most
‘informative’, contributing most heavily to diagnostic classification. First, focusing on only
the ten top features, for which an accuracy of 75% was achieved, we found that limbic (left
anterior cingulate), somatosensory (postcentral gyri bilaterally), visual (calcarine sulcus), and
default mode (right precuneus) regions stood out (Figure 3.1A). This was largely supported
by subsequent examination of the 100 top features, for which the peak classification accuracy
of 91% was reached. Half of all ROIs involved in these connections belonged to three (out of
14) networks, which were default mode, somatosensory/motor (hand) and visual
(Figure 3.2A).
The informative role of the DMN in diagnostic classification was not surprising,
given extensive evidence of DMN anomalies in ASD. These include fMRI findings
indicating failure to enter the default mode in ASD (Kennedy, Redcay, & Courchesne, 2006;
Murdaugh et al., 2012), as well as numerous fcMRI reports of atypical connectivity of the
DMN (Assaf et al., 2010; Di Martino, Zuo, et al., 2013; Keown, Shih, Nair, Peterson, &
22
Müller, 2013; Monk et al., 2009; Redcay et al., 2013; Uddin, Supekar, & Menon, 2013; von
dem Hagen, Stoyanova, Baron-Cohen, & Calder, 2012; Washington et al., 2013; Zielinski et
al., 2012). Functionally, the DMN is considered to relate to self-referential cognition,
including domains of known impairment in ASD, such as theory of mind and affective
decision making (Andrews-Hanna, Reidler, Sepulcre, Poulin, & Buckner, 2010).
The visual system appears to be overall relatively spared in ASD (Simmons et al.,
2009), and atypically enhanced participation of visual cortex has been observed across many
fMRI (Samson, Mottron, Soulieres, & Zeffiro, 2012) and fcMRI studies (Shen et al., 2012),
with the added recent finding of atypically increased local functional connectivity in ASD
being associated with symptom severity (Keown et al., 2013). The relative importance of
visual regions and occipital lobes probably reflects atypical function of the visual system in
ASD (Dakin & Frith, 2005) rather than sparing.
The role of somatosensory regions was particularly prominent when considering a
normalized ratio of informative ROIs (Figure 3.2B). Although only a total of 26
somatosensory/motor ROIs (hand and mouth regions combined) were included in the
analysis, these participated in 44 out of the 100 most informative connections. This effect
was distinctly driven by primary somatosensory cortex in the postcentral gyrus (rather than
motor cortex). Although somatosensory anomalies have been described in a number of ASD
studies (Puts, Wodka, Tommerdahl, Mostofsky, & Edden, 2014; Tomchek & Dunn, 2007;
Tommerdahl, Tannan, Cascio, Baranek, & Whitsel, 2007), the highly prominent role of
somatosensory regions in diagnostic classification was remarkable. The imaging literature on
somatosensory cortex in ASD is currently limited. Cascio and colleagues (2012) found
atypical activation patterns in adults with ASD for pleasant and unpleasant tactile stimuli in
PCC and insula, which could reflect altered functional connectivity with somatosensory
cortex (not tested in the cited study). Atypical postcentral responses to somatosensory stimuli
have also been observed by one evoked potential study in young children with ASD
(Miyazaki et al., 2007).
4.2 ISSUES, CAVEATS, AND PERSPECTIVES
Diagnostic classification accuracy achieved using Random Forest machine learning
was distinctly above accuracies reported from previous studies using rs-fcMRI (Anderson et
23
al., 2011), including a recent publication (Nielsen et al., 2013) using the same consortium
database as in the present study. It is important to note that RF implements an ‘out-of-bag’
validation process (randomly selected subsamples), contrary to the other machine learning
tools (PSO-SVM, RFE-SVM) used by us, which required external validation data sets. While
this latter procedure may appear to provide best protection from overfitting to the
idiosyncracies of any given training dataset, it rests on the assumption that training and
validation samples can be optimally matched. Such optimal matching may be difficult, if
many variables (e.g., age, sex, motion, site, eye status, scanner, imaging protocol) are
potentially relevant (cf. Table 4.1 for demographic information by site; Table 4.2 for ABIDE
cohort matching). However, even if matching on those variables were possible, a more
intractable problem would remain. There is general consensus that ASD is an umbrella term
for several – maybe many – underlying biological subtypes, which are currently unknown.
Any substantial difference between training and validation datasets in the composition of
these subtypes, which cannot be detected and cannot therefore be avoided, is likely to result
in poor prediction in a validation dataset. In ASD machine learning classification, accuracy
will tend to be overestimated in training datasets (even when the common leave-one-out
procedure is applied), but underestimated in external validation datasets because of the
problems described above. The use of an out-of-bag validation dataset in RF presents a
reasonable solution to these problems, as the procedure itself ensures matching of training
and validation datasets.
Our findings suggest that intrinsic functional connectivity may identify complex
biomarkers of ASD, which is not unexpected given the extensive literature on functional
connectivity anomalies in this disorder (e.g., Vissers et al., 2012). Aside from the use of RF
machine learning and its virtues described above, the main difference between our study and
the relatively unsuccessful classification study by Nielsen et al. (2013) was our careful
selection of high-quality (low-motion) data sets, resulting in the inclusion of only 252 (out of
a total of 1112) participants, equally divided between two tightly motion-matched diagnostic
groups. Our procedure was informed by the recent flurry of studies suggesting that even
small amounts of head motion well below 1mm affect interregional fMRI signal correlations
(Carp, 2013; Jo et al., 2013; Power et al., 2012; Power et al., 2014; Satterthwaite et al., 2013;
Van Dijk, Sabuncu, & Buckner, 2012).
24
Table 4.1. ABIDE Demographic Information
Sites Sample Size Age RMSD NVIQ Gender ADOS_total Handedness
ASD TD ASD
mean (SD)
[range]
TD ASD
mean (SD)
[range]
TD ASD
mean (SD)
[range]
TD Male Female Range
[range]
NA Left Right
Full Sample
NYU (N=70) 36 34 15.6 (6) 16.96 (6.17) .02 (.02) .02 (.02) 108 (15.1) 103 (14.9) 54 16 [5-22] 0 NA NA
[8.5-30] [6.4-30.8] [.03-.09] [.03-.09] [72-149] [67-137]
UM_1 (N=29) 15 14 14.7 (2.1) 15.5 (2.8) .06 (.02) .05 (.02) 112 (19.6) 102.2 (9.4) 17 12 NA 15 5 21
[11.5-18.6] [11-19] [.02-.09] [.02-.08] [81-148] [82-113]
USM (N=29) 19 20 21.9 (6.4) 6.4 (7.2) .05 (.02) .06 (.02) 105.7 (13) 111 (12.5) 29 0 [6-21] 0 NA NA
[11.3-35.7] [13.8-34] [.02-.09] [.03-.09] [75-133] [90-129]
SDSU (N=25) 10 15 14.6 (2.3) 13.4 (2.3) .03 (.01) .04 (.02) 112 (16) 111 (11) 21 4 [4-17] 0 6 19
[9.6-17.7] [8-16.8] [.02-.05] [.02-.07] [86-114] [92-137]
Leuven_1 (N=17) 9 8 22.2 (4.9) 23.75 (2.9) .061 (.02) .065 (.02) 103 (16.4) 114 (18.9) 17 0 NA 9 1 16
[19-32] [21-29] [.04-.09] [.04-.09] [74-120] [93-155]
Leuven_2 (N=17) 8 9 13.8 (1.1) 14.3 (1.4) .07 (.01) .08 (.02) 97.5 (11.8) 101.4 (6.5) 11 6 NA 8 4 13
[12.2-15.3] [12.3-16.6] [.05-.1] [.05-.1] [83-117] [89-109]
PITT (N=14) 8 6 21.2 (8.4) 20.2 (4.1) .07 (.02) .09 (.02) 11.7 (14.2) 115 (5.3) 12 2 [7-19] 1 1 13
[12.6-35.2] [12.8-24.6] [.04-.1] [.07-.1] [93-128] [108-121]
TRINITY (N=12) 4 8 17.2 (3.4) 17.9 (4) .06 (.010) .07 (.01) 121 (7.4) 111 (6.2) 12 0 [7-15] 0 0 12
[13.6-20.8] [12.7-24.8] [.05-.07] [.06-.07] [114-131] [103-121]
YALE (N=12) 6 6 14.6 (1.9) 13.2 (3.4) .06 (.02) .05 (.01) 85 (29) 97 (11.2) 8 4 NA 5 1 11
[11.8-17.2] [8.7-16.7] [.03-.09] [.03-.06] [37-120] [84-115]
KKIN (N=8) 3 5 9.9 (1.51) 9.9 (1.7) .01 (.004) .01 (.02) NA NA 6 2 [10-16] 0 1 7
[8.2-11.1] [8.5-12.8] [.06-.07] [.05-.1]
OLIN (N=8) 4 4 18.3 (4.03) 19.5 (2.4) .09 (.01) .09 (.3) NA NA 6 2 [7-19] 4 0 8
[15-24] [16-21] [.07-.1] [.06-.1]
(table continues)
25
Table 4.1. (continued)
Sites Sample Size Age RMSD NVIQ Gender ADOS_total Handedness
ASD TD ASD
mean (SD)
[range]
TD ASD
mean (SD)
[range]
TD ASD
mean (SD)
[range]
TD Male Female Range
[range]
NA Left Right
UM_2 (N=6) 1 5 17.4 (NA) 18.4 (5.9) .08 (NA) .05 (.01) 80 (NA) 102 (6.8) 6 0 NA 1 0 6
[17.4-17.4] [14.5-28.9] [.08-.08] [.03-.06] [80-80] [92-109]
CMU (N=5) 3 2 26.7 (4.5) 26 (.07) .06 (.02) .08 (.01) 113.7 (10.8) 109.5 (0.7) 4 1 [8-16] 0 0 5
[22-31] [21-31] [.04-.07] [.07-.08] [106-126] [109-110]
Training
NYU (N=60) 31 29 15.96 (6.22) 16.96 (6.50) .05 (.02) 0.05 (0.02) 108.03 (15.80) 102.93 (15.17) 46 14 [6-22] 29 NA NA
[8.51 - 29.98] [6.47 - 30.78] [.03 -.09] [0.03 - 0.09] [72.00 - 149.00] [67.00 - 137.00]
UM_1 (N=25) 14 11 14.64 (2.11) 15.28 (2.85) 0.06 (0.02) 0.05 (0.02) 111.43 (19.97) 105.11 (6.72) 15 10 NA 14 5 17
[11.50 - 18.60] [11.00 - 19.20] [0.02 - 0.09] [0.02 - 0.08] [81.00 - 148.00] [92.00 - 113.00]
Leuven_1 (N=17) 9 8 22.22 (4.87) 23.75 (2.92) 0.06 (0.02) 0.07 (0.02) 103.00 (16.42) 113.88] (18.88) 17 0 NA 9 1 16
[19.00 - 32.00] [21.00 - 29.00] [0.04 - 0.09] [0.04 - 0.09] [74.00 - 120.00] [93.00 - 155.00
SDSU (N=17) 9 8 15.19 (1.65) 13.43 (3.35) 0.03 (0.01) 0.04 (0.01) 112.67 (16.77) 110.25 (13.61) 17 0 [4-17] 0 5 12
[12.88 - 17.67] [8.02 - 16.59] [0.02 - 0.05] [0.02 - 0.07] [86.00 - 140.00] [92.00 - 137.00]
USM (N=16) 8 8 23.67 (7.35) 21.60 (6.84) 0.05 (0.02) 0.06 (0.02) 101.75 (6.82) 111.38 (13.22) 16 0 [9-17] 0 NA NA
[15.85 - 35.71] [13.81 - 34.01] [0.02 - 0.07] [0.03 - 0.09] [87.00 - 106.00] [90.00 - 129.00]
Leuven_2 (N=14) 6 8 13.53 (1.23) 14.59 (1.32) 0.07 (0.01) 0.07 (0.01) 96.17 (9.54) 102 (6.70) 8 6 NA 6 3 11
[12.20 - 15.30] [12.30 - 16.60] [0.05 - 0.08] [0.05 - 0.10] [83.00 - 106.00] [89.00 - 109.00]
PITT (N=11) 6 5 23.03 (9.07) 20.01 (4.56) 0.08 (0.02) 0.09 (0.01) 114.67 (14.90) 114.20 (5.07) 10 1 [7-19] 1 1 10
[12.62 - 35.20] [12.83 - 24.60] [0.06 - 0.11] [0.07 - 0.11] [93.00 - 128.00] [108.00- 119.00]
YALE (N=9) 4 5 14.96 (2.39) 12.53 (3.30) 0.07 (0.02) 0.04 (0.01) 84.75 (37.03) 98.20 (12.32) 8 1 NA 3 1 8
[11.83 - 17.17] [8.67 - 16.66] [0.04 - 0.09] [0.03 - 0.06] [37.00 - 120.00] [84.00 - 115.00]
KKI (N=8) 3 5 9.88 (1.51) 9.91 (1.69) 0.07 (0.00) 0.07 (0.02) NA NA 6 2 [10-16] 0 1 7
[8.20 - 11.14] [8.46 - 12.76] [0.06 - 0.07] [0.05 - 0.10]
OLIN (N=7) 3 4 18.3 (4.93) 19.50 (2.38) 0.09 (0.01) 0.09 (0.03) NA NA 5 2 [9-19] 0 0 7
3[15.00- 24.00] [16.00 - 21.00] [0.08 - 0.10] [0.06 - 0.13]
(table continues)
26
Table 4.1. (continued)
Sites Sample Size Age RMSD NVIQ Gender ADOS_total Handedness
ASD TD ASD
mean (SD)
[range]
TD ASD
mean (SD)
[range]
TD ASD
mean (SD)
[range]
TD Male Female Range
[range]
NA Left Right
TRINITY (N=7) 3 4 16.46 (3.79) 19.48 (4.53) 0.06 (0.01) 0.06 (0.01) 123.00 (7.21) 112.00 (6.38) 7 0 [7-15] 0 0 7
[13.58 - 20.75] [13.75 - 24.83] [0.05 - 0.07] [0.06 - 0.08]
[117.00 -
131.00] [107.00- 121.00]
CMU (N=5) 3 2 26.67 (4.51) 26.00 (7.07) 0.06 (0.02) 0.08 (0.01) 113.67 (10.79) 109.50 (0.71) 4 1 [8-16] 0 0 5
[22.00 - 31.00] [21.00 - 31.00] [0.04 - 0.08] [0.07 - 0.08]
[106.00 -
126.00] [109.00- 110.00]
UM_2 (N=4) 1 3 17.40 (0.00) 15.23 (0.67) 0.08 (0.00) 0.04 (0.02) 80.00 (0.00) 99.33 (7.02) 4 0 NA 1 0 4
[17.40 - 17.40] [14.50 - 15.80] [0.08 - 0.08] [0.03 - 0.06] [80.00 - 80.00] [92.00 - 106.00]
Validation
USM (N=13) 11 2 20.7 (5.7) 28.2 (5.6) .05 (.02) .06 (.005) 105.9 (17.9) 109.5 (11.3) 12 1 [6-21] 1 NA NA
[11.4-32.8] [22-34] [.03-.09] [.05-.07] [75-133] [100-119]
NYU (N=10) 5 5 13.1 (3.99) 16.4 (4.3) .07 (.02) .05 (.01) 38.8 (11.5) 102.8 (14.7) 2 8 [5-13] 0 NA NA
[8.9-17.9] [9.8-20.0] [.04-.09] [.03-.06] [92-119] [83-117]
SDSU (N=8) 1 7 9.64 13.5 (2.9) 0.025 .04 (.018) 112 111.7 (8.3) 4 4 11 0 1 7
[9.6-16.8] [.02-.07] [103-124]
TRINITY (N=5) 1 4 19.4 16.3 (3.2) 0.057 .066 (.0087) 114 109.8 (6.7) 5 0 13 0 5
[12.7-20.1] [.05-.076] [103-116]
UM_1 (N=4) 1 3 16.1 16.2 (2.7) 0.048 .05 (.01) 125 178 (9.9) 2 2 NA 1 0 4
[13.1-18.2] [.03-.06] [82-96]
Leuven_2 (N=3) 2 1 14.5 (.29) 12.4 .08 (.02) 0.11 101.5 (21.9) 97 3 0 NAN 3 1 2
[14.3-14.7] [.07-.09] [86-117]
PITT (N=3) 2 1 15.6 (1.9) 21.22 .05 (.03) 0.11 103 (9.9) 121 2 1 [12-16] 0 0 3
[14.2-16.9] [.03-.07] [96-110]
YALE (N=3) 2 1 14 (0.53) 16.66 .049 (.02) 0.057 85 (1.4) 93 0 3 NA 2 0 3
[13.66-14.4] [.03-.06] [84-86]
UM_2 (N=2) 0 2 NA 23.2 (7.9) NA .06 (.003) NA 107.5 (2.2) 2 0 NA NA 0 2
[17.6-28.8] [.058-.06] [106-109]
OLIN (N=1) 1 0 18 NA 0.07 NA NA NA 1 0 7 0 0 1
27
Table 4.2. ABIDE Cohort Matching (p-Values from Independent Two-Sample t-Tests)
Full Sample Training Validation Training <-> Validation Training <-> Validation
N= 252 N=200 N=52
ASD <-> TD ASD <-> TD ASD <-> TD ASD <-> ASD TD <-> TD
Motion 0.923 0.922 0.984 0.985 0.984
Age 0.793 0.780 0.984 0.840 0.962
NVIQ 0.771 0.753 0.924 0.934 0.986
ADOS 0.824
Although we modeled effects of site, the use of multisite data with the numerous
factors of variability it implies remains a challenge that needs to be accepted as high-quality
rs-fMRI data for comparable sample sizes from a single site are not currently available.
Inclusion of 126 ASD participants may provide relative protection from cohort effects that
are probably at work in many small-sample imaging studies. However, the need for high
levels of compliance in fMRI studies, even when there is no explicit task, inevitably results
in inclusionary biases because low-functioning people with ASD are unable to participate.
The methodologically indispensable step of selecting low-motion data probably added to this
issue, given that higher-functioning participants were likely to move less during scanning.
Indeed, our diagnostic groups were matched for nonverbal IQ, and relatively few ASD
participants with intellectual disability and IQs below 70 were included. The pattern of
informative connections observed in our study may therefore not apply to the lower-
functioning end of the autism spectrum.
A high diagnostic classification accuracy was reached for 100 most informative
features. The pattern of these connections was highly complex, involving numerous brain
regions. While some networks (somatosensory, default mode, visual) stood out, many others
contributed to the classification. This may be disappointing to anyone harboring the hope that
the neurobiology of ASD might ultimately be pinned down to simple and localizable causes.
However, the literature on functional and anatomical brain anomalies in ASD is fully
consistent with the concept of a disorder with highly distributed, rather than localized,
patterns (Akshoomoff, Pierce, & Courchesne, 2002; Müller, 2007; Philip et al., 2012; Vissers
et al., 2012; Wass, 2011). The complex pattern (cf. Figure 3.1B) should, however, not be
viewed as definitive for the ASD population as a whole. Instead, it reflects a distinctive
28
connectivity pattern for a large cohort of children and young adults with ASD. Sample size,
careful data denoising, and refinement of machine learning approaches in the present study
thus present a distinct advance in the search for functional connectivity biomarkers of ASD,
but given the challenges described above, much further work remains to be done. Although
fcMRI is generally accepted as a powerful technique for identifying network abnormalities,
the addition of complementary techniques (e.g., diffusion weighted MRI for assays of
anatomical connectivity; magnetoencephalography for detection of neuronal coherence in
high-frequency bands) will likely enhance diagnostic classification and the detection of
complex biomarkers (see Zhou, Yu, & Duong, 2014 for a recent multimodal attempt with
moderate success). However, it is also important to recognize that machine learning
diagnostic classification can only perform at the level permitted by the diagnostic procedure
itself, which is fundamentally imperfect. It cannot be expected that a disorder recognized as
neurobiological in nature would be detected by means of purely behavioral and observational
criteria currently used for diagnosis. There can thus be no perfect fit between brain markers
and diagnostic labels. The contribution of machine learning to the discovery of biomarkers,
which can ultimately replace behavioral criteria, is therefore important.
29
REFERENCES
Akshoomoff, N., Pierce, K., & Courchesne, E. (2002). The neurobiological basis of autism
from a developmental perspective. Development and Psychopathology, 14, 613-634.
American Psychiatric Association. (2013). Diagnostic and statistical manual of mental
disorders - V. Washington, DC: American Psychiatric Association.
Anderson, J. S., Nielsen, J. A., Froehlich, A. L., DuBray, M. B., Druzgal, T. J., Cariello, A.
N., . . . Lainhart, J. E. (2011). Functional connectivity magnetic resonance imaging
classification of autism. Brain, 134, 3742-3754.
Andrews-Hanna, J. R., Reidler, J. S., Sepulcre, J., Poulin, R., & Buckner, R. L. (2010).
Functional-anatomic fractionation of the brain's default network. Neuron, 65, 550-
562.
Assaf, M., Jagannathan, K., Calhoun, V. D., Miller, L., Stevens, M. C., Sahl, R., . . .
Pearlson, G. D. (2010). Abnormal functional connectivity of default mode sub-
networks in autism spectrum disorder patients. Neuroimage, 53, 247-256.
Breiman, L. (2001). Random forests. Machine Learning, 45, 5-32.
Carp, J. (2013). Optimizing the order of operations for movement scrubbing: Comment on
Power et al. NeuroImage, 76, 436-438.
Cascio, C. J., Moana-Filho, E. J., Guest, S., Nebel, M. B., Weisner, J., Baranek, G. T., &
Essick, G. K. (2012). Perceptual and neural response to affective tactile texture
stimulation in adults with autism spectrum disorders. Autism Research: Official
Journal of the International Society for Autism Research, 5, 231-244.
Cordes, D., Haughton, V. M., Arfanakis, K., Carew, J. D., Turski, P. A., Moritz, C. H., . . .
Meyerand, M. E. (2001). Frequencies contributing to functional connectivity in the
cerebral cortex in "resting-state" data. American Journal of Neuroradiology, 22,
1326-1333.
Cox, R. W. (1996). AFNI: Software for analysis and visualization of functional magnetic
resonance neuroimages. Computers and Biomedical Research, 29, 162-173.
Dakin, S., & Frith, U. (2005). Vagaries of visual perception in autism. Neuron, 48, 497-507.
Di Martino, A., Yan, C. G., Li, Q., Denio, E., Castellanos, F. X., Alaerts, K., . . . Milham, M.
P. (2013). The autism brain imaging data exchange: Towards a large-scale evaluation
of the intrinsic brain architecture in autism. Molecular Psychiatry, 19(6), 659-667.
Di Martino, A., Zuo, X. N., Kelly, C., Grzadzinski, R., Mennes, M., Schvarcz, A., . . .
Milham, M. P. (2013). Shared and distinct intrinsic functional network centrality in
autism and attention-deficit/hyperactivity disorder. Biological Psychiatry, 74(8), 623-
632.
30
Fischl, B., van der Kouwe, A., Destrieux, C., Halgren, E., Segonne, F., Salat, D. H., . . . Dale,
A. M. (2004). Automatically parcellating the human cerebral cortex. Cerebral Cortex,
14, 11-22.
Greschwind, D. H., & Levitt, P. (2007). Autism spectrum disorders: Developmental
disconnection syndromes. Current Opinion in Neurobiology, 17, 103-111.
Guyon, V. V., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene selection for cancer
classification using support vector machines. Machine Learning, 46, 389-422.
Irimia, A., Chambers, M. C., Torgerson, C. M., & Horn, J. D. (2012). Circular representation
of human cortical networks for subject and population-level connectomic
visualization. NeuroImage, 60, 1340-1351.
Jo, H. J., Gotts, S. J., Reynolds, R. C., Bandettini, P. A., Martin, A., Cox, R. W., & Saad, Z.
S. (2013). Effective preprocessing procedures virtually eliminate distance-Dependent
motion artifacts in resting state FMRI. Journal of Applied Mathematics, 2013,
935154.
Kennedy, J., & Eberhart, R. (1995). Particle swarm optimization. Proceedings of the IEEE
International Conference on Neural Network, 4, 1942-1948.
Kennedy, D. P., Redcay, E., & Courchesne, E. (2006). Failing to deactivate: Resting
functional abnormalities in autism. Proceedings of the National Academy of Science
USA, 103, 8275-8280.
Keown, C. L., Shih, P., Nair, A., Peterson, N., & Müller, R.-A. (2013). Local functional
overconnectivity in posterior brain regions is associated with symptom severity in
autism spectrum disorders. Cell Reports, 5, 567-572.
Liaw, A., & Wiener, M. (2002). Classification and regression by random forest. R News, 2,
18-22.
Miyazaki, M., Fujii, E., Saijo, T., Mori, K., Hashimoto, T., Kagami, S., & Kuroda, Y. (2007).
Short-latency somatosensory evoked potentials in infantile autism: Evidence of
hyperactivity in the right primary somatosensory area. Developmental Medicine and
Child Neurology, 49, 13-17.
Monk, C. S., Peltier, S. J., Wiggins, J. L., Weng, S. J., Carrasco, M., Risi, S., & Lord, C.
(2009). Abnormalities of intrinsic functional connectivity in autism spectrum
disorders. Neuroimage, 47, 764-772.
Müller, R.-A. (2007). The study of autism as a distributed disorder. Mental Retardation and
Developmental Disabilities Research Review, 13, 85-95.
Müller, R.-A., Shih, P., Keehn, B., Deyoe, J. R., Leyden, K. M., & Shukla, D. K. (2011).
Underconnected, but how? A survey of functional connectivity MRI studies in autism
spectrum disorders. Cerebral Cortex, 21, 2233-2243.
Murdaugh, D. L., Shinkareva, S. V., Deshpande, H. R., Wang, J., Pennick, M. R., & Kana, R.
K. (2012). Differential deactivation during mentalizing and classification of autism
based on default mode network connectivity. PloS One, 7, e50064.
31
Nair, A., Keown, C. L., Datko, M. C., Shih, P., Keehn, B. M., Müller, R.-A. (2014). Impact
of methodological variables on functional connectivity findings in Autism Spectrum
Disorders. Human Brain Mapping, 35(8), 4035-4048.
Nielsen, J. A., Zielinski, B. A., Fletcher, P. T., Alexander, A. L., Lange, N., Bigler, E. D., . . .
Anderson, J. S. (2013). Multisite functional connectivity MRI classification of
autism: ABIDE results. Frontiers in Human Neuroscience, 7, 599.
Philip, R. C., Dauvermann, M. R., Whalley, H. C., Baynham, K., Lawrie, S. M., & Stanfield,
A. C. (2012). A systematic review and meta-analysis of the fMRI investigation of
autism spectrum disorders. Neuroscience and Biobehavioral Review, 36, 901-942.
Power, J. D., Cohen, A. L., Nelson, S. M., Wig, G. S., Barnes, K. A., Church, J.A., . . .
Petersen, S. E. (2011). Functional network organization of the human brain. Neuron,
72, 665-678.
Power, J. D., Barnes, K. A., Snyder, A. Z., Schlaggar, B. L., Petersen, S. E. (2012). Spurious
but systematic correlations in functional connectivity MRI networks arise from
subject motion. Neuroimage, 59, 2142-2154.
Power JD, Barnes KA, Snyder AZ, Schlaggar BL, Petersen SE (2013): Steps toward
optimizing motion artifact removal in functional connectivity MRI; a reply to Carp.
NeuroImage 76:439-441.
Power, J. D., Mitra, A., Laumann, T. O., Snyder, A. Z., Schlaggar, B. L., & Petersen, S. E.
(2014): Methods to detect, characterize, and remove motion artifact in resting state
fMRI. Neuroimage 84:320-341.
Puts, N. A., Wodka, E. L., Tommerdahl, M., Mostofsky, S. H., & Edden, R. A. (2014).
Impaired tactile processing in children with autism spectrum disorder. Journal of
Neurophysiology, 111, 1803-1811.
Redcay, E., Moran, J. M., Mavros, P. L., Tager-Flusberg, H., Gabrieli, J. D., & Whitfield-
Gabrieli, S. (2013). Intrinsic functional network organization in high-functioning
adolescents with autism spectrum disorder. Frontiers in Human Neuroscience, 7, 573.
Samson, F., Mottron, L., Soulieres, I., & Zeffiro, T. A. (2012). Enhanced visual functioning
in autism: An ALE meta-analysis. Human Brain Mapping, 33,1553-1581.
Satterthwaite, T. D., Elliott, M. A., Gerraty, R. T., Ruparel, K., Loughead, J., Calkins, M. E.,
. . . Wolf, D. H. (2013). An improved framework for confound regression and
filtering for control of motion artifact in the preprocessing of resting-state functional
connectivity data. Neuroimage, 64, 240-256.
Schipul, S. E., Keller, T. A., & Just, M. A. (2011). Inter-regional brain communication and
its disturbance in autism. Frontiers in Systems Neuroscience, 5, 10.
Shen, M. D., Shih, P., Ottl, B., Keehn, B., Leyden, K. M., Gaffrey, M. S., & Muller, R. A.
(2012). Atypical lexicosemantic function of extrastriate cortex in autism spectrum
disorder: Evidence from functional and effective connectivity. Neuroimage, 62, 1780-
1791.
Simmons, D. R., Robertson, A. E., McKay, L. S., Toal, E., McAleer, P., & Pollick, F. E.
(2009). Vision in autism spectrum disorders. Vision Research, 49, 2705-2739.
32
Smith, S. M., Jenkinson, M., Woolrich, M. W., Beckmann, C. F., Behrens, T. E., Johansen-
Berg, H., . . . Matthews, P. M. (2004). Advances in functional and structural MR
image analysis and implementation as FSL. Neuroimage, 23(Suppl 1), S208-219.
Tomchek, S. D., & Dunn, W. (2007). Sensory processing in children with and without
autism: A comparative study using the short sensory profile. American Journal of
Occupational Therapy, 61, 190-200.
Tommerdahl, M., Tannan, V., Cascio, C. J., Baranek, G. T., Whitsel, B. L. (2007).
Vibrotactile adaptation fails to enhance spatial localization in adults with autism.
Brain Research, 1154, 116-123.
Uddin, L. Q., Supekar, K., Lynch, C. J., Khouzam, A., Phillips, J., Feinstein, C., . . . Menon,
V. (2013). Salience network-based classification and prediction of symptom severity
in children with autism. Journal of the American Medical Association on Psychiatry,
70(8), 869-879.
Uddin, L. Q., Supekar, K., & Menon, V. (2013). Reconceptualizing functional brain
connectivity in autism from a developmental perspective. Frontiers in Human
Neuroscience, 7, 458.
Van Dijk, K. R., Hedden, T., Venkataraman, A., Evans, K. C., Lazar, S. W., & Buckner, R.
L. (2010). Intrinsic functional connectivity as a tool for human connectomics: Theory,
properties, and optimization. Journal of Neurophysiology, 103, 297-321.
Van Dijk, K. R., Sabuncu, M. R., & Buckner, R. L. (2012). The influence of head motion on
intrinsic functional connectivity MRI. Neuroimage, 59, 431-438.
Vapnik, V. M. (1995). The nature of statistical learning theory. New York, NY: Springer.
Vissers, M. E., Cohen, M. X., & Geurts, H. M. (2012). Brain connectivity and high
functioning autism: A promising path of research that needs refined models,
methodological convergence, and stronger behavioral links. Neuroscience and
Biobehavioral Review, 36, 604-625.
von dem Hagen, E. A., Stoyanova, R. S., Baron-Cohen, S.,& Calder, A. J. (2012). Reduced
functional connectivity within and between 'social' resting state networks in autism
spectrum conditions. Social Ccognitive and Affective Neuroscience, 8(6), 694-701.
Wang, X., Yang, J., Teng, X., Xia, W., & Jensen, R. (2007). Feature selection based on
rough sets and particle swarm optimization. Pattern Recognition Letters, 28, 459-471
Washington, S. D., Gordon, E. M., Brar, J., Warburton, S., Sawyer, A. T., Wolfe, A., . . .
Vanmeter, J. W. (2013). Dysmaturation of the default mode network in autism.
Human Brain Mapping, 35(4), 1284-1296.
Wass, S. (2011). Distortions and disconnections: Disrupted brain connectivity in autism.
Brain and Cognition, 75, 18-28.
Zhang, Y., Brady, M., & Smith, S. (2001). Segmentation of brain MR images through a
hidden Markov random field model and the expectation-maximization algorithm.
IEEE Transactions on Medical Imaging, 20, 45-57.
33
Zhou, Y., Yu, F., & Duong, T. (2014). Multiparametric MRI characterization and prediction
in autism spectrum disorder using graph theory and machine learning. PloS One, 9,
e90405.
Zielinski, B. A., Anderson, J. S., Froehlich, A. L., Prigge, M. B., Nielsen, J. A., Cooperrider,
J. R., . . . Lainhart, J. E. (2012). scMRI reveals large-scale brain network
abnormalities in autism. PloS One, 7, e49172.

More Related Content

Similar to Chen_Colleen_Thesis_2014

Deep Learning for Health Informatics
Deep Learning for Health InformaticsDeep Learning for Health Informatics
Deep Learning for Health InformaticsJason J Pulikkottil
 
SeniorThesisFinal_Biswas
SeniorThesisFinal_BiswasSeniorThesisFinal_Biswas
SeniorThesisFinal_BiswasAditya Biswas
 
Microsoft Professional Capstone: Data Science
Microsoft Professional Capstone: Data ScienceMicrosoft Professional Capstone: Data Science
Microsoft Professional Capstone: Data ScienceMashfiq Shahriar
 
Neural Networks on Steroids
Neural Networks on SteroidsNeural Networks on Steroids
Neural Networks on SteroidsAdam Blevins
 
Evolutionary optimization of bio-inspired controllers for modular soft robots...
Evolutionary optimization of bio-inspired controllers for modular soft robots...Evolutionary optimization of bio-inspired controllers for modular soft robots...
Evolutionary optimization of bio-inspired controllers for modular soft robots...GiorgiaNadizar
 
IRJET- Brain Tumor Detection using Deep Learning
IRJET- Brain Tumor Detection using Deep LearningIRJET- Brain Tumor Detection using Deep Learning
IRJET- Brain Tumor Detection using Deep LearningIRJET Journal
 
A PERFORMANCE EVALUATION OF A PARALLEL BIOLOGICAL NETWORK MICROCIRCUIT IN NEURON
A PERFORMANCE EVALUATION OF A PARALLEL BIOLOGICAL NETWORK MICROCIRCUIT IN NEURONA PERFORMANCE EVALUATION OF A PARALLEL BIOLOGICAL NETWORK MICROCIRCUIT IN NEURON
A PERFORMANCE EVALUATION OF A PARALLEL BIOLOGICAL NETWORK MICROCIRCUIT IN NEURONijdpsjournal
 
Automatic generation of power system network diagram(Mimic diagram) from a CI...
Automatic generation of power system network diagram(Mimic diagram) from a CI...Automatic generation of power system network diagram(Mimic diagram) from a CI...
Automatic generation of power system network diagram(Mimic diagram) from a CI...Nikhil Valiveti
 
Communication and Control in Electric Power Systems_ Applications of Parallel...
Communication and Control in Electric Power Systems_ Applications of Parallel...Communication and Control in Electric Power Systems_ Applications of Parallel...
Communication and Control in Electric Power Systems_ Applications of Parallel...alimeziane3
 
Face recognition vendor test 2002 supplemental report
Face recognition vendor test 2002   supplemental reportFace recognition vendor test 2002   supplemental report
Face recognition vendor test 2002 supplemental reportSungkwan Park
 
Enterprise Data Center Networking (with citations)
Enterprise Data Center Networking (with citations)Enterprise Data Center Networking (with citations)
Enterprise Data Center Networking (with citations)Jonathan Williams
 
KurtPortelliMastersDissertation
KurtPortelliMastersDissertationKurtPortelliMastersDissertation
KurtPortelliMastersDissertationKurt Portelli
 
A Survey on Stroke Prediction
A Survey on Stroke PredictionA Survey on Stroke Prediction
A Survey on Stroke PredictionMohammadRakib8
 
A survey on heart stroke prediction
A survey on heart stroke predictionA survey on heart stroke prediction
A survey on heart stroke predictiondrubosaha
 
Emona tims-analog-communication-part1 2
Emona tims-analog-communication-part1 2Emona tims-analog-communication-part1 2
Emona tims-analog-communication-part1 2Subhajit Sahu
 

Similar to Chen_Colleen_Thesis_2014 (20)

Deep Learning for Health Informatics
Deep Learning for Health InformaticsDeep Learning for Health Informatics
Deep Learning for Health Informatics
 
Sensfusion
SensfusionSensfusion
Sensfusion
 
SeniorThesisFinal_Biswas
SeniorThesisFinal_BiswasSeniorThesisFinal_Biswas
SeniorThesisFinal_Biswas
 
Microsoft Professional Capstone: Data Science
Microsoft Professional Capstone: Data ScienceMicrosoft Professional Capstone: Data Science
Microsoft Professional Capstone: Data Science
 
Neural Networks on Steroids
Neural Networks on SteroidsNeural Networks on Steroids
Neural Networks on Steroids
 
Evolutionary optimization of bio-inspired controllers for modular soft robots...
Evolutionary optimization of bio-inspired controllers for modular soft robots...Evolutionary optimization of bio-inspired controllers for modular soft robots...
Evolutionary optimization of bio-inspired controllers for modular soft robots...
 
IRJET- Brain Tumor Detection using Deep Learning
IRJET- Brain Tumor Detection using Deep LearningIRJET- Brain Tumor Detection using Deep Learning
IRJET- Brain Tumor Detection using Deep Learning
 
A PERFORMANCE EVALUATION OF A PARALLEL BIOLOGICAL NETWORK MICROCIRCUIT IN NEURON
A PERFORMANCE EVALUATION OF A PARALLEL BIOLOGICAL NETWORK MICROCIRCUIT IN NEURONA PERFORMANCE EVALUATION OF A PARALLEL BIOLOGICAL NETWORK MICROCIRCUIT IN NEURON
A PERFORMANCE EVALUATION OF A PARALLEL BIOLOGICAL NETWORK MICROCIRCUIT IN NEURON
 
Automatic generation of power system network diagram(Mimic diagram) from a CI...
Automatic generation of power system network diagram(Mimic diagram) from a CI...Automatic generation of power system network diagram(Mimic diagram) from a CI...
Automatic generation of power system network diagram(Mimic diagram) from a CI...
 
Communication and Control in Electric Power Systems_ Applications of Parallel...
Communication and Control in Electric Power Systems_ Applications of Parallel...Communication and Control in Electric Power Systems_ Applications of Parallel...
Communication and Control in Electric Power Systems_ Applications of Parallel...
 
Face recognition vendor test 2002 supplemental report
Face recognition vendor test 2002   supplemental reportFace recognition vendor test 2002   supplemental report
Face recognition vendor test 2002 supplemental report
 
Enterprise Data Center Networking (with citations)
Enterprise Data Center Networking (with citations)Enterprise Data Center Networking (with citations)
Enterprise Data Center Networking (with citations)
 
KurtPortelliMastersDissertation
KurtPortelliMastersDissertationKurtPortelliMastersDissertation
KurtPortelliMastersDissertation
 
esnq_control
esnq_controlesnq_control
esnq_control
 
main
mainmain
main
 
A Survey on Stroke Prediction
A Survey on Stroke PredictionA Survey on Stroke Prediction
A Survey on Stroke Prediction
 
A survey on heart stroke prediction
A survey on heart stroke predictionA survey on heart stroke prediction
A survey on heart stroke prediction
 
Sarda_uta_2502M_12076
Sarda_uta_2502M_12076Sarda_uta_2502M_12076
Sarda_uta_2502M_12076
 
dissertation
dissertationdissertation
dissertation
 
Emona tims-analog-communication-part1 2
Emona tims-analog-communication-part1 2Emona tims-analog-communication-part1 2
Emona tims-analog-communication-part1 2
 

Chen_Colleen_Thesis_2014

  • 1. A MACHINE LEARNING STUDY OF INTRINSIC FUNCTIONAL CONNECTIVITY IN AUTISM SPECTRUM DISORDERS _______________ A Thesis Presented to the Faculty of San Diego State University _______________ In Partial Fulfillment of the Requirements for the Degree Master of Science in Computational Science _______________ by Colleen Pam Chen Fall 2014
  • 2. SAN DIEGO STATE UNIVERSITY The Undersigned Faculty Committee Approves the Thesis of Colleen Pam Chen: A Machine Learning Study of Intrinsic Functional Connectivity in Autism Spectrum Disorders _____________________________________________ Ralph-Axel Mueller, Chair Department of Psychology _____________________________________________ Mark Pflieger Computational Science Research Center _____________________________________________ Barbara Bailey Department of Mathematics and Statistics ______________________________ Approval Date
  • 3. iii Copyright © 2014 by Colleen Pam Chen All Rights Reserved
  • 4. iv DEDICATION Dedicated to those who believed in me...
  • 5. v Where there’s a will, there’s a way – a wise man
  • 6. vi ABSTRACT OF THE THESIS A Machine Learning Study of Intrinsic Functional Connectivity in Autism Spectrum Disorders by Colleen Pam Chen Master of Science in Computational Science San Diego State University, 2014 Despite consensus on the neurological nature of autism spectrum disorders (ASD), brain biomarkers remain unknown and diagnosis continues to be based on behavioral criteria. Growing evidence suggests that brain abnormalities in ASD occur at the level of interconnected networks; however, previous attempts using functional connectivity data for diagnostic classification have reached only moderate accuracy. We selected 252 low-motion resting-state functional MRI (rs-fMRI) scans from the Autism Brain Imaging Data Exchange (ABIDE) including typically developing (TD) and ASD participants (n=126 each), matched for age, non-verbal IQ, and head motion. A matrix of functional connectivities between 220 functionally defined regions of interest was used for diagnostic classification, implementing several machine learning tools. While support vector machines in combination with particle swarm optimization and recursive feature elimination performed modestly (with accuracies for validation data sets <70%), diagnostic classification reached high accuracy of 91% with random forest (RF), a nonparametric ensemble learning method. Among the 100 most informative features (connectivities), for which this peak accuracy was achieved, participation of somatosensory, default mode, visual, and subcortical regions stood out. Whereas these findings were in part not unexpected, given previous findings of default mode abnormalities and atypical visual functioning in ASD, the prominent role of somatosensory regions was remarkable. The finding of peak accuracy for 100 interregional functional connectivities further suggests that brain biomarkers of ASD may be regionally complex and distributed, rather than localized.
  • 7. vii TABLE OF CONTENTS PAGE ABSTRACT..............................................................................................................................vi LIST OF TABLES...................................................................................................................viii LIST OF FIGURES ..................................................................................................................ix ACKNOWLEDGEMENTS......................................................................................................xi CHAPTER 1 INTRODUCTION .........................................................................................................1 2 METHODS AND MATERIALS...................................................................................3 2.1 Data Preprocessing.............................................................................................3 2.2 Motion................................................................................................................4 2.3 ROIs and Connectivity Matrix...........................................................................5 2.4 Machine Learning Algorithms...........................................................................5 2.4.1 Particle Swarm Optimization (PSO-SVM)...............................................5 2.4.2 Recursive Feature Elimination (RFE-SVM).............................................8 2.4.3 Random Forest (RF) .................................................................................9 3 RESULTS ....................................................................................................................15 3.1 Classification Accuracy of the Three Algorithms ...........................................15 3.2 Classification Characterization of Informative Features from RF...................15 4 DISCUSSION ..............................................................................................................21 4.1 Regions and Networks .....................................................................................21 4.2 Issues, Caveats, and Perspectives ....................................................................22 REFERENCES ........................................................................................................................29
  • 8. viii LIST OF TABLES PAGE Table 2.1. Participant Information.............................................................................................3 Table 2.2. Anatomical Labels Used in Connectogram ..............................................................6 Table 4.1. ABIDE Demographic Information .........................................................................24 Table 4.2. ABIDE Cohort Matching (p-Values from Independent Two-Sample t- Tests)............................................................................................................................27
  • 9. ix LIST OF FIGURES PAGE Figure 2.1. For each classification tree (RF’s single unit), data are bootstrapped into in-bag and out-of-bag (for testing of the model). The tree starts with the in-bag data at the root node, with a random subset of features, and begins the process of “splitting.” The splitting criteria at each node are determined by a type of greedy algorithm that recursively partitions the data into daughter nodes, until reaching a terminal node. (Greedy recursive partitioning algorithms make the partitioning choice that performs best at each split, with the goal of globally optimal classification). The out-of-bag data are then sent down the tree as a test of model fit. The misclassification rate here is called “OOB error.” This is the internal validation process that prevents overfitting. Once all 10,000 trees in the forest have finished this process, the forest takes the majority vote from the trees, and obtains variable importance measures and OOB error. .........................10 Figure 2.2. OOB error as a function of number of trees in the forest. Initially, the error fluctuates greatly, then eventually stabilizes with a large enough forest with 10,000 trees. These are OOB errors using all 24,090 features prior to feature selection. The number of trees in the forest is an important parameter to fine tune as to ensure stability of the algorithm. .................................................................12 Figure 2.3. Feature selection using Mean Decrease Accuracy (MDA). In RF, vriable importance is determined based on mean decrease in accuracy. This is done for each tree by randomly permuting a variable in the OOB data and recording the change in accuracy. For the ensemble of trees, the permuted predictions are aggregated and compared against the unpermuted predictions to determine the importance of the permuted variable by the magnitude of the decrease in accuracy of that variable. We selected the top features using this MDA plot, where we look for the plateau at which no further decrease in accuracy is achieved. Here, 100 features was selected using the MDA measures. ........................13 Figure 2.4. Fine tuning the mtry parameter for RF to select the optimal number of features to split at each node. Using the top 100 features, we fine tuned RF again on the parameter mtry, the number of features used at each split, to 5. .............14 Figure 3.1. Connectogram showing informative connections. (A) Top 10 informative connections. (B) Top 100 informative connections. The labels of the connectograms can be found in Table 2, in the order in which they appear in the figure. .....................................................................................................................16 Figure 3.2. Informative features selected by RF. (A) Pie chart showing the number of informative ROIs per functional network. (B) Normalized number of informative ROIs per network (ratio of the number of times network ROIs participate in an informative connection divided by the total number of ROIs
  • 10. x in given network). This number can exceed 1 because a given ROI may participate in several informative connections. (C) Heatmap of informative connections by functional networks. (D) Number of informative ROIs per anatomical parcellation. (E) Normalized number of informative ROIs per anatomical parcel (ratio of the number of times anatomical ROIs participate in an informative connection divided by the total number of ROIs in given parcel). (F) Heatmap of informative connections by anatomical parcel......................17
  • 11. xi ACKNOWLEDGEMENTS I would like to thank everyone in the Brain Development Imaging Lab at San Diego State University. Dr. Ralph-Axel Mller was an excellent mentor, who gave me the freedom and creativity space to develop my ideas but also the guidance to keep me on track. I would also like to thank Christopher Keown, who was my mentor the first year in the lab. He is one of the most driven and hardworking people I know, and He inspired me to work even harder. For the thesis project specifically, I would like to thank Omar Maximo, Afrooz Jehedi, and Chris Keown for their help on quality controlling the data, as well as Aarti Nair for collecting the data and Adriana DiMartino for organizing the ABIDE data consortium. Finally, I would like to thank my committee members for taking the time to help me through this process.
  • 12. 1 CHAPTER 1 INTRODUCTION Autism spectrum disorder (ASD) is a highly heterogeneous disorder, diagnosed on the basis of behavioral criteria. From the neurobiological perspective, ‘ASD’ can be considered an umbrella term that may encompass multiple distinct neurodevelopmental etiologies (Greschwind & Levitt, 2007). Since any given cohort is thus likely composed of ill-understood subtypes (whose brain features may vary subtly or even dramatically), it is not surprising that brain markers with perfect sensitivity and specificity remain unavailable. Nonetheless, given the specificity of diagnostic criteria (American Psychiatric Association, 2013), the hope that some (potentially complex) patterns of brain features may be unique to the disorder is not unreasonable and worthy of pursuit. Issues of heterogeneity and cohort effects can be partially addressed through the use of large samples, as provided by the recent Autism Brain Imaging Data Exchange (ABIDE) (Di Martino, Yan, et al., 2013), which incorporates over 1,100 resting state functional MRI (rs-fMRI) datasets from 17 sites. The use of these data for examining functional connectivity matrices for large numbers of ROIs across the entire brain is further promising, as there is growing consensus about ASD being characterized by aberrant connectivity in numerous functional brain networks (Schipul, Keller, & Just, 2011; Vissers, Cohen, & Geurts, 2012; Wass, 2011). However, the functional connectivity literature in ASD is complex and often inconsistent (Müller et al., 2011; Nair et al., 2014), and data-driven machine learning (ML) techniques provide valuable exploratory tools for uncovering potentially unexpected patterns of aberrant connectivity that may be unique to the disorder. A few previous ASD studies have used intrinsic functional connectivity MRI (fcMRI) (Van Dijk et al., 2010) for diagnostic classification, i.e., for determining whether a dataset is from an ASD or typically developing participant solely based on functional connectivity. Anderson and colleagues (2011), using a large fcMRI connectivity matrix, reached an overall diagnostic classification accuracy of 79%, which was however lower in a separate small replication sample. Uddin, Supekar, Lynch, et al. (2013) used a logistic regression classifier
  • 13. 2 for 10 rs-fMRI based features identified by ICA, which corresponded to previously described functional networks. The classifier achieved accuracies about 60-70% for all but one component identified as salience network, for which accuracy reached 77%. Imperfect accuracy in these studies may be attributed to moderate sample sizes (N≤80). However, in a recent classification study using the much larger ABIDE dataset, Nielsen et al. (2013) reported an overall accuracy of only 60%, suggesting that the approach selected, a leave-one- out classifier using a general linear model, may not be sufficiently powerful. In the present study, we implemented several multivariate learning methods, including random forest (RF), which is an ensemble learning method that operates by constructing many individual decision trees, known in the literature as classification and regression trees (CART). Each decision tree in the forest makes a classification based on a bootstrap sample of the data and a random subset of the input features. The forest as a whole makes a prediction based on the majority vote of the trees. One desirable feature of the random forest algorithm is the bootstrapping of the sample to have a built-in training and validation mechanism, generating an unbiased out-of-bag error that measures the predictive power of the forest. Features were intrinsic functional connectivities (Van Dijk et al., 2010) between a standard set of regions of interest using only highest quality (low motion) data sets from ABIDE.
  • 14. 3 CHAPTER 2 METHODS AND MATERIALS Data were selected from the Autism Brain Imaging Data Exchange (ABIDE, http://fcon_1000.projects.nitrc.org/indi/abide/; Di Martino, Yan, et al., 2013), a collection of over 1100 resting-state scans from 17 different sites. In view of the sensitivity of intrinsic fcMRI analyses to motion artifacts and noise (as described below), we prioritized data quality over sample size. We excluded any datasets exhibiting artifacts, signal dropout, suboptimal registration or standardization, or excessive motion (see details below). Sites acquiring fewer than 150 time points were further excluded. Based on these criteria, we selected a subsample of 252 participants with low head motion. Groups were matched on age and motion to yield a final sample of 126 TD and 126 ASD, ranging in age from six to thirty-six years (see Table 2.1). Table 2.1. Participant Information Full Sample ASD M ± SD [range] TD M ± SD [range] p-value (2-sample t- test) N (Female) 126 (18) 126 (31) Age (years) 17.311 ± 6.00 [8.2 - 35.7] 17.116 ± 5.700 [6.5 – 34] 0.800 Motion (mm) .057 ± .020 [.018-.108] .058 ± .020 [.020-.125] 0.923 Non-verbal IQ 106.86 ± 17.0 [37-149] 106.28 ± 12.8 [67-155] 0.800 2.1 DATA PREPROCESSING Data were processed using the Analysis of Functional NeuroImages software (afni.nimh.nih.gov; Cox, 1996) and FSL 5.0 (www.fmrib.ox.ac.uk/fsl; Smith et al., 2004). Functional images were slice-time corrected, motion corrected to align to the middle time
  • 15. 4 point, field-map corrected and aligned to the anatomical images using FLIRT with six degrees of freedom. FSL’s nonlinear registration tool (FNIRT) was used to standardize images to the MNI152 standard image (3mm isotropic) using sinc interpolation. The outputs were blurred to a global full-width-at-half-maximum of 6mm. Given recent concerns that traditional filtering approaches can cause rippling of motion confounds to neighboring time points (Carp, 2013), we used a second-order band-pass Butterworth filter [Power, Barnes, Snyder, Schlaggar, & Petersen, 2013; Satterthwaite et al., 2013) to isolate low-frequency BOLD fluctuations (.008 < f < .08 Hz) (Cordes et al., 2001). Regression of 17 nuisance variables was performed to improve data quality (Satterthwaite et al., 2013). Nuisance regressors included six rigid-body motion parameters derived from motion correction and their derivatives. White matter and ventricular masks were created at the participant level using FSL’s image segmentation (Zhang, Brady, & Smith, 2001) and trimmed to avoid partial-volume effects. An average time series was extracted from each mask and was removed using regression, along with its corresponding derivative. Whole-brain global signal was also included as a regressor to mitigate cross-site variability. All nuisance regressors were band-pass filtered using the second-order Butterworth filter (.008 < f < .08 Hz) (Power et al., 2013; Satterthwaite et al., 2013). 2.2 MOTION Motion was quantified as the Euclidean distance between consecutive time points (based on detected six rigid-body motion parameters). For any instance greater than .25mm, considered excessive motion, the time point as well as the preceding and following time points were censored, or “scrubbed” (Power, Barnes, Snyder, Schlaggar, & Petersen, 2012). If two censored time points occurred within ten time points of each other, all time points between them were also censored. Datasets with fewer than 90% of time points or less than 150 total time points remaining after censoring were excluded from the analysis. Runs were then truncated at the point where 150 usable time points were reached. Motion over the truncated run was summarized for each participant as the average Euclidean distance moved between time points (including areas that were censored) and was well matched between groups (p = 0.92).
  • 16. 5 2.3 ROIS AND CONNECTIVITY MATRIX We used 220 ROIs (10mm spheres) adopted from a meta-analysis of functional imaging studies by Power et al. (2011) excluding 44 of their 264 ROIs because of missing signal in >2 participants. Mean time courses were extracted from each ROI and a 220 x 220 connectivity matrix of Fisher-transformed Pearson correlation coefficients was created for each subject. We then concatenated each subject’s functional connectivities to construct a group level data matrix. For each ROI pair, we regressed out age and site as covariates from data matrix. For interpretation of findings, ROIs were further sorted according to two classification schemes. One was functional, using ROI assignments into networks by Power et al. (2011). The other was anatomical, using each ROI’s center MNI coordinate and a surface based atlas (Fischl et al., 2004) included in FreeSurfer. These anatomical labels were also used to generate connectograms (Irimia, Chambers, Torgerson, & Horn, 2012) for complementary visualization of informative connectivities (see Table 2.2 for anatomical labels used in the connectogram). 2.4 MACHINE LEARNING ALGORITHMS Three ML algorithms were implemented in this study to perform a binary classification (ASD vs. TD) using rs-fMRI data: (i) support vector machines (SVM) in combination with particle swarm optimization (PSO) for feature selection (PSO-SVM); (ii) support vector machines with recursive feature elimination (RFE-SVM) for feature ranking; and (iii) a nonparametric ensemble learning method called random forest (RF). 2.4.1 Particle Swarm Optimization (PSO-SVM) We first used PSO in combination with a base classifier, a linear support vector machine. PSO is a biologically inspired, stochastic optimization algorithm that models the behavior of swarming particles. The PSO algorithm was utilized as a feature selection tool to obtain a compact and discriminative feature subset for improved accuracy and robustness of the subsequent classifiers. Particle swarm is a biologically inspired, stochastic optimization algorithm that models the behavior of swarming particles; it was first developed by Kennedy and Eberhart (1995) The algorithm seeks to explore the search space by a population of individuals or
  • 17. 6 Table 2.2. Anatomical Labels Usedin Connectogram Frontal lobe TrFPoG/S FMarG/S MFS LOrS SbOrS OrS RG InfFGOrp MFG OrG InfFGTrip InfFS MedOrS SupFG SupFS InfFGOpp InfPrCS SupPrCS PrCG SbCG/S CS Transverse frontopolar gyri and sulci Fronto-marginal gyrus (of Wernicke) and sulcus Middle frontal sulcus Lateral orbital sulcus Suborbital sulcus Orbital sulci Straight gyrus (gyrus rectus) Orbital part of the inferior frontal gyrus Middle frontal gyrus Orbital gyri Triangular part of the inferior frontal gyrus Inferior frontal sulcus Medial orbital sulcus (olfactory sulcus) Superior frontal gyrus Superior frontal sulcus Opercular part of the inferior frontal gyrus Inferior part of the precentral sulcus Superior part of the precentral sulcus Precentral gyrus Subcentral gyrus (central operculum) and sulci Central sulcus (Rolando’s fissure) Insular lobe ALSHorp ACirInS ALSVerp ShoInG SupCirInS LoInG/CInS InfCirInS PosLS Horizontal ramus of the anterior lateral sulcus Anterior circular sulcus the insula Vertical ramus of the anterior lateral sulcus Short insular gyri Superior segment of circular sulcus of the insula Long insular gyrus and central insular sulcus Inferior segment of circular sulcus of the insula Posterior ramus of lateral sulcus Limbic lobe ACgG/S MACgG/S SbCaG PerCaS MPosCgG/S CgSMarp PosDCgG PosVCgG Anterior part of the cingulate gyrus and sulcus Middle-anterior part of the cingulate gyrus/sulcus Subcallosal area, subcallosal gyrus Pericallosal sulcus (S of corpus callosum) Middle-posterior part of cingulate gyrus/sulcus Marginal branch of the cingulate sulcus Posterior-dorsal part of the cingulate gyrus Posterior-ventral part of the cingulate gyrus Temporal lobe TPo PoPl SupTGLp HG ATrCoS TrTS MTG TPl InfTG InfTS SupTS Temporal pole Polar plane of the superior temporal gyrus Lateral aspect of the superior temporal gyrus Heschl’s gyrus (anterior transverse temporal gyrus) Anterior transverse collateral sulcus Transverse temporal sulcus Middle temporal gyrus Temporal plane of the superior temporal gyrus Inferior temporal gyrus Inferior temporal sulcus Superior temporal sulcus (table continues)
  • 18. 7 Table 2.2. (continued) Parietal lobe PosCG SuMarG PaCL/S PosCS JS SbPS IntPS/TrPS SupPL PrCun AngG POcS Postcentral gyrus Supramarginal gyrus Paracentral lobule and sulcus Postcentral sulcus Sulcus intermedius primus (of Jensen) Subparietal sulcus Intraparietal sulcus and transverse parietal sulci Superior parietal lobule Precuneus Angular gyrus Parietal-occipital sulcus Occipital lobe PaHipG CoS/LinS LOcTS FuG CcS LinG AOcS InfOcG/S SupOcS/TrOcS PosTrCoS Cun MOcG MOcS/LuS SupOcG OcPo Parahippocampal gyrus Medial occipito-temporal sulcus and lingual sulcus Lateral occipito-temporal sulcus Fusiform gyrus Calcarine sulcus Lingual gyrus Anterior occipital sulcus Interior occipital gyrus/sulcus Superior occipital sulcus/transverse occipital sulcus Posterior transverse collateral sulcus Cuneus Middle occipital gyrus Middle occipital sulcus and lunatus sulcus Superior occipital gyrus Occipital pole Subcortical Pu Pal CaN NAcc Amg Tha Hip Putamen Pallidum Caudate nucleus Nucleus accumbens Amygdala Thalamus Hippocampus Cerebellum CeB Cerebellum Brain stem BSt Brain stem particles. Each particle represents a potential solution with a velocity, which is dynamically adjusted according to its own experience and that of its neighbors. The population of particles is updated based on each particle’s previous best performance and the best particle in the population. PSO combines local search with global search for balancing the exploration and exploitation, thus it is useful for searching high-dimensional problem spaces. Its implementation here was based on Wang, Yang, Teng, Xia, and Jensen (2007), where a modified binary PSO algorithm is proposed for feature selection. The feature space was made
  • 19. 8 up of the predefined brain regions (as described in the main text) that were then constructed as a correlation matrix. The features were selected by PSO based on highest fitness scores, or cost function, then utilized for binary classification by a linear SVM. The SVM seeks to minimize the upper bound of the generalization error based on the structural risk minimization principal that is known to have high generalization performance (Vapnik, 1995). Choosing the optimal input feature subset influences the performance of the SVM. Feature selection is an important issue in building classification systems, which refers to choosing subset of attributes from the set of original attributes. The key purpose is to identify significant features, eliminate the irrelevant or dispensable features and build a good learning model. Using a new learning scheme based on swarm intelligence, PSO has been found to be a promising technique for real world engineering and optimization problems due to its strong global search capability. PSO takes less time for each function evaluation and is easy to implement. In this study, a discrete binary PSO algorithm was used as a feature selection vehicle to identify the most discriminant or informative features. The main objective of this study was to exploit the maximum generalization capability of SVM and apply it to the autism neuroimaging data to distinguish clinical from control populations. The PSO algorithm was utilized as a feature selection tool to obtain a compact and discriminative feature subset, which improved the accuracy and robustness of the subsequent classifiers. 2.4.2 Recursive Feature Elimination (RFE-SVM) In a second approach, we used recursive feature elimination, a pruning technique that eliminates original input features by using feature ranking coefficients as classifier weights, and retains a minimum subset of features that yield best classification performance (Guyon, Weston, Barnhill, & Vapnik, 2002). The output obtained from this algorithm is a list of all features ranked in the order of most informative to least. We used RFE to reduce the feature space dimensionality and select the top informative features. We implemented an alternate approach for space dimensionality reduction using SVM methods based on Recursive Feature Elimination (RFE). A known problem in classification and machine learning is to reduce the dimensionality of the feature space to overcome the risk of overfitting. Since our data had high dimensionality in the feature space and comparatively small number of training patterns, we faced the risk of overfitting where
  • 20. 9 the decision function that separates the training data does not perform well on the test data. In this study, we implemented a pruning technique, proposed by Guyon, that eliminates some of the original input features and retain a minimum subset of features that yield best classification performance (Guyon et al., 2002). The proposed feature-ranking technique yields a fixed number of top ranked features, which may be selected for further analysis or design a classifier. Using feature ranking coefficients as classifier weights, the inputs that are weighted by the largest value influence most the classification decision. This multivariate classifier is optimized during training to handle multiple variables, or features, simultaneously. RFE is an iterative method that trains the classifier (by optimizing the weights), computes the ranking criterion for all features, then removes the feature with smallest ranking criterion. This iterative procedure is an instance of backward feature elimination. SVM RFE is an application of RFE using the weight magnitude as ranking criterion. The output obtained from such algorithm is a list of all features ranked in the order of most informative to least. We then explored the optimal number of (top ranked) features that maximized classification accuracy on the training data set. We applied the subset of top ranked features to the independent test set in order to obtain the error rate and the predictive power of the classifier. 2.4.3 Random Forest (RF) In RF the basic unit is a classification tree, and the ensemble of trees, or forest, is used to classify subjects using features (e.g., connectivity measures from fMRI, as in the present study; see Figure 2.1). For final prediction, each tree gives a classification by "voting" for a binary class label (e.g., diagnostic status). The forest chooses the classification having the most votes (aggregated over all the trees in the forest). In random forests, there is no need for cross-validation or a separate test set to obtain an unbiased estimate of the test set error, which is instead estimated internally, during the run (www.stat.berkeley.edu/~breiman/ RandomForests). Each tree in the forest is constructed using a different bootstrap sample from the original data. Since the bootstrap draws samples with replacement from the original dataset, a probabilistic argument can show that each bootstrap sample will contain on average about two-thirds of the original observations which results in about one-third of the subjects being left out (i.e., not used in the construction of a particular tree). These Out-of-Bag
  • 21. 10 Figure 2.1. For each classification tree (RF’s single unit), data are bootstrapped into in-bag and out-of-bag (for testing of the model). The tree starts with the in-bag data at the root node, with a random subset of features, and begins the process of “splitting.” The splitting criteria at each node are determined by a type of greedy algorithm that recursively partitions the data into daughter nodes, until reaching a terminal node. (Greedy recursive partitioning algorithms make the partitioning choice that performs best at each split, with the goal of globally optimal classification). The out-of-bag data are then sent down the tree as a test of model fit. The misclassification rate here is called “OOB error.” This is the internal validation process that prevents overfitting. Once all 10,000 trees in the forest have finished this process, the forest takes the majority vote from the trees, and obtains variable importance measures and OOB error.
  • 22. 11 subjects are used to attain a running unbiased estimate of the classification error as trees are added to the forest; they are also used for estimates of variable importance. The test set classification, or OOB error estimate, has proven to be unbiased (Breiman, 2001). Using the OOB subjects to determine error and variable importance, RF minimizes overfitting. RF also provides variable importance measures for feature selection. The OOB data are used to determine variable importance, based on mean decrease in accuracy. This is done for each tree by randomly permuting a variable in the OOB data and recording the change in accuracy. For the ensemble of trees, the permuted predictions are aggregated and compared against the unpermuted predictions to determine the importance of the permuted variable by the magnitude of the decrease in accuracy of that variable. When the number of variables is very large (in the order of 2 x 10^4), RF can be run once with all the variables, then again using only the most important variables selected from the first run. In the present study, we used the R package randomForest (Liaw & Wiener, 2002) for classification analysis. As importance scores, proximity measures, and error rates vary because of the random components in RF (out of bag sampling, node-level permutation testing), we created a forest with a sufficient number of trees to ensure stability. To optimize the most important RF parameters, we fine-tuned the number of trees per forest to 10,000 as the plotted error rate was observed to stabilize (Figure 2.2), and set the number of variables randomly selected per node to 150, after tuning for this parameter. The RF algorithm ranked all features based on variable importance given by the mean decrease in accuracy. After the initial run with all features, we used the variable importance measures to select for 100 top informative features (Figure 2.3). Using the 100 features, we obtained the OOBerror to assess how well the model predicted new data (see Figure 2.4 for fine tuning of the mtry parameter once the top 100 features have been selected).
  • 23. 12 Figure 2.2. OOB error as a function of number of trees in the forest. Initially, the error fluctuates greatly, then eventually stabilizes with a large enough forest with 10,000 trees. These are OOB errors using all 24,090 features prior to feature selection. The number of trees in the forest is an important parameter to fine tune as to ensure stability of the algorithm.
  • 24. 13 Figure 2.3. Feature selectionusing MeanDecrease Accuracy (MDA). In RF, vriable importance is determined based on mean decrease in accuracy. This is done for each tree by randomly permuting a variable in the OOB data and recording the change in accuracy. For the ensemble of trees, the permuted predictions are aggregated and compared against the unpermuted predictions to determine the importance of the permuted variable by the magnitude of the decrease in accuracy of that variable. We selectedthe top features using this MDA plot, where we look for the plateau at which no further decrease in accuracy is achieved. Here, 100 features was selectedusing the MDA measures.
  • 25. 14 Figure 2.4. Fine tuning the mtry parameter for RF to select the optimal number of features to split at each node. Using the top 100 features, we fine tuned RF again on the parameter mtry, the number of features used at each split, to 5.
  • 26. 15 CHAPTER 3 RESULTS 3.1 CLASSIFICATION ACCURACYOF THE THREE ALGORITHMS The PSO-SVM achieved an accuracy of 81% on the training, and of 58% on the validation dataset, with 44% sensitivity and 72% specificity. Accuracy for RFE-SVM was 100% on the training dataset and 66% on the validation dataset, with 60% sensitivity and 72% specificity. RF classified ASD with only 58% accuracy without feature selection. However, using the top 100 features with highest variable importance (as defined above), RF achieved 90.8% accuracy (OOB error rate 9.2%), with sensitivity at 89% and specificity at 93%. In fact, using only 10 most informative features, RF still attained an accuracy of 75%, with 75% sensitivity and 75% specificity. Note that the RF methodology does not have an external validation set, but the bootstrap sample of the data for each tree acts as an internal validation dataset (see Discussion). 3.2 CLASSIFICATION CHARACTERIZATION OF INFORMATIVE FEATURES FROM RF Given the overall better performance from RF (compared to the other ML techniques), we proceeded to examine the most informative features selected by RF. When selecting only the 10 top features, several regions were noted to participate in two or more informative connections: left anterior cingulate gyrus, postcentral gyrus bilaterally, right precuneus, and left calcarine sulcus (see Figure 3.1A). The pattern for the top 100 features, which yielded the peak accuracy of 91%, involved connections between 124 ROIs (see Figure 3.1B). The left paracentral lobule and the right postcentral gyrus participated in ≥ 6 informative connections, which was ≥2SD above the mean participation among the 124 informative ROIs. We further examined these top 100 features by functional networks, as defined in Power et al. (2011). We first determined the number of times ROIs from a specific network participated in an informative connection (Figure 3.2A). The distribution differed
  • 27. 16 Figure 3.1. Connectogram showing informative connections. (A) Top 10 informative connections. (B) Top 100 informative connections. The labels of the connectograms can be found in Table 2, in the order in which they appear in the figure.
  • 28. 17 Figure 3.2. Informative features selectedby RF. (A) Pie chart showing the number of informative ROIs per functional network. (B) Normalized number of informative ROIs per network (ratio of the number of times network ROIs participate in an informative connection divided by the total number of ROIs in given network). This number can exceed1 because a given ROI may participate in several informative connections. (C) Heatmap of informative connections by functional networks. (D) Number of informative ROIs per anatomical parcellation. (E) Normalized number of informative ROIs per anatomical parcel (ratio of the number of times anatomical ROIs participate in an informative connection divided by the total number of ROIs in given parcel). (F) Heatmap of informative connections by anatomical parcel. Abbreviated labels: DMN, default mode network; SMH, somatosensory and motor [hand]; VIS, visual; SAL, salience; SUB, subcortical; SMM, somatosensory and motor [mouth]; FPTC, frontal parietal task control; AUD, audio; UN, unknown; VA, ventral attention; COTC, cingulo opercular task control; DA, dorsal attention; MR, memory retrieval; CEB, cerebellum.
  • 29. 18
  • 30. 19 significantly from the overall distribution of ROIs across different networks in Power et al. (2011) (X2 = 33.348, p = 0.001). Default mode, somatosensory/motor (hand region), and visual networks ranked at the top, accounting for half of all informative connections. We further examined normalized network importance (i.e., the number of times network ROIs participated in an informative connection divided by the total number of ROIs in the given network; Figure 3.2B). This ratio highlighted the importance of somatosensory/motor networks (both mouth and hand regions), ahead of subcortical, memory retrieval, and visual networks. Within the two top networks, somatosensory cortex (postcentral gyrus) participated in 23 of the top 100 connections, whereas primary motor cortex (precentral gyrus) participated in only 9. Finally, we depicted the 100 top connections in a heat map matrix (Figure 3.2C), which highlighted the importance of connections between DMN and visual ROIs (14 informative features). Informative features for the somatosensory/motor hand regions were dominated by connections with subcortical ROIs. Examining the 100 top features by broad anatomical subdivisions (Fischl et al., 2004), the distribution did not differ significantly from expected (X2 = 5.225, p = 0.98). We found 60 participations by parietal lobes bilaterally, followed by 45 for occipital lobes bilaterally. Right frontal lobe also had heightened density of informative connections (with 21 participations; Figure 3.2D). When normalizing participations by the number of ROIs within each subdivision (analogous to the procedure described above), bilateral parietal lobes again stood out, followed by right temporal lobe (Figure 3.2E). On the heat map matrix (Figure 3.2F), the highest concentration of most informative connections was found to involve parietal lobes bilaterally, with six informative connections each within left and right parietal lobes, and an additional six between right parietal and left occipital lobe and five between left parietal and right frontal lobe. We examined the 100 top features by several other criteria. They were almost evenly divided with respect to the direction of mean FC differences (45% over-, 55% underconnected in the ASD group) and type of connection (49% inter-, 51% intra- hemispheric). 95% of the top connections were between networks, 5% within networks; 87% were mid- to long-distance (>40mm Euclidian distance), 11% were short-distance (<40mm). However, these percentages of Euclidian distance (X2 = 0.228, p-value = 0.63) and
  • 31. 20 of network connection (X2 = 2.24, p-value = 0.13) did not differ significantly from those expected based on the totality of 24090 features examined in entire connectivity matrix.
  • 32. 21 CHAPTER 4 DISCUSSION In this study, we used intrinsic functional connectivities between a set of functionally defined ROIs (Power et al., 2011) for machine learning diagnostic classification of ASD. While accuracy remained overall modest for PSO-SVM and RFE-SVM approaches, it was high (91%) for Random Forest. The RF algorithm is well-suited for classification of fcMRI data because it (i) is applicable when there are more features than observations, (ii) performs embedded feature selection and is relatively insensitive to any large number of irrelevant features, (iii) incorporates interactions between features, (iv) is based on the theory of ensemble learning that allows the algorithm to learn accurately both simple and complex classification functions, and (v) is applicable for both binary and multi-category classification tasks (Breiman, 2001). 4.1 REGIONS AND NETWORKS We examined in two steps which single regions and functional networks were most ‘informative’, contributing most heavily to diagnostic classification. First, focusing on only the ten top features, for which an accuracy of 75% was achieved, we found that limbic (left anterior cingulate), somatosensory (postcentral gyri bilaterally), visual (calcarine sulcus), and default mode (right precuneus) regions stood out (Figure 3.1A). This was largely supported by subsequent examination of the 100 top features, for which the peak classification accuracy of 91% was reached. Half of all ROIs involved in these connections belonged to three (out of 14) networks, which were default mode, somatosensory/motor (hand) and visual (Figure 3.2A). The informative role of the DMN in diagnostic classification was not surprising, given extensive evidence of DMN anomalies in ASD. These include fMRI findings indicating failure to enter the default mode in ASD (Kennedy, Redcay, & Courchesne, 2006; Murdaugh et al., 2012), as well as numerous fcMRI reports of atypical connectivity of the DMN (Assaf et al., 2010; Di Martino, Zuo, et al., 2013; Keown, Shih, Nair, Peterson, &
  • 33. 22 Müller, 2013; Monk et al., 2009; Redcay et al., 2013; Uddin, Supekar, & Menon, 2013; von dem Hagen, Stoyanova, Baron-Cohen, & Calder, 2012; Washington et al., 2013; Zielinski et al., 2012). Functionally, the DMN is considered to relate to self-referential cognition, including domains of known impairment in ASD, such as theory of mind and affective decision making (Andrews-Hanna, Reidler, Sepulcre, Poulin, & Buckner, 2010). The visual system appears to be overall relatively spared in ASD (Simmons et al., 2009), and atypically enhanced participation of visual cortex has been observed across many fMRI (Samson, Mottron, Soulieres, & Zeffiro, 2012) and fcMRI studies (Shen et al., 2012), with the added recent finding of atypically increased local functional connectivity in ASD being associated with symptom severity (Keown et al., 2013). The relative importance of visual regions and occipital lobes probably reflects atypical function of the visual system in ASD (Dakin & Frith, 2005) rather than sparing. The role of somatosensory regions was particularly prominent when considering a normalized ratio of informative ROIs (Figure 3.2B). Although only a total of 26 somatosensory/motor ROIs (hand and mouth regions combined) were included in the analysis, these participated in 44 out of the 100 most informative connections. This effect was distinctly driven by primary somatosensory cortex in the postcentral gyrus (rather than motor cortex). Although somatosensory anomalies have been described in a number of ASD studies (Puts, Wodka, Tommerdahl, Mostofsky, & Edden, 2014; Tomchek & Dunn, 2007; Tommerdahl, Tannan, Cascio, Baranek, & Whitsel, 2007), the highly prominent role of somatosensory regions in diagnostic classification was remarkable. The imaging literature on somatosensory cortex in ASD is currently limited. Cascio and colleagues (2012) found atypical activation patterns in adults with ASD for pleasant and unpleasant tactile stimuli in PCC and insula, which could reflect altered functional connectivity with somatosensory cortex (not tested in the cited study). Atypical postcentral responses to somatosensory stimuli have also been observed by one evoked potential study in young children with ASD (Miyazaki et al., 2007). 4.2 ISSUES, CAVEATS, AND PERSPECTIVES Diagnostic classification accuracy achieved using Random Forest machine learning was distinctly above accuracies reported from previous studies using rs-fcMRI (Anderson et
  • 34. 23 al., 2011), including a recent publication (Nielsen et al., 2013) using the same consortium database as in the present study. It is important to note that RF implements an ‘out-of-bag’ validation process (randomly selected subsamples), contrary to the other machine learning tools (PSO-SVM, RFE-SVM) used by us, which required external validation data sets. While this latter procedure may appear to provide best protection from overfitting to the idiosyncracies of any given training dataset, it rests on the assumption that training and validation samples can be optimally matched. Such optimal matching may be difficult, if many variables (e.g., age, sex, motion, site, eye status, scanner, imaging protocol) are potentially relevant (cf. Table 4.1 for demographic information by site; Table 4.2 for ABIDE cohort matching). However, even if matching on those variables were possible, a more intractable problem would remain. There is general consensus that ASD is an umbrella term for several – maybe many – underlying biological subtypes, which are currently unknown. Any substantial difference between training and validation datasets in the composition of these subtypes, which cannot be detected and cannot therefore be avoided, is likely to result in poor prediction in a validation dataset. In ASD machine learning classification, accuracy will tend to be overestimated in training datasets (even when the common leave-one-out procedure is applied), but underestimated in external validation datasets because of the problems described above. The use of an out-of-bag validation dataset in RF presents a reasonable solution to these problems, as the procedure itself ensures matching of training and validation datasets. Our findings suggest that intrinsic functional connectivity may identify complex biomarkers of ASD, which is not unexpected given the extensive literature on functional connectivity anomalies in this disorder (e.g., Vissers et al., 2012). Aside from the use of RF machine learning and its virtues described above, the main difference between our study and the relatively unsuccessful classification study by Nielsen et al. (2013) was our careful selection of high-quality (low-motion) data sets, resulting in the inclusion of only 252 (out of a total of 1112) participants, equally divided between two tightly motion-matched diagnostic groups. Our procedure was informed by the recent flurry of studies suggesting that even small amounts of head motion well below 1mm affect interregional fMRI signal correlations (Carp, 2013; Jo et al., 2013; Power et al., 2012; Power et al., 2014; Satterthwaite et al., 2013; Van Dijk, Sabuncu, & Buckner, 2012).
  • 35. 24 Table 4.1. ABIDE Demographic Information Sites Sample Size Age RMSD NVIQ Gender ADOS_total Handedness ASD TD ASD mean (SD) [range] TD ASD mean (SD) [range] TD ASD mean (SD) [range] TD Male Female Range [range] NA Left Right Full Sample NYU (N=70) 36 34 15.6 (6) 16.96 (6.17) .02 (.02) .02 (.02) 108 (15.1) 103 (14.9) 54 16 [5-22] 0 NA NA [8.5-30] [6.4-30.8] [.03-.09] [.03-.09] [72-149] [67-137] UM_1 (N=29) 15 14 14.7 (2.1) 15.5 (2.8) .06 (.02) .05 (.02) 112 (19.6) 102.2 (9.4) 17 12 NA 15 5 21 [11.5-18.6] [11-19] [.02-.09] [.02-.08] [81-148] [82-113] USM (N=29) 19 20 21.9 (6.4) 6.4 (7.2) .05 (.02) .06 (.02) 105.7 (13) 111 (12.5) 29 0 [6-21] 0 NA NA [11.3-35.7] [13.8-34] [.02-.09] [.03-.09] [75-133] [90-129] SDSU (N=25) 10 15 14.6 (2.3) 13.4 (2.3) .03 (.01) .04 (.02) 112 (16) 111 (11) 21 4 [4-17] 0 6 19 [9.6-17.7] [8-16.8] [.02-.05] [.02-.07] [86-114] [92-137] Leuven_1 (N=17) 9 8 22.2 (4.9) 23.75 (2.9) .061 (.02) .065 (.02) 103 (16.4) 114 (18.9) 17 0 NA 9 1 16 [19-32] [21-29] [.04-.09] [.04-.09] [74-120] [93-155] Leuven_2 (N=17) 8 9 13.8 (1.1) 14.3 (1.4) .07 (.01) .08 (.02) 97.5 (11.8) 101.4 (6.5) 11 6 NA 8 4 13 [12.2-15.3] [12.3-16.6] [.05-.1] [.05-.1] [83-117] [89-109] PITT (N=14) 8 6 21.2 (8.4) 20.2 (4.1) .07 (.02) .09 (.02) 11.7 (14.2) 115 (5.3) 12 2 [7-19] 1 1 13 [12.6-35.2] [12.8-24.6] [.04-.1] [.07-.1] [93-128] [108-121] TRINITY (N=12) 4 8 17.2 (3.4) 17.9 (4) .06 (.010) .07 (.01) 121 (7.4) 111 (6.2) 12 0 [7-15] 0 0 12 [13.6-20.8] [12.7-24.8] [.05-.07] [.06-.07] [114-131] [103-121] YALE (N=12) 6 6 14.6 (1.9) 13.2 (3.4) .06 (.02) .05 (.01) 85 (29) 97 (11.2) 8 4 NA 5 1 11 [11.8-17.2] [8.7-16.7] [.03-.09] [.03-.06] [37-120] [84-115] KKIN (N=8) 3 5 9.9 (1.51) 9.9 (1.7) .01 (.004) .01 (.02) NA NA 6 2 [10-16] 0 1 7 [8.2-11.1] [8.5-12.8] [.06-.07] [.05-.1] OLIN (N=8) 4 4 18.3 (4.03) 19.5 (2.4) .09 (.01) .09 (.3) NA NA 6 2 [7-19] 4 0 8 [15-24] [16-21] [.07-.1] [.06-.1] (table continues)
  • 36. 25 Table 4.1. (continued) Sites Sample Size Age RMSD NVIQ Gender ADOS_total Handedness ASD TD ASD mean (SD) [range] TD ASD mean (SD) [range] TD ASD mean (SD) [range] TD Male Female Range [range] NA Left Right UM_2 (N=6) 1 5 17.4 (NA) 18.4 (5.9) .08 (NA) .05 (.01) 80 (NA) 102 (6.8) 6 0 NA 1 0 6 [17.4-17.4] [14.5-28.9] [.08-.08] [.03-.06] [80-80] [92-109] CMU (N=5) 3 2 26.7 (4.5) 26 (.07) .06 (.02) .08 (.01) 113.7 (10.8) 109.5 (0.7) 4 1 [8-16] 0 0 5 [22-31] [21-31] [.04-.07] [.07-.08] [106-126] [109-110] Training NYU (N=60) 31 29 15.96 (6.22) 16.96 (6.50) .05 (.02) 0.05 (0.02) 108.03 (15.80) 102.93 (15.17) 46 14 [6-22] 29 NA NA [8.51 - 29.98] [6.47 - 30.78] [.03 -.09] [0.03 - 0.09] [72.00 - 149.00] [67.00 - 137.00] UM_1 (N=25) 14 11 14.64 (2.11) 15.28 (2.85) 0.06 (0.02) 0.05 (0.02) 111.43 (19.97) 105.11 (6.72) 15 10 NA 14 5 17 [11.50 - 18.60] [11.00 - 19.20] [0.02 - 0.09] [0.02 - 0.08] [81.00 - 148.00] [92.00 - 113.00] Leuven_1 (N=17) 9 8 22.22 (4.87) 23.75 (2.92) 0.06 (0.02) 0.07 (0.02) 103.00 (16.42) 113.88] (18.88) 17 0 NA 9 1 16 [19.00 - 32.00] [21.00 - 29.00] [0.04 - 0.09] [0.04 - 0.09] [74.00 - 120.00] [93.00 - 155.00 SDSU (N=17) 9 8 15.19 (1.65) 13.43 (3.35) 0.03 (0.01) 0.04 (0.01) 112.67 (16.77) 110.25 (13.61) 17 0 [4-17] 0 5 12 [12.88 - 17.67] [8.02 - 16.59] [0.02 - 0.05] [0.02 - 0.07] [86.00 - 140.00] [92.00 - 137.00] USM (N=16) 8 8 23.67 (7.35) 21.60 (6.84) 0.05 (0.02) 0.06 (0.02) 101.75 (6.82) 111.38 (13.22) 16 0 [9-17] 0 NA NA [15.85 - 35.71] [13.81 - 34.01] [0.02 - 0.07] [0.03 - 0.09] [87.00 - 106.00] [90.00 - 129.00] Leuven_2 (N=14) 6 8 13.53 (1.23) 14.59 (1.32) 0.07 (0.01) 0.07 (0.01) 96.17 (9.54) 102 (6.70) 8 6 NA 6 3 11 [12.20 - 15.30] [12.30 - 16.60] [0.05 - 0.08] [0.05 - 0.10] [83.00 - 106.00] [89.00 - 109.00] PITT (N=11) 6 5 23.03 (9.07) 20.01 (4.56) 0.08 (0.02) 0.09 (0.01) 114.67 (14.90) 114.20 (5.07) 10 1 [7-19] 1 1 10 [12.62 - 35.20] [12.83 - 24.60] [0.06 - 0.11] [0.07 - 0.11] [93.00 - 128.00] [108.00- 119.00] YALE (N=9) 4 5 14.96 (2.39) 12.53 (3.30) 0.07 (0.02) 0.04 (0.01) 84.75 (37.03) 98.20 (12.32) 8 1 NA 3 1 8 [11.83 - 17.17] [8.67 - 16.66] [0.04 - 0.09] [0.03 - 0.06] [37.00 - 120.00] [84.00 - 115.00] KKI (N=8) 3 5 9.88 (1.51) 9.91 (1.69) 0.07 (0.00) 0.07 (0.02) NA NA 6 2 [10-16] 0 1 7 [8.20 - 11.14] [8.46 - 12.76] [0.06 - 0.07] [0.05 - 0.10] OLIN (N=7) 3 4 18.3 (4.93) 19.50 (2.38) 0.09 (0.01) 0.09 (0.03) NA NA 5 2 [9-19] 0 0 7 3[15.00- 24.00] [16.00 - 21.00] [0.08 - 0.10] [0.06 - 0.13] (table continues)
  • 37. 26 Table 4.1. (continued) Sites Sample Size Age RMSD NVIQ Gender ADOS_total Handedness ASD TD ASD mean (SD) [range] TD ASD mean (SD) [range] TD ASD mean (SD) [range] TD Male Female Range [range] NA Left Right TRINITY (N=7) 3 4 16.46 (3.79) 19.48 (4.53) 0.06 (0.01) 0.06 (0.01) 123.00 (7.21) 112.00 (6.38) 7 0 [7-15] 0 0 7 [13.58 - 20.75] [13.75 - 24.83] [0.05 - 0.07] [0.06 - 0.08] [117.00 - 131.00] [107.00- 121.00] CMU (N=5) 3 2 26.67 (4.51) 26.00 (7.07) 0.06 (0.02) 0.08 (0.01) 113.67 (10.79) 109.50 (0.71) 4 1 [8-16] 0 0 5 [22.00 - 31.00] [21.00 - 31.00] [0.04 - 0.08] [0.07 - 0.08] [106.00 - 126.00] [109.00- 110.00] UM_2 (N=4) 1 3 17.40 (0.00) 15.23 (0.67) 0.08 (0.00) 0.04 (0.02) 80.00 (0.00) 99.33 (7.02) 4 0 NA 1 0 4 [17.40 - 17.40] [14.50 - 15.80] [0.08 - 0.08] [0.03 - 0.06] [80.00 - 80.00] [92.00 - 106.00] Validation USM (N=13) 11 2 20.7 (5.7) 28.2 (5.6) .05 (.02) .06 (.005) 105.9 (17.9) 109.5 (11.3) 12 1 [6-21] 1 NA NA [11.4-32.8] [22-34] [.03-.09] [.05-.07] [75-133] [100-119] NYU (N=10) 5 5 13.1 (3.99) 16.4 (4.3) .07 (.02) .05 (.01) 38.8 (11.5) 102.8 (14.7) 2 8 [5-13] 0 NA NA [8.9-17.9] [9.8-20.0] [.04-.09] [.03-.06] [92-119] [83-117] SDSU (N=8) 1 7 9.64 13.5 (2.9) 0.025 .04 (.018) 112 111.7 (8.3) 4 4 11 0 1 7 [9.6-16.8] [.02-.07] [103-124] TRINITY (N=5) 1 4 19.4 16.3 (3.2) 0.057 .066 (.0087) 114 109.8 (6.7) 5 0 13 0 5 [12.7-20.1] [.05-.076] [103-116] UM_1 (N=4) 1 3 16.1 16.2 (2.7) 0.048 .05 (.01) 125 178 (9.9) 2 2 NA 1 0 4 [13.1-18.2] [.03-.06] [82-96] Leuven_2 (N=3) 2 1 14.5 (.29) 12.4 .08 (.02) 0.11 101.5 (21.9) 97 3 0 NAN 3 1 2 [14.3-14.7] [.07-.09] [86-117] PITT (N=3) 2 1 15.6 (1.9) 21.22 .05 (.03) 0.11 103 (9.9) 121 2 1 [12-16] 0 0 3 [14.2-16.9] [.03-.07] [96-110] YALE (N=3) 2 1 14 (0.53) 16.66 .049 (.02) 0.057 85 (1.4) 93 0 3 NA 2 0 3 [13.66-14.4] [.03-.06] [84-86] UM_2 (N=2) 0 2 NA 23.2 (7.9) NA .06 (.003) NA 107.5 (2.2) 2 0 NA NA 0 2 [17.6-28.8] [.058-.06] [106-109] OLIN (N=1) 1 0 18 NA 0.07 NA NA NA 1 0 7 0 0 1
  • 38. 27 Table 4.2. ABIDE Cohort Matching (p-Values from Independent Two-Sample t-Tests) Full Sample Training Validation Training <-> Validation Training <-> Validation N= 252 N=200 N=52 ASD <-> TD ASD <-> TD ASD <-> TD ASD <-> ASD TD <-> TD Motion 0.923 0.922 0.984 0.985 0.984 Age 0.793 0.780 0.984 0.840 0.962 NVIQ 0.771 0.753 0.924 0.934 0.986 ADOS 0.824 Although we modeled effects of site, the use of multisite data with the numerous factors of variability it implies remains a challenge that needs to be accepted as high-quality rs-fMRI data for comparable sample sizes from a single site are not currently available. Inclusion of 126 ASD participants may provide relative protection from cohort effects that are probably at work in many small-sample imaging studies. However, the need for high levels of compliance in fMRI studies, even when there is no explicit task, inevitably results in inclusionary biases because low-functioning people with ASD are unable to participate. The methodologically indispensable step of selecting low-motion data probably added to this issue, given that higher-functioning participants were likely to move less during scanning. Indeed, our diagnostic groups were matched for nonverbal IQ, and relatively few ASD participants with intellectual disability and IQs below 70 were included. The pattern of informative connections observed in our study may therefore not apply to the lower- functioning end of the autism spectrum. A high diagnostic classification accuracy was reached for 100 most informative features. The pattern of these connections was highly complex, involving numerous brain regions. While some networks (somatosensory, default mode, visual) stood out, many others contributed to the classification. This may be disappointing to anyone harboring the hope that the neurobiology of ASD might ultimately be pinned down to simple and localizable causes. However, the literature on functional and anatomical brain anomalies in ASD is fully consistent with the concept of a disorder with highly distributed, rather than localized, patterns (Akshoomoff, Pierce, & Courchesne, 2002; Müller, 2007; Philip et al., 2012; Vissers et al., 2012; Wass, 2011). The complex pattern (cf. Figure 3.1B) should, however, not be viewed as definitive for the ASD population as a whole. Instead, it reflects a distinctive
  • 39. 28 connectivity pattern for a large cohort of children and young adults with ASD. Sample size, careful data denoising, and refinement of machine learning approaches in the present study thus present a distinct advance in the search for functional connectivity biomarkers of ASD, but given the challenges described above, much further work remains to be done. Although fcMRI is generally accepted as a powerful technique for identifying network abnormalities, the addition of complementary techniques (e.g., diffusion weighted MRI for assays of anatomical connectivity; magnetoencephalography for detection of neuronal coherence in high-frequency bands) will likely enhance diagnostic classification and the detection of complex biomarkers (see Zhou, Yu, & Duong, 2014 for a recent multimodal attempt with moderate success). However, it is also important to recognize that machine learning diagnostic classification can only perform at the level permitted by the diagnostic procedure itself, which is fundamentally imperfect. It cannot be expected that a disorder recognized as neurobiological in nature would be detected by means of purely behavioral and observational criteria currently used for diagnosis. There can thus be no perfect fit between brain markers and diagnostic labels. The contribution of machine learning to the discovery of biomarkers, which can ultimately replace behavioral criteria, is therefore important.
  • 40. 29 REFERENCES Akshoomoff, N., Pierce, K., & Courchesne, E. (2002). The neurobiological basis of autism from a developmental perspective. Development and Psychopathology, 14, 613-634. American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders - V. Washington, DC: American Psychiatric Association. Anderson, J. S., Nielsen, J. A., Froehlich, A. L., DuBray, M. B., Druzgal, T. J., Cariello, A. N., . . . Lainhart, J. E. (2011). Functional connectivity magnetic resonance imaging classification of autism. Brain, 134, 3742-3754. Andrews-Hanna, J. R., Reidler, J. S., Sepulcre, J., Poulin, R., & Buckner, R. L. (2010). Functional-anatomic fractionation of the brain's default network. Neuron, 65, 550- 562. Assaf, M., Jagannathan, K., Calhoun, V. D., Miller, L., Stevens, M. C., Sahl, R., . . . Pearlson, G. D. (2010). Abnormal functional connectivity of default mode sub- networks in autism spectrum disorder patients. Neuroimage, 53, 247-256. Breiman, L. (2001). Random forests. Machine Learning, 45, 5-32. Carp, J. (2013). Optimizing the order of operations for movement scrubbing: Comment on Power et al. NeuroImage, 76, 436-438. Cascio, C. J., Moana-Filho, E. J., Guest, S., Nebel, M. B., Weisner, J., Baranek, G. T., & Essick, G. K. (2012). Perceptual and neural response to affective tactile texture stimulation in adults with autism spectrum disorders. Autism Research: Official Journal of the International Society for Autism Research, 5, 231-244. Cordes, D., Haughton, V. M., Arfanakis, K., Carew, J. D., Turski, P. A., Moritz, C. H., . . . Meyerand, M. E. (2001). Frequencies contributing to functional connectivity in the cerebral cortex in "resting-state" data. American Journal of Neuroradiology, 22, 1326-1333. Cox, R. W. (1996). AFNI: Software for analysis and visualization of functional magnetic resonance neuroimages. Computers and Biomedical Research, 29, 162-173. Dakin, S., & Frith, U. (2005). Vagaries of visual perception in autism. Neuron, 48, 497-507. Di Martino, A., Yan, C. G., Li, Q., Denio, E., Castellanos, F. X., Alaerts, K., . . . Milham, M. P. (2013). The autism brain imaging data exchange: Towards a large-scale evaluation of the intrinsic brain architecture in autism. Molecular Psychiatry, 19(6), 659-667. Di Martino, A., Zuo, X. N., Kelly, C., Grzadzinski, R., Mennes, M., Schvarcz, A., . . . Milham, M. P. (2013). Shared and distinct intrinsic functional network centrality in autism and attention-deficit/hyperactivity disorder. Biological Psychiatry, 74(8), 623- 632.
  • 41. 30 Fischl, B., van der Kouwe, A., Destrieux, C., Halgren, E., Segonne, F., Salat, D. H., . . . Dale, A. M. (2004). Automatically parcellating the human cerebral cortex. Cerebral Cortex, 14, 11-22. Greschwind, D. H., & Levitt, P. (2007). Autism spectrum disorders: Developmental disconnection syndromes. Current Opinion in Neurobiology, 17, 103-111. Guyon, V. V., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Machine Learning, 46, 389-422. Irimia, A., Chambers, M. C., Torgerson, C. M., & Horn, J. D. (2012). Circular representation of human cortical networks for subject and population-level connectomic visualization. NeuroImage, 60, 1340-1351. Jo, H. J., Gotts, S. J., Reynolds, R. C., Bandettini, P. A., Martin, A., Cox, R. W., & Saad, Z. S. (2013). Effective preprocessing procedures virtually eliminate distance-Dependent motion artifacts in resting state FMRI. Journal of Applied Mathematics, 2013, 935154. Kennedy, J., & Eberhart, R. (1995). Particle swarm optimization. Proceedings of the IEEE International Conference on Neural Network, 4, 1942-1948. Kennedy, D. P., Redcay, E., & Courchesne, E. (2006). Failing to deactivate: Resting functional abnormalities in autism. Proceedings of the National Academy of Science USA, 103, 8275-8280. Keown, C. L., Shih, P., Nair, A., Peterson, N., & Müller, R.-A. (2013). Local functional overconnectivity in posterior brain regions is associated with symptom severity in autism spectrum disorders. Cell Reports, 5, 567-572. Liaw, A., & Wiener, M. (2002). Classification and regression by random forest. R News, 2, 18-22. Miyazaki, M., Fujii, E., Saijo, T., Mori, K., Hashimoto, T., Kagami, S., & Kuroda, Y. (2007). Short-latency somatosensory evoked potentials in infantile autism: Evidence of hyperactivity in the right primary somatosensory area. Developmental Medicine and Child Neurology, 49, 13-17. Monk, C. S., Peltier, S. J., Wiggins, J. L., Weng, S. J., Carrasco, M., Risi, S., & Lord, C. (2009). Abnormalities of intrinsic functional connectivity in autism spectrum disorders. Neuroimage, 47, 764-772. Müller, R.-A. (2007). The study of autism as a distributed disorder. Mental Retardation and Developmental Disabilities Research Review, 13, 85-95. Müller, R.-A., Shih, P., Keehn, B., Deyoe, J. R., Leyden, K. M., & Shukla, D. K. (2011). Underconnected, but how? A survey of functional connectivity MRI studies in autism spectrum disorders. Cerebral Cortex, 21, 2233-2243. Murdaugh, D. L., Shinkareva, S. V., Deshpande, H. R., Wang, J., Pennick, M. R., & Kana, R. K. (2012). Differential deactivation during mentalizing and classification of autism based on default mode network connectivity. PloS One, 7, e50064.
  • 42. 31 Nair, A., Keown, C. L., Datko, M. C., Shih, P., Keehn, B. M., Müller, R.-A. (2014). Impact of methodological variables on functional connectivity findings in Autism Spectrum Disorders. Human Brain Mapping, 35(8), 4035-4048. Nielsen, J. A., Zielinski, B. A., Fletcher, P. T., Alexander, A. L., Lange, N., Bigler, E. D., . . . Anderson, J. S. (2013). Multisite functional connectivity MRI classification of autism: ABIDE results. Frontiers in Human Neuroscience, 7, 599. Philip, R. C., Dauvermann, M. R., Whalley, H. C., Baynham, K., Lawrie, S. M., & Stanfield, A. C. (2012). A systematic review and meta-analysis of the fMRI investigation of autism spectrum disorders. Neuroscience and Biobehavioral Review, 36, 901-942. Power, J. D., Cohen, A. L., Nelson, S. M., Wig, G. S., Barnes, K. A., Church, J.A., . . . Petersen, S. E. (2011). Functional network organization of the human brain. Neuron, 72, 665-678. Power, J. D., Barnes, K. A., Snyder, A. Z., Schlaggar, B. L., Petersen, S. E. (2012). Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. Neuroimage, 59, 2142-2154. Power JD, Barnes KA, Snyder AZ, Schlaggar BL, Petersen SE (2013): Steps toward optimizing motion artifact removal in functional connectivity MRI; a reply to Carp. NeuroImage 76:439-441. Power, J. D., Mitra, A., Laumann, T. O., Snyder, A. Z., Schlaggar, B. L., & Petersen, S. E. (2014): Methods to detect, characterize, and remove motion artifact in resting state fMRI. Neuroimage 84:320-341. Puts, N. A., Wodka, E. L., Tommerdahl, M., Mostofsky, S. H., & Edden, R. A. (2014). Impaired tactile processing in children with autism spectrum disorder. Journal of Neurophysiology, 111, 1803-1811. Redcay, E., Moran, J. M., Mavros, P. L., Tager-Flusberg, H., Gabrieli, J. D., & Whitfield- Gabrieli, S. (2013). Intrinsic functional network organization in high-functioning adolescents with autism spectrum disorder. Frontiers in Human Neuroscience, 7, 573. Samson, F., Mottron, L., Soulieres, I., & Zeffiro, T. A. (2012). Enhanced visual functioning in autism: An ALE meta-analysis. Human Brain Mapping, 33,1553-1581. Satterthwaite, T. D., Elliott, M. A., Gerraty, R. T., Ruparel, K., Loughead, J., Calkins, M. E., . . . Wolf, D. H. (2013). An improved framework for confound regression and filtering for control of motion artifact in the preprocessing of resting-state functional connectivity data. Neuroimage, 64, 240-256. Schipul, S. E., Keller, T. A., & Just, M. A. (2011). Inter-regional brain communication and its disturbance in autism. Frontiers in Systems Neuroscience, 5, 10. Shen, M. D., Shih, P., Ottl, B., Keehn, B., Leyden, K. M., Gaffrey, M. S., & Muller, R. A. (2012). Atypical lexicosemantic function of extrastriate cortex in autism spectrum disorder: Evidence from functional and effective connectivity. Neuroimage, 62, 1780- 1791. Simmons, D. R., Robertson, A. E., McKay, L. S., Toal, E., McAleer, P., & Pollick, F. E. (2009). Vision in autism spectrum disorders. Vision Research, 49, 2705-2739.
  • 43. 32 Smith, S. M., Jenkinson, M., Woolrich, M. W., Beckmann, C. F., Behrens, T. E., Johansen- Berg, H., . . . Matthews, P. M. (2004). Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage, 23(Suppl 1), S208-219. Tomchek, S. D., & Dunn, W. (2007). Sensory processing in children with and without autism: A comparative study using the short sensory profile. American Journal of Occupational Therapy, 61, 190-200. Tommerdahl, M., Tannan, V., Cascio, C. J., Baranek, G. T., Whitsel, B. L. (2007). Vibrotactile adaptation fails to enhance spatial localization in adults with autism. Brain Research, 1154, 116-123. Uddin, L. Q., Supekar, K., Lynch, C. J., Khouzam, A., Phillips, J., Feinstein, C., . . . Menon, V. (2013). Salience network-based classification and prediction of symptom severity in children with autism. Journal of the American Medical Association on Psychiatry, 70(8), 869-879. Uddin, L. Q., Supekar, K., & Menon, V. (2013). Reconceptualizing functional brain connectivity in autism from a developmental perspective. Frontiers in Human Neuroscience, 7, 458. Van Dijk, K. R., Hedden, T., Venkataraman, A., Evans, K. C., Lazar, S. W., & Buckner, R. L. (2010). Intrinsic functional connectivity as a tool for human connectomics: Theory, properties, and optimization. Journal of Neurophysiology, 103, 297-321. Van Dijk, K. R., Sabuncu, M. R., & Buckner, R. L. (2012). The influence of head motion on intrinsic functional connectivity MRI. Neuroimage, 59, 431-438. Vapnik, V. M. (1995). The nature of statistical learning theory. New York, NY: Springer. Vissers, M. E., Cohen, M. X., & Geurts, H. M. (2012). Brain connectivity and high functioning autism: A promising path of research that needs refined models, methodological convergence, and stronger behavioral links. Neuroscience and Biobehavioral Review, 36, 604-625. von dem Hagen, E. A., Stoyanova, R. S., Baron-Cohen, S.,& Calder, A. J. (2012). Reduced functional connectivity within and between 'social' resting state networks in autism spectrum conditions. Social Ccognitive and Affective Neuroscience, 8(6), 694-701. Wang, X., Yang, J., Teng, X., Xia, W., & Jensen, R. (2007). Feature selection based on rough sets and particle swarm optimization. Pattern Recognition Letters, 28, 459-471 Washington, S. D., Gordon, E. M., Brar, J., Warburton, S., Sawyer, A. T., Wolfe, A., . . . Vanmeter, J. W. (2013). Dysmaturation of the default mode network in autism. Human Brain Mapping, 35(4), 1284-1296. Wass, S. (2011). Distortions and disconnections: Disrupted brain connectivity in autism. Brain and Cognition, 75, 18-28. Zhang, Y., Brady, M., & Smith, S. (2001). Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Transactions on Medical Imaging, 20, 45-57.
  • 44. 33 Zhou, Y., Yu, F., & Duong, T. (2014). Multiparametric MRI characterization and prediction in autism spectrum disorder using graph theory and machine learning. PloS One, 9, e90405. Zielinski, B. A., Anderson, J. S., Froehlich, A. L., Prigge, M. B., Nielsen, J. A., Cooperrider, J. R., . . . Lainhart, J. E. (2012). scMRI reveals large-scale brain network abnormalities in autism. PloS One, 7, e49172.