[PDF] Learning and comparing functional connectomes across subjects

Abstract

Functional connectomes capture brain interactions via synchronized fluctuations in the functional magnetic resonance imaging signal. If measured during rest, they map the intrinsic functional architecture of the brain. With task-driven experiments they represent integration mechanisms between specialized brain areas. Analyzing their variability across subjects and conditions can reveal markers of brain pathologies and mechanisms underlying cognition. Methods of estimating functional connectomes from the imaging signal have undergone rapid developments and the literature is full of diverse strategies for comparing them. This review aims to clarify links across functional-connectivity methods as well as to expose different steps to perform a group study of functional connectomes.

Full PDF

aa r X i v : . [ q - b i o . N C ] A p r Learning and comparing functional connectomes across subjects

Ga¨el Varoquaux a,b,c, ∗ , R. Cameron Craddock d,e a Parietal project-team, INRIA Saclay-ˆıle de France b INSERM, U992 c CEA/Neurospin bˆat 145, 91191 Gif-Sur-Yvette d Child Mind Institute, New York, New York e Nathan Kline Institute for Psychiatric Research, Orangeburg, New York

Abstract

Functional connectomes capture brain interactions via synchronized ﬂuctuations in the functional magnetic resonanceimaging signal. If measured during rest, they map the intrinsic functional architecture of the brain. With task-drivenexperiments they represent integration mechanisms between specialized brain areas. Analyzing their variability acrosssubjects and conditions can reveal markers of brain pathologies and mechanisms underlying cognition. Methods ofestimating functional connectomes from the imaging signal have undergone rapid developments and the literature is fullof diverse strategies for comparing them. This review aims to clarify links across functional-connectivity methods as wellas to expose diﬀerent steps to perform a group study of functional connectomes.

Keywords:

Functional connectivity, connectome, group study, eﬀective connectivity, fMRI, resting-state

1. Introduction

Functional connectivity reveals the synchronization ofdistant neural systems via correlations in neurophysiolog-ical measures of brain activity [14, 37]. Given that high-level function emerges from the interaction of specializedunits [110], functional connectivity is an essential part ofthe description of brain function, that complements thelocalizationist picture emerging from the systematic map-ping of regions recruited in tasks [101]. However, whilethere exists a well-deﬁned standard analysis frameworkfor activation mapping that enables statistically-controlledcomparisons across subjects [39], group-level analysis offunctional connectivity still face many open methodolog-ical challenges. Deriving a picture of a single subject’sfunctional connectivity is by itself not straightforward, asthe brain comprises a myriad of interacting subsystemsand its connectivity must be decomposed into simpliﬁedand synthetic representations. An important view of brainconnectivity is that of distributed functional networks de-picted by their spatial maps [31]. Another no less impor-tant and complementary view is that of connections link-ing localized functional modules depicted as a graph [17].This representation of brain connectivity is often calledthe functional connectome [102] and is the focus of intenseworldwide research eﬀorts as it holds promises of new in-sights in cognition and pathologies [13, 30, 45].The purpose of this paper is to review methodologicalprogress in the estimation of functional connectomes from ∗ Corresponding author blood oxygenation level dependent (BOLD) based func-tional magnetic resonance imaging (fMRI) data and theircomparisons across individuals. It does not attempt to beexhaustive, as the ﬁeld is wide and moving rapidly, but de-tails speciﬁc tools and guidelines that, in the experience ofthe authors, lead to controlled and powerful inter-subjectcomparisons. The paper is focused on functional connec-tomes in contrast to structural connectomes, as the infer-ence of functional connectivity requires important statisti-cal modeling considerations that are vastly diﬀerent fromthe complications involved with estimating structural con-nectivity. While the notion of functional connectomics isoften associated with the study of resting state [13], themethods presented in this paper are also relevant for task-based studies. On the other hand, the paper has a focus onfMRI; although the core concepts presented can be appliedto magnetoencephalography (MEG) or electroencephalog-raphy (EEG) [103], additional speciﬁc problems such assource reconstruction must be considered [93].“Functional connectivity” is deﬁned as a measure ofsynchronization in brain signals [35]. More generally, itis interesting as a window on underlying synchrony onneural processes [63]. By “functional connectome”, herewe speciﬁcally denote a graph representing functional in-teractions in the brain, where the term “graph” is takenin its mathematical sense: a set of nodes connected to-gether by edges . Graph nodes (brain regions) correspondto spatially-contiguous and functionally-coherent patchesof gray matter and edges describe long-range synchroniza-tions between nodes that are putatively subtended by largeﬁber pathways [68]. A graph can be weighted or not,and is completely equivalent to its adjacency matrix , a

Preprint submitted to Elsevier April 16, 2013 ymmetric matrix tabulating the connection weights be-tween each pair of nodes. Functional-connectivity graphsare used to represent evoked activity, as in task-responsestudies [72], as well as ongoing activity, present in the ab-sence of speciﬁc tasks or in the background during taskand often studied in so-called resting state experiments[83]. Another important notion that arises from the studyof distributed modes of brain function is that of specializedfunctional networks [31]. With our deﬁnition of the func-tional connectome, functional networks are not directlybuilding blocks of the connectome but appear as a conse-quence of the graphical structure [116, 117].The paper is organized as follows. First we discuss es-timation of functional connectomes. This part, akin to aﬁrst-level analysis in standard activation mapping method-ology, is not in itself a group-level operation, but it is acritical step for inter-subject comparison. In a followingsection, we discuss several strategies for comparing con-nectomes across subjects. Finally we discuss the links be-tween the representation of brain connectivity as graphsof functional connectivity and more complex models, suchas eﬀective-connectivity models.

2. Estimating functional connectomes

Here we discuss the inference of connectomes fromfunctional brain imaging data. We start with preprocess-ing considerations, followed by the choice of nodes i.e. re-gions, signal extraction, and the estimation of graphs.

In addition to standard preprocessing performedfor task-based analysis (slice-timing correction, realign-ment, spatial normalization, and possibly smoothing),connectivity-based analysis require additional denoising toseparate intrinsic activity from confounding signals. Thisprocess involves regressing time series capturing sourcesof structured noise from the fMRI data. Physiologicalnoise due to cardiac and respiration are two importantnoise signals [11, 12, 53, 67] that are diﬃcult to controlfor and as a result are not commonly regressed out. In-stead the mean signal from white matter (WM) and cere-brospinal ﬂuid (CSF) are used as surrogates to measurethese sources of noise as well as other scanner induced sig-nal ﬂuctuations [31, 67]. More complex models account forspatial variation in noise by incorporating voxel-speciﬁc re-gressors of neighboring WM (ANATICOR [55]) or the topcomponents from a principal components analysis of high-variance signals (CompCor [7]). Head motion induced sig-nal ﬂuctuations are accounted for by incorporating move-ment parameters [31, 41, 67]. The global mean time series In neuroimaging, the term network is sometimes used to denotea graph of brain function. To disambiguate the notion of segregatedspatial mode [31] from that of connectivity graphs, we will purposelyrestrict its usage in this paper. has been proposed as an additional noise regressor thatappears to improve the spatial speciﬁcity of connectivityresults [31, 32]. This practice has become controversialsince the global signal regression introduces negative corre-lations [19, 77, 90]. Removing these sources of nuisance inaddition to linear trends results in more contrasted corre-lation matrices that improve the delineation of functionalstructures (ﬁg. 2).Filtering to remove high frequencies is often performed,based on the initial observation that ﬂuctuations impli-cated in resting-state functional connectivity are predom-inately slower than 0.1 Hz [14, 23]. While high-pass andlow-pass ﬁltering decrease the impact of some confounds,recent studies have shown that connectivity is presentacross the full spectrum of observed frequencies [99, 113].Regressing out a good choice of confound signals is morespeciﬁc than frequency ﬁltering, and in our experiencegives more contrasted correlation matrices . In addition,the recent developments of very rapid acquisition proto-cols prevent aliasing of the physiological noise with theneural signal and give access to more speciﬁc noise con-founds than traditional low-TR sequences [16].It is important to keep in mind that the proposed cor-rection strategies are approximate and not deﬁnitive tech-niques. This has become particularly apparent for headmotion with reports that micromovements on the scale of ≤ . The choice of regions of interests (ROIs) that deﬁnethe nodes of the graphs can be very important both inthe estimation of connectomes and for group comparison[119]. Unsurprisingly, simulations have shown that ex-tracting signal from ROIs that did not match functionalunits would lead to erronous graph estimation [100]. Dif-ferent strategies to deﬁne suitable ROIs coexist. Whiledense parcellation approaches cover a large fraction of thebrain [1, 8, 25, 116, 119], this coverage can be traded oﬀ tofocus on some speciﬁc regions, in favor of increased func-tional speciﬁcity and thus better diﬀerentiation across net-works [28, 46, 114]. In addition, while ROIs are most oftendeﬁned as a hard selection of voxels, it is also possible touse a soft deﬁnition, attributing weights as with proba-bilistic atlases, or spatial maps of functional networks ex-tracted from techniques such as independent componentanalysis (ICA) [57, 99].

Regions from atlases.

Atlases can be used to deﬁne full-brain parcellations. Popular choices are the AutomaticAnatomic Labeling (AAL) atlas [111], which beneﬁts from Note that naive use of ﬁltering can induce spurious correlations[26].

2n SPM toolbox, or the ubiquitous Talaraich-Tournoux at-las [107]. However, these atlases suﬀer from major short-comings; namely i) they were deﬁned on a single subjectand thus do not reﬂect inter-subject variability, and ii) they focus on labeling large anatomical structures and donot match functional layout –for instance only two re-gions describe the medial part of the frontal lobe in theAAL atlas. Multi-subject probabilistic altases such asthe Harvard-Oxford atlas distributed with FSL [98] or thesulci-based structural atlas used in [116] mitigate the ﬁrstproblem, and the high number of regions deﬁned usingsulci also somewhat circumvent the second problem (seeﬁg. 1). Deﬁning regions from the literature.

Regions can be de-ﬁned from previous studies, informally or with system-atic meta-analysis. This strategy is used to deﬁne themain resting-state networks, such as the default mode net-work, but may also be useful to study connectivity in task-speciﬁc networks [14, 28, 47, 86]. The common practice isto place balls of a given radius, 5 or 10 mm, centered at thecoordinates of interest. Given that functional networks aretightly interleaved in some parts of the cortex, such as theparietal lobe, care must be taken not to deﬁne too manyregions that would overlap and lead to mixing of the signal.

FMRI-based function deﬁnition.

Deﬁning regions directlyfrom the fMRI signal brings many beneﬁts. First, it cancapture subject-speciﬁc functional information. Second,it adapts to the signal at hand and its limitations, suchas image distortions or vascular and movement artifactsthat are isolated in ICA-like approaches. Lastly, incorpo-rating functional information into regional deﬁnition willresult in more homogenous regions that better representconnectivity present at the voxel level than anatomically-deﬁned atlases such as AAL or Harvard-Oxford [25]. Thesimplest approach to deﬁne task-speciﬁc regions is to useactivation maps derived from standard GLM-based anal-ysis in a task-driven study (see for instance [81]). Re-gions are extracted by thresholding the maps, or usingballs around the activation peaks. For resting-state stud-ies, unsupervised multivariate analysis techniques are nec-essary. Clustering approaches extract full-brain parcella-tions [9, 25, 109, 121], and have been shown to segmentwell-known functional structures from rest data. Alterna-tively, decomposition methods, such as ICA [6], can unmixlinear combinations of multiple eﬀects and separate outpartially-overlapping spatial maps that capture functionalnetworks or confounding eﬀects, as for instance with thepresence of vascular structure in functional networks. Athigh model order, ICA maps deﬁne a functional parcel-lation [57]. Extracting regions from these maps requiresadditional eﬀort as they can display fragmented spatialfeatures and structured background noise, but incorporat-ing sparsity and spatial constraints in the decompositiontechniques leads to contrasted maps that outline many dif-ferent structures [117] (see ﬁg. 1).

Figure 1: Diﬀerent full-brain parcellations: the AAL atlas [111], theHarvard-Oxford atlas, the sulci atlas used in [116], regions extractedby Ncuts [25], the resting-state networks extracted in [97] by ICA,and in [115] by sparse dictionary learning.

Optimal number of regions.

Deﬁning an optimal numberof regions to use for whole-brain connectivity analysisbears careful consideration. On one hand we desire a suf-ﬁciently large number of regions to guarantee that theyare functionally homogeneous regions and adequately rep-resent the connectivity information present in the data.On the other hand too many regions will render statisticalinference challenging, result in an explosion in computa-tional complexity, and interfere with the interpretability ofobserved connections. For functional parcellation, cross-validation methods can be employed to estimate an opti-mal number of regions based on homogeneity, the abilityto reproduce connectivity information present at the voxelscale, and the ability to obtain the same parcellations fromindependent data [15, 25]. In general these metrics do notresult in an obvious peak at a “best” number of regions,but instead oﬀer a range over which the number of regionscan be chosen based on the needs of the analysis at hand.Finally, it is important to keep in mind that there is nouniversally better parcellation and associated number ofregions. From a practical standpoint, these choices willdepend on the task at hand, and more fundamentally, agood description of brain function should cover multiplescales. Given that it is not clear that an optimal parcel-lation can be identiﬁed from the sample size of a typicalstudy, randomized parcellation, as used in structural con-nectomes [124] or activation mapping [118], may also beconsidered.

The concept of functional connectivity has been calledelusive [51]: it has many mathematical instantiations al-3 o r s P CC L D M NRD M N M ed D M N F r on t D M NR P o s t T e m p RD L P F CR P a r R F r on t po l L P a r L D L P F C L F r on t po l L I PS R I PS L A n tI PS R A n tI PS M o t o r L A ud R A udL S T S R S T S L I n s R I n s C i ng VA CCD A CCR A I n s B a s a l B r o c a R P a r s O p S up F r on t S L T P J R T P J C e r ebLL O CR L O C V i s S t r i a t e O cc po s t D o r s P CC L D M NR D M N M ed D M N F r on t D M NR P o s t T e m p R D L P F CR P a r R F r on t po l L P a r L D L P F C L F r on t po l L I PS R I PS L A n t I PS R A n t I PS M o t o r L A ud R A udL S T S R S T S L I n s R I n s C i ng V A

CCD A CCR A I n s B a s a l B r o c a R P a r s O p S up F r on t S L T P J R T P J C e r ebL L O CR L O C V i s S t r i a t e O cc po s t Without regressing outCompCor, WM and CSF

Regressing outglobal signal

Figure 2: Correlation matrices of rest time-series extracted from the39 main regions of the Varoquaux 2011 [115] parcellation with diﬀer-ent choices of confound regressors –

Left : regressing out CompCorsignals, as well as white matter and CSF average signals and move-ment parameters. The insert shows the connections restricted to afew major nodes. –

Upper right : regressing out only movement pa-rameters. –

Lower right : regressing out movement parameters andglobal signal mean. No frequency ﬁltering was applied here. Whenno confounding brain signals are regressed, all regions are heavily cor-related. Regressing out common signal, in the form of well-identiﬁedconfounds or a global mean, teases out the structure. though in essence they all strive to extract simple statisticsfrom functional imaging in order to characterize synchronyand communication between large ensembles of neurons.Here we choose to focus on second order statistics thatcan be related to Gaussian models, the simplest of whichbeing the correlation matrix of the signals of the diﬀerentROIs.

Signal extraction.

Given a set of graph nodes, the nextstep is to extract a representative time series for each node.To study intrinsic activity, e.g. with rest data, signal ex-traction can be achieved by either averaging the fMRI timeseries across the voxels in a region, or by taking the ﬁrsteigenvariate from a principle components analysis of thetime series [40]. Comparisons of these methods has shownthat the eigenvariate method is more sensitive to functioninhomogeneity [25] and exhibits worse test-retest reliabil-ity than averaging time series [128]. In addition, improvedspeciﬁcity to BOLD signal can be enforced by using onlysignal in voxels near gray-matter tissues. For this pur-pose, we suggest summarizing the signal in an ROI bya mean of the diﬀerent voxels weighted by the subject-speciﬁc gray matter probabilistic segmentation, as outputby e.g.

SPM’s segmentation tool [4] or FSL’s FAST pro-gram [126].Studying connectivity from evoked activity with task-driven studies requires disambiguating task-speciﬁc con-nectivity eﬀects from intrinsic connectivity mediated byshared neuromodulatory/task inputs, anatomical path-ways, etc . In this regard, it can be beneﬁcial to run aGLM-based ﬁrst-level analysis, enforcing speciﬁcity of themeasure extracted to the task. With slow event-related D o r s P CC L D M NRD M N M ed D M N F r on t D M NR P o s t T e m p RD L P F CR P a r R F r on t po l L P a r L D L P F C L F r on t po l L I PS R I PS L A n tI PS R A n tI PS M o t o r L A ud R A udL S T S R S T S L I n s R I n s C i ng VA CCD A CCR A I n s B a s a l B r o c a R P a r s O p S up F r on t S L T P J R T P J C e r ebLL O CR L O C V i s S t r i a t e O cc po s t D o r s P CC L D M NR D M N M ed D M N F r on t D M NR P o s t T e m p R D L P F CR P a r R F r on t po l L P a r L D L P F C L F r on t po l L I PS R I PS L A n t I PS R A n t I PS M o t o r L A ud R A udL S T S R S T S L I n s R I n s C i ng V A

CCD A CCR A I n s B a s a l B r o c a R P a r s O p S up F r on t S L T P J R T P J C e r ebL L O CR L O C V i s S t r i a t e O cc po s t Without sparsity

GraphLasso estimate

Figure 3: Diﬀerent inverse-covariance matrices estimates corre-sponding to ﬁg. 2 –

Left : group-sparse estimate using the ℓ es-timator [116]. The insert shows the connections restricted to a fewmajor nodes. – Upper right : non-sparse estimate: inverse of thesample correlation matrix. –

Lower right : sparse estimate usingthe Graph Lasso [34]. designs, task-speciﬁc functional connectivity can be cap-tured in trial-to-trial ﬂuctuations in the BOLD response,estimated using a GLM analysis with one regressor pertrial [47, 75, 86]. This approach, known as beta-series re-gression, has been adapted for rapid event-related designs,using multiple GLMs to optimize deconvolution of eachtrial [76].

Correlation and partial correlations.

Given ROIs deﬁningthe nodes of the functional-connectome graph, one needsto estimate the corresponding edges connecting them.Functional connectivity between the ROIs can be mea-sured by computing the correlation matrix of the extractedsignals. An important and often neglected point is that thesample correlation matrix, i.e. the correlation matrix ob-tained by plugging the observed signal in the correlationmatrix formula, is not the population correlation matrix, i.e. the correlation matrix of the data-generating process.If the number of measurements was inﬁnite, the two wouldcoincide, however if this number is not large compared tothe number of connections (that scales as the square of thenumber of ROIs), the sample correlation matrix is a poorestimate of the underlying population correlation matrix.In other words, the sample correlation matrix captures alot of sampling noise, intrinsic randomness that arises inthe estimation of correlations from short time series. Con-clusions drawn from the sample correlation matrix can eas-ily reﬂect this estimation error. Varoquaux et al. [116] andSmith et al. [100] have shown respectively on rest fMRI andon realistic simulations that a good choice of correlationmatrix estimator could recover the connectivity structure,where the sample correlation matrix would fail. In general,the choice of a better estimate depends on the settings andthe end goals [114, 117], however the Ledoit-Wolf shrink-age estimate [62] is a simple, computationally-eﬃcient, and4 ithout regressing outCompCor, WM and CSF

Regressing outglobal signal

Figure 4: Inverse-covariance matrices for diﬀerent choice of con-found regressors –

Left : regressing out only movement parameters–

Right : removal of the global mean, instead of the white matter,CSF, and CompCor time courses. parameter-less alternative that performs uniformly betterthan the sample correlation matrix [116, 117] and shouldalways be preferred.For the problem of recovering the functional-connectivity structure , i.e. ﬁnding which region is con-nected to which, sparse inverse covariance estimators havebeen found to be eﬃcient [89, 100, 116]. The intuition forrelying on inverse covariance rather than correlation stemsfrom that fact that standard correlation (marginal correla-tion) between two variables a and b also capture the eﬀectsof other variables: strong correlation of a and b with a thirdvariable c will induce a correlation between a and b . Onthe opposite, the inverse covariance matrix (also called precision matrix ) captures partial correlations, removingthe eﬀect of other variables [71]. In the small sample limit,this removal is challenging from the statistical standpoint.This is why an assumption of sparsity, i.e. that only fewvariables need to be considered at a time, is importantto estimate a good inverse covariance. Various estimationstrategies exist for sparse inverse covariance, and have animpact on the resulting networks [116, 117]. The GraphLasso ( ℓ -penalized maximum-likelihood estimator) [34] isin general a good approach for structure recovery. In groupstudies, the ℓ estimator [50, 116] is useful to impose acommon sparsity structure across diﬀerent subjects andachieve better recovery of this common structure. Simplyput, these approaches are necessary because estimationnoise creates a background structure (see ﬁg. 3); however,unlike in a univariate situation, the parameters are not in-dependent, and the spurious background connections de-grade the estimation of the actual connections. The sparseestimators make a compromise between imposing simplermodels, i.e. with less connections, and providing a goodﬁt to the data. This compromise is set via a regulariza-tion parameter which controls the sparsity of the estimate.A good procedure to choose this parameter is via cross- Covariance and correlation matrices diﬀer simply by the factthat a covariance matrix captures the amplitude of a signal, via itsvariance, while a correlation matrix is computed on standardized(zero mean, unit variance) signals. validation [116].

Network structure extracted.

The correlation matrices andinverse-covariance matrices that we extract contain a lot ofinformation on the functional structure of the brain. First,the correlation matrix (ﬁg. 2) shows blocks of synchronizedregions that can be interpreted as large-scale functionalnetworks, such as the default mode network. Note that thesplit in networks is not straightforward. Diﬀerent orderingof the nodes will reveal diﬀerent networks. Indeed, becauseof the presence of hubs and interleaved networks, the pic-ture in terms of segregated networks is not suﬃcient to ex-plain full-brain connectivity [117]. Connectivity matrices,correlation matrices and inverse-covariance matrices, canbe represented as graphs: nodes connected by weightededges (inserts on ﬁg. 2 and ﬁg. 3). The inverse-covariancematrix, which captures partial correlations, appears thenas extracting a backbone or core of the graph. While suchstructure has been used as a way to summarize anatomi-cal brain connectivity graphs [49], here it has a clear-cutmeaning with regard to the BOLD signal: it gives the con-ditional independence structure between regions [117]. Inother words, regions a and b are not connected if the sig-nal that they have in common can be explained by a thirdregion c . In this light, the choice of nuisance regressorsto remove confounding common signal is less critical withpartial correlations than with correlations. Indeed, whilewith correlation matrices regressing out the global meanhas a drastic eﬀect (ﬁg. 2 upper right and lower right), oninverse covariance it only changes the resulting matricesvery slightly (ﬁg. 4).There have been debates on whether to regress out cer-tain signals, such as the global mean, as it induces negativecorrelations [19, 32, 77], and these may seem surprising:one network appears as having opposite ﬂuctuations toanother. However, correlation between two signals onlytakes its meaning with the deﬁnition of a baseline. A sim-ple picture to explain anti-correlations between two regionsis the presence of a third region, mediating the interac-tions. Using this third region as a baseline would amountto estimate partial correlations in the whole system. Us-ing inverse-covariance matrices or partial correlations tounderstand brain connectivity makes the interpretationin terms of interactions between brain regions easier andmore robust to the choice of confounds.

3. Comparing connectivity

We now turn to the problem of comparing functionalconnectivity across subjects or across conditions.

First, we focus on detecting where the connectivity ma-trices estimated in the previous section diﬀer.5 ass-univariate approaches.

The most natural approachis to apply a linear model to each coeﬃcient of the con-nectivity matrices [47, 64]. This approach is similar to thesecond-level analysis used in mass-univariate brain map-ping, and gives rise to many of the well-known techniquesused in such a context, such as the deﬁnition of a second-level design, with possibly the inclusion of confounding ef-fects, and statistical tests (T tests or F tests) on contrastvectors. Importantly, in order to work with Gaussian-distributed variables, it is necessary to apply a Fisher Ztransform to the correlations. Note that in these set-tings, the Ledoit-Wolf estimator [62] is often a good choiceto estimate the correlation matrix, as it is parameter-freeand gives good estimation performance without imposingany restrictions on the data. For hypothesis testing, cor-recting for multiple comparison can severely limit statis-tical power, as the number of tests performed scales asthe square of the number of regions used. Controlling forthe false discovery rate (FDR) mitigates this problem. Al-ternatively, as the assumptions underlying the Benjamini-Hochberg procedure [10] for the FDR can easily be broken,non-parametric permutations-based tests give reliable ap-proaches. In particular, the max-T procedure [42, 79] isinteresting to avoid the drastic Bonferroni correction whencontrolling for multiple comparison in family-wise errorrate. Accounting for distributed variability.

A speciﬁc challengeof connectivity analysis is that the connectivity strengthbetween diﬀerent regions tends to covary. For instance,with resting-state data, functional networks comprisingmany nodes can appear as more or less connected acrosssubjects (see for instance ﬁg. 5, showing variability in acontrol population at rest). In other words, non-speciﬁcvariability is distributed across the connectivity graph,and it is structured by the graph itself. This obser-vation brings the natural question of whether second-level analysis should be performed on correlation matri-ces, inverse-covariance matrices, or another parametriza-tion that would disentangle eﬀects and give unstructured(white) residuals. While inverse-covariance matrices showless distributed ﬂuctuations than correlation matrices,they capture a lot of background noise, as partial corre-lations are intrinsically harder to estimate. Preliminarywork [114] suggests performing statistical tests on residu-als of a parametrization intermediate between correlationmatrix and inverse covariance matrix, as it can decoupleeﬀects and noise.Taking a diﬀerent stance on distributed variability, the“network-based statistics” approach [122] draws from thehypothesis that if, in a second-level analysis, an eﬀect isdetected on a connection that lies in a network of stronglyconnected nodes, a large sub-network is likely to carry an See http://en.wikipedia.org/wiki/Fisher_transformation or[3] section 4.2.3 for mathematical arguments. -11 a . Correlation matrices -44 b . Z score on diﬀerence -11 c . Inverse-covariance matrices -44 d . Z score on inverse-covariance -44 e . Z score on residuals [114] Figure 5: Inter subject variability. Note that this is variability occur-ring in a healthy population at rest, in other words it is non speciﬁcvariability – a : single-subject correlation matrices for diﬀerent sub-jects – b : Corresponding Z-score (eﬀect / standard deviation) of thediﬀerence between a subject and the remaining others – c : single-subject inverse-covariance matrices – d : Corresponding Z-score forthe inverse-covariance matrices – e : Corresponding Z-score for thesubject residuals, as deﬁned in [114]. eﬀect. Thus, they adapt cluster-level inference to connec-tivity analysis, in order to mitigate the curse of multiplecomparisons. Both the multiple comparison issue and the network-level distributed variability are a plague to edge-level com-parison of connectomes. A possible strategy to circumventthese diﬃculties is to perform comparisons and statisticaltesting at the level of the network, rather than the indi-vidual connection.

Network integration.

Marrelec et al. [69] introduce theuse of entropy and mutual information as a measure ofnetwork-level functional integration . Gaussian entropycan be seen as a simple metric to generalize correlation orvariance to multiple nodes (see [3] § § a , b and c . Their correlationstructure is captured by three correlation coeﬃcients: ρ ab , ρ bc and ρ ac . Summarizing these by their mean, as mightseem natural, discards the relationship between the sig-nals, while using the integration metric, deﬁned as theGaussian entropy, tells us how much two signals can be See [116] for simpliﬁed formulas for network integration and mu-tual information. ntegration: Integration:

Figure 6: Two diﬀerent correlation matrices with the same averagecorrelation, but with very diﬀerent integration values. Indeed, thematrix on the left was chosen to represent three signals a , b and c asdiﬀerent from each other as possible, given ρ ab + ρ bc + ρ ac = .

35; itthus has a small integration value. On the opposite, for the matrixon the right, signal b can almost be fully recovered by combiningsignals a and c ; the matrix thus has a large integration value. combined to form the third (see ﬁg. 6). Cross-entropy –ormutual information– [69] measures the amount of cross-talk between two systems in a similar way as Gaussianentropy is used to measure the integration of a brain sys-tem. The functional-connectivity structure, or its repre-sentation in the form of a correlation matrix, can thus becharacterized via the integration and cross-talk of someof its sub-systems. This approach gives a simpliﬁed rep-resentation with a small number of metrics that can becompared across subjects. Graph-topological metrics.

Functional connectivity graphshave been found to display speciﬁc topological prop-erties that are characteristic of small-world networks[1, 17, 91, 103]. These networks display excellent transportproperties: although they have a relatively small numberof connections, any two regions of the brain are well con-nected. Another interesting consequence of their speciﬁctopology is the resilience it gives the system to attacks suchas resulting from brain lesions [1]. This overall structureof functional-connectivity graphs can be summarized bya few metrics, such as the average path length betweenany two nodes, the local clustering coeﬃcients, or thenode degree centrality [87]. Given that pathologies with-out a localized focus, such as schizophrenia, are thoughtto have a global impact on brain connectivity [5, 65], thegraph-topological metrics are promising markers to per-form inter-subject comparison. Such an approach is ap-pealing as it is not subject to multiple comparison issues.However, it has been criticized as giving a fairly unspe-ciﬁc characterization of the brain and being fragile to noise[54]. Another caveat is that these properties are not spe-ciﬁc to brain function: correlation matrices display small-world properties such as local clustering by construction.Indeed, if two nodes are strongly correlated to a third,they are highly likely to be correlated to each other [123].This observation highlights the need for well deﬁned null-hypothesis [88, 123], but also for controlled recovery ofbrain functional connectivity going beyond empirical cor-relation matrices, as discussed in the previous section. In the neuroscience world, these descriptions are grouped underthe terms of “graph-theoretical approaches”, however graph theoryis an entire division of mathematics and computer science that isconcerned with much more than topology of random graphs.

Predictive modeling is concerned with learning (or ﬁt-ting) a model that is capable of predicting informationfrom unseen data [80]. In the context of connectomes, pre-dictive modeling can extract connectivity-based biomark-ers of disease diagnosis, prognosis, or other phenotypicoutcomes [24, 27]. The accuracy of a predictive modelprovides a measure of the amount of information presentin the connectome about the phenotypic measure beingevaluated [58, 59]. When combined with reproducibility,prediction accuracy provides a metric for evaluating ex-perimental trade-oﬀs for data acquisition, preprocessing,and analysis [60, 106]. Multivariate predictive models areattractive in connectomics because they are sensitive todependencies between features and avoid the need to cor-rect for multiple comparisons since the signiﬁcance of anentire pattern is evaluated using a single statistical test.Additionally, modern predictive modeling techniques drawfrom the statistical learning literature, which speciﬁcallyaddresses high dimensional datasets with few observations.Predictive modeling has been successfully applied to iden-tify connectome-based biomarkers of Alzheimer’s disease[104], depression [24, 125], schizophrenia [18, 94], autism[2], ADHD [127], aging [27], as well as to classify mentaloperations [85, 95]. The growing interest for applying pre-dictive modeling to connectivity analysis was highlightedby the ADHD200 Global Competition, in which the objectwas to identify a connectivity-based biomarker of ADHD[108]. Recent work has illustrated the utility of predictivemodeling for deriving connectivity models at the individ-ual level [22].Technically, predictive modeling is a supervised ma-chine learning problem where a target to be predicted– e.g. age, disease state, cognitive state– is available foreach observation of the data. In the context of comparingconnectomes, features used in the predictive model corre-spond to bivariate measures of connectivity [27, 85, 95],or any of the previously discussed graph summary metrics[18, 29]. The quality of a predictive model is determined byits prediction accuracy (or generalization ability) which ismeasured using one or more iterations of cross-validation.Cross-validation iteratively subdivides available data intoa subset used for training the classiﬁer and a dataset forevaluating classiﬁer performance [80]. The signiﬁcance ofachieved prediction accuracy can be assessed using permu-tation tests [44]. Predictive modeling approaches typicallyrequire the speciﬁcation of several parameters, which maybe chosen based on domain speciﬁc knowledge or require-ments [21], determined using an analytical approach [20],or optimized using a second-level cross validation proce-dure [33]. Several strategies exist for performing cross-validation and thecommonly used approach of using only a single observation for testing(leave-one-out cross-validation) results in highly variable estimatesof prediction accuracy [33]. Alternative approaches such as (5 or 10)-fold cross-validation, or 0 . + bootstrap should be preferred [33].

4. Beyond correlation, eﬀective connectivity?

All the approaches that we have presented in this re-view are based on second-order statistics of the signal,in other words correlation analysis. Traditionally, theseare deﬁned as functional connectivity , deﬁned as “tempo-ral correlations between remote neurophysiological events”[35], and opposed to eﬀective connectivity , i.e. “the in-ﬂuence one neural system exerts over another” [35]. Toconclude this review, we would like to bridge the gap be-tween these concepts, which in our eyes should be seen asa continuum rather than an opposition (this opinion is alsoexpressed in [73]).A ﬁrst step to move from purely descriptive statisticsto interaction models with functional connectivity analysisis to consider a correlation matrix as a Gaussian graphi-cal model, i.e. a well-deﬁned probabilistic model that de- scribes observed correlations in terms of an independencestructure and conditional relations [61, 117]. In such set-tings, the inverse covariance graph or the partial correla-tions are a measure of inﬂuence from one node to another,albeit undirected. Inferring directionality in a Gaussianmodel is impossible. Linear structural equation models(SEMs) [74] rely on a similar model that consists in speci-fying a candidate directed graphical structure. This struc-ture constraints the covariance matrix of the signals andcan thus be tested on observed data. In fact some forms ofSEMs are known as “covariance structure models”. Thereis thus a strong formal link between correlation analysis inthe framework of graphical models and SEMs: the formeris undirected but fully exploratory, as it does not requirethe speciﬁcation of candidate structure, while the latter isdirected but conﬁrmatory. This link has been exploited tospecify candidate structures for SEMs using partial cor-relations [70]. More complex models, such as dynamicalcausal models (DCMs) [38] or Granger causality [43] re-quire additional hypotheses such as non-linear couplingsor time lags.Most importantly, more complex models can only beused to model interactions between a small number ofnodes. This is not only due to a computational diﬃculty,but also to fundamental roadblocks in statistics: the com-plexity of the model must match the richness of the data.While injecting prior information can help model estima-tion, the more informative this prior is, the more fragilethe inference becomes. The ongoing debate on the impactof hemodynamic lag on Granger-causality inference [96]is an example of such fragility. Note that although mostof the theory underpinning correlation analysis (Gaussiangraphical models) is based on a Gaussian assumption, thecore results are robust to violations of this assumption [84].It is tempting to favor more neurobiologically-inspiredmodels that give descriptions close to our knowledge of thebrain basic mechanisms, however, as George Box famouslysaid, “all models are wrong; some models are useful”. De-pending on the question and the data at hand, a trade-oﬀshould be chosen between complex models based on a bio-physical description, and simple phenomenological modelssuch as correlation matrices. In particular to model inter-actions between a large number of regions, as in full-brainanalysis, and learn a large connectome , simple models areto be preferred. For more hypothesis-driven studies, suchas the analysis of the mechanisms underlying a speciﬁctask, more complex models can be preferred, if rich datais available. Automatic choice of model is a diﬃcult prob-lem, however, cross-validation (as used in [25, 105, 116])is a useful tool. The central principle of cross-validation isto test a model on diﬀerent data than the data used to ﬁtthe model. Models too complex for the data available willﬁt noise in the data, and thus generalize poorly. The mainbeneﬁt of cross-validation is that it is a non-parametricmethod which does not rely strongly on modeling assump-8ions .

5. Conclusion

Horwitz el al. [52] claimed almost 20 years ago that“the crucial concept needed for network analysis is covari-ance”. In our eyes, this still holds today. Estimation func-tional connectomes relies largely on ﬁtting covariance mod-els. Their comparison requires understanding how thesecovariances vary and ﬁnding metrics to capture this vari-ability. The additional secret ingredient may be using con-founds regressors in all statistical steps. A good choice ofa small number of relevant regions facilitates connectomecomparison. However, such a choice cannot yet be fullyfactored out via methods and must rely on neuroscientiﬁcexpertise.Methodological challenges to functional-connectome-based group studies arise from the dimensionality andthe variability of the connectome. With the currenttools, inter-subject comparison of connectomes compris-ing many nodes is limited by the diﬃculty of estimatinghigh-dimensional covariance matrices and the loss of sta-tistical power due to multiple comparisons. Better algo-rithms integrating powerful a priori information are re-quired to push the limits of covariance estimation. Bettercharacterization of inter-subject variability of connectomes[56] will help choosing parameterizations and invariants toavoid testing each edge for a diﬀerence, as this strategyinevitably leads to a needle in a haystack problem.Reviewing methodological options to learn and com-pare connectomes highlights that there is currently nounique solution, but a spectrum of related methods andanalytical strategies. More empirical results are requiredto guide the choices. However this diversity is probablyunavoidable: a diﬀuse disease like schizophrenia will notlead to the same connectome modiﬁcations as a focal le-sion. In statistical learning, “no free lunch” theorems [120]tell us that no strategy can perform uniformly better in allsituations. In practice, the key to a successful analysis isto understand well the assumptions and interpretation ofeach option, in order to match the method to the question.Similarly, the idealized notion of an unique functional con-nectome to describe connections in brain function is prob-ably an utopia, and various connectomes should be con-sidered in diﬀerent settings, such as the study of varying This is to be contrasted to Bayesian model comparison, whichwill give well-controlled results only if the true generative model is inthe list of models compared. [36] argues that, based on the Neyman-Pearson lemma, cross-validation is less powerful than likelihood ratiotests using the full dataset. However, it is important to keep in mindthat these approaches only test for self-consistence, as the Neyman-Pearson lemma is established under the hypothesis that the modelused to deﬁne the test is indeed the data-generating process [78],while in practice it is often the case that this model gives poor ﬁts tothe data [66]. Applying test procedures on diﬀerent data than thatused to ﬁt the model, as in cross-validation, is much more resilientto modeling errors. phenotypic conditions, or that of on-going activity versusactivity related to speciﬁc tasks.

Acknowledgments

GV acknowledges funding from the NiConnect grantand the Dynamic Diaschisis project DEQ20100318254from

Fondation pour la Recherche M´edicale , as well asmany insightful discussions with Andreas Kleinschmidt onon-going activity and Bertrand Thirion on statistical dataprocessing. RCC would like to acknowledge support bya NARSAD Young Investigator Grant from the Brain &Behavior Research Foundation. The authors would liketo thank the anonymous reviewers for their suggestions,which improved the manuscript.

ReferencesReferences [1] S. Achard, R. Salvador, B. Whitcher, J. Suckling, E. Bullmore,A resilient, low-frequency, small-world human brain functionalnetwork with highly connected association cortical hubs, JNeurosci 26 (2006) 63.[2] J. Anderson, J. Nielsen, A. Froehlich, M. DuBray, T. Druz-gal, A. Cariello, J. Cooperrider, B. Zielinski, C. Ravichandran,P. Fletcher, et al., Functional connectivity magnetic resonanceimaging classiﬁcation of autism, Brain 134 (2011) 3739.[3] T. Anderson, An introduction to multivariate statistical anal-ysis, Wiley New York, 1958.[4] J. Ashburner, K. Friston, Uniﬁed segmentation, Neuroimage26 (2005) 839.[5] D. Bassett, E. Bullmore, B. Verchinski, V. Mattay, D. Wein-berger, A. Meyer-Lindenberg, Hierarchical organization of hu-man cortical networks in health and schizophrenia, J Neurosci28 (2008) 9239.[6] C.F. Beckmann, S.M. Smith, Probabilistic independent com-ponent analysis for functional magnetic resonance imaging,Trans Med Im 23 (2004) 137.[7] Y. Behzadi, K. Restom, J. Liau, T. Liu, A component basednoise correction method (compcor) for bold and perfusionbased fMRI, Neuroimage 37 (2007) 90.[8] P. Bellec, V. Perlbarg, S. Jbabdi, M. Pelegrini-Issac, J.L. An-ton, J. Doyon, H. Benali, Identiﬁcation of large-scale networksin the brain using fMRI., Neuroimage 29 (2006) 1231.[9] P. Bellec, P. Rosa-Neto, O. Lyttelton, H. Benali, A. Evans,Multi-level bootstrap analysis of stable clusters in resting-statefMRI, NeuroImage 51 (2010) 1126.[10] Y. Benjamini, Y. Hochberg, Controlling the false discoveryrate: a practical and powerful approach to multiple testing, JRoy Stat Soc B (1995) 289.[11] R.M. Birn, J.B. Diamond, M. Smith, P. Bandettini, Separat-ing respiratory-variation-related ﬂuctuations from neuronal-activity-related ﬂuctuations in fMRI, NeuroImage 31 (2006)1536.[12] R.M. Birn, M. Smith, T.B. Jones, P. Bandettini, The respira-tion response function: the temporal dynamics of fMRI signalﬂuctuations related to changes in respiration, NeuroImage 40(2008) 644.[13] B. Biswal, M. Mennes, X. Zuo, S. Gohel, C. Kelly, S. Smith,C. Beckmann, et al., Toward discovery science of human brainfunction, Proc Ntl Acad Sci 107 (2010) 4734.[14] B. Biswal, F. Zerrin Yetkin, V. Haughton, J. Hyde, Functionalconnectivity in the motor cortex of resting human brain usingecho-planar MRI, Magn Reson Med 34 (1995) 53719.[15] T. Blumensath, T.E.J. Behrens, S.M. Smith, Resting-statefMRI single subject cortical parcellation based on region grow-ing, MICCAI (2012) 188.

16] R. Boyacio˘glu, M. Barth, Generalized iNverse imaging (GIN):Ultrafast fMRI with physiological noise correction, Mag ResMed (2012) epub ahead of print.[17] E. Bullmore, O. Sporns, Complex brain networks: graph theo-retical analysis of structural and functional systems, Nat RevNeurosci 10 (2009) 186.[18] G. Cecchi, I. Rish, B. Thyreau, B. Thirion, M. Plaze,M. Paillere-Martinot, C. Martelli, J. Martinot, J. Poline, Dis-criminative network models of schizophrenia, in: Advances inNeural Information Processing Systems, 2009.[19] C. Chang, G. Glover, Eﬀects of model-based physiologicalnoise correction on default mode network anti-correlations andcorrelations, Neuroimage 47 (2009) 1448.[20] V. Cherkassky, Y. Ma, Practical selection of SVM parame-ters and noise estimation for svm regression, Neural Netw. 17(2004) 113.[21] V. Cherkassky, F. Mulier, Learning from Data: Concepts, The-ory, and Methods, John Wiley & Sons, 1998.[22] C. Chu, D.A. Handwerker, P.A. Bandettini, J. Ashburner,Measuring the consistency of global functional connectivityusing kernel regression methods, in: Proceedings of the 2011IEEE International Workshop on Pattern Recognition in Neu-roImaging, p. 41.[23] D. Cordes, V. Haughton, K. Arfanakis, J. Carew, P. Turski,C. Moritz, M. Quigley, M. Meyerand, Frequencies contributingto functional connectivity in the cerebral cortex in “resting-state” data, Am J Neuroradio 22 (2001) 1326.[24] R. Craddock, P. Holtzheimer III, X. Hu, H. Mayberg, Dis-ease state prediction from resting state functional connectivity,Magnetic resonance in Medicine 62 (2009) 1619.[25] R. Craddock, G. James, P. Holtzheimer III, X. Hu, H. May-berg, A whole brain fMRI atlas generated via spatially con-strained spectral clustering, Hum Brain Mapp 33 (2012) 1914.[26] C. Davey, D. Grayden, G. Egan, L. Johnston, Filtering inducescorrelation in fMRI resting state data, NeuroImage 64 (2013)728.[27] N. Dosenbach, B. Nardos, A. Cohen, D. Fair, J. Power,J. Church, S. Nelson, G. Wig, A. Vogel, C. Lessov-Schlaggar,et al., Prediction of individual brain maturity using fmri, Sci-ence 329 (2010) 1358.[28] N.U. Dosenbach, K.M. Visscher, E.D. Palmer, F.M. Miezin,K.K. Wenger, H.C. Kang, E.D. Burgund, A.L. Grimes, B.L.Schlaggar, S.E. Petersen, A core system for the implementationof task sets, Neuron 50 (2006) 799.[29] M. Ekman, J. Derrfuss, M. Tittgemeyer, C. Fiebach, Predict-ing errors from reconﬁguration patterns in human brain net-works, P Natl Acad Sci Usa (2012) epub ahead of print.[30] M. Fox, M. Greicius, Clinical applications of resting state func-tional connectivity, Frontiers in systems neuroscience 4 (2010).[31] M. Fox, A. Snyder, J. Vincent, M. Corbetta, D. Van Essen,M. Raichle, The human brain is intrinsically organized intodynamic, anticorrelated functional networks, Proc Ntl AcadSci 102 (2005) 9673.[32] M. Fox, D. Zhang, A. Snyder, M. Raichle, The global sig-nal and observed anticorrelated resting state brain networks,J Neurophysio 101 (2009) 3270.[33] J. Friedman, T. Hastie, R. Tibshirani, The elements of statis-tical learning, Springer Series in Statistics, 2001.[34] J. Friedman, T. Hastie, R. Tibshirani, Sparse inverse covari-ance estimation with the graphical lasso, Biostatistics 9 (2008)432.[35] K.J. Friston, Functional and eﬀective connectivity in neu-roimaging: a synthesis, Hum Brain Mapp 2 (1994) 56.[36] K.J. Friston, Ten ironic rules for non-statistical reviewers, Neu-roImage 61 (2012) 1300.[37] K.J. Friston, C.D. Frith, P.F. Liddle, R.S.J. Frackowiak,Functional connectivity: the principal-component analysis oflarge (PET) data sets, Journal of cerebral blood ﬂow andmetabolism 13 (1993) 5.[38] K.J. Friston, L. Harrison, W. Penny, Dynamic causal mod-elling., Neuroimage 19 (2003) 1273. [39] K.J. Friston, A.P. Holmes, K.J. Worsley, J.B. Poline, C. Frith,R.S.J. Frackowiak, Statistical parametric maps in functionalimaging: A general linear approach, Hum Brain Mapp (1995)189.[40] K.J. Friston, P. Rotshtein, J.J. Geng, P. Sterzer, R.N. Henson,A critique of functional localisers, Neuroimage 30 (2006) 1077.[41] K.J. Friston, S. Williams, R. Howard, R.S. Frackowiak,R. Turner, Movement-related eﬀects in fMRI time-series, Mag-netic resonance in medicine 35 (1996) 346.[42] Y. Ge, S. Dudoit, T. Speed, Resampling-based multiple testingfor microarray data analysis, Test 12 (2003) 1.[43] R. Goebel, A. Roebroeck, D. Kim, E. Formisano, Investigatingdirected cortical interactions in time-resolved fmri data usingvector autoregressive modeling and granger causality mapping,Magnetic resonance imaging 21 (2003) 1251.[44] P. Golland, B. Fischl, Permutation tests for classiﬁcation: To-wards statistical signiﬁcance in image-based studies., in: IPMI,p. 330.[45] M. Greicius, Resting-state functional connectivity in neuropsy-chiatric disorders, Current opinion in neurology 21 (2008) 424.[46] M. Greicius, B. Krasnow, A. Reiss, V. Menon, Functional con-nectivity in the resting brain: a network analysis of the defaultmode hypothesis, Proc Ntl Acad Sci 100 (2003) 253.[47] M.L. Grillon, C. Oppenheim, G. Varoquaux, F. Charbonneau,A. Devauchelle, M. Krebs, F. Bayle, B. Thirion, C. Huron,Hyperfrontality and hypoconnectivity during refreshing inschizophrenia, Psychiatry Research: Neuroimaging (2012).[48] I. Guyon, A. Elisseeﬀ, An introduction to variable and featureselection, J. Mach. Learn. Res. 3 (2003) 1157.[49] P. Hagmann, L. Cammoun, X. Gigandet, R. Meuli, C.J. Honey,V.J. Wedeen, O. Sporns, Mapping the structural core of humancerebral cortex, PLoS Biol 6 (2008) e159.[50] J. Honorio, D. Samaras, Simultaneous and group-sparse multi-task learning of gaussian graphical models, arXiv:1207.4255(2012).[51] B. Horwitz, The elusive concept of brain connectivity, Neu-roImage 19 (2003) 466.[52] B. Horwitz, A. McIntosh, J. Haxby, C. Grady, Network analy-sis of brain cognitive function using metabolic and blood ﬂowdata, Behavioural brain research 66 (1995) 187.[53] X. Hu, T.H. Le, T. Parrish, P. Erhard, Retrospective estima-tion and correction of physiological ﬂuctuation in functionalMRI, Magn Reson Med 34 (1995) 201.[54] A. Ioannides, Dynamic functional connectivity, Current opin-ion in neurobiology 17 (2007) 161.[55] H.J. Jo, Z.S. Saad, W.K. Simmons, L.a. Milbury, R.W. Cox,Mapping sources of correlation in resting state FMRI, withartifact detection and removal, NeuroImage 52 (2010) 571.[56] C. Kelly, B. Biswal, R. Craddock, F. Castellanos, M. Mil-ham, Characterizing variation in the functional connectome:promise and pitfalls, Trends in cognitive sciences 16 (2012)181.[57] V. Kiviniemi, T. Starck, J. Remes, X. Long, J. Nikkinen,M. Haapea, J. Veijola, et al., Functional segmentation of thebrain cortex using high model order group PICA., Hum BrainMap 30 (2009) 3865.[58] U. Kjems, L. Hansen, J. Anderson, S. Frutiger, S. Muley,J. Sidtis, D. Rottenberg, S. Strother, The quantitative eval-uation of functional neuroimaging experiments: Mutual infor-mation learning curves, NeuroImage 15 (2002) 772.[59] N. Kriegeskorte, R. Goebel, P. Bandettini, Information-basedfunctional brain mapping, Proc Ntl Acad Sci 103 (2006) 3863.[60] S. LaConte, J. Anderson, S. Muley, J. Ashe, S. Frutiger,K. Rehm, L. Hansen, E. Yacoub, X. Hu, D. Rottenberg, Theevaluation of preprocessing choices in single-subject bold fMRIusing npairs performance metrics, NeuroImage 18 (2003) 10.[61] S. Lauritzen, Graphical models, Oxford University Press, USA,1996.[62] O. Ledoit, M. Wolf, A well-conditioned estimator for large-dimensional covariance matrices, J. Multivar. Anal. 88 (2004)365.

63] L. Lee, L.M. Harrison, A. Mechelli, et al., A report of the func-tional connectivity workshop, dusseldorf 2002, Neuroimage 19(2003) 457.[64] C. Lewis, A. Baldassarre, G. Committeri, G. Romani, M. Cor-betta, Learning sculpts the spontaneous activity of the restinghuman brain, Proc Ntl Acad Sci 106 (2009) 17558.[65] Y. Liu, M. Liang, Y. Zhou, Y. He, Y. Hao, M. Song, C. Yu,H. Liu, Z. Liu, T. Jiang, Disrupted small-world networks inschizophrenia, Brain 131 (2008) 945.[66] G. Lohmann, K. Erfurth, K. M¨uller, R. Turner, Critical com-ments on dynamic causal modelling, Neuroimage 59 (2012)2322.[67] T.E. Lund, K.H. Madsen, K. Sidaros, W.L. Luo, T.E. Nichols,Non-white noise in fMRI: does modelling have an impact?,NeuroImage 29 (2006) 54.[68] G. Marrelec, P. Bellec, H. Benali, Exploring large-scale brainnetworks in functional MRI, J Physio Paris 100 (2006) 171.[69] G. Marrelec, P. Bellec, A. Krainik, H. Duﬀau, M. P´el´egrini-Issac, S. Leh´ericy, H. Benali, J. Doyon, Regions, systems, andthe brain: hierarchical measures of functional integration infMRI, Medical Image Analysis 12 (2008) 484.[70] G. Marrelec, H. Horwitz, J. Kim, M. P´el´egrini-Issac, H. Benali,J. Doyon, Using partial correlation to enhance structural equa-tion modeling of functional MRI data., Magn Reson Imaging25 (2007) 1181.[71] G. Marrelec, A. Krainik, H. Duﬀau, M. P´el´egrini-Issac,S. Leh´ericy, J. Doyon, H. Benali, Partial correlation for func-tional brain interactivity investigation in functional MRI, Neu-roimage 32 (2006) 228.[72] A. McIntosh, Towards a network theory of cognition, NeuralNetworks 13 (2000) 861.[73] A. McIntosh, Moving between functional and eﬀective connec-tivity, in: O. Sporns (Ed.), Analysis and Function of Large-Scale Brain Networks, Society for Neuroscience, 2010, p. 15.[74] A. McIntosh, F. Gonzalez-Lima, Structural equation model-ing and its application to network analysis in functional brainimaging, Hum Brain Map 2 (1994) 2.[75] M. Mennes, C. Kelly, X.N. Zuo, A. Di Martino, B.B. Biswal,F.X. Castellanos, M.P. Milham, Inter-individual diﬀerencesin resting-state functional connectivity predict task-inducedBOLD activity, Neuroimage 50 (2010) 1690.[76] J. Mumford, B. Turner, F. Ashby, R. Poldrack, Deconvolvingbold activation in event-related designs for multivoxel patternclassiﬁcation analyses, NeuroImage 59 (2012) 2636.[77] K. Murphy, R. Birn, D. Handwerker, J. T.B., P. Bandettini,The impact of global signal regression on resting state corre-lations: are anti-correlated networks introduced?, NeuroImage44 (2009) 893.[78] J. Neyman, E. Pearson, On the problem of the most eﬃcienttests of statistical hypotheses, Philosophical Transactions ofthe Royal Society of London. Series A 231 (1933) 289.[79] T. Nichols, A. Holmes, Nonparametric permutation tests forfunctional neuroimaging: a primer with examples, Hum brainmap 15 (2001) 1.[80] F. Pereira, T. Mitchell, M. Botvinick, Machine learning clas-siﬁers and fmri: a tutorial overview, Neuroimage 45 (2009)S199.[81] R. Poldrack, J. Mumford, T. Nichols, Handbook of functionalMRI data analysis, Cambridge University Press, 2011.[82] J. Power, K. Barnes, A. Snyder, B. Schlaggar, S. Petersen,Spurious but systematic correlations in functional connectivitymri networks arise from subject motion, Neuroimage 59 (2011)2142.[83] M. Raichle, Two views of brain function, Trends in cognitivesciences 14 (2010) 180.[84] P. Ravikumar, M. Wainwright, G. Raskutti, B. Yu, High-dimensional covariance estimation by minimizing 1-penalizedlog-determinant divergence, Elec J Stat 5 (2011) 935.[85] J. Richiardi, H. Eryilmaz, S. Schwartz, P. Vuilleumier, D. VanDe Ville, Decoding brain states from fMRI connectivity graphs,NeuroImage 56 (2011) 616. [86] J. Rissman, A. Gazzaley, M. D’Esposito, Measuring functionalconnectivity during distinct stages of a cognitive task, Neu-roimage 23 (2004) 752.[87] M. Rubinov, O. Sporns, Complex network measures of brainconnectivity: uses and interpretations, Neuroimage 52 (2010)1059.[88] M. Rubinov, O. Sporns, Weight-conserving characterizationof complex functional brain networks, Neuroimage 56 (2011)2068.[89] S. Ryali, T. Chen, K. Supekar, V. Menon, Estimation of func-tional connectivity in fMRI data using stability selection-basedsparse partial correlation with elastic net penalty, Neuroimage59 (2012) 3852.[90] Z.S. Saad, S.J. Gotts, K. Murphy, G. Chen, H.J. Jo, A. Mar-tin, R.W. Cox, Trouble at rest: how correlation patterns andgroup diﬀerences become distorted after global signal regres-sion, Brain Connect 2 (2012) 25–32.[91] R. Salvador, J. Suckling, M. Coleman, Neurophysiological ar-chitecture of functional magnetic resonance images of humanbrain, Cerebral Cortex 15 (2005) 1332.[92] T. Satterthwaite, M. Elliott, R. Gerraty, K. Ruparel, J. Loug-head, M. Calkins, S. Eickhoﬀ, et al., An improved frameworkfor confound regression and ﬁltering for control of motion ar-tifact in the preprocessing of resting-state functional connec-tivity data, NeuroImage 64 (2013).[93] J. Schoﬀelen, J. Gross, Source connectivity analysis with MEGand EEG, Hum brain map 30 (2009) 1857.[94] H. Shen, L. Wang, Y. Liu, D. Hu, Discriminative analysisof resting-state functional connectivity patterns of schizophre-nia using low dimensional embedding of fMRI, Neuroimage 49(2010) 3110.[95] W. Shirer, S. Ryali, E. Rykhlevskaia, V. Menon, M. Greicius,Decoding subject-driven cognitive states with whole-brain con-nectivity patterns, Cerebral Cortex 22 (2012) 158.[96] S. Smith, P. Bandettini, K. Miller, T. Behrens, K. Friston,O. David, T. Liu, M. Woolrich, T. Nichols, The danger ofsystematic bias in group-level FMRI-lag-based causality esti-mation, Neuroimage 59 (2012) 1228.[97] S. Smith, P. Fox, K. Miller, D. Glahn, P. Fox, C. Mackay, et al.,Correspondence of the brain’s functional architecture duringactivation and rest, Proc Natl Acad Sci 106 (2009) 13040.[98] S. Smith, M. Jenkinson, M. Woolrich, C. Beckmann,T. Behrens, H. Johansen-Berg, P. Bannister, M.D. Luca,I. Drobnjak, D. Flitney, R. Niazy, J. Saunders, J. Vickers,Y. Zhang, N.D. Stefano, J. Brady, P. Matthews, Advances infunctional and structural MR image analysis and implementa-tion as FSL, NeuroImage 23 (2004) 208.[99] S. Smith, K. Miller, S. Moeller, J. Xu, E. Auerbach, M. Wool-rich, C. Beckmann, M. Jenkinson, J. Andersson, M. Glasser,et al., Temporally-independent functional modes of sponta-neous brain activity, Proc Ntl Acad Sci 109 (2012) 3131.[100] S. Smith, K. Miller, G. Salimi-Khorshidi, M. Webster, C. Beck-mann, T. Nichols, J. Ramsey, M. Woolrich, Network modellingmethods for fMRI, Neuroimage 54 (2011) 875.[101] O. Sporns, D. Chialvo, M. Kaiser, C. Hilgetag, Organization,development and function of complex brain networks, Trendsin Cognitive Sciences 8 (2004) 418.[102] O. Sporns, G. Tononi, R. Kotter, The human connectome: astructural description of the human brain, PLoS Comput Biol1 (2005) e42.[103] C. Stam, Functional connectivity patterns of human mag-netoencephalographic recordings: a “small-world” network?,Neuroscience letters 355 (2004) 25.[104] C. Stonnington, C. Chu, S. Kl¨oppel, C. Jack Jr, J. Ashburner,R. Frackowiak, et al., Predicting clinical scores from magneticresonance scans in alzheimer’s disease, Neuroimage 51 (2010)1405.[105] S. Strother, Evaluating fMRI preprocessing pipelines, Engi-neering in Medicine and Biology Magazine, IEEE 25 (2006)27.[106] S.C. Strother, J. Anderson, L.K. Hansen, U. Kjems, R. Kus- ra, J. Sidtis, S. Frutiger, S. Muley, S. LaConte, D. Rottenberg,The quantitative evaluation of functional neuroimaging exper-iments: the NPAIRS data analysis framework, Neuroimage 15(2002) 747.[107] J. Talairach, P. Tournoux, Co-planar Stereotaxic Atlas of theHuman Brain: 3-dimensional Proportional System, ThiemeClassics, Thieme Medical Pub, 1988.[108] The ADHD-200 Consortium, The ADHD-200 consortium: Amodel to advance the translational potential of neuroimagingin clinical neuroscience, Front Syst Neurosci 6 (2012) 62.[109] B. Thirion, G. Flandin, P. Pinel, A. Roche, P. Ciuciu, J.B.Poline, Dealing with the shortcomings of spatial normalization:Multi-subject parcellation of fMRI datasets, Hum brain map27 (2006) 678.[110] G. Tononi, O. Sporns, G. Edelman, Reentry and the problemof integrating multiple cortical areas: simulation of dynamicintegration in the visual system, Cerebral Cortex 2 (1992) 310.[111] N. Tzourio-Mazoyer, B. Landeau, D. Papathanassiou, F. Criv-ello, O. Etard, N. Delcroix, B. Mazoyer, M. Joliot, Automatedanatomical labeling of activations in SPM using a macroscopicanatomical parcellation of the MNI MRI single-subject brain.,Neuroimage 15 (2002) 273.[112] K. Van Dijk, M. Sabuncu, R. Buckner, The inﬂuence of headmotion on intrinsic functional connectivity MRI, Neuroimage59 (2012) 431.[113] E. Van Oort, D. Norris, S. Smith, C. Beck-mann, Resting state networks are character-ized by high frequency BOLD ﬂuctuations, https://ww4.aievolution.com/hbm1201/index.cfm?do=abs.viewAbs&abs=6235 ,2012.[114] G. Varoquaux, F. Baronnet, A. Kleinschmidt, P. Fillard,B. Thirion, Detection of brain functional-connectivity diﬀer-ence in post-stroke patients using group-level covariance mod-eling, in: MICCAI, 2010.[115] G. Varoquaux, A. Gramfort, F. Pedregosa, V. Michel,B. Thirion, Multi-subject dictionary learning to segment anatlas of brain spontaneous activity, in: Inf Proc Med Imag, p.562.[116] G. Varoquaux, A. Gramfort, J.B. Poline, B. Thirion, Braincovariance selection: better individual functional connectivitymodels using population prior, in: NIPS, 2010.[117] G. Varoquaux, A. Gramfort, J.B. Poline, B. Thirion, Markovmodels for fMRI correlation structure: is brain functionalconnectivity small world, or decomposable into networks?, JPhysio Paris 106 (2012) 212.[118] G. Varoquaux, A. Gramfort, B. Thirion, Small-sample brainmapping: sparse recovery on spatially correlated designs withrandomization and clustering, ICML (2006).[119] J. Wang, L. Wang, Y. Zang, H. Yang, H. Tang, Parcellation-dependent small-world brain functional networks: A resting-state fMRI study, Hum Brain Mapp 30 (2009) 1511.[120] D. Wolpert, The lack of a priori distinctions between learningalgorithms, Neural Computation 8 (1996) 1341.[121] B. Yeo, F. Krienen, J. Sepulcre, M. Sabuncu, et al., The or-ganization of the human cerebral cortex estimated by intrinsicfunctional connectivity, J Neurophysio 106 (2011) 1125.[122] A. Zalesky, A. Fornito, E. Bullmore, Network-based statis-tic: Identifying diﬀerences in brain networks, NeuroImage 53(2010) 1197.[123] A. Zalesky, A. Fornito, E. Bullmore, On the use of correlationas a measure of network connectivity, NeuroImage 60 (2012)2096.[124] A. Zalesky, A. Fornito, I.H. Harding, L. Cocchi, M. Y¨ucel,C. Pantelis, E.T. Bullmore, Whole-brain anatomical networks:does the choice of nodes matter?, Neuroimage 50 (2010) 970.[125] L.L. Zeng, H. Shen, L. Liu, L. Wang, B. Li, P. Fang, Z. Zhou,Y. Li, D. Hu, Identifying major depression using whole-brainfunctional connectivity: a multivariate pattern analysis, Brain135 (2012) 1498.[126] Y. Zhang, M. Brady, S. Smith, Segmentation of brain MRimages through a hidden Markov random ﬁeld model and the expectation-maximization algorithm, Trans Med Imag 20(2001) 45.[127] C. Zhu, Y. Zang, Q. Cao, C. Yan, Y. He, T. Jiang, M. Sui,Y. Wang, Fisher discriminative analysis of resting-state brainfunction for attention-deﬁcit/hyperactivity disorder, Neuroim-age 40 (2008) 110.[128] X.N. Zuo, Mean or SVD? A test-retest reliability perspectiveon seed timeseries generation in RSFC., Technical Report, In-stitute for Pediatric Neuroscience at NYU Child Study Center,New York University School of Medicine, NY, USA, 2010.,2012.[114] G. Varoquaux, F. Baronnet, A. Kleinschmidt, P. Fillard,B. Thirion, Detection of brain functional-connectivity diﬀer-ence in post-stroke patients using group-level covariance mod-eling, in: MICCAI, 2010.[115] G. Varoquaux, A. Gramfort, F. Pedregosa, V. Michel,B. Thirion, Multi-subject dictionary learning to segment anatlas of brain spontaneous activity, in: Inf Proc Med Imag, p.562.[116] G. Varoquaux, A. Gramfort, J.B. Poline, B. Thirion, Braincovariance selection: better individual functional connectivitymodels using population prior, in: NIPS, 2010.[117] G. Varoquaux, A. Gramfort, J.B. Poline, B. Thirion, Markovmodels for fMRI correlation structure: is brain functionalconnectivity small world, or decomposable into networks?, JPhysio Paris 106 (2012) 212.[118] G. Varoquaux, A. Gramfort, B. Thirion, Small-sample brainmapping: sparse recovery on spatially correlated designs withrandomization and clustering, ICML (2006).[119] J. Wang, L. Wang, Y. Zang, H. Yang, H. Tang, Parcellation-dependent small-world brain functional networks: A resting-state fMRI study, Hum Brain Mapp 30 (2009) 1511.[120] D. Wolpert, The lack of a priori distinctions between learningalgorithms, Neural Computation 8 (1996) 1341.[121] B. Yeo, F. Krienen, J. Sepulcre, M. Sabuncu, et al., The or-ganization of the human cerebral cortex estimated by intrinsicfunctional connectivity, J Neurophysio 106 (2011) 1125.[122] A. Zalesky, A. Fornito, E. Bullmore, Network-based statis-tic: Identifying diﬀerences in brain networks, NeuroImage 53(2010) 1197.[123] A. Zalesky, A. Fornito, E. Bullmore, On the use of correlationas a measure of network connectivity, NeuroImage 60 (2012)2096.[124] A. Zalesky, A. Fornito, I.H. Harding, L. Cocchi, M. Y¨ucel,C. Pantelis, E.T. Bullmore, Whole-brain anatomical networks:does the choice of nodes matter?, Neuroimage 50 (2010) 970.[125] L.L. Zeng, H. Shen, L. Liu, L. Wang, B. Li, P. Fang, Z. Zhou,Y. Li, D. Hu, Identifying major depression using whole-brainfunctional connectivity: a multivariate pattern analysis, Brain135 (2012) 1498.[126] Y. Zhang, M. Brady, S. Smith, Segmentation of brain MRimages through a hidden Markov random ﬁeld model and the expectation-maximization algorithm, Trans Med Imag 20(2001) 45.[127] C. Zhu, Y. Zang, Q. Cao, C. Yan, Y. He, T. Jiang, M. Sui,Y. Wang, Fisher discriminative analysis of resting-state brainfunction for attention-deﬁcit/hyperactivity disorder, Neuroim-age 40 (2008) 110.[128] X.N. Zuo, Mean or SVD? A test-retest reliability perspectiveon seed timeseries generation in RSFC., Technical Report, In-stitute for Pediatric Neuroscience at NYU Child Study Center,New York University School of Medicine, NY, USA, 2010.