[PDF] GEFF: Graph Embedding for Functional Fingerprinting

Abstract

It has been well established that Functional Connectomes (FCs), as estimated from functional MRI (fMRI) data, have an individual fingerprint that can be used to identify an individual from a population (subject-identification). Although identification rate is high when using resting-state FCs, other tasks show moderate to low values. Furthermore, identification rate is task-dependent, and is low when distinct cognitive states, as captured by different fMRI tasks, are compared. Here we propose an embedding framework, GEFF (Graph Embedding for Functional Fingerprinting), based on group-level decomposition of FCs into eigenvectors. GEFF creates an eigenspace representation of a group of subjects using one or more task FCs (Learning Stage). In the Identification Stage, we compare new instances of FCs from the Learning subjects within this eigenspace (validation dataset). The validation dataset contains FCs either from the same tasks as the Learning dataset or from the remaining tasks that were not included in Learning. Assessment of validation FCs within the eigenspace results in significantly increased subject-identification rates for all fMRI tasks tested and potentially task-independent fingerprinting process. It is noteworthy that combining resting-state with one fMRI task for GEFF Learning Stage covers most of the cognitive space for subject identification. In addition to subject-identification, GEFF was also used for identification of cognitive states, i.e. to identify the task associated to a given FC, regardless of the subject being already in the Learning dataset or not (subject-independent task-identification). In addition, we also show that eigenvectors from the Learning Stage can be characterized as task-dominant, subject dominant or neither, providing a deeper insight into the extent of variance in functional connectivity across individuals and cognitive states.

Full PDF

GGEFF: Graph Embedding for Functional Fingerprinting

Kausar Abbas , Enrico Amico , Diana Otero Svaldi , Uttara Tipnis , Duy Anh Duong-Tran , Mintao Liu , Meenusree Rajapandian , Jaroslaw Harezlak , Beau M. Ances , Joaquín Goñi Purdue Institute for Integrative Neuroscience, Purdue University, West Lafayette, IN, USA School of Industrial Engineering, Purdue University, West Lafayette, IN, USA Indiana University School of Medicine, Indiana University, Indianapolis, IN, USA Indiana Alzheimer Disease Center, Indiana University, Indianapolis, IN, USA Department of Epidemiology and Biostatistics, Indiana University, IN, USA Washington University School of Medicine, Washington University, St Louis, MO, USA Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN, USA

ABSTRACT

Learning Stage ). In the

Identification Stage , we compare new instances of FCs from the

Learning subjects within this eigenspace (validation dataset). The validation dataset contains FCs either from the same tasks as the

Learning dataset or from the remaining tasks that were not included in

Learning . Assessment of validation FCs within the eigenspace results in significantly increased subject-identification rates for all fMRI tasks tested and potentially task-independent fingerprinting process. It is noteworthy that combining resting-state with one fMRI task for GEFF

Learning Stage covers most of the cognitive space for subject identification. Thus, while designing an experiment, one could choose a task fMRI to ask a specific question and combine it with resting-state fMRI to extract maximum subject differentiability using GEFF. In addition to subject-identification, GEFF was also used for identification of cognitive states, i.e. to identify the task associated to a given FC, regardless of the subject being already in the

Learning dataset or not ( subject-independent task-identification ). In addition, we also show that eigenvectors from the

Learning Stage can be characterized as task-dominant, subject-dominant or neither, using two-way ANOVA of their corresponding loadings, providing a deeper insight into the extent of variance in functional connectivity across individuals and cognitive states. INTRODUCTION

To date, most studies using fMRI rely on group level analysis where data is averaged over subjects within groups , potentially ignoring any intra-group individual variability . However, improved acquisition parameters and the increased availability of large datasets with open data policy have generated opportunities for the development of subject-level biomarkers from fMRI, thus opening the possibility of personalized medicine for neuro/psychiatric disorders . As clinically seful subject level biomarkers must have high inter-subject differentiabiltiy, also known as subject fingerprint, recent efforts have gone into capturing and improving individual variability in biomarkers based on functional connectivity in fMRI data . Subject- and task-specific signatures have also been found using whole brain effective connectivity and dynamic functional connectivity . Whole-brain functional connectivity patterns are showing increasing promise as subject-level biomarkers that can be estimated from fMRI data. These patterns can be summarized in the form of a full symmetric correlation matrix denominated Functional Connectome (FC). The development of the FC has given birth to the field of brain functional connectomics which has been extensively used to study brain connectivity across a wide range of brain disorders . Recently, it has been shown that FCs have a recurrent and reproducible individual fingerprint , that can be used to identify an individual from a population of FCs. We refer to this process as subject-identification (SI) . Using data from the Human Connectome Project (HCP), individual fingerprints have been shown to exist in all eight different tasks (resting-state (RS); emotion (EM); gambling (GAM); language (LAN); motor (MOT); relational (REL); social (SOC) and working memory (WM)), but, apart from resting-state, the SI accuracy was moderate to low . Following the discovery of a fingerprint in FC, Amico and Goñi introduced the “Identifiability Framework ( 𝐼𝑓 ) which improved the SI accuracy for all eight tasks from the HCP dataset. Using group-level Principal Component Analysis (PCA) decomposition of FCs, the framework works as a denoising procedure that uncovers latent fingerprints; noisy principal components were identified (and removed) by maximizing differential identifiability (similarity of an individual’s FC across two sessions, relative to its similarity to the rest of the population). This denoising based on maximizing differential identifiability not only improves SI accuracy, but also the capacity to predict fluid intelligence from FCs . This framework has been tested to improve individual fingerprint for different scanning lengths , across scanners, with and without global signal regression , and across network properties . An extension of this framework has been also used to assess disease progression . Although promising, the existing frameworks used for subject-identification are not task-independent , meaning that an FC from one task cannot be used to identify an individual from a population of FCs from another task even with moderate accuracy rates. Even though the differential identifiability framework improves the SI accuracy for each individual task, it does not make the SI process any more task-independent . This could be the result of the differential identifiability framework trying to make FCs within tasks as similar as possible, thus potentially removing components which could help with identification across tasks.

In addition to subject fingerprint, functional connectivity patterns, and in turn FCs, have also been shown to vary depending on the cognitive state of an individual (i.e. task-fingerprinting ) . Thus, task-identification (TI), or the ability to identify the task associated with a given FC from a population of reference FCs that include a collection of tasks, has also become a key goal in the field of brain connectomics. Task identification frameworks have been recently proposed by Xie et al. , Pallarés et al. and more recently, Wang et al. using dynamic functional connectivity, effective connectivity, and deep learning, respectively. Although useful, these frameworks present ome challenges. While effective connectivity showed improved identification performance with respect to functional connectivity, it requires not only functional connectivity but also structural connectivity and a mathematical model of cortical dynamics with its corresponding parameters. Dynamic functional connectivity (dFC) suffers from a subjective and data dependent choice of window length . Deep learning frameworks, although effective in some cases, are black boxes and difficult to generalize to new datasets . In contrast, static functional connectivity is easier to compute and is being widely used in the network neuroscience community. Existing TI frameworks are either subject dependent or can only perform task-fingerprinting at the group-level, after removing the subject-specific fingerprints (specific independent components) from the data . Thus, the field still lacks a framework that can perform task-identification on functional connectivity while still preserving individual level variability necessary for personalized medicine. Both subject and task identification can be thought of as object recognition problems. Eigenspace embedding is a common technique used in object recognition, detection, and tracking due to its simplicity and effectiveness. Essentially, high dimensional training images are used to create a low dimensional eigenspace. Then, both training images and target objects are projected into this low dimensional eigenspace and distances are computed between target and training images to detect and/or track certain objects. A number of techniques based on this basic principle have been developed to detect and recognize human faces , recognize 3D objects and estimate their pose , and identify partially occluded objects and estimate their pose . In short, it is a low cost (in terms of memory space and processing time) and computationally efficient image recognition method. In this study, we propose a framework based on eigenspace embedding for functional connectome fingerprinting (GEFF). Instead of images, whole-brain functional connectomes (FCs) are embedded into a low dimensional eigenspace and classified based on subjects or tasks. Our aim is to achieve four major goals: (i) increase the SI accuracy, (ii) make the SI process potentially task-independent, (iii) perform TI process with high accuracy and, (iv) make the TI process subject-independent, while preserving individual level variability in FCs. In essence, we introduce a fingerprinting framework that, given an FC for a particular individual performing a particular task, is able to identify the subject and/or task with high accuracy. METHODS

Dataset

The fMRI dataset used in this study is from the publicly available Human Connectome Project (HCP). Per HCP protocol, written informed consent was obtained from all subjects by the HCP Consortium. Full description of the acquisition protocol and processing steps is given below.

HCP: Functional Data

We assessed the 100 unrelated subjects (54 females, 46 males, mean age = 29.1 ± 3.7 years) from the HCP 900 subjects data release . This subset of subjects was chosen from the overall dataset to ensure that no two subjects are family relatives. The criterion to exclude family relatives was crucial to avoid confounding effects in our analyses due to family-structure co-variables. The resting-state fMRI scans were acquired on two different days, with two sessions each with two ifferent acquisitions (left to right or LR, and right to left or RL) . The seven fMRI tasks were: emotion, gambling, language, motor, relational, social, and working memory. The gambling, motor and working memory tasks were acquired on the first day, and the emotion, language, relational and social tasks were acquired on the second day. The HCP scanning protocol was approved by the local Institutional Review Board at Washington University in St. Louis. For resting-state fMRI, only the two sessions from day one were used in this study. Full details on the HCP dataset have been published previously . Brain Atlas

A multi-modal parcellation of the human cerebral cortex, with 180 brain regions in each hemisphere (360 total), was used in this work . For completeness, 14 subcortical regions were added, as provided by the HCP release (filename Atlas_ROI2.nii.gz). To do so, this file was converted from NIFTI to CIFTI format using the HCP workbench software HCP Preprocessing: Functional Data

The data processed using the ‘minimal’ preprocessing pipeline from the HCP was employed in this work . This pipeline included artifact removal, motion correction, and registration to standard space. Full details on this pipeline can be found in earlier publications . The main steps were spatial (minimal) preprocessing, both in volumetric and grayordinate space (i.e. where brain regions are mapped onto the native mesh cortical surface) ; slice-timing correction; minimal high-pass temporal filtering (using the -bptf option in FSL’s fslmaths tool; full width at half maximum) applied to both volumetric and grayordinate forms, effectively removing linear trends in the data (no low pass filtering was applied in this ‘minimal’ HCP pipeline); MELODIC ICA applied to volumetric data; and using FIX to identify and remove artifact components. Artifacts- and motion-related time courses were regressed out (i.e. the six rigid-body parameter time series, their backwards-looking first differences, and the squares of all 12 resulting regressors) of both volumetric and grayordinate data . We added the following steps to the ‘minimal’ HCP processing pipeline. For resting-state fMRI data: (i) we regressed out the global gray-matter signal from the voxel time courses , (ii) we applied a bandpass first-order Butterworth filter in forward and reverse directions (0.001Hz to 0.08Hz ; MATLAB functions butter and filtfilt ), and (iii) the voxel time courses were z-scored and then averaged per brain region, excluding any outlier time points that were outside of 3 standard deviation from the mean ( workbench software, command - cifti-parcellate ). For task fMRI data, we applied the same steps as mentioned above but a more liberal frequency range was adopted for the band-pass filter (0.001Hz to 0.25Hz), since the connection between different tasks and optimal frequency ranges is still unclear . Estimating Individual Functional Connectomes earson correlation between the time courses of all possible brain region pairs (MATLAB command corr ) results in a symmetric correlation matrix for each fMRI session of each subject. In this paper we would refer to this object as Functional Connectome (FC). Each task has two sessions ─ one with left-to-right (LR) and the other with right-to-left (RL) phase-encoding. To avoid any session bias, for each task separately, FCs were chosen randomly from LR and RL sessions such that we had equal number of FCs from both in the two sessions. Finally, the resulting individual FCs were ordered according to the seven so called ‘yeo’ Functional Networks (FNs), as proposed by Yeo and colleagues . For completeness, an eighth FN comprising the 14 HCP subcortical regions was added (as analogously done in recent papers ). This reordering was done for visualization purposes only, so that any visualization of FCs or FC-related objects would be somewhat visually interpretable. Mathematical

Notations

In this section, we would establish a few mathematical notations that would be used throughout the paper. Scalar is an italicized letter e.g. a . A vector is denoted by a bold italicized letter e.g. a , which would be a column vector by default unless otherwise specified. Matrix is denoted by a capitalized italicized bold letter e.g. A . For any given vector a , the average of its entries is denoted by 〈𝒂〉 , while its norm or magnitude is denoted by ‖𝒂‖ . If 𝑟 ∈ [𝑞] , it means that 𝑟 accepts integer values from 1 up to q , where 𝑞 ∈{𝑎𝑙𝑙 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑖𝑛𝑡𝑒𝑔𝑒𝑟𝑠} . Finally, if the i th sample of a set S with cardinality N has a class label where the set of class labels is [𝑞] = {1,2, … , 𝑞} , then it would be denoted by 𝑠 (cid:3036) ∈ [𝑞] (cid:3015) . GEFF: A framework for Graph Embedding for Functional Fingerprinting

The GEFF framework consists of two stages:

Learning and

Identification . In the

Learning stage, we compute an eigenspace representation of each learning FC using group-level Principal Component Analysis (PCA) decomposition. In the

Identification stage, we first compute average representations (centroids) of each underlying class in the learning dataset. Then, using the eigenvectors computed in the learning stage, we project each validation FC into the eigenspace and identify it by matching it with one of the class centroids (Figure 1). It has to be noted that GEFF is somewhat similar in its setup with the “Identifiability Framework ( 𝐼𝑓 )” proposed by Amico and Goñi , but there are key differences. First, there is no reconstruction in GEFF and all the processing takes place in the eigenspace. In addition, as opposed to the 𝐼𝑓 , GEFF does not require two runs (test/retest FCs) of the same subject in its setup. The two stages of GEFF are described in detail below. Learning Stage: Eigenspace Embedding

An FC is an 𝑚 x 𝑚 symmetric correlation matrix ( 𝑚 is the number of brain regions in the parcellation), and hence can be vectorized into a 𝑀 = 𝑚 (𝑚 − 1) 2⁄ dimensional vector by taking he upper triangular part of the matrix (excluding the main diagonal). Analogously to Amico and Goñi , we vectorized all the learning FCs and organized them into a matrix 𝑿 = [𝒙 (cid:2869) , 𝒙 (cid:2870) , … , 𝒙 (cid:3015) ] where 𝒙 (cid:3036) is an 𝑀 -dimensional vectorized learning FC ( 𝑖 ∈ [𝑁] ), and 𝑁 is the number of learning FCs. To construct an eigenspace, we create a PCA decomposition of the input matrix 𝑿 (MATLAB command pca ) to extract the eigenvectors and the representations (projections) of 𝒙 𝒊 vectors in(to) the eigenspace. Analytically, eigenvectors are obtained by solving the following equation: 𝑿(cid:3365)𝑿(cid:3365) 𝑻 𝒖 (cid:3036) = 𝜆 (cid:3036) 𝒖 (cid:3036) where 𝑿(cid:3365) = [𝒙 (cid:2869) − 〈𝒙 (cid:2869) 〉, 𝒙 (cid:2870) − 〈𝒙 (cid:2870) 〉, … , 𝒙 (cid:3015) − 〈𝒙 (cid:3015) 〉] , and 𝒖 (cid:3036) represents an 𝑀 -dimensional eigenvector of the 𝑿(cid:3365)𝑿(cid:3365) 𝑻 covariance matrix, with a corresponding eigenvalue 𝜆 (cid:3036) . Eigenvectors

𝑼 = [𝒖 (cid:2869) , … , 𝒖 (cid:3015) ] are arranged in descending order of their eigenvalues, which is equivalent to descending order of their explained variance. For any value of 𝑘 ≤ 𝑁 , the 𝑀 -dimensional vectorized FC 𝒙 (cid:3036) can be projected to the eigenspace using the following equation: Figure 1: GEFF, the identification framework.

GEFF consists of two stages: Learning and Identification. During the Learning Stage, all learning FCs are vectorized, organized together (a) and then projected into the eigenspace using PCA (b). During the Identification Stage, we compute average representations (centroids) of each underlying class in the learning dataset (c). Then each validation FC is projected into the eigenspace using eigenvectors from the Learning Stage (d) and is identified by matching its projection with one of the class centroids (c). 𝒚 (cid:3036)(cid:3038) = [𝒖 (cid:2869) , … , 𝒖 (cid:3038) ] (cid:3021) 𝒙(cid:3365) (cid:3036) here 𝒙(cid:3365) (cid:3036) = 𝒙 (cid:3036) − 〈𝒙 (cid:3036) 〉 , and 𝒚 (cid:3036)(cid:3038) is the 𝑘 -dimensional representation of 𝒙 (cid:3036) in the eigenspace. Using this procedure, we obtained 𝑘 -dimensional representations for all learning FCs, for 𝑘 =1,2, … , 𝑁 . Identification Stage: Nearest Centroid Classifier

The identification process is essentially a multi-class classification problem where the objective is to label an FC in the validation data to one of the classes in the learning data. In this work, we used the Nearest Centroid Classification with the idea that an average representation of a class (subject or task) would be more robust and generalizable than individual samples of that class.

For a given value of 𝑘 ∈ [𝑁] , we had class-labeled learning samples i.e. (cid:3419)(cid:3435)𝒚 (cid:2869)(cid:3038) , 𝑧 (cid:2869) (cid:3439), … , (cid:3435)𝒚 (cid:3015)(cid:3038) , 𝑧 (cid:3015) (cid:3439)(cid:3423) , where 𝒚 (cid:3036)(cid:3038) is the 𝑘 -dimensional eigenspace representation of the 𝑖 -th learning FC ( 𝑖 ∈ [𝑁] ), and 𝑧 (cid:3036) ∈ [𝑍] (cid:3015) is the corresponding class label. Using these samples, we computed per-class centroids: 𝒄 (cid:3039)(cid:3038) = 1|𝐶 (cid:3039) | (cid:3533) 𝒚 (cid:3036)(cid:3038)(cid:3036)∈(cid:3004) (cid:3287) where 𝒄 (cid:3039)(cid:3038) is the 𝑘 -dimensional centroid of class 𝑙 ∈ [𝑍] , 𝐶 (cid:3039) is the set of indices of samples belonging to the class 𝑙 ∈ [𝑍] , and |𝐶 (cid:3039) | is the number of samples or size of the class 𝑙 ∈ [𝑍] . For SI and TI processes, classes correspond to the subjects and the tasks included in the learning dataset, and these centroids are average representations of the subjects and tasks in the eigenspace, respectively.

For a given validation FC, we first vectorized it into an 𝑀 -dimensional vector 𝒘 . We then obtained a 𝑘 -dimensional vector 𝒈 (cid:3038) by projecting 𝒘 to the eigenspace constructed in the learning stage using the following equation: 𝒈 (cid:3038) = [𝒖 (cid:2869) , … , 𝒖 (cid:3038) ] (cid:3021) 𝒘(cid:3365) where 𝒘(cid:3365) = 𝒘 − 〈 𝒘 〉 . To provide an alternative and perhaps more intuitive perspective, one may also think of this process as a multi-linear regression: 𝒘(cid:3365) = 𝑼 (cid:3038) 𝜷 + 𝜺 where (cid:3365) is the dependent variable or the validation FC, 𝑼 (cid:3038) = [𝒖 (cid:2869) , … , 𝒖 (cid:3038) ] represents the transposed independent variables or the eigenvectors, 𝜷 = 𝒈 (cid:3038) is the 𝑘 -dimensional vector of estimated coefficients, and 𝜺 is the residual noise. A validation FC ( 𝒈 (cid:3038) ) was identified as belonging to class 𝑙 ∗ that minimized the distance between 𝒈 (cid:3038) and the class centroid 𝒄 (cid:3039)(cid:3038) : 𝑙 ∗ = 𝑑𝑖𝑠𝑡(cid:3435)𝒈 (cid:3038) , 𝒄 (cid:3039)(cid:3038) (cid:3439) (cid:3039)(cid:3028)(cid:3045)(cid:3034)(cid:3040)(cid:3036)(cid:3041) where 𝒄 (cid:3039)(cid:3038) is the centroid for class 𝑙 ∈ [𝑍] , and ‘ 𝑑𝑖𝑠𝑡 ’ represents the distance function that was used to compute distance between the input 𝒈 (cid:3038) and the class centroids. In our case, we used the cosine distance which is given by the following equation: 𝑑𝑖𝑠𝑡(𝒙, 𝒚) = 1 − 𝑐𝑜𝑠𝜃 = 1 − ⟨𝒙, 𝒚⟩‖𝒙‖‖𝒚‖ where ⟨𝒙, 𝒚⟩ is the dot product of vectors 𝒙 and 𝒚 . For high dimensional data, to measure closeness or distance between two unit vectors, a natural choice, empirically, would be the angle between them, or the cosine of that angle . Although the framework was also tested with correlation distance and Euclidean distance and similar results were found (results not shown). We repeated the identification process for all the validation FCs and the identification rate was defined as

𝐼𝑑𝑒𝑛𝑡𝑖𝑓𝑖𝑐𝑎𝑡𝑖𝑜𝑛 𝑅𝑎𝑡𝑒 = 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑐𝑜𝑟𝑟𝑒𝑐𝑡𝑙𝑦 𝑙𝑎𝑏𝑒𝑙𝑒𝑑 𝑣𝑎𝑙𝑖𝑑𝑎𝑡𝑖𝑜𝑛 𝐹𝐶𝑠𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑎𝑙𝑖𝑑𝑎𝑡𝑖𝑜𝑛 𝐹𝐶𝑠

Using this generic definition of accuracy, we can also compute SI or TI rates for subsets of validation dataset for all possible values of eigenspace dimensionality 𝑘 = 1,2, … , 𝑁 , which depends on how FCs are split into learning and validation data and is described in more detail below.

Subject Identification (SI) Process

For each of the 100 unrelated subjects, we had eight different fMRI tasks (including resting-state), as described above. For each task, we had two runs, here referred to as Test and Retest. For resting-state, we had four runs in total, two runs per session, but we only used the two runs from session 1 to balance the dataset with task fMRIs. For simplicity, we will refer to resting-state as a task, unless stated clearly otherwise. or SI statistics, we must consider the dependence between subjects in the sample. For instance, if two subjects A and B are very close to each other, B might be misclassified as A. But, if A was not in the learning dataset, it is possible that B would have been classified correctly. A convenient procedure to assess variability in the identification process is to use random cross-validation resampling , with each resample comprising random draws without replacement of the box containing the group of subjects. Within every cross-validation run, we randomly picked 80% of the 100 subjects (

100 ∗ 0.8 = 80 ) as our learning subjects from the

Test session (

It must be emphasized that for any task, FCs from the two runs (left-to-right vs right-to-left phase or LR vs RL) of the original HCP dataset were randomly assigned to either the Test or the Retest. That is why choosing FCs from only Test is essentially choosing FCs randomly from the two available sessions of LR and RL ). For every subject, we picked

𝑇 ∈ [7] number of task FCs, which resulted in

𝑁 = 80 ∗ 𝑇

FCs in the learning dataset i.e.

𝑿 = [𝒙 (cid:2869)(cid:2869) , 𝒙 (cid:2870)(cid:2869) , … , 𝒙 (cid:3021)(cid:2869) , … … … … , 𝒙 (cid:2869)(cid:2876)(cid:2868) , 𝒙 (cid:2870)(cid:2876)(cid:2868) , … , 𝒙 (cid:3021)(cid:2876)(cid:2868) ] where 𝒙 (cid:3036)(cid:3037) is the vectorized FC for the 𝑗 th subject and the 𝑖 th task. Then, as described above, we apply PCA to 𝑿 in order to create an eigenspace and compute 𝑘 -dimensional eigenspace representations for all the learning FCs for a given value of 𝑘 ( 𝑘 represents the eigenspace dimensionality or the number of eigenvectors chosen for the projection in the order of descending eigenvalues or equivalently, explained variance) i.e. 𝒀 (cid:3038) = [𝒚 (cid:2869)(cid:2869) , 𝒚 (cid:2870)(cid:2869) , … , 𝒚 (cid:3021)(cid:2869) , … … … … , 𝒚 (cid:2869)(cid:2876)(cid:2868) , 𝒚 (cid:2870)(cid:2876)(cid:2868) , … , 𝒚 (cid:3021)(cid:2876)(cid:2868) ] where 𝒀 (cid:3038) is the matrix of 𝑘 -dimensional projections of all the learning FCs. For the SI process, ‘subjects’ are the classes i.e.

𝑍 = 80 . So, in the Identification Stage, one centroid is computed per subject i.e. 𝑪 (cid:3046)(cid:3048)(cid:3029)(cid:3037)(cid:3038) = (cid:3419)𝒄 (cid:2869)(cid:3038) , 𝒄 (cid:2870)(cid:3038) , … , 𝒄 (cid:2876)(cid:2868)(cid:3038) (cid:3423) where 𝑪 (cid:3046)(cid:3048)(cid:3029)(cid:3037)(cid:3038) is the matrix of all subject-centroids in the 𝑘 -dimensional eigenspace. These centroids reflect an average representation of subjects across tasks which was then utilized in the identification process. In each cross-validation resample, the validation dataset comprised of new FCs (additional runs of the learning tasks or external tasks) of the same subjects employed in the learning dataset. FCs in the validation dataset always included all tasks for all learning subjects. Hence, overall it always comprised of

80 ∗ 8 = 640

FCs. The validation dataset was subdivided into two categories: ) Within-Learning-Tasks: new FCs that belonged to the tasks that were included in the learning dataset

2) Across-Tasks: new FCs that belonged to the tasks that were not included in the learning dataset. All the validation FCs were projected into the eigenspace and were labelled by identifying the nearest ‘subject centroid’ as described in detail in the Methods section. The SI process was performed for: 1)

Within-Learning-Tasks and Across-Tasks, separately 2)

100 random cross-validation resamples 3) all the values of eigenspace dimensionality i.e. 𝑘 = 1,2, … , 𝑁 , and 4) different number of learning tasks i.e.

𝑇 = 1,2, … ,7 . For a given value of 𝑇 , the process was repeated for all possible permutations of tasks in the learning dataset. For instance, if 𝑇 =2 , there are (cid:4672)82(cid:4673) = 28 possible permutations in which we can pick two tasks out of eight. So, the process was repeated for all permutations. Task Identification (TI) Process

Like the SI process, we must consider the dependence between subjects in the sample. Although here the consideration is slightly different. Two subjects A and B from the same task when averaged, could create a ‘better’ average representation of the task than say subjects B and C. Here the word ‘better’ means a representation that is more generalizable to the rest of the sample and hence would perform better in the identification stage. As done during the SI process, variability in the identification process was assessed by using random cross-validation resampling , with each resample comprising random draws of subjects without replacement. Additionally, we should consider the number of subjects per task in the learning dataset, because intuitively a larger sample of subjects per task could create a ‘better’ average representation of the task than a smaller one. So, we need to explore the TI process over a range with respect to the number of subjects per task in the learning dataset. Within each cross-validation run, 𝑛 number of subjects from the Test session are chosen randomly per task. So, the total number of FCs in the learning dataset would be

𝑁 = 8𝑛 , since there are in total 8 tasks i.e.

𝑿 = [𝒙 (cid:2869)(cid:2869) , 𝒙 (cid:2870)(cid:2869) , … , 𝒙 (cid:3041)(cid:2869) , … … … … , 𝒙 (cid:2869)(cid:2876) , 𝒙 (cid:2870)(cid:2876) , … , 𝒙 (cid:3041)(cid:2876) ] where 𝑥 (cid:3036)(cid:3037) is the vectorized FC for the 𝑗 th task and the 𝑖 th subject. Then, just as we did in the SI process, an eigenspace was created using PCA and all the learning FCs were projected into the eigenspace for a given value of 𝑘 i.e. (cid:3038) = [𝒚 (cid:2869)(cid:2869) , 𝒚 (cid:2870)(cid:2869) , … , 𝒚 (cid:3041)(cid:2869) , … … … … , 𝒚 (cid:2869)(cid:2876) , 𝒚 (cid:2870)(cid:2876) , … , 𝒚 (cid:3041)(cid:2876) ] where 𝒀 (cid:3038) is the set of 𝑘 -dimensional projections of all the learning FCs. For the TI process, classes are the different ‘tasks’, instead of ‘subjects’ i.e.

𝑍 = 8 . So, in the Identification Stage, one centroid was computed per task i.e. 𝑪 (cid:3047)(cid:3028)(cid:3046)(cid:3038)(cid:3038) = (cid:3419)𝒄 (cid:2869)(cid:3038) , 𝒄 (cid:2870)(cid:3038) , … , 𝒄 (cid:2876)(cid:3038) (cid:3423) where 𝑪 (cid:3047)(cid:3028)(cid:3046)(cid:3038)(cid:3038) is the set of all task-centroids in the 𝑘 -dimensional eigenspace. These centroids reflect an average representation of tasks across subjects which was then utilized in the identification process. The validation dataset comprised of the FCs from the

Retest session for all the subjects and all the tasks (

100 ∗ 8 = 800 ). The validation dataset was subdivided into two categories:

1) Within-Learning-Subjects: new FCs that belonged to the same subjects that were included in the learning dataset, and

2) Different-Subjects: new FCs that belonged to all the other subjects that were not included in the learning dataset. All the validation FCs were projected into the eigenspace and were labelled by identifying the nearest ‘task’ centroid. The SI process was performed for: 1.

Within-Learning-Subjects and Different-Subjects 2.

100 cross-validation resamples 3. all the values of eigenspace dimensionality i.e. 𝑘 = 1,2, … , 𝑁 , and 4. different number of subjects per task i.e. 𝑛 = [2: 1: 20,30: 10: 80] . Null Model Evaluation for the framework

For both the SI and the TI processes, a null model was evaluated by randomly permuting the class labels of the learning dataset and repeating the identification process.

Comparative Analysis: SI and TI using original FCs

As a comparative analysis, the SI and TI processes were also performed using the original FCs (Orig FCs). The learning and validation datasets were created the same way and the process was repeated for the same values of different parameters. Instead of averaging the eigenspace representations, Orig FCs were averaged across tasks and subjects for the SI and the TI processes, espectively. The second major difference was in the way the FCs in the validation dataset were compared to the learning dataset. First, the averaged representations of subjects or tasks (for SI and TI respectively) were vectorized and organized into a matrix i.e. 𝑪 (cid:3046)(cid:3048)(cid:3029)(cid:3037) = [𝒄 (cid:2869) , 𝒄 (cid:2870) , … , 𝒄 (cid:2876)(cid:2868) ] (for SI) and 𝑪 (cid:3047)(cid:3028)(cid:3046)(cid:3038) = [𝒄 (cid:2869) , 𝒄 (cid:2870) , … , 𝒄 (cid:2876) ] (for TI) where 𝒄 (cid:3036) is an averaged FC. All the FCs in the validation dataset were also vectorized. A given vectorized validation FC, 𝒚 , was identified as belonging to class 𝑙 ∗ that maximized the similarity between the input 𝒚 and the averaged FC for the class centroid 𝒄 (cid:3039) : 𝑙 ∗ = 𝑑(𝒚, 𝒄 (cid:3039) ) (cid:3039)(cid:3028)(cid:3045)(cid:3034)(cid:3040)(cid:3028)(cid:3051) where 𝑙 ∈ [𝑍] , the set of all class labels, 𝑐 (cid:3039) is the averaged FC for the 𝑙 -th class, and 𝑑(𝒚, 𝒄) = ∑ (cid:3435)(cid:3052) (cid:3285) (cid:2879)𝒚(cid:3365)(cid:3439)(cid:3435)(cid:3030) (cid:3285) (cid:2879)𝒄(cid:3364)(cid:3439) (cid:3285) (cid:3495)∑ (cid:3435)(cid:3052) (cid:3285) (cid:2879)𝒚(cid:3365)(cid:3439) (cid:3285) (cid:3118) (cid:3495)∑ (cid:3435)(cid:3030) (cid:3285) (cid:2879)𝒄(cid:3364)(cid:3439) (cid:3285) (cid:3118) is the Pearson’s correlation coefficient between 𝒚 and 𝒄 . A direct comparison between traditional process of identification (for instance Finn et al. or Venkatesh et al. ) with GEFF is only possible when we use only one FC per subject in the learning stage. For two or more FCs per subject in the learning dataset, we used averaged-across-tasks FCs instead, as described in detail above. This was necessary so we could keep the comparative analysis with Orig FCs consistent with GEFF, but also make qualitative comparisons with the previous literature. Characterization of Eigenvectors in Terms of Their Subject- and Task-fingerprint

We did a post-hoc analysis to characterize each eigenvector separately in terms of its subject- and/or task fingerprint. The idea was to see if eigenvectors, separately, indeed hold subject- and/or task-fingerprint and if there are different regimes of eigenvectors based on subject- and task-specificity. For this process, FCs for all the subjects and for all the tasks from the

Test and the

Retest session (

FCs) were vectorized and then organized into a matrix 𝑋 : 𝑿 = [𝒙 (cid:2869) , 𝒙 (cid:2870) , … , 𝒙 (cid:3015) ] where 𝒙 (cid:3036) is an 𝑀 -dimensional vectorized FC ( 𝑖 ∈ [𝑁] ), and =2*100*8=1600 is the total number of FCs. To construct an eigenspace, we input 𝑿 to PCA (MATLAB command pca ) to extract the eigenvectors and the representations (projections) of 𝒙 (cid:3036) vectors in(to) the eigenspace (much in the same way as we did in the Learning Stage for GEFF, Figure a-b): 𝑼 = [𝒖 (cid:2869) , 𝒖 (cid:2870) , … , 𝒖 (cid:3015) ] , and 𝒀 = [𝒚 (cid:2869)(cid:3015) , 𝒚 (cid:2870)(cid:3015) , … , 𝒚 (cid:3015)(cid:3015) ] where 𝒖 (cid:3036) is an 𝑀 -dimensional eigenvector, 𝒚 (cid:3036)(cid:3015) is the 𝑁 -dimensional projection of the 𝑀 -dimensional vector 𝒙 (cid:3036) into the 𝑁 -dimensional eigenspace, and ( 𝑖 ∈ [𝑁] ). The matrix 𝒀 can be expanded as: 𝒀 = ⎣⎢⎢⎡𝑦 (cid:2869)(cid:2869)(cid:3015) 𝑦 (cid:2870)(cid:2869)(cid:3015) … 𝑦 (cid:3015)(cid:2869)(cid:3015) 𝑦 (cid:2869)(cid:2870)(cid:3015) 𝑦 (cid:2870)(cid:2870)(cid:3015) … 𝑦 (cid:3015)(cid:2870)(cid:3015) ⋮ ⋮ ⋱ ⋮𝑦 (cid:2869)(cid:3015)(cid:3015) 𝑦 (cid:2870)(cid:3015)(cid:3015) … 𝑦 (cid:3015)(cid:3015)(cid:3015) ⎦⎥⎥⎤ where each column is an 𝑁 -dimensional projection ( 𝒚 (cid:3036)(cid:3015) ) in the 𝑁 -dimensional eigenspace. These projections can be thought of as coordinates in an 𝑁 -dimensional eigenspace, spanned by the 𝑁 eigenvectors. Hence, the 𝑖 -th row contains the weights or loadings of all the projections corresponding to the 𝑖 -th eigenvector. Since each column corresponds to an FC that belongs to a specific task or a subject, the weights corresponding to each eigenvector can also be grouped by tasks or subjects. We characterized each eigenvector individually, in terms of its subject- and/or task-fingerprint, using two-way ANOVA on the corresponding weights, where the group effects were ‘task’ and ‘subject’. This analysis was repeated for all eigenvectors and the corresponding p -values and effect sizes ( F -stats) were computed. The p -values were corrected for multiple comparisons using Bonferroni correction across the

ANOVAs performed. An eigenvector was declared task- and/or subject-dominant if the corresponding p -values was and subsequently based on the magnitude of corresponding F -stat.

3. RESULTS

In this study, we proposed the Graph Embedding for Functional Fingerprinting (GEFF) framework. GEFF was employed to perform subject- and task-identification (SI and TI, respectively) using the 100 unrelated subjects from the HCP 900 subject data release. GEFF consisted of two stages: 1) Learning and 2) Identification. In the

Learning stage , we computed an eigenspace representation of each FC in the learning dataset using group-level PCA decomposition. In the

Identification stage , we computed average representations (centroids) of each underlying class (subjects or tasks) in the learning dataset. Then, using eigenvectors omputed in the Learning Stage, we projected each validation FC into the eigenspace and identified it by matching its projection with one of the class centroids (Figure 1).

Both the SI and TI processes were repeated using original FCs (Orig FCs), where average representations of the underlying classes (subjects or tasks) were computed by averaging the corresponding FCs. The class of each validation FC was identified by matching it (using correlation; see

Section for details) with one of these averaged FCs.

SI process was performed using different number of task FCs per subject in the learning dataset, which we labeled as LS (i) , 𝑖 = 1,2, … ,7 . To assess the robustness of the results and statistical comparisons between the two frameworks (Orig FCs and GEFF), SI rates were computed for 100 random cross-validation resamples. For each cross-validation resample, 80% of the subjects (for each learning task) were randomly chosen without replacement from the Test session to create the learning dataset. SI rates were then computed for new FCs of the same subjects when 1) FCs belonged to same tasks as the learning tasks (Within-Learning-Tasks) and 2) when FCs belonged to tasks different than the learning tasks (Across-Tasks). Whenever possible, we show the SI rates separately for the cases where resting-state was part of the learning dataset (RS+) from the cases where it was not (RS‒). Even though this choice is somewhat intuitive considering resting-state fMRI is by design different than task fMRIs, we will provide a more practical reason when we discuss the SI results with two task FCs per subject in the learning dataset i.e. LS (2) . We should also highlight that variation around the mean behavior (whether across cross-validation resamples or learning tasks permutations) was so small (in most cases) that it was hidden behind the mean solid lines. (1)

At the maximum eigenspace dimensionality, GEFF improved SI rates over Orig FCs for each task and for both Within-Learning-Task and Across-Tasks scenarios (Figure 2a, 2d). Within-Learning-Task SI rates were for GEFF using resting-state, gambling, language, relational, and social tasks. For resting-state, SI rate was exactly across all the cross-validation resamples (Figure 2a). Even for emotion, motor, and working memory task, where the SI rates were lower than 90%, they were still significantly higher than their Orig FCs counterparts (e.g. an improvement of around 30% for motor task) (Figure 2a). Even though Across-Tasks SI rates were considerably lower (with the highest for relational and working memory tasks: ~60%), they were significantly higher than SI rates using Orig FCs (Figure 2d). SI rates increased monotonically with increasing eigenspace dimensionality (Figure 2b-c, 2e-f). Interestingly, Within-Learning-Task SI rates for resting-state saturate at 100% using only 75% (60 80⁄ ) of maximum eigenspace dimensionality (Figure 2b). For Within-Learning-Task SI rates, GEFF required less than half of the maximum eigenspace dimensionality to cross the Orig FCs SI rates (Figure 2b-c). On the other hand, GEFF Across-Tasks SI rates required more than half but ess than 75% of the maximum eigenspace dimensionality to cross the Orig FCs SI rates (Figure 2e-f). (2)

At the maximum eigenspace dimensionality, Within-Learning-Tasks SI rates for GEFF were for all permutations across learning tasks (Figure 3a). Interestingly, SI rates for validation FCs from resting-state were considerably lower when resting-state was not part of the learning dataset (Figure 3b). On the other hand, if we included resting-state in the learning dataset, along with one other task, we saw that SI rates for all the validation tasks were very high, whether those tasks were part of the learning dataset (Figure 3a) or not (Figure 3b). Combination of resting-state with motor task in the learning dataset seemed to be an exception as it resulted in lower SI rates for relational and social task (

70 − 80% ). This special behavior of resting-state compelled us to separate the cases where resting-state was part of the learning dataset from cases where it was not. Just as observed with one learning task, SI rates increased monotonically with increasing dimensionality (Figure 3c-f). Within-Learning-Tasks SI rates saturated at using only

75% (120 160⁄ ) of maximum eigenspace dimensionality when resting-state was included in the learning dataset (RS+; Figure 3c), and at when resting-state was not included (RS─; Figure 3d). When resting state was not included in the dataset, average Across-Tasks SI rates were (RS─; Figure 3f) and increased to when resting-state was included (RS+; Figure 3d). It should be noted that Across-Tasks SI rates never reached a saturation point (Figure 3d, 3f). Also, for both Within-Learning-Tasks and Across-Tasks, GEFF required less than half of the maximum eigenspace dimensionality to cross the Orig FC SI rates (Figure 3c-f). We should also highlight that without RS in the learning dataset, six or more tasks are required to reach similar Across-Task SI rates as with RS and one other task in the learning dataset (Figure S1; bottom row). We explore this in more detail in the next subsection. At this point, we have shown that using GEFF improved SI rates for all tasks individually (LS (1) ; Figure 2) and we achieved close to perfect SI rates using only two tasks in the learning dataset (LS (2) ) when the learning and validation FCs come from the same tasks (Figure 3a). In addition, SI process can be made potentially task-independent using only two tasks in the learning dataset, if one of the tasks is resting-state, although the corresponding rates are which can still be improved (Figure 3b, 3d). For this purpose, we considered Across-Tasks SI rates using more than two tasks in the learning dataset. (i) ( ) With resting-state included in the learning dataset (RS+), we reached Across-Tasks SI rates of with three and with four learning tasks. Beyond that, the improvement in SI rates was marginal (Figure S1; top row). Interestingly, when resting-state was not included (RS‒), SI rates do increase with increasing learning tasks, but achieve a maximum of (Figure 4; bottom row) (compared to with only four learning tasks in RS+). With increasing number of tasks in the learning dataset, the percentage of maximum eigenspace dimensionality required to cross the Orig FC SI rates and to achieve saturation, decreased (Figure S1). Finally, Across-Tasks SI rates increased for Orig FC with increasing learning tasks (just like EFF) when resting-state is included in the learning tasks (RS+) (Figure S1; top row) but decreased when resting-state is not included (RS‒) (Figure 4; bottom row).

Figure 2: Subject-Identification (SI) rates with only one task in the Learning dataset (LS (1) ) for Orig FCs and GEFF . SI rates for each Learning task at the maximum eigenspace dimensionality (i.e. 80) when the validation dataset contains new FCs from the same task as the learning stage dataset i.e.

Within-Learning-Task (a) and when the validation dataset is made up of new FCs from tasks not included in the learning stage dataset i.e.

Across-Tasks (d). The bars show the mean and the error bars show the standard error of mean (SEM) across the cross-validation resamples. (b) and (e) show the SI rate curves with increasing eigenspace dimensionality for

Within-Learning-Tasks and Across-Tasks when only FCs from resting-state are chosen as the learning dataset (RS+). On the other hand, (c) and (f) show similar curves for

Within-Learning-Tasks and Across-Tasks when resting-state is not included in the learning dataset (RS─). The solid lines show the mean SI rate across learning tasks and the shaded regions show the SEM.

GEFF improved the subject identification rates over Orig FCs across the board: 1) whether the validation FCs belong to the same tasks as the learning tasks or not (Within-Learning-Tasks or Across-Tasks) and 2) whether the learning tasks include resting-state or not (RS+ or RS‒) (Figure 4). We also show that a qualitatively optimal point for GEFF with respect to subject identification accuracy would be when we have two learning tasks and one of those is resting-state (white asterisk, Figure 4). In addition, we show that an average individual representation, whether it was reated using Orig FCs or with GEFF, resulted in a much better individual fingerprint (Figure 4; Within-Learning-Tasks) and became more generalizable to external tasks (Figure 4; Across-Tasks). Finally, the SI rates for the null model under any condition are very low and almost identical to the chance level of identification (Figure 2-3).

Figure 3:

Subject Identification (SI) rates with two tasks in the Learning stage dataset (LS (2) ) for Orig FCs and GEFF.

SI rates for each permutation of two tasks in the Learning dataset at the maximum eigenspace dimensionality (i.e. 160) when the validation dataset contains new FCs from the same two tasks as the Learning dataset i.e.

Within-Learning-Tasks (a) and when the validation dataset is made up of new FCs from tasks not included in the Learning dataset i.e.

Across-Tasks (b). SI rate curves with increasing eigenspace dimensionality, when one of two tasks in the Learning dataset is resting-state (RS+) are shown in (c) and (e), respectively, for

Within-Learning-Tasks and

Across-Tasks . On the other hand, (d) and (f) show similar curves for

Within-Learning-Tasks and

Across-Tasks when resting-state is not included (RS─) in the Learning dataset. The solid lines show the mean SI rate across all Learning tasks permutations and the shaded regions show the standard error of mean (SEM). Two black rectangles in each row of (b) correspond to the two tasks that were used in the Learning stage for that particular case.

The first step in TI process was to see how the TI rates change with number of subjects per task in the learning dataset. This process was repeated for a wide range ( 𝑛 = [2: 1: 20,30: 10: 80] ) of number of subjects per task. To assess the robustness of the results and for statistical comparisons between the two frameworks (Orig FCs and GEFF), TI rates were computed for 100 cross-validation resamples. Within each cross-validation resample, 𝑛 (where 𝑛 = [2: 1: 20,30: 10: 80] ) number of subjects (for all tasks and resting-state) were chosen at random from the Test session to create the learning dataset. TI rates were then computed for new FCs from the same tasks when 1) Cs belonged to the same subjects as the ones included in the learning dataset (Within-Learning-Subjects) and 2) when FCs belong to all the other subjects that were not included in the learning dataset. Figure 4: A summary of Subject Identification (SI) results for Orig FCs (left) and GEFF (right).

For GEFF, the SI rates correspond to the maximum eigenspace dimensionality for a given number of tasks in the learning dataset (LS (i) ). White asterisk marks a qualitatively optimal setting for GEFF, where both Within-Learning-Tasks and Across-Tasks SI rates are very high while minimizing the learning tasks to 2.

We observed that at 20 subjects per task in the learning dataset, the TI rates reach a plateau for both Within-Learning-Subjects and Different-Subjects, although there was marginal increase with GEFF with increasing number of subjects per task (Figure 5). TI rates using Orig FCs saturated around and were always lower than corresponding TI rates for GEFF which saturated around 99%. It should be highlighted that TI rates for GEFF were computed at the maximum eigenspace dimensionality for each value of 𝑛 . Also, the standard error of mean across cross-validation resamples was so low that it is hidden behind the mean lines (Figure 5). After establishing that TI rates reach a saturation point after 20 subjects per task, we assessed the TI rates at 𝑛 = 20 in more detail. We observed that TI rates for GEFF cross the Orig FC TI rates with just (

44 160⁄ ) and (

48 160⁄ ) of the maximum eigenspace dimensionality for Within-Learning-Subjects (Figure S2) and for Different-Subjects (Figure S3), respectively. We noticed that the TI rates for GEFF saturated after (

80 160⁄ ) of the maximum eigenspace dimensionality at and for Within-Learning-Subjects and Different-Subjects respectively (Figure S2 and S3). Another important observation was that the TI rate rises sharply with the first three eigenvectors and then steadily increases with increasing dimensionality (Figure S2 and S3). This observation highlights the importance of the first few eigenvectors in the TI process, which will be discussed again in the next section (

Characterization of Eigenvectors ). Finally, the confusion matrices shown in Figures S2 and S3 highlight that when the TI rates improve with increasing eigenspace dimensionality, they do so for all the tasks. This also shows that certain asks (e.g. emotion, gambling, relational) are harder to identify than others (e.g. resting-state, social).

We should also highlight that the TI rates for the null model are very low and almost equal to the chance levels of identification rates (Figure 5; Figure S2-S3).

Figure 5: Task Identification (TI) rate curves with increasing number of subjects per task in the Learning Stage dataset for Orig FCs and GEFF.

TI rates shown were computed at the maximum eigenspace dimensionality. Left panel shows the TI rate curves when validation dataset contains new FCs from the same subjects as the ones included in the Learning dataset i.e.

Within-Learning-Subjects . Right panel, on the other hand, shows the TI rate curves when validation dataset is made up of new FCs from subjects not included in the Learning dataset i.e.

Different Subjects . Solid lines with dots show the mean TI rates across cross-validation resamples, while the shaded areas around the mean show the standard error of the mean (SEM). Note that the SEM is so small that it’s hidden behind the solid mean lines.

Using all ( ) FCs from Test and

Retest sessions, we computed the eigenvectors and their corresponding weights using group-level PCA (see Methods). To ascertain the task- and subject-specificity of a given eigenvector, two-way

ANOVA was applied to its corresponding weights using ‘task’ and ‘subject’ as the two group effects. This process was repeated for all eigenvectors and the corresponding p -values were Bonferroni corrected. We observed that the eigenvectors can be divided into three regimes: 1) Task-Dominant, 2) Subject-Dominant, and 3) Neither (Figure 6). The Task-Dominant regime consists of the first

10 −20 eigenvectors which explain

80 − 90% of the variance in the data. Then, we observed a second wave of eigenvectors which constitute the Subject-Dominant regime. This regime lasts till around eigenvectors and is followed by the

Neither regime which is neither task- nor subject-specific. It should be noted here that there are no hard boundaries between these regimes. A task dominant eigenvector can have subject-specificity (e.g. the first 10 eigenvectors) and vice versa. However, it is noteworthy that ordering the eigenvectors by their explained variance separated them into ask- and subject-dominant regimes, instead of task- and subject-specificity spuriously distributed across the range of eigenvectors.

Figure 6: Characterization of individual eigenvectors.

Top panel with black dots shows the explained variance of each eigenvector individually. The middle and lower panels show the group effects ( F -statistic) for task and subject groups, computed using two-way ANOVA on each eigenvector weights. The black dots with orange boundary highlight eigenvectors with significant group effects ( 𝑝 < 0.01 ; bonferroni corrected across the 1600 eigenvectors), while the gray dots show the non-significant ones. DISCUSSION

In this paper, we proposed an embedding framework for FC fingerprinting called GEFF: Graph Embedding for Functional Fingerprinting. We employed this framework to perform Subject- and Task-Identification (SI and TI respectively) using functional connectomes. Compared with existing frameworks, not only did GEFF considerably improve the SI and TI rates, it also made the SI and TI processes, respectively, task- and subject-independent . GEFF proved to be a highly accurate and potentially universal FC fingerprinting framework which allowed us to robustly estimate individual fingerprint and decode cognitive states from FCs. We also showed that resting-state combined with one other task covers the entire cognitive space in terms of individual fingerprinting. We also characterized the learning stage eigenvectors, and found that they can be delineated into task- and subject-dominant regimes by simply arranging them in the descending order of their explained variance. n average individual representation, whether it was created using original FCs in the connectivity domain or with GEFF in the eigenspace, resulted in a much better individual fingerprint, especially when the FCs being identified belonged to the same tasks that were used to create the average representations. With only one exception, by adding more tasks to create the average representations, the individual fingerprint became more accurate and generalizable to external tasks. This result aligns with previous work of Gao et al. where they show that combining multiple FCs improves predictive estimates of phenotypic measures. For original FCs (Orig FCs), there was one exception to this trend. That happened when resting-state was not included to create the average representations and the FCs being identified belonged to the tasks different from the ones used to create the average. The fingerprint became worse when more and more FCs were used to create the average. This is partially explained by the fact that some of the validation FCs that we were trying to identify in these cases were resting-state FCs. As we would explore in more detail later, it is hard to identify resting-state FCs when the average representations are created using only non-resting-state tasks. The more tasks we used to create the average representations, the fewer tasks (including resting-state) were left for identification. This resulted into a higher percentage of resting-state FCs in the validation data, which in turn caused a decrease in the fingerprinting accuracy. We did a post-hoc analysis to investigate this further. When resting-state is removed from the validation FCs, the fingerprinting accuracy increases with increasing number of tasks participating in the average representations (Figure S4). However, this argument didn’t hold when 5 or more tasks were used for average representations, as the identification rates slightly decreased for Across-Tasks. In other words, the individual fingerprint became less generalizable to external tasks with or more tasks in the Learning dataset. We should also emphasize that this behavior was only observed for original FCs and when resting-state was excluded while computing the average representations for individuals. With GEFF, the individual fingerprint became always more accurate and generalizable to external tasks when more tasks (with and without resting-state) were used for average individual representations. When performing individual fingerprinting for Within-Task FCs, GEFF exhibited near perfect performance. Using GEFF, the FC fingerprint was universally improved ( accuracy for most tasks), with a perfect fingerprint ( accuracy) for resting-state. In addition, when more than one task was used in GEFF to create an average representation of individuals, individual fingerprinting was nearly perfect for any combination of tasks ( accuracy) as long as the new FCs that were being compared with the average representations belonged to the same tasks as the ones used to create the average representation. This widely outperformed the canonical method for performing FC fingerprinting using correlation between FCs belonging to the same task. Using this canonical approach, only resting-state FCs had a reasonable fingerprint ( accuracy) while all the other tasks performed poorly ( for all tasks; emotion and motor ). Although canonical fingerprints did improve when we created the average individual representations with original FCs (except for Across-Tasks), GEFF always outperformed by a margin (

10 − 20% ) for all possible number and combination of tasks. inn et al. reported a mean identification accuracy of for resting-state using Pearson correlation, which is higher than that we obtained (Figure 2a; RS). Recently, Venkatesh et al obtained identification rate with RS when using correlation as a similarity metric. Given that the HCP data has four runs for RS, two on each day, Finn et al. averaged the two FCs from the same day into a single FC. Altogether this suggests that averaging across several runs of the same task produces a more representative FC, which results in higher fingerprinting accuracies. In this work, however, we only used the two runs from one of the two days and kept the two runs separate. The identification accuracy still increased to a perfect with GEFF. Finally, a geometry-aware approach for comparing FCs, within a single task, was recently proposed. This method outperformed the canonical methods of using correlation-based similarity metric across all tasks . It is noteworthy that GEFF outperforms this approach as well across all tasks (e.g. an improvement of around for emotion, gambling, and relational tasks for Subject Identification). This work provides strong evidence to suggest that GEFF makes individual fingerprinting task independent. When we used two or more tasks (one of those being resting-state) to create an average individual representation, we found that GEFF was able to correctly identify a validation FC ( ≥ 90% ) even when it belonged to a task not included to construct the average individual representation. Assuming the task-independent nature of GEFF, specific FCs with embeddings that fall far away from the average representation of a given subject might indicate suboptimal quality of its estimation. This also suggests that perhaps we are all hardwired in a similar way and that there are only subtle differences in terms of functional reconfiguration when performing any cognitive task . Therefore, perhaps it is not the task but the individual wiring of the person that explains maximal inter-subject variability. It must be emphasized that GEFF was potentially task-independent only when one of the tasks used to create the average individual representation was resting-state. When resting-state is not used to create the average, the fingerprinting accuracy drops considerably (by as much as in one case). When resting-state is part of the average, by adding more and more tasks into the average, the fingerprinting accuracy approaches perfection ( ). On the other hand, when resting-state is excluded from the average, we found that even though the fingerprinting accuracy increases with increasing number of tasks in the average, it reaches a plateau at . An average representation created from exclusively non-resting-state tasks is not entirely generalizable to identify resting-state FCs, as mentioned before in section 4.1.

This suggests that resting-state connectivity captures a fingerprint of an individual which is somewhat orthogonal to other tasks, as described in a little more detail below.

When we used two tasks to create average individual representations (centroids) in GEFF, if one of the two is resting-state, we found that the resultant average representation has a strong individual fingerprint within the same tasks and is also highly generalizable to the external tasks. There were ight tasks implemented and acquired in the HCP dataset, all of them targeting different cognitive capacities as well as neural circuits , and hence providing with a fair representation of individual’s cognitive space . Considering the breadth and variety of tasks assessed, our results suggest that one resting-state and one non-resting-state task would potentially be enough to fingerprint an individual anywhere in the cognitive space, i.e. when GEFF is used for these or potentially any other set of fMRI tasks. As mentioned in the previous section (4.3), an average individual representation created exclusively using non-resting-state tasks, does not fully generalize to the resting-state. But if we use one resting-state and one non-resting-state task, the resultant individual fingerprint is potentially universal across the whole cognitive space. All of this suggests that resting-state and all other tasks form two orthogonal axes of a cognitive space in terms of fingerprinting. This fits well with the idea of an “intrinsic architecture” and a “task-general architecture” proposed by Cole et al . Even though we observed high fingerprinting accuracy by combining any non-resting-state task with resting-state, certain tasks performed better than others. For instance, motor task performed the worst, while relational task performed the best when combined with resting-state. This is in agreement with previous results that show that different tasks seem to possess different levels of individual fingerprint and that the individual differentiability that is obtained by combining multiple tasks depends very much on the tasks themselves . Based on these observations, we suggest that when designing an experiment that relies on individual differentiability, the experimenter should acquire one resting-state and one non-resting-state to cover as much individual cognitive space as possible. One could tailor the non-resting-state task to ask a desired question but then combine it with resting-state to extract maximal individual fingerprint.

We observed that GEFF is unaffected by increasing sample size (see Figure S5). While the fingerprinting accuracy generally worsened with original FCs as the sample size increased, with GEFF it did not. This was true for all the different scenarios that we studied. In this work, we only go as high as 80 subjects, but if we extrapolate the observed trends to larger datasets (say, consisting of subjects ), we expect that individual fingerprinting accuracy for original FCs, especially with external tasks, would go down considerably. At the same time, we do not see any evidence for this trend in GEFF where the fingerprinting accuracy is stable across different sample sizes. Interestingly, when resting-state is excluded from the learning stage and identification is performed for external tasks, the accuracy goes up with increasing data size. We do not have a clear explanation for this phenomenon at this point. A larger dataset might help us dig a little deeper into this sample size behavior. sing original FCs or GEFF, we show that the task identification accuracy levels off around 20 subjects per task to create the average task representation. With merely 20 samples per tasks, we can create a task representation that is highly accurate ( for Orig FCs and for GEFF) and highly generalizable to external subjects ( for Orig FCs and for GEFF). GEFF still outperformed Orig FCs for any number of subjects per task, despite the fact that the performance gap for task identification was not as pronounced as the gap for individual identification. Note that when assessing TI for more than 20 subjects per task, GEFF TI continued to rise, reaching with 80 subjects per task, while Orig FCs TI did not exhibit improvement.

With only 20 subjects per task to create the average task representation, GEFF was able to identify all eight tasks with an average accuracy of , which is comparable to the accuracy achieved by a deep learning framework . All the tasks had identification accuracies above , except emotion ( ). Even for external subjects, the average accuracy was . Although, we should highlight that in Wang et al. , the sample size is much larger ( 𝑁 = 1034 ) than in this study (

𝑁 = 100 ). We would emphasize again that GEFF was tested here with only eight tasks but we show that this framework has the potential to be universal in decoding large number of cognitive states simultaneously. Thus, GEFF could be employed to track a dynamically changing mental state with high accuracy in a relatively straightforward manner. In addition, using dynamic FC, we could also use GEFF to create a dynamic eigenspace profile of a subject doing different tasks.

By characterizing eigenvectors based on their task- and/or subject-specificity, we were able to show that they can be delineated into task- and subject-dominant regimes, simply by ordering them in descending order of explained variance. We observed that the first eigenvectors which explained around of the variance in the data, were highly task-dominant, while there was a second wave of eigenvectors from

10 − 300 that were subject-dominant. Interestingly, most of the eigenvectors were neither task- nor subject-specific.

We should emphasize here that this organization of eigenvectors into specificity regimes was not intuitive to us beforehand. The task- and subject-specificity could easily have been spuriously distributed across the spectrum or there could easily have been no task- or subject-dominant eigenvectors. The fact that by simply ordering eigenvectors in descending order of their explained variance, they are delineates into task- and subject-dominant regimes is an interesting phenomenon.

We aim to reproduce these findings with a larger sample size to estimate the effects of increasing number of subjects and tasks on the robustness and task-independence of GEFF. We can potentially also use GEFF with dynamic FCs and create dynamic eigenspace profiles of individuals to see if those profiles provide any additional information about the individual and how that individual reconfigures with changing mental states within a task. We also need to test this ramework with different parcellation sizes. GEFF could be used to track disease progression over time and lead to more personalized medicine. In addition, GEFF could be applied to effective connectivity data in much the same as function connectivity data.

GEFF was tested with a relatively modest sample size of 100 subjects, although we would like to test this framework with larger datasets. In addition, we only used one parcellation and it has been shown that parcellation size has an effect on the FC fingerprint. New FCs cannot be added dynamically to the dataset with GEFF, as it requires a group-level decomposition to create an eigenspace. So, every time a new FC is added to the dataset, a reconstruction of the eigenspace and a subsequent updated projection of all the data is needed. Also, since whole FCs are embedded as points in a high-dimensional eigenspace, we cannot discern the contribution of individual brain regions to the identification accuracy. The cognitive space of the subjects was explored through the eight available fMRI tasks in the HCP dataset. Datasets with even more fMRI tasks will allow better exploration of subject and task fingerprints within the GEFF framework. CONCLUSION

In summary, we propose a graph embedding framework, i.e. GEFF, that is extremely accurate in comparing functional connectomes. We demonstrate this by showing very high subject- and task-identification accuracies using the HCP 100 unrelated subjects dataset. We also show that GEFF is potentially task-independent for subject-identification and subject-independent for task-identification. In other words, the average representation created by GEFF for a subject or task is highly generalizable to external data. In addition, we show that eigenvectors can be characterized as task- or subject-dominant, which provides a deeper insight into the extent of variance of functional connectivity across individuals and cognitive states. GEFF is a robust and potentially universal identification framework that can serve as a potential benchmark for FC fingerprinting and as an exploratory tool to track the cognitive dynamics in an individual.

CKNOWLEDGEMENTS

Data were provided [in part] by the Human Connectome Project, WU-Minn Consortium (Principle Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.

FUNDING INFORMATION

Authors acknowledge financial support from NIH R01EB022574 (JG), NIH R01MH108467 (JG and JH), Indiana Alcohol Research Center P60AA07611 (JG), and Purdue Discovery Park Data Science Award "Fingerprints of the Human Brain: A Data Science Perspective” (JG). . Fornito A, Zalesky A, Breakspear M. The connectomics of brain disorders.

Nat Rev Neurosci . 2015;16(3):159-172. doi:10.1038/nrn3901 2. Castellanos FX, Di Martino A, Craddock RC, Mehta AD, Milham MP. Clinical applications of the functional connectome.

Neuroimage . 2013. doi:10.1016/j.neuroimage.2013.04.083 3. Crossley NA, Mechelli A, Scott J, et al. The hubs of the human connectome are generally implicated in the anatomy of brain disorders.

Brain . 2014. doi:10.1093/brain/awu132 4. Seitzman BA, Gratton C, Laumann TO, et al. Trait-like variants in human functional brain networks.

Proc Natl Acad Sci U S A . 2019;116(45):22851-22861. doi:10.1073/pnas.1902932116 5. Van Essen DC, Ugurbil K, Auerbach E, et al. The Human Connectome Project: A data acquisition perspective.

Neuroimage . 2012;62(4):2222-2231. doi:10.1016/j.neuroimage.2012.02.018 6. Van Essen DC, Smith SM, Barch DM, Behrens TEJ, Yacoub E, Ugurbil K. The WU-Minn Human Connectome Project: An overview.

Neuroimage . 2013. doi:10.1016/j.neuroimage.2013.05.041 7. Amunts K, Ebell C, Muller J, Telefont M, Knoll A, Lippert T. The Human Brain Project: Creating a European Research Infrastructure to Decode the Human Brain.

Neuron . 2016. doi:10.1016/j.neuron.2016.10.046 8. Allen NE, Sudlow C, Peakman T, Collins R. UK biobank data: Come and get it.

Sci Transl Med . 2014. doi:10.1126/scitranslmed.3008601 9. Miller KL, Alfaro-Almagro F, Bangerter NK, et al. Multimodal population brain imaging in the UK Biobank prospective epidemiological study.

Nat Neurosci . 2016. doi:10.1038/nn.4393 10. Okano H, Miyawak A, Kasai K. Brain/MINDS: Brain-mapping project in Japan.

Philos Trans R Soc B Biol Sci . 2015. doi:10.1098/rstb.2014.0310 11. Poo M ming, Du J lin, Ip NY, Xiong ZQ, Xu B, Tan T. China Brain Project: Basic Neuroscience, Brain Diseases, and Brain-Inspired Computing.

Neuron . 2016. doi:10.1016/j.neuron.2016.10.050 12. Satterthwaite TD, Xia CH, Bassett DS. Personalized Neuroscience: Common and Individual-Specific Features in Functional Brain Networks.

Neuron . 2018;98(2):243-245. doi:10.1016/j.neuron.2018.04.007 13. Mars RB, Passingham RE, Jbabdi S. Connectivity Fingerprints: From Areal Descriptions to Abstract Spaces.

Trends Cogn Sci . 2018. doi:10.1016/j.tics.2018.08.009 14. Gratton C, Laumann TO, Nielsen AN, et al. Functional Brain Networks Are Dominated by Stable Group and Individual Factors, Not Cognitive or Daily Variation.

Neuron . 2018. doi:10.1016/j.neuron.2018.03.035 15. Pallarés V, Insabato A, Sanjuán A, et al. Extracting orthogonal subject- and condition-specific signatures from fMRI data using whole-brain effective connectivity.

Neuroimage . 018;178:238-254. doi:10.1016/j.neuroimage.2018.04.070 16. Xie H, Calhoun VD, Gonzalez-Castillo J, et al. Whole-brain connectivity dynamics reflect both task-specific and individual-specific modulation: A multitask study.

Neuroimage . 2018;180:495-504. doi:10.1016/j.neuroimage.2017.05.050 17. Amico E, Dzemidzic M, Oberlin BG, et al. The disengaging brain: Dynamic transitions from cognitive engagement and alcoholism risk.

Neuroimage . 2020;209:116515. doi:10.1016/J.NEUROIMAGE.2020.116515 18. Fornito A, Bullmore ET. Connectomics: A new paradigm for understanding brain disease.

Eur Neuropsychopharmacol . 2015. doi:10.1016/j.euroneuro.2014.02.011 19. van den Heuvel MP, Sporns O. A cross-disorder connectome landscape of brain dysconnectivity.

Nat Rev Neurosci . 2019. doi:10.1038/s41583-019-0177-6 20. Venkatesh M, Jaja J, Pessoa L. Comparing functional connectivity matrices: A geometry-aware approach applied to participant identification.

Neuroimage . November 2019:116398. doi:10.1016/J.NEUROIMAGE.2019.116398 21. Finn ES, Shen X, Scheinost D, et al. Functional connectome fingerprinting: Identifying individuals using patterns of brain connectivity.

Nat Neurosci . 2015. doi:10.1038/nn.4135 22. Amico E, Goñi J. The quest for identifiability in human functional connectomes.

Sci Rep . 2018. doi:10.1038/s41598-018-25089-1 23. Bari S, Amico E, Vike N, Talavage TM, Goñi J. Uncovering multi-site identifiability based on resting-state functional connectomes.

Neuroimage . 2019. doi:10.1016/j.neuroimage.2019.06.045 24. Rajapandian M, Amico E, Abbas K, Ventresca M, Goñi J. Uncovering differential identifiability in network properties of human brain functional connectomes. arXiv Prepr arXiv191110193 . 2019. 25. Svaldi DO, Goñi J, Bharthur Sanjay A, et al. Towards subject and diagnostic identifiability in the alzheimer’s disease spectrum based on functional connectomes. In:

Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) . ; 2018. doi:10.1007/978-3-030-00689-1_8 26. Varona P, Rabinovich MI. Hierarchical dynamics of informational patterns and decision-making.

Proc R Soc B Biol Sci . 2016. doi:10.1098/rspb.2016.0475 27. Varoquaux G, Schwartz Y, Poldrack RA, et al. Atlases of cognition with large-scale human brain mapping.

PLoS Comput Biol . 2018. doi:10.1371/journal.pcbi.1006565 28. Shirer WR, Ryali S, Rykhlevskaia E, Menon V, Greicius MD. Decoding subject-driven cognitive states with whole-brain connectivity patterns.

Cereb Cortex . 2012. doi:10.1093/cercor/bhr099 29. Greene AS, Gao S, Scheinost D, Constable RT. Task-induced brain state manipulation improves prediction of individual traits.

Nat Commun . 2018;9(1). doi:10.1038/s41467-018-04920-3 0. Krienen FM, Thomas Yeo BT, Buckner RL. Reconfigurable task-dependent functional coupling modes cluster around a core functional architecture.

Philos Trans R Soc B Biol Sci . 2014;369(1653). doi:10.1098/rstb.2013.0526 31. Salehi M, Greene AS, Karbasi A, Shen X, Scheinost D, Constable RT. There is no single functional atlas even for a single individual: Functional parcel definitions change with task.

Neuroimage . 2019. doi:10.1016/J.NEUROIMAGE.2019.116366 32. Greene AS, Gao S, Noble S, Scheinost D, Constable RT. How tasks change whole-brain functional organization to reveal brain-phenotype relationships.

NEURON-D-19-01606 . 2019. 33. Wang X, Liang X, Jiang Z, et al. Decoding and mapping task states of the human brain via deep learning.

Hum Brain Mapp . 2019;n/a(n/a). doi:10.1002/hbm.24891 34. Hutchison RM, Womelsdorf T, Allen EA, et al. Dynamic functional connectivity: Promise, issues, and interpretations.

Neuroimage . 2013;80:360-378. doi:10.1016/j.neuroimage.2013.05.079 35. Jakubovitz D, Giryes R, Rodrigues MRD. Generalization Error in Deep Learning. In: Boche H, Caire G, Calderbank R, Kutyniok G, Mathar R, Petersen P, eds.

Compressed Sensing and Its Applications: Third International MATHEON Conference 2017 . Cham: Springer International Publishing; 2019:153-193. doi:10.1007/978-3-319-73074-5_5 36. Takahashi T, Murase H. Eigenspace Methods BT - Computer Vision: A Reference Guide. In: Ikeuchi K, ed. Boston, MA: Springer US; 2014:235-239. doi:10.1007/978-0-387-31439-6_711 37. Sirovich L, Kirby M. Low-dimensional procedure for the characterization of human faces.

J Opt Soc Am A . 1987. doi:10.1364/josaa.4.000519 38. Turk M, Pentland A. Eigenfaces for recognition.

J Cogn Neurosci . 1991. doi:10.1162/jocn.1991.3.1.71 39. Murase H, Nayar SK. Visual learning and recognition of 3-d objects from appearance.

Int J Comput Vis . 1995. doi:10.1007/BF01421486 40. Ohba K, Ikeuchi K. Detectability, uniqueness, and reliability of eigen windows for stable verification of partially occluded objects.

IEEE Trans Pattern Anal Mach Intell . 1997. doi:10.1109/34.615453 41. Glasser MF, Sotiropoulos SN, Wilson JA, et al. The minimal preprocessing pipelines for the Human Connectome Project.

Neuroimage . 2013. doi:10.1016/j.neuroimage.2013.04.127 42. Smith SM, Beckmann CF, Andersson J, et al. Resting-state fMRI in the Human Connectome Project.

Neuroimage . 2013. doi:10.1016/j.neuroimage.2013.05.039 43. Glasser MF, Coalson TS, Robinson EC, et al. A multi-modal parcellation of human cerebral cortex.

Nature . 2016. doi:10.1038/nature18933 44. Marcus DS, Harwell J, Olsen T, et al. Informatics and data mining tools and strategies for the human connectome project.

Front Neuroinform . 2011. doi:10.3389/fninf.2011.00004 5. Jenkinson M, Beckmann CF, Behrens TEJ, Woolrich MW, Smith SM. FSL.

Neuroimage . 2012;62(2). doi:10.1016/j.neuroimage.2011.09.015 46. Salimi-Khorshidi G, Douaud G, Beckmann CF, Glasser MF, Griffanti L, Smith SM. Automatic denoising of functional MRI data: Combining independent component analysis and hierarchical fusion of classifiers.

Neuroimage . 2014. doi:10.1016/j.neuroimage.2013.11.046 47. Power JD, Mitra A, Laumann TO, Snyder AZ, Schlaggar BL, Petersen SE. Methods to detect, characterize, and remove motion artifact in resting state fMRI.

Neuroimage . 2014. doi:10.1016/j.neuroimage.2013.08.048 48. Cole MW, Bassett DS, Power JD, Braver TS, Petersen SE. Intrinsic and task-evoked network architectures of the human brain.

Neuron . 2014. doi:10.1016/j.neuron.2014.05.014 49. Thomas Yeo BT, Krienen FM, Sepulcre J, et al. The organization of the human cerebral cortex estimated by intrinsic functional connectivity.

J Neurophysiol . 2011. doi:10.1152/jn.00338.2011 50. Amico E, Marinazzo D, Di Perri C, et al. Mapping the functional connectome traits of levels of consciousness.

Neuroimage . 2017. doi:10.1016/j.neuroimage.2017.01.020 51. Hotelling H. Analysis of a complex of statistical variables into principal components.

J Educ Psychol . 1933. doi:10.1037/h0071325 52. Pearson K. LIII. On lines and planes of closest fit to systems of points in space .

London, Edinburgh, Dublin Philos Mag J Sci . 1901. doi:10.1080/14786440109462720 53. Koch I.

Analysis of Multivariate and High-Dimensional Data .; 2012. doi:10.1017/CBO9781139025805 54. Efron B, Tibshirani RJ.

An Introduction to the Bootstrap .; 1993. doi:10.1007/978-1-4899-4541-9 55. Galton F. Regression Towards Mediocrity in Hereditary Stature.

J Anthropol Inst Gt Britain Irel . 1886. doi:10.2307/2841583 56. Bravais A.

Analyse Mathématique Sur Les Probabilités Des Erreurs de Situation d’un Point . Impr. Royale; 1844. 57. Gao S, Greene AS, Constable RT, Scheinost D. Combining multiple connectomes improves predictive modeling of phenotypic measures.

Neuroimage . 2019. doi:10.1016/j.neuroimage.2019.116038 58. Duong-Tran D, Amico E, Corominas-Murtra B, et al. A morphospace framework to assess configural breadth based on brain functional networks. 2019.

Figure S1: Subject Identification (SI) rates for Across-Tasks using more than 2 learning tasks: LS (i) , 𝑖 = 3, … ,7 . Top row shows the SI curves when one of the learning tasks is resting-state (RS+), while the bottom row shows the curves when resting-state is excluded from the learning tasks (RS‒). Columns are tagged by number of tasks ( ) used in the learning stage to create the eigenspace. The solid lines show the mean SI rate across all learning tasks’ permutations and the shaded regions show the standard error of mean (SEM).

Figure S2: Task Identification (TI) rates with 20 subjects per task in the Learning stage dataset using Orig FCs and GEFF for

Validation Stage 1 . The top-left panel shows the TI rate curve with increasing eigenspace dimensionality when the validation dataset contains new FCs from the same subjects as the Learning dataset i.e.

Validation Stage 1 . The solid lines show the mean TI rate across bootstrap resamples and the shaded regions show the standard error of mean (SEM). The top-right panel is the confusion matrix describing the performance of TI process using Orig FCs. The main diagonal shows the fraction of validation FCs that were identified correctly for each task. Off diagonal elements show the fraction of FCs which were mislabeled as other tasks. Confusion matrices in the bottom panels correspond to GEFF with eigenspace dimensionality of 3, 44 and 80 as shown above the matrices and highlighted on the TI curve in the top-left panel.

Figure S3: Task Identification (TI) rates with 20 subjects per task in the Learning stage dataset using Orig FCs and GEFF for

Validation Stage 2 . The top-left panel shows the TI rate curve with increasing eigenspace dimensionality when the validation dataset contains new FCs from the subjects not included in the Learning dataset i.e.

Validation Stage 2 . The solid lines show the mean TI rate across bootstrap resamples and the shaded regions show the standard error of mean (SEM). The top-right panel is the confusion matrix describing the performance of TI process using Orig FCs. The main diagonal shows the fraction of validation FCs that were identified correctly for each task. Off diagonal elements show the fraction of FCs which were mislabeled as other tasks. Confusion matrices in the bottom panels correspond to GEFF with eigenspace dimensionality of 3, 48 and 80 as shown above the matrices and highlighted on the TI curve in the top-left panel.

Figure S4:

Subject Identification rates for Across-Tasks when resting-state is excluded from both Learning and Validation datasets for Orig FCs and GEFF.