Briones, Tapia-Rivas, D’Souza, & Estevez, 2021). A com-
bination of fMRI and genomics data from a single loca-
tion was utilized to perform canonical correlation analy-
sis using two fully linked, sparse autoencoders followed by
SVM (G. Li et al., 2020). There were also several stud-
ies that combined fMRI and structural MRI; the fMRI tasks
included resting state, letter-n back task, auditory oddball,
and audiovisual stimuli tasks that elicited negative and neu-
tral emotion (Dakka et al., 2017; Kim, Calhoun, Shim, &
Lee, 2016; Oh et al., 2019; Salvador et al., 2019; Yan et al.,
2019). These studies used a variety of strategies to differen-
tiate between classes, including 3D activation maps, func-
tional connectomes, ICA decomposition, and independent
polygenic risk scores along with modelling techniques such
as artificial neural networks and decision tree classifiers.
Our research was inspired by a publication in which the
authors attempted to discover consistent patterns and con-
nections between brain areas and also categorized partici-
pants using a variety of machine learning models (Gheirat-
mand et al., 2017). However, their study was focused on
a single site, and we desired to incorporate data from nu-
merous locations, which is where (Zeng et al., 2018) came
in handy. The writers of this article combined private and
publicly available datasets. That, we believe, is the only ef-
fort in this area that utilized such a vast sample space for
model training. We were unable to locate many open fMRI
datasets, particularly those that included both T1 weighted
and resting-state fMRI. Finally, we chose two locations,
COBRE and UCLA.
3 Materials and Methods
3.1 Participants
The dataset is made up of 120 schizophrenia patients and
206 healthy people who were gathered from two different
imaging resources. Refer to Table 1. The first subgroup
was developed at the University of California, Los Ange-
les (UCLA: 50 schizophrenics and 122 healthy controls;
patients were allowed to use stable medications) (Bilder
et al., 2016). The second group comes from the Center
for Biomedical Research Excellence (COBRE) (COBRE:
69 Schizophrenia Strict Spectrum patients, 11 Schizoaffec-
tive Spectrum patients, and 84 healthy controls; all of the
patients were on antipsychotic medications) (Aine et al.,
2017).
All individuals were screened and rejected if they had
a history of neurological illness, mental retardation, seri-
ous head trauma resulting in a loss of consciousness lasting
more than 5 minutes, or a history of substance addiction
or dependency within the previous 12 months. The Struc-
tured Clinical Interview for DSM-IV Disorders was utilized
to obtain diagnostic information (SCID).
3.2 Image Acquisition
On a Siemens Erlangen 3.0 Tesla Trim Trio scanner, MR
images from both locations were collected. There were,
however, modest changes in the methods used to capture
the photos. The COBRE dataset was collected in complete
k-space EPI sequences in a single shot. The time repetitions
were set to 2000 milliseconds, and the echo time was set to
29 milliseconds. With a field of vision of 192 mm and a
thickness of 4 mm, there is no gap between the 32 slices. A
resting state run was performed on each participant, yield-
ing 150-time steps. An asymmetrical spin-echo echo-planar
sequence was used to collect the UCLA dataset. The time
repetitions were set to 2000 milliseconds, and the echo time
was set to 30 milliseconds. With a field of vision of 192
mm and a thickness of 4 mm, there is no gap between the
34 slices. A resting state run was performed on each partic-
ipant, yielding 152-time steps.
3.3 Data Preprocessing
We performed an extensive 7-step pipeline to preprocess
the resting-state fMRI data. It has been recorded by re-
search that enhanced preprocessing of the fMRI data im-
proves the performance of modelling (Churchill, Spring,
Afshin-Pour, Dong, & Strother, 2015). First, we apply
slice-time correction to each voxel’s time series using SPM
known as Hanning-Windowed Sinc Interpolation (HWSI).
Afterwards, we subtract 7 timesteps from the beginning of
each subject to account for magnetic saturation. Using FSL-
MCFLIRT (Jenkinson, Bannister, Brady, & Smith, 2002),
we were able to correct the fMRI for motion artefacts in-
duced by low-frequency drifts, which might have a detri-
mental influence on the time course decomposition of the
data. We eliminated the skull and neck voxels from the
structural T1 weighted image using the FSL-BET (Smith,
2002) corresponding to each fMRI time series from the
structural T1 weighted image. As a result, we coregistered
the structural image with the functional image, and the pro-
cess was completed. Following that, we matched the regis-
tered brains to the Montreal Neurological Institute standard
3mm brain template (MNI152) (Fonov et al., 2011). Us-
ing a Gaussian kernel with a full-width half-maximum of
4mm, spatial smoothing was performed to a time series of
data (FWHM). The final step was to downscale the images
to the integer format in order to decrease the amount of disc
space consumed.
3.4 Functional Connectivity Measure
In order to extract the functional connectomes, we used
two brain atlases for two different purposes. One was to
use it for classification purposes whereas the other one is
2