Automatic diagnosis of late-life depression by 3D convolutional neural networks and cross-sample Entropy analysis from resting-state fMRI

Lin, Chemin; Lee, Shwu-Hua; Huang, Chih-Mao; Chen, Guan-Yen; Chang, Wei; Liu, Ho-Ling; Ng, Shu-Hang; Lee, Tatia Mei-Chun; Wu, Shun-Chi

doi:10.1007/s11682-022-00748-0

Automatic diagnosis of late-life depression by 3D convolutional neural networks and cross-sample Entropy analysis from resting-state fMRI

Original Research
Open access
Published: 24 November 2022

Volume 17, pages 125–135, (2023)
Cite this article

Download PDF

You have full access to this open access article

Brain Imaging and Behavior Aims and scope Submit manuscript

Automatic diagnosis of late-life depression by 3D convolutional neural networks and cross-sample Entropy analysis from resting-state fMRI

Download PDF

Chemin Lin^1,2,3,
Shwu-Hua Lee^2,4,
Chih-Mao Huang⁵,
Guan-Yen Chen⁶,
Wei Chang⁶,
Ho-Ling Liu⁷,
Shu-Hang Ng^8,9,
Tatia Mei-Chun Lee^10,11,12 &
…
Shun-Chi Wu⁶

2618 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

Resting-state fMRI has been widely used in investigating the pathophysiology of late-life depression (LLD). Unlike the conventional linear approach, cross-sample entropy (CSE) analysis shows the nonlinear property in fMRI signals between brain regions. Moreover, recent advances in deep learning, such as convolutional neural networks (CNNs), provide a timely application for understanding LLD. Accurate and prompt diagnosis is essential in LLD; hence, this study aimed to combine CNN and CSE analysis to discriminate LLD patients and non-depressed comparison older adults based on brain resting-state fMRI signals. Seventy-seven older adults, including 49 patients and 28 comparison older adults, were included for fMRI scans. Three-dimensional CSEs with volumes corresponding to 90 seed regions of interest of each participant were developed and fed into models for disease classification and depression severity prediction. We obtained a diagnostic accuracy > 85% in the superior frontal gyrus (left dorsolateral and right orbital parts), left insula, and right middle occipital gyrus. With a mean root-mean-square error (RMSE) of 2.41, three separate models were required to predict depressive symptoms in the severe, moderate, and mild depression groups. The CSE volumes in the left inferior parietal lobule, left parahippocampal gyrus, and left postcentral gyrus performed best in each respective model. Combined complexity analysis and deep learning algorithms can classify patients with LLD from comparison older adults and predict symptom severity based on fMRI data. Such application can be utilized in precision medicine for disease detection and symptom monitoring in LLD.

Identification of Functional Connectivity Features in Depression Subtypes Using a Data-Driven Approach

Major depressive disorder diagnosis based on effective connectivity in EEG signals: a convolutional neural network and long short-term memory approach

Article 26 July 2020

Depression detection from sMRI and rs-fMRI images using machine learning

Article 02 August 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Late-life depression (LLD) is a prevalent mental disorder among older adults, with incidence ranging from 0.2 to 14.1 per 100 person-years (Büchtemann et al., 2012). LLD is associated with rapid cognitive decline (Chodosh et al., 2007), progression to dementia (Richard et al., 2013), suicide (Conwell et al., 2002), and social burden (Zivin et al., 2013).The development of artificial intelligence (AI) bring an evolutionary change in mental health (Lovejoy, 2019). With accumulated data from electronic health records, brain imaging, wearable devices, and even social media content, AI can predict or classify a variety of mental health illnesses (Graham et al., 2019). The results from AI may validate and complement our knowledge in psychiatry.

Machine learning is a subdiscipline of AI, where algorithms can learn the associations from data and produce predictions on data (i.e., learning from experience) (Jordan & Mitchell, 2015). By using more wide-range statistical models without a priori assumptions, machine learning has been applied in research on LLD. For example, using clinical and structural neuroimaging data, alternating decision trees can predict diagnosis and treatment response in LLD (Patel et al., 2015). Moreover, support-vector clustering can discriminate LLD patients with cognitive impairment from those without such impairment based on the abnormalities of three proteins (Apo AI, IL-12, and stem cell factor) (Diniz et al., 2015). These studies suggest that machine learning can handle complex behavioral and biological data in LLD.

The implementation of machine learning in neuroimaging analysis is appropriate as algorithms can manage datasets with more variables than observations (Janssen et al., 2018). Traditional resting-state functional connectivity, by measuring the temporal correlation between two anatomically separate brain regions at rest, has been widely used in the research of LLD (Tadayonnejad & Ajilore, 2014). However, such static analysis ignores the dynamic and nonlinear information in this typical 5–10-minute resting-state scan (Xie et al., 2008). Entropy analysis (Richman & Moorman, 2000), a way to measure randomness and predictability in the data, has been adopted to capture the dynamic and complexity of the temporal fluctuation in the resting brain (Wang et al., 2018). However, uncorrelated random signals (i.e., white noise) have high entropy value but are not complex at all, limiting the use of traditional entropy analysis. Therefore, multiscale sample entropy (MSE) analysis is developed where the complexity of the system is analyzed under different scales of the time series (Jiang et al., 2011). MSE analysis can better capture the physiological changes in biological system with the applications from cardiac arrhythmia to aging process (Costa et al., 2005). Using multiscale sample entropy analysis in the analysis of the resting-state fMRI, we have recently shown entropy differences in certain brain regions in LLD with particularly the adaptive increase of entropy in the left frontoparietal network (Lin et al., 2019). Complimentary to connectivity analysis, entropy analysis can provide additional insights on how the complexity of brain signal changes (McDonough & Nashiro, 2014). Given that mental disorders were hypothesized to be associated with complexity changes in the brain (Takahashi, 2013; Yang & Tsai, 2013), the combination of entropy analysis and machine learning in resting-state fMRI could provide innovative features in LLD.

By extending our previous work, the current study adopted cross-sample entropy (CSE) analysis, which measures the similarity between two time series (i.e., the relation of the complexity between two brain regions of interest [ROIs] in this case). Moreover, we constructed a three-dimensional (3D) CSE volume for each subject (Wang et al., 2014). Yielding to the high computational demand, Convolutional Neural Networks (CNN) was used. Unlike traditional machine learning, deep learning obtains features in the data by automatic learning through sets of neural networks. CNN, initially inspired by the primate visual system, is a type of neural network that excels particularly in 3D image recognition and classification (Koppe et al., 2021; Segato et al., 2020). To date, CNN has been applied in resting-state fMRI data to diagnose Alzheimer’s disease (Duc et al., 2020), attention deficit hyperactivity disorder (Ariyarathne et al., 2020), and schizophrenia (Qureshi et al., 2019). Thus, this study aimed to investigate which brain region could predict LLD diagnosis and depression severity most accurately using 3D-CNN with the input of CSE volumes of the resting-state fMRI signal. We expect that our result will further the use of machine learning in fMRI analysis, which helps in understanding the brain’s functional architecture in LLD.

Methods

Participants

Patients recruited from psychiatric outpatient service were at least 60 years old with at least one major depression episode, based on the DSM-5 diagnosis, after age of 60 (regardless of the age of the 1st depression onset, inclusive of early-onset and late-onset depression). Depression severity was measured using the 17-item Hamilton depression rating scale (HAM-D) (Hamilton, 1967). We recruited patients who were still in active depressive phase by only including those who had HAM-D score above 7 (i.e., not remitted). The patients were allowed to keep their psychotropics during the study due to ethical reasons, with the dosage maintained for at least 2 months before the MRI scan. Except for anxiety disorder, no other major Axis I psychiatric disorder was diagnosed. All the participants had no history of significant head trauma, stroke, major neurocognitive decline, Parkinson’s disease, thyroid dysfunction, or other major neurological disorders and had a minimum score of 24 on the Mini-Mental State Examination (Folstein et al., 1983). Cognitively normal older adults without any major Axis I psychiatric diagnosis were recruited by advertisements in community centers. Patients in the LLD group were allowed to keep their psychotropics during the fMRI scan for ethical reason. However, their medication had been unchanged for at least two months prior the scan, and we also calculated the medication load for each participant according to Antidepressant Treatment History Form ATHF-modified (Sackeim, 2001). The study protocols have been approved by the institutional review board of the Chang Gung Medical Foundation (IRB No. 201202970B0C601). All the study participants provided written informed consent. The participants were merged from two consecutive projects in order to gain adequate power in the model building (Lin et al., 2021; Lin et al., 2019).

Data acquisition

We collected our MRI data using an eight-channel head coil on a 3T MRI scanner (Discovery MR750, GE Healthcare, Milwaukee, WI). Participants were asked to keep their eyes closed, not to think of anything, and not to fall asleep during the scan. The resting-state functional MRI data were collected using a T2*-weighted gradient-echo echo-planar imaging sequence with the following parameters: repetition time (TR), 2,000 ms; echo time (TE), 30 ms; flip angle, 90°; number of slices, 36; in-plane matrix size, 64 × 64; and slice thickness, 4 mm. A total of 180 dynamic volumes were acquired for each subject. The T1-weighted structural imaging was acquired with TR, 8 ms; TE, 3 ms; flip angle, 12°; FOV, 250 × 250 mm²; voxel size, 0.98 × 0.98 × 1 mm³; and slice number, 160.

Image preprocessing

The preprocessing included the following steps: slice-timing correction, motion correction by realigning images to the first volume and removing those images showing 2-mm axial displacement and 2° in rotation angle, normalization and deformation to Montreal Neurological Institute template, and reslicing to 2 × 2 × 2 isotropic voxel dimensions. These procedures were implemented using SPM12 (Statistical Parametric Mapping, http://www.fil.ion.ucl.ac.uk/spm/). Further preprocessing steps used the REST toolbox (http://restfmri.net/forum/REST_V1.8). At each voxel, the time series were detrended and bandpass-filtered (frequencies between 0.01 and 0.08 Hz). The time courses for various covariates (white matter, cerebrospinal fluid, and six motion parameters for head movement) were extracted and regressed out as nuisance regressors for eliminating potential impacts of physiological artifacts. Finally, the gray matter of the brain was divided into 90 ROIs according to the Automated Anatomical Labeling (AAL) (Tzourio-Mazoyer et al., 2002). The data time series of all voxels in an ROI were averaged. The brain networks were visualized using the BrainNet Viewer toolbox (http://www.nitrc.org/projects/bnv/) (Xia et al., 2013).

Cross-sample entropy

CSE is a measure to assess the degree of asynchrony or dissimilarity of two data series. It remains relatively consistent for conditions where the cross-approximate entropy does not and exhibits no direction dependence (Gómez et al., 2009; Richman & Moorman, 2000). With this measure, the functional connectivity between the brain regions and complexity of their data series can be simultaneously revealed (Zhang et al., 2007). Given two normalized data series $\mathbf{x}={\text{ }}[{x_1},{\text{ }}{x_2},...,{\text{ }}{x_L}]$ and $\mathbf{y}={\text{ }}[{y_1},{\text{ }}{{\text{y}}_2},...,{\text{ }}{{\text{y}}_L}]$, their CSE is defined as

$${\text{CSE}}(m,r,L)= - \ln \frac{{p_{{}}^{{m+1}}}}{{p_{{}}^{m}}},$$

(1)

where $p_{{}}^{l}=\sum\limits_{{i=1}}^{{L - l}} {p_{i}^{l}} /(L - l)$ with l (m or m + 1 in (1) with m = 2 in this study) being the length of two sub-vectors ${\mathbf{y}_l}(j)=[{y_j},{\text{ }}{y_{j+1}},...,{{\text{y}}_{j+l - 1}}]$ and ${\mathbf{x}_l}(i)=[{x_i},{\text{ }}{x_{i+1}},...,{\text{ }}{x_{i+l - 1}}]$ from data series y and x, respectively.$p_{i}^{l}=n_{i}^{l}/(L - l)$ and $n_{i}^{l}$ are the probability and number of vectors that any l-point sub-vector ${\mathbf{y}_l}(j)$ in the data series y that matches the l-point sub-vector ${\mathbf{x}_l}(i)$ in the data series x, respectively. The indices i and j vary from 1 to L - l. A match with a tolerance r is defined to be $d\left[ {{\mathbf{x}_l}(i),{\text{ }}{\mathbf{y}_l}(j)} \right]<r$ with

$$d\left[ {{\mathbf{x}_l}(i),{\text{ }}{\mathbf{y}_l}(j)} \right]=\hbox{max} \left\{ {\left| {{x_{i+k}} - {y_{j+k}}} \right|:0 \leqslant k \leqslant l - 1} \right\},$$

(2)

which is the maximum difference between the components of ${\mathbf{x}_l}(i)$ and ${\mathbf{y}_l}(j)$. The tolerance parameter r is determined when the two sub-vectors match, which was set to be 0.6 in this study.

The CSE matrix in Fig. 1b is a 90 × 90 square matrix containing CSE values between the AAL ROIs. The calculation of these CSE values starts from pairing the data time series of the 90 ROIs and is obtained using (1). Note that the CSE matrix is symmetric, with its main diagonal containing the sample entropy of each ROI (i.e., the CSE of each ROI with itself). Furthermore, the i-th column of the CSE map contains the CSE values of the 90 ROIs with respect to the i-th ROI. Assigning each entry of the i-th column of the CSE map to all the voxels of the corresponding AAL ROI, we can construct a 3D CSE volume that uses the i-th ROI as the “seed.” The resulting 3D CSE volume has a size of 91$\times$109$\times$91, which serves as the input to the proposed network models. Having a large enough dataset is always essential for the performance of a deep learning model. Thus, a data augmentation method is incorporated before calculating the CSE values. More specifically, for a data time series of length N (i.e., 180 in this study), it is partitioned into three segments of length N/2 overlapped with N/4 so that the first and third segments are the first and last halves of the original data time series. The second segment comprises the last and first halves of the first and third segments. By doing so, we triple the data and thus the 3D CSE volumes for each subject. Finally, since the CSE matrix encodes only temporal information about brain activity, combining it with the 3D brain structure provides spatial-temporal details on brain interaction. We also center and scale each entry in a CSE matrix before constructing its 3D CSE volumes according to the mean and standard deviation of the normal subjects’ CSE matrices.

Convolutional neural networks

The classification model for depression diagnosis comprises one average-pooling, three convolution, two max-pooling, two batch normalization, one flatten, and four fully connected layers. The architecture starts with the average-pooling layer. Two of the three convolution layers are followed, each with the max-pooling and batch normalization layers intervened. There is no consensus on the arrangement for the pooling and batch normalization layers, and one may try out different arrangements for “smooth” training experiences (Ioffe & Szegedy, 2015). We noted that they might be used interchangeably for various classification and regression purposes (Hasani & Khotanlou, 2019; Herent et al., 2018; Khvostikov et al., 2018). Then, the third convolutional layers are connected, whose output is fed to the four successive fully connected layers. The input to the network is a data block of size 91 × 109 × 91 (i.e., the CSE volume of a specific seed ROI). The convolution layers have 32, 64, and 128 filters of sizes 3 × 3 × 3, 3 × 3 × 3, and 2 × 2 × 2, respectively. The filter sizes for the average-pooling and the two max-pooling layers are all 2 × 2 × 2. The number of neurons is 500 in the first three fully connected layers and 2 in the last one. The two severity level classification networks share the same architecture. Although we defined the HAM-D scales of > 13, 11–13, and 8–10 as severe, moderate, and mild scale levels, respectively, we trained the model of each level with HAM-D scales lying in the ranges of > 11, 9–15, and 8–12, respectively. The overlapped ranges of the HAM-D scale offer fault-tolerance for the networks, which is crucial to provide consistent results for HAM-D scale prediction.

The proposed regression network model for HAM-D scale prediction resembles the above classification network, except that it comprises seven fully connected layers in series instead of four and that the batch normalization layers are applied before max-pooling layers. The convolution layers have 32, 64, and 128 filters, and all are of sizes 3 × 3 × 3. The filter sizes for the average-pooling and the two max-pooling layers are 2 × 2 × 2 and 3 × 3 × 3, respectively. The number of neurons is 1028 in the first three fully connected layers, 512 in the subsequent three layers, and 1 in the last layer. Note that the regression networks conditioned with the CSE volumes constructed from different seed ROIs show the same architecture.

All the above-mentioned conditioned layers are followed by the ReLU activation function, except for the last fully connected layer, where the softmax function is used in the classification network and the linear function is used in the regression network. The architectures of the network models for classification and regression are illustrated in Supplementary Fig. 2.

Model training

We retained most participants for the training dataset to ensure enough data covering each HAM-D scale for model training, resulting in 51, 7, and 13 participants in the training, validation, and testing sets for depression diagnosis and 29, 10, and 10 participants in the corresponding sets for severity scale prediction. The concept for data splitting is to ensure that data set contains approximately the same number of observations in each participant group and each HAM-D scale. We used the categorical cross-entropy loss for classification and the mean squared error for regression during model training; both were optimized using the adaptive moment estimation optimizer with the epoch size of 200 and the batch size of 32. The learning rate was set as in all the classification models in Depression Diagnosis Network (DDN) and the first severity classification network in Depression Severity Prediction Network (DSPN). For the remaining classification network and regression networks, the learning rate was 1 × 10^− 4. The training procedure was stopped if the loss did not improve within 60 epochs and 20 epochs in the classification and regression models, respectively. Early stopping was utilized to avoid overfitting (Orr & Müller, 2003). The validation dataset provides a basis to evaluate the model trained on the training dataset. The prediction error on the training dataset will decrease, whereas the error on the validation set will first decrease and then increase. The early stopping point occurs when the error on the validation set is the lowest. Here, the network weights provide the best generalization, and the training will be stopped since an increase in the error on the validation dataset is a sign of overfitting to the training dataset. The models were trained in TensorFlow 1.13.1 with the CUDA 9.2 Toolkit and cuDNN v7.6.0 on the computing platform: ASUS ESC8000 G4 server system with Intel Xeon CPU, GeForce RTX 2080 Ti, and 192 GB RAM.

Proposed scheme

The flow of the proposed scheme for depression diagnosis and severity scale prediction is depicted in Fig. 1a (In Supplementary Fig. 1, a detailed system diagram of the proposed scheme is provided.) In this scheme, the DDN is used to determine whether the subject under test is depressed. For those determined to be depressed, the HAM-D scores are predicted through the DSPN. At the cores of these two networks is the CNNs for different purposes, with their inputs being the three-dimensional (3D) CSE volumes (Chen et al., 2020; Richman & Moorman, 2000; Zhang et al., 2007). For a given seed ROI, its 3D CSE volume is constructed by mapping the CSE values between the data time series of this seed ROI and those of the remaining ROIs in the brain (Chen et al., 2020), as shown in Fig. 1b.

With the influence of learning strategies (Sagi & Rokach, 2018), the DDN integrates across different seed ROI selections to better determine whether the subject under test is depressed than that achieved by any of the constituent ROIs alone. Thus, without a priori selection of a particular ROI, we hypothesized that the DDN can benefit from integrating different ROI selections. The final decision made by the DDN is obtained through polling the results from the classification models trained on the 3D CSE volumes calculated with different seeds, providing comprehensive investigations of different 3D CSE volumes in depression diagnosis.

In DSPN, not all CSE volumes of different seed ROIs are used for severity scale prediction. Three datasets are involved in deep learning application. The training dataset contains the sample of data used to fit or train the network model. The validation dataset provides the sample of data used to obtain an unbiased estimate of the generalization error of the model acquired from the training set while tuning model hyperparameters, which avoids model overfitting. These two datasets will be considered during the process of model training. Lastly, the test dataset has the sample of data used to provide an unbiased evaluation of a final model fit on the training and validation datasets. The role of any sample in this dataset mimics a future sample we will encounter in practice. With this in mind, to select the CSE volumes for HAM-D scale prediction, we only retain those whose resulting classification models in DDN achieve 85% accuracy during model validation. This means that no information from the CSE volumes that will be used to evaluate the performance of the final network model will be used in this selection. Moreover, the ROIs (49 in this study) used as the seeds for computing the CSE volumes that led to better diagnosis accuracy may induce insights that are crucial in discriminating depressed patients from normal subjects and will be further explored. The accuracy for node here means the percentage of correct classification in disease diagnosis or symptoms prediction.

To predict the HAM-D scale of a depressed subject, we adopt a two-stage strategy in DSPN. A depressed subject is first classified to be of a severe, moderate, or mild scale level. Here we define severe scale level as HAM-D scales > 13, moderate scale as between 11 and 13, and mild scale as ≤ 10. Two severity level classification networks are used to achieve this classification task. The first network, composed of m (again 49 in this study) independent weak classifiers, categorizes the depressed subjects into high- or low-scale levels. Those in the low-scale levels are further divided into moderate or mild scale levels using the second severity classification network which comprises merely one classifier. Each of the m CSE volumes of the subject is fed into this network to predict the severity level, and all the determined levels are then polled to decide the most likely severity level of that subject. Finally, we predict the HAM-D scale of the depressed subject using the regression network according to which severity level the subject belongs to, enabling precise HAM-D scale prediction. Moreover, this network consists of merely one regression model for each severity level, and the predicted HAM-D scales obtained by feeding different CSE volumes to the regression model are averaged and rounded to the nearest integer to obtain the ultimate HAM-D scale for that subject.

Results

We enrolled 77 participants, including 49 late-life depressed patients and 28 non-depressed comparison older adults, with a mean age of 67.7 ± 6.0 years. The LLD group had lower education level and higher HAMD score than the comparison group. The average onset of the first depressive episode was 50.4 ± 11.6 with a mean 3.3 ± 2.9 episodes in the LLD group. Also, all the patients in the LLD group were not remitted from their current depressive episode with a mean HAM-D score of 13.3 ± 3.6. Thus, only five patients were not under antidepressant, while the rest all received antidepressants during the study period, including 19 under selective serotonin reuptake inhibitors, eight under mirtazapine, seven under serotonin and norepinephrine reuptake inhibitors, six under agomelatine, and four under bupropion. No group difference was found in age, sex and MMSE score (Table 1).

Table 1 Demographic data and between group comparisons

Full size table

In the testing data set, the proposed diagnostic network can correctly classify all the 13 participants. Four ROIs attained accuracy rate above 85% in classification, including the superior frontal gyrus (left dorsolateral and right orbital parts), left insula, and right middle occipital gyrus (Fig. 2). A total of 20 ROIs reached an 80% accuracy rate (Supplementary Fig. 3).

For DSPN, participants were allocated to three corresponding regression models based on symptom severity. In the 10 testing participants, there were 3 the in severe, 3 in the moderate, and 4 in the mild depression groups. Since we intentionally trained these three prediction models with overlapped cut-off points to allow some fault-tolerance, three subjects were allocated to the wrong models. The root-mean-square error (RMSE) for predicting the HAM-D scales of the 10 subjects was 2.41 (Fig. 3). In post hoc, we manually allocated these three patients into the right models; the RMSE could be reduced to 1.48. This indicated that these prediction models provide good HAM-D scale prediction under appropriate regression models. In each of the three models, we ranked all 90 ROIs based on the RMSE value shown in Supplementary Fig. 4. In the regression model of the severe depression group, the CSE volume in the left inferior parietal lobule performed best with an RMSE of 1.53 in the regression model of the severe depression group, left parahippocampal gyrus with an RMSE of 1.66 in the moderate group, and left postcentral gyrus with an RMSE of 0.50 in the mild group (Supplement Fig. 4a ~ c).

Discussion

Based on the CSE analysis of resting-state fMRI data, we demonstrated that 3D-CNN machine learning algorithm could classify late-life depressed patients from comparison group and predict depression severity. The 3D-CSE volume in regions such as the superior frontal gyrus (left dorsolateral and right orbital parts), left insula, and right middle occipital gyrus, could discriminate LLD from the comparison group with an accuracy rate above 85%. Regions in the left inferior parietal lobule, left parahippocampal gyrus, and left postcentral gyrus best predict HAM-D score in severe, moderate, and mild depression groups, respectively. Combined entropy analysis and 3D CNN approach can reveal complementary and additional brain features in LLD.

Two nodes in the superior frontal gyrus were found to have high predictive power, classifying LLD from non-depressed older adults. Brain atrophy in the superior frontal gyrus has been found in a meta-analysis on LLD (Boccia et al., 2015). In particular, our finding of the dorsolateral part of the prefrontal gyrus (DLPFC) is consistent with prior research showing persistent functional connectivity decrease in DLPFC even after treatment (Aizenstein et al., 2009). The anterior and posterior subdivisions of the dorsolateral SFG correlated with the default mode network (DMN) and other regions involved in cognitive control, respectively (Li et al., 2013), which are both implicated in the pathophysiology of depression. Moreover, the orbital SFG is implicated in depression, whose nodal efficiency, which measures the community efficiency in a network, has been found to be positively related to depression severity in diffusion tensor imaging study (Qin et al., 2014). Orbital SFC is referred to as medial orbital frontal gyrus (OFC) under newer brain parcellations (Heather Hsu et al., 2020; Rolls et al., 2015). Decreased activity in the medial OFC is associated with decreased reward experience and depressive symptoms, especially anhedonia and apathy, which are two common symptoms of LLD (Rolls, 2019). The insula shares a similar relationship with the OFC (Ghaziri et al., 2017) and is a key node in the salience network. Decreased salience network (insula) connectivity with left executive control network (ECN) has been found to be correlated with executive dysfunction in LLD, suggesting insula’s crucial role in task-switching function (Li et al., 2017). Furthermore, we also found the right middle occipital gyrus (MOG) to be highly predictive of LLD. A past study reported decreased functional connectivity in the right MOG within ECN in patients remitted from LLD (Karim et al., 2017). In LLD, increased hyperconnectivity has been found in the DMN and auditory and visual networks (Eyre et al., 2016), highlighting the significance of the nodes in occipital cortex in LLD.

Regarding depression severity prediction, we could not obtain acceptable accuracy rate by using only one model. Instead, three separate models were required for the high, moderate, and low depression groups. Also, the anchor points for our categorization into three groups were different from the conventional grouping to define mild (8–16), moderate (17–23), and severe depression (≥ 24) (Zimmerman et al., 2013). The conventional categorization performed poorly in our sample. However, the nature of the model formation is data-driven and reflects the range of depression severity in our sample (range in HAMD: 8–20). The ROIs derived from these three models were different from each other, offering neurobiological evidence to this categorization and challenging the traditional cutoffs to define depression severity categories. For example, the left inferior parietal lobule and left parahippocampal gyrus were the nodes performing best to predict symptom severity in high and moderate depression groups, respectively. Both nodes belong to the DMN, where the heightened activity is the central pathophysiology of depression (Sheline et al., 2009). In contrast, the CSE in the left postcentral gyrus can best predict the depression severity in the mild depression group. The white matter hyperintensity (WMH)-related structural dysconnectivity in postcentral gyrus was associated with slower processing speed and reduced attentional set-shifting in LLD (Respino et al., 2019). Moreover, increased functional connectivity between the right postcentral gyrus and ECN was determined after treatment of LLD (Karim et al., 2017). All these studies suggest that the structural and functional integrity in postcentral gyrus is associated with the executive dysfunction in LLD. Thus, nodes from the DMN demonstrate higher predictive power in those of more severe depression, whereas nodes related to cognitive function are for those of milder depression.

Unlike traditional linear functional connectivity analysis, we adopted an alternative nonlinear approach to analyze the functional connectivity of distinct brain regions (Hu & Shi, 2006). CSE is used to detect the asynchrony of the signals between two brain regions, reflecting the complexity of local and distributed information processing within the brain. Entropy is also applied to understand the consciousness of the brain (Carhart-Harris, 2018), where a certain range of the entropy is required for the brain to maintain its function (i.e., a state called “criticality”, a critical range between order and disorder). In addition to self-organized criticality, scale-free is another attribute of the brain dynamic network (Hesse & Gross, 2014; Massobrio et al., 2015). The scale-free brain network follows the power-law distribution, resulting in only a few “hubs” in the brain with multiple connections with other regions and endowing the brain network resilience toward random attack (Aerts et al., 2016; Albert et al., 2000). This scale-free property of the brain network could be demonstrated by CSE analysis (Pritchard et al., 2014). Thus, these prior studies have suggested entropy analysis suitable to capture the scale-free and criticality features of the brain network. The notion of entropy is also implicated in the understanding the formation of depression, a status regarded as one’s adaptive behavior to minimize the uncertainty in life by reducing the entropy of the sensory and physical states (Badcock et al., 2017).

Regarding the limitations, the sample size was small to cover each HAM-D scale for model training. Although the independent testing dataset used in this study can still measure the generalization performance for unknown future cases, data from more samples for model training are beneficial to model generalization. Furthermore, the small number in our testing data is noteworthy as low testing data could lead to misestimation of model performance (Flint et al., 2021). The polls of few subjects were won by a margin of only 10% of the 90 seed ROIs’ votes, and those in the margin may lead to potential failure in depression diagnosis. Moreover, our modification using overlapped cut-off points in HAM-D presents fault-tolerance in the depression symptoms prediction network. However, further studies including participants with a larger variance in depression symptoms are warranted. Third, we only used resting-state fMRI data. Future studies incorporating multi-modal imaging measures may yield superior model performance (Patel et al., 2015). One study did attempt to employ 3D convolutional and recurrent neural networks on structural and functional MRI data, however, with the receiver operating characteristic curve score reaching only 0.73 (Pominova et al., 2018). To date, numerous machine learning approaches have been applied extensively to neuroimaging data to classify depression (Gao et al., 2018), where resting-state fMRI appeared to provide the highest accuracy rate compared to other neuroimaging modalities. Last but not least, we notice the group difference in education level and medication the use of antidepressant in the group of LLD. Speculatively, integrating these variables in the model may augment the model performance. However, our aim of the study is to develop an automatic classification model based solely on resting-state fMRI data without relying on the collection of demographic data or clinical variables. Also, it has been found that antidepressant treatment can obscure or reverse the brain changes due to depression (Zhuo et al., 2019). Were it not for the continuing antidepressants use in the LLD group, our proposed CSE-based CNN model could provide more salient features with higher accuracy rate in the model. Nevertheless, future validation is warranted in treatment naïve or medication-free older adults with LLD.

Conclusion

In conclusion, our results also highlight the importance to include both the temporal (i.e., the multiscale entropy analysis) and the spatial (i.e., the 3D approach) features of the fMRI data in order to construct an effective model. Our new machine learning approach for neuroimaging data provides new insights into understanding the neurobiology of LLD.

References

Aerts, H., Fias, W., Caeyenberghs, K., & Marinazzo, D. (2016). Brain networks under attack: robustness properties and the impact of lesions. Brain, 139(12), 3063–3083. https://doi.org/10.1093/brain/aww194.
Article PubMed Google Scholar
Aizenstein, H. J., Butters, M. A., Wu, M., Mazurkewicz, L. M., Stenger, V. A., Gianaros, P. J., & Carter, C. S. (2009). Altered functioning of the executive control circuit in late-life depression: episodic and persistent phenomena. The American Journal of Geriatric Psychiatry, 17(1), 30–42.
Article PubMed PubMed Central Google Scholar
Albert, R., Jeong, H., & Barabási, A. L. (2000). Error and attack tolerance of complex networks. Nature, 406(6794), 378–382. https://doi.org/10.1038/35019019.
Article CAS PubMed Google Scholar
Ariyarathne, G., De Silva, S., Dayarathna, S., Meedeniya, D., & Jayarathne, S. (2020). ADHD identification using convolutional neural network with seed-based approach for fMRI data. Proceedings of the 2020 9th International Conference on Software and Computer Applications
Badcock, P. B., Davey, C. G., Whittle, S., Allen, N. B., & Friston, K. J. (2017). The depressed brain: an evolutionary systems theory. Trends in Cognitive Sciences, 21(3), 182–194.
Article PubMed Google Scholar
Boccia, M., Acierno, M., & Piccardi, L. (2015). Neuroanatomy of Alzheimer’s disease and late-life depression: a coordinate-based meta-analysis of MRI studies. Journal of Alzheimer’s Disease, 46(4), 963–970.
Article PubMed Google Scholar
Büchtemann, D., Luppa, M., Bramesfeld, A., & Riedel-Heller, S. (2012). Incidence of late-life depression: a systematic review. Journal of affective disorders, 142(1), 172–179. https://doi.org/10.1016/j.jad.2012.05.010.
Article PubMed Google Scholar
Carhart-Harris, R. L. (2018). The entropic brain-revisited. Neuropharmacology, 142, 167–178.
Article CAS PubMed Google Scholar
Chen, G. Y., Huang, C. M., Liu, H. L., Lee, S. H., Lee, T. M. C., Lin, C., & Wu, S. C. (2020). Depression Scale Prediction with Cross-Sample Entropy and Deep Learning. 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)
Chodosh, J., Kado, D. M., Seeman, T. E., & Karlamangla, A. S. (2007). Depressive symptoms as a predictor of cognitive decline: MacArthur Studies of successful aging. The American Journal of Geriatric Psychiatry, 15(5), 406–415.
Article PubMed Google Scholar
Conwell, Y., Duberstein, P. R., & Caine, E. D. (2002). Risk factors for suicide in later life. Biological Psychiatry, 52(3), 193–204.
Article PubMed Google Scholar
Costa, M., Goldberger, A. L., & Peng, C. K. (2005). Multiscale entropy analysis of biological signals. Physical review E, 71(2), 021906.
Article Google Scholar
Diniz, B. S., Sibille, E., Ding, Y., Tseng, G., Aizenstein, H., Lotrich, F., & Klunk, W. E. (2015). Plasma biosignature and brain pathology related to persistent cognitive impairment in late-life depression. Molecular psychiatry, 20(5), 594.
Article CAS PubMed Google Scholar
Duc, N. T., Ryu, S., Qureshi, M. N. I., Choi, M., Lee, K. H., & Lee, B. (2020). 3D-deep learning based automatic diagnosis of Alzheimer’s disease with joint MMSE prediction using resting-state fMRI. Neuroinformatics, 18(1), 71–86.
Article PubMed Google Scholar
Eyre, H. A., Yang, H., Leaver, A. M., Van Dyk, K., Siddarth, P., Cyr, N. S., & Lavretsky, H. (2016). Altered resting-state functional connectivity in late-life depression: a cross-sectional study. Journal of affective disorders, 189, 126–133.
Article PubMed Google Scholar
Flint, C., Cearns, M., Opel, N., Redlich, R., Mehler, D. M. A., Emden, D., & Hahn, T. (2021). Systematic misestimation of machine learning performance in neuroimaging studies of depression. Neuropsychopharmacology : Official Publication Of The American College Of Neuropsychopharmacology, 46(8), 1510–1517. https://doi.org/10.1038/s41386-021-01020-7.
Article PubMed Google Scholar
Folstein, M. F., Robins, L. N., & Helzer, J. E. (1983). The mini-mental state examination. Archives of General Psychiatry, 40(7), 812–812.
Article CAS PubMed Google Scholar
Gao, S., Calhoun, V. D., & Sui, J. (2018). Machine learning in major depression: from classification to treatment outcome prediction. CNS neuroscience & therapeutics, 24(11), 1037–1052.
Article Google Scholar
Ghaziri, J., Tucholka, A., Girard, G., Houde, J. C., Boucher, O., Gilbert, G., & Nguyen, D. K. (2017). The corticocortical structural connectivity of the human insula. Cerebral Cortex, 27(2), 1216–1228.
Article PubMed Google Scholar
Gómez, C., Hornero, R., Abásolo, D., Fernández, A., & Escudero, J. (2009). Analysis of MEG background activity in Alzheimer’s disease using nonlinear methods and ANFIS. Annals of Biomedical Engineering, 37(3), 586–594.
Article PubMed Google Scholar
Graham, S., Depp, C., Lee, E. E., Nebeker, C., Tu, X., Kim, H. C., & Jeste, D. V. (2019). Artificial intelligence for mental health and mental illnesses: an overview. Current psychiatry reports, 21(11), 1–18.
Article Google Scholar
Hamilton, M. (1967). Development of a rating scale for primary depressive illness. British journal of social and clinical psychology, 6(4), 278–296.
Article CAS PubMed Google Scholar
Hasani, M., & Khotanlou, H. (2019). An empirical study on position of the batch normalization layer in convolutional neural networks. 2019 5th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS)
Heather Hsu, C. C., Rolls, E. T., Huang, C. C., Chong, S. T., Lo, Z., Feng, C. Y., J., & Lin, C. P. (2020). Connections of the human orbitofrontal cortex and inferior frontal gyrus. Cerebral Cortex, 30(11), 5830–5843.
Article PubMed Google Scholar
Herent, P., Jegou, S., Wainrib, G., & Clozel, T. (2018). Brain age prediction of healthy subjects on anatomic MRI with deep learning: Going beyond with an “explainable AI” mindset. bioRxiv, 413302.
Hesse, J., & Gross, T. (2014). Self-organized criticality as a fundamental property of neural systems. Frontiers in Systems Neuroscience, 8, 166.
Article PubMed PubMed Central Google Scholar
Hu, Z., & Shi, P. (2006). Interregional functional connectivity via pattern synchrony. 2006 9th International Conference on Control, Automation, Robotics and Vision
Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. International conference on machine learning
Janssen, R. J., Mourão-Miranda, J., & Schnack, H. G. (2018). Making individual prognoses in psychiatry using neuroimaging and machine learning. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 3(9), 798–808.
PubMed Google Scholar
Jiang, Y., Peng, C. K., & Xu, Y. (2011). Hierarchical entropy analysis for biological signals. Journal of Computational and Applied Mathematics, 236(5), 728–742. https://doi.org/10.1016/j.cam.2011.06.007.
Article Google Scholar
Jordan, M. I., & Mitchell, T. M. (2015). Machine learning: Trends, perspectives, and prospects. Science, 349(6245), 255–260.
Article CAS PubMed Google Scholar
Karim, H., Andreescu, C., Tudorascu, D., Smagula, S., Butters, M., Karp, J., & Aizenstein, H. (2017). Intrinsic functional connectivity in late-life depression: trajectories over the course of pharmacotherapy in remitters and non-remitters. Molecular psychiatry, 22(3), 450–457.
Article CAS PubMed Google Scholar
Khvostikov, A., Aderghal, K., Benois-Pineau, J., Krylov, A., & Catheline, G. (2018). 3D CNN-based classification using sMRI and MD-DTI images for Alzheimer disease studies. arXiv preprint arXiv:1801.05968.
Koppe, G., Meyer-Lindenberg, A., & Durstewitz, D. (2021). Deep learning for small and big data in psychiatry. Neuropsychopharmacology : Official Publication Of The American College Of Neuropsychopharmacology, 46(1), 176–190.
Article PubMed Google Scholar
Li, W., Qin, W., Liu, H., Fan, L., Wang, J., Jiang, T., & Yu, C. (2013). Subregions of the human superior frontal gyrus and their connections. Neuroimage, 78, 46–58.
Article PubMed Google Scholar
Li, W., Wang, Y., Ward, B. D., Antuono, P. G., Li, S. J., & Goveas, J. S. (2017). Intrinsic inter-network brain dysfunction correlates with symptom dimensions in late-life depression. Journal of psychiatric research, 87, 71–80.
Article PubMed Google Scholar
Lin, C., Huang, C. M., Karim, H. T., Liu, H. L., Lee, T. M. C., Wu, C. W., & Lee, S. H. (2021). Greater white matter hyperintensities and the association with executive function in suicide attempters with late-life depression. Neurobiology of Aging, 103, 60–67.
Article PubMed Google Scholar
Lin, C., Lee, S. H., Huang, C. M., Chen, G. Y., Ho, P. S., Liu, H. L., & Wu, S. C. (2019). Increased brain entropy of resting-state fMRI mediates the relationship between depression severity and mental health-related quality of life in late-life depressed elderly. Journal of affective disorders, 250, 270–277.
Article PubMed Google Scholar
Lovejoy, C. A. (2019). Technology and mental health: the role of artificial intelligence. European Psychiatry, 55, 1–3.
Article PubMed Google Scholar
Massobrio, P., de Arcangelis, L., Pasquale, V., Jensen, H. J., & Plenz, D. (2015). Criticality as a signature of healthy neural systems. Frontiers in Systems Neuroscience, 9, 22.
Article PubMed PubMed Central Google Scholar
McDonough, I. M., & Nashiro, K. (2014). Network complexity as a measure of information processing across resting-state networks: evidence from the human Connectome Project. Frontiers in Human Neuroscience, 8, 409.
Article PubMed PubMed Central Google Scholar
Orr, G. B., & Müller, K. R. (2003). Neural networks: tricks of the trade. Springer.
Patel, M. J., Andreescu, C., Price, J. C., Edelman, K. L., Reynolds, C. F. III, & Aizenstein, H. J. (2015). Machine learning approaches for integrating clinical and imaging features in late-life depression classification and response prediction. International journal of geriatric psychiatry, 30(10), 1056–1067.
Article PubMed PubMed Central Google Scholar
Pominova, M., Artemov, A., Sharaev, M., Kondrateva, E., Bernstein, A., & Burnaev, E. (2018). Voxelwise 3d convolutional and recurrent neural networks for epilepsy and depression diagnostics from structural and functional mri data. 2018 IEEE International Conference on Data Mining Workshops (ICDMW)
Pritchard, W. S., Laurienti, P. J., Burdette, J. H., & Hayasaka, S. (2014). Functional brain networks formed using cross-sample entropy are scale free. Brain connectivity, 4(6), 454–464.
Article PubMed PubMed Central Google Scholar
Qin, J., Wei, M., Liu, H., Yan, R., Luo, G., Yao, Z., & Lu, Q. (2014). Abnormal brain anatomical topological organization of the cognitive-emotional and the frontoparietal circuitry in major depressive disorder. Magnetic resonance in medicine, 72(5), 1397–1407.
Article PubMed Google Scholar
Qureshi, M. N. I., Oh, J., & Lee, B. (2019). 3D-CNN based discrimination of schizophrenia using resting-state fMRI. Artificial Intelligence in Medicine, 98, 10–17.
Article PubMed Google Scholar
Respino, M., Jaywant, A., Kuceyeski, A., Victoria, L. W., Hoptman, M. J., Scult, M. A., & Murri, M. B. (2019). The impact of white matter hyperintensities on the structural connectome in late-life depression: relationship to executive functions. NeuroImage: Clinical, 23, 101852.
Article PubMed Google Scholar
Richard, E., Reitz, C., Honig, L. H., Schupf, N., Tang, M. X., Manly, J. J., & Luchsinger, J. A. (2013). Late-life depression, mild cognitive impairment, and dementia. JAMA neurology, 70(3), 383–389.
Article Google Scholar
Richman, J. S., & Moorman, J. R. (2000). Physiological time-series analysis using approximate entropy and sample entropy. American Journal Of Physiology Heart And Circulatory Physiology, 278(6), https://doi.org/10.1152/ajpheart.2000.278.6.H2039. H2039-2049.
Rolls, E. T. (2019). The orbitofrontal cortex and emotion in health and disease, including depression. Neuropsychologia, 128, 14–43.
Article PubMed Google Scholar
Rolls, E. T., Joliot, M., & Tzourio-Mazoyer, N. (2015). Implementation of a new parcellation of the orbitofrontal cortex in the automated anatomical labeling atlas. Neuroimage, 122, 1–5.
Article PubMed Google Scholar
Sackeim, H. A. (2001). The definition and meaning of treatment-resistant depression. Journal of Clinical Psychiatry, 62, 10–17.
CAS PubMed Google Scholar
Sagi, O., & Rokach, L. (2018). Ensemble learning: a survey.Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 8(4), e1249.
Segato, A., Marzullo, A., Calimeri, F., & De Momi, E. (2020). Artificial intelligence for brain diseases: a systematic review. APL bioengineering, 4(4), 041503.
Article PubMed PubMed Central Google Scholar
Sheline, Y. I., Barch, D. M., Price, J. L., Rundle, M. M., Vaishnavi, S. N., Snyder, A. Z., & Raichle, M. E. (2009). The default mode network and self-referential processes in depression. Proceedings of the National Academy of Sciences, 106(6), 1942–1947.
Tadayonnejad, R., & Ajilore, O. (2014). Brain network dysfunction in late-life depression: a literature review. Journal of geriatric psychiatry and neurology, 27(1), 5–12.
Article PubMed Google Scholar
Takahashi, T. (2013). Complexity of spontaneous brain activity in mental disorders. Progress in Neuro-Psychopharmacology and Biological Psychiatry, 45, 258–266.
Article PubMed Google Scholar
Tzourio-Mazoyer, N., Landeau, B., Papathanassiou, D., Crivello, F., Etard, O., Delcroix, N., & Joliot, M. (2002). Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage, 15(1), 273–289.
Article CAS PubMed Google Scholar
Wang, D. J., Jann, K., Fan, C., Qiao, Y., Zang, Y. F., Lu, H., & Yang, Y. (2018). Neurophysiological basis of multi-scale entropy of brain complexity and its relationship with functional connectivity. Frontiers in Neuroscience, 12, 352.
Article PubMed PubMed Central Google Scholar
Wang, Z., Li, Y., Childress, A. R., & Detre, J. A. (2014). Brain Entropy Mapping using fMRI. Plos One, 9(3), e89948. https://doi.org/10.1371/journal.pone.0089948.
Article CAS PubMed PubMed Central Google Scholar
Xia, M., Wang, J., & He, Y. (2013). BrainNet Viewer: a network visualization tool for human brain connectomics.PloS one, 8(7), e68910.
Xie, X., Cao, Z., & Weng, X. (2008). Spatiotemporal nonlinearity in resting-state fMRI of the human brain. Neuroimage, 40(4), 1672–1685.
Article PubMed Google Scholar
Yang, A. C., & Tsai, S. J. (2013). Is mental illness complex? From behavior to brain. Progress in Neuro-Psychopharmacology and Biological Psychiatry, 45, 253–257.
Article PubMed Google Scholar
Zhang, T., Yang, Z., & Coote, J. H. (2007). Cross-sample entropy statistic as a measure of complexity and regularity of renal sympathetic nerve activity in the rat. Experimental physiology, 92(4), 659–669.
Article PubMed Google Scholar
Zhuo, C., Li, G., Lin, X., Jiang, D., Xu, Y., Tian, H., & Song, X. (2019). The rise and fall of MRI studies in major depressive disorder. Translational psychiatry, 9(1), 335. https://doi.org/10.1038/s41398-019-0680-6.
Article PubMed PubMed Central Google Scholar
Zimmerman, M., Martinez, J. H., Young, D., Chelminski, I., & Dalrymple, K. (2013). Severity classification on the Hamilton depression rating scale. Journal of affective disorders, 150(2), 384–388. https://doi.org/10.1016/j.jad.2013.04.028.
Article PubMed Google Scholar
Zivin, K., Wharton, T., & Rostant, O. (2013). The economic, public health, and caregiver burden of late-life depression. Psychiatric Clinics, 36(4), 631–649.
PubMed Google Scholar

Download references

Acknowledgements

None.

Funding

Supported by medical research grant CRRPG2G0061 from Chang Gung Memorial Hospital and (1) 109-2314-B-182 A-107-MY3 from Ministry of Science and Technology, Taiwan to Chemin Lin and (2) 106-2314-B-182 A-084-MY2 to Shwu-Hua, Lee. The authors report no financial relationships with commercial interests.

Compliance with Ethical Standards, and Conflict of Interest.

Author information

Authors and Affiliations

Department of Psychiatry, Keelung Chang Gung Memorial Hospital, Keelung City, Taiwan
Chemin Lin
College of Medicine, Chang Gung University, Taoyuan County, Taiwan
Chemin Lin & Shwu-Hua Lee
Community Medicine Research Center, Chang Gung Memorial Hospital, Keelung, Keelung, Taiwan
Chemin Lin
Department of Psychiatry, Linkou Chang Gung Memorial Hospital, Taoyuan County, Taiwan
Shwu-Hua Lee
Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
Chih-Mao Huang
Department of Engineering and System Science, National Tsing Hua University, Hsinchu, Taiwan
Guan-Yen Chen, Wei Chang & Shun-Chi Wu
Department of Imaging Physics, University of Texas MD Anderson Cancer Center, Houston, TX, USA
Ho-Ling Liu
Department of Head and Neck Oncology Group, Linkou Chang Gung Memorial Hospital, Chang Gung University, Taoyuan, Taiwan
Shu-Hang Ng
Department of Diagnostic Radiology, Linkou Chang Gung Memorial Hospital and Chang Gung University, Taoyuan, Taiwan
Shu-Hang Ng
State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong, Hong Kong
Tatia Mei-Chun Lee
Laboratory of Neuropsychology and Human Neuroscience, The University of Hong Kong, Hong Kong, Hong Kong
Tatia Mei-Chun Lee
Institute of Clinical Neuropsychology, The University of Hong Kong, Hong Kong, Hong Kong
Tatia Mei-Chun Lee

Authors

Chemin Lin
View author publications
You can also search for this author in PubMed Google Scholar
Shwu-Hua Lee
View author publications
You can also search for this author in PubMed Google Scholar
Chih-Mao Huang
View author publications
You can also search for this author in PubMed Google Scholar
Guan-Yen Chen
View author publications
You can also search for this author in PubMed Google Scholar
Wei Chang
View author publications
You can also search for this author in PubMed Google Scholar
Ho-Ling Liu
View author publications
You can also search for this author in PubMed Google Scholar
Shu-Hang Ng
View author publications
You can also search for this author in PubMed Google Scholar
Tatia Mei-Chun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Shun-Chi Wu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

CL: Conceptualization, Funding acquisition, Investigation, Project administration, Resources, Writing - original draft; SHL: Conceptualization, Funding acquisition, Writing - review, editing; CMH: Conceptualization, Investigation, Methodology, Project administration, Supervision, Validation, Visualization, Writing - eview, editing; CYC and WC: Conceptualization, Data curation, Formal analysis, Investigation, Project administration, Software, Validation, Visualization, Writing - original draft; HLL: Conceptualization, Supervision, Writing – review; SHN: Conceptualization, Investigation, Project administration, Writing – review; TMCL and SCW: Conceptualization, Data curation, Formal analysis, Funding acquisition; Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing - original draft, review, editing.

Corresponding authors

Correspondence to Tatia Mei-Chun Lee or Shun-Chi Wu.

Ethics declarations

Ethics approval

The study has been approved by the institutional review board of the Chang Gung Medical Foundation (IRB No. 201202970B0C601). All the study participants provided written informed consent.

Conflict of interest

The authors report no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lin, C., Lee, SH., Huang, CM. et al. Automatic diagnosis of late-life depression by 3D convolutional neural networks and cross-sample Entropy analysis from resting-state fMRI. Brain Imaging and Behavior 17, 125–135 (2023). https://doi.org/10.1007/s11682-022-00748-0

Download citation

Received: 23 May 2022
Revised: 26 October 2022
Accepted: 12 November 2022
Published: 24 November 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s11682-022-00748-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Automatic diagnosis of late-life depression by 3D convolutional neural networks and cross-sample Entropy analysis from resting-state fMRI

Abstract

Similar content being viewed by others

Identification of Functional Connectivity Features in Depression Subtypes Using a Data-Driven Approach

Major depressive disorder diagnosis based on effective connectivity in EEG signals: a convolutional neural network and long short-term memory approach

Depression detection from sMRI and rs-fMRI images using machine learning

Introduction