Introduction

Major depressive disorder (MDD) is predicted to be the leading cause of disability in high-income countries by the year 20301. It is important to understand the early-stage abnormalities of MDD in the processing and regulation of emotions. Widely-accepted models suggest that MDD is underpinned by structural and functional abnormalities in multiple neuronal circuits, such as the fronto-limbic circuitry2 and the default mode network (DMN)3, and are generally supported by evidence from neuroimaging, notably magnetic resonance imaging (MRI).

There are several MRI analytic approaches to quantifying structural abnormalities, including traditional hand-drawn regions of interest (ROIs) and whole-brain morphometrics. The ROI method has substantial anatomical validity, but has two major limitations: it is time-consuming and vulnerable to ROI selection bias4. There are two whole-brain analytic methods for quantifying structural abnormalities: voxel-based morphometry (VBM) and vertex-based morphometry. Vertex-based morphometry is often applied to cortex thickness. In the present work we wished to focus on GM volume, for which VBM is well-suited. VBM is an automated whole-brain technique which calculates local concentrations of GM in an unbiased way without a priori specification of ROIs5. GM volume reduction has been reported in anterior cingulate cortex (ACC)6, orbitofrontal cortex (OFC)7, 8, and dorsolateral prefrontal cortex (DLPFC)6, 9, which are all prefrontal regions involved in the automatic regulation of emotional behavior10. Structural alterations have also been reported in hippocampus11, 12 and amygdala9, the key regions of the limbic system theoretically identified as important in mood regulation in the pathophysiology of MDD. Additionally, a GM abnormality in the cerebellum13, which is involved in cognitive processing, has been reported in MDD14.

MRI methods for identifying functional abnormalities are broadly of two kinds: resting-state functional MRI and task-based functional MRI. Task-based functional MRI maps specific brain regions recruited during a target-detection task designed to evaluate responses to target stimuli15. However, task-related changes in neural activation represent only a small fraction of the brain’s total activity16, 17, because intrinsic activation is energetically more costly than responses to external stimuli18. Knowing how the brain allocates the majority of its resources is therefore essential for understanding neural mechanisms associated with MDD. In addition, there is a lack of agreement about task paradigms19. Resting-state MRI analytic methods to define the local features of the spontaneous BOLD signal include amplitude of low-frequency fluctuation (ALFF) and regional homogeneity (ReHo): ALFF quantifies the intensity of low-frequency oscillations in spontaneous neural activity20, 21 while ReHo reflects the statistical similarity of spontaneous neural activity among spatially adjacent brain tissues22. Because of its physiological correlates23, ALFF is a more direct index of regional spontaneous neuronal activity, and can be used to locate specific impaired brain regions24, 25. ALFF also helps to avoid the potential bias induced by selection of the ‘seed’ voxels or the number of components in resting-state functional connectivity analysis24, 26 such as graph theory, ROI-to-ROI matrix analysis, seed-to-voxel analysis and independent component analysis. Resting-state studies in MDD have reported increased ALFF in the frontal cortex, including ACC, OFC27, 28 and posterior cingulate cortex/precuneus29, as well as the fusiform gyrus29,30,31 and lingual gyrus30, 32, which have been thought to reflect the excessive self-referential processing of MDD. Decreased ALFF has been reported in the cerebellar hemispheres33, 34 and superior temporal gyrus32, 35, and this has been linked to deficits in cognitive control of emotional processing. Reduced ALFF in the OFC has also been reported in MDD31, 36.

To date, volumetric and resting-state functional differences have been inconsistent and are poorly replicated for some brain regions. This is partly explained by considerable variation between studies in sample size (limiting the power to detect subtle brain differences and yielding both false-positive and false-negative findings37), in patients’ demographic and clinical characteristics, and in imaging protocols. We aimed to conduct a whole-brain voxel-wise meta–analysis to explore in a preliminary way the most robust findings across a range of published VBM and ALFF studies. Furthermore, to report on multimodally affected brain regions (the frontal-limbic regions where both structural and functional alterations have been reported), we performed an additional meta-analysis to display abnormalities in both VBM and ALFF in a single map. There are, we suggest, two ways to look at this analysis. On the working assumption that both modalities reflect a common pathophysiology, it is important to know that putatively-affected brain regions show conjoint abnormalities of both GM and brain function in MDD. The most plausible interpretation is that functional abnormalities observed in MDD are mediated by the underlying structural abnormalities38, potentially by complementary mechanisms where the direction of structural and functional changes are opposite. Alternatively, clearly dissociated abnormalities may throw an interesting light on pathophysiology. One such study applied VBM and ALFF together in drug-naive MDD, finding decreased GM in the parietal-temporal regions and decreased ALFF in the temporal regions and cerebellum39; however, no overlap was observed in the same template39. One reason for such failure to detect conjoint GM and brain function abnormalities might be that, hypothetically, structural damage in one region might cause functional abnormality in another. Alternatively, it might be simply an artifact of the relatively small sample size. In either case, a multimodal meta-analysis approach to identifying conjoint abnormalities from VBM studies of GM volume and ALFF studies of brain activity in MDD should be illuminating.

Other factors may contribute to the variability among MRI results in MDD. Antidepressant medication might increase heterogeneity and limit the interpretability and generalizability of the results, especially in the light of evidence that drugs may have important effects, such as upregulating neurotrophin expression40, altering neuronal remodeling41 and protecting against GM loss42, 43, in both animal and human studies44,45,46. In addition, studies on the course of the illness have reported brain structure and function differences between patients with first-episode (FE) and recurrent depression. For instance, compared with patients with recurrent MDD and with healthy controls, patients with FE depression showed increased amygdala volume47. However, depressed subjects with multiple depressive episodes showed hippocampal volume reductions which were not found in FE patients48. In view of this we restricted our analysis to FE and drug-naive MDD patients to eliminate interference by repeated episodes and antidepressant treatment.

In the present meta-analysis, we provide an up-to-date quantitative summary of studies investigating GM and ALFF abnormalities in FE drug-naive MDD patients, using Seed-based d Mapping (formerly “Signed Differential Mapping”) (SDM)49, a new version of effect-size signed differential mapping (ES-SDM)50 which has previously been applied to e.g. studies of dementia with Lewy bodies51, childhood maltreatment52, alcohol dependence53, and migraine54. Furthermore, we used a multimodal meta-analytical method integrated into SDM which enables combination of the results of the separate meta-analyses conducted from studies using different modalities to detect brain regions which display both structural and functional abnormalities55; this has previously been applied to studies of subjects at familial high risk for schizophrenia56, obsessive-compulsive disorder57 and FE psychosis50, but this seems to be its first application in MDD.

In brief, we conducted separate meta-analyses of VBM studies and ALFF studies on FE drug-naive MDD, followed by a multimodal meta-analysis of VBM studies and ALFF studies on FE drug-naive MDD to determine whether individuals exhibit brain regions with both structural and functional abnormalities.

Results

Included studies and sample characteristics

Figure 1 shows a flow diagram of the identification and attrition of studies. The search strategy identified 2124 structural and 786 functional neuroimaging studies. Though we did not apply any language restriction in the search, all the abstracts yielded were in English; any articles in other languages were translated into English or Chinese for assessment. Of the 38 studies which met our inclusion criteria, we excluded 12 which reanalyzed previously published data, 1 in which coordinates were unavailable, and 2 which reported VBM and ALFF results synchronously, leaving 23 peer-reviewed and published original studies10, 11, 19, 30, 33,34,35,36, 39, 58,59,60,61,62,63,64,65,66,67,68,69,70,71. One of the ALFF studies analysed two different subgroups of MDD patients, namely early treatment responsive and nonresponsive patients33, both compared with the same HC group; each subgroup comparison was included as a data set in the present meta-analysis. Specifically, the structural meta-analysis included 15 data sets of 15 VBM studies with 471 FE drug-naive MDD subjects (194/277 male/female; mean age 34.0 years), matched with 521 controls (226/295 male/female; mean age 33.6 years); the resting state functional imaging meta-analysis included 11 data sets of 10 ALFF studies with a total of 261 FE drug-naive MDD subjects (126/135 male/female; mean age 31.9 years), matched with 278 controls (138/140 male/female; mean age 30.7 years). Table 1 and Table 2 summarise clinical and demographic data and technique details from all the included studies. The quality scores ranged from 9 to 12 (mean score 11.1), showing that these studies were of high quality. In no study was there any significant difference in age and sex between the MDD group and the HC.

Figure 1
figure 1

Meta-analysis of voxel-based morphometry and amplitude of low-frequency fluctuation studies in first-episode drug-naive patients with major depressive disorder. Study selection was done according to “Preferred reporting items for systematic reviews and meta-analysis” (PRISMA) guidelines. Abbreviations: ALFF, amplitude of low-frequency fluctuation; FE, first-episode; HC, healthy control; N, number; MDD, major depressive disorder; VBM, voxel-based morphometry.

Table 1 Demographic and clinical characteristics of subjects in the 23 voxel-based morphometry data sets included in the meta-analysis.
Table 2 Technique details of VBM and ALFF studies on MDD in meta-analysis.

Changes in regional grey matter

A group comparison of FE drug-naive MDD patients with HC across the 15 data sets in the main meta-analysis of VBM studies revealed decreased GM relative to controls in right DLPFC, right supplementary motor area (SMA), and right inferior temporal gyrus (ITG) extending to fusiform gyrus, and increased GM relative to controls in the right insula extending to putamen and striatum, left lateral OFC, left temporal pole (TP), and bilateral thalamus (Table 3 and Fig. 2).

Table 3 Regions of significant differences in GM and brain activity between patients with major depressive disorder and healthy controls.
Figure 2
figure 2

Areas of increased (red) and decreased (blue) grey matter or resting-state brain activity in first-episode drug-naive with major depressive disorder compared with healthy controls in the meta-analyses of (A) voxel-based morphometry studies and (B) amplitude of low-frequency fluctuation studies. Abbreviations: DLPFC, dorsal lateral prefrontal cortex; ITG, inferior temporal gyrus; L, left; PHG, parahippocampal gyrus; R, right; SMA, supplementary motor area; TP, temporal pole.

In whole-brain jackknife sensitivity analysis the findings of decreased GM in MDD patients in the right DLPFC, right SMA, and right ITG remained significant in all but 1 combination, and increased GM in the right insula, left lateral OFC, and left STG in all but 2 combinations. The right thalamus and left thalamus remained significant in all but 4 and 5 combinations, respectively (Table 3).

In analysis of heterogeneity the right insula and left TP with increased GM showed significant statistical heterogeneity between studies (p < 0.005), while the remaining regions with altered GM did not show significant between-study heterogeneity (p > 0.005) (see Supplementary Table S2).

In analysis of publication bias, the Egger test was significant in the right SMA (P = 0.034) and right ITG (P = 0.025) but not for right DLPFC (P = 0.051), right ITG (P = 0.112), right insula (P = 0.371), left lateral OFC (P = 0.645), left TP (P = 0.641), left thalamus (P = 0.971) or right thalamus (P = 0.936) in the VBM metaanalysis.

Changes in resting state regional brain activity

The main meta-analysis of the ALFF studies on MDD patients showed significantly enhanced brain activities in the bilateral SMA and left PHG extending to hippocampus and attenuated brain activities in the bilateral lateral OFC (Table 3 and Fig. 2).

In whole-brain jackknife sensitivity analysis the findings of attenuated brain activity in the bilateral lateral OFC and SMA were highly replicable, being preserved throughout all but 1 combinations of the data sets. The results in left PGH remained significant in all but 2 combinations (Table 3).

In analysis of heterogeneity, the regions with altered brain activities did not showed significant statistical heterogeneity between studies (p > 0.005).

In analysis of publication bias, the Egger test was significant for left SMA (P = 0.031), right SMA (P = 0.027) and right PGH (P = 0.031), but not for the left lateral OFC (P = 0.870) and right lateral OFC (P = 0.928) with decreased brain activities in the brain activity meta-analysis.

Multimodal analysis of grey matter and brain activity

The results were then summarised by putting structural and functional findings in a single meta-analytic map to demonstrate regions which showed both structural and functional abnormalities. This revealed increased GM with decreased brain activity in the left lateral OFC, and decreased GM with increased brain activity in the right SMA (Table 3 and Fig. 3). The left lateral OFC finding was observed in the two separate structural and functional meta-analyses and preserved in the jackknife sensitive analyses, and was without publication bias or heterogeneity. The right SMA finding was observed in the two separate structural and functional meta-analyses and preserved in the jackknife sensitive analyses, and was without heterogeneity, but failed in both publication bias analyses.

Figure 3
figure 3

Multimodal meta-analysis reveals (A) increased grey matter with decreased brain activity in the left OFC and (B) Decreased GM with increased brain activity in the right SMA in first-episode drug-naive patients with MDD compared with healthy controls. Abbreviations: L, left; MDD, major depressive disorder; R, right; SMA, supplementary motor area.

Subgroup meta-analyses

Details of the results of subgroup analysis are presented in Supplementary Materials.

Discussion

Aims and strengths of the study

This is to our knowledge the first multimodal neuroimaging meta-analysis which attempts to localise the neural substrates of MDD by combining information from whole-brain VBM studies investigating GM with ALFF studies of spontaneous brain activity.

Methodological strengths are the novel techniques combining features from coordinate meta-analytic approaches and standard meta-analytic methods, and the multimodal approach. The restriction to FE MDD patients helps distinguish the intrinsic brain features of the disease from potential effects of episode times, and the restriction to drug-naive MDD patients minimises the interference from medication effects.

Both anatomical and functional brain abnormalities in MDD were observed, characterised by decreased GM mainly localizing in the right DLPFC, right SMA, and right ITG extending to the fusiform gyrus, and increased GM in the right insula extending to putamen and striatum, left lateral OFC, left TP, and bilateral thalamus, along with increased brain activity in the bilateral SMA and left PHG extending to hippocampus, and decreased brain activity in the bilateral lateral OFC. Because of the publication bias in the right SMA and right ITG findings of the VBM study, as well as in the bilateral SMA and right PHG findings of the ALFF study, these results should be interpreted with some caution. The multimodal meta-analysis identified conjoint structural and functional differences in the left lateral OFC and right SMA in MDD.

These main findings can be thought of in terms of three circuits (DLPFC-striatum-thalamus circuit, lateral OFC-striatum-thalamus circuit, and SMA-striatum-thalamus circuit) broadly representing emotion, cognition and motor dysregulation respectively70.

Grey matter abnormalities in MDD

The most prominent finding was decreased GM in the right DLPFC, part of the central executive network which plays an important role in working memory and attention72, and is related to impaired cognitive function in FE drug-naive patients with MDD. This GM loss may reflect the reductions in glial cell density and neuronal size in the prefrontal cortex reported in postmortem studies73, 74. As the lateral prefrontal lobe is a well-known neural substrate involved in the pathophysiology of MDD, being associated with cognitive dysfunction75, 76, the GM loss in this region may be causally important. Furthermore, repetitive transcranial magnetic stimulation of the DLPFC is an established treatment for depression77. In rat models of depression, a lower expression of synaptic-function-related genes and correspondingly reduced number of synapses in the DLPFC has been reported78, and a similar phenomenon might underlie the decreased DLPFC volume in MDD.

The striatum has been associated with mood, cognitive processes and movement regulation79 and has connections with the DLPFC, OFC, SAM and temporal lobe70. The GM deficit in the putamen may contribute causally to the symptoms of MDD80, 81.

The thalamus is a complex structure, associated with the experience and expression of emotion in mood disorders12. We found symmetrical increased GM in thalamus in FE drug-naive MDD patients, consistent with a postmortem report of elevated neuron number in thalamus82, and also with a previous study in which increased GM in thalamus was related to pre-apoptotic osmotic changes or hypertrophy in FE drug-naive MDD patients68. However, structural studies and a previous meta-analysis have reported decreased GM in the thalamus83, 84. A possible explanation for this discrepancy may be that the latter enrolled data sets with a different course of illness or number of episodes. Consequently, we speculate that the increased volume of bilateral thalamus may be involved in the early stage of MDD, and is not likely to be the result of medication exposure.

MDD patients also showed increased GM in the right insula and left TP. The insula has extensive connections to several areas of the cortex and limbic system implicated in monitoring interceptive awareness85, 86, high-level cognitive control and attentional processes87. Increased insular activation to facial expressions of disgust in MDD may reflect an emotion processing bias88. Contrary to our finding, several studies reported reduced GM in insula25, 89, 90, and so this may be an effect of recurrent episodes91. Furthermore, increased GM in insula could be interpreted as resulting from neuroinflammation92: there is significantly elevated translocator protein density, an important aspect of neuroinflammation, in insula during a major depressive episode92. The TP is a visual and auditory-related brain region implicated in the processing of working memory and facial emotions93. Our findings are in line with previous reports of morphological alteration in the TP, an apparently early sign of MDD unlikely to result from treatment with antidepressants94, 95. Moreover, longitudinal MRI studies have reported progressive GM loss in the temporal lobe in MDD patients95, 96. Accordingly, the increased GM in right insula, bilateral thalamus and left temporal lobe might represent a specific character of early-stage MDD.

Regional brain activity abnormalities in MDD

The meta-analysis of ALFF studies found increased spontaneous brain activity in the left PHG extending to hippocampus. Both structures belong to the limbic system and play a central role in regulation of emotions, motivation, memory, affective dimension of pain6, 21, 23 and cognitive processes in MDD10. Surprisingly, no differences in GM volumes were detected in the hippocampus despite the fact that a variety of studies have reported abnormalities in that region12, 83. Possible reasons for this are differences in illness duration, medication status, age of onset and the number of episode. The volume deficit of hippocampus reportedly correlates with illness duration9, 84, and decreased hippocampus volume was detected in patients with long illness duration compared with short duration60. A newly-published meta-analysis suggests that the lower hippocampus volume is associated with the number of episodes, whilst no difference was detected between FE MDD patients and controls97. In our meta-analysis, all patients were FE and drug-naive, and most of studies were of short duration, which may explain why we found no structural hippocampus abnormality.

Subgroup meta-analyses

In addition to the results in the pooled meta-analysis, we found increased ALFF in the left posterior cingulate gyrus and right precuneus in subgroup meta-analyses of studies with large sample size. These regions belong to the DMN, which has a role in the balance between processing of external stimuli and internal and self-directed processing, which has long been thought to be involved in the pathophysiology of MDD. However, they failed in the pooled ALFF meta-analysis. This suggests that the detection of changes in DMN may be influenced by sample size; larger studies are needed to confirm this finding.

Conjoint abnormalities in grey matter and brain activity in MDD

The multimodal meta-analysis identified conjoint structural and functional differences in the left pars orbitalis (increased GM with decreased brain activity), which is a part of the inferior frontal gyrus (Brodmann area 47), belonging to lateral OFC. The OFC, being important parts of the affective network, are involved in the emotional processing of mental states98, 99. However, there is a difference between the medial part and lateral part, processing negative and positive emotion separately71. This is reconcilable with the conceptualization of MDD as a disorder of emotion regulation100, 101. However, a number of studies have found decreased GM in this region7, 102, corroborated by a previous meta-analysis of volumetric MRI studies79. It should be noted that most previous studies included patients on antidepressant treatment, while we included only studies of drug-naive patients. We hypothesise that increased GM may be related to temporal hypertrophy103, marking areas of early neuronal pathology without the confounding factors of repeat episodes and treatment. One study has reported volume being larger at illness onset, and then declining with multiple episodes or treatment in mood disorder103. Regarding brain activity, the presence of anxiety symptoms of MDD is reportedly associated with decreased OFC activation104. In MDD the severity of depression correlates negatively with activity in the left lateral OFC105. Reduced baseline resting state connectivity within the orbitofrontal component was predictive of clinical response in medication-free MDD patients106, 107. Thus, the imbalance between structure and brain activity may represent a distinctive alteration of FE drug-naive MDD patients.

In addition, conjoint structural and functional differences were found in the right SMA (decreased GM with increased brain activity), which forms part of the SMA-striatum-thalamus circuit. This is traditionally considered the cortical area necessary for voluntary movement as well as implicated in psychomotor retardation81, the key feature of MDD, but it also participates in cognitive activities such as working memory108, implicit learning ability109, and attention and executive function110. As most of these are impaired in MDD, it is tempting to infer a causal link. Our finding was in accord with previous studies relating the reduced regional volumes of the right SMA to psychomotor retardation in early-onset depression patients60, 109. If a primary decrease of GM volume in SMA were accompanied by a compensatory hyperfunctionality of the remaining GM, involving higher regional cerebral metabolism111 and cerebral blood flow112, this would likely increase local ALFF. Conversely, primary hyperfunction might lead to a decrease in GM by glutamate-induced ‘excitotoxicity’96. Of particular importance, a previous review indicated that the reduced GM volume in some structures may produce partial volume effects in functional images19. For instance, MDD subjects relative to controls show metabolic activity that appears reduced in the subgenual prefrontal cortex113. However, when this anatomical deficit is taken into account by correcting the metabolic data for the partial volume averaging effect associated with the corresponding GM reduction, metabolism instead appears increased in the subgenual prefrontal cortex in the unmedicated-depressed patients114. Whatever the pathophysiology, the regions identified by the multimodal meta-analysis could serve as a specific ROIs template for both individual postmortem histopathological and in vivo imaging studies.

Limitations

Firstly, combining numerous potentially underpowered studies with SDM meta-analysis using peak voxels, as we have done, may not reveal subtle widespread changes which are undetected by individual studies. A traditional meta-analysis, or an SDM meta-analysis using raw SPM images, gains much of its advantage by pooling raw data, including nonsignificant results, from all studies to increase power115. Our SDM meta-analysis using only significant co-ordinates as input is actually a tool for spatial integration of already-significant results. It is possible, given the hypothesis of subtle GM or brain activity abnormalities, that some areas of altered GM volume or brain activity do not reach significance in smaller studies but would prove significant if raw SPM maps were combined for SDM analysis.

Secondly, we discussed the findings that were not significant in the multimodal analysis as dissociated abnormalities in grey matter and brain activity in MDD. This dissociated distribution may not represent the real pattern of MDD abnormalities, because it may rather be due to failure to reach statistical significance in the multimodal analysis.

In the Egger test, we found publication bias in right SMA and right ITG in the VBM analysis, and in bilateral SMA and right PGH in the ALFF analysis, so another important limitation is the possibility of selective positive reporting and publication bias.

Finally, most of the primary studies have been so far conducted in China, thus limiting the generalizability of the current findings to other populations.

Summary

The present meta-analysis revealed a complex pattern of neural abnormalities in first-episode drug-naive MDD patients, characterised by conjoint and dissociated structural and functional brain abnormalities in brain regions involved in motor, cognition and emotional processing. These volumetric and functional alterations support the notion that multiple parallel basal ganglia-thalamocortical circuits70, together with other limbic regions (parahippocampus, hippocampus) contribute to the underlying pathophysiology of early-stage MDD. First-episode drug naive MDD patients showed increase in GM as well as decrease in brain activity in the left lateral OFC, and decrease in GM as well as increase in brain activity in right SMA, which could therefore serve as a specific ROI template for future studies. Of note, this study adds to Psychoradiology (https://radiopaedia.org/articles/psychoradiology), an emerging subspecialty of radiology, which seems primed to play a major clinical role in guiding diagnostic and treatment planning decisions in patients with mental disorder116, 117.

Methods

Study selection

Meta-analysis was conducted according to the Preferred Reporting Items for Systematic reviews and Meta-Analyses guidelines (PRISMA)118. A systematic strategy was used to search for relevant studies published in PubMed, Embase, Web of Science, Science Direct and Google Scholar. Candidate structural imaging studies were sought using the keywords “depression” or “depressive” or “unipolar depression” or “major depression” or “major depressive disorder” or “MDD” plus “voxel-based morphometry” or “VBM” or “voxel*” or “morphometry”. Resting state functional imaging studies were sought using the keywords “depression” or “depressive” or “unipolar depression” or “major depression” or “major depressive disorder” or “MDD” plus “amplitude of low-frequency fluctuation” or “ALFF” or “low-frequency fluctuation” or “LFF”. The search was conducted up to July 2016, with no time-span specified for date of publication. Language of publication was not a specific search criterion. The reference lists of these studies were checked to identify further studies for inclusion.

Structural neuroimaging studies were included according to the following criteria: 1) used VBM to analyze whole-brain GM changes in adult (age range 18 to 60 years) MDD patients, to minimise the effect of neurodevelopment and neurodegeneration as potential confounders; 2) compared MDD patients with healthy control (HC) subjects; 3) investigated first-episode and drug naive MDD patients, who had never received antidepressant medications before MRI scanning. Functional studies were included according to the following criteria: 1) used ALFF to analyze whole-brain resting state brain activity in adult MDD patients; 2) compared MDD patients with HC; 3) investigated first-episode and drug naive MDD patients. We excluded: 1) studies from which peak coordinates could not be retrieved from the published article or after contacting the authors; 2) studies in which different thresholds were used in different regions of the brain; 3) findings based on ROIs. For studies where multiple independent patient samples were compared with HC, the appropriate coordinates were included as separate data sets. For studies using overlapping samples, the study with the most subjects was included.

Three authors (W.N.W., Y.J.Z and X.Y.H.) independently conducted the literature search. The results were compared, any inconsistencies were discussed, and a consensus decision was obtained.

Quality assessment

We assessed the quality of the included studies using a 12-point checklist that focused on both the clinical and demographic aspects of individual study samples and on the imaging methodology. The checklist was based on previous meta-analytic studies91, 119, and included structural measures from MRI, modified to reflect critical variables that are important to assess VBM studies56 and resting state fMRI studies. This assessment included the quality of the diagnostic procedures, the demographic and clinical characterization, the prospective (or otherwise) nature of the patient and control studies, the sample size, the MRI acquisition parameters, the analysis technique and the quality of the reported results (see Supplementary Table S1). Although this checklist was not designed as an assessment tool, it provides an objective index of the rigor of individual studies. The quality scores are presented in Table 1.

Recorded variables

For each included study we recorded: sample size, gender and mean age of subjects; illness duration, depression symptom severity and mean number of episodes; drug status; the statistical threshold of the main findings, and the method employed to correct whole-brain results for multiple comparisons. These data are presented in Tables 1 and 2.

Standard meta-analysis of structural abnormalities

Separate voxel-based meta-analysis of regional GM abnormalities was conducted using the SDM software package57 (www.sdmproject.com), which implements a refinement of methods50, 120 which have been applied to neuroimaging studies of neurological and psychiatric disorders such as Alzheimer’s disease34, Attention-deficit/Hyperactivity Disorder121, late-life depression56 and MDD58. SDM uses the reported peak coordinates and effect sizes to recreate, based on the spatial correlation between neighbouring voxels, brain maps of the effect size of the GM differences between patient and comparison subjects, and accounts for sample size and variance as well as between-study heterogeneity. The SDM methods have been described in detail elsewhere50, 120, so we merely summarise the main features here. First, peak coordinates and effect sizes (derived, for example, from t values) of GM differences between MDD individuals and comparison subjects were extracted from each study. Any peaks not statistically significant at the whole brain level were excluded; thus, while different studies may employ different thresholds, we ensured that in each study the same statistical threshold was used throughout the brain. This avoids bias toward liberally threshold brain regions, which is common for ROIs. Second, a standard Montreal Neurological Institute map of the differences in GM was separately recreated for each study by means of an anisotropic Gaussian kernel, which assigns higher effect sizes to the voxels more correlated with peaks. This has been found to optimise the recreation of the effect size maps, and is robust because it does not depend on a full width at half-maximum49. Third, a map of the effect size variance was derived for each study from its effect size map and its sample size. Fourth, the mean map was obtained by voxel-wise calculation of the random-effects mean of the study maps, weighted by the sample size and variance of each study and the between-study heterogeneity50. Details of the effect size are presented in the online Supplementary Materials.

Considering possible methodological differences between the studies, we then performed subgroup meta-analyses included studies with large sample size (n > 30), studies with small sample size (n < 30), studies that utilized 1.5 T and 3.0 T MRI, studies with a correction for multiple comparisons or not, and patients with short duration (less than 6 months).

The main analysis was complemented with three analyses of robustness to ensure that only the most replicable and robust of the results were retained. First, a jackknife sensitivity analysis was performed, systematically repeating the meta-analyses excluding one study at a time: if a region remains significant in all or most of these combinations of studies, this finding is deemed highly replicable120. Second, a random-effects model with Q statistics was used to detect the statistical (between-studies) heterogeneity of individual clusters. Third, Egger tests were used to assess publication bias.

Standard meta-analysis of resting state functional abnormalities

The separate main meta-analysis and the analyses of robustness of regional resting state brain activity were methodologically identical to those of regional GM.

Multimodal Meta-Analysis

Finally, the meta-analyses of regional GM and resting state functional abnormalities were combined in order to detect those brain regions showing differences in both imaging modalities. We followed the approach described in Radua et al.55, which aims to obtain the overlap between the abnormal regions in the two modalities. In the current meta-analysis, the multimodal meta-analysis is used to detect those brain regions which display both structural and functional abnormalities. However, the exact relationship between these changes cannot be defined further.