Introduction

Understanding how genetic variability influences brain structure in adolescence may provide insights into the pathophysiology and biological constructs that impact normal brain development, cognition, and the development of neuropsychiatric disorders. Brain structures vary in heritability, suggesting that different genes may influence the structural development of different areas (Bis et al. 2012; den Braber et al. 2013; Stein et al. 2012) and the genetic influences may vary over time (Brouwer et al. 2017). Subcortical regions of the brain are associated with neuropsychiatric disorders including depression (Koolschijn et al. 2009), anxiety (Holzschneider and Mulert 2011), and schizophrenia (Shepherd et al. 2012), as well as intellectual functioning (Bohlken et al. 2014).

Studies, predominantly in adult populations, have investigated the heritability of subcortical volumes of anatomic structures in the brain (den Braber et al. 2013; Kremen et al. 2010). Hippocampal and intracranial volume have been associated with genes located on chromosome 12q24 (Bis et al. 2012; Stein et al. 2012; Yoon et al. 2011). A meta-analysis of twin studies reported high heritability from 0.52 for the right thalamus to 0.82 for the right putamen volume, but with a relatively small sample size and wide confidence intervals of 0.4–0.8 for the right thalamus (Blokland et al. 2012). Another twin study that looked at subcortical volumes over a 5 year period also found high heritability for the thalamus, caudate, and putamen (den Braber et al. 2013). Other studies on adult twins have demonstrated high heritability of total intracranial brain volume (Posthuma et al. 2000; Thompson et al. 2001).

The brain undergoes rapid development in the first two years of life and nears adult size by early childhood at around age six (Jansen et al. 2015), but more dynamic changes in cortical and subcortical regions involving gray and white matter continue to occur into adolescence and young adulthood (Lenroot and Giedd 2006). Prior work in neurodevelopment has suggested greater genetic heritability earlier in childhood for primary cortices (motor, somatosensory), and more heritability in later adolescence for structures unique to the human species such as the dorsolateral prefrontal cortex (Lenroot and Giedd 2006).

There are few reports on adolescents with respect to heritability of subcortical volumes. A study of twins scanned at ages 9 and 12 showed high heritability for intracranial volume, gray matter, and white matter volume from 0.77 to 0.91 (Peper et al. 2009). In another study of 8 year-olds, heritability ranged from 0.57 to 0.79 for gray matter, white matter, corpus callosum, and total cerebrum, but with higher heritability in the left cerebral hemisphere (Yoon et al. 2011). A larger study of 326 twins resulted in heritability measures of 0.65 for corpus callosum, 0.64 for basal ganglia, and 0.42 for the thalamus, but lower for the ventricles (0.17) and cerebellum (0.24) (Schmitt et al. 2007).

Phenotypic neurodevelopmental trajectory research shows that there are significant patterns of age-related change in subcortical volumes across adolescence (Herting et al. 2018); however, it is unclear which etiological influences underlying the phenotypes remain stable. Longitudinal twin research allows for testing whether the genetic and environmental influences on subcortical structure are stable in adolescence, or if there are new genetic influences that emerge. Anomalies in brain structure are a robust correlate to adolescent and adult psychopathology (Gurholt et al. 2022; Guyer 2020), and understanding the stable and emerging influences in brain development may provide insights into the relationship between genetics, brain development and mental health.

One previous longitudinal twin study found that subcortical volumes (thalamus, hippocampus, amygdala, putamen, caudate, pallidum, and nucleus accumbens) were highly heritable and entirely genetically stable from ages 9 to 12; however the sample size was limited (48 MZ pairs; 64 DZ pairs)(Swagerman et al. 2014).

ABCD study

While several studies have considered the heritability of the volumes of subcortical structures, very few have investigated their microstructure (Gillespie et al. 2017), which can be probed non-invasively using diffusion MRI. In this study, we leverage the Adolescent Brain Cognitive DevelopmentSM Study (ABCD Study) to investigate the heritability of both subcortical volume and microstructure in adolescents.

The ABCD study is a longitudinal study of brain development that includes extensive cognitive, behavioral, genetic and imaging data (Casey et al. 2018). The study is enriched for same sex twin pairs (Iacono et al. 2018), enabling the assessment of the genetic and environmental influences on a wide variety of traits over development. We use twin data to estimate the genetic and environmental influences on deep gray matter volume, assessed using conventional structural MRI, and microstructure, assessed using diffusion MRI.

The imaging battery included in ABCD, and repeated every two years, includes structural and diffusion MRI. Structural MRI scans have high (1 mm) spatial resolution, and sufficient gray/white matter contrast to enable segmentation of deep gray matter structures, and determination of their volumes. Diffusion MRI uses the motion of water molecules in the brain to provide information on the microscopic environment, and is sensitive to developmental changes related to, for example, myelination and cell density (Watts et al. 2003).

To our knowledge, this is the largest twin sample used to assess subcortical volume and microstructure heritability, as well as the first to longitudinally test the genetic and environmental contributions to subcortical microstructure in adolescence.

Data and methods

Subject selection

The ABCD data used in this report came from Curated Data Release 4.0. To maximize data consistency for the technically demanding diffusion acquisition, only data from twins scanned using a single MRI vendor (Siemens Healthineers AG) were included in the diffusion analysis. To maximize sample size, volumetric data for twins scanned on both Siemens and Philips (Philips HealthTech) scanners was included. Twin pairs were excluded if they had incomplete data, or if the structural and diffusion imaging data did not pass ABCD-recommended quality control, including a diffusion mean motion threshold of 1.5 mm. Subjects with a history of brain injury, cerebral palsy, muscular dystrophy, multiple sclerosis, or substance abuse were excluded. Demographic information for the two groups is given in Table 1.

Table 1 Demographics of participants (number of same-sex twin pairs) recruited and included in structural and diffusion analyses

MRI data acquisition

The MRI acquisition is described in more detail elsewhere (Casey et al. 2018). Briefly, structural images were acquired using a T1-weighted 3D MP-RAGE sequence with 1 mm isotropic spatial resolution, TE/TI/TR = 2.88/1060/2500ms, flip angle = 8°, sagittal field of view of 256 × 256 × 176 mm3. Real-time motion detection and correction using volumetric navigators was used to minimize the effects of head motion (Tisdall et al. 2012).

Diffusion-weighted images were acquired using a 2D spin-echo EPI sequence with b-values of 500 (6 directions), 1000 (15 directions), 2000 (15 directions), and 3000 (60 directions) s/mm2, as well as 6 reference acquisitions without diffusion weighting. Diffusion imaging was acquired with 1.7 mm isotropic spatial resolution, TE/TR = 88/4100ms, 81 slices with a multiband factor of 3, axial field of view of 240 × 240 mm2. Corresponding field maps were acquired to enable correction of EPI susceptibility distortions.

MRI data processing

Volumetric data

Imaging data was preprocessed as described previously (Hagler et al. 2019). Segmentation of subcortical structures was performed using FreeSurfer (Fischl et al. 2002), providing measures of the volume of each structure and masks for the aligned diffusion data. Subcortical volumes were normalized to the total intracranial volume.

Restriction spectrum imaging

A restriction spectrum imaging (RSI) (White et al. 2013) model was applied to the diffusion data to generate measures of the isotropic and directional restricted and hindered components of diffusion, as well as a free water component. The isotropic restricted component has been shown to be sensitive to tissue cellularity (White et al. 2013). Deep gray matter structures display low directional anisotropy, so we limited our analysis to the restricted normalized isotropic (RNI) component.

Diffusion tensor imaging

A conventional diffusion tensor imaging (DTI) model was also applied to generate measures of mean diffusivity (MD) to compare its sensitivity to the more sophisticated RSI model.

Zygosity status was genetically determined. All data was residualized to account for the effects of age, sex, and site.

Descriptive statistics

Intraclass correlations (ICCs) between MZ and DZ twin pairs were calculated as descriptive statistics to visualize the relative twin similarity for the phenotypes. Within subject Pearson correlations for phenotypes derived at baseline and year 2 were calculated to provide a proxy measure of the test-retest reliability of the imaging metrics. The relationship between the estimated heritability for each region/metric and the within-subject stability was calculated as a Pearson correlation.

Assumptions of the ACE model

The analysis presented is based on the ACE model. MZ twins share 100% of their genetic information, while DZ twins share approximately 50%. Additive genetic effects imply that the expected genetically-based correlation between DZ twins should be half that of MZ twins. Non-linear effects, such as those caused by genetic dominance, are not included in the model. The ACE model also incorporates the equal environments assumption (EEA), that the effect of the common environment is the same for MZ and DZ twins. Violation of this assumption would be expected to result in greater similarity of MZ twins relative to DZ twins, increasing the apparent heritability. The unique environment term models uncorrelated differences between twin pairs, including differences caused by measurement error.

Structural equation modeling

Structural Equation Modelling was used to assess the additive genetic (A), common environment (C), and unique environmental (E) contributions to tissue microstructure using the OpenMx (Boker et al. 2011) package in R (R version 3.6.1, R Core Team). A direct variance ACE model was used (Verhulst et al. 2019). Correlated factors models were employed to assess the stability of the variance components by decomposing the cross-time covariance of the trait into 3 latent sources of contributions (rA, rC, or rE)(Loehlin 1996). Likelihood-based confidence intervals are provided to account for the non-normal distribution of the variance components (Neale and Miller 1997). If the estimates’ confidence interval does not include 1.00, this is suggestive of novel etiological influences. Model fit was assessed using the criteria of root-mean-square error (RMSEA) < 0.06 and a Comparative Fit Index (CFI) > 0.95 (Hu and Bentler 1998). Chi-square difference tests were used to assess significance (p < 0.05) of the etiological correlations to account for the standard error invariance to model parameterization in genetic models (Neale et al. 1989).

Results

Subjects

The demographics of participants recruited and included in structural and diffusion analyses are shown in Table 1. For the volumetric measures at baseline, 284 MZ and 369 DZ twin pairs had acceptable quality data and were included in the analysis. For diffusion analysis, 148 MZ and 212 DZ twin pairs were included at baseline. At two-year follow-up the corresponding numbers were 207/247 and 134/158 for volume and diffusion respectively. The MZ and DZ groups did not differ significantly in age, sex, household income, parental education, or race/ethnicity (chi-squared test, p > 0.05).

Baseline results

Intraclass correlations (ICCs) for MZ and DZ twin pairs for each region and metric at baseline are shown in Fig. 1. For MZ twins, ICCs ranged from 0.610 (caudate) to 0.867 (pallidum) for RNI, 0.347 (hippocampus) to 0.673 (putamen) for MD, and 0.642 (nucleus accumbens) to 0.910 (brain stem) for volumes. With the exception of MD in the hippocampus and brainstem, the ICCs for DZ twins were significantly less than those from MZ twins (two sample t-test, p < 0.05).

Fig. 1
figure 1

Intraclass correlation (ICC) between monozygotic (MZ) and dizygotic (DZ) twins for restricted normalized isotropic (RNI, top) signal, mean diffusivity (MD, middle), and volumes (bottom) of subcortical regions at baseline

Structural equation modelling estimates of the additive genetic, common, and unique environmental contributions to each region and metric are shown in Table 2; Fig. 2 (top). The model fit statistics can be found in Supplementary Tables 1–3. Heritability estimates (estimate, 95% confidence interval) for RNI at baseline in the pallidum (0.859, 0.818–0.889), putamen (0.859, 0.819–0.889), and thalamus (0.855, 0.814–0.887) were especially high. For MD, heritability was generally lower than for RNI, but highest in the putamen (0.704, 0.622–0.768), pallidum (0.672, 0.596–0.734), and thalamus (0.670, 0.573–0.744). For structure volumes, the putamen (0.906, 0.875–0.914) and caudate (0.831, 0.688–0.992) were found to be highly heritable.

Table 2 Univariate variance component estimates from the correlated factor models
Fig. 2
figure 2

Additive genetic (A), common environmental (C), and unique environmental contributions to restricted normalized isotropic (RNI, left) signal, mean diffusivity (MD, middle), and volumes (right) in subcortical regions at baseline (top) and two-year follow-up (bottom). Negative estimates for contributions to MD in nucleus accumbens have been replaced with zeros for display

Two-year follow-up results

The contributions to each region and metric are shown in Table 2; Fig. 2 (bottom). The model fit statistics can be found in Supplementary Tables 1–3. Heritability estimates for RNI showed a similar pattern at two-year follow-up, with the pallidum (0.835, 0.787–0.871), putamen (0.874, 0.838–0.902) and thalamus (0.819, 0.767–0.857) again demonstrating the highest heritability. For MD, the amygdala, caudate, pallidum, putamen, and thalamus had similar heritabilities, with estimates in the range of 0.627 to 0.701. As at baseline, structure volumes were most highly heritable for the putamen (0.906, 0.885–0.923) and caudate (0.848, 0.701–1.011).

At two-year follow-up, significant (p < 0.05) contributions of the common environment to MD in the nucleus accumbens (0.465, 0.185–0.728) and the volume of the brainstem (0.225, 0.074–0.355) were observed.

Longitudinal model results

The correlated factors models assessed the stability of etiological influences on subcortical regions between measures at baseline and the two-year follow-up (Table 3). For RNI, the genetic correlations ranged from 0.743 (amygdala) to 0.970 (caudate). The hippocampus and caudate appear to be genetically stable across time (rA confidence interval includes 1), but the other phenotypes may have novel genetic influences (rA confidence interval do not include 1). The unique environmental correlations ranged from 0.183 (caudate) to 0.660 (amygdala). For MD, the genetic correlations ranged from 0.772 (ventral diencephalon) to 0.967 (amygdala). The confidence intervals for the brainstem, amygdala, hippocampus, pallidum, ventral diencephalon include 1.00, suggesting genetic stability across time. The rA and rC coefficients could not be estimated for the nucleus accumbens MD measure as there was not both a positive A or positive C variance component to correlate. The unique environmental correlations were − 0.038 (nucleus accumbens) to 0.527 (hippocampus). For volume, the genetic correlations were 0.865 (pallidum) to 1 (brainstem). The confidence intervals for the brainstem, nucleus accumbens, caudate, hippocampus, and ventral diencephalon included 1. The unique environmental correlations for the volumetric measures ranged from 0.292 (pallidum) to 0.610 (hippocampus).

Table 3 Correlations from the correlated factors model

Stability of measures and heritability estimates

The Pearson correlations between measures at baseline and two-year follow-up are shown in Fig. 3. For RNI, the correlations ranged from 0.717 (amygdala) to 0.879 (pallidum). For MD, the correlations were substantially lower, ranging from 0.388 (ventral diencephalon) to 0.751 (pallidum). The volume of the pallidum displayed the lowest correlation between time points (0.647), while the caudate was highest (0.931).

Fig. 3
figure 3

Pearson correlations between RSI, DTI and volumetric measures derived at baseline and two-year follow-up

Figure 4 shows that those measures that were stable between time points (r = 0.821–0.922, pallidum, putamen, thalamus for RNI, and brainstem, caudate, hippocampus and putamen for volume) demonstrated high heritability estimates (0.714–0.906 at baseline; 0.659–0.906 at two-year follow-up). Conversely, those measures that were least stable showed relatively low heritabilities (such as MD in the brain stem, nucleus accumbens, and ventral diencephalon). The Pearson correlation between heritability and stability was 0.800 (95% CI 0.604–0.905), suggesting that 64% on the variance in heritability across regions/metrics can be explained by the stability of the measurement.

Fig. 4
figure 4

Relationship between heritability determined at baseline and the stability of each metric between baseline and two-year follow-up

Discussion

The RNI metric based on an RSI model was more consistent between time points and generally yielded higher heritability estimates compared to MD derived from a diffusion tensor model. The RSI model is biologically based, and substantially more sophisticated than the DTI model, modelling the restricted, hindered, isotropic, anisotropic, and free fluid components separately. By comparison the DTI model combines these components, and the derived MD values are susceptible to contamination, for example from partial volume averaging of free fluid (which has a very high value of MD) with adjacent tissue due to poor tissue segmentation. RNI may be a more reliable microstructure measure to be used in the scope of twin research. A previous study (den Braber et al. 2013) noted poor 5-year retest reliability in the volumes of the nucleus accumbens and pallidum.

The highest microstructure heritability estimates were for the pallidum, putamen, and thalamus. To our knowledge, this is the first study to examine the etiology of subcortical microstructure in adolescents. A prior study (Gillespie et al. 2017) considered heritability of both subcortical MD and volume in middle-aged males. Their results for MD are similar to the present study, with some structures showing higher heritability in adults (caudate, pallidum, putamen), and some lower (amygdala, thalamus). Increasing heritability with age may represent the evolving influence of genetics over development, while decreasing may be due to diverging environmental impacts, especially during adulthood.

For volumes, the highest heritability estimates were for the brainstem, caudate, and putamen. The heritability of subcortical volumes were generally higher or comparable in our cohort compared to prior studies (Swagerman et al. 2014; Yoon et al. 2011). Our results are similar to those of den Braber (den Braber et al. 2013), who found the highest heritability in the left (0.88) and right (0.86) caudate in a cohort of young adults.

The consistency of high heritability estimates across RNI microstructure and volume measures provides evidence that individual variation in these subcortical measures is highly influenced by additive genetic effects. Anomalies in adolescent subcortical structure have been associated with negative outcomes, such as psychopathology (Gurholt et al. 2022; Guyer 2020). The high heritability of these phenotypes suggests they could serve as intermediary phenotypes to identify risk genes for heritable psychopathologies as well as provide insight into possible mechanisms (Glahn et al. 2007; Gurholt et al. 2022).

There was evidence of shared environmental effects influencing brainstem volume and mean diffusivity of the nucleus accumbens at follow-up year 2. However, the nucleus accumbens is often susceptible to low test-retest reliability due to its size and location and this finding should be approached with caution. The brainstem has been associated with neurodevelopmental disorders (Dadalko and Travers 2018). Future research could explore whether shared environmental risk factors, such as nutrition, share variance with the brainstem at this developmentally-sensitive age.

The hippocampus and caudate showed genetic stability (rA confidence intervals included 1) across all three imaging modalities. The putamen and thalamus displayed the possibility of novel genetic influences in all three modalities; however, given the magnitude of the genetic correlations, the genetic effects are mostly stable. All the other phenotypes demonstrated high genetic correlations but were inconsistent across modality on whether there were novel genetic influences. The putamen and thalamus displayed the possibility of novel genetic influences in all three modalities; however, given the magnitude of the genetic correlations, the genetic effects are mostly stable. All the other phenotypes demonstrated high genetic correlations but were inconsistent across modality on whether there were novel genetic influences.

The large genetic correlations across time are suggestive that genetic influences on subcortical development remain relatively stable from ages 10 to 12. Besides the hippocampus and caudate, most of the genetic correlations were less than one in at least one of the imaging modalities, suggesting some novel genetic influences emerging. These findings contradict one previous longitudinal twin study which found no new genetic effects on subcortical brain volumes from ages 9 to 12 (Swagerman et al. 2014); however, the current study had a larger sample size.

In addition to high genetic stability, the subcortical regions displayed unique environmental correlations across time, which were strongest in magnitude for the hippocampus, amygdala, and caudate. Unique environmental correlations can be indicative of a unique risk factor that is not explained by familial influences, such as a traumatic event. Environmental stressors in childhood have been implicated in atypical, sub-nuclei development (McCrory et al. 2012). The literature is especially abundant on structural alterations in the amygdala and hippocampus due to childhood adversity (Calem et al. 2017) and the possible exacerbating impacts these anomalies may have on later psychopathology (McCrory et al. 2012). The unique environmental correlations support that there may be events not captured by familial influence which impact sub-cortical development, and that not all E on subcortical structures is random measurement error. Future research can explore etiological correlations between these subcortical measures in relation to risk factors with psychopathology, such as childhood adversity.

One previous study found that the longitudinal change rate of childhood thalamus, pallidum, and amygdala volumes had a significant heritable component (Brouwer et al. 2017). Future research could explore the heritability of the change rate of subcortical microstructure.

Limitations

The sample size is an important limitation of twin studies that involve imaging, and this study is no exception. The uncertainties in the derived heritability estimates are large, especially for those regions with moderate or low heritability, perhaps limited by measurement uncertainty. Particularly at baseline (ages 9–10), subject motion can also be problematic, although the ABCD study benefits from the use of prospective motion correction for the structural data acquisition, combined with recommended imaging quality control criteria.

Measurement error is expected to reduce the correlations between MZ twins and DZ twins similarly, resulting in lower heritability estimates for metrics that have low test-retest reliability (Fig. 4, r = 0.8). This implies that the estimates of heritabilities presented may represent a lower bound on the true heritability.

While we controlled for the effect of sex, we did not specifically investigate it. Prior studies have postulated that there are no sex differences in subcortical volumes (den Braber et al. 2013; Vink et al. 2012) which may suggest that the same genes influence the same phenotype in both sexes. However, these studies concentrated on adults.

Conclusions

Volume and diffusion MRI metrics demonstrated substantial heritability across all subcortical regions in this sample both at baseline and two-year follow-up. The RNI component of diffusion derived from the RSI model was found to be more heritable than MD, perhaps due to its better test-retest reliability. Particularly high heritability of greater than 0.8 was found using the RSI model in the pallidum, putamen, and thalamus, and in volumes of the caudate and putamen. These findings were consistent across time points. The subcortical microstructures and volumes were largely genetically stable, with some small, novel genetic influences. Almost all of the phenotypes had cross-time unique environmental correlations, suggesting that a non-familial risk factor may impact subcortical development.