Introduction

Attention deficit/hyperactivity disorder (ADHD) is a neurodevelopmental disorder characterised by age-inappropriate levels of inattention and hyperactivity-impulsivity. ADHD symptoms have a strong genetic component, with heritability estimated at around 70 % (Burt 2009; Faraone et al. 2005; Wood et al. 2010b). The investigation of objectively measured physiological and neurobiological measures as potential biological correlates of the ADHD phenotypes may capture underlying processes that lie closer to ADHD genetic liability and delineate pathways from genes to ADHD behaviours.

Studies on children with diagnosed ADHD versus control groups examining overall diurnal cortisol levels and rhythm have reported significantly lower basal cortisol concentrations in morning sampling (Blomqvist et al. 2007; Isaksson et al. 2012; Ma et al. 2011), and significantly lower incidence of typical diurnal variation (defined as having a maximum and minimum cortisol level, respectively, at morning and evening sampling (Kaneko et al. 1993). Other studies on stress reactivity have reported increased cortisol productivity (White and Mulligan 2005), and conversely significantly lower pre- and post-stressor (IV insertion) cortisol concentrations (McCarthy et al. 2011) in children with ADHD, and blunted cortisol reactivity linked to persistent ADHD (King et al. 1998). In addition, one study reported a significantly higher incidence of atypical cortisol non-suppression to dexamethasone in ADHD versus control group (Kaneko et al. 1993). Overall, a recent meta-analysis of case–control studies of baseline cortisol found a significant, though modest effect (d = −0.31, p = 0.0001) of ADHD, with lower levels in ADHD versus controls (Scassellati et al. 2012). Some studies suggest that the covariation between ADHD and HPA-axis dysregulation is further complicated by ADHD subtype and/or comorbidity (Freitag et al. 2009; Hastings et al. 2009), but in the largest study to date with over 300 children (Isaksson et al. 2012), the negative association of morning cortisol measures with ADHD symptoms in children aged over 10 years did not differ according to ADHD subtype or co-occurring symptoms.

The heritability of HPA axis functioning has been investigated in twin studies. A review of twin studies (Bartels et al. 2003a) reported substantial variation in heritability estimates for basal cortisol measures (0–84 %), likely reflective of varying methodologies. Moreover, the authors found that none of the reviewed studies had adequate sample sizes to disentangle familial covariation into genetic versus shared environmental components. Pooling together five comparable twin studies yielded a heritability estimate of 62 % (Bartels et al. 2003a). In a subsequent study conducted on cortisol sampled over two consecutive days in 180 children aged 12, genetic factors were negligible for evening samples, with moderate heritability for cortisol sampled at awakening and before lunch, and highest (56–59 %) for samples collected up to an hour after awakening (Bartels et al. 2003b) A study, based on a subsample of same-sex twins selected from the Twins’ Early Development Study (TEDS; where the current sample is also derived from), found substantial heritability for single-point measures (around 60 %; pre-task baseline and two post-task values) and more moderate heritability for cortisol reactivity (44 %) to a computerised behavioural challenge (Steptoe et al. 2009).

The heritability of ADHD symptoms is comparatively well established (Burt 2009; Faraone et al. 2005; Wood et al. 2010a, b). Moreover, a series of studies on the TEDS sample suggest that stability in ADHD symptomatology is largely attributed to genetic effects (Kuntsi et al. 2005) and that the two ADHD symptom domains show substantial genetic overlap, as well as some degree of genetic distinction (Greven et al. 2011; McLoughlin et al. 2007). Furthermore, twin analyses indicate that, despite rater-specific effects, informant ratings largely index the same genetically influenced behaviours (Merwood et al. 2013).

However, the extent to which the genetic influences on cortisol measures are shared with genetic influences on ADHD has not yet been addressed and is a key objective of this study. If shared genetic risk is demonstrated, then indices of HPA-axis activity could be a biomarker for ADHD. Furthermore, if overlapping genetic effects are identified and can be distinguished as representing mediating genetic effects [as opposed to representing pleiotropic (alternative manifestations of the same genetic risk factors) genetic effects], then cortisol dysregulation can be identified as reflecting a causal pathway underlying ADHD and indicate a potential target for treatment and preventative interventions. Finally, the low levels of cortisol productivity may be related to under-arousal which has been theorised as underlying ADHD symptomatology [e.g. the state-regulatory (van der Meer, 2002) and the arousal-attention (Johnson et al. 2007; O’Connell et al. 2008) model].

This study investigated the association between ADHD affection status and cortisol activity in a sample of male adolescent twin pairs selected from a population-based sample according to consistent high and low ADHD symptoms (McLoughlin et al. 2014; Tye et al. 2012). Specifically, we aimed to explore the aetiological relationship between ADHD affection status (high versus low ADHD symptom scores) and the dynamics of cortisol productivity as captured by the intercept and slope variables of a linear growth curve model (GCM), while also modelling potential covariate effects (e.g. age, sampling conditions and demographic factors) on the means of these latent variables. We hypothesised that ADHD affection status would be associated with atypical cortisol productivity, specifically blunted cortisol activity for the group with high ADHD symptom scores. We had no a priori hypotheses regarding the aetiological relationship of ADHD affection status and cortisol activity.

Methodology

Sample

The sample was selected from TEDS, a birth cohort study, which invited all twins born in England and Wales between 1994 and 1996 to enrol (Trouton et al. 2002). Zygosity was determined using a zygosity questionnaire that has been shown to have 95 % accuracy (Price et al. 2000). Where zygosity was unclear from this questionnaire, DNA testing was conducted. The TEDS sample is representative of the general population in terms of parental education, ethnicity and employment status (Haworth et al. 2013).

The Neurophysiological study of Activity and Attention in Twins (NEAAT) subset used in this study consisted of 68 male twin pairs aged between 12 and 15. Twin pairs were selected based on latent class trajectory analysis of ADHD symptoms at ages 8, 12 and 14, using the 18 DSM-IV based ADHD items from the Conners’ Parent Rating Scale (Conners et al. 1998). This approach identified sub-groups of individuals who have been consistently rated by parents as having high or low ADHD symptoms and thus ensured selection of twin pairs consistently concordant or discordant for high levels of ADHD symptoms or unaffected controls (consistently low ADHD symptom ratings). Further information on selection of twins is given elsewhere (see McLoughlin et al. (2014) and Tye et al. (2012) for supplementary information). The selected sample consisted of 22 pairs concordant for high ADHD symptoms (11 monozygotic (MZ) and 11 dizygotic (DZ) pairs), 8 pairs discordant for ADHD symptoms (2 MZ and 6 DZ pairs) and 38 control pairs concordant for low ADHD symptoms (22 MZ and 16 DZ pairs). When subdividing our sample according to ADHD symptom ratings, 84 participants were classified as controls (low ADHD symptom scores) and 52 participants placed in the ‘high ADHD symptoms’ group.

Participating families gave their written informed consent and the study was approved by King’s College London Psychiatry, Nursing and Midwifery Research Ethics Sub-Committee (PNM/08/09-089).

Measures

Salivary cortisol

Salivary cortisol was obtained using the passive drool method at three time points during the cognitive-electroencephalography (EEG) session. The baseline measure was collected approximately 40 min after entering the EEG lab. Prior to baseline measurement, the child watched a cartoon while the EEG cap and electrodes were prepared for data collection and had two resting EEG periods (eyes open and closed; 3 min each). The child then completed three cognitive tasks while EEG was simultaneously measured (described below). On average, cortisol levels peak 20 min after stressor onset (Hirvikoski et al. 2009) and, therefore, to index participants’ cortisol response to the second cognitive task, the second sampling of saliva was obtained at the end of the last cognitive task (approximately an hour after baseline measurement). The final salivary sample was collected approximately 20 min later, to index response to the third cognitive task.

11 individuals (8 controls and 3 cases) were unable or did not agree to participate in saliva collection during the cognitive-EEG testing session. There were no significant differences in age or ADHD group status between those who did or did not participate. On the day of laboratory testing, four children had taken over-the-counter medication (e.g. pain killers) and an additional five children had taken steroid-based asthma inhalers. Two dichotomous variables were created [steroid-based medication (yes/no), and other medication (yes/no)]. There were no significant differences according to medication status for either variable, and, therefore, all participants’ samples were retained for analysis.

All uncentrifuged saliva samples were stored in a −80 °C medical freezer, until completion of the study when they were assayed for cortisol. Salivary samples from 10 participants were sent for preliminary analysis, with all remaining samples analysed in a single batch using a high-sensitivity chemiluminscence assay (Salimetrics, Cambridge, UK). The lower limit of detectable sensitivity was <0.003 µg/dL (micrograms per decilitre). One of the 375 analysed samples had insufficient saliva volume to test for cortisol. Coefficients of variance were below 10 % and, therefore, not subjected to re-testing. Cortisol concentrations were provided in µg/dL and converted into nmol/l (nanomoles per litre) by multiplying original values by 27.59.

Cortisol values were screened for extreme values. All samples from one control participant were higher than three standard deviations (SD) from the mean and designated as missing. Another control participant had one outlier (defined as a value >3 SD above the mean) and was winsorized (re-assigned with the highest individual corresponding value <3 SD above the mean).

Cognitive-EEG tasks

Task 1: cued continuous performance test (CPT-OX) with flankers 33 (11 min) (Doehnert et al. 2008; Valko et al. 2009). This well-validated cued CPT involves selective attention, preparation and vigilance for an infrequently occurring stimulus, plus an embedded go/no-go test. The stimuli consist of a black letter array formed of a centre letter flanked on each side by distractor letters. Participants are instructed to respond only to cue-target sequences (XOX-OXO), by pressing a button as quickly as possible with the index finger of their preferred hand.

Task 2: arrow flanker task (13 min) (Albrecht et al. 2005; McLoughlin et al. 2009, 2014). This standard arrow flanker task is designed to measure brain activity during conflict monitoring. Two flankers (black arrowheads above and below the position of a fixation mark) are first presented 100 ms before the central target black arrowhead appears. Participants have to press a response button corresponding to the direction indicated by the target arrow. On congruent trials, flanker and target arrowheads point in the same direction; on incongruent trials, they point in opposite directions. If the participants are being too careful and not making any errors, they are instructed to perform at a faster rate, with the intention that this will cause them to make errors. Conversely, if the participant is making too many errors, they are given feedback to be more careful in their responses.

Task 3: the fast task (20 min) (Andreou et al. 2007; Kuntsi et al. 2006). This four-choice reaction time task measures performance under baseline (slow, unrewarded) and fast-incentive conditions. Four empty circles first appear on the screen. After one of them (the target) is coloured in, participants have to press an appropriate response key as quickly as possible. In the fast-incentive condition the participants win a smiley face whenever responding faster than their own mean reaction time (MRT) during the baseline condition consecutively for three trials.

ADHD rating scales

ADHD ratings scores were derived from the 18-item DSM-IV based ADHD subscale of the Long Version of the Conners’ Parent Revised Rating Scale (Conners et al. 1998).

Statistical analysis

Genetic growth curve model

The relationship between ADHD affection status and cortisol measures taken during the cognitive-EEG session was tested in a linear growth curve model (GCM) which captures the dynamics of the cortisol activity (an index of HPA axis function) by means of two latent factors: the intercept [cortisol concentration at the beginning of the testing paradigm (I)] and the slope [rate of change in cortisol productivity over time (S)]. The loadings for the intercept were fixed at 1 and those of the slope to 0, 1 and 2 to reflect the linear trajectory of cortisol over the testing period (Fig. 1). The GCM captures each individual’s trajectory across all sampling points, rather than focussing on individual measures (Hagger-Johnson et al. 2010) and, therefore, does not require complete data for every participant and estimates both parameters simultaneously within the same model.

Fig. 1
figure 1

Mean salivary cortisol concentration by group status during cognitive-EEG testing. Mean raw cortisol concentration values shown in nmol/L during testing paradigm; Lab_1 was pre-task (baseline) measure; Lab_2 was obtained at the end of cognitive testing; Lab_3 was taken at end of entire assessment

The genetic nature of the twin sample further allowed estimation of the aetiological variance and covariance of these latent intercept and slope factors as well as ADHD affection status according to general biometrical genetic theory. In brief, twin studies enable us to disentangle the extent to which a trait is influenced by genetic factors (A), shared environmental factors (C), and non-shared environmental influences (E). This method relies on the difference in genetic relatedness between MZ twins, who share all of their genetic information, and DZ twins who share, on average, only 50 % of their additive genes. Multivariate genetic models estimate the underlying variance components of individual variables (or latent factors) and the aetiological components of phenotypic covariance between multiple traits (or factors) (Neale and Cardon 1992).

Main effects of covariates like age and sampling conditions were specified on the intercept and slope factor means while modelling the genetic architecture of these factors. The genetic and environmental variance and covariance parameters of the continuous cortisol measures and ordinal affection status were estimated using structural equation modelling in the OpenMx program (Boker et al. 2011). Using full information maximum likelihood estimation, parameters were obtained in a joint continuous-ordinal model in which a liability threshold model was assumed to express the risk for ADHD as normally distributed in the general population with the disorder occurring when a certain threshold at symptoms’ level is exceeded. However, to get unbiased estimates for the association with the cortisol factors, the non-random selection of individuals on ADHD required all its parameters to be fixed to population known values: h2 = 0.76, c2 = 0, e2 = 0.24 (derived from the mean heritability pooled across over 20 twin studies (Faraone et al. 2005); and the threshold to a lifetime risk of 5 % (Polanczyk et al. 2007). This model has been applied to schizophrenia and bipolar disorder in analyses on brain volumes (Rijsdijk et al. 2005) and neuropsychological measures (Toulopoulou et al. 2007), as well as on electrophysiological parameters on the same sample as reported here (McLoughlin et al. 2014; Tye et al. 2012).

Figure 2 presents the correlated factors solution of the fitted genetic Cholesky model on intercept, slope and ADHD factors, where the ordering of variables is arbitrary (Loehlin 1996). It gives the genetic (rA), shared environmental (rC) and nonshared environmental (rE) correlations (i.e. the extent to which the same genetic or environmental factors contribute to trait covariation). In addition, using the standardized path estimates, we calculated the proportion of the phenotypic overlap between slope and ADHD due to genetic effects (rph-A = √h 2 Slope × rA × √0.76) and the unique environmental effects (rph-E = √e 2 Slope × rE × √0.24) (for more details see Neale and Cardon 1992; Rijsdijk et al. 2005).

Fig. 2
figure 2

Genetic GCM for cortisol sampling during cognitive EEG-testing paradigm and ADHD affection status, with age effects incorporated in Intercept and slope factor means. LAB_1 baseline, LAB_2 sampled at end of cognitive tasks, LAB_3 sampled at end of cognitive-EEG assessment, As ADHD affection status; I intercept, S slope, Esp residual measurement error, µ i intercept mean; µ s slope mean, μ io overall mean of the intercept factor, μ so overall mean of the slope factor, β i age-dependent intercept mean, β s age-dependent slope mean; the variances of the intercept and slope factors are 0.26 and 0.03, respectively; model presented for one twin only for ease of presentation

Results

As expected, the ADHD group had significantly higher mean ADHD DSM-IV based symptom scores (Table 1), confirming that the two groups significantly differed according to key ADHD diagnostic criteria. Other group mean differences include that the group with high ADHD symptom scores was significantly younger than controls and were more likely to have participated during months with more light (March–September). In addition, on the day of the cognitive-EEG session, the high ADHD symptom group had significantly later awakening and later sampling times. The high ADHD symptom group had lower cortisol concentration levels throughout the cognitive-EEG session, although this was only significant at the last sampling point and did not retain significance (t = −1.34, p = 0.18) after regressing out potential confounders (age, awakening time, sampling time, and season of sampling).

Table 1 Group mean differences

The aetiological relationship between ADHD and cortisol activity was examined by testing the overlap with the intercept and slope factors, respectively, as measured in a latent growth factor model. We ran separate models controlling individually for potential confounding effects of all the sampling factors (age, awakening time, sampling time and season of sampling) on the latent factor means (mI = 1.31 and mS = −0.25). The only significant effect was found for age on the intercept (β I age = 0.24). Accordingly, only age was incorporated in the final model (Fig. 2). Familial factors largely explained individual differences on both the intercept [h 2 = 46 % (95 % CI 0/97 %), c 2 = 29 % (0/83 %) and e 2 = 25 % (3/66 %)] and the slope factor [h 2 = 21 % (0/99 %), c 2 = 64 % (0/98 %) and e 2 = 15 % (0/99 %)]. ADHD only significantly correlated with the slope factor [−0.49 (−0.98/−0.09)], implicating that cortisol concentrations decreased significantly faster for the affected group. The phenotypic overlap between the slope factor and ADHD affection status due to genetic effects (rph-A = √0.21 × −0.89 × √0.76) was −0.36 and that due to unique environmental effects (rph-E = √0.15 × −0.71 × √0.24) was -0.13. This means that the phenotypic overlap between cortisol change over time and ADHD status was predominantly (−0.36/−0.49 = 73 %) due to shared genetic factors (within the limitations of the fixed model which assumes no effects of C on ADHD affection status). The standardized estimates for the individual cortisol measures are presented in Table 2. Heritability estimates for the individual cortisol measures ranged from 22 to 33 %, but were not detectable as significant in this small sample. Estimates for shared and unique environmental components were largely similar across the individual cortisol measures. Unique environmental variance contributed to the growth factors, as well as significant residual variance (which includes measurement error).

Table 2 Aetiological components for individual cortisol sampled during cognitive-EEG testing paradigm

Discussion

In an analysis of adolescent twin pairs selected for developmentally stable high vs low ADHD symptoms, two key findings emerged that highlight the dynamics of cortisol productivity and its aetiological link to ADHD. First, while we did not find the expected group differences in all single-point measures across the trajectory of cortisol concentrations over laboratory testing, we found, using growth curve modelling (GCM), that cortisol levels declined at a faster rate for the high ADHD symptom group, with a significantly steeper slope, compared to controls. In contrast, the intercept was similar across groups. Our second main finding was that the significant overlap between ADHD and the slope factor was predominantly due to shared genetic factors.

Taken together, our findings suggest that significant group differences relate to dynamic (degree of change) indicators of HPA functioning during cognitive-EEG testing, rather than baseline levels. A methodological implication of our findings is that GCM represents an effective alternative means of examining cortisol activity over a given period of time, in line with increasing research that suggests that patterns of activity are generally not adequately captured by single-point or absolute-difference measures (Balodis et al. 2010).

Further, the rate of cortisol change in productivity, as indexed by the slope, captures a majority of the genetic risk in ADHD. The shared genetic risk is an important finding and suggests that cortisol dysregulation is a candidate endophenotype for the ADHD. It may be that blunted cortisol activity is a biological marker of the ADHD phenotype and a potential neurobiological pathway underlying ADHD symptomatology. In particular, if future studies can clarify if the overlapping genetic effects represent causal pathways (as opposed to pleiotropy), then indices of HPA-axis functioning may be identified as a potential target for treatment and preventative interventions. In relation to theoretical models of ADHD, attenuated cortisol levels have been suggested to reflect under-arousal, in line with arousal dysregulation models (Isaksson et al. 2012), such as the state-regulatory model (van der Meer 2002) or the arousal-attention model (Johnson et al. 2007; O’Connell et al. 2008). Our findings add to the previous findings by specifying the dynamics of cortisol productivity as a key process in ADHD, which future studies can examine in relation to cognitive-EEG and skin conductance measures of the proposed arousal dysregulation in ADHD.

It is important to emphasise that the current study on cognitive-EEG testing was part of a larger battery of assessment and was not intended to provoke major stress. Yet, across both groups the first sample yielded the highest cortisol concentrations (Table 1; Fig. 1), suggesting potentially anticipatory stress and HPA activation prior to entering the EEG lab. It is possible that the pattern of activity captured is reflecting only part of the cortisol-reactivity curve: subsequent recovery from the anticipation of entering the laboratory. Repeated sampling over a longer sampling time would allow a more detailed examination of the cortisol-reactivity pattern.

It is important to note that the cortisol concentrations sampled were skewed to the right and that maximum-likelihood statistical equational modelling requires data to be normally distributed. However, as the means and variances of the GCM factors rely on changes in means of the observed variables, any transformations that would change the natural order of the means (e.g. log transformation) would render the variables unsuitable for GCM. This is a limitation of the current analyses and could potentially be avoided using multiple threshold models, albeit with a severe loss of statistical power (Mehta et al. 2004). In addition, we were unable to distinguish between ADHD behavioural dimensions in our study, as children were selected for consistently high/low parental ratings across both symptom domains; yet the evidence from other studies suggests that blunted cortisol activity in ADHD may have a stronger relationship to hyperactivity-impulsivity, as opposed to inattention (Ma et al. 2011; Maldonado et al. 2009). Along the same lines, it was not possible to model any moderation (or main) effects of co-occurring behaviours, such as oppositional behaviours or anxiety-shy symptoms, on ADHD liability and the relationship with the slope factor, given the selected nature of the sample and the fixed parameters on ADHD to correct for selection. Although some studies have suggested that the covariation between ADHD and HPA-axis dysfunctioning may be complicated by comorbidity (particularly comorbid ODD), our previous twin analyses focusing on middle childhood suggested that hyperactivity-impulsivity and oppositional behaviours largely index the same aetiological liability (Wood et al. 2009), raising the possibility that associations with ADHD (particularly hyperactivity-impulsivity) and/or ODD may be capturing the same effect.

One of the main strengths of this study was that the sample was limited to include males only and to a restricted age range, to reduce variability and maximize sample homogeneity, although an important caveat is that we did not have information relating to pubertal status, which may be a significant source of variance in cortisol productivity. This study relied on parental ratings of ADHD, as opposed to a diagnosis made within a clinical setting, and so avoids biases associated with referral bias, comorbidity and the confounding effects of stimulant medication. Although this may limit the generalizability of our findings to non-population based studies, the overall evidence from family and twin studies strongly suggest that the clinical diagnosis of ADHD reflects the extreme and impairing tail end of one or more continuous dimensions of psychopathology (Chen et al. 2008; Levy et al. 1997). Moreover, a series of analyses have highlighted the comparability of our quantitative genetic findings across both diagnostic and dimensional approaches to the ADHD phenotype (Cheung et al. 2012, 2014; Kuntsi et al. 2010, 2014; Paloyelis et al. 2010; Wood et al. 2010a, 2011). Although our sample size was small, when studying low prevalence disorders in twin model fitting power can be increased by using a selected sample (Neale et al. 1994), as utilised in our study.

In conclusion, in a novel application of growth curve modelling (GCM) and the twin design to examine cortisol activity with ADHD affection status, we found that ADHD is specifically associated with the dynamics of cortisol productivity and that this overlap is predominantly due to shared genetic factors. Future research incorporating genotype, cognitive-EEG and skin conductance data will be able to further delineate the affected pathways. Increased awareness of the pathophysiological processes involved in ADHD may facilitate improvements in treatment and management of the disorder (Corominas et al. 2012).