The co-occurrence between attention deficit-hyperactivity disorder (ADHD), which is characterized by developmentally inappropriate levels of inattentive and/or hyperactive-impulsive behaviours (American Psychiatric Association (APA) 2000), and reading disability, whether defined as diagnostic categories or quantitative traits, is well documented (August and Garfinkel 1990; Dykman and Ackerman 1991; Trzesniewski et al. 2006; Willcutt and Pennington 2000a,b). It reflects a strong phenotypic association between reading disability and ADHD inattention symptoms, which has been largely attributed to shared genes (Martin et al. 2006; Willcutt and Pennington 2000a; Willcutt et al. 2000, 2007b). The present study, using a genetically informative design, extends this research by investigating to what extent the common genetic variability between reading difficulties and ADHD inattention symptoms is also shared with general cognitive ability, as measured by IQ.

Previous twin studies have shown that both reading ability/disability (Alarcon and DeFries 1997; Byrne et al. 2008; Gayan and Olson 2003; Harlaar et al. 2005a; Tiu et al. 2003; Wadsworth et al. 2000) and ADHD symptoms (Kuntsi et al. 2004; Wood et al. 2009) share genetic variability with IQ. Therefore, it is possible that any shared genetic variability between inattention symptoms and reading disability could reflect a common genetic association shared with general intelligence. This idea was supported by evidence that genetic and environmental influences shared with IQ accounted for the covariance between reading performance and a cognitive attention measure (Zumberge et al. 2007). However, it is not possible to generalize directly from cognitive attention processes to behavioural inattention problems (Marzocchi et al. 2009; Warner-Rogers et al. 2000). Similar manifestations of inattention problems may reflect diverse cognitive deficits, and the relationship between overt behavioural inattention and deficits in cognitive attention is neither simple nor direct (Marzocchi et al. 2009; Warner-Rogers et al. 2000).

Indeed, behavioural and genetic evidence at the overt symptom level suggests that shared aetiological influences between inattention symptoms and reading disability are likely to be independent of IQ. A recent study reported similar estimates of common genetic influences between a brief measure of hyperactive/inattentive behaviours and a composite measure of academic achievement (including reading) before and after adjusting the latter for IQ (Saudino and Plomin 2007). Other studies have shown that early inattention symptoms predicted later reading achievement even after controlling for prior reading ability and IQ (Rabiner and Coie 2000; Rabiner and Malone 2004).

There is a need to understand better the aetiology of the covariance between ADHD inattention symptoms and reading disability because, to be effective at improving reading performance in the context of inattention problems, intervention programs need to address the specific deficits giving rise to the covariance between these disorders (Rabiner and Malone 2004). If the covariance could be attributed to shared aetiological influences that are independent of IQ, this would indicate the presence of specific neurocognitive deficits contributing to these disorders that were independent of possible “generalist” mechanisms spanning cognitive processes and learning abilities/disabilities across domains (Haworth et al. 2009).

Bivariate studies allow the parsing of the covariance between two phenotypes into distinct genetic and environmental components. The genetic and environmental correlations provide estimates of the degree to which the covariance between two phenotypes reflects shared genes or environmental factors, respectively. The inclusion of additional variables of interest in the multivariate case makes possible the estimation of the extent to which the genetic (or environmental) overlap is further shared with the additional variables (such as IQ). Multivariate designs offer improved power by decreasing the rate of false positive type I error rate (by decreasing the number of tests) and by taking into account the covariance among traits for each individual (Hottenga and Boomsma 2008).

Existing studies have employed bivariate designs, examining the aetiology between reading deficits and a single ADHD subtype, or ADHD symptom dimension, at a time, excluding IQ (Martin et al. 2006; Willcutt et al. 2000, 2007b). In the present study we replicate and extend previous research by using a general population sample and employing a multivariate design, including measures of IQ and both ADHD inattention and hyperactivity-impulsivity symptoms. The use of an unselected, general population sample avoids possible selection biases associated with clinic-referred or selected community samples and allows the generalization of findings to the general population. Inattention, hyperactivity-impulsivity and reading disability are all considered to be the tails of normally distributed traits, reflecting the normal distribution of genetic risk in the population (Chen et al. 2008; Harlaar et al. 2005b; Levy et al. 1997; Shaywitz et al. 1992). Studying general population samples is useful in understanding the extremes, as quantitative genetic and epidemiological evidence supports the validity of making inferences from population data to clinical cases in ADHD (Chen et al. 2008).

Specifically, our study aimed to: (1) Confirm the substantially larger phenotypic and genetic correlations between reading difficulties and ADHD inattention symptoms, compared to hyperactivity-impulsivity symptoms. (2) Investigate, for the first time, to what extent the common genetic variability between ADHD inattention symptoms and reading difficulties is also shared with IQ and hyperactivity-impulsivity symptoms. (3) Assess how much of the variance in reading difficulties could be attributed to aetiological influences shared with the ADHD symptom domains and IQ, thus providing an estimate of their relative importance in understanding the aetiology of reading difficulties. (4) Finally, examine the possibility of gender differences in the aetiology for the covariation between reading difficulties and inattention symptoms.


Sample and Procedure

Participants are members of the Study of Activity and Impulsivity Levels in children (SAIL), a general population sample of twins aged 7.9–10.9 years. They were recruited from the Twins’ Early Development Study (TEDS; Trouton et al. 2002), a birth cohort study which had invited parents of all twins born in England and Wales during 1994–1996 to enrol. Despite attrition, the TEDS families continue to be reasonably representative of the UK population with respect to parental occupation, education and ethnicity (Oliver and Plomin 2006). Zygosity has been determined using a standard zygosity questionnaire which has shown 95% accuracy (Price et al. 2000).

TEDS families were invited to take part if they fulfilled the following SAIL project inclusion criteria: twins’ birthdates between September 1, 1995 and December 31, 1996; lived within a feasible travelling distance from the research centre; White European ethnic origin (to reduce population heterogeneity for molecular genetic studies); recent participation in TEDS, as indicated by return of questionnaires at either 4- or 7-year data collection point; no extreme pregnancy, perinatal difficulties, specific medical syndromes, chromosomal anomalies or epilepsy; not participating in other current TEDS substudies; and not on stimulant or other neuropsychiatric medications.

Of the 1,230 suitable families contacted, 672 families (55%) agreed to participate. Overall, the sample is as representative of the general population as is feasible for a study of this kind, and previous analyses on TEDS indicated that attrition was not due to ADHD symptoms. For example, Saudino et al. (2005) found that twins who participated at age 7 assessments were not significantly different in parent ratings of hyperactivity from lost twins at age 2 (t = 1.77; p = 0.08). However slight bias towards higher parental occupational classification, compared to the original TEDS sample, should be noted (39% of mothers and 52% of fathers in managerial or professional jobs, compared to 28% and 40%, respectively). Thirty-two children were subsequently excluded due to: IQ < 70, epilepsy, autism, obsessive-compulsive or other neurodevelopmental disorder, illness during testing or placement on stimulant medication for ADHD. The final sample consisted of 1312 individuals: 255 monozygotic (MZ) twin pairs, 183 same-sex dizygotic (DZ) and 206 opposite-sex DZ twin pairs, as well as 24 singletons coming from pairs with one of the twins excluded. Data for the 24 singleton twins were also used in the structural equation modelling (see Neale et al. 2003). Participants were invited to our research centre for cognitive assessment (see Kuntsi et al. 2006), where ratings on the Conners’ scale and the reading difficulties questionnaire were collected from parents. Teachers’ ratings on the Conners’ scale were obtained through post.

The mean age of the sample was 8.83 (SD = 0.67), and half of the sample were girls (N = 663, 50.5%). Children’s IQs ranged from 70 to 158 (M = 109.34, SD = 14.72). Parents of all participants gave informed consent following procedures approved by the Institute of Psychiatry Ethical Committee.


Wechsler Intelligence Scales for Children, Third Edition (WISC-III; Wechsler 1991)

The vocabulary, similarities, picture completion and block design subtests from the WISC-III were used to obtain an estimate of the child’s IQ (prorated following procedures described by Sattler (1992)).

Ratings of Inattention and Hyperactivity-Impulsivity

Parents and teachers were asked to complete the Long Versions of the Conners’ Parent and Teacher Rating Scales (Conners et al. 1998a,b). Teacher ratings were collected from the main class teacher for each child. Previous analyses on the TEDS sample indicated that the majority of twins had been rated by the same teacher (Saudino et al. 2005). In a study looking at scholastic achievement and hyperactivity/inattention in the TEDS parent sample, the aetiology of the covariance was similar for ratings provided by the same or different teachers or parents (Saudino and Plomin 2007). From both scales, we used the 9-item inattention and 9-item hyperactivity-impulsivity DSM-IV symptoms sub-scales. Inter-rater agreement for parent and teacher ratings was .46 (p < .001) for inattention and .40 (p < .001) for hyperactivity-impulsivity, which are comparable to those obtained in previous studies (Saudino et al. 2005; Thapar et al. 2000). We created DSM-IV composite inattention and hyperactivity-impulsivity scores by summing up standardized parent and teacher ratings on the corresponding subscales. Teacher ratings were missing for 151 individuals and parent ratings for two individuals. Those with missing teacher ratings on the Conners’ scale did not significantly differ from the rest of the sample on the reading difficulties questionnaire (F(1, 667) = 0.61, p = .43), in IQ (F(1, 667) = 3.55, p = .06) or parent hyperactivity-impulsivity ratings (F(1, 666) = 3.49, p = .06), but were slightly older (M = 9.12, SD = 0.71; F(1, 667) = 17.88, p < .001) and had slightly higher parent inattention ratings (M = 7.18, SD = 5.82; F(1, 666) = 5.01, p = .026) (corrected for the inclusion of non-independent data from twin pairs; Williams 2000). To avoid excluding existing data from these 153 individuals and consequently reducing the power of our study, and under the assumption that teacher data were missing at random, we imputed missing parent and teacher ratings using the existing teacher or parent ratings, respectively, with the “impute” command in STATA (version 10.0; StataCorp, 2007). In a few cases, missing responses to individual items in the Conners’ subscales were pro-rated: a summary score based on the mean of individual questions on the rest of the subscale was used if there was more than 75% completion for each subscale. In our sample, 11.8% of the participants had parent ratings and 4.6% of the participants had teacher ratings on the DSM-IV ADHD total scale indicating clinical significance (using as a cut-off a T-score > 65 as indicated in the manual) (Conners 1997). These estimates match or exceed the estimated ADHD prevalence rate in the population (Ford et al. 2003; Polanczyk et al. 2007).

Reading Difficulties Questionnaire (RDQ; Martin et al. 2006; Rommelse et al. 2009; Willcutt et al. 2010b)

This 6-item parent rating scale is part of an instrument screening for learning disorders. On a scale which ranges from 1 (“Never/not at all”) to 5 (“Always/a great deal”), parents are asked to report to what extent their child has difficulties with spelling, learning letter names or phonics (sounding words out), and to what extent their child reads slowly, below expectancy level or has required extra help at school. In the validation study, using four independent referred and general population samples (N = 4158), all items loaded on a single factor and the scale showed excellent internal consistency (mean Cronbach’s α = .90) and high inter-rater (r = .83) and 1-year test-retest (r = .81) reliabilities (Willcutt et al. 2010b). In that study RDQ showed high correlations with a range of objective reading and spelling measures (overall r = .64; CI:.60 to .68) but low correlations with measures of other learning difficulties (r = .07–.024), attesting to its good criterion and discriminant validity (Willcutt et al. 2010b). In our sample (using data collected as part of the TEDS project, Trouton et al. 2002), the RDQ showed similarly high correlations with a measure of word reading efficiency (r = −.63, p < .001, N = 301, unpublished observations; Test of word reading efficiency, Torgesen et al. 1999) and teacher ratings for reading attainment (r = −.58, p < .001, N = 975; unpublished observations). Moreover, RDQ scores have shown high heritability (h 2 = 53% to 83%) and high genetic correlations (−.71 to −.89) with a composite measure of reading performance (Astrom et al. 2009; Martin et al. 2006). Parent and teacher ratings of inattention and hyperactivity-impulsivity showed remarkably similar correlations with RDQ scores (inattention: parents:.41 (CI95:.37 to .46), teachers:.47 (CI95:.45 to .51); hyperactivity-impulsivity: parents:.22 (CI95:.17 to .23), teachers:.17 (CI95:.11 to .18)).


Overview of the Twin Method

In univariate analyses, correlations between members of a twin pair for each trait are used to apportion phenotypic variance to additive genetic (A), dominant genetic (D) or shared environment (C), and child-specific environment (E) components (which also subsumes measurement error) (Neale and Cardon 1992; Plomin et al. 2001). Based on the assumptions that (a) MZ twins are genetically identical and therefore share 100% of genetic variation, whereas DZ twins share, on average, 50% of their segregating alleles contributing to A and 25% contributing to D, and (b) both MZ and DZ pairs share 100% of their C but are discordant for E, the phenotypic variance for a trait is partitioned into constituent A, D or C and E influences. Greater phenotypic similarity between MZ twins compared to DZ twins suggests genetic influences on trait variance. If the phenotypic similarity of MZ twins is more than twice that of DZ twins, this suggests the presence of D, otherwise only A is suggested. DZ twin correlations greater than half the MZ twin correlations suggest the presence of C. The extent to which MZ twins are not 100% concordant for a trait reflects E (Rijsdijk and Sham 2002).

Structural equation modelling provides a tool for the formal estimation of variance components (A, C/D and E parameters) and for testing alternative models describing possible component contributions to trait variance or covariance. When only twin pairs reared together are used, the available information allows the estimation of only a C or D component at a time. In multivariate genetic analyses, as well as partitioning the phenotypic variance of single traits, it is also the covariance between traits that is decomposed into A, C/D and E influences following exactly the same logic as above and using the ratio of MZ:DZ differences in cross-twin cross-trait correlations, (e.g. inattention symptoms in twin 1 with reading difficulties in twin 2) (Rijsdijk and Sham 2002).

As multivariate models have increased power over univariate models (Schmitz et al. 1998), we do not present parameter estimates from univariate models. Univariate modelling was used to inform the choice of parameters for the multivariate models (e.g. the choice of C or D parameters) and to test for sex effects.

Structural Equation Modelling

Structural equation modelling was performed using Mx (Neale et al. 2003). Models were fitted to age- and sex-regressed standardized residual scores, which were logarithmically transformed to minimize skewness. All estimates are provided with 95% confidence intervals (the inclusion of zero indicates non-significance). The relative goodness of fit of the competing hierarchical (or nested) models was assessed using a likelihood ratio test. This was computed as the difference in the −2LL statistics of two models, which is distributed as a χ 2 with degrees of freedom (df) equal to the difference in the parameters estimated with each model. A significant χ² suggests a significant deterioration in fit for the more constrained model; if the χ² is not significant, the model with the fewer parameters is preferred for being more parsimonious. In the case of multivariate genetic analyses with large samples, the χ 2 difference from the saturated model is likely to be significant. In such cases, as well as when comparing models that are not nested, Akaike’s information criteria (AIC) can be used to assess the relative fit of models. The lowest AIC value indicates the best fitting model, given the data and the set of candidate models (Wagenmakers and Farrell 2004).

Saturated Phenotypic Model

This model fully describes the data using the maximum number of free parameters and provides a baseline comparison for subsequent genetic models. We constrained this model in accordance with the assumptions of the genetic method (that is, means and variances within traits and phenotypic correlations across traits were equated across twins in a pair and zygosity groups) to obtain phenotypic correlations representative of the whole sample while taking into account the non-independence of the data (i.e. data of related subjects).

Sex Effects

Qualitative sex differences are found where the nature of the A, C/D and E influences differs, i.e. different genes or different environmental influences underlie the variance in the trait for males and females. The data indicate quantitative sex differences if the magnitude of A, C/D and E influences underlying a trait are significantly different for males and females. Scalar sex differences are found where only unstandardized A, D/C and E estimates differ (but standardized estimates are the same), due to variance differences in the trait distribution between males and females. There were neither qualitative nor quantitative sex differences underlying the variance in traits, although scalar differences were observed for reading difficulties, and inattention and hyperactivity-impulsivity symptoms. Therefore, in the multivariate modelling male phenotypic variances for these traits were pre- and post-multiplied by a scaling factor. As there are no significant qualitative or quantitative differences in variance components between the sexes, MZ and DZ correlations are not presented for each sex. However, given the scalar differences between the sexes, means and standard deviations are broken down into sex- and zygosity-specific groups (Table 1).

Table 1 Cross-Twin Correlations (with 95% Confidence Intervals in Brackets) and Means (and Standard Deviations) for and Across IQ, Inattention, Hyperactivity-Impulsivity and Reading Difficulties Ratings

Parameter Selection for the Multivariate Models

In the univariate analyses, an ACE model provided the best fit for IQ, while ADE models (with scalar sex differences) fitted best for the remaining three phenotypes (as we would predict from the MZ:DZ ratios of cross-twin correlations for these traits, Table 1). In this study we were interested to assess the contribution of broad-sense shared genetic influences to the covariation between reading difficulties and inattention (that is, not distinguishing between A and D effects), and the extent to which they are independent from genetic effects also shared with IQ. Therefore, in the multivariate models we parsed the variance contributing to the covariation among the four phenotypes into A and E components, with A reflecting broad-sense genetic effects. Due to the lack of qualitative or quantitative sex differences in the univariate analyses beyond scalar differences, the computational intensity of modelling sex effects and additional power issues (Neale et al. 2006), only scalar differences between males and females were allowed in the multivariate models.

Multivariate Genetic analyses

A Cholesky triangular decomposition, which postulates a series of hierarchical genetic (A1–A4) and child-specific environment (E1–E4) factors, was used (Fig. 1). The order of the traits in the Cholesky model was decided a priori, with a view to estimating the aetiological influences that contribute to the covariance between reading difficulties and inattention symptoms independent of the other traits. A different ordering of the traits would produce the same fit of the model, but address different questions. The Cholesky model can be converted to the mathematically equivalent correlated factors solution (Fig. 2; Loehlin 1996), in which the order of traits is of no importance. This mathematical solution allows the estimation of the extent to which the same genes or environments contribute to the covariation between traits (i.e. the genetic and environmental correlations), irrespective of the extent to which they are shared with other traits in the model.

Fig. 1
figure 1

Multivariate Cholesky triangular decomposition with unstandardized parameter estimates. The best-fitting ACE(IQ)-AE model is presented (for twin one only)

Fig. 2
figure 2

Correlated factors solution of the best fitting ACE(IQ)-AE multivariate Cholesky decomposition model presented in Fig. 1 (for twin one only). Path estimates presented as square roots are the unstandardized parameter estimates (the basis of the square root provides the standardized estimate). Bidirectional arrows between latent genetic (A1–A4) and child-specific environment (E1–E4) factors show the genetic (rg) and child-specific environment (re) correlations between them, respectively. Dotted arrows denote non-significant estimates

For instance, in the Cholesky model, the proportion of the covariance between reading difficulties and inattention symptoms due to common genes (or environmental factors) which are not shared with hyperactivity-impulsivity symptoms and/or IQ can be estimated as the product of the paths linking these phenotypes with latent factor A3 and dividing by their covariance. In the same model, the proportion of the variance in reading difficulties which is not due to genetic (or environmental) effects shared with any of the other traits measured in this study is estimated by dividing the square of the path linking RDQ with latent genetic factor A4 by the variance in RDQ scores.

Two additional multivariate models were employed to address the question of whether the covariance between the four traits in our study could be attributed to genetic and child-specific environment effects that are common to all, either directly (common factor, independent-pathway model, Fig. 3a), or through a common latent factor (common-factor, common-pathway model, Fig. 3b).

Fig. 3
figure 3

Illustration of the multivariate independent pathways (a) and common pathway (b) ACE(IQ)-AE models which were compared to the multivariate Cholesky model presented in Fig. 1


  • Aim 1: Are the phenotypic and genetic correlations between reading difficulties and ADHD inattention symptoms larger compared to hyperactivity-impulsivity symptoms?

A medium phenotypic correlation between reading difficulties and inattention symptoms was observed, which was substantially and significantly higher than the correlation with hyperactivity-impulsivity symptoms (p<.001) (Table 2). The association between IQ and reading difficulties was similar to what has been previously reported in the larger TEDS sample using a composite index of reading ability (r = .43; Harlaar et al. 2005a). The correlation between IQ and ADHD inattention symptoms was small but significant, while the correlation with hyperactivity-impulsivity symptoms, although also significant, was less than half the size of the association with inattention. Heritability estimates ranged from 55% for inattention to 74% for reading difficulties (Table 3). The genetic correlation between reading difficulties and inattention was moderately high at 0.60, 2.5 times the size of the genetic correlation with hyperactivity-impulsivity (Table 3). The phenotypic correlation of reading difficulties with both inattention and hyperactivity-impulsivity symptoms was largely attributable to shared genetic effects (75% and 92% respectively; Table 3).

Table 2 Phenotypic Correlations (with 95% Confidence Intervals in Brackets) Across IQ, Inattention, Hyperactivity-Impulsivity and Reading Difficulties Ratings
Table 3 Standardized Parameter Estimates (with 95% Confidence Intervals) from the Correlated Factors Solution of the Full Cholesky Model Within and Across IQ, Hyperactivity-Impulsivity, Inattention and Reading Difficulties Ratings
  • Aim 2: To what extent do genes that are also shared with IQ and/or hyperactivity-impulsivity explain the covariance between reading difficulties and inattention symptoms?

Comparing the AIC values of the three multivariate models, including the common-pathway and independent-pathway models, showed that the ACE-AE Cholesky model provided the best fit to the data (Table 4). This indicated that the covariance between the four phenotypes could not be best explained by genetic and environmental factors that are common among all traits. Using the parameter estimates from the best-fitting Cholesky decomposition model (Fig. 1), we estimated that 45% of the covariance between reading difficulties and inattention was due to genetic effects not shared with any of the other phenotypes: (3.20*1.80)/[3.20*1.80+1.24*1.98+(−2.89)*(−0.47)+2.22*1.19+0.16*0.87+(−0.57)*(−0.65)]. Similarly, we calculated that a fifth (19%) of the covariance was due to genetic effects shared with both inattention and hyperactivity-impulsivity, but not IQ, and only 11% of the covariance was due to genetic effects shared among all four phenotypes. These estimates added up to 75%, which was the percentage of the phenotypic correlation between reading difficulties and inattention due to shared genetic effects (Table 3). The remaining 25% percent could be attributed to child-specific environment effects (21% not being shared with hyperactivity-impulsivity or IQ).

Table 4 Statistics for the Relative Fit of the Multivariate Genetic Models
  • Aim 3: To what extent can genetic and environmental effects shared with ADHD symptoms and IQ account for variance in reading difficulties?

Individual differences in reading difficulties scores were largely attributed to genetic factors (heritability h2=74%; Table 3). Of the total variance for reading difficulties, 31% [3.772/(3.772+3.202+1.242+(−2.89)2 +3.172+1.192+0.162+(−0.57)2)] (or 42% of the heritability) was attributed to unique genetic effects (see Fig. 1), 22% to genetic effects shared only with inattention (A3), 3% to genetic effects shared with both inattention and hyperactivity-impulsivity (A2), and 18% to genetic effects shared among all phenotypes (A1). Unique child—specific environment influences accounted for 22% of the total variance in reading difficulties, whereas the sum of child-specific environment influences shared with the other phenotypes accounted for only 4% of the total variance.

  • Aim 4: Are there possible gender differences in the aetiology for the covariation between reading difficulties and inattention symptoms?

The lack of qualitative and quantitative sex differences in the univariate analyses, together with the absence of any indication of sex differences in the cross-twin cross-trait correlations for reading difficulties and inattention symptoms between males (MZ: r = .39, 95% confidence intervals (CI95):.31 to .46; DZ: r = .02, CI95: −.15 to .19) and females (MZ: r = .41, CI95:.34 to .47; DZ: r = .04, CI95: −.12 to .20), provided no indication for further testing this aim in the multivariate analysis.


Using an unselected general population sample, we confirmed previous reports that reading difficulties show higher phenotypic and genetic correlations with ADHD inattention symptoms, compared to hyperactivity-impulsivity symptoms (Martin et al. 2006; Willcutt et al. 2000, 2007b). This study further showed, for the first time, that the covariance between reading difficulties and ADHD inattention symptoms is largely (66%) driven by aetiological factors (genetic, 45%, and child-specific environment, 21%) that are not shared with IQ or hyperactivity-impulsivity. Almost half (47%) of the variance in reading difficulties was attributed to components shared with other phenotypes, mainly genetic influences shared with inattention (22%) or inattention, hyperactivity-impulsivity and IQ (18%), attesting to the multifactorial nature of the deficits underlying reading disability (Bosse et al. 2007; Vellutino et al. 2004).

This study extends findings from bivariate twin studies that used extreme selected groups (Willcutt et al. 2000, 2007b) to the full distribution of ADHD symptom ratings and reading disability scores in the general population. The similarity of findings across selected and general population samples supports the notion that inattention and hyperactivity-impulsivity symptoms and reading disability are the tails of normally distributed traits, reflecting normally distributed risk factors in the population (Chen et al. 2008; Harlaar et al. 2005b; Levy et al. 1997; Shaywitz et al. 1992).

A novel finding in this study was that the aetiological factors linking reading difficulties with inattention symptoms were largely independent from factors affecting general cognitive ability. This is consistent with evidence from behavioural genetic (Saudino and Plomin 2007) and longitudinal studies (Rabiner and Coie 2000; Rabiner and Malone 2004). Moreover, accumulating evidence from reading disability studies has challenged the role of IQ as a useful marker in the diagnosis of reading disability and the prognosis of outcome (Stuebing et al. 2002; Vellutino et al. 2004). For example, IQ-discrepancy scores did not distinguish cases according to the potential benefit they would receive from an intervention program (Vellutino et al. 2004; Vellutino et al. 2000). In a behavioural intervention study, teacher attention ratings predicted reading outcome better than verbal IQ did (Stage et al. 2003).

Another novel finding in this study was that child-specific environment factors (including possible error) were uniquely shared between reading difficulties and inattention symptoms, making a significant contribution (21%) to their covariance. This confirms previous evidence from TEDS showing a child-specific environment contribution to the covariation between a brief ADHD measure and a composite measure of teacher-assessed scholastic achievement (including reading) (Saudino and Plomin 2007). Other studies, though, have reported minimal or no contributions of child-specific environment influences (Martin et al. 2006; Willcutt et al. 2000). Possibly, the use of the full range of scores that is available with an unselected general population sample and the multivariate design may have increased power to detect this effect. Saudino and Plomin (2007) suggested that such factors are likely to operate across family and school environments. An intriguing prospect for future research would be to replicate this finding and identify such factors using objective readings measures, as they might provide useful targets for intervention.

Consistent with previous reports (Saudino and Plomin 2007; Willcutt et al. 2000), our findings did not provide any indication for gender differences in the aetiology of the covariance between reading difficulties and inattention symptoms. Further, while our data confirmed the low phenotypic and genetic correlations between reading difficulties and hyperactivity-impulsivity symptoms, it suggested that the phenotypic association was almost entirely due to shared genetic effects (92%), unlike previous reports (Willcutt et al. 2000, 2007b). It is possible that the use of the full range of scores in an unselected sample may have made possible the detection of this effect in the present study.

The evidence from this and previous studies (Martin et al. 2006; Willcutt et al. 2000, 2007b) is supportive of the idea that pleiotropic gene effects drive the covariance between inattention symptoms and reading difficulties. Further support is provided by molecular genetic studies, which have identified genetic variants likely to increase susceptibility to both reading disability and ADHD or show bivariate linkage to both disorders (Couto et al. 2009; Gayan et al. 2005; Loo et al. 2004; Wigg et al. 2008; Willcutt et al. 2002). Current twin and genetic data cannot distinguish, though, between different causal pathways; phenotypic covariance could result either from deficits in distinct neurocognitive mechanisms underlying each disorder, or deficits in common processes acting as vulnerability factors for both disorders, or both. Existing evidence from neuropsychological, medical treatment and behavioural intervention studies, while consistent with the idea of pleiotropic gene effects, is supportive of both possible patterns of causal relatedness.

Neuropsychological deficits in the comorbid group are mostly a combination of deficits observed when either disorder is present alone (Bental and Tirosh 2007; Rommelse et al. 2009; Shanahan et al. 2006; Willcutt et al. 2001, 2005). Processing speed and rapid naming emerge as strong candidates for shared risk factors, sharing all the common genetic influences contributing to the covariance between performance in objective reading tests and ADHD symptoms (Willcutt et al. 2010a). The same executive and motor endophenotypes have also been associated with both reading difficulties and ADHD (Rommelse et al. 2009).

Pharmacological studies have shown that despite improvements in ADHD symptoms, methylphenidate and atomoxetine effects on reading performance depended on the nature of the drug, and possibly the reading test that was used. Methylphenidate improved reading performance only in cases of reading disability comorbid with ADHD, despite similar improvements in inattention symptoms in single-disorder groups (Keulers et al. 2007). Similarly, a behavioural intervention program failed to improve reading performance only in those cases with both inattention symptoms and reading difficulties (Rabiner and Malone 2004). This evidence is suggestive of deficits in neurocognitive processes specific to the comorbid cases. Improvement in reading performance effected by atomoxetine treatment either did not correlate (or did so weakly) with improvement in ADHD symptoms (Sumner et al. 2009), or atomoxetine did not affect reading performance despite reducing ADHD symptoms (de Jong et al. 2009). This dissociation of atomoxetine effects on reading performance and ADHD symptoms suggests that distinct neurocognitive mechanisms may also contribute to the covariance between these traits.

Nevertheless, alternative explanations would also be consistent with a shared genes account. Methodological/sampling issues, such as symptom overlap or shared method variance could mimic shared gene effects. The similarity of findings across studies using objective measures (Willcutt et al. 2000, 2007b) or parent ratings (this study and Martin et al. 2006) of reading difficulties makes it unlikely that such factors made an important contribution in this study. The use of only parent ratings of reading difficulties is a limitation of this study though. Cross-assortative mating could be another possible confounding factor, yet initial evidence (Faraone et al. 1993) was not replicated (Friedman et al. 2003). Besides, cross-assortative mating would actually decrease estimates of shared genetic effects (Willcutt et al. 2000); evidently, this cannot have played an important role in this study.

Another possible explanation that would mimic shared gene effects is the phenocopy hypothesis, whereby disorder B is caused by the deficits associated with disorder A, in the absence of the normal aetiological antecedents for the former (Pennington et al. 1993). Initially, based on the resemblance of the neurocognitive profile of the comorbid group with that of the pure reading disability group, it was suggested that ADHD is a secondary outcome of reading disability (Pennington et al. 1993). This hypothesis was not supported by later neuropsychological studies though (discussed earlier), or by evidence from separate lines of research. First, it is not consistent with evidence for small to moderate phenotypic and genetic correlations between inattention symptoms and prereading skills in preschool children, before the onset of formal reading instruction (Willcutt et al. 2007a,b). Second, ADHD symptom ratings before or at school entry predicted scholastic achievement in later years (Fantuzzo et al. 2003; Horn and Packard 1985; Merrell and Tymms 2001; Newman et al. 1998; Rabiner and Coie 2000). Third, this and other studies have found that parent as well as teacher ratings of ADHD symptoms predict reading difficulties, showing that this association was not restricted to the classroom (Fergusson and Horwood 1992; Saudino and Plomin 2007).

Yet this evidence is not inconsistent with the hypothesis that attention deficits could lead to reading difficulties. This idea makes intuitive sense, as it is plausible that inattention problems might interfere with reading instruction. It is known that inadequate experience or instruction in reading accounts for most cases of early reading difficulties, which can be rectified through additional tutoring (Vellutino et al. 2004). Such deficiencies would mimic deficits observed in cases of pure reading disability, which reflect cognitive deficits of biological origin (Vellutino et al. 2004). Inattention symptoms do indeed moderate the effect of early tutoring on reading development (Stage et al. 2003), and studies have shown that early inattention deficits could predict later reading achievement even after controlling for concurrent reading ability (Rabiner and Coie 2000; Rabiner and Malone 2004). In particular, Rabiner and Coie (2000) found that one third of a group of inattentive kindergarten children whose standardized reading achievement scores at grade 5 were substantially (>1 SD) below those of their peers had normal reading scores at the post-kindergarten assessment. However, existing evidence suggests that reading difficulties in the context of inattention symptoms may not be caused by inadequate tutoring, given the failure of a behavioural intervention program to improve reading performance only in the comorbid cases (Rabiner and Malone 2004).

The findings from the current study can only apply to reading difficulties and ADHD inattention symptoms as reflected in parent and teacher ratings; they cannot be extrapolated to any specific process in either domain. ADHD inattention ratings represent pervasive inattentiveness across different domains in real life over an extended period of time. Therefore our conclusions cannot be extended to any specific cognitive attention process, as overt behavioural problems cannot be linked in a simple and direct manner to possible underlying deficits in cognitive attention processes, while similar manifestations of inattention problems may reflect diverse cognitive deficits (Marzocchi et al. 2009; Warner-Rogers et al. 2000). Studies using specific cognitive attention measures may lead to different conclusions regarding the extent of shared aetiological influences with IQ, depending on what particular measure in used (Zumberge et al. 2007).

It is important to understand the causal links between inattention problems or deficits in cognitive attention processes and the development of normal reading skills or the act of reading. Reading disability is a multifactorial disorder (Bosse et al. 2007; Vellutino et al. 2004), and for behavioural interventions to be effective they need to address the specific aetiological factors involved in each case (e.g. focusing on self-regulation versus targeting a specific neurocognitive process) (Rabiner and Malone 2004; Stage et al. 2003). Inattention is unlikely to be associated with reading difficulties only at the behavioural level and, in line with existing research, our findings are consistent with the idea that specific neurocognitive processes underlie the covariance between these two traits, which reflects shared genetic and child-specific environment influences that are largely independent from IQ. Indeed, cognitive attentional processes are being recognized as being involved in many stages of the reading process (Bosse et al. 2007; Reynolds and Besner 2006; Shaywitz and Shaywitz 2008; Vidyasagar and Pammer 2010). In the future we need evidence from longitudinal twin studies, measuring objectively a range of specific reading skills as well as of processes likely to contribute to behavioural inattention problems. Such studies will identify with greater specificity the neurocognitive processes that link behavioural inattention with deficits in specific reading skills, and examine causal relationships between these traits though development, as well as examine whether the contribution of IQ differs throughout development.