Introduction

A robust positive correlation between height and intelligence, as measured by IQ tests,Footnote 1 has been established in the literature with little consensus regarding its cause (Humphreys et al. 1985; Johnson 1991; Kanazawa and Reyniers 2009; Tanner 1979; Teasdale et al. 1989; Wheeler et al. 2004). Both environmental and genetic explanations have been advanced. For instance, previous research has found that markers of prenatal quality and nutritional status during childhood are associated with height (Eide et al. 2005; Steckel 1995) and cognition in adulthood (See Gomez-Pinilla 2008; Martyn et al. 1996; Seidman et al. 1992; Sørensen et al. 1997), suggesting that early environmental factors may be responsible for the correlation between height and intelligence (Abbott et al. 1998). Other evidence in support of environmental channels is the decline in the correlation between height and intelligence over time observed in the Scandinavian countries (Teasdale et al. 1989; Sundet et al. 2005; Tuvemo et al. 1999). On the other hand, reported heritability estimates in adults for measures of intelligence (e.g. 75%) (Neisser et al. 1996) and height (e.g. 87–93%) (Silventoinen et al. 2003) are high, suggesting that the relationship may instead be genetically mediated. For example, growth hormone deficiency, which is sometimes caused by genetic mutations, is characterized by both short stature and cognitive impairments (van Dam et al. 2005). Evidence of a shared genetic architecture between brain volume, intracranial space, and height (Posthuma et al. 2000) may also be interpreted as evidence of genetic mediation of the height–intelligence correlation.

Two studies, using twin data, have attempted to estimate the components of the height–intelligence correlation that are due to genetic and environmental effects. The results were mixed. Silventoinen et al. (2006) found in several samples of Dutch twins that the association between height and intelligence is primarily genetic in origin. Sundet et al. (2005), using conscription data from a considerably larger and more representative sample of Norwegian twins, found that the association between height and intelligence is primarily explained by common environmental factors. Both papers use a standard bivariate ACE model to arrive at these conclusions.

The present study makes several contributions toward elucidating the sources of covariation between height and intelligence. First, we extend the bivariate ACE model to allow for assortative mating and derive a general formula for decomposing a cross-trait genetic correlation into components attributable to assortative mating and pleiotropy. This provides a clear theoretical framework for studying the correlation between two traits and is particularly useful when examining phenotypes for which there is high assortative mating, height and intelligence being prime examples. The model demonstrates how sensitive estimates from the bivariate ACE model are to assumptions about assortative mating. Second, we use a sample of male Swedish twins matched to conscription records to examine the relative importance of genetic and environmental factors in explaining the association between height and intelligence. The sample used is the largest to date for such a study. It includes an additional measure of cognitive function other than intelligence, measured during enlistment through interviews by a military psychologist. This measure, which we label military aptitude, has a strong predictive validity for labor market outcomes independent of intelligence, such as wages, earnings and unemployment (Lindqvist and Vestman forthcoming). We apply the standard bivariate ACE model to decompose the height–intelligence and the height–military aptitude correlations, but we caution that the resulting estimates are very sensitive to assumptions about assortative mating and illustrate the sensitivity of these estimates by reporting results for different assumed levels of assortative mating. Lastly, we report a significant within-family height–intelligence correlation \(\left(\hat{\rho} = 0{\text{.}}10\right),\) consistent with the hypothesis that pleiotropy (or linkage) accounts for part of the genetic correlation.

Method

Sample

Our data comes from two main sources: the Swedish Twin Registry (STR) and the Swedish National Service Administration (SNSA). The STR contains information on nearly all twin births in Sweden since 1886, and has been described in further detail elsewhere (Lichtenstein et al. 2006). The sample includes those individuals who have participated in at least one of the Twin Registry’s surveys. The primary datasource is SALT (Screening Across the Lifespan Twin study). This was a survey administered to all Swedish twins born between 1926 and 1958 and attained a response rate of 74%. Fifty percent of the subjects in the dataset are from the SALT cohort. The secondary source is the web-based survey STAGE (The Study of Twin Adults: Genes and Environment). This was a web-based survey administered between November 2005 and March 2006 to all twins born in Sweden between 1959 and 1985. It attained a response rate of 61%. Approximately 30% of our subjects are drawn from STAGE. Our final datasource comes from a survey sent out in 1973 to the same cohort as SALT (Lichtenstein et al. 2006).

We matched the Swedish twins to the conscription data provided by SNSA. All Swedish men are required by law to participate in a nationwide military conscription at the age of 18. Before 1990, exemptions were very rare. The actual drafting procedure can take several days during which recruits undergo medical and psychological examination. The basic structure of the administered intelligence test has remained unchanged during our study period, though minor changes took place in 1980 and 1994. Recruits take four subtests (logical, verbal, spatial and technical) which, for most of the study period, are graded on a scale from 0 to 40. These raw scores are converted to a ordinal variable ranging from 1 to 9. Carlstedt (2000) discusses the history of psychometric testing in the Swedish military and provides evidence that this test of intelligence is a good measure of general intelligence. Thus, this test differs from the AFQT, which focuses more on “crystallized” intelligence.

All conscripts also see a psychologist for a structured interview. The psychologist has access to background information on the interviewee, such as school grades, medical background, cognitive ability and answers to a battery of questions on friends, family, and life. In conducting the interview, the psychologist is required to follow a manual and, ultimately, to make an assessment of the prospective recruit’s capacity to handle stress in a war situation. In making the assessment, the psychologist considers an individual’s ability to function in a group, adapt to new environments, as well as his persistance and emotional stability. Motivation for doing the military service is not among the set of characteristics that is considered beneficial for succeeding in the military. The psychologist assigns each interviewee an ordinal score from 1 to 9, but again these are constructed from four raw scores, this time ranging from 1 to 5. Like the intelligence test score, the military aptitude score is subject to measurement error because of random influences on conscript performance and because conscripts may differ in their motivation for the military service.Footnote 2 Lindqvist and Vestman (forthcoming) provide a more detailed description of the personality measure used by SNSA.

For both intelligence and military aptitude, the raw scores underlying an individual’s ordinal score are available to us. The raw scores are percentile rank-transformed and then converted by taking the inverse of the standard normal distribution to produce normally distributed test scores. The transformation is done separately for each year, but when less than 100 pairs are available for a particular year, two adjoint years are pooled. Since the sample of women who enlist in the military comprises a small and highly self-selected group, this paper focuses exclusively on men. We restrict the sample to all male twin pairs for whom complete data on cognitive ability and military aptitude is available for both twins. This leaves 1246 pairs of monozygotic (MZ) twins and 1568 pairs of dizygotic (DZ) twins for analysis, all of which are born between 1950 and 1976. Descriptive statistics for the twins in our sample are presented in Table 1, disaggregated by zygosity. For expositional convenience, we report the ordinal instead of the normalized scores in the Table, even though the latter are used in the analyses that follow.

Table 1 Background variables

The bivariate ACE model, assortative mating and the cross-trait genetic correlation

The bivariate ACE model

We follow biometrical genetic theory (Falconer and Mackay 1981) and previous papers on this subject (Silventoinen et al. 2006; Sundet et al. 2005) to decompose the variance of a phenotype into shares attributable to additive genetic factors, common environment, and individual environment. Our empirical analysis uses the standard bivariate ACE model (Neale and Cardon 1992). Let \(Y^{i}=\left( Y_{1}^{i},Y_{2}^{i}\right)^{\prime}\) be a vector of two observable phenotypes of individual i, and suppose that

$$ Y^{i}={\bf a}A^{i}+{\bf c}C^{i}+{\bf e}E^{i}, $$

where a, c, and e are 2 × 2 matrices of coefficients and where \(A^{i}=\left( A_{1}^{i},A_{2}^{i}\right)^{\prime},\, C^{i}=\left( C_{1}^{i},C_{2}^{i}\right)^{\prime}\) and \(E^{i}=\left( E_{1}^{i},E_{2}^{i}\right)^{\prime}\) are, respectively, the latent additive genetic, common environmental, and individual environmental factors underlying traits \(Y_{1}^{i}\) and \(Y_{2}^{i}.\) Throughout, we make the standard assumption that A,  C, and E are mutually independent. The model assumes that all genetic variance is additive, thereby ruling out dominance and epistasis. Suppose further, without loss of generality, that all the variables have been standardized to have mean zero and unit variance. The correlation between the traits of two individuals i and j will then be equal to

$$ {\bf E} \left[ Y^{i}Y^{j \prime}\right] ={\bf aE}\left[ A^{i}A^{j\prime}\right] {\bf a}^{\prime}+{\bf cE}\left[ C^{i}C^{j\prime}\right] {\bf c}^{\prime}+{\bf eE}\left[ E^{i}E^{j\prime}\right] {\bf e}^{\prime}, $$
(1)

where E denotes the expectations operator. It follows from the standardization that the expectation of the product of two variables is simply their correlation. Table 2 summarizes a set of assumptions about E[A i A j],  E[C i C j] and E[E i E j]. These assumptions are customary in the behavior genetics literature, except for the presence of the \({\Upgamma}_{M}\) matrix which, as we will show, accounts for assortative mating at the additive genetic level. The usual bivariate ACE model does not account for assortative mating and thus implicitly assumes that \(\Upgamma_{M}\) is a zero matrix. In the next section of the paper, we show that assortative mating enters the model this way, through \(\Upgamma_{M}.\)

Table 2 Assumed values of E[A i A j′],  E[C i C j′] and E[E i E j′] for different relationships between individuals I and J

Behavior geneticists have previously studied models which allow for assortative mating, but they were mostly concerned with the case of a single phenotype (see, for instance, Eaves et al. 1978; Eaves and Heath 1981; Martin et al. 1986; Keller et al. 2009). Eaves et al. (1984) consider the multivariate case and develop a model in which there is assortative mating on a latent phenotype. We add to this literature by deriving formulas that describe, among others, the effects of assortative mating on the genetic correlation between two traits and by augmenting the bivariate ACE model to account for these effects.

The \({ \Upgamma }\) matrices in Table 2 have the following elements:

$$ \begin{aligned} \Upgamma_{A} & = \left(\begin{array}{cc} 1 & \rho_{A} \\ \rho_{A} & 1 \end{array} \right), \Upgamma_{C}=\left(\begin{array}{cc} 1 & \rho_{C} \\ \rho_{C} & 1 \end{array}\right), \Upgamma_{E}=\left(\begin{array}{cc} 1 & \rho_{E} \\ \rho_{E} & 1 \end{array}\right) \\ \hbox{and }\Upgamma_{M} &= \left(\begin{array}{cc} m_{11} & \bar{m}_{12} \\ \bar{m}_{12} & m_{22} \end{array} \right). \end{aligned} $$

As we show in the next section, \(\bar{m}_{12}=\frac{m_{12}+m_{21}}{2},\) where m kl is the correlation between the latent additive genetic factors underlying phenotype Y k in fathers and phenotype Y l in mothers (k,l \(\epsilon\){1, 2}). Thus, there is positive assortative mating at the additive genetic level for phenotypes k and l if m kl  > 0 and/or m lk  > 0.

The resulting model has 19 free parameters: ρ A ,  ρ C ,  ρ E ,  m 11,  m 12,  m 21,  m 22 and 4 free parameters for each of the matrices a, c and e. However, only nine moments can be computed from the data: the cross-trait covariance between the MZ twins and, for each trait, the covariance between the MZ twins; the corresponding three moments for the DZ twins; and the population variances of the two traits as well as the population cross-trait covariance. Since the number of parameters exceeds the number of independently informative equations, at least ten identifying assumptions need to be made. In the standard decomposition, it is assumed that a, c, and e are diagonal (i.e. that a 12 = a 21 = c 12c 21 = e 12 = e 21 = 0) and that \(\Upgamma_{M}=0.\) Under these assumptions, the remaining parameters are identified. The substantive implication of the diagonality assumption is that, while the latent factors underlying the two traits may be correlated, each latent factor may only influence its respective trait.Footnote 3 The restriction that \(\Upgamma_{M}=0\) means that there is no assortative mating (including cross-trait assortative mating) for phenotypes Y 1 and Y 2.Footnote 4 Below, we also discuss the consequence of assuming different, more realistic values of \(\Upgamma_{M}.\)

Estimates of the shares of the observed variance in either trait attributable to additive genetic factors (a 2), common environment (c 2), and unique environment (e 2) can be obtained from the above parameters. Examination of Eq. 1 and some algebra reveals that

$$ Var(Y_{k})={\bf E}\left[\left( Y_{k}^{i}\right)^{2}\right]=1=(a_{kk})^{2}+(c_{kk})^{2}+(e_{kk})^{2}. $$

We thus see that \(a_{k}^{2}=a_{kk}^{2},\, c_{k}^{2}=c_{kk}^{2},\) and \(e_{k}^{2}=e_{kk}^{2}\, (k=1,2).\) It also follows from Eq. 1 that the within-individual cross-trait correlation is given by

$$ corr(Y_{1},Y_{2})={\bf E}\left[ Y_{1}^{i}Y_{2}^{i}\right] =a_{11}a_{22}\rho_{A}+c_{11}c_{22}\rho_{C}+e_{11}e_{22}\rho_{E}. $$
(2)

The shares of the cross-trait phenotypic correlation attributable to additive genetic factors, common environment, and unique environment are thus \(\frac{a_{11}a_{22}\rho_{A}}{corr(Y_{1},Y_{2})}, \, \frac{c_{11}c_{22}\rho_{C}}{corr(Y_{1},Y_{2})},\) and \(\frac{e_{11}e_{22}\rho_{E}}{corr(Y_{1},Y_{2})},\) respectively.Footnote 5 These shares depend on the correlations between the latent factors underlying both traits (ρ A , ρ C , and ρ E ) and on the shares of observed phenotypic variance explained by each of the latent factors (a 2,  c 2, and e 2).

Assortative mating in the bivariate ACE model

We now show that assortative mating enters the bivariate ACE model as described in the previous section. For this, it will be useful to augment the model to include individual i’s father and mother. We assume that we are in genetic equilibrium and that all the parameters of the model are fixed across generations. We can write

$$ {\bf E}\left[A_{k}^{i}|A_{k}^{F_{i}},A_{k}^{M_{i}}\right]=\frac{1}{2} \left(A_{k}^{F_{i}}+A_{k}^{M_{i}}\right)\Longrightarrow A_{k}^{i}=\frac{1}{2} \left(A_{k}^{F_{i}}+A_{k}^{M_{i}}\right)+\epsilon_{k}^{i}, $$
(3)

where, as above, \(A_{k}^{i}\) is the latent additive genetic factor underlying phenotype \(Y_{k}^{i}\,(k=1,2)\) for individual i; F i and M i refer to i’s father and mother, respectively; and \(\epsilon_{k}^{i}\) is an error term independent of the parental genotypes. We rewrite the error term as

$$ \epsilon_{k}^{i}=\theta_{k}S_{k}^{i}, $$
(4)

where θ k is a normalizing constant and \({\bf E}\left[\left( S_{k}^{i}\right)^{2}\right]=1,\) thus adhering to the convention of working with standardized variables. The expression \(\theta_{k}S_{k}^{i}\) is the deviation due to Mendelian segregation (Otto et al. 1994). Finally, we let \({\bf E}\left[S_{1}^{i}S_{2}^{i}\right]=\rho_{S}\).Footnote 6 The first equality in Eq. 3 implies that \({\bf E} \left[S_{k}^{i}|A_{k}^{F_{i}}\right]={\bf E}\left[S_{k}^{i}|A_{k}^{M_{i}}\right]={\bf E} \left[S_{k}^{i}\right]=0,\) and thus \({\bf E}\left[S_{k}^{i}A_{k}^{F_{i}}\right]={\bf E} \left[S_{k}^{i}A_{k}^{M_{i}}\right]=0.\) For any pair of full siblings (including DZ twins) i and j, it follows that

$$ \begin{aligned} {\bf E}\left(A_{k}^{i}A_{l}^{j}\right) & = {\bf E}\left[ \left( \frac{1}{2} \left(A_{k}^{F_{i}}+A_{k}^{M_{i}}\right)+\theta_{k}S_{k}^{i}\right) \left( \frac{1}{2} \left(A_{l}^{Fj}+A_{l}^{M_{j}}\right)+\theta_{l}S_{l}^{j}\right) \right] \\ & = \frac{1}{4}\left( {\bf E}\left[ A_{k}^{F_{i}}A_{l}^{F_{j}}\right] + {\bf E}\left[ A_{k}^{M_{i}}A_{l}^{M_{j}}\right] + {\bf E}\left[ A_{k}^{F_{i}}A_{l}^{M_{j}}\right] + {\bf E}\left[ A_{k}^{M_{i}}A_{l}^{F_{j}}\right] \right) \\ & = \frac{1}{4} \left( \rho_{A,kl}+\rho_{A,kl}+m_{kl}+m_{lk}\right) ={{1 }\over {2}}\left( \rho_{A,kl}+\bar{m}_{kl}\right), \end{aligned} $$

where the second equality holds because \(S_{k}^{i}\) and \(S_{l}^{j}\) are uncorrelated with each other and with the other variables.Footnote 7 The third equality follows from the fact that F i  = F j and M i  = M j and from our assumption of genetic equilibrium, which implies that \({\bf E}\left[\left(A_{k}^{i}\right)^{2}\right]=1\) is constant across generations and thus that \({\bf E} \left[A_{k}^{i}A_{l}^{i}\right]=Cov\left(A_{k}^{i},A_{l}^{i}\right)=\rho_{A,kl}.\) Here, ρA,kl is the correlation between the latent additive genetic factors underlying phenotypes k and l.Footnote 8 Therefore, \({\bf E}\left[A^{i}A^{j\prime}\right]=\frac{1}{2} \left(\Upgamma_{A}+\Upgamma_{M}\right) .\)

So far, we have only considered assortative mating at the additive genetic level. The parameters of the matrix of genetic assortative mating correlations \(\Upgamma_{M}\) are not directly observable, so it would be useful to have a mapping relating \(\Upgamma_{M}\) to observable parameters. Unfortunately, to our knowledge, no one has yet derived such a mapping for the general case of unconstrained multivariate assortative mating.Footnote 9 Gianola (1982)Footnote 10 considers two special cases of interest to the livestock industry. The first case is when assortative mating is actively practiced on one phenotype only and a second phenotype is genetically correlated with the first due to pleiotropy (or linkage)—such as when, for instance, large bulls are mated with large cows, and the second trait is genetically correlated with cattle size. The second case is when mating pairs are assorted to have a certain correlation between phenotype X in males and phenotype Y in females—such as when, to use Gianola’s example, high milk production females are mated to fast growing males. Assortative mating for height and intelligence in humans is more complex and unlikely to fit either of these cases. We leave the derivation of a general mapping to future research. However, the above discussion should make it clear that it is important to investigate how sensitive the results from the standard bivariate decomposition are to the assumption that \(\Upgamma_{M}\) is equal to zero for traits with high assortative mating.

We now turn our attention to three additional features of the model which merit further exploration. First, we show that in the augmented model (in which the elements of the \(\Upgamma_{M}\) matrix are not constrained to be zero), the correlation between two traits can be decomposed into parts attributable to assortative mating, pleiotropy, common environment, and individual environment. Second, we investigate the bias which arises if a standard bivariate ACE model is estimated in the presence assortative mating. Finally, we consider how within-family correlations can be used to shed light on the sources of a phenotypic correlation.

Decomposition of ρ A

Observe that

$$ \begin{aligned} 1 &= {\bf E}\left[\left(A_{k}^{i}\right)^{2}\right] ={\bf E}\left[ \left( \frac{1}{2}\left( A_{k}^{Fi}+A_{k}^{M_{i}}\right) +\theta_{k}S_{k}\right)^{2} \right] \\ &=\frac{1}{2}\left(1+m_{kk}\right) +\left( \theta_{k}\right)^{2}\Longrightarrow \theta_{k}=\sqrt{\frac{1-m_{kk}}{2}},\, (k=1,\,2), \end{aligned} $$

and that

$$ \begin{aligned} \rho_{A} &={\bf E}\left[ A_{1}^{i}A_{2}^{i}\right] = {\bf E}\left[\left( \frac{1}{2}\left(A_{1}^{Fi}+A_{1}^{M_{i}}\right) +\theta_{1}S_{1}\right) \cdot \left( \frac{1}{2}\left( A_{2}^{Fi}+A_{2}^{M_{i}}\right) +\theta_{2}S_{2}\right) \right] ,\\ &=\frac{1}{2}\left( \rho_{A}+\bar{m}_{12}\right) +\theta_{1}\theta_{2}\rho_{S}. \end{aligned} $$

It follows that

$$ \rho_{A}=\bar{m}_{12}+\sqrt{( 1-m_{11}) ( 1-m_{22})} \rho_{S}. $$
(5)

Therefore, in equilibrium, the correlation between the latent additive genetic factors underlying phenotypes Y 1 and Y 2 is equal to the sum of a term accounting for cross-trait assortative mating and a term accounting for the genetic correlation arising from pleiotropy (or linkage). In the limiting case where there is no assortative mating, \(\rho_{A|m_{11}= \bar{m}_{12}=m_{22}=0}=\rho_{S}.\) Without assortative mating, the genetic correlation must be entirely attributable to pleiotropy (or linkage), and thus ρ s is the genetic correlation that is attributable to pleiotropy (or linkage). We can thus rewrite Eq. 5 as

$$ \rho_{A}=\bar{m}_{12}+\sqrt{(1-m_{11}) (1-m_{22})} \rho_{A,Pleiotropy} $$
(6)

In the limiting case where there is no pleiotropy, \(\rho_{A|\rho_{S}=0}= \bar{m}_{12}\) and the genetic correlation is entirely due to cross-trait assortative mating. Substituting Eq. 6 into Eq. 2 yields

$$ \begin{aligned} corr(Y_{1},Y_{2}) =& a_{11}a_{22}\bar{m}_{12}+a_{11}a_{22}\sqrt{( 1-m_{11}) ( 1-m_{22}) }\rho_{A,Pleiotropy}\\ &+c_{11}c_{22}\rho_{C}+e_{11}e_{22}\rho_{E}. \end{aligned} $$
(7)

The correlation between phenotypes Y 1 and Y 2 can thus be decomposed into parts attributable to assortative mating, pleiotropy, common environment, and individual environment.

A lower bound for the share of the genetic correlation

Observe that for a pair of MZ twins,

$$ {\bf E}^{MZ}\left[Y_{i}Y_{j}^{\prime}\right]={\bf a}\Upgamma_{A}{\bf a}^{\prime } +{\bf c}\Upgamma_{C}{\bf c}^{\prime}, $$

and that for a pair of DZ twins,

$$ {\bf E}^{DZ}\left[Y_{i}Y_{j}^{\prime}\right]=\frac{1}{2}{\bf a}\left( \Upgamma_{A}+\Upgamma_{M}\right) {\bf a}^{\prime } +{\bf c}\Upgamma_{C}{\bf c}^{\prime}, $$

implying that

$$ {\bf a}\Upgamma_{A}{\bf a}^{\prime }=2\left( {\bf E}^{MZ}\left[Y^{i}Y^{j\prime}\right]-{\bf E}^{DZ}\left[Y^{i}Y^{j\prime}\right]\right) +{\bf a }\Upgamma_{M}{\bf a}^{\prime}. $$

Computing the off-diagonal elements of the symmetric matrices on both sides of the above equation gives

$$ a_{11}a_{22}\rho_{A}=2\left( {\bf E}^{MZ}\left[Y_{1}^{i}Y_{2}^{j}\right]-{\bf E}^{DZ}\left[Y_{1}^{i}Y_{2}^{j}\right]\right) +a_{11}a_{22}\bar{m}_{12}. $$

Interestingly, when there is no cross-trait assortative mating at the genetic level \(\left(\bar{m}_{12}=0\right),\) the share of the cross-trait phenotypic correlation attributable to additive genetic factors \(\left(\frac{ a_{11}a_{22}\rho_{A}}{corr(Y_{1},Y_{2})}\right) \) does not depend on the same-trait assortative mating genetic correlations (m 11 and m 22). Thus, under the maintained assumptions of the model, if cross-trait assortative mating is nonnegative at the genetic level \(\left(\bar{m}_{12}\geq 0\right),\) estimates from the standard bivariate ACE model still provide a lower bound for the share of the cross-trait phenotypic correlation attributable to additive genetic factors. The cross-trait genetic correlation itself (ρ A ) is however a function of both the cross-trait and same-trait assortative mating genetic correlations, since the coefficients a 11 and a 22 depend on the latter.

Within-family analysis

The within-family correlation between traits Y 1 and Y 2 is defined as \(corr_{WF}(Y_{1},Y_{2})=corr\left(Y_{1}^{i}-Y_{1}^{j},Y_{2}^{i}-Y_{2}^{j}\right),\) where individuals i and j are either non-twin siblings or DZ twins. Thus,

$$ \begin{aligned} corr_{WF}(Y_{1},Y_{2})&= \frac{{\bf E}\left[ \left( Y_{1}^{i}-Y_{1}^{j}\right) \left( Y_{2}^{i}-Y_{2}^{j}\right) \right] }{\sqrt{ {\bf E}\left[Y_{1}^{i}-Y_{1}^{j}\right]^{2}{\bf E}\left[Y_{2}^{i}-Y_{2}^{j}\right]^{2}}}\\ &=\frac{{\bf E}\left[ \left( a_{11}A_{1}^{i}+e_{11}E_{1}^{i}-a_{11}A_{1}^{j}-e_{11}E_{1}^{j}\right) \left(a_{22}A_{2}^{i}+e_{22}E_{2}^{i}-a_{22}A_{2}^{j}-e_{22}E_{2}^{j}\right) \right]}{\sqrt{{\bf E} \left[a_{11}A_{1}^{i}+e_{11}E_{1}^{i}-a_{11}A_{1}^{j}-e_{11}E_{1}^{j}\right]^{2}{\bf E}\left[a_{22}A_{2}^{i}+e_{22}E_{2}^{i}-a_{22}A_{2}^{j}-e_{22}E_{2}^{j}\right]^{2}}}\\ &= \frac{{\bf E}\left[ a_{11}\theta_{1}\left( S_{1}^{i}-S_{1}^{j}\right) a_{22}\theta_{2}\left( S_{2}^{i}-S_{2}^{j}\right) +e_{11}\left( E_{1}^{i}-E_{1}^{j}\right) e_{22}\left( E_{2}^{i}-E_{2}^{j}\right) \right] }{\sqrt{{\bf E}\left[ a_{11}\theta_{1}\left( S_{1}^{i}-S_{1}^{j}\right) +e_{11}\left( E_{1}^{i}-E_{1}^{j}\right) \right]^{2}{\bf E}\left[ a_{22}\theta_{2}\left( S_{2}^{i}-S_{2}^{j}\right) +e_{22}\left( E_{2}^{i}-E_{2}^{j}\right) \right]^{2}}}\\ &=\frac{a_{11}a_{22}\theta_{1}\theta_{2}\rho_{S}+e_{11}e_{22}\rho_{E}}{\sqrt{\left(\left( a_{11}\theta_{1}\right)^{2}+\left( e_{11}\right)^{2}\right)\cdot \left(\left( a_{22}\theta_{2}\right)^{2}+\left( e_{22}\right)^{2}\right)}}. \end{aligned} $$
(8)

Therefore, cross-trait assortative mating does not affect the within-family correlation. Unless ρ S and ρ E have different signs, the empirical observation that corr WF (Y 1Y 2) ≅ 0 would rule out pleiotropy (or linkage) as a source of genetic correlation, thereby suggesting that assortative mating is responsible. Lastly, observe that expression Eq. 8 collapses nicely to Corr WF (Y 1Y 2) ≅  ρ S  = ρA,Pleiotropy when e 11 ≅ e 22 ≅ 0.

Results

The correlation between height and intelligence is 0.176 (SE 0.015) in our sample, which is similar to what has been previously reported in the literature. The correlation between height and military aptitude is 0.101 (SE 0.016). Estimates (and their asymptotic standard errors) of the parameters in the standard bivariate ACE model without assortative mating are reported in Table 3, separately for the height–intelligence and the height–military aptitude correlations. The analyses were run in MPLUS (Muthén and Muthén 2006), a numerical optimizer often used in behavior genetics. We followed standard practice and used maximum likelihood estimation under the assumption that the phenotypic variables come from a bivariate normal distribution.

Table 3 Results from the estimation of the bivariate ACE model for height–intelligence and for height–military aptitude

Results suggest that 77%, 16% and 7% of the observed variance in height is attributable to additive genetic, common environmental, and unique environmental factors, respectively. The corresponding figures are 62%, 20%, and 18% for intelligence, and 55%, 11%, and 35% for military aptitude. Our point estimates also suggest that the correlation between the latent genetic factors underlying the height and intelligence phenotypes (ρ A ) is a rather modest 0.08, and that the corresponding correlation from the height–military aptitude decomposition is 0.09. We obtain much larger estimates for the common environmental correlations (ρ C ): 0.59 for height–intelligence and 0.23 for height–military aptitude, but we note the fairly low precision of these estimates. Finally, we obtain modest but very significant estimates for the unique environment correlations (ρ E ) for both variance decompositions.

The estimated shares of the phenotypic cross-trait correlations accounted for by the covariation between A 1 and A 2,   C 1 and C 2, and E 1 and E 2 are presented in Table 4, along with their asymptotic standard errors, for both the height–intelligence and the height–military aptitude cases.

Table 4 Shares of the height–intelligence and height–military aptitude correlations accounted for by the three latent factors

These results suggest that 59% of the height–intelligence correlation is mediated by common environment, that 31% is due to additive genetic factors, and that 9% is due to unique environment. The results of the variance decomposition for height and military aptitude suggest that most of the correlation is accounted for by additive genetic factors (56%); the estimated share due to common environment appears quite large at 30%, but is not significantly different from zero.

Given the strong evidence for assortative mating for both height and intelligence, it is interesting to ask how sensitive these results are to the assumption that \(\Upgamma_{M}\) is a zero matrix. Estimates of the spousal phenotypic correlations for height and intelligence are both large and positive. For example, Bouchard and McGue (1981), in a review of the literature, report an average spousal correlation of 0.33 for intelligence, while Silventoinen et al. (2003) and Vandenberg (1972) report spousal correlations in the neighborhood of 0.30 for height. We know of no published estimates of the spousal height–intelligence phenotypic correlation, but data available to us from the Minnesota Twin Family Study suggests that the correlation is around 0.04.Footnote 11 Though there exists no mapping relating \(\Upgamma_{M}\) to observable parameters (as we noted above), we hope this information about spousal resemblance can provide some guidance as to what might be plausible parameter values. To have a better idea of the potential significance of the biasing effect of neglecting assortative mating, we estimated our augmented bivariate ACE model for several assumed values of \(\Upgamma_{M}.\) The results of this exercise are reported in Table 5.

Table 5 Parameter estimates for height–intelligence under different assumed values of the \(\Upgamma_{M}\) matrix

These results demonstrate that the point estimates from the standard bivariate ACE model are sensitive to assumptions about the spousal cross-trait genetic correlation. As the values of the off-diagonal parameters and the values of the diagonal parameters of \(\Upgamma_{M}\) matrix are progressively increased from 0 to 0.08 and from 0.10 to 0.25, respectively, the value of the genetic share goes from 0.314 to 0.905 and the value of the common environmental share declines from 0.592 to zero. These changes are quite dramatic and illustrate the sensitivity of the model to even small departures from the assumptions on which it is based. Nonetheless, as we demonstrated analytically above, in the presence of positive cross-trait assortative mating, the standard bivariate ACE model still provides a lower bound for the share of the cross-trait phenotypic correlation attributable to additive genetic factors. Table 5 also illustrates another well-known result, namely that as the diagonal elements of the \(\Upgamma_{M}\) matrix are progressively increased, the estimated heritabilities rise at the expense of the estimated shared environmental variance components.

Our framework also showed how the within-DZ correlation can be used to help shed light on the sources of the phenotypic correlation between height and intelligence. In our sample, the within-family height–intelligence correlation is 0.10 (SE 0.017). The within-family height–military aptitude correlation is 0.09 (SE 0.022).

Discussion

Our point estimates from the standard bivariate ACE model with zero assortative mating suggest that both environmental and genetic factors are responsible for the height–intelligence and the height–military aptitude correlations. Specifically, they suggest that common environment and additive genetic effects account for 59% and 31% of the height–intelligence correlation, respectively; the corresponding numbers for the height–military aptitude correlation are 30% and 56%. Our height–intelligence results are strikingly similar to those of Sundet et al. (2005): applying the standard bivariate ACE model to a sample of Norwegian conscripts, they found that the correlation between the latent common environmental factors which underlie height and intelligence was 0.56, only marginally lower than our point estimate of 0.59. They also estimated the heritability of height at 0.76 and the heritability of intelligence at 0.64, while our own estimates are 0.77 and 0.63, respectively.Footnote 12

This finding contrasts with those reported by Silventoinen et al. (2006) who found that for all cohorts in which there was an association between height and intelligence, the association was explained entirely by additive genetic correlation. One possible reason for the discrepancy between our results and those reported by Silventoinen et al. (2006) is the difference between the samples. Silventoinen et al. (2006) perform independent analyses on four cohorts of Dutch twins ranging in age from children to middle-age adults and in size from 156 to 567 twin pairs.

However, a more likely explanation is that Silventoinen et al. (2006) use a different model selection procedure. Procedurally, they do the following: (1) for each cohort, they begin by fitting univariate models to the data for both height and intelligence; (2) they compare the χ2-goodness-of-fit statistics of the models and select the best-fitting model for each trait; and (3) they use those best-fitting models for all subsequent analyses. They find that the univariate AE model offers adequate fit in every cohort for height and in almost every cohort for intelligence. Thus, in their bivariate model, they only include latent factors for additive genetic effects and for unique environment for height for every cohort, and they do likewise for intelligence in almost every cohort. As a result, there is no ρ C parameter in any of their bivariate models. Moreover, it appears that they drop the ρ E parameter from their model whenever this improves the fit. In other words, they decompose the height–intelligence correlation either into additive genetic and individual environmental effects, or uniquely into additive genetic effects. Dropping ρ C —the most important component of the height–intelligence correlation in both in Sundet et al.’s (2005) data and the results reported here—from their model, and sometimes also dropping ρ E , quite mechanically leads to the conclusion that ρ A is an important source of the correlation.Therefore, the confidence intervals presented by Silventoinen et al. (2006) do not reflect the uncertainty that stems from the model-selection procedure that is employed. It is not clear to us why the fact that a parameter is not statistically significant should justify the restriction that the parameter is equal to zero. This point is further elaborated on by Goldberger (2002).

We also derived the bivariate ACE model for the more general and realistic case where there is assortative mating at the genetic level. Sundet et al. (2005, p. 310) and Silventoinen et al. (2006, p. 587) note that their estimates may be biased if there is assortative mating for the studied traits. Our analysis demonstrates the sensitivity of the estimates from the bivariate ACE models to one of several problematic assumptions. Specifically, changing the assumed values of the cross-trait and same-trait spousal genetic assortative mating correlations from 0 to 0.08 and from 0.10 to 0.25, respectively, raises the estimated share of the height–intelligence correlation mediated by genes from 0.314 to 0.905. Given that there is ample uncertainty about the relationship between the elements of \(\Upgamma_{M}\) and the spousal trait and cross-trait correlations, this further reinforces our conclusion that for traits with assortative mating, bivariate ACE estimate must be approached with caution.

It is important to emphasize that we have only explored one dimension of the model selection uncertainty in multivariate behavior genetic models. The interpretation would be further complicated by attempts to allow for the latent factors to be correlated or for gene action to be non-additive, questions which we have not investigated here. As is well known, information on MZ and DZ twins alone is not sufficient to estimate models with latent factors for additive genetic effects, non-additive genetic effects, common environment, and individual environment. Since the twin model suffers from parameter indeterminacy when both dominance and common environmental effects are present (Keller and Coventry 2005), we suggest that future work use extended twin family samples with data on twins’ relatives, thus allowing richer models to be identified. Also, rapid advances in molecular genetics may soon make it possible to study the sources of covariation between traits from a new angle.

Despite our reservations, we believe that our framework provides valuable insights into what could plausibly be learned from studies of this kind. For example, Jensen (1980) notes that the observation that the within-family height–intelligence correlation is small and only marginally significant, in contrast to the between-family correlation, is consistent with the hypothesis that the height–intelligence correlation stems from assortative mating. This reasoning is consistent with expression Eq. 8. Johnson (1991), on the other hand, in a review of the literature, concludes that there is a marginally significant positive within-family height–intelligence correlation, and that this suggests that pleiotropy plays some role in the height–intelligence correlation. We also report a significant positive within-family height–intelligence correlation, adding further weight to the hypothesis that pleiotropy is at play. Though expression Eq. 8 shows that this within-family correlation could in part be the result of a correlation between the latent unique environment factors underlying height and intelligence rather than of pleiotropy, the conclusion that pleiotropy cannot account for part of the height–intelligence correlation appears to be premature.

Previous research provides little insight into the genetic mechanisms responsible for the association between height and various aspects of cognition. The difficulty of pin-pointing specific mechanisms is partly due to the fact that both height and intelligence are polygenic traits. While a number of promising markers for height have been identified, they only explain a small share of the heritable variation (Weedon and Frayling 2008). As for decoding the molecular genetic structure underlying intelligence, an extraordinarily complex trait, the problems appear to be even less tractable (Butcher et al. 2007; Deary et al. 2009). However, with the advent of genome-wide association technology, we are hopeful that key genetic markers will be identified resulting in a greater understanding of the genetics of the association between height and various aspects of cognition.

Concluding remark

We developed a general theoretical framework to study the correlation between two traits. We used this framework to extend the standard bivariate ACE model to account for assortative mating and to derive formulas to decompose a cross-trait genetic correlation into components attributable to assortative mating and pleiotropy and to decompose a cross-trait within-family correlation. Our results from the standard bivariate ACE model without assortative mating suggest that common environment explains most of the height–intelligence correlation. However, we caution that our estimates from the standard bivariate ACE model are sensitive to assumptions about assortative mating. In fact, we show that assuming more plausible values for the matrix of assortative mating correlations \(\Upgamma_{M}\) dramatically changes the results of the height–intelligence decomposition, implying that the correlation is primarily mediated by genetic factors rather than by common environment. Also, although we use the largest sample to date for this kind of analysis, and although our data comes from standardized measurements of male conscripts in a tight age range, our estimates are not very precise, as reflected by the size of the standard errors. This leads us to emphasize the difficulty of disentangling the sources of covariation between two traits with samples consisting only of MZ and DZ twins. Future work could benefit from the use of large extended twin family samples. Alternatively, given the rapid pace at which knowledge about the genome is increasing, it may be that such questions will soon be more easily tackled through an approach that seeks to understand how specific genes directly affect the phenotypes of interest.