Introduction

In behavior genetics, two types of genetic effects on a trait are considered, namely, additive genetic (A) effects and non-additive genetic (D or NA) effects. In addition, two types of environmental effects on a trait are considered, namely, shared environmental (C) effects and non-shared environmental (E) effects. Therefore, in behavior genetics, the above four effects on phenotypes are assumed. To estimate these effects, structural equation modeling (Bollen 1989; Eaves et al. 1978a, b) is used extensively.

Figure 1 illustrates the path diagram in which all four factors are included. This is called the univariate ACDE model. In Fig. 1, p 1 and p 2 are phenotypes of the same trait for twin 1 and twin 2. In addition, let a, c, d, and e be the additive genetic effect, the shared environmental effect, the sum of the dominance effect and the interaction effect between genes (epistatic effect), and the nonshared environmental effect, respectively.

Fig. 1
figure 1

The ACDE model

To estimate a, c, d, and e, SEM using 2nd-order (and sometimes 1st-order) moments are used in current behavior genetic models (Eaves et al. 1978a, b; Neale and Carldon 1992). A significant problem in current behavior genetics models is that the ACDE model in Fig. 1 cannot be identified. Therefore, reduced ACE or ADE models are actually used in most research. However, the use of these reduced models may lead to biased estimators of these effects (Eaves et al. 1978a; Grayson 1989). The purpose of the present paper is to identify the ACDE model using non-normal structural equation modeling (nnSEM; Shimizu and Kano 2008; Ozaki and Ando 2009) to overcome this problem.

The reason why the ACDE model cannot be identified is as follows. The expected covariance matrices of the ACDE model for MZ and DZ twins are as follows:

$$\begin{aligned} \Sigma_{MZ}&=\left[ \begin{array}{ll} E[p_{1}^{2}(mz)] & E[p_{1}p_{2}(mz)]\\ E[p_{1}p_{2}(mz)] & E[p_{2}^{2}(mz)] \end{array}\right]\\ &=\left[ \begin{array}{ll} a^{2}+c^{2}+d^{2}+e^{2} & a^{2}+c^{2}+d^{2}\\ a^{2}+c^{2}+d^{2} & a^{2}+c^{2}+d^{2}+e^{2} \end{array}\right] \end{aligned}$$
(1)
$$\begin{aligned} \Sigma_{DZ}&= \left[ \begin{array}{ll} E[p_{1}^{2}(dz)] & E[p_{1}p_{2}(dz)]\\ E[p_{1}p_{2}(dz)] & E[p_{2}^{2}(dz)] \end{array}\right]\\ &=\left[ \begin{array}{ll} a^{2}+c^{2}+d^{2}+e^{2} & 0.5a^{2}+c^{2}+0.25d^{2}\\ 0.5a^{2}+c^{2}+0.25d^{2} & a^{2}+c^{2}+d^{2}+e^{2} \end{array}\right] \end{aligned}$$
(2)

Here, E[p 1 p 2(mz)] is the expectation of p 1 p 2 of MZ twins (here, let both p 1 and p 2 be mean centered phenotypes). The variances of A, C, D, and E are fixed to 1s. This model cannot be identified because there are three unique elements (one variance and two covariances). However, there are four parameters in the model (number of degrees of freedom: −1). Therefore, instead of the ACDE model, the ACE or ADE models are actually used.

In general, when twice the correlation between the DZ pair (r dz ) exceeds the correlation between the MZ pair (r mz ), the ACE model, in which D is removed, is used. Otherwise, the ADE model, in which C is removed, is used. In the latter case, MZ pairs are much similar than DZ pairs. Therefore, genetic effects are expected to be more important than common environmental effects, and d is included in the model. However, this model fit strategy is used because of the statistical limitation described above.

Actually, when twice the correlation between the DZ pair does not exceed the correlation between the MZ pair, this indicates only that the d effect is smaller than twice the c effect, because

$$\begin{aligned} &r_{mz}<2r_{dz} \\ & \Longleftrightarrow a^{2}+c^{2}+d^{2}<a^{2}+2c^{2}+0.5d^{2} \\ & \Longleftrightarrow d^{2}<2c^{2}.\end{aligned}$$

Similarly when twice the correlation between the DZ pair exceeds the correlation between the MZ pair, the d effect is larger than twice the c effect. Keller et al. (2005) noted that r mz  > 2r dz suggests only that c is less powerful than d, and not that c is non-existent.

Including only three factors (ACE or ADE) often leads to incorrect results. Table 1 shows the biases in estimators (explained variance) for each factor when the ACE or ADE models are fit, even though all four factors actually affect a phenotype (which is realistic in most cases). The table shows that when the ACE model is fit, the explained variance for A is overestimated by 1.5d 2, C is underestimated by 0.5d 2, and D is underestimated by d 2, and when ADE model is fit, A is overestimated by 3c 2, C is underestimated by c 2, and D is underestimated by 2c 2 (as shown in the Appendix; see also Keller and Coventry 2005). In both the cases of the ACE and ADE models, the explained variance for E is estimated without bias. Therefore, both the ACE and ADE models yield biased estimators, except for E. In particular, C is underestimated whether the ACE model or the ADE model is fit. The constant results of small C effects in behavior genetics are to a degree due to the use of the ACE or ADE model (Grayson 1989).

Table 1 Biases in estimators when the ACE model or the ADE model is fit

Two numerical examples are given. First, in reality, if the explained percentage of variance for A, C, D, and E of a phenotype are all 0.25, then the MZ correlation is 0.75 (= 0.25 + 0.25 + 0.25) and the DZ correlation is 0.4375 (= 0.25 × 0.5 + 0.25 + 0.25 × 0.25). In this case, the ACE model is fit because r mz  < 2r dz . Therefore, D is removed. However, in reality, D contributes as much as A, C, and E.

Second, when the explained variance for A, C, D, and E of a phenotype are 0.1, 0.2, 0.6, and 0.1, respectively, the MZ correlation is 0.9 (= 0.1 + 0.2 + 0.6), and the DZ correlation is 0.4 (= 0.1 × 0.5 + 0.2 + 0.6 × 0.25). In this case, the ADE model is fit because r mz  > 2r dz . Therefore, C is removed. However, in reality, C has the second largest effect on a phenotype.

Table 1 and these two examples demonstrate that misspecification of the analysis model leads to incorrect results. In the present paper, in order to overcome this problem, the ACDE model is identified using nnSEM. The developed method provides more accurate estimators and is therefore the more accurate method for examining the etiology of variation in human behaviors. Eaves et al. (1978a) and Grayson (1989) noted that there are two primary problems in classical twin design, (which refers to behavior genetics research that uses only twin data and the ACE model or the ADE model is used), namely, (1) the inability to simultaneously estimate both C and D effects and (2) the inability to estimate higher-order epistasis effects. In addition, Keller and Coventry (2005) showed that problem (1) leads to larger bias in estimators than problem (2) (see also Coventry and Keller 2005). Therefore, although it does not perfectly explain variations in human behaviors, the ACDE model covers the primary behavioral factors.

nnSEM

The idea of using higher-order moments in SEM to accommodate non-normal variables was first suggested by Bentler (1983). Moreover, in 1985, a factor analysis model using non-normal variables and higher-order moments to eliminate rotational indeterminacy was developed by Mooijaart (1985). Therefore, the idea of using non-normal variables and higher-order moments is not a recent concept. However, Shimizu and Kano (2008) were the first to develop a simple regression model that uses non-normal variables and higher-order moments in the framework of SEM, where SEM using higher-order moments as well as (1st- and) 2nd-order moments is referred to as nnSEM. Also, Kano and Shimizu (2003) showed that using 2nd, 3rd, and 4th order moments a simple regression model considering confounding a latent variable can be identified.

And in the framework of Independent component analysis (ICA; Comon 1994), there are some researches using non-normally distributed disturbance variables. For example, Shimizu et al. (2006) showed that under the assumptions that the data generating process is linear, no confounding latent variable exists, and disturbance variables have non-normal distributions of non-zero variances, the best fit acyclic model can be identified. They called the model LiNGAM (a linear, non-gaussian, acyclic model). Hoyer et al. (2008) then showed that even if some confounding latent variables exist, parameter estimation of an acyclic model can be performed.

nnSEM has some advantages over SEM. For example, nnSEM can identify models that cannot be identified using SEM, because of a negative degree of freedom. Another advantage is that nnSEM can detect the direction of causation between two cross-sectional variables. Ozaki and Ando (2009) used this advantage to develop a method to detect the direction of causations between C factors and between E factors in bivariate behavior genetics models.

As an illustration of the former advantage, Toyoda (2007) showed that the spurious correlation model and the reciprocal causation model can be identified within the framework of nnSEM. The numbers of parameters of the spurious correlation model and the reciprocal causation model are both four (two path coefficients and two residual variances) when specifying models using only second-order moments. However, the number of unique elements of the covariance matrix between two variables is three. Therefore, neither the spurious correlation model nor the reciprocal causation model can be identified using SEM (number of degrees of freedom: −1). However, as shown in the Appendix of Ozaki and Ando (2009), nnSEM using second- and third-order moments can identify both models.

The two models can be identified using nnSEM because the use of third-order moments increases the number of moments of observed variables and therefore the number of degrees of freedom. Actually, the number of degrees of freedom of the spurious correlation model using nnSEM is 0 and that of the reciprocal causation model is 1. Therefore, these two models can be identified using nnSEM. In the present paper, using this advantage of nnSEM, the univariate ACDE model will be developed.

Model specifications

In addition to SEM using up to second-order moments, third-order moments are used in the present study. Sample third-order moments (\(s_{p_{1}^{3}}, s_{p_{1}^{2}p_{2}}, s_{p_{1}p_{2}^{2}},\;\text{and}\;s_{p_{2}^{3}} \)) are calculated as follows when there are two phenotypes p 1 and p 2 (for twin 1 and twin 2, respectively):

$$\begin{aligned} s_{p_{1}^{3}}&=\frac{1}{n}\sum_{i=1}^{n}(p_{i1}-\bar{p}_{1})^{3}\\ s_{p_{1}^{2}p_{2}}&=\frac{1}{n}\sum_{i=1}^{n}(p_{i1}-\bar{p}_{1})^{2}(p_{i2}-\bar{p}_{2})\\ s_{p_{1}p_{2}^{2}}&=\frac{1}{n}\sum_{i=1}^{n}(p_{i1}-\bar{p}_{1})(p_{i2}-\bar{p}_{2})^{2}\\ s_{p_{2}^{3}}&=\frac{1}{n}\sum_{i=1}^{n}(p_{i2}-\bar{p}_{2})^{3} \end{aligned}$$

where i indicates an observation (in this case, an observation pair). These sample statistics become information in the estimation of a, c, d, and e effects.

When three latent variables C, D, and E are assumed to follow non-normal distributions, the expected third-order moments for the MZ twins are as follows:

$$E[p_{1}^{3}(mz)]=c^3 \sigma_{C^{3}}+d^3 \sigma_{D^{3}}+e^3 \sigma_{E^{3}}$$
(3)
$$ E[p_{1}^{2}p_{2}(mz)]=c^3 \sigma_{C^{3}}+d^3 \sigma_{D^{3}} $$
(4)
$$ E[p_{1}p_{2}^{2}(mz)]=c^3 \sigma_{C^{3}}+d^3 \sigma_{D^{3}} $$
(5)
$$ E[p_{2}^{3}(mz)]=c^3 \sigma_{C^{3}}+d^3 \sigma_{D^{3}}+e^3 \sigma_{E^{3}} $$
(6)

In addition, the expected third-order moments for the DZ twins are as follows:

$$ E[p_{1}^{3}(dz)]=c^3 \sigma_{C^{3}}+d^3 \sigma_{D^{3}}+e^3 \sigma_{E^{3}} $$
(7)
$$ E[p_{1}^{2}p_{2}(dz)]=c^3 \sigma_{C^{3}}+d^3 \sigma_{D1^{2}D2} $$
(8)
$$ E[p_{1}p_{2}^{2}(dz)]=c^3 \sigma_{C^{3}}+d^3 \sigma_{D1D2^{2}} $$
(9)
$$ E[p_{2}^{3}(dz)]=c^3 \sigma_{C^{3}}+d^3 \sigma_{D^{3}}+e^3 \sigma_{E^{3}} $$
(10)

Note that A is assumed to follow a normal distribution when many loci affect the phenotype of interest (Fisher 1918). Here, \(\sigma_{C^{3}}, \sigma_{D^{3}},\hbox{and}\;\sigma_{E^{3}}\) are the skewnesses of C, D, and E, respectively. (In this case, the variances of these independent factors are fixed to 1s. Therefore, these parameters are the skewnesses of the factors.) \(\sigma_{D1^{2}D2}\) expresses the expected value of the second power of D of twin 1 times D of twin 2, and \(\sigma_{D1D2^{2}}\) expresses the expected value of D of twin 1 times the second power of D of twin 2, respectively. \(\sigma_{D1^{2}D2}\) and \(\sigma_{D1D2^{2}}\) are assumed to be equal, because the order of twins is arbitrary.

The above equations are calculated as follows. For example,

$$ \begin{aligned} E[p_{1}^{2}p_{2}(dz)]&=E[(aA_{1}+cC+dD_{1}+eE_{1})^{2}(aA_{2}+cC+dD_{2}+eE_{2})]\\ &=a^3\times 0+c^3 \sigma_{C^{3}}+d^3 \sigma_{D1^{2}D2}. \end{aligned} $$

Assumptions

However, the above model has not yet been identified. There are eight parameters (\(a, c, d, e, \sigma_{C^{3}}, \sigma_{D^{3}}, \sigma_{E^{3}}, \text{and}\;\sigma_{D1^{2}D2}=\sigma_{D1D2^{2}}\)) in the model. However, the number of unique equations is only six (three for second-order moments and three for third-order moments), resulting in the number of degrees of freedom being negative. Therefore, constraints are needed in order to identify the model. The following are models that can be identified.

  1. 1.

    C and E are non-normal, and D is normal. (Therefore, \(\sigma_{D^{3}}=0, \text{and}\;\sigma_{D1^{2}D2}=0.\)) Furthermore, the skewnesses of C and E are the same (\(\sigma_{^{3}}.\)) In this case, there are five parameters (and \(a, c, d, e, \text{and}\;\sigma_{^{3}}\)), and the number of unique equations is five (\(E[p_{1}^{2}(mz)]\) (Eq. 1) = \(E[p_{2}^{2}(mz)]\) (Eq. 1) = \(E[p_{1}^{2}(dz)]\) (Eq. 2) = \(E[p_{2}^{2}(dz)]\) (Eq. 2), \(E[p_{1}p_{2}(mz)]\) (Eq.1), \(E[p_{1}p_{2}(dz)]\) (Eq. 2), \(E[p_{1}^{3}(mz)]\) (Eq. 3) = \(E[p_{2}^{3}(mz)]\) (Eq. 6) = \(E[p_{1}^{3}(dz)]\) (Eq. 7) = \(E[p_{2}^{3}(dz)]\) (Eq. 10), and \(E[p_{1}^{2}p_{2}(mz)]\) (Eq. 4) = \(E[p_{1}p_{2}^{2}(mz)]\) (Eq. 5) = \(E[p_{1}^{2}p_{2}(dz)]\) (Eq. 8) = \(E[p_{1}p_{2}^{2}(dz)]\) (Eq. 9)), resulting in the number of degrees of freedom being 0. However, since there are 14 (six for second-order moments and eight for third-order moments) sample statistics and five parameters, the number of degrees of freedom used to calculate model fit indices is nine (=14 − 5). If the phenotypes are standardized, since the four variances are all 1s, the number of degrees of freedom is six (=11 − 5).

  2. 2.

    D and E are non-normal, and C is normal. (Therefore, \(\sigma_{C^{3}}=0.)\) Furthermore, the skewnesses of D and E are the same (\(\sigma_{^{3}}\)) and \(\sigma_{D1^{2}D2}=\sigma_{D1D2^{2}}\). In this case, there are six parameters (\(a, c, d, e, \sigma_{^{3}}, \text{and}\;\sigma_{D1^{2}D2}\)), and the number of unique equations is six, resulting in the number of degrees of freedom being 0. The number of degrees of freedom used to calculate model fit indices is eight (=14 − 6). If the phenotypes are standardized, then the number of degrees of freedom is five (=11 − 6).

  3. 3.

    D, C, and E are non-normal. Furthermore, the skewnesses of D, C, and E are the same (\(\sigma_{^{3}}\)). In this case, there are six parameters (\(a, c, d, e, \sigma_{^{3}}, \text{and}\;\sigma_{D1^{2}D2}\)), and the number of unique equations is six, resulting in the number of degrees of freedom being 0. The number of degrees of freedom used to calculate model fit indices is eight (=14 − 6). If the phenotypes are standardized, then the number of degrees of freedom is five (=11 − 6).

Note that if all four latent factors are distributed normally, nnSEM cannot estimate these effects. The non-normal distributional assumption on D is likely. For example, if genotypes AA and Aa have the same effect on a trait and the gene frequencies of A and a are both 0.5, then D has a skewed distribution because the genotypic frequency of aa is 0.25 and the genotypic frequency of AA  +  Aa is 0.75. When researchers use the present method, they should compare the fits of the three identified models, and the results of the best fit model should be interpreted. An example R script can be downloaded from http://www.010.upp.so-net.ne.jp/koken/bg.html.

Note also that the developed ACDE model can be used only with continuous data and is not suitable for binary or ordinal data because, in order to analyze the latter two types of data, the liability model that assumes latent variables that follow some normal distributions is usually used in Behavior Genetics. However, if all four latent variables (A, C, D, and E) are normally distributed, then nnSEM cannot be used, because it is impossible to calculate the 3rd order moments of the assumed normally distributed latent variables.

Estimator

The asymptotically distribution-free (ADF) method (Browne 1982, 1984) is used to estimate parameters. In the two-group case (MZ and DZ), the ADF estimator can be expressed as follows:

$$ F=n_{m}(\varvec{\sigma}_{m}(\varvec{\theta})-{\user2{s}}_{m})^{\prime}\user2{W}^{-1}_{m}(\varvec{\sigma}_{m}(\varvec{\theta})-{\user2{s}}_{m})+ n_{d}(\varvec{\sigma}_{d}(\varvec{\theta})-{\user2{s}}_{d})^{\prime}\user2{W}^{-1}_{d}(\varvec{\sigma}_{d}(\varvec{\theta})-{\user2{s}}_{d}) $$
(11)

Here, F is the function to be minimized, n is the sample size, σ(θ) is the expected second- and third-order moments vector, s is the observed second- and third-order moments vector, W is the weight matrix, and subscripts m and d denote MZ and DZ, respectively. Each element of W consists of n times the asymptotic covariance of estimators of moments of the observed phenotypes.

Simulation study

Settings

This simulation study examines the effects of two factors on the average biases and standard deviations of estimates of explained variance for A, C, D, and E. The two factors are (1) the true explained variance for A, C, D, and E (21 patterns, see Table 2), and (2) the skewness of C, D, and E (seven patterns, see Table 3). For all of the 147 (= 21 × 7) conditions, 300 replications were performed. Second- and third-order moments were used in the analyses. In each replication, the BICs of the three identified models are calculated, and the estimates of the best fit model were considered as the estimates in the replication. For the same generated data, when r mz  < 2r dz , the ACE model was fit, otherwise the ADE model was fit. This enables comparisons between the results of the current method (SEM) and the developed method (nnSEM). The average biases and standard deviations of the estimates were calculated for each condition. In all cases, the numbers of MZs and DZs were both 600. The ADF estimator was used for both nnSEM and SEM analyses. For nnSEM, second- and third-order moments were used, on the other hand, for SEM only second-order moments were used. The analysis program was written using the R programming language (Ihaka and Gentleman 1996; version 2.4.0). We wrote the estimator, which was not a pre-existing R package. However, the ‘BFGS’ method, which is a quasi-Newton method, in the ‘optim’ function was used for optimization calculation.

Twenty one patterns of the true explained variance were divided into two cases. In one case, all four factors affect a trait. Therefore, nnSEM is the true model. In the other case, three of the four factors (ACE or ADE) affect a trait. Therefore, both nnSEM and SEM are the true model. Figures 2 and 3 show the results for the former cases, and Figs. 4 and 5 show the results for the latter cases.

Fig. 2
figure 2

Biases (estimated value–true value) when all four factors affect a trait

Fig. 3
figure 3

Standard deviations of estimates when all four factors affect a trait

Fig. 4
figure 4

Biases (estimated value–true value) when three factors affect a trait

Fig. 5
figure 5

Standard deviations of estimates when three factors affect a trait

The seven patterns of skewness are shown in Table 3. These seven patterns can be divided into two cases, in which either (1) one of the three models identified is the true model (patterns 4, 5, and 7; denoted as T in Figs. 2, 3, 4, 5) or (2) none of the three models identified is the true model (patterns 1, 2, 3, and 6; denoted as F in Figs. 2, 3, 4, 5). In order to simplify the results, rather than seven patterns, the average of the results of the three T patterns and the average of the results of the four F patterns are presented in Figs. 2, 3, 4, 5. In all cases, the factor scores for A were generated from the standard normal distribution.

Table 2 True explained variances for A, C, D, and E
Table 3 Skewness of C–E

Results and discussion

Figures 2, 3, 4, 5 show the box plots for A, C, D, E, and G. Here, G indicates the results of A  +  D, which can be considered as the broad-sense heritability (Keller and Coventry 2005). In each figure, nnS indicates the case in which nnSEM was used for the analyses, and S indicates the case in which SEM was used. In addition, T and F in parentheses indicate whether one of the three models identified is the true model (T) or not (F). A, C, D, E, and G at the bottom of each figure indicate that the plots are the results for the corresponding factors. Figures 2 and 4 show the results for the average biases (estimated value–true value), and Figs. 3 and 5 show the results for the standard deviation of the estimates. The box plots are shown for the averages of the three T cases or the four F cases. For example, the box plot for A of the nnS(T) case in Fig. 2 was shown for the following values: 0.041, −0.060, −0.089, −0.145, −0.012, 0.031, 0.107, −0.080, 0.055, −0.016, −0.069, 0.098, and −0.004. The first value, 0.041, is the average of the 300 biases for A when using nnS(T) and the variance components for the four factors are the same (first case in Table 2). The results will be discussed based on (1) an examination of the estimation accuracy of the proposed method and (2) comparison of the estimation accuracies of the conventional method (SEM) and the proposed method (nnSEM).

Examination of estimation accuracy of the proposed method

The simulation study revealed the following characteristics of the proposed method. Note that the following interpretations are for T  +  F cases (T and F are pooled), except for point 11.

  1. 1.

    Figure 2 shows that when all four factors affect a trait, the explained variance of A tends to be slightly underestimated (−7.9% on average), and Fig. 3 shows that the average standard deviation of the estimates of the variance of A is 14.8% and occasionally exceeds 20%, which is a large value. Consequently, the variance of A is not accurately estimated under these conditions.

  2. 2.

    Figure 3 shows that the average standard deviation is larger in the T case (16.8%) than in the F case (12.8%). However, this does not mean that the F case yields better estimates than the T case, because the average bias is much smaller in the T case (−1.1%) than in the F case (−11.4%).

  3. 3.

    Figure 4 shows that when three of the four factors affect a trait, the explained variance of A tends to be somewhat underestimated (−19.5% on average), and Fig. 5 shows that the average standard deviation of the estimates of the variance of A is 10.8%, which is not a small value. Consequently, the variance of A is not accurately estimated under these conditions.

  4. 4.

    Figure 2 shows that when all of the four factors affect a trait, the biases of the explained variance of C are <5% in most cases and are at most 10.1%. Furthermore, Fig. 3 shows that the standard deviations of the estimates are <9%, which indicates that the variance of C is accurately estimated under these conditions.

  5. 5.

    Figure 4 shows that when three of the four factors affect a trait, the explained variance of C tends to be slightly overestimated (6.1% on average), and Fig. 5 shows that the average standard deviation of the estimates of the variance of C is 5.7%. Consequently, the variance of C is accurately estimated under these conditions.

  6. 6.

    Figure 2 shows that when all four factors affect a trait, the biases of the explained variance of D are <10% in most cases, and the explained variance of D tends to be overestimated. Figure 3 shows that standard deviation of the estimates is not larger than in the case of A. Consequently, the variance of D is not accurately estimated, but is estimated more accurately than the variance of A.

  7. 7.

    Figure 4 shows that when three of the four factors affect a trait, the biases of the explained variance of D are overestimated (12.0% on average). Figure 5 shows that the standard deviation of the estimates is not larger than in the case of A. Consequently, the variance of D is not accurately estimated, but is estimated more accurately than the variance of A. These results are the same as in the case in which all four factors affect a trait.

  8. 8.

    Figures 2, 3, 4, 5 show that whether three or four factors affect a trait, the biases of the explained variance of E are less than 5% in all cases. Furthermore, the standard deviations of the estimates are <5%, which indicates that the variance of E is very accurately estimated.

  9. 9.

    Figure 2 shows that when all four factors affect a trait, the biases of the average explained variance of G is −4%, and Fig. 3 shows that the standard deviations of the estimates are <10%.

  10. 10.

    Figure 4 shows that when all four factors affect a trait, the biases of the average explained variance of G is −7.5%, and Fig. 5 shows that the standard deviations of the estimates are <9%.

  11. 11.

    Figures 2, 3, 4, 5 show that whether three or four factors affect a trait, even if data is generated from a model that is not one of the three identified models (F cases), the results of biases and standard deviations of the estimates are not much larger than in the T cases, except for the biases for the A factor. This indicates that the proposed method can provide a similar level of accuracy for these two cases.

  12. 12.

    Figures 2, 3, 4, 5 show that if G = A + D is considered to be global effects of genes, then the explained variances of G, C, and E can be estimated accurately.

Comparison of the results obtained using SEM and nnSEM

The simulation study revealed the following characteristics. Note that the following interpretations are for T  +  F cases (T and F are pooled).

  1. 1.

    When all four factors affect a trait, for A, C, D, and G, there is a high probability of providing a smaller bias by nnSEM than by SEM under these conditions ((22/26) × 100 = 85% for A, (23/26) × 100 = 89% for C, (24/26) × 100 = 92% for D, and (21/26) × 100 = 81% for G).

  2. 2.

    In contrast, when three of the four factors affect a trait, for A, C, D, and G, SEM provided smaller biases than nnSEM in every case.

  3. 3.

    For E, both SEM and nnSEM provide very small bias whether three or four factors affect a trait.

  4. 4.

    The explained variance of C is always underestimated using SEM when all four factors affect a trait. However, this does not occur when using nnSEM.

  5. 5.

    SEM provides a smaller standard deviation of the estimates. Therefore, although nnSEM provides a smaller bias when all four factors affect a trait, a greater number of twins are needed in order to obtain the same level of standard deviation of the estimates using SEM.

Consequently, nnSEM was shown to provide better estimators than SEM in the cases in which four latent factors affect a trait. In particular, when G is considered as broad-sense heritability, G, C, and E can be estimated with biases of <5% in most cases for this simulation setting. When three latent factors affect a trait, SEM provides better estimators than nnSEM. However, even in these cases, C can be estimated with biases of <6.1% on average, E can be estimated with almost no bias, and G can be estimated with biases of <8% in most cases for this simulation setting using nnSEM.

Discussion

The ACE and ADE models yield biased estimates if the ACDE model is the true model. However, structural equation modeling cannot identify the ACDE model because of a negative degree of freedom. In the present paper, the ACDE model was developed using nnSEM. Simulation studies have suggested that the proposed method can decrease the biases and can be used even in cases in which three latent factors affect a trait if G, C, and E are used as outcomes. The effects of the MZ/DZ ratio on the results were not examined in the simulation study described above. However, a small simulation study was conducted, in which the cases of MZ:DZ = 800:400 and 400:800 were considered. The results showed that the MZ/DZ ratio has almost no effect on the estimates in this simulation setting.

In the present paper, although power calculation was not discussed, the power would be calculated by setting the true second- and third-order moments and analyzing the moments using a null model. This is the same method used to calculate the power in the SEM using Mx (Neale et al. 2006, p. 96).

There is another method of identifying the ACDE model. Kathryn et al. (2004) identified the ACDE model using sibling data, where one of the siblings is an adopted person, as well as twin data. Such sibling data also provides information of the etiology of human variation, because the siblings (one of whom is an adopted person) are thought to share the same environment, however, there is no genetic relationship between the siblings. Therefore, their similarity is thought to be caused by their shared environment. Unfortunately, gathering data for siblings for the case in which one sibling is adopted is very difficult and costly. In the present paper, however, nnSEM was used to identify the ACDE model in which higher-order moments are used as additional information. Since the method of the present paper can estimate a, c, d, and e effects using only twin data, the proposed method requires a much lower cost and is much easier to perform. Therefore, the proposed method contributes to behavior genetic research.

There are other factors that have possible effects on phenotypes, such as higher-order epistasis (Eaves et al. 1978a), assortative mating (people choose to marry persons similar to themselves; Eaves et al. 1984; Loelin and DeFries 1987), gene–environment correlation (Plomin et al. 1977; Eaves et al. 1977), and gene–environment interaction (Plomin et al. 1977; Eaves et al. 1977; Purcell 2002; Eaves and Erkanli 2003). The presence of higher-order epistasis makes the correlation between D factors of DZ twins lower than 0.25. The presence of assortative mating leads to overestimation of the effect of C and underestimation of the effect of A if these are not included in models. An example of gene–environment correlation is that genetically intellectual parents tend to provide an intellectual environment for their children, which in turn increases the intelligence of the children. The presence of positive gene–environment correlation also leads to overestimation of the effect of C and underestimation of the effect of A if these are not included in models. An extended twin-family design (ETFD, Truett et al. 1994) can estimate the effects of assortative mating and gene–environment correlation, but cannot estimate the higher-order epistatic effect or gene–environment interaction. Although Purcell (2002) proposed a relatively easy method of estimating gene–environment interaction, at present, there is no method that incorporates the higher-order epistatic effect in the model. Therefore, further research on developing a more exhaustive model is needed.

Higher-order epistasis is expressed as the correlation between D factors of DZ twins, and assortative mating is expressed as the correlation between A factors of DZ twins and D factors of DZ twins. gene–environment correlation is expressed as the correlation between A and C factors and between D and C factors, and gene–environment interaction is the interaction between one of the genetic factors and one of the environmental factors. Therefore, these four effects can be expressed as parameters associated with A, C, D, or E. Consequently, proper estimation of a, c, d, and e is an important basis for including these phenomena (such as assortative mating). Although, in the present paper, only the ACDE model was proposed, this is an important first step in the development of a more exhaustive model.