Abstract
Co-twin studies are an elegant and powerful design that allows controlling for the effect of confounding variables, including genetic and a range of environmental factors. There are several approaches to carry out this design. One of the methods commonly used, when contrasting continuous variables, is to calculate difference scores between members of a twin pair on two associated variables, in order to analyse the covariation of such differences. However, information regarding whether and how the different ways of estimating within-pair difference scores may impact the results is scant. This study aimed to compare the results obtained by different methods of data transformation when performing a co-twin study and test how the magnitude of the association changes using each of those approaches. Data was simulated using a direction of causation model and by fixing the effect size of causal path to low, medium, and high values. Within-pair difference scores were calculated as relative scores for diverse within-pair ordering conditions or absolute scores. Pearson’s correlations using relative difference scores vary across the established scenarios (how twins were ordered within pairs) and these discrepancies become larger as the within-twin correlation increases. Absolute difference scores tended to produce the lowest correlation in every condition. Our results show that both using absolute difference scores or ordering twins within pairs, may produce an artificial decrease in the magnitude of the studied association, obscuring the ability to detect patterns compatible with causation, which could lead to discrepancies across studies and erroneous conclusions.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Twin registries are an invaluable resource (Odintsova et al. 2018) for disentangling the relative role of genetic and environmental factors in all kinds of phenotypes (Friedman et al. 2021; Knopik et al. 2017). Twin data are not only useful for classic heritability analyses but can also be used to test a variety of alternative hypotheses. Among other purposes, twin data can be especially useful to test for causal associations (Gonggrijp et al. 2023; McAdams et al. 2021; McGue et al. 2010). A causal relationship between an exposure and an outcome implies that the former (e.g., risk factor) causes the latter (e.g., a disease or disorder). However, an association between an exposure and an outcome might be non-causal when the relationship is due to a confounding factor that influences both (Kendler et al. 1993), and determining whether an association is truly causal is frequently an elusive task. There is a wide consensus that randomized experiments are the gold standard for testing for causality. However, quite often the hypothesized causal variable cannot be properly manipulated due to practical, methodological, or ethical constraints (Verhulst & Estabrook 2012).
Co-twin control studies have been proposed as an alternative quasi-experimental method to facilitate causal inference (Gonggrijp et al. 2023; McAdams et al. 2021; McGue et al. 2010; Verhulst and Estabrook 2012). Although they are not equivalent to a randomized experiment, they give us a powerful and elegant design which allows a suitable control for a wide range of confounders, both genetic and environmental, known and unknown (Kendler 2017; Vitaro et al. 2009). That is, twin studies cannot prove that an exposure causes an outcome, but they can show whether the former continues to predict the latter after accounting for the effect of shared aetiology (McAdams et al. 2021). The logic of the co-twin design is based on a close match between members of a twin pair, which includes, by definition, genetic and shared environmental factors. Therefore, twin pairs discordant for an exposure can be selected and the probability of experiencing the outcome compared. Alternatively, twins can be selected for outcome discordance in order to analyse the probability of having been exposed to the hypothesized risk factor (Goldberg and Fisher 2005).
A co-twin approach is commonly used to examine potentially causal associations observed at individual level (i.e., without regard to twin-pair membership). The co-twin approach may identify several alternative interpretations of the observed correlations. First, the correlation is not causal, but likely due to familial or shared environmental confounding (e.g., socioeconomic status). In this case, associations between the exposure and outcome variables observed at the individual level will be the same in dizygotic (DZ) or monozygotic (MZ) twin pairs. Second, the correlation is not causal, but likely due to genetic confounding. Such would be the case when the association between the exposure and outcome variables remains significant for DZ twins but disappears in MZ twins. Finally, the association remains significant at all levels, individual or within DZ/MZ pairs, which is consistent with a causal effect. Therefore, if we artificially underestimate the correlations, it will be more difficult to identify potentially causal associations. Scenarios 1 and 2 may depict different degrees of confounding, which can be total or partial depending on the within-DZ/MZ pair effect (Kendler et al. 1993; Kujala et al. 2002; McGue et al. 2010).
There are various approaches to operationalise discordance in a co-twin control study. One way is to construct a dichotomous (discordant/concordant) variable, but in doing so one is necessarily discarding information about the degree of discordance. Alternatively, it is possible to calculate a continuous difference score between the members of a twin pair. Continuous measures are calculated by subtracting the score of one twin from the score of the co-twin. The logic behind this approach can be summarized with the following formula: \(E\left(\Delta {Y}_{i}\right)={\beta }_{w }E\left(\Delta {X}_{i}\right)\). The \({\beta }_{w}\) coefficient represents the expected difference between Twin 1 (T1) and Twin 2 (T2) on the outcome variable for each unit of change in the difference between T1 and T2 on the exposure variable (Vitaro et al. 2009). Correlations, different types of regression analyses, or structural equation modelling can then be used to analyse the association between difference scores on the exposure and the outcome to identify possible causal relationships (McAdams et al. 2021; Vitaro et al. 2009).
One of the most prominent differences is that for the dichotomous discordance strategy, a cut-off to select discordant twin pairs needs to be established. That is apparently straightforward for some conditions (e.g., T1 smokes but T2 does not), although reality tends to be not that simple (e.g., T1 started smoking last year and still does while T2 quit six months ago after 15 years of smoking). Using the difference scores approach allows researchers to include subtle gradient differences within twin pairs, thus preserving statistical power. Further, this retains the complex nuances inherent in continuously measured conditions such as neuroticism, weight, or sleep quality.
At first glance, it seems that the quantitative difference approach would be more appropriate when feasible. Yet, it also has some methodological boundaries that have not received enough attention to date. Specifically, difference scores are frequently computed in one of two ways: (1) as relative differences where the sign of the difference indicates which twin (T1 or T2) has the largest score on the phenotype; or (2) absolute differences where just the magnitude of the difference is considered. This differentiation may have major implications when analysing associations between variables that could be causally related.
Advice regarding how to compute within-pair difference scores in a co-twin design is scarce and contradictory. Rovine (1994) postulated that “when birth order is part of the hypothesis, a relative difference score (maintaining the birth order through the sign of the difference) is appropriate. When any difference is of importance and it does not matter which sibling has more of the characteristic, an absolute difference may be more appropriate”. Turkheimer and Waldron (2000) also stated that absolute differences can be computed when no ordering of the siblings is available or desirable. Other researchers, however, have suggested the use of relative differences computed by assigning twins randomly as T1 and T2 respectively (Vitaro et al. 2009; von Stumm and Plomin 2018). Nevertheless, both absolute and relative difference scores are common in the literature: (Barclay et al. 2013; Brendgen et al. 2017; Calais-Ferreira et al. 2021; Castelbaum et al. 2020; Isaksson et al. 2022; Kim et al. 2022; Klump et al. 2000; Korotana et al. 2018; Lundin Remnélius et al. 2022; Mosing et al. 2016; Torgersen and Janson 2002; von Stumm and Plomin 2018) Yet, the justification for using absolute or relative differences is often not clearly stated or even discussed in the manuscript. Furthermore, it is sometimes not clear if absolute difference scores are calculated for both the outcome and the exposure or just for one of them.
Relative difference scores appear to have an obvious advantage compared to absolute differences, given that they are not only informative about the magnitude of the difference (even though the mean tends to 0), but also about the direction of such difference. Yet, the magnitude of the association using relative difference scores with twins randomly assigned as T1 or T2 could differ from the use of absolute difference scores since the variability is reduced in the case of absolute as compared to relative difference scores. An alternative could be the ordering of twins such that the twin with the highest score is always assigned as T1. This should produce the same estimate of the regression slope as random assignment, for the line constrained through the origin (Carlin et al. 2005).
However, all those propositions remain speculative since no comparative exploration has yet been made about the discrepancies between the different approaches to data transformation. Results from co-twin studies using quantitative difference scores could vary depending on the type of scores (relative/absolute) and the twin-pair organisation (random/ordered) chosen. The aim of this study is to compare the results obtained by different methods of data transformation when performing a co-twin control study and to test how the association changes using each of those approaches. To that aim, we simulated different scenarios for a co-twin study, and we compared the association between two simulated variables in a co-twin design by varying the order of the twins for one of the variables (twins ordered such that T1 has the highest score in Variable 1, from 100 to 0% of the cases) and the kind of score (relative/absolute). This study aims to serve as a reference for performing a co-twin analysis by highlighting potential issues of selecting the either strategy, especially for those less familiar with the twin methodology.
Method
A simulation study was performed using R version 4.0.4 (R Core Team 2021). The script for this simulation is available at (https://osf.io/ms4uw/?view_only=c4b49846af38401cbe788a4bc9100678). Data was created to simulate a scenario with two continuous variables (hereon V1 and V2) for pairs of twins (hereon T1 and T2). Data for 10,000 MZ twin pairs and 10,000 DZ twin pairs were simulated with fixed means (equal to 0 for both twins). These data were generated from a Direction of Causation (DoC) model (Duffy and Martin 1994; Heath et al. 1993; Verhulst and Estabrook 2012). We set the additive genetic factors (A) of V1 to 0.33, common-shared environmental factors (C) to 0.33 and non-shared environmental factors (E) to 0.34, whereas for V2 we set A to 0.50, C to 0, and E to 0.50 to maximize the statistical power of the DoC model. We simulated data for three different scenarios with different correlations: high causal path (rV1,V2 = 0.6), medium causal path (rV1,V2 = 0.3) and low causal path (rV1,V2 = 0.1). Additionally, we also simulated data with a high causal path and adding genetic confounders (genetic correlation [rA: 0.3]). The same simulations were performed for common-shared environmental confounders (shared environment correlation [rC: 0.3]) and non-shared environmental confounders (non-shared environment correlation [rE: 0.3]).
To test the main hypothesis of this study, difference scores (i.e., V1T1–V1T2 and V2T1–V2T2) were calculated. Pearson’s correlation between the intra-pair differences for both variables were then estimated both using absolute and relative difference scores. Difference scores were calculated for a range of scenarios by intentionally modifying the frequency of twin pairs where T1 had a higher score than T2 on V1, from 100% (T1 is always the highest score) to 0% (T1 is always the lowest score) in 5% intervals. Note that the 100% and 0% conditions are almost equivalent to absolute difference scores and that the 50% condition would be equivalent to perfect random assignment of twin order, which are the main proposed strategies for absolute and relative difference scores (Vitaro et al. 2009). Regression models using absolute/relative difference scores were also fitted. Additionally a regression model including within-pair and between-pair effects was also fitted (see McGue et al. (2010) for full description).
Real Data Demonstration
Data on height and weight from the Murcia Twin Registry (MTR) in Spain, were used to demonstrate the importance of differences between the methods. Description about the MTR, recruitment procedures, ethics, and data collection is provided elsewhere (Ordoñana et al. 2019). Height in centimeters (mean: 162.4; SD = 10.2) and weight in kilograms (mean: 72.5; SD = 13.8) were selected as they are known to be correlated. These variables were collected using both objective methods (50% of the sample) and self-reported. Data from 654 complete same sex twin pairs were used for this test.
Results
Figure 1 displays Pearson’s correlations across the different conditions (i.e., low, medium and high causal association between V1 and V2). The results for both MZ and DZ twin pairs are combined as there were no confounders (i.e., rA, rC and rE equal to 0) that would bias the estimated correlations. We found ample variation across the different scenarios of twin ordering (i.e., from 0 to 100%); variation that becomes more pronounced as the phenotypic correlation increases. For the high causal association condition Pearson’s correlation between difference scores ranged from 0.41 (0 and 100%) to 0.60 (50%); for the medium causal association condition ranged from 0.19 (0 and 100%) to 0.31 (50%); and for the low causal association condition the range varied between 0.08 (0% and 100%) and 0.12 (50%). The Pearson’s correlation between absolute difference scores for V1 and V2 always showed, again, a slightly lower value compared to the 0% and 100% conditions (0.33, 0.10 and 0.04 for the high, medium and low causal conditions respectively). In summary, the results show that: 1) correlations between relative difference scores are notably stronger than absolute difference scores; 2) the magnitude of the correlations using relative difference scores vary based on the proportion of time the value of twin 1 is greater than the value for twin 2; and 3) differences between relative and absolute difference scores are magnified as causal associations increase (Fig. 1).
We extended our analyses to examine the impact of genetic and environmental confounding. Using the high causal association condition (phenotypic correlation = 0.6) framework, we introduced a genetic correlation between V1 and V2 (rA = 0.3) to mimic confounding genetic factors. Our results again showed the same pattern of results for MZ and DZ twins, but notably the within-person DZ twin correlation was higher due to the fact that DZ twins share only 50% of their genetic factors (Fig. 2). The same pattern of results was found when we introduced common-shared environmental confounders (rC = 0.3), as expected, the within-person correlations were the same for MZ and DZ twins since common-shared environmental factors are fully controlled in both conditions (Fig. 3). Finally, when we introduced non-shared environmental confounders (rE = 0.3), we found higher correlations as compared to the simulated value (phenotypic correlation = 0.6) (Fig. 4). As expected this increase was higher for the MZ condition as previously reported in the literature (Frisell et al. 2012).
Real Data Demonstration
Results using height and weight data from the MTR are very similar to those obtained using the medium causal association scenario. The matrix of correlations (MZ/DZ twins) was as follows:
Pearson’s correlation ranged from 0.20 (0%,100% and absolute difference scores conditions) to 0.29 (50% condition). These results show that depending on the method used (relative vs absolute difference scores) we can obtain a different value for the association between two phenotypes using the same empirical data (see Supplementary Fig. 1).
Regression Models
Results for regression models comparing relative difference scores (only for 50% or 100% conditions) with absolute difference scores are presented in Table 1. For each of the simulated causal conditions as well as the real data analysis, the associations in the 50% relative difference condition were larger than in either the 100% relative difference condition or the absolute difference condition, thus providing more power to detect a causal effect. Another common analytical method for conducting cotwin analyses is to fit within-pair and between-pair effects (Carlin et al. 2005; McGue et al. 2010). Notably, the point estimates for the within-pair effects matched the 50% relative difference estimates. Similar results were found when different scenarios of confounding (genetic, shared-environmental, and non-shared environmental) were included, according to zygosity, within the high causal association condition (Suppl. Table 1). Note that all the regression models were fitted including the intercept term. However, regression models using difference scores must always be fitted without the intercept term as it has been previously reported (Carlin et al. 2005)—if the intercept term is included and twins are not randomly ordered then the results will be biased, underestimating the association, as shown in Table 1 and Supplementary Table 1.
Discussion
The results from our simulation study and data demonstration suggest that the method used to calculate difference scores has a large impact on the magnitude of the association in co-twin studies and thus any subsequent conclusions. When difference scores are calculated from continuous variables, correlations between relative difference scores are larger than correlations between absolute difference scores. Therefore, the appropriateness of the difference scores has major implications for the conclusions that are drawn from analyses of genetically informative designs such as the co-twin method (Kendler 2017; Kujala et al. 2002; McAdams et al. 2021; McGue et al. 2010). As such, the appropriateness of the transformations and procedures carried out in these kinds of studies is by no means a trivial issue, and results from co-twin studies may vary depending on the method used to calculate difference scores. In our real data demonstration, we showed that the magnitude of the association between two variables (i.e., height and weight) is considerably decreased (≈30%) depending on the analytical strategy (from 0.29 to 0.20). This has clear implications, since using the wrong strategy could lead to non-significant associations when in fact the relationship is present after controlling for familial factors. Additionally, this miscalculation could lead to erroneous conclusions regarding the presence of genetic and/or environmental confounders when comparing the estimates with those obtained from the analysis of non-related samples.
Although the scientific literature about the preference of absolute versus relative difference scores within co-twin studies is scarce, it has been proposed that relative difference scores would be appropriate if birth order is part of the hypothesis whereas absolute difference scores would be appropriate when it does not matter which twin has more of the characteristic (Rovine 1994). However, this last method could be counterintuitive since the direction of the association disappears, which could explain why other researchers have used or recommended relative difference scores on a regular basis (Vitaro et al. 2009; von Stumm and Plomin 2018). We have observed, using both simulated as well as real data, that the magnitude of the association between difference scores varies substantially depending on the strategy used to calculate those scores.
The main finding of our study is that when using absolute difference scores or relative differences with twins ordered within pairs, the correlation between difference scores tends to be artificially reduced, impeding the power to detect causal associations. This results from a drop in the variability of the difference scores produced by the restriction of the range of values when negative scores are transformed into positive ones. Such reduction in variability is known to lead to a decrease in the magnitude of the product-moment correlation (Cohen et al. 2003) and, hence the correlation estimates become lower as more negative values (or positive values depending on the sign of the association) for the within-pair differences are removed in one of the variables. In the same course of action, the use of absolute instead of relative difference scores (whichever the twin ordering) will lower correlation estimates as the loss of variability in this case occurs for both variables.
Behavioural research has shown the importance of taking into account small effects since psychological traits are likely influenced by a multitude of causes (Götz et al. 2021). For the co-twin control method, this implies that subtle variations yielded by the strategy chosen for data organisation/computation could alter the results substantially, leading to discrepancies among studies or erroneous conclusions. In the light of our results, the most appropriate estimate appears to be that obtained by ordering twins randomly and computing relative difference scores. By doing so, the direction of the association (as opposed to absolute values) is preserved, and the risk of a deflated correlation estimate reduced, at the same time that data manipulation is reduced to a minimum. This becomes increasingly important as within-twin cross-trait correlation increases since the difference between estimates escalates accordingly.
This strategy is also not free from possible biases. In particular, care should be taken to ensure that no association exists between the exposure/outcome studied and other variables that could be related to twin ordering. Of course, this would not be the method of choice if twin order forms part of the hypothesis. Furthermore, it is important to note that non-shared environmental factors could bias the results, as shown in our simulation including non-shared environmental confounders (Fig. 4), the association between variables increases and this increase is higher for the MZ condition. This has been previously addressed in several publications highlighting that this bias could be even higher as compared to the unpaired estimate (Frisell et al. 2012; Saunders et al. 2019). Accordingly, the theoretical motivation for utilising absolute difference scores should be carefully justified when reporting results from co-twin studies. Measurement error should also be considered in this context, whereas non-shared environmental confounders could increase the magnitude of the association, measurement error has the opposite effect and could lead to conclude that the causal association is confounded by familial factors (Frisell et al. 2012; Gustavson et al. 2024).
In summary, co-twin control designs can shed light on the causal associations between variables, but the selection of absolute or relative difference scores can have a major impact on the study’s conclusions. In this study, we examined the discrepancies across methods used to calculate within-pair difference scores both using simulated and real data. Our results show that when twins are not randomly assigned or using absolute difference scores might underestimate the association due to a reduction in variability, which could therefore mislead the conclusions of the study. We find that the use of relative difference scores, with twins randomly assigned, appears to be the method of choice in co-twin control designs, as long as twin birth order is not part of the hypothesis. Furthermore, given their potential relevance for the results, the procedure used for computing the difference scores should be reported clearly in the methods of co-twin control studies.
Data Availability
All the results can be replicated using the provided syntax, except results using real data from the MTR that is available under restrictions.
Code Availability
Syntax for this study is available at (https://osf.io/ms4uw/?view_only=c4b49846af38401cbe788a4bc9100678).
References
Barclay NL, Eley TC, Parsons MJ, Willis TA, Gregory AM (2013) Monozygotic twin differences in non-shared environmental factors associated with chronotype. J Biol Rhythms 28(1):51–61. https://doi.org/10.1177/0748730412468698
Brendgen M, Ouellet-Morin I, Lupien SJ, Vitaro F, Dionne G, Boivin M (2017) Environmental influence of problematic social relationships on adolescents’ daily cortisol secretion: a monozygotic twin-difference study. Psychol Med 47(3):460–470. https://doi.org/10.1017/S003329171600252X
Calais-Ferreira L, Barreto ME, Mendonça E, Dite GS, Hickey M, Ferreira PH, Hopper JL (2021) Birthweight, gestational age and familial confounding in sex differences in infant mortality: a matched co-twin control study of Brazilian male-female twin pairs identified by population data linkage. Int J Epidemiol. https://doi.org/10.1093/ije/dyab242
Carlin JB, Gurrin LC, Sterne JA, Morley R, Dwyer T (2005) Regression models for twin studies: a critical review. Int J Epidemiol 34(5):1089–1099. https://doi.org/10.1093/ije/dyi153
Castelbaum L, Sylvester CM, Zhang Y, Yu Q, Constantino JN (2020) On the nature of monozygotic twin concordance and discordance for autistic trait severity: a quantitative analysis. Behav Genet 50(4):263–272. https://doi.org/10.1007/s10519-019-09987-2
Cohen J, Cohen P, West SG, Aiken LS (2003) Bivariate correlation and regression. In: Applied multiple regression/correlation analysis for the behavioral sciences, 3rd edn. Routledge, New York, pp 19–63. https://doi.org/10.4324/9780203774441
Duffy DL, Martin NG (1994) Inferring the direction of causation in cross-sectional twin data: theoretical and empirical considerations. Genet Epidemiol 11(6):483–502. https://doi.org/10.1002/gepi.1370110606
Friedman NP, Banich MT, Keller MC (2021) Twin studies to GWAS: there and back again. Trends Cogn Sci 25(10):855–869. https://doi.org/10.1016/j.tics.2021.06.007
Frisell T, Öberg S, Kuja-Halkola R, Sjölander A (2012) Sibling comparison designs: bias from non-shared confounders and measurement error. Epidemiology 23(5):713–720. https://doi.org/10.1097/EDE.0b013e31825fa230
Goldberg J, Fisher M (2005) Co-twin control methods. In: Everitt BS, Howell DC (eds) Encyclopedia of statistics in behavioral science. Wiley, Chichester, pp 415–418
Gonggrijp BMA, van de Weijer SGA, Bijleveld CCJH, van Dongen J, Boomsma DI (2023) The co-twin control design: implementation and methodological considerations. Twin Res Hum Genet. https://doi.org/10.1017/thg.2023.35
Götz FM, Gosling SD, Rentfrow PJ (2021) Small effects: The indispensable foundation for a cumulative psychological science. Perspect Psychol Sci. https://doi.org/10.1177/1745691620984483
Gustavson K, Torvik FA, Davey Smith G, Røysamb E, Eilertsen EM (2024) Familial confounding or measurement error? How to interpret findings from sibling and co-twin control studies. Eur J Epidemiol 39(6):587–603. https://doi.org/10.1007/s10654-024-01132-6
Heath AC, Kessler RC, Neale MC, Hewitt JK, Eaves LJ, Kendler KS (1993) Testing hypotheses about direction of causation using cross-sectional family data. Behav Genet 23(1):29–50. https://doi.org/10.1007/BF01067552
Isaksson J, Ruchkin V, Aho N, Lundin Remnélius K, Marschik PB, Bölte S (2022) Nonshared environmental factors in the aetiology of autism and other neurodevelopmental conditions: a monozygotic co-twin control study. Mol Autism 13(1):8. https://doi.org/10.1186/s13229-022-00487-5
Kendler KS (2017) Causal inference in psychiatric epidemiology. JAMA Psychiat 74(6):561–562. https://doi.org/10.1001/jamapsychiatry.2017.0502
Kendler KS, Neale MC, MacLean CJ, Heath AC, Eaves LJ, Kessler RC (1993) Smoking and major depression. A causal analysis. Arch Gen Psychiatry 50(1):36–43
Kim SY, Yoo DM, Kwon MJ, Kim JH, Bang WJ, Choi HG (2022) Differences in nutritional intake, total body fat, and BMI score between twins. Nutrients. https://doi.org/10.3390/nu14173655
Klump KL, Holly A, Iacono WG, McGue M, Willson LE (2000) Physical similarity and twin resemblance for eating attitudes and behaviors: a test of the equal environments assumption. Behav Genet 30(1):51–58
Knopik VS, Neiderhiser JM, DeFries JC, Plomin R (2017) Behavioral genetics, 7th edn. Worth Publishers, New York
Korotana LM, von Ranson KM, Wilson S, Iacono WG (2018) Reciprocal Associations between eating pathology and parent-daughter relationships across adolescence: A monozygotic twin differences study. Front Psychol 9:914. https://doi.org/10.3389/fpsyg.2018.00914
Kujala UM, Kaprio J, Koskenvuo M (2002) Modifiable risk factors as predictors of all-cause mortality: the roles of genetics and childhood environment. Am J Epidemiol 156(11):985–993
Lundin Remnélius K, Neufeld J, Isaksson J, Bölte S (2022) Eating problems in autistic females and males: a co-twin control study. J Autism Dev Disord 52(7):3153–3168. https://doi.org/10.1007/s10803-021-05198-z
McAdams TA, Rijsdijk FV, Zavos HMS, Pingault JB (2021) Twins and causal inference: leveraging nature’s experiment. Cold Spring Harb Perspect Med. https://doi.org/10.1101/cshperspect.a039552
McGue, M., Osler, M., & Christensen, K. (2010). Causal inference and observational research: the utility of twins. In Perspectives on psychological science (Vol. 5, pp. 546–556): NIH Public Access.
Mosing MA, Madison G, Pedersen NL, Ullén F (2016) Investigating cognitive transfer within the framework of music practice: genetic pleiotropy rather than causality. Dev Sci 19(3):504–512. https://doi.org/10.1111/desc.12306
Odintsova VV, Willemsen G, Dolan CV, Hottenga JJ, Martin NG, Slagboom PE, Boomsma DI (2018) Establishing a twin register: an invaluable resource for (behavior) genetic, epidemiological, biomarker, and “omics” studies. Twin Res Hum Genet 21(3):239–252. https://doi.org/10.1017/thg.2018.23
Ordoñana JR, Carrillo E, Colodro-Conde L, García-Palomo FJ, González-Javier F, Madrid-Valero JJ, Sánchez-Romera JF (2019) An update of twin research in Spain: the Murcia twin registry. Twin Res Hum Genet 22(6):667–671. https://doi.org/10.1017/thg.2019.60
R Core Team (2021) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/
Rovine MJ (1994) Estimating nonshared environment using sibling discrepancy scores. In: Hetherington EM, Reiss D, Plomin R (eds) The importance of nonshared environment on development. Eribaun, Hillsdale, pp 33–61
Saunders GRB, McGue M, Malone SM (2019) Sibling comparison designs: addressing confounding bias with inclusion of measured confounders. Twin Res Hum Genet 22(5):290–296. https://doi.org/10.1017/thg.2019.67
Torgersen AM, Janson H (2002) Why do identical twins differ in personality: shared environment reconsidered. Twin Res 5(1):44–52. https://doi.org/10.1375/1369052022893
Turkheimer E, Waldron M (2000) Nonshared environment: a theoretical, methodological, and quantitative review. Psychol Bull 126(1):78–95. https://doi.org/10.1037/0033-2909.126.1.78
Verhulst B, Estabrook R (2012) Using genetic information to test causal relationships in cross-sectional data. J Theor Polit 24(3):328–344. https://doi.org/10.1177/0951629812439348
Vitaro F, Brendgen M, Arseneault L (2009) The discordant MZ-twin method: one step closer to the holy grail of causality. Int J Behav Dev 33(4):376–382. https://doi.org/10.1177/0165025409340805
von Stumm S, Plomin R (2018) Monozygotic twin differences in school performance are stable and systematic. Develop Sci. https://doi.org/10.1111/desc.12694
Funding
Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. Ministerio de Ciencia, Innovación y Universidades—Spain (RTI2018-095185-B-I00) co-funded by European Regional Development Fund (FEDER). The funder had no role in the study design, data collection, data analyses, writing or the decision to submit this manuscript for publication.
Author information
Authors and Affiliations
Contributions
JJMV and JRO conceptualized the original idea and constructed the methodology. JJMV and JRO participated in data collection (real twin data analyses). JJMV and BV in consultation with JALL performed the simulation and statistical analyses. JJMV and JRO wrote the original manuscript in consultation with BV and JALL. All authors have read and agreed to the final version of the manuscript.
Corresponding authors
Ethics declarations
Conflicts of interest
Juan J. Madrid-Valero, Brad. Verhulst, José A. López-López and Juan R. Ordoñana declare no conflict of interest.
Ethics Approval
The MTR protocols and instruments, as well as the data collection procedures and the analysis derivatives thereof, have been approved by the Research Ethics Committee of the University of Murcia and meet the legal requirements of confidentiality and protection of personal data. Participants provided written informed consent when interviewed in person or oral consent when a telephone interview was used.
Informed Consent
Participants provided written informed consent when interviewed in person or oral consent by telephone interview.
Additional information
Edited by Stephen Petrill.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
10519_2024_10196_MOESM1_ESM.tif
Supplementary file1 (TIF 4572 KB) Association between height and weight for MZ and DZ twins from the Murcia Twin Registry. The result for the absolute difference score is arbitrarily plotted at the 50% point.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Madrid-Valero, J.J., Verhulst, B., López-López, J.A. et al. Calculating Within-Pair Difference Scores in the Co-twin Control Design. Effects of Alternative Strategies. Behav Genet 54, 426–435 (2024). https://doi.org/10.1007/s10519-024-10196-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10519-024-10196-9