Abstract
We introduce a new method for analyzing twin data called quantile regression. Through the application presented here, quantile regression is able to assess the genetic and environmental etiology of any skill or ability, at multiple points in the distribution of that skill or ability. This method is compared to the Cherny et al. (Behav Genet 22:153–162, 1992) method in an application to four different reading-related outcomes in 304 pairs of first-grade same sex twins enrolled in the Western Reserve Reading Project. Findings across the two methods were similar; both indicated some variation across the distribution of the genetic and shared environmental influences on non-word reading. However, quantile regression provides more details about the location and size of the measured effect. Applications of the technique are discussed.
Similar content being viewed by others
References
Catts HW, Petscher Y, Schatschneider C, Bridges MS, Mendoza K (2009) Floor effects associated with universal screening and their impact on the early identification of reading disabilities. J Learn Disabil 42:163–176
Cherny SS, Cardon LR, Fulker DW, DeFries JC (1992) Differential heritability across levels of cognitive ability. Behav Genet 22(2):153–162
Clifford CA, Hopper JL, Fulker DW, Murray RM (1984) A genetic and environmental analysis of a twin family study of alcohol use, anxiety, and depression. Genet Epidemiol 1:63–79
Cohen J (1988) Statistical power analysis for the behavioral sciences, 2nd edn. Lawrence Earlbaum Associates, Hillsdale, NJ
Deater-Deckard K, Reiss D, Hetherington EM, Plomin R (1997) Dimensions and disorders of adolescent adjustment: a quantitative genetic analysis of unselected samples and selected extremes. J Child Psychol Psychiatry 38(5):515–525. doi:10.1111/j.1469-7610.1997.tb01538.x
DeFries JC, Fulker DW (1985) Multiple regression analysis of twin data. Behav Genet 15(5):467–473
DeFries JC, Fulker DW (1988) Multiple regression analysis of twin data: etiology of deviant scores versus individual differences. Acta Genet Med Gemellol Twin Res 37(3–4):205–216
Eaves L (2006) Genotype x environment interaction in psychopathology: fact or artifact? Twin Res Hum Genet 9(1):1–8
Firpo S (2007) Efficient semiparametric estimation of quantile treatment effects. Econometrica 75(1):259–276
Goldsmith HH (1991) A zygosity questionnaire for young twins: a research note. Behav Genet 21(3):257–269
Gottesman II, Shields J (1973) Genetic theorizing and schitzophrenia. Br J Psychiatry 122:15–30
Koenker R, Bassett G (1978) Regression quantiles. Econometrica 46(1):33–50
Koenker R, Hallock KF (2001) Quantile regression. J Econ Perspect 15(4):143–156
Koenker R, Machado JAF (1999) Goodness of Fit and related inference processes for quantile regression. J Am Stat Assoc 94(448):1296–1310
LaBuda MC, DeFries JC (1990) Genetic etiology of reading disability: evidence from a twin study. In: Pavlidis GT (ed) Perspectives on dyslexia. Wiley, New York, pp 47–76
LaBuda MC, DeFries JC, Fulker DW (1986) Multiple regression analysis of twin data obtained from selected samples. Genet Epidemiol 3:425–433
Lubke GH, Dolan CV, Neale MC (2004) Implications of absense of measurement invariance for detecting sex limitation and genotype by environment interaction. Twin Res Hum Genet 7:292–298
Petrill SA, Deater-Deckard K, Thompson LA, DeThorne LS, Schatschneider C (2006) Reading skills in early readers: genetic and shared environmental influences. J Learn Disabil 39(1):48–55
Petscher Y, Kim Y (2011) The utility and accuracy of oral reading fluency score types in predicting reading comprehension. J School Psychol 49:107–129
Plomin R, Kovas Y (2005) Generalist genes and learning disabilities. Psychol Bull 131(4):592–617
Plomin R, DeFries JC, McClearn GE, McGuffin P (2008) Behavioral genetics, 5th edn. Worth Publishers, New York
Purcell S (2002) Variance components models for gene-environment interaction in twin analysis. Twin Res 5(6):554–571
Reeves EB, Lowe J (2009) Quantile regression: an education policy research tool. South Rural Sociol 24(1):175–199
Robertson C, Salter W (1997) The phonological awareness test. LinguiSystems, Inc., East Moline, IL
Rodgers JL, Muster M, Rowe DC (2001) Genetic and environmental influences on delinquency: DF analysis of NLSY kinship data. J Quant Criminol 17(2):145–168
Thomdike RL, Hagen EP, Sattler JM (1986) Stanford-Binet intelligence scale: fourth edition (Technical Manual). Riverside, Chicago
Wicherts JM, Johnson W (2009) Group differences in the heritability of items and test scores. Proc R Soc B. doi:10.1098/rspb.2009.0238
Woodcock RW (1987) Woodcock reading mastery tests—revised. American Guidance Service, Circle Pines
Acknowledgments
Special thanks to Dr. Yaacov Petscher and the anonymous reviewers for helping to shape the paper. This research was supported by the National Institute of Child Health and Human Development (NICHD) Grant HD038075 to The Ohio State University. The content of this publication does not necessarily reflect the views or policies of the NICHD, and mention of trade names, commercial products, or organizations does not imply endorsement by the United States government.
Author information
Authors and Affiliations
Corresponding author
Additional information
Edited by Stacey Cherny.
Appendices
Appendix 1
SAS code: Data setup
************************************************************************* | |
This code requires a data set that has one variable (column) per twin. In this first data preparation step, the data is called in, and the scores on the desired variable for twin 1 and their cotwin (twin 2) are renamed as “Var1” and “Var2”. | |
*************************************************************************; | |
libname df ‘C:\yourpath\’; Data abc; set df.your_dataset; Var1 = your_twin1_variable; Var2 = your_twin2_variable; run; | |
************************************************************************* | |
For this next step, change the words “your_zygosity” to the name of the zygosity variable in your dataset. Change “MZ” to read however MZ is coded in your dataset (i.e., 1). Do the same for DZ. | |
The “proc standard” z-scores the variables (var1 & var2) prior to entry in the analysis. | |
*************************************************************************; | |
data abc2; set abc; if (your_zygosity = MZ) then rel = 1; if (your_zygosity = DZ) then rel = 0.5; run; | |
proc standard data = abc2 m=0 std=1 out = z1; var var1 var2; run; | |
*************************************************************************; |
Appendix 2
SAS code: Cherny method
************************************************************************* | |
The Cherny method can be conducted using the GLM procedure. This is done by using twin 1’s score (var1) to predict twin 2’s score (var2), along with an interaction of twin 1’s score with degree of relatedness (established in Appendix 1). | |
*************************************************************************; | |
proc glm data = abc2; | |
model var2 = var1 rel var1*var1 var1*rel var1*var1*rel; | |
run; quit; | |
************************************************************************* | |
From these results, the parameter estimate associated with: | |
“var1” represents the proportion variance attributable to shared environment | |
“var1*rel” represents the proportion of variance attributable to heritability | |
“var1*var1” represents the linear change of shared environment across the distribution | |
“var1*var1*rel” represents the linear change of heritability across the distribution | |
*************************************************************************; |
Appendix 3
SAS code: Quantile regression
************************************************************************* | |
After setting up the data, run the following code to obtain estimates of heritability and shared environmental effects. This is done using the variable “Rel,” created in step 2 of Appendix 1 (indicating the degree of relatedness). | |
The results are presented graphically via the ODS statements. | |
Replace “quantile = all” with the quantiles desired. For the analyses in the present study, this syntax read: quantile = 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 | |
*************************************************************************; | |
ods html; ods graphics on; proc quantreg ci=sparsity; model var1 = var2 | rel/quantile= all plot = quantplot; run; ods graphics off; ods html close; | |
************************************************************************* | |
The shaded areas on the graph represent 95% confidence intervals of differences from zero. The graph labeled “var2” represents shared environment. The graph labeled “var2*zyg” represents heritability. | |
*************************************************************************; |
Rights and permissions
About this article
Cite this article
Logan, J.A.R., Petrill, S.A., Hart, S.A. et al. Heritability Across the Distribution: An Application of Quantile Regression. Behav Genet 42, 256–267 (2012). https://doi.org/10.1007/s10519-011-9497-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10519-011-9497-7