Skip to main content
Log in

Heritability Across the Distribution: An Application of Quantile Regression

  • Original Research
  • Published:
Behavior Genetics Aims and scope Submit manuscript

Abstract

We introduce a new method for analyzing twin data called quantile regression. Through the application presented here, quantile regression is able to assess the genetic and environmental etiology of any skill or ability, at multiple points in the distribution of that skill or ability. This method is compared to the Cherny et al. (Behav Genet 22:153–162, 1992) method in an application to four different reading-related outcomes in 304 pairs of first-grade same sex twins enrolled in the Western Reserve Reading Project. Findings across the two methods were similar; both indicated some variation across the distribution of the genetic and shared environmental influences on non-word reading. However, quantile regression provides more details about the location and size of the measured effect. Applications of the technique are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Catts HW, Petscher Y, Schatschneider C, Bridges MS, Mendoza K (2009) Floor effects associated with universal screening and their impact on the early identification of reading disabilities. J Learn Disabil 42:163–176

    Article  PubMed  Google Scholar 

  • Cherny SS, Cardon LR, Fulker DW, DeFries JC (1992) Differential heritability across levels of cognitive ability. Behav Genet 22(2):153–162

    Article  PubMed  Google Scholar 

  • Clifford CA, Hopper JL, Fulker DW, Murray RM (1984) A genetic and environmental analysis of a twin family study of alcohol use, anxiety, and depression. Genet Epidemiol 1:63–79

    Article  PubMed  Google Scholar 

  • Cohen J (1988) Statistical power analysis for the behavioral sciences, 2nd edn. Lawrence Earlbaum Associates, Hillsdale, NJ

  • Deater-Deckard K, Reiss D, Hetherington EM, Plomin R (1997) Dimensions and disorders of adolescent adjustment: a quantitative genetic analysis of unselected samples and selected extremes. J Child Psychol Psychiatry 38(5):515–525. doi:10.1111/j.1469-7610.1997.tb01538.x

    Article  PubMed  Google Scholar 

  • DeFries JC, Fulker DW (1985) Multiple regression analysis of twin data. Behav Genet 15(5):467–473

    Article  PubMed  Google Scholar 

  • DeFries JC, Fulker DW (1988) Multiple regression analysis of twin data: etiology of deviant scores versus individual differences. Acta Genet Med Gemellol Twin Res 37(3–4):205–216

    Google Scholar 

  • Eaves L (2006) Genotype x environment interaction in psychopathology: fact or artifact? Twin Res Hum Genet 9(1):1–8

    Article  PubMed  Google Scholar 

  • Firpo S (2007) Efficient semiparametric estimation of quantile treatment effects. Econometrica 75(1):259–276

    Article  Google Scholar 

  • Goldsmith HH (1991) A zygosity questionnaire for young twins: a research note. Behav Genet 21(3):257–269

    Article  PubMed  Google Scholar 

  • Gottesman II, Shields J (1973) Genetic theorizing and schitzophrenia. Br J Psychiatry 122:15–30

    Article  PubMed  Google Scholar 

  • Koenker R, Bassett G (1978) Regression quantiles. Econometrica 46(1):33–50

    Article  Google Scholar 

  • Koenker R, Hallock KF (2001) Quantile regression. J Econ Perspect 15(4):143–156

    Article  Google Scholar 

  • Koenker R, Machado JAF (1999) Goodness of Fit and related inference processes for quantile regression. J Am Stat Assoc 94(448):1296–1310

    Article  Google Scholar 

  • LaBuda MC, DeFries JC (1990) Genetic etiology of reading disability: evidence from a twin study. In: Pavlidis GT (ed) Perspectives on dyslexia. Wiley, New York, pp 47–76

    Google Scholar 

  • LaBuda MC, DeFries JC, Fulker DW (1986) Multiple regression analysis of twin data obtained from selected samples. Genet Epidemiol 3:425–433

    Article  PubMed  Google Scholar 

  • Lubke GH, Dolan CV, Neale MC (2004) Implications of absense of measurement invariance for detecting sex limitation and genotype by environment interaction. Twin Res Hum Genet 7:292–298

    Article  Google Scholar 

  • Petrill SA, Deater-Deckard K, Thompson LA, DeThorne LS, Schatschneider C (2006) Reading skills in early readers: genetic and shared environmental influences. J Learn Disabil 39(1):48–55

    Article  PubMed  Google Scholar 

  • Petscher Y, Kim Y (2011) The utility and accuracy of oral reading fluency score types in predicting reading comprehension. J School Psychol 49:107–129

    Article  Google Scholar 

  • Plomin R, Kovas Y (2005) Generalist genes and learning disabilities. Psychol Bull 131(4):592–617

    Article  PubMed  Google Scholar 

  • Plomin R, DeFries JC, McClearn GE, McGuffin P (2008) Behavioral genetics, 5th edn. Worth Publishers, New York

    Google Scholar 

  • Purcell S (2002) Variance components models for gene-environment interaction in twin analysis. Twin Res 5(6):554–571

    Article  PubMed  Google Scholar 

  • Reeves EB, Lowe J (2009) Quantile regression: an education policy research tool. South Rural Sociol 24(1):175–199

    Google Scholar 

  • Robertson C, Salter W (1997) The phonological awareness test. LinguiSystems, Inc., East Moline, IL

  • Rodgers JL, Muster M, Rowe DC (2001) Genetic and environmental influences on delinquency: DF analysis of NLSY kinship data. J Quant Criminol 17(2):145–168

    Article  Google Scholar 

  • Thomdike RL, Hagen EP, Sattler JM (1986) Stanford-Binet intelligence scale: fourth edition (Technical Manual). Riverside, Chicago

  • Wicherts JM, Johnson W (2009) Group differences in the heritability of items and test scores. Proc R Soc B. doi:10.1098/rspb.2009.0238

  • Woodcock RW (1987) Woodcock reading mastery tests—revised. American Guidance Service, Circle Pines

    Google Scholar 

Download references

Acknowledgments

Special thanks to Dr. Yaacov Petscher and the anonymous reviewers for helping to shape the paper. This research was supported by the National Institute of Child Health and Human Development (NICHD) Grant HD038075 to The Ohio State University. The content of this publication does not necessarily reflect the views or policies of the NICHD, and mention of trade names, commercial products, or organizations does not imply endorsement by the United States government.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jessica A. R. Logan.

Additional information

Edited by Stacey Cherny.

Appendices

Appendix 1

SAS code: Data setup

*************************************************************************

This code requires a data set that has one variable (column) per twin. In this first data preparation step, the data is called in, and the scores on the desired variable for twin 1 and their cotwin (twin 2) are renamed as “Var1” and “Var2”.

*************************************************************************;

libname df ‘C:\yourpath\’;

Data abc; set df.your_dataset;

Var1 = your_twin1_variable;

Var2 = your_twin2_variable; run;

*************************************************************************

For this next step, change the words “your_zygosity” to the name of the zygosity variable in your dataset. Change “MZ” to read however MZ is coded in your dataset (i.e., 1). Do the same for DZ.

The “proc standard” z-scores the variables (var1 & var2) prior to entry in the analysis.

*************************************************************************;

data abc2; set abc;

if (your_zygosity = MZ) then rel = 1;

if (your_zygosity = DZ) then rel = 0.5;

run;

proc standard data = abc2 m=0 std=1 out = z1;

var var1 var2; run;

*************************************************************************;

Appendix 2

SAS code: Cherny method

*************************************************************************

The Cherny method can be conducted using the GLM procedure. This is done by using twin 1’s score (var1) to predict twin 2’s score (var2), along with an interaction of twin 1’s score with degree of relatedness (established in Appendix 1).

*************************************************************************;

proc glm data = abc2;

model var2 = var1 rel var1*var1 var1*rel var1*var1*rel;

run; quit;

*************************************************************************

From these results, the parameter estimate associated with:

        “var1” represents the proportion variance attributable to shared environment

        “var1*rel” represents the proportion of variance attributable to heritability

        “var1*var1” represents the linear change of shared environment across the distribution

        “var1*var1*rel” represents the linear change of heritability across the distribution

*************************************************************************;

Appendix 3

SAS code: Quantile regression

*************************************************************************

After setting up the data, run the following code to obtain estimates of heritability and shared environmental effects.

This is done using the variable “Rel,” created in step 2 of Appendix 1 (indicating the degree of relatedness).

The results are presented graphically via the ODS statements.

Replace “quantile = all” with the quantiles desired. For the analyses in the present study, this syntax read:

quantile = 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90

*************************************************************************;

ods html;

ods graphics on;

proc quantreg ci=sparsity;

model var1 = var2 | rel/quantile= all plot = quantplot;

run;

ods graphics off;

ods html close;

*************************************************************************

The shaded areas on the graph represent 95% confidence intervals of differences from zero.

The graph labeled “var2” represents shared environment.

The graph labeled “var2*zyg” represents heritability.

*************************************************************************;

Rights and permissions

Reprints and permissions

About this article

Cite this article

Logan, J.A.R., Petrill, S.A., Hart, S.A. et al. Heritability Across the Distribution: An Application of Quantile Regression. Behav Genet 42, 256–267 (2012). https://doi.org/10.1007/s10519-011-9497-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10519-011-9497-7

Keywords

Navigation