Efficient Calculation of Empirical P-values for Genome-Wide Linkage Analysis Through Weighted Permutation

Medland, Sarah E.; Schmitt, James E.; Webb, Bradley T.; Kuo, Po-Hsiu; Neale, Michael C.

doi:10.1007/s10519-008-9229-9

Efficient Calculation of Empirical P-values for Genome-Wide Linkage Analysis Through Weighted Permutation

Original Research
Published: 23 September 2008

Volume 39, pages 91–100, (2009)
Cite this article

Behavior Genetics Aims and scope Submit manuscript

Sarah E. Medland^1,2,3,
James E. Schmitt^1,3,
Bradley T. Webb^1,4,
Po-Hsiu Kuo^1,3 &
…
Michael C. Neale^1,3,5,6

279 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

Linkage analysis in multivariate or longitudinal context presents both statistical and computational challenges. The permutation test can be used to avoid some of the statistical challenges, but it substantially adds to the computational burden. Utilizing the distributional dependencies between \( \hat{\pi}\) (defined as the proportion of alleles at a locus that are identical by descent (IBD) for a pairs of relatives, at a given locus) and the permutation test we report a new method of efficient permutation. In summary, the distribution of \( \hat{\pi } \) for a sample of relatives at locus x is estimated as a weighted mixture of \( \hat{\pi } \) drawn from a pool of ‘representative’ \( \hat{\pi } \) distributions observed at other loci. This weighting scheme is then used to sample from the distribution of the permutation tests at the representative loci to obtain an empirical P-value at locus x (which is asymptotically distributed as the permutation test at loci x). This weighted mixture approach greatly reduces the number of permutation tests required for genome-wide scanning, making it suitable for use in multivariate and other computationally intensive linkage analyses. In addition, because the distribution of \( \hat{\pi } \) is a property of the genotypic data for a given sample and is independent of the phenotypic data, the weighting scheme can be applied to any phenotype (or combination of phenotypes) collected from that sample. We demonstrate the validity of this approach through simulation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Violating the normality assumption may be the lesser of two evils

Article Open access 07 May 2021

Overview of Statistical Methods for Genome-Wide Association Studies (GWAS)

VARista: a free web platform for streamlined whole-genome variant analysis across T2T, hg38, and hg19

Article 12 April 2024

References

Abecasis GR, Cherny SS, Cookson WO, Cardon LR (2002) Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 30(1):97–101. doi:10.1038/ng786
Article PubMed CAS Google Scholar
Amos CI (1994) Robust variance-components approach for assessing genetic linkage in pedigrees. Am J Hum Genet 54(3):535–543
PubMed CAS Google Scholar
Amos C, de Andrade M, Zhu D (2001) Comparison of multivariate tests for genetic linkage. Hum Hered 51(3):133–144. doi:10.1159/000053334
Article PubMed CAS Google Scholar
Besag J, Clifford P (1991) Sequential Monte Carlo P-values. Biometrika 79:301–304
Google Scholar
Churchill GA, Doerge RW (1994) Empirical threshold values for quantitative trait mapping. Genetics 138(3):963–971
PubMed CAS Google Scholar
Cornish E, Fisher RA (1938) Moments and cumulants in the specification of distributions. Rev Inst Int Statist 5:307–320. doi:10.2307/1400905
Article Google Scholar
Gentry J, Kagen M (2006). Genefinder: Finds genes that have similar patterns of expression: R: Bioconductor. http://rss.acs.unt.edu/Rdoc/library/genefilter/html/genefinder.html. Accessed 28 Jan 2008
Iturria SJ, Williams JT, Almasy L, Dyer TD, Blangero J (1999) An empirical test of the significance of an observed quantitative trait locus effect that preserves additive genetic variation. Genet Epidemiol 17(suppl 1):S169–S173
PubMed Google Scholar
Kong A, Cox NJ (1997) Allele-sharing models: LOD scores and accurate linkage tests. Am J Hum Genet 61(5):1179–1188. doi:10.1086/301592
Article PubMed CAS Google Scholar
Kuo PH, Neale MC, Riley BP, Patterson DG, Walsh D, Prescott CA et al (2007) A genome-wide linkage analysis for the personality trait neuroticism in the Irish affected sib-pair study of alcohol dependence. Am J Med Genet B Neuropsychiatr Genet 144(4):463–468. doi:10.1002/ajmg.b.30478
Google Scholar
Kuo PH, Neale MC, Riley BP, Webb BT, Sullivan PF, Vittum J et al (2006) Identification of susceptibility loci for alcohol-related traits in the Irish affected sib pair study of alcohol dependence. Alcohol Clin Exp Res 30(11):1807–1816. doi:10.1111/j.1530-0277.2006.00217.x
Article PubMed CAS Google Scholar
Li M, Boehnke M, Abecasis GR, Song PX (2006) Quantitative trait linkage analysis using Gaussian copulas. Genetics 173(4):2317–2327. doi:10.1534/genetics.105.054650
Article PubMed CAS Google Scholar
Neale BM, Sham PC (2004) The future of association studies: gene-based analysis and replication. Am J Hum Genet 75(3):353–362. doi:10.1086/423901
Article PubMed CAS Google Scholar
Neale MC, Lubke G, Aggen SH, Dolan CV (2005) Problems with using sum scores for estimating variance components: contamination and measurement non-invariance. Twin Res Human Genet 8(6):553–568. doi:10.1375/twin.8.6.553
Article Google Scholar
Neale MC, Aggen SH, Maes HH, Kubarych TS, Schmitt JE (2006a) Methodological issues in the assessment of substance use phenotypes. Addict Behav 31(6):1010–1034. doi:10.1016/j.addbeh.2006.03.047
Article PubMed Google Scholar
Neale MC, Boker SM, Xie G, Maes HH (2006b) Mx: statistical modeling, 6th edn. Richmond, VA 23298: Department of Psychiatry, VCU. http://www.vcu.edu/mx/. Accessed 28 Jan 2008
Ott J (1989) Computer-simulation methods in human linkage analysis. Proc Natl Acad Sci USA 86(11):4175–4178. doi:10.1073/pnas.86.11.4175
Article PubMed CAS Google Scholar
Peng J, Siegmund D (2006) QTL mapping under ascertainment. Ann Hum Genet 70(Pt 6):867–881. doi:10.1111/j.1469-1809.2006.00286.x
Article PubMed CAS Google Scholar
Prescott C, Sullivan P, Kuo P, Webb B, Vittum J, Patterson D et al (2006) Geome-wide linkage study in the Irish affected sib pair study of alcohol dependence: evidence for a susceptibility region for symptoms of alcohol dependence on chromosome 4. Mol Psychiatr 11:603–611. doi:10.1038/sj.mp.4001811
Article CAS Google Scholar
Self S, Liang K (1987) Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J Am Stat Assn 82:605–610. doi:10.2307/2289471
Article Google Scholar
Song KK, Weeks DE, Sobel E, Feingold E (2004) Efficient simulation of P values for linkage analysis. Genet Epidemiol 26(2):88–96. doi:10.1002/gepi.10296
Article PubMed Google Scholar
Terwilliger JD, Ott J (1992) A multisample bootstrap approach to the estimation of maximized-over-models lod score distributions. Cytogenet Cell Genet 59(2–3):142–144. doi:10.1159/000133228
Article PubMed CAS Google Scholar
Visscher PM (2006) A note on the asymptotic distribution of likelihood ratio tests to test variance components. Twin Res Human Genet 9(4):490–495. doi:10.1375/twin.9.4.490
Article Google Scholar
Visscher PM, Medland SE, Ferreira MA, Morley KI, Zhu G, Cornes BK et al (2006) Assumption-free estimation of heritability from genome-wide identity-by-descent sharing between full siblings. PLOS Genet 2(3):e41. doi:10.1371/journal.pgen.0020041
Article PubMed Google Scholar
Wan Y, Cohen J, Guerra R (1997) A permutation test for the robust sib-pair linkage method. Ann Hum Genet 61(Pt 1):79–87. doi:10.1017/S0003480096005957
PubMed CAS Google Scholar
Wang T, Elston RC (2007) Regression-based multivariate linkage analysis with an application to blood pressure and body mass index. Ann Hum Genet 71(Pt 1):96–106. doi:10.1111/j.1469-1809.2006.00303.x
Article PubMed CAS Google Scholar
Wigginton JE, Abecasis GR (2006) An evaluation of the replicate pool method: quick estimation of genome-wide linkage peak P-values. Genet Epidemiol 30(4):320–332. doi:10.1002/gepi.20147
Article PubMed Google Scholar
Zou F, Fine JP, Hu J, Lin DY (2004) An efficient resampling method for assessing genome-wide statistical significance in mapping quantitative trait loci. Genetics 168(4):2307–2316. doi:10.1534/genetics.104.031427
Article PubMed CAS Google Scholar

Download references

Acknowledgments

This research was supported in part by NIH (USA) grant DA18673 awarded to MCN. SEM is supported by an Australian NHMRC Sidney Sax fellowship (443036).The authors would like to thank the reviewers for their helpful comments.

Author information

Authors and Affiliations

Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA, USA
Sarah E. Medland, James E. Schmitt, Bradley T. Webb, Po-Hsiu Kuo & Michael C. Neale
Genetic Epidemiology, Queensland Institute of Medical Research, Brisbane, Australia
Sarah E. Medland
Department of Psychiatry, Medical College of Virginia, Virginia Commonwealth University, P.O. Box 980126, Richmond, VA, 23298-0126, USA
Sarah E. Medland, James E. Schmitt, Po-Hsiu Kuo & Michael C. Neale
Department of Pharmacy, Virginia Commonwealth University, Richmond, VA, USA
Bradley T. Webb
Department of Human Genetics, Virginia Commonwealth University, Richmond, VA, USA
Michael C. Neale
Department of Psychology, Virginia Commonwealth University, Richmond, VA, USA
Michael C. Neale

Authors

Sarah E. Medland
View author publications
You can also search for this author in PubMed Google Scholar
James E. Schmitt
View author publications
You can also search for this author in PubMed Google Scholar
Bradley T. Webb
View author publications
You can also search for this author in PubMed Google Scholar
Po-Hsiu Kuo
View author publications
You can also search for this author in PubMed Google Scholar
Michael C. Neale
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sarah E. Medland.

Additional information

Edited by David Allison.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Medland, S.E., Schmitt, J.E., Webb, B.T. et al. Efficient Calculation of Empirical P-values for Genome-Wide Linkage Analysis Through Weighted Permutation. Behav Genet 39, 91–100 (2009). https://doi.org/10.1007/s10519-008-9229-9

Download citation

Received: 28 January 2008
Accepted: 03 September 2008
Published: 23 September 2008
Issue Date: January 2009
DOI: https://doi.org/10.1007/s10519-008-9229-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Efficient Calculation of Empirical P-values for Genome-Wide Linkage Analysis Through Weighted Permutation

Abstract

Access this article

Similar content being viewed by others

Violating the normality assumption may be the lesser of two evils

Overview of Statistical Methods for Genome-Wide Association Studies (GWAS)

VARista: a free web platform for streamlined whole-genome variant analysis across T2T, hg38, and hg19

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Efficient Calculation of Empirical P-values for Genome-Wide Linkage Analysis Through Weighted Permutation

Abstract

Access this article

Similar content being viewed by others

Violating the normality assumption may be the lesser of two evils

Overview of Statistical Methods for Genome-Wide Association Studies (GWAS)

VARista: a free web platform for streamlined whole-genome variant analysis across T2T, hg38, and hg19

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation