Abstract
Linkage analysis in multivariate or longitudinal context presents both statistical and computational challenges. The permutation test can be used to avoid some of the statistical challenges, but it substantially adds to the computational burden. Utilizing the distributional dependencies between \( \hat{\pi}\) (defined as the proportion of alleles at a locus that are identical by descent (IBD) for a pairs of relatives, at a given locus) and the permutation test we report a new method of efficient permutation. In summary, the distribution of \( \hat{\pi } \) for a sample of relatives at locus x is estimated as a weighted mixture of \( \hat{\pi } \) drawn from a pool of ‘representative’ \( \hat{\pi } \) distributions observed at other loci. This weighting scheme is then used to sample from the distribution of the permutation tests at the representative loci to obtain an empirical P-value at locus x (which is asymptotically distributed as the permutation test at loci x). This weighted mixture approach greatly reduces the number of permutation tests required for genome-wide scanning, making it suitable for use in multivariate and other computationally intensive linkage analyses. In addition, because the distribution of \( \hat{\pi } \) is a property of the genotypic data for a given sample and is independent of the phenotypic data, the weighting scheme can be applied to any phenotype (or combination of phenotypes) collected from that sample. We demonstrate the validity of this approach through simulation.
Similar content being viewed by others
References
Abecasis GR, Cherny SS, Cookson WO, Cardon LR (2002) Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 30(1):97–101. doi:10.1038/ng786
Amos CI (1994) Robust variance-components approach for assessing genetic linkage in pedigrees. Am J Hum Genet 54(3):535–543
Amos C, de Andrade M, Zhu D (2001) Comparison of multivariate tests for genetic linkage. Hum Hered 51(3):133–144. doi:10.1159/000053334
Besag J, Clifford P (1991) Sequential Monte Carlo P-values. Biometrika 79:301–304
Churchill GA, Doerge RW (1994) Empirical threshold values for quantitative trait mapping. Genetics 138(3):963–971
Cornish E, Fisher RA (1938) Moments and cumulants in the specification of distributions. Rev Inst Int Statist 5:307–320. doi:10.2307/1400905
Gentry J, Kagen M (2006). Genefinder: Finds genes that have similar patterns of expression: R: Bioconductor. http://rss.acs.unt.edu/Rdoc/library/genefilter/html/genefinder.html. Accessed 28 Jan 2008
Iturria SJ, Williams JT, Almasy L, Dyer TD, Blangero J (1999) An empirical test of the significance of an observed quantitative trait locus effect that preserves additive genetic variation. Genet Epidemiol 17(suppl 1):S169–S173
Kong A, Cox NJ (1997) Allele-sharing models: LOD scores and accurate linkage tests. Am J Hum Genet 61(5):1179–1188. doi:10.1086/301592
Kuo PH, Neale MC, Riley BP, Patterson DG, Walsh D, Prescott CA et al (2007) A genome-wide linkage analysis for the personality trait neuroticism in the Irish affected sib-pair study of alcohol dependence. Am J Med Genet B Neuropsychiatr Genet 144(4):463–468. doi:10.1002/ajmg.b.30478
Kuo PH, Neale MC, Riley BP, Webb BT, Sullivan PF, Vittum J et al (2006) Identification of susceptibility loci for alcohol-related traits in the Irish affected sib pair study of alcohol dependence. Alcohol Clin Exp Res 30(11):1807–1816. doi:10.1111/j.1530-0277.2006.00217.x
Li M, Boehnke M, Abecasis GR, Song PX (2006) Quantitative trait linkage analysis using Gaussian copulas. Genetics 173(4):2317–2327. doi:10.1534/genetics.105.054650
Neale BM, Sham PC (2004) The future of association studies: gene-based analysis and replication. Am J Hum Genet 75(3):353–362. doi:10.1086/423901
Neale MC, Lubke G, Aggen SH, Dolan CV (2005) Problems with using sum scores for estimating variance components: contamination and measurement non-invariance. Twin Res Human Genet 8(6):553–568. doi:10.1375/twin.8.6.553
Neale MC, Aggen SH, Maes HH, Kubarych TS, Schmitt JE (2006a) Methodological issues in the assessment of substance use phenotypes. Addict Behav 31(6):1010–1034. doi:10.1016/j.addbeh.2006.03.047
Neale MC, Boker SM, Xie G, Maes HH (2006b) Mx: statistical modeling, 6th edn. Richmond, VA 23298: Department of Psychiatry, VCU. http://www.vcu.edu/mx/. Accessed 28 Jan 2008
Ott J (1989) Computer-simulation methods in human linkage analysis. Proc Natl Acad Sci USA 86(11):4175–4178. doi:10.1073/pnas.86.11.4175
Peng J, Siegmund D (2006) QTL mapping under ascertainment. Ann Hum Genet 70(Pt 6):867–881. doi:10.1111/j.1469-1809.2006.00286.x
Prescott C, Sullivan P, Kuo P, Webb B, Vittum J, Patterson D et al (2006) Geome-wide linkage study in the Irish affected sib pair study of alcohol dependence: evidence for a susceptibility region for symptoms of alcohol dependence on chromosome 4. Mol Psychiatr 11:603–611. doi:10.1038/sj.mp.4001811
Self S, Liang K (1987) Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. J Am Stat Assn 82:605–610. doi:10.2307/2289471
Song KK, Weeks DE, Sobel E, Feingold E (2004) Efficient simulation of P values for linkage analysis. Genet Epidemiol 26(2):88–96. doi:10.1002/gepi.10296
Terwilliger JD, Ott J (1992) A multisample bootstrap approach to the estimation of maximized-over-models lod score distributions. Cytogenet Cell Genet 59(2–3):142–144. doi:10.1159/000133228
Visscher PM (2006) A note on the asymptotic distribution of likelihood ratio tests to test variance components. Twin Res Human Genet 9(4):490–495. doi:10.1375/twin.9.4.490
Visscher PM, Medland SE, Ferreira MA, Morley KI, Zhu G, Cornes BK et al (2006) Assumption-free estimation of heritability from genome-wide identity-by-descent sharing between full siblings. PLOS Genet 2(3):e41. doi:10.1371/journal.pgen.0020041
Wan Y, Cohen J, Guerra R (1997) A permutation test for the robust sib-pair linkage method. Ann Hum Genet 61(Pt 1):79–87. doi:10.1017/S0003480096005957
Wang T, Elston RC (2007) Regression-based multivariate linkage analysis with an application to blood pressure and body mass index. Ann Hum Genet 71(Pt 1):96–106. doi:10.1111/j.1469-1809.2006.00303.x
Wigginton JE, Abecasis GR (2006) An evaluation of the replicate pool method: quick estimation of genome-wide linkage peak P-values. Genet Epidemiol 30(4):320–332. doi:10.1002/gepi.20147
Zou F, Fine JP, Hu J, Lin DY (2004) An efficient resampling method for assessing genome-wide statistical significance in mapping quantitative trait loci. Genetics 168(4):2307–2316. doi:10.1534/genetics.104.031427
Acknowledgments
This research was supported in part by NIH (USA) grant DA18673 awarded to MCN. SEM is supported by an Australian NHMRC Sidney Sax fellowship (443036).The authors would like to thank the reviewers for their helpful comments.
Author information
Authors and Affiliations
Corresponding author
Additional information
Edited by David Allison.
Rights and permissions
About this article
Cite this article
Medland, S.E., Schmitt, J.E., Webb, B.T. et al. Efficient Calculation of Empirical P-values for Genome-Wide Linkage Analysis Through Weighted Permutation. Behav Genet 39, 91–100 (2009). https://doi.org/10.1007/s10519-008-9229-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10519-008-9229-9