Detecting genomic clustering of risk variants from sequence data: cases versus controls

Schaid, Daniel J.; Sinnwell, Jason P.; McDonnell, Shannon K.; Thibodeau, Stephen N.

doi:10.1007/s00439-013-1335-y

Detecting genomic clustering of risk variants from sequence data: cases versus controls

Original Investigation
Published: 11 July 2013

Volume 132, pages 1301–1309, (2013)
Cite this article

Human Genetics Aims and scope Submit manuscript

Daniel J. Schaid¹,
Jason P. Sinnwell¹,
Shannon K. McDonnell¹ &
…
Stephen N. Thibodeau²

12 Citations
1 Altmetric
Explore all metrics

Abstract

As the ability to measure dense genetic markers approaches the limit of the DNA sequence itself, taking advantage of possible clustering of genetic variants in, and around, a gene would benefit genetic association analyses, and likely provide biological insights. The greatest benefit might be realized when multiple rare variants cluster in a functional region. Several statistical tests have been developed, one of which is based on the popular Kulldorff scan statistic for spatial clustering of disease. We extended another popular spatial clustering method—Tango’s statistic—to genomic sequence data. An advantage of Tango’s method is that it is rapid to compute, and when single test statistic is computed, its distribution is well approximated by a scaled χ ² distribution, making computation of p values very rapid. We compared the Type-I error rates and power of several clustering statistics, as well as the omnibus sequence kernel association test. Although our version of Tango’s statistic, which we call “Kernel Distance” statistic, took approximately half the time to compute than the Kulldorff scan statistic, it had slightly less power than the scan statistic. Our results showed that the Ionita-Laza version of Kulldorff’s scan statistic had the greatest power over a range of clustering scenarios.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts

Article 18 May 2020

Detecting Multiethnic Rare Variants

Interpretation of Manhattan Plots and Other Outputs of Genome-Wide Association Studies

References

Asimit J, Zeggini E (2010) Rare variant association analysis methods for complex traits. Annu Rev Genet 44:293–308
Article PubMed CAS Google Scholar
Bansal V, Libiger O, Torkamani A, Schork NJ (2010) Statistical analysis strategies for association studies involving rare variants. Nat Rev Genet 11(11):773–785
Article PubMed CAS Google Scholar
Basu S, Pan W (2010) Comparison of statistical tests for disease association with rare variants. Genet Epidem 35:606–619
Article Google Scholar
Breslow NE, Day NE (1980) The analysis of case–control studies. Inter Agency Res Cancer, Lyon
Google Scholar
Fier H, Won S, Prokopenko D, Alchawa T, Ludwig KU, Fimmers R, Silverman EK, Pagano M, Mangold E, Lange C (2012) ‘Location, location, location’: a spatial approach for rare variant analysis and an application to a study on non-syndromic cleft lip with or without cleft palate. Bioinformatics 28(23):3027–3033
Article PubMed CAS Google Scholar
Ionita-Laza I, Makarov V, Buxbaum JD (2012) Scan-statistic approach identifies clusters of rare disease variants in LRP2, a gene linked and associated with autism spectrum disorders, in three datasets. Amer J Hum Genet 90(6):1002–1013
Article PubMed CAS Google Scholar
Kulldorff M (2007) A spatial scan statistic. Commun Stat Theo Meth 26:1481–1496
Article Google Scholar
Kwee LC, Liu D, Lin X, Ghosh D, Epstein MP (2008) A powerful and flexible multilocus association test for quantitative traits. Am J Hum Genet 82(2):386–397
Article PubMed CAS Google Scholar
Lee S, Emond MJ, Bamshad MJ, Barnes KC, Rieder MJ, Nickerson DA, Christiani DC, Wurfel MM, Lin X (2012a) Optimal unified approach for rare-variant association testing with application to small-sample case–control whole-exome sequencing studies. Am J Hum Genet 91(2):224–237
Article PubMed CAS Google Scholar
Lee S, Wu MC, Lin X (2012b) Optimal tests for rare variant effects in sequencing association studies. Biostatistics 13(4):762–775
Article PubMed Google Scholar
Mantel N (1967) The detection of disease clustering and a generalized regression approach. Can Res 27:209–220
CAS Google Scholar
Ruppert D, Wand P, Carroll R (2005) Semi parametric regression. Cambridge University Press, Cambridge
Google Scholar
Tango T (1984) The detection of clustering of disease in time. Biometrics 40:15–26
Article PubMed CAS Google Scholar
Tango T (2000) A test for spatial disease clustering adjusted for multiple testing. Stat Med 19(2):191–204
Article PubMed CAS Google Scholar
Tango T (2010) Statistical methods for disease clustering. Springer, New York
Book Google Scholar
Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X (2011) Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 89(1):82–93
Article PubMed CAS Google Scholar

Download references

Acknowledgments

This research was supported by the US Public Health Service, National Institutes of Health (NIH), contract Grant Number GM065450 (DJS, JPS).

Author information

Authors and Affiliations

Division of Biomedical Statistics and Informatics, Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Daniel J. Schaid, Jason P. Sinnwell & Shannon K. McDonnell
Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
Stephen N. Thibodeau

Authors

Daniel J. Schaid
View author publications
You can also search for this author in PubMed Google Scholar
Jason P. Sinnwell
View author publications
You can also search for this author in PubMed Google Scholar
Shannon K. McDonnell
View author publications
You can also search for this author in PubMed Google Scholar
Stephen N. Thibodeau
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel J. Schaid.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Schaid, D.J., Sinnwell, J.P., McDonnell, S.K. et al. Detecting genomic clustering of risk variants from sequence data: cases versus controls. Hum Genet 132, 1301–1309 (2013). https://doi.org/10.1007/s00439-013-1335-y

Download citation

Received: 03 April 2013
Accepted: 02 July 2013
Published: 11 July 2013
Issue Date: November 2013
DOI: https://doi.org/10.1007/s00439-013-1335-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Detecting genomic clustering of risk variants from sequence data: cases versus controls

Abstract

Access this article

Similar content being viewed by others

Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts

Detecting Multiethnic Rare Variants

Interpretation of Manhattan Plots and Other Outputs of Genome-Wide Association Studies

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Detecting genomic clustering of risk variants from sequence data: cases versus controls

Abstract

Access this article

Similar content being viewed by others

Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts

Detecting Multiethnic Rare Variants

Interpretation of Manhattan Plots and Other Outputs of Genome-Wide Association Studies

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation