A Compositional Approach to Allele Sharing Analysis

Galván-Femenía, I.; Graffelman, J.; Barceló-i-Vidal, C.

doi:10.1007/978-3-319-44811-4_5

I. Galván-Femenía³,
J. Graffelman⁴ &
C. Barceló-i-Vidal³

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 187))

Included in the following conference series:

International Workshop on Compositional Data Analysis

1191 Accesses

Abstract

Relatedness is of great interest in population-based genetic association studies. These studies search for genetic factors related to disease. Many statistical methods used in population-based genetic association studies (such as standard regression models, t-tests, and logistic regression) assume that the observations (individuals) are independent. These techniques can fail if independence is not satisfied. Allele sharing is a powerful data analysis technique for analyzing the degree of dependence in diploid species. Two individuals can share 0, 1, or 2 alleles for any genetic marker. This sharing may be assessed for alleles identical by state (IBS) or identical by descent (IBD). Starting from IBS alleles, it is possible to detect the type of relationship of a pair of individuals by using graphical methods. Typical allele sharing analysis consists of plotting the fraction of loci sharing 2 IBS alleles versus the fraction of sharing 0 IBS alleles. Compositional data analysis can be applied to allele sharing analysis because the proportions of sharing 0, 1 or 2 IBS alleles (denoted by \(p_0\), \(p_1\), and \(p_2\)) form a 3-part-composition. This chapter provides a graphical method to detect family relationships by plotting the isometric log-ratio transformation of \(p_0\), \(p_1\), and \(p_2\). On the other hand, the probabilities of sharing 0, 1, or 2 IBD alleles (denoted by \(k_0, k_1, k_2\)), which are termed Cotterman’s coefficients, depend on the relatedness: monozygotic twins, full-siblings, parent-offspring, avuncular, first cousins, etc. It is possible to infer the type of family relationship of a pair of individuals by using maximum likelihood methods. As a result, the estimated vector \({\hat{\varvec{k}}}=(\hat{k}_0, \hat{k}_1,\hat{k}_2)\) for each pair of individuals forms a 3-part-composition and can be plotted in a ternary diagram to identify the degree of relatedness. An R package has been developed for the study of genetic relatedness based on genetic markers such as microsatellites and single nucleotide polymorphisms from human populations, and is used for the computations and graphics of this contribution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Cavalli-Sforza, L.L.: The human genome diversity project: past, present and future. Nature Rev. Genet. 6, 333–340 (2005)
Google Scholar
Chakraborty, R., Jin, L.: Determination of relatedness between individuals using DNA fingerprinting. Hum. Biol. 65(6), 875–895 (1993)
Google Scholar
Cotterman, C.W.: Relative and human genetic analysis. Sci. Mon. 53, 227–234 (1941)
Google Scholar
Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G., Barceló-Vidal, C.: Isometric logratio transformations for compositional data analysis. Math. Geol. 35(3), 279–300 (2003)
Article MathSciNet MATH Google Scholar
Foulkes, A.S.: Applied Statistical Genetics with R. Springer (2009)
Google Scholar
Ghalanos, A., Theussl, S.: Rsolnp: general non-linear optimization using augmented Lagrange multiplier method. R package version 1, 15 (2014)
Google Scholar
Graffelman, J., Galván-Femenía, I.: An application of the isometric log-ratio transformation in relatedness research. In: Martín-Fernández J, A., Thió-Henestrosa, S. (eds.) Compositional Data Analysis, Springer Proceedings in Mathematics & Statistics 187, (2016)
Google Scholar
Hamilton, N.: ggtern: An Extension to ‘ggplot2’, for the Creation of Ternary Diagrams. R package version 1.0.6.0 (2015). http://CRAN.R-project.org/package=ggtern
Laird, N.M., Lange, C.: The fundamentals of modern statistical genetics. Springer (2011)
Google Scholar
Milligan, B.G.: Maximum-likelihood estimation of relatedness. Genetics 163, 1153–67 (2003)
Google Scholar
Moltke, I., Albrechtsen, A.: RelateAdmix: a software tool for estimating relatedness between admixed individuals. Bioinformatics 30, 1027–8 (2014)
Article Google Scholar
Nembot-Simo, A., Graham, J., McNeney, B.: CrypticIBDcheck: an R package for checking cryptic relatedness in nominally unrelated individuals. Source Code Biol. Med. 8, 5 (2013)
Article Google Scholar
Rosenberg, N.A.: Rosenberg lab at Stanford University (2002). http://www.stanford.edu/group/rosenberglab/diversity.html
Rosenberg, N.A.: Standardized subsets of the HGDP-CEPH human genome diversity cell line panel, accounting for atypical and duplicated samples and pairs of close relatives. Ann. Hum. Genet. 70, 841–847 (2006)
Article Google Scholar
Thompson, E.A.: Estimation of pairwise relationships. Ann. Hum. Genet. 39, 173–188 (1975)
Article MathSciNet MATH Google Scholar
Thompson, E.A.: Estimation of relationships from genetic data. In: Rao, C.R., Chakraborty, R. (eds.) Handbook of Statistics, vol. 8, pp. 255–269. Elsevier Science, Amsterdam (1991)
Google Scholar
Weir, B.S., Anderson, A.D., Hepler, A.B.: Genetic relatedness analysis: modern data and new challenges. Nature Rev. Genet. 7, 771–780 (2006)
Article Google Scholar

Download references

Acknowledgments

We thank the referees and the editors for their comments on the manuscript. This study was supported by grant CODARSS MTM2012-33236 (2013–2015) of the Spanish Ministry of Education and Science.

Author information

Authors and Affiliations

Department of Computer Science, Applied Mathematics and Statistics, Universitat de Girona, Campus Montilivi P-IV, 17071, Girona, Spain
I. Galván-Femenía & C. Barceló-i-Vidal
Department of Statistics and Operations Research, Universitat Politècnica de Catalunya, Avinguda Diagonal 647, 6th Floor, 08028, Barcelona, Spain
J. Graffelman

Authors

I. Galván-Femenía
View author publications
You can also search for this author in PubMed Google Scholar
J. Graffelman
View author publications
You can also search for this author in PubMed Google Scholar
C. Barceló-i-Vidal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to I. Galván-Femenía .

Editor information

Editors and Affiliations

Department of Computer Science and Applied Mathematics, University of Girona, Girona, Spain
Josep Antoni Martín-Fernández
Department of Computer Science and Applied Mathematics, University of Girona, Girona, Spain
Santiago Thió-Henestrosa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Galván-Femenía, I., Graffelman, J., Barceló-i-Vidal, C. (2016). A Compositional Approach to Allele Sharing Analysis. In: Martín-Fernández, J., Thió-Henestrosa, S. (eds) Compositional Data Analysis. CoDaWork 2015. Springer Proceedings in Mathematics & Statistics, vol 187. Springer, Cham. https://doi.org/10.1007/978-3-319-44811-4_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-44811-4_5
Published: 20 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-44810-7
Online ISBN: 978-3-319-44811-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics