, Volume 23, Issue 1, pp 111129
First online:
Bayesian spatial modeling of genetic population structure
 Jukka CoranderAffiliated withDepartment of Mathematics and Statistics, University of Helsinki Email author
 , Jukka SirénAffiliated withDepartment of Mathematics and Statistics, University of Helsinki
 , Elja ArjasAffiliated withDepartment of Mathematics and Statistics, University of Helsinki
Rent the article at a discount
Rent now* Final gross prices may vary according to local VAT.
Get AccessAbstract
Natural populations of living organisms often have complex histories consisting of phases of expansion and decline, and the migratory patterns within them may fluctuate over space and time. When parts of a population become relatively isolated, e.g., due to geographical barriers, stochastic forces reshape certain DNA characteristics of the individuals over generations such that they reflect the restricted migration and mating/reproduction patterns. Such populations are typically termed as genetically structured and they may be statistically represented in terms of several clusters between which DNA variations differ clearly from each other. When detailed knowledge of the ancestry of a natural population is lacking, the DNA characteristics of a sample of current generation individuals often provide a wealth of information in this respect. Several statistical approaches to modelbased clustering of such data have been introduced, and in particular, the Bayesian approach to modeling the genetic structure of a population has attained a vivid interest among biologists. However, the possibility of utilizing spatial information from sampled individuals in the inference about genetic clusters has been incorporated into such analyses only very recently. While the standard Bayesian hierarchical modeling techniques through Markov chain Monte Carlo simulation provide flexible means for describing even subtle patterns in data, they may also result in computationally challenging procedures in practical data analysis. Here we develop a method for modeling the spatial genetic structure using a combination of analytical and stochastic methods. We achieve this by extending a novel theory of Bayesian predictive classification with the spatial information available, described here in terms of a colored Voronoi tessellation over the sample domain. Our results for real and simulated data sets illustrate well the benefits of incorporating spatial information to such an analysis.
Keywords
Bayesian inference Genetic structure Spatial modeling Statistical learning theory Unsupervised classification Title
 Bayesian spatial modeling of genetic population structure
 Journal

Computational Statistics
Volume 23, Issue 1 , pp 111129
 Cover Date
 200801
 DOI
 10.1007/s001800070072x
 Print ISSN
 09434062
 Online ISSN
 16139658
 Publisher
 SpringerVerlag
 Additional Links
 Topics
 Keywords

 Bayesian inference
 Genetic structure
 Spatial modeling
 Statistical learning theory
 Unsupervised classification
 Industry Sectors
 Authors

 Jukka Corander ^{(1)}
 Jukka Sirén ^{(1)}
 Elja Arjas ^{(1)}
 Author Affiliations

 1. Department of Mathematics and Statistics, University of Helsinki, P. O. Box 68, 00014, Helsinki, Finland