On the Inference of Ancestries in Admixed Populations
Inference of ancestral information in recently admixed populations, in which every individual is composed of a mixed ancestry (e.g., African Americans in the US), is a challenging problem. Several previous model-based approaches have used hidden Markov models (HMM) to model the problem, however, the Markov Chain Monte Carlo (MCMC) algorithms underlying these models converge slowly on realistic datasets. While retaining the HMM as a model, we show that a combination of an accurate fast initialization and a local hill-climb in likelihood results in significantly improved estimates of ancestry. We studied this approach in two scenarios—the inference of locus-specific ancestries in a population that is assumed to originate from two unknown ancestral populations, and the inference of allele frequencies in one ancestral population given those in another.
KeywordsHide Markov Model Markov Chain Monte Carlo Ancestral Population Inference Algorithm Markov Chain Monte Carlo Algorithm
Unable to display preview. Download preview PDF.
- 3.Pritchard, J., Stephens, M., Donnelly, P.: Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000)Google Scholar
- 4.Falush, D., Stephens, M., Pritchard, J.K.: Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164, 1567–1587 (2003)Google Scholar
- 8.Sankararaman, S., Sridhar, S., Kimmel, G., Halperin, E.: Estimating local ancestry in admixed populations. American Journal of Human Genetics (to appear)Google Scholar
- 9.Huang, X., Acero, A., Hon, H.-W.: Spoken Language Processing. Prentice-Hall, Upper Saddle River (2001)Google Scholar
- 12.Nachman, M., Crowell, S.: Estimate of the mutation rate per nucleotide in humans. Genetics 156, 297–304 (2000)Google Scholar