On the Inference of Ancestries in Admixed Populations
- Cite this paper as:
- Sankararaman S., Kimmel G., Halperin E., Jordan M.I. (2008) On the Inference of Ancestries in Admixed Populations. In: Vingron M., Wong L. (eds) Research in Computational Molecular Biology. RECOMB 2008. Lecture Notes in Computer Science, vol 4955. Springer, Berlin, Heidelberg
Inference of ancestral information in recently admixed populations, in which every individual is composed of a mixed ancestry (e.g., African Americans in the US), is a challenging problem. Several previous model-based approaches have used hidden Markov models (HMM) to model the problem, however, the Markov Chain Monte Carlo (MCMC) algorithms underlying these models converge slowly on realistic datasets. While retaining the HMM as a model, we show that a combination of an accurate fast initialization and a local hill-climb in likelihood results in significantly improved estimates of ancestry. We studied this approach in two scenarios—the inference of locus-specific ancestries in a population that is assumed to originate from two unknown ancestral populations, and the inference of allele frequencies in one ancestral population given those in another.
Unable to display preview. Download preview PDF.