Haplotype Inference on Pedigrees with Recombinations and Mutations
Haplotype Inference (HI) is a computational challenge of crucial importance in a range of genetic studies, such as functional genomics, pharmacogenetics and population genetics. Pedigrees have been shown a valuable data that allows us to infer haplotypes from genotypes more accurately than population data, since Mendelian inheritance restricts the set of possible solutions. In order to overcome the limitations of classic statistical haplotyping methods, a combinatorial formulation of the HI problem on pedigrees has been proposed in the literature, called Minimum-Recombinant Haplotype Configuration (MRHC) problem, that allows a single type of genetic variation events, namely recombinations. In this work, we define a new problem, called Minimum-Change Haplotype Configuration (MRHC), that extends the MRHC formulation by allowing also a second type of natural variation events: mutations. We propose an efficient and accurate heuristic algorithm for MRHC based on an L-reduction to a well-known coding problem. Our heuristic can also be used to solve the original MRHC problem and it can take advantage of additional knowledge about the input genotypes, such as the presence of recombination hotspots and different rates of recombinations and mutations. Finally, we present an extensive experimental evaluation and comparison of our heuristic algorithm with several other state-of-the-art methods for HI on pedigrees under several simulated scenarios.
Unable to display preview. Download preview PDF.
- 6.Gallager, R.G.: Low-Density Parity-Check Codes. MIT Press, Cambridge (1963)Google Scholar
- 11.Pearl, J.: Reverend Bayes on inference engines: A distributed hierarchical approach. In: Proc. of the American Ass. of Artificial Intelligence National Conference on AI, Pittsburgh, PA, pp. 133–136 (1982)Google Scholar
- 13.Sobel, E., Lange, K.: Descent graphs in pedigree analysis: applications to haplotyping, location scores, and marker-sharing statistics. American J. of Human Genetics 58(6), 1323–1337 (1996)Google Scholar
- 14.The International HapMap Consortium: A second generation human haplotype map of over 3.1 million SNPs. Nature 449(7164), 851–861 (2007)Google Scholar