Haplotype Inference for Pedigrees with Few Recombinations

  • B. KirkpatrickEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9683)


Pedigrees, or family trees, are graphs of family relationships that are used to study inheritance. A fundamental problem in computational biology is to find, for a pedigree with n individuals genotyped at every site, a set of Mendelian-consistent haplotypes that have the minimum number of recombinations. This is an \(\mathsf {NP}\)-hard problem and some pedigrees can have thousands of individuals and hundreds of thousands of sites.

This paper formulates this problem as a optimization on a graph and introduces a tailored algorithm with a running time of \(O(n^{(k+2)}m^{6k})\) for n individuals, m sites, and k recombinations. Since there are generally only 1-2 recombinations per chromosome in each meiosis, k is small enough to make this algorithm practically relevant.


Pedigrees Haplotype inference Minimum recombination haplotype configuration (MRHC) 



BK thanks M. Mnich at the Cluster of Excellence, Saarland University, Saarbrücken, Germany for critical reading of the manuscript. BK thanks arXiv for pre-print publication of the full manuscript [10].


  1. 1.
    Abecasis, G., Cherny, S., Cookson, W., Cardon, L.: Merlin-rapid analysis of dense genetic maps using sparse gene flow trees. Nat. Genet. 30, 97–101 (2002)CrossRefGoogle Scholar
  2. 2.
    Browning, S., Browning, B.: On reducing the statespace of hidden Markov models for the identity by descent process. Theoret. Popul. Biol. 62(1), 1–8 (2002)CrossRefzbMATHGoogle Scholar
  3. 3.
    Coop, G., Wen, X., Ober, C., Pritchard, J.K., Przeworski, M.: High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans. Science 319(5868), 1395–1398 (2008)CrossRefGoogle Scholar
  4. 4.
    Doan, D.D., Evans, P.A.: Fixed-parameter algorithm for haplotype inferences on general pedigrees with small number of sites. In: Moulton, V., Singh, M. (eds.) WABI 2010. LNCS, vol. 6293, pp. 124–135. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  5. 5.
    Doan, D., Evans, P.: An FPT haplotyping algorithm on pedigrees with a small number of sites. Algorithms Mol. Biol. 6, 1–8 (2011)CrossRefGoogle Scholar
  6. 6.
    Fishelson, M., Dovgolevsky, N., Geiger, D.: Maximum likelihood haplotyping for general pedigrees. Hum. Hered. 59, 41–60 (2005)CrossRefGoogle Scholar
  7. 7.
    Geiger, D., Meek, C., Wexler, Y.: Speeding up HMM algorithms for genetic linkage analysis via chain reductions of the state space. Bioinformatics 25(12), i196 (2009)CrossRefGoogle Scholar
  8. 8.
    Geiger, D., Meek, C., Wexler, Y.: Speeding up HMM algorithms for genetic linkage analysis via chain reductions of the state space. Bioinformatics 25(12), i196–i203 (2009)CrossRefGoogle Scholar
  9. 9.
    Iverson, K.E.: A Programming Language. Wiley, New York (1962)CrossRefzbMATHGoogle Scholar
  10. 10.
    Kirkpatrick, B.: Haplotype inference for pedigrees with few recombinations. arXiv 1602.04270 (2016).
  11. 11.
    Lauritzen, S.L., Sheehan, N.A.: Graphical models for genetic analysis. Stat. Sci. 18(4), 489–514 (2003)MathSciNetCrossRefzbMATHGoogle Scholar
  12. 12.
    Li, J., Jiang, T.: Computing the minimum recombinant haplotype configuration from incomplete genotype data on a pedigree by integer linear programming. J. Comput. Biol. 12(6), 719–739 (2005)CrossRefGoogle Scholar
  13. 13.
    O’Connell, J., Gurdasani, D., et al.: A general approach for haplotype phasing across the full spectrum of relatedness. PLoS Genet 10(4), e1004234 (2014)CrossRefGoogle Scholar
  14. 14.
    Pirola, Y., Bonizzoni, P., Jiang, T.: An efficient algorithm for haplotype inference on pedigrees with recombinations and mutations. IEEE/ACM Trans. Comput. Biol. Bioinform. 9(1), 12–25 (2012)CrossRefGoogle Scholar
  15. 15.
    Risch, N., Merikangas, K.: The future of genetic studies of complex human diseases. Science 273(5281), 1516–1517 (1996)CrossRefGoogle Scholar
  16. 16.
    Sobel, E., Lange, K.: Descent graphs in pedigree analysis: applications to haplotyping, location scores, and marker-sharing statistics. Am. J. Hum. Genet. 58(6), 1323–1337 (1996)Google Scholar
  17. 17.
    Steel, M., Hein, J.: Reconstructing pedigrees: a combinatorial perspective. J. Theoret. Biol. 240(3), 360–367 (2006)MathSciNetCrossRefGoogle Scholar
  18. 18.
    Thornton, T., McPeek, M.: Case-control association testing with related individuals: a more powerful quasi-likelihood score test. Am. J. Hum. Genet. 81, 321–337 (2007)CrossRefGoogle Scholar
  19. 19.
    Wang, W.B., Jiang, T.: Inferring haplotypes from genotypes on a pedigree with mutations, genotyping errors and missing alleles. J. Bioinform. Comput. Biol. 9, 339–365 (2011)CrossRefGoogle Scholar
  20. 20.
    Xiao, J., Lou, T., Jiang, T.: An efficient algorithm for haplotype inference on pedigrees with a small number of recombinants. Algorithmica 62(3), 951–981 (2012)MathSciNetCrossRefzbMATHGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  1. 1.Intrepid Net ComputingDillonUSA

Personalised recommendations