International Workshop on Parameterized and Exact Computation

IWPEC 2004: Parameterized and Exact Computation pp 174-186

Perfect Path Phylogeny Haplotyping with Missing Data Is Fixed-Parameter Tractable

  • Jens Gramm
  • Till Nierhoff
  • Till Tantau
Conference paper

DOI: 10.1007/978-3-540-28639-4_16

Volume 3162 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Gramm J., Nierhoff T., Tantau T. (2004) Perfect Path Phylogeny Haplotyping with Missing Data Is Fixed-Parameter Tractable. In: Downey R., Fellows M., Dehne F. (eds) Parameterized and Exact Computation. IWPEC 2004. Lecture Notes in Computer Science, vol 3162. Springer, Berlin, Heidelberg

Abstract

Haplotyping via perfect phylogeny is a method for retrieving haplotypes from genotypes. Fast algorithms are known for computing perfect phylogenies from complete and error-free input instances—these instances can be organized as a genotype matrix whose rows are the genotypes and whose columns are the single nucleotide polymorphisms under consideration. Unfortunately, in the more realistic setting of missing entries in the genotype matrix, even restricted forms of the perfect phylogeny haplotyping problem become NP-hard. We show that haplotyping via perfect phylogeny with missing data becomes computationally tractable when imposing additional biologically motivated constraints. Firstly, we focus on asking for perfect phylogenies that are paths, which is motivated by the discovery that yin-yang haplotypes span large parts of the human genome. A yin-yang haplotype implies that every corresponding perfect phylogeny has to be a path. Secondly, we assume that the number of missing entries in every column of the input genotype matrix is bounded. We show that the perfect path phylogeny haplotyping problem is fixed-parameter tractable when we consider the maximum number of missing entries per column of the genotype matrix as parameter. The restrictions we impose are met by a majority of the problem instances encountered in publicly available human genome data.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Jens Gramm
    • 1
  • Till Nierhoff
    • 1
  • Till Tantau
    • 1
  1. 1.International Computer Science InstituteBerkeleyUSA