Complexity of Approximating Closest Substring Problems
The closest substring problem, where a short string is sought that minimizes the number of mismatches between it and each of a given set of strings, is a minimization problem with a polynomial time approximation scheme . In this paper, both this problem and its maximization complement, where instead the number of matches is maximized, are examined and bounds on their hardness of approximation are proved. Related problems differing only in their objective functions, seeking either to maximize the number of strings covered by the substring or maximize the length of the substring, are also examined and bounds on their approximability proved. For this last problem of length maximization, the approximation bound of 2 is proved to be tight by presenting a 2-approximation algorithm.
KeywordsApproximation algorithms Hardness of approximation Closest Substring
- 1.Arora, S.: Probabilistic checking of proofs and the hardness of approximation problems. PhD thesis, UC Berkeley (1994)Google Scholar
- 2.Lund, C., Yannakakis, M.: On the hardness of approximating minimization problems. Journal of the ACM 41(5) (1994)Google Scholar
- 3.Raz, R., Safra, S.: A sub-constant error-probability low-degree test, and a sub-constant error-probability PCP characterization of NP. In: Proceedings of the Annual ACM Symposium on Theory of Computing, pp. 475–484 (1997)Google Scholar
- 5.Lanctot, J.K., Li, M., Ma, B., Wang, S., Zhang, L.: Distinguishing string selection problems. In: Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 633–642. ACM Press, New York (1999)Google Scholar