Abstract
Given a string w over a finite alphabet Σ and an integer K, can w be partitioned into strings of length at most K, such that there are no collisions? We refer to this question as the string partition problem and show it is NP-complete for various definitions of collision and for a number of interesting restrictions including |Σ| = 2. This establishes the hardness of an important problem in contemporary synthetic biology, namely, oligo design for gene synthesis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chlichlia, K., Schirrmacher, V., Sandaltzopoulos, R.: Cancer immunotherapy: Battling tumors with gene vaccines. Current Medicinal Chemistry - Anti-Inflammatory & Anti-Allergy Agents 4, 353–365 (2005)
Christie, D.A., Irving, R.W.: Sorting strings by reversals and by transpositions. SIAM Journal on Discrete Mathematics 14(2), 193–206 (2001)
Cid-Arregui, A., Juarez, V., Hausen, H.Z.: A Synthetic E7 Gene of Human Papillomavirus Type 16 That Yields Enhanced Expression of the Protein in Mammalian Cells and Is Useful for DNA Immunization Studies. Journal of Virology 77(8), 4928–4937 (2003)
Condon, A., Maňuch, J., Thachuk, C.: Complexity of a Collision-Aware String Partition Problem and Its Relation to Oligo Design for Gene Synthesis. In: Hu, X., Wang, J. (eds.) COCOON 2008. LNCS, vol. 5092, pp. 265–275. Springer, Heidelberg (2008)
Cox, J.C., Lape, J., Sayed, M.A., Hellinga, H.W.: Protein fabrication automation. Protein Science 16(3), 379–390 (2007)
Eriksen, N.: (1+ ε)-Approximation of sorting by reversals and transpositions. Theoretical Computer Science 289(1), 517–529 (2002)
Gibson, D., Benders, G., Andrews-Pfannkoch, C., Denisova, E., Baden-Tillson, H., Zaveri, J., Stockwell, T., Brownley, A., Thomas, D., Algire, M., et al.: Complete Chemical Synthesis, Assembly, and Cloning of a Mycoplasma genitalium Genome. Science 319(5867), 1215–1220 (2008)
Goldstein, A., Kolman, P., Zheng, J.: Minimum common string partition problem: Hardness and approximations. The Electronic Journal of Combinatorics 12(R50), 1 (2005)
Hannenhalli, S.: Polynomial algorithm for computing translocation distance between genomes. Discrete Applied Mathematics 71(1), 137–151 (1996)
Hartman, T.: A Simpler 1.5-Approximation Algorithm for Sorting by Transpositions. In: Baeza-Yates, R., Chávez, E., Crochemore, M. (eds.) CPM 2003. LNCS, vol. 2676, pp. 156–169. Springer, Heidelberg (2003)
Karp, R.M.: Mapping the genome: some combinatorial problems arising in molecular biology. In: STOC 1993: Proceedings of the Twenty-Fifth Annual ACM Symposium on Theory of Computing, pp. 278–285. ACM, New York (1993)
Kumar, D., Gustafsson, C., Klessig, D.F.: Validation of RNAi silencing specificity using synthetic genes: salicylic acid-binding protein 2 is required for innate immunity in plants. Plant J. 45(5), 863–868 (2006)
Lin, C.T., Tsai, Y.C., He, L., Calizo, R., Chou, H.H., Chang, T.C., Soong, Y.K., Hung, C.F., Lai, C.H.: A DNA vaccine encoding a codon-optimized human papillomavirus type 16 E6 gene enhances CTL response and anti-tumor activity. J. Biomed. Sci. 13(4), 481–488 (2006)
Myers, E., Sutton, G., Delcher, A., Dew, I., Fasulo, D., Flanigan, M., Kravitz, S., Mobarry, C., Reinert, K., Remington, K., et al.: A whole-genome assembly of Drosophila. Science 287(5461), 2196 (2000)
Papadimitriou, C.H.: Computational Complexity. Addison-Wesley (1994)
Pevzner, P., Tang, H., Waterman, M.: An Eulerian path approach to DNA fragment assembly. Proceedings of the National Academy of Sciences of the United States of America 98(17), 9748 (2001)
Roden, R., Wu, T.: Preventative and therapeutic vaccines for cervical cancer. Expert Review of Vaccines 2(4), 495–516 (2003)
Stemmer, W.P., Crameri, A., Ha, K.D., Brennan, T.M., Heyneker, H.L.: Single-step assembly of a gene and entire plasmid from large numbers of oligodeoxyribonucleotides. Gene 164(1), 49–53 (1995)
Yancopoulos, S., Attie, O., Friedberg, R.: Efficient sorting of genomic permutations by translocation, inversion and block interchange. Bioinformatics 21(16), 3340–3346 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Condon, A., Maňuch, J., Thachuk, C. (2012). The Complexity of String Partitioning. In: Kärkkäinen, J., Stoye, J. (eds) Combinatorial Pattern Matching. CPM 2012. Lecture Notes in Computer Science, vol 7354. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31265-6_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-31265-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31264-9
Online ISBN: 978-3-642-31265-6
eBook Packages: Computer ScienceComputer Science (R0)