Skip to main content

Starting from Scratch: Growing Longest Common Subsequences with Evolution

  • Conference paper
Parallel Problem Solving from Nature - PPSN IX (PPSN 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4193))

Included in the following conference series:

Abstract

An evolutionary algorithm (EA) usually initializes its population with random genotypes, which represent random solutions to the target problem instance. If the problem is one of constrained optimization, an initial population whose genotypes all represent empty solutions might allow the EA to grow valid solutions as much as search for them and thereby identify good solutions more quickly. This is the case in a genetic algorithm (GA) for the longest common subsequence problem, which seeks the length of a longest subsequence common to each of a set of given strings. The GA encodes sequences as binary strings that indicate subsequences of the shortest or first given string. In tests on a variety of problem instances, the GA always identifies an optimum subsequence, but on most instances, the GA reaches an optimum more quickly when its initial population encodes empty sequences than when its initial genotypes represent random sequences.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hinkemeyer, B., Julstrom, B.A.: A genetic algorithm for the longest common subsequence problem. In: Proceedings of the 2006 Genetic and Evolutionary Computation Conference, The ACM Press, New York (2006)

    Google Scholar 

  2. Maier, D.: The complexity of some problems on subsequences and supersequences. Journal of the ACM 25, 322–336 (1978)

    Article  MathSciNet  Google Scholar 

  3. Irving, R., Fraser, C.: Two algorithms for the longest common subsequence of three (or more) strings. In: Proceedings of the 3rd Annual Symposium on Combinatorial Pattern Matching, pp. 214–229. Springer, New York (1992)

    Chapter  Google Scholar 

  4. Fogel, G., Chellapilla, K., Fogel, D.: Identification of coding regions in DNA sequences using evolved neural networks. In: Evolutionary Computation in Bioinformatics, Morgan Kaufmann, San Francisco (2003)

    MATH  Google Scholar 

  5. Forrest, S., Javornik, B., Smith, R., Perelson, A.: Using genetic algorithms to explore pattern recognition in the immune system. Evolutionary Computation 1, 191–211 (1993)

    Article  Google Scholar 

  6. Pei, M., Goodman, E.D., Punch, W.F.: Pattern discovery from data using genetic algorithms. In: Proceedings of 1st Pacific-Asia Conference on Knowledge Discovery & Data Mining (PAKDD 1997) (1997)

    Google Scholar 

  7. Shankar, P.: V Diff - A file comparer with visual output (2003) (Retrieved October 25, 2004), On-line at: http://www.codeproject.com/cpp/vdiff.asp?df=100&forumid=14638&exp=0&select=825191

  8. Clough, P.: Plagiarism in natural and programming languages: an overview of current tools and technologies (2000) (Retrieved October 13, 2004), On-line at: http://www.dcs.shef.ac.uk/~cloughie/papers/Plagiarism.pdf

  9. Grant, T., Baker, K.: Identifying reliable, valid markers of authorship: A response to Chaski. Forensic Linguistics 8, 66–79 (2001)

    Google Scholar 

  10. Lovis, C., Baud, R.H.: Fast exact string pattern-matching algorithms adapted to the characteristics of the medical language. Journal of the American Medical Informatics Association 7, 378–391 (2000)

    Article  Google Scholar 

  11. Losee, R.: Learning syntactic rules and tags with genetic algorithms. Information Processing & Management 32, 185–197 (1996)

    Article  Google Scholar 

  12. Louis, S.: Learning to improve genetic algorithm performance on job shop scheduling (2002) (Retrieved October 21, 2004), On-line at: http://www.cs.unr.edu/~sushil/papers/conference/newpapers/2002/gecco2002/cigarScheduling/cigarScheduling/cigarScheduling.html

  13. Navarro, G., Raffinot, M.: Flexible Pattern Matching in Strings. Cambridge University Press, New York (2002)

    Book  Google Scholar 

  14. Bergroth, L., Hakonen, H., Raita, T.: A survey of longest common subsequence algorithms. In: Proceedings of the Seventh International Symposium on String Processing and Information Retrieval (2000)

    Google Scholar 

  15. Koza, J.R.: Genetic Programming: On the programming of computers by means of natural selection. The MIT Press, Cambridge (1992)

    MATH  Google Scholar 

  16. Banzhaf, W., Nordin, P., Keller, R.E., Francone, F.D.: Genetic programming: An introduction. Morgan Kaufmann, San Francisco (1998)

    Book  Google Scholar 

  17. Koza, J.R., Bennett, F.H., Andre, D., Keane, M.A.: Evolution using genetic programming of a low-distortion 96 decibel operational amplifier. In: Bryant, B., Carroll, J., Oppenheim, D., Hightower, J., George, K.M. (eds.) Applied Computing 1997. Proceedings of the 1997 ACM Symposium on Applied Computing, pp. 207–216. ACM Press, New York (1997)

    Google Scholar 

  18. Koza, J.R., Bennett, F.H., Andre, D., Keane, M.A., Dunlap, F.: Automated systhesis of analog electrical circuits by means of genetic programming. IEEE Transactions on Evolutionary Computation 1, 109–128 (1997)

    Article  Google Scholar 

  19. Koza, J.R., Bennett, F.H., Andre, D., Keane, M.A.: Genetic Programming III: Darwinian Invention and Problem Solving. Morgan Kaufmann, San Francisco (1999)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Julstrom, B.A., Hinkemeyer, B. (2006). Starting from Scratch: Growing Longest Common Subsequences with Evolution. In: Runarsson, T.P., Beyer, HG., Burke, E., Merelo-Guervós, J.J., Whitley, L.D., Yao, X. (eds) Parallel Problem Solving from Nature - PPSN IX. PPSN 2006. Lecture Notes in Computer Science, vol 4193. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11844297_94

Download citation

  • DOI: https://doi.org/10.1007/11844297_94

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-38990-3

  • Online ISBN: 978-3-540-38991-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics