Correlogram-Based Method for Comparing Biological Sequences

  • Debasis Mitra
  • Gandhali Samant
  • Kuntal Sengupta
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4031)


In this article we have proposed an abstract representation for a sequence using a constant sized 3D matrix. Subsequently the representation may be utilized for many analytical purposes. We have attempted to use it for comparing sequences, and analyzed the method’s asymptotic complexity. Providing a metric for sequence comparison is an underlying operation to many bioinformatics applications. In order to show the effectiveness of the proposed sequence comparison technique we have generated some phylogeny over two sets of bio-sequences and compared them with the ones available in literature. The results prove that our technique is comparable to the standard ones. The technique, called the correlogram-based method, is borrowed from the image analysis area. We have also done some experiments with synthetically generated sequences in order to compare correlogram-based method with the well-known dynamic programming method. Finally, we have discussed some other possibilities on how our method can be used or extended.


Dynamic Program Biological Sequence Dynamic Program Method Phylogeny Tree Equine Influenza Virus 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. Journal of Molecular. Biology 215, 403–410 (1990)Google Scholar
  2. Bertorelle, G., Barbujanit, G.: Analysis of DNA diversity by spatial auto correlation. Genetics 140(2), 811–819 (1995)Google Scholar
  3. Chapman, M.S., Rossmann, M.G.: Structure, Sequence and Function Correlations among Parvoviruses. Viriology 194(2), 491–508 (1993)CrossRefGoogle Scholar
  4. Huang, J., Mitra, M., Zhu, W.J., Zabih, R.: Image Indexing using color correlograms. International Journal of Computer Vision 35(3), 245–268 (1999)CrossRefGoogle Scholar
  5. Lai, A.C.K., Rogers, K.M., Glaser, A., Tudor, L., Chambers, T.: Alternate circulation of recent equine-2 influenza viruses (H3N8) from two distinct lineages in the United States. Virus Res. 100(2), 159–164 (2004)CrossRefGoogle Scholar
  6. Macchiato, M.F., Cuomo, V., Tramontano, A.: Determination of the autocorrelation orders of proteins. Genetics 140, 811–819 (1995)Google Scholar
  7. Rosenberg, M.S., Subramanian, S., Kumar, S.: Patterns of Transitional Mutation Biases Within and Among Mammalian Genomes. Mol. Biol. Evol. 20(6), 988–993 (2003)CrossRefGoogle Scholar
  8. Samant, G., Mitra, D.: Correlogram method for Comparing Bio-Sequences, Florida Institute of Technology Technical Report No. CS-2006-01 (2005),
  9. Smith, T.F., Waterman, M.S.: Identification of common molecular sequences. Journal of Molecular Biology 147, 195–197 (1981)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Debasis Mitra
    • 1
  • Gandhali Samant
    • 1
  • Kuntal Sengupta
    • 2
  1. 1.Department of Computer SciencesFlorida Institute of TechnologyMelbourneUSA
  2. 2.Authentec CorporationMelbourneUSA

Personalised recommendations