Coding Region Prediction in Genomic Sequences Using a Combination of Digital Signal Processing Approaches

  • Aníbal Rodríguez Fuentes
  • Juan V. Lorenzo Ginori
  • Ricardo Grau Ábalo
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4756)


Identifying protein coding regions in DNA sequences is a basic step in the location of genes. Several approaches based on signal processing tools have been applied to solve this problem, trying to achieve more accurate predictions. This paper presents a new predictor that improves the efficacy of three ones that use the Fourier Transform to predict coding regions, and that could be computed using an algorithm that reduces the computation load. ROC curves are used to demonstrate the efficacy of the proposed predictor, based on the computation of 25 DNA sequences from three different organisms.


Bioinformatics Digital Signal Processing Fourier Transform Coding region prediction Computational load reduction 


  1. 1.
    Tiwari, S., et al.: Prediction of probable genes by Fourier analysis of genomic sequences. CABIOS 113, 263–270 (1997)Google Scholar
  2. 2.
    Anastassiou, D.: Genomic signal processing. IEEE Signal Processing Magazine 18(4), 8–20 (2001)CrossRefGoogle Scholar
  3. 3.
    Kotlar, D., Lavner, Y.: Gene Prediction by Spectral Rotation Measure: A New Method for Identifying Protein-Coding Regions. Genome Research 13(8), 1930–1937 (2003)Google Scholar
  4. 4.
    Vaidyanathan, P.P., Yoon, B.-J.: Gene and exon prediction using allpass-based filters. ONR (2002)Google Scholar
  5. 5.
    Akhtar, M., Ambikairajah, E., Epps, J.: Detection of Period-3 Behavior in Genomic Sequences Using Singular Value Decomposition. In: IEEE-International Conference on Emerging Technologies, pp. 13–17 (2005)Google Scholar
  6. 6.
    Berger, J.A., Mitra, S.K., Astola, J.: Power spectrum analysis for DNA sequences. In: ISSPA 2003, Paris, France, pp. 29–32 (2003)Google Scholar
  7. 7.
    Dodin, G., et al.: Fourier and Wavelet Transform Analysis, a Tool for Visualizing Regular Patterns in DNA Sequences. J. Theor. Biol. 206, 323–326 (2000)CrossRefGoogle Scholar
  8. 8.
    Berger, J.A., et al.: New approaches to genome sequence analysis based on digital signal processing. University of California (2002)Google Scholar
  9. 9.
    Fuentes, A.R., Ginori, J.V.L., Ábalo, R.G.: Detection of Coding Regions in Large DNA Sequences Using the Short Time Fourier Transform with Reduced Computational Load. In: Martínez-Trinidad, J.F., Carrasco Ochoa, J.A., Kittler, J. (eds.) CIARP 2006. LNCS, vol. 4225, pp. 902–909. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  10. 10.
    Cristea, P.D.: Conversion of nucleotides sequences into genomic signals. J. Cell. Mol. Med. 6(2), 279–303 (2002)CrossRefGoogle Scholar
  11. 11.
    Su, S.-C., Yeh, C.H., Kuo, C.J.: Structural Analysis of Genomic Sequences with Matched Filtering. IEEE Signal Proccessing Magazine 3, 2893–2896 (2003)Google Scholar
  12. 12.
    Tsonis, A.A., Elsner, J.B., Tsonis, P.A.: Periodicity in DNA coding sequences: Implications in gene evolution. J. Theor. Biol. 151, 323–331 (1991)CrossRefGoogle Scholar
  13. 13.
    Chechetkin, V.R., Turygin, A.Y.: Size-dependence of three-periodicity and long-range correlations in DNA sequences. Phys. Lett. A 199, 75–80 (1995)CrossRefGoogle Scholar
  14. 14.
    Swets, J.A., Pickett, R.M.: Evaluation of diagnostic systems: methods from signal detection theory. Academic Press, Nueva York (1982)Google Scholar
  15. 15.
    Zweig, M.H., Campbell, G.: Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. Clin. Chem. 39, 561–577 (1993)Google Scholar
  16. 16.
    GenBank database, NCBIGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Aníbal Rodríguez Fuentes
    • 1
  • Juan V. Lorenzo Ginori
    • 1
  • Ricardo Grau Ábalo
    • 2
  1. 1.Center for Studies on Electronics and Information Technologies 
  2. 2.Center for Studies on Informatics, Universidad Central “Marta Abreu” de Las Villas, Carretera a Camajuaní, km 5 1/2, Santa Clara, VC, CP 54830Cuba

Personalised recommendations