Segmentation of DNA into Coding and Noncoding Regions Based on Inter-STOP Symbols Distances
In this study we set to explore the potentialities of the inter-genomic symbols distance for finding the coding regions in DNA sequences. We use the distance between STOP symbols in the DNA sequence and a chi-square statistic to evaluate the nonhomogeneity of the three possible reading frames. The results of this exploratory study suggest that inter-STOP symbols distance has strong ability to discriminate coding regions.
Keywordsinter-STOP symbols distance DNA coding regions chi-square
Unable to display preview. Download preview PDF.
- 2.Frenkel, F.E., Korotkov, E.V.: Using triplet periodicity of nucleotide sequences for finding potential reading frame shifts in genes. DNA Research 16(2)Google Scholar
- 5.Nicorici, D., Astola, J.: Segmentation of DNA into coding and noncoding regions based on recursive entropic segmentation and stop-codon statistics. EURASIP Journal on Applied Signal Processing 1, 81–91 (2004)Google Scholar
- 6.Deng, S., Shi, Y., Yuan, L., Li, Y., Ding, G.: Detecting the borders between coding and non-coding dna regions in prokaryotes based on recursive segmentation and nucleotide doublets statistics. BMC Genomics 13(suppl. 8), S19 (2011)Google Scholar
- 9.Bastos, C.A.C., Afreixo, V., Pinho, A.J., Garcia, S.P., Rodrigues, J.M.O.S., Ferreira, P.J.S.G.: Inter-dinucleotide distances in the human genome: an analysis of the whole-genome and protein-coding distributions. Journal of Integrative Bioinformatics 8(3), 172 (2011)Google Scholar