Skip to main content

9.1 9.1 Introduction

Analysis of the DNA sequences of genes is important and necessary in the study of genetics, yet we still can not comprehend the whole meanings of DNA sequences. Many methods can be applied to their analysis, and the artificial neural network is one of the most frequent methods used to obtain the characteristic features of DNA sequences. Here we show how, though trial application, artificial neural networks acquire the features of splicing sites of DNA sequences.

When a protein is produced from a DNA chain, the sequence of the DNA is transcribed to the messenger RNA (mRNA). Then transfer RNA (tRNA) binds to the codons of the mRNA. Finally, the amino acids are polymerized to the protein. Not all the codons in the corresponding portion of DNA sequence are always translated to amino acids, however. After transcription to premature mRNA, the splicing process removes the noncoding sequences, called introns, and splices together with the remaining parts of premature mRNA,...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  • R. Breathnach and P. Chambon. Organization and expression of eucaryotic split genes coding for proteins. Annu. Rev. Biochem. 50: 349–383, 1981.

    Article  Google Scholar 

  • K. L. Denninghoff and R. W. Gatterdam. On the undecidability of splincing systems. Intern. J. Computer Math. 27: 133–145, 1989.

    Article  Google Scholar 

  • R. Carhart, J. Moore and A. Engelberg. Strategene, an expert assistant for genetic engineering research. Proceedings of the Third Annual Artificial Intelligence and Advanced Computer Technology Conference, 166–173, 1987.

    Google Scholar 

  • K. Culik and T. Harju. Splicing semigroups of dominoes and DNA. Discrete Appl. Math. 31: 261–267, 1991.

    Article  MathSciNet  MATH  Google Scholar 

  • A. Lapedes, C. Barnes, C. Burks, R. Farber and K. Sirotkin. SFI Studies in the Sciences of Complexity. Computer and DNA 7: 157–182, 1989.

    Google Scholar 

  • E. C. Uberbacher and R. J. Mural. Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach. Proc. Natl. Acad. Sci. USA 88: 11261–11265, 1991.

    Article  Google Scholar 

  • H. Furutani, K. Yamamoto, Y. Kitazoe and H. Ogura. Analysis of thalassemia betaglobin gene by neural network: prediction of abnormal splice. MEDINFO 89. Preceeding of the Sixth Conference on Medical Informatics, pp. 96–100, 1989.

    Google Scholar 

  • H. Ogura, H. Agata, M. Xie, T. Odaka and H. Furutani. A study of learning splice sites of DNA sequence by neural networks. Comput. Biol. Med. 27: 67–75, 1997.

    Article  Google Scholar 

  • T. Odaka, H. Agata, H. Furutani and H. Ogura. A general purpose neural network simulator system for medical data processing. J. Med. Sys. 18: 305–314, 1994.

    Article  Google Scholar 

  • F. Giannelli et al. Haemophilxia b: database of point mutations and short additions and deletions-second edition. Nucleic Acids Res. 19: 2193–2219, 1991.

    Google Scholar 

  • J. D. Hirst and M. J. E. Sternberg. Prediction of structural and functional features of protein and nucleic acid sequences by artificial neural networks. Biochemistry 31: 7211–7218, 1992.

    Article  Google Scholar 

  • M. E. Mulligan and W. R. McClure. Analysis of the occurence of promoter-sites in DNA. Nucleic Acids Res. 14: 109–126, 1986.

    Article  Google Scholar 

  • M. C. O’Neill. Training back-propagation neural networks to define and detect DNA-binding sites. Nucleic Acids Res. 19: 313–318, 1991.

    Article  Google Scholar 

  • M. C. O’Neill. Escherichia coli promoters: neural networks develop distinct descriptions in learning to search for promoters of different spacing classes. Nucleic Acids Res. 20: 3471–3477, 1992.

    Article  MathSciNet  Google Scholar 

  • I. Mahadevan and I. Ghosh. Analysis of E. coli promoter structures using neural networks. Nucleic Acids Res. 22: 2158–2165, 1994.

    Article  Google Scholar 

  • N. I. Larsen, J. Engelbrecht and S. Brunak. Analysis of eukaryotic promoter sequences reveals a systematically occurring CT-signal. Nucleic Acids Res. 23: 1223–1230, 1995.

    Article  Google Scholar 

  • P. Bucher. The Eukaryotic Promoter Database EPD EMBL Nucleotide Sequence Data Library, 1992.

    Google Scholar 

  • N. Qian and T. J. Sejnowski. Predicting the secondary structure of globular proteins using neural network models. J. Molec. Biol. 202: 865–884, 1988.

    Article  Google Scholar 

  • L. H. Holley and M. Karplus. Protein secondary structure prediction with a neural network. Proc. Nat. Acad. Sci. USA 86: 768–774, 1989.

    Article  Google Scholar 

  • J. M. Chandonia and M. Karplus. Neural networks for secondary structure and structural class predictions. Protein Sci. 4: 275–285, 1995.

    Article  Google Scholar 

  • J. M. Chandonia and M. Karplus. The importance of larger data sets for protein secondary structure prediction with neural networks. Protein Sci. 5: 768–774, 1996.

    Article  Google Scholar 

  • B. Rost and C. Sander. Improved prediction of protein secondary structure by use of sequence profiles and neural networks. Proc. Natl. Acad. Sci. USA 90: 7558–7562, 1993.

    Article  Google Scholar 

Download references

Authors

Editor information

Cornelius T. Leondes

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Kluwer Academic Publishers

About this entry

Cite this entry

Leondes, C.T. (2003). An Application of Artificial Neural Networks to DNA Sequence Analysis. In: Leondes, C.T. (eds) Computational Methods in Biophysics, Biomaterials, Biotechnology and Medical Systems. Springer, Boston, MA. https://doi.org/10.1007/0-306-48329-7_9

Download citation

  • DOI: https://doi.org/10.1007/0-306-48329-7_9

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4020-7110-2

  • Online ISBN: 978-0-306-48329-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics