Neural Network Tree for Identification of Splice Junction and Protein Coding Region in DNA

  • Pradipta Maji
  • Sushmita Paul


This chapter presents the design of a hybrid learning model, termed as neural network tree (NNTree), for identification of splice-junction and protein coding region in DNA sequences. It incorporates the advantages of both decision tree and neural network. An NNTree is a decision tree, where each nonterminal node contains a neural network. The versatility of this method is illustrated through its application in splice-junction and gene identification problems. Extensive experimental results establish that the NNTree produces more accurate classifier than that have previously been obtained for a range of different sequence lengths; thereby indicating a cost-effective alternative in splice-junction and protein coding region identification problems. .


Classification Accuracy Leaf Node Intermediate Node Terminal Node Current Node 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Blaisdell BE (1983) A prevalent persistent global nonrandomness that distinguishes coding and non-coding eucaryotic nuclear dna sequence. J Mol Evol 19(2):122–133CrossRefMathSciNetGoogle Scholar
  2. 2.
    Breathnach RJ, Mandel JL, Chambon P (1977) Ovalbumin gene is split in chicken DNA. Nature 270:314–319CrossRefGoogle Scholar
  3. 3.
    Cheeseman P, Stutz J (1996) Bayesian classification (AutoClass): theory and results. In: Fayyad UM, Piatetsky-Shapiro G, Smith P, Uthurusamy R (eds) Advances in knowledge discovery and data mining. AAAI/MIT Press, Cambridge, pp 153–180Google Scholar
  4. 4.
    Farber R, Lapedes A, Sirotkin K (1992) Determination of eucaryotic protein coding regions using neural networks and information theory. J Mol Biol 226(2):471–479CrossRefGoogle Scholar
  5. 5.
    Fickett J (1982) Recognition of protein coding regions in DNA sequences. Nucleic Acids Res 10(17):5303–5318CrossRefGoogle Scholar
  6. 6.
    Fickett J, Tung CS (1992) Assessment of protein coding measures. Nucleic Acids Res 20(24):6441–6450CrossRefGoogle Scholar
  7. 7.
    Guo H, Gelfand SB (1992) Classification trees with neural network feature extraction. IEEE Trans Neural Networks 3(6):923–933CrossRefGoogle Scholar
  8. 8.
    Hertz J, Krogh A, Palmer RG (1991) Introduction to the theory of neural computation. Addison Wesley, Santa Fe institute studies in the sciences of complexityGoogle Scholar
  9. 9.
    Koza JR (1994) Genetic programming-II: automatic discovery of reusable programs. MIT Press, Cambridge, ISBN 0262111896Google Scholar
  10. 10.
    Lippmann R (1987) An introduction to computing with neural nets. IEEE Acoust Speech Signal Process Mag 4(2):4–22Google Scholar
  11. 11.
    Maji P (2008) Efficient design of neural network tree using a new splitting criterion. Neurocomputing 71(4–6):787–800CrossRefGoogle Scholar
  12. 12.
    Maji P, Das C (2008) Pattern classification using NNtree: design and application for biological data set. J Intell Syst 17(1–3):51–71Google Scholar
  13. 13.
    Maji P, Shaw C, Ganguly N, Sikdar BK, Chaudhuri PP (2003) Theory and application of cellular automata for pattern classification. Fundamenta Informaticae 58:321–354MATHMathSciNetGoogle Scholar
  14. 14.
    Michie D, Spiegelhalter DJ, Taylor CC (1994) Machine learning, neural and statistical classification. Ellis Horwood, ChichesterGoogle Scholar
  15. 15.
    Murty SK, Kasif S, Salzberg S (1994) A system for identification of oblique decision trees. J Artif Intell Res 2(1):1–32Google Scholar
  16. 16.
    Murty SK, Kasif S, Salzberg S, Beigel R (1993) OC1: randomized induction of oblique decision trees. In: Proceedings of the 11th national conference on artificial intelligence, AAAI/MIT Press, pp 322–327Google Scholar
  17. 17.
    Qian N, Sejnowski TJ (1988) Predicting the secondary structure of globular proteins using neural network models. J Mol Biol 202(4):865–884CrossRefGoogle Scholar
  18. 18.
    Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann, San FransiscoGoogle Scholar
  19. 19.
    Schmitz GP, Aldrich C, Gouws FS (1999) ANN-DT: an algorithm for extraction of decision trees from artificial neural networks. IEEE Trans Neural Networks 10(6):1392–1401CrossRefGoogle Scholar
  20. 20.
    Sethi IK (1990) Entropy nets: from decision trees to neural networks. Proc IEEE 78(10):1605–1613CrossRefGoogle Scholar
  21. 21.
    Sethi IK, Yoo JH (1997) Structure-driven induction of decision tree classifiers through neural learning. Pattern Recogn 30(11):1893–1904CrossRefGoogle Scholar
  22. 22.
    Song HH, Lee SW (1998) A self-organizing neural network tree for large-set pattern classification. IEEE Trans Neural Networks 9(6):369–380CrossRefGoogle Scholar
  23. 23.
    Tay ALP, Zurada JM, Wong LP, Xu J (2007) The hierarchical fast learning artificial neural network (hieflann): an autonomous platform for hierarchical neural network construction. IEEE Trans Neural Networks 18(6):1645–1657CrossRefGoogle Scholar
  24. 24.
    Tsukimoto H (2000) Extracting rules from trained neural networks. IEEE Trans Neural Networks 11(2):377–389CrossRefGoogle Scholar
  25. 25.
    Uberbacher E, Mural R (1991) Locating protein-coding regions in human dna sequences by a multiple sensor-neural network approach. Proc Nat Acad Sci USA 88(24):11,261–11,265Google Scholar
  26. 26.
    Wilamowski BM, Yu H (2010) Neural network learning without backpropagation. IEEE Trans Neural Networks 21(11):1793–1803CrossRefGoogle Scholar
  27. 27.
    Zhao QF (2000) Neural network tree: integration of symbolic and nonsymbolic approaches. Technical Report of IEICE Google Scholar
  28. 28.
    Zhao QF (2001) Evolutionary design of neural network tree-integration of decision tree, neural network and GA. In: Proceedings of the IEEE congress on evolutionary computation, pp. 240–244Google Scholar
  29. 29.
    Zhao QF (2001) Training and retraining of neural network trees. In: Proceedings of the INNS IEEE international joint conference on neural networks, pp. 726–731Google Scholar
  30. 30.
    Zhou ZH, Chen ZQ (2002) Hybrid decision tree. Knowl-Based Syst 15(8):515–528CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  1. 1.Indian Statistical InstituteKolkataIndia

Personalised recommendations