Abstract
Parallel Cascade Identification (PCI) has been successfully applied to build dynamic nonlinear systems that address diverse challenges in the field of bioinformatics. PCI may be used to identify either single-input single-output (SISO) or multi-input single-output (MISO) models. Although SISO PCI models have typically sufficed, it has been suggested that MISO PCI systems could also be used to form bioinformatics classifiers, and indeed they were successfully applied in one study. This paper reports on the first systematic comparison of MISO and SISO PCI classifiers. Motivation for using the MISO structure is given. The construction of MISO parallel cascade models is also briefly reviewed. In order to compare the accuracy of SISO and MISO PCI classifiers, genetic algorithms are applied to optimize the model architecture on a number of equivalent single-input and multi-input biological training datasets. Through evaluation of both model structures on independent test datasets, we establish that MISO PCI is capable of building classifiers of equal accuracy to those resulting from SISO PCI models. Moreover, we discuss and illustrate the benefits of the MISO approach, including significant reduction in training and testing times, and the ability to adjust automatically the weighting of individual inputs according to information content.
Similar content being viewed by others
References
Adeney, K. M. and M. J. Korenberg. Iterative fast orthogonal search algorithm for MDL-based training of generalized single-layer networks. Neural Networks 13:787–799, 2000.
Cheever, E. A., D. B. Searls, W. Karunaratne, and G. C. Overton. Using signal processing techniques for DNA sequence comparison. In: Proceedings of the 15th Northeastern Bioengineering Symposium, edited by S. Buus. Boston, MA: IEEE, 1989, pp. 173–174.
Dorsey, R. E. and W. J. Mayer. Genetic algorithms for estimation problems with multiple optima, nondifferentiability, and other irregular features. JBES 13(1):53–66, 1995.
Green, J. R., M. J. Korenberg, R. David, and I. W. Hunter. Recognition of adenosine triphosphate binding sites using parallel cascade system identification. Ann. Biomed. Eng. 31(4):462–470, 2003.
Juusola, M., J. E. Niven, and A. S. French. Shaker K+ Channels contribute early nonlinear amplification to the light response in Drosophila Photoreceptors. J. Neurophysiol. 90:2014–2021, 2003.
Korenberg, M. J. Parallel cascade identification and kernel estimation for nonlinear systems. Ann. Biomed. Eng. 19:429–455, 1991.
Korenberg, M. J., J. E. Solomon, and M. E. Regelson. Parallel cascade identification as a means for automatically classifying protein sequences into structure/function groups. Biol. Cybern. 82:15–21, 2000.
Korenberg, M. J., R. David, I. W. Hunter, and J. E. Solomon. Automatic classification of protein sequences into structure/function groups via parallel cascade identification: a feasibility study. Ann. Biomed. Eng. 28(7):803–811, 2000.
Korenberg, M. J., E. D. Lipson, J. R. Green, and J. E. Solomon. Parallel Cascade Recognition of Exon and Intron DNA Sequences. Ann. Biomed. Eng. 30(1):129–140, 2002.
Lin, K., A. C. W. May, and W. R. Taylor. Amino acid encoding schemes from protein structure alignments: multi-dimensional vectors to describe residue types. J. Theor. Biol. 216:361–365, 2002.
Mao, K. Z. and S. A. Billings. Algorithms for minimal model structure detection in nonlinear dynamic system identification. Int. J. Control 68:311–330, 1997.
Matthews, B. W. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochem. Biophys. Acta 405:442–451, 1975.
Rose, G. D., A. R. Geselowitz, G. J. Lesser, R. H. Lee, and M. H. Aehfus. Hydrophobicity of amino acid residues in globular proteins. Science 229:834–838, 1985.
Rowen, L., B. F. Koop, and L. Hood. The complete 685-kilobase DNA sequence of the human beta T-cell receptor locus. Science 272(5269):1755–1762, 1996.
Acknowledgments
This study was supported by a grant from the Natural Sciences and Engineering Research Council of Canada. The authors thank the reviewers for their astute comments which substantially improved the quality of this paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Green, J.R., Korenberg, M.J. On the Advantages of Multi-Input Single-Output Parallel Cascade Classifiers. Ann Biomed Eng 34, 709–716 (2006). https://doi.org/10.1007/s10439-006-9080-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10439-006-9080-1