Skip to main content

A General Method for Combining Predictors Tested on Protein Secondary Structure Prediction

  • Conference paper

Part of the Perspectives in Neural Computing book series (PERSPECT.NEURAL)

Abstract

Ensemble methods, which combine several classifiers, have been successfully applied to decrease generalization error of machine learning methods. For most ensemble methods the ensemble members are combined by weighted summation of the output, called the linear average predictor. The logarithmic opinion pool ensemble method uses a multiplicative combination of the ensemble members, which treats the outputs of the ensemble members as independent probabilities. The advantage of the logarithmic opinion pool is the connection to the Kullback-Leibler error function, which can be decomposed into two terms: An average of the error of the ensemble members, and the ambiguity. The ambiguity is independent of the target function, and can be estimated using unlabeled data. The advantage of the decomposition is that an unbiased estimate of the generalization error of the ensemble can be obtained, while training still is on the full training set. These properties can be used to improve classification. The logarithmic opinion pool ensemble method is tested on the prediction of protein secondary structure. The focus is on how much improvement the general ensemble method can give rather than on outperforming existing methods, because that typically involves several more steps of refinement.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • DOI: 10.1007/978-1-4471-0513-8_39
  • Chapter length: 6 pages
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
eBook
USD   149.00
Price excludes VAT (USA)
  • ISBN: 978-1-4471-0513-8
  • Instant PDF download
  • Readable on all devices
  • Own it forever
  • Exclusive offer for individuals only
  • Tax calculation will be finalised during checkout
Softcover Book
USD   199.00
Price excludes VAT (USA)

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. B. Rost and C. Sander. Prediction of protein secondary structure at better than 70 % accuracy. Journal of Molecular Biology, 232(2): 584–599, Jul 20 1993.

    CrossRef  Google Scholar 

  2. S. K. Riis and A. Krogh. Improving prediction of protein secondary structure using structured neural networks and multiple sequence alignments. Journal of Computational Biology, 3:163–183, 1996.

    CrossRef  Google Scholar 

  3. P. Baldi and S. Brunak. Bioinformatics — The Machine Learning Approach. MIT Press, Cambridge MA, 1998.

    Google Scholar 

  4. Anders Krogh and Jesper Vedelsby. Neural network ensembles, cross validation, and active learning. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 231–238. The MIT Press, 1995.

    Google Scholar 

  5. Tom Heskes. Bias/variance decompositions for likelihood-based estimators. Neural Computation, 10(6): 1425–1433, 1998.

    CrossRef  Google Scholar 

  6. Tom Heskes. Selecting weighting factors in logarithmic opinion pools. In Michael I. Jordan, Michael J. Kearns, and Sara A. Solla, editors, Advances in Neural Information Processing Systems, volume 10. The MIT Press, 1998.

    Google Scholar 

  7. O. Lund, K. Frimand, J. Gorodkin, H. Bohr, J. Bohr, J. Hansen, and S. Brunak. Protein distance constraints predicted by neural networks and probability density functions. Protein Engineering, 10(11): 1241–1248, 1997.

    CrossRef  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2000 Springer-Verlag London

About this paper

Cite this paper

Hansen, J.V., Krogh, A. (2000). A General Method for Combining Predictors Tested on Protein Secondary Structure Prediction. In: Malmgren, H., Borga, M., Niklasson, L. (eds) Artificial Neural Networks in Medicine and Biology. Perspectives in Neural Computing. Springer, London. https://doi.org/10.1007/978-1-4471-0513-8_39

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-0513-8_39

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-85233-289-1

  • Online ISBN: 978-1-4471-0513-8

  • eBook Packages: Springer Book Archive