Amino Acids

, Volume 43, Issue 2, pp 657–665

Wavelet images and Chou’s pseudo amino acid composition for protein classification

Original Article

DOI: 10.1007/s00726-011-1114-9

Cite this article as:
Nanni, L., Brahnam, S. & Lumini, A. Amino Acids (2012) 43: 657. doi:10.1007/s00726-011-1114-9

Abstract

The last decade has seen an explosion in the collection of protein data. To actualize the potential offered by this wealth of data, it is important to develop machine systems capable of classifying and extracting features from proteins. Reliable machine systems for protein classification offer many benefits, including the promise of finding novel drugs and vaccines. In developing our system, we analyze and compare several feature extraction methods used in protein classification that are based on the calculation of texture descriptors starting from a wavelet representation of the protein. We then feed these texture-based representations of the protein into an Adaboost ensemble of neural network or a support vector machine classifier. In addition, we perform experiments that combine our feature extraction methods with a standard method that is based on the Chou’s pseudo amino acid composition. Using several datasets, we show that our best approach outperforms standard methods. The Matlab code of the proposed protein descriptors is available at http://bias.csr.unibo.it/nanni/wave.rar.

Keywords

Proteins classification Machine learning Ensemble of classifiers Support vector machines 

Copyright information

© Springer-Verlag 2011

Authors and Affiliations

  • Loris Nanni
    • 1
  • Sheryl Brahnam
    • 2
  • Alessandra Lumini
    • 3
  1. 1.Department of Information EngineeringUniversity of PaduaPadovaItaly
  2. 2.Computer Information SystemsMissouri State UniversitySpringfieldUSA
  3. 3.Department of ElectronicInformatics and Systems (DEIS), Università di BolognaCesenaItaly

Personalised recommendations