Using Multi-scale Glide Zoom Window Feature Extraction Approach to Predict Protein Homo-oligomer Types

  • QiPeng Li
  • Shao Wu Zhang
  • Quan Pan
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5265)

Abstract

The concept of multi-scale glide zoom window was proposed and a novel approach of multi-scale glide zoom window feature extraction was used for predicting protein homo-oligomers. Based on the concept of multi-scale glide zoom window, we choose two scale glide zoom window: whole protein sequence glide zoom window and kin amino acid glide zoom window, and for every scale glide zoom window, three feature vectors of amino acids distance sum, amino acids mean distance and amino acids distribution, were extracted. A series of feature sets were constructed by combining these feature vectors with amino acids composition to form pseudo amino acid compositions (PseAAC). The support vector machine (SVM) was used as base classifier. The 75.37% total accuracy is arrived in jackknife test in the weighted factor conditions, which is 10.05% higher than that of conventional amino acid composition method in same condition. The results show that multi-scale glide zoom window method of extracting feature vectors from protein sequence is effective and feasible, and the feature vectors of multi-scale glide zoom window may contain more protein structure information.

Keywords

Multi-scale glide zoom window feature extraction pseudo amino acid compositions homo-oligomer 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chou, K.C.: Review: Low-frequency Collective Motion in Biomacromolecules and Its Biological Functions. Biophys. Chem. 30, 3–48 (1988)CrossRefPubMedGoogle Scholar
  2. 2.
    Chou, K.C.: Review: Structural Bioinformatics and Its Impact to Biomedical Science. Curr. Med. Chem. 11, 2105–2134 (2004e)CrossRefPubMedGoogle Scholar
  3. 3.
    Chou, K.C.: Molecular Therapeutic Target for Type-2 Diabetes. J. Proteome. Res. 3, 1284–1288 (2004a)CrossRefPubMedGoogle Scholar
  4. 4.
    Chou, K.C.: Insights from Modelling Three-dimensional Structures of the Human Potassium and Sodium Channels. J. Proteome. Res. 3, 856–861 (2004b)CrossRefPubMedGoogle Scholar
  5. 5.
    Chou, K.C.: Insights from Modelling the 3D Structure of the Extracellular Domain of Alpha7 Nicotinic Acetylcholine Receptor. Biochem. Biophys. Res. Commun. 319, 433–438 (2004c)CrossRefPubMedGoogle Scholar
  6. 6.
    Chou, K.C.: Modelling Extracellular Domains of GABA-A Receptors: Subtypes 1, 2, 3, and 5. Biochem. Biophys. Res. Commun. 316, 636–642 (2004d)CrossRefPubMedGoogle Scholar
  7. 7.
    Oxenoid, K., Chou, J.J.: The Structure of Phospholamban Pentamer Reveals a Channel-Like Architecture in Membranes. Proc. Natl. Acad. Sci. USA 102, 10870–10875 (2005)CrossRefPubMedPubMedCentralGoogle Scholar
  8. 8.
    Anfinsen, C.B., Haber., E., Sela, M., White, F.H.: The Kinetics of the Formation of Native Ribonuclease During Oxidation of the Reduced Polypeptide Chain. Proc. Natl. Acad. Sci. USA 47, 1309–1314 (1961)CrossRefPubMedPubMedCentralGoogle Scholar
  9. 9.
    Anfisen, C.B.: Principles that Govern the Folding of Protein Chains. Science 181, 223–230 (1973)CrossRefGoogle Scholar
  10. 10.
    Garian, R.: Prediction of Quaternary Structure from Primary Structure. Bioinformatics 17, 551–556 (2001)CrossRefPubMedGoogle Scholar
  11. 11.
    Chou, K.C., Cai, Y.D.: Predicting Protein Quaternary Structure by Pseudo Amino Acid Composition. Proteins Struct. Func. Gene. 53, 282–289 (2003b)CrossRefGoogle Scholar
  12. 12.
    Zhang, S.W., Quan, P., Zhang, H.C., Zhang, Y.L., Wang, H.Y.: Classification of Protein Quaternary Structure with Support Vector Machine. Bioinformatics 19, 2390–2396 (2003)CrossRefPubMedGoogle Scholar
  13. 13.
    Zhang, S.W., Pan., Q., Zhang., H.-C., Shao., Z.-C., Shi, J.-Y.: Prediction of Protein Homo-oligomer Types by Pseudo Amino Acid Composition: Approached with an Improved Feature Extraction and Naive Bayes Feature Fusion. Amino Acids 30, 461–468 (2006)CrossRefPubMedGoogle Scholar
  14. 14.
    Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)CrossRefGoogle Scholar
  15. 15.
    Vapnik, V. (ed.): Statistical Learning Theory. Wiley, New York (1998)Google Scholar
  16. 16.
    Bairoch, A., Apweiler, R.: The SWISS-PROT Protein Data Bank and Its New Supplement TrEMBL. Nucleic Acids Res. 24, 21–25 (1996)CrossRefPubMedPubMedCentralGoogle Scholar
  17. 17.
    Chou, K.C., Zhang, C.T.: Prediction of Protein Structural Classes. Crit. Rev. Biochem. Mol. Biol. 30, 275–349 (1995) (review)CrossRefPubMedGoogle Scholar
  18. 18.
    Zhou, G.P., Assa-Munt, N.: Some Insights Into Protein Structural Class Prediction. Proteins Struct. Funct. Genet. 44, 57–59 (2001)CrossRefPubMedGoogle Scholar
  19. 19.
    Fasman, G.D. (ed.): Handbook of Biochemistry and Molecular Biology, 3rd edn. CRC Press, Boca Raton (1976)Google Scholar
  20. 20.
    Bahar, I., Atilgan, A.R., Jernigan, R.L., Erman, B.: Understanding the Recognition of Protein Structural Classes by Amino Acid Composition. Proteins 29, 172–185 (1997)CrossRefPubMedGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • QiPeng Li
    • 1
  • Shao Wu Zhang
    • 1
  • Quan Pan
    • 1
  1. 1.School of Automation/School of MechatronicsNorthwestern Polytechnical UniversityXi’anChina

Personalised recommendations