Amino Acids

, Volume 38, Issue 4, pp 1201–1208 | Cite as

Predicting subcellular location of apoptosis proteins based on wavelet transform and support vector machine

  • Jian-Ding Qiu
  • San-Hua Luo
  • Jian-Hua Huang
  • Xing-Yu Sun
  • Ru-Ping Liang
Original Article

Abstract

Apoptosis proteins have a central role in the development and homeostasis of an organism. These proteins are very important for understanding the mechanism of programmed cell death. As a result of genome and other sequencing projects, the gap between the number of known apoptosis protein sequences and the number of known apoptosis protein structures is widening rapidly. Because of this extremely unbalanced state, it would be worthwhile to develop a fast and reliable method to identify their subcellular locations so as to gain better insight into their biological functions. In view of this, a new method, in which the support vector machine combines with discrete wavelet transform, has been developed to predict the subcellular location of apoptosis proteins. The results obtained by the jackknife test were quite promising, and indicated that the proposed method can remarkably improve the prediction accuracy of subcellular locations, and might also become a useful high-throughput tool in characterizing other attributes of proteins, such as enzyme class, membrane protein type, and nuclear receptor subfamily according to their sequences.

Keywords

Apoptosis protein Subcellular location Discrete wavelet transform Support vector machines Hydrophobicity 

Supplementary material

726_2009_331_MOESM1_ESM.doc (59 kb)
Supplementary material 1 (DOC 59 kb)

References

  1. Adams JM, Cory S (1998) The Bcl-2 protein family: arbiters of cell survival. Science 281:1322–1326CrossRefPubMedGoogle Scholar
  2. Barinaga M (1998) Stroke-damaged neurons may commit cellular suicide. Science 281:1302–1303CrossRefPubMedGoogle Scholar
  3. Bhasin M, Raghava GPS (2004a) Classification of nuclear receptors based on amino acid composition and dipeptide composition. J Biol Chem 279:23262–23266CrossRefPubMedGoogle Scholar
  4. Bhasin M, Raghava GPS (2004b) ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST. Nucleic Acids Res 32:W414–W419CrossRefPubMedGoogle Scholar
  5. Bulashevska A, Eils R (2006) Predicting protein subcellular locations using hierarchical ensemble of Bayesian classifiers based on Markov chains. BMC Bioinf 7:298CrossRefGoogle Scholar
  6. Cedano J, Aloy P, Pérez-Pons JA, Querol E (1997) Relation between amino acid composition and cellular location of proteins. J Mol Biol 266:594–600CrossRefPubMedGoogle Scholar
  7. Chang CC, Lin CJ (2002) Training nu-support vector regression: theory and algorithms. Neural Comput 14:1959–1977CrossRefPubMedGoogle Scholar
  8. Chen YL, Li QZ (2004) Prediction of the subcellular location apoptosis proteins using the algorithm of measure of diversity. Acta Sci Nat Univ NeiMongol 25:413–417Google Scholar
  9. Chen YL, Li QZ (2007a) Prediction of subcellular location of apoptosis proteins. J Theor Biol 245:775–783CrossRefPubMedGoogle Scholar
  10. Chen YL, Li QZ (2007b) Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition. J Theor Biol 248:377–381CrossRefPubMedGoogle Scholar
  11. Chou KC (2001) Prediction of protein cellular attributes using pseudo amino acid composition. Proteins Struct Funct Genet 43:246–255CrossRefPubMedGoogle Scholar
  12. Chou KC (2004) Review: structural bioinformatics and its impact to biomedical science. Curr Med Chem 11:2105–2134PubMedGoogle Scholar
  13. Chou KC, Cai YD (2003) A new hybrid approach to predict subcellular localization of proteins by incorporating gene oncology composition. Biochem Biophys Res Commun 311:743–747CrossRefPubMedGoogle Scholar
  14. Chou KC, Cai YD (2004) Predicting protein structural class by functional domain composition. Biochem Biophys Res Commun 321:1007–1009CrossRefPubMedGoogle Scholar
  15. Chou KC, Shen HB (2006) Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic k-nearest neighbor classifiers. J Proteome Res 5:1888–1897CrossRefPubMedGoogle Scholar
  16. Chou KC, Shen HB (2007) Euk-mPLoc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites. J Proteome Res 6:1728–1734CrossRefPubMedGoogle Scholar
  17. Chou KC, Zhang CT (1995) Prediction of proteins structural classes. Crit Rev Biochem Mol Biol 30:275–349CrossRefPubMedGoogle Scholar
  18. Chou JJ, Matsuo H, Duan H, Wagner G (1998) Solution structure of the RAIDD CARD and model for CARD/CARD interaction in caspase-2 and caspase-9 recruitment. Cell 94:171–180CrossRefPubMedGoogle Scholar
  19. Chou JJ, Li H, Salvesen GS, Yuan J, Wagner G (1999) Solution structure of BID, an intracellular amplifier of apoptotic signaling. Cell 96:615–624CrossRefPubMedGoogle Scholar
  20. Daubechies I (1992) Ten lectures on wavelets. In: CBMS-NSF regional conference series in applied mathematics. SIAMGoogle Scholar
  21. Ding YS, Zhang TL (2008) Using Chou’s pseudo amino acid composition to predict subcellular localization of apoptosis proteins: an approach with immune genetic algorithm-based ensemble classifier. Pattern Recogn Lett 29:1887–1892CrossRefGoogle Scholar
  22. Eisenberg D, McLachlan AD (1986) Solvation energy in protein folding and binding. Nature 319:199–203CrossRefPubMedGoogle Scholar
  23. Eisenberg D, Weiss RM, Terwilliger TC (1984) The hydrophobic moment detects periodicity in protein hydrophobicity. Proc Natl Acad Sci USA 81:140–144CrossRefPubMedGoogle Scholar
  24. Evan G, Littlewood T (1998) A matter of life and cell death. Science 281:1317–1322CrossRefPubMedGoogle Scholar
  25. Fauchere JL, Pliska V (1983) Transformational homologies in amino acid sequence. Eur J Med Chem 18:369–375Google Scholar
  26. Feng ZP (2001) Prediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition. Biopolymers 58:491–499CrossRefPubMedGoogle Scholar
  27. Grunbaum FA (1992) An introduction to wavelets. Science 257:821–822CrossRefPubMedGoogle Scholar
  28. Hirakawa H, Muta S, Kuhara S (1999) The hydrophobic cores of proteins predicted by wavelet analysis. Bioinformatics 15:141–148CrossRefPubMedGoogle Scholar
  29. Huang Y, Li Y (2004) Prediction of protein subcellular locations using fuzzy k-NN method. Bioinformatics 20:21–28CrossRefPubMedGoogle Scholar
  30. Huang J, Shi F, Zhou HB (2005) Support vector machine for predicting apoptosis proteins types by incorporating protein instability index. China J Bioinf 3:121–123Google Scholar
  31. Jacobson MD, Weil M, Raff MC (1997) Programmed cell death in animal development. Cell 88:347–354CrossRefPubMedGoogle Scholar
  32. Joachims T (1999) Making large-scale SVM learning practical. In: Scholkopf B, Burges C, Smola A (eds) Advances in kernel methods—support vector learning. MIT Press, CambridgeGoogle Scholar
  33. Kerr JF, Wyllie AH, Currie AR (1972) Apoptosis: a basic biological phenomenon with wide-ranging implications in tissue kinetics. Br J Cancer 26:239–257PubMedGoogle Scholar
  34. Kreßel UH (1999) Pairwise classification and support vector machines. In: Schölkopf B, Burges CJ, Smola AJ (eds) Advances in kernel methods: support vector learning. MIT Press, CambridgeGoogle Scholar
  35. Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105–132CrossRefPubMedGoogle Scholar
  36. Mallat SG (1989) A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Mach Intell 11:674–693CrossRefGoogle Scholar
  37. Mallat SG (1999) A wavelet tour of signal processing. Academic Press, San DiegoGoogle Scholar
  38. Nakashima H, Nishikawa K (1994) Discrimination of intracellular and extracellular proteins using amino acid composition and residue pair frequencies. J Mol Biol 238:54–61CrossRefPubMedGoogle Scholar
  39. Park KJ, Kanehisa M (2003) Prediction of protein subcellular locations by support vector machines using compositions of amino acid and amino acid pairs. Bioinformatics 19:1656–1663CrossRefPubMedGoogle Scholar
  40. Platt J, Cristianini N, Shawe-Taylor J (2000) Large margin DAGs for multiclass classification. Adv Neural Inf Proc Syst 12:547–553Google Scholar
  41. Qiu JD, Liang RP, Zou XY, Mo JY (2003) Prediction of protein secondary structure based on continuous wavelet transform. Talanta 61:285–293CrossRefPubMedGoogle Scholar
  42. Qiu JD, Liang RP, Zou XY, Mo JY (2004) Prediction of transmembrane proteins based on the continuous wavelet transform. J Chem Inf Comput Sci 44:741–747PubMedGoogle Scholar
  43. Qiu JD, Huang JH, Liang RP, Luo SH (2009a) Prediction of G-protein-coupled receptor classes based on the concept of Chou’s pseudo amino acid composition: an approach from discrete wavelet transform. Anal Biochem 390:68–73CrossRefPubMedGoogle Scholar
  44. Qiu JD, Luo SH, Huang JH, Liang RP (2009b) Using support vector machines for prediction of protein structural classes based on discrete wavelet transform. J Comput Chem 30:1344–1350CrossRefPubMedGoogle Scholar
  45. Qiu JD, Luo SH, Huang JH, Liang RP (2009c) Using support vector machines to distinguish enzymes: approached by incorporating wavelet transform. J Theor Biol 256:625–631CrossRefPubMedGoogle Scholar
  46. Reed JC, Paternostro G (1999) Postmitochondrial regulation of apoptosis during heart failure. Proc Natl Acad Sci USA 96:7614–7616CrossRefPubMedGoogle Scholar
  47. Schulz JB, Weller M, Moskowitz MA (1999) Caspases as treatment targets in stroke and neurodegenerative diseases. Ann Neurol 45:421–429CrossRefPubMedGoogle Scholar
  48. Shen HB, Chou KC (2007) Hum-mPLoc: an ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites. Biochem Biophys Res Commun 355:1006–1011CrossRefPubMedGoogle Scholar
  49. Shi JY, Zhang SW, Pan Q, Cheng YM, Xie J (2007) Prediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition. Amino Acids 33(1):69–74CrossRefPubMedGoogle Scholar
  50. Suzuki M, Youle RJ, Tjandra N (2000) Structure of Bax: coregulation of dimer formation and intracellular localization. Cell 103:645–654CrossRefPubMedGoogle Scholar
  51. Vapnik V (1995) The nature of statistical learning theory. Springer, New YorkGoogle Scholar
  52. Vapnik V (1998) Statistical learning theory. Wiley, New YorkGoogle Scholar
  53. Vaux DL, Heacker G, Strasser A (1994) An evolutionary perspective on apoptosis. Cell 76:77–779CrossRefGoogle Scholar
  54. Walczak B (2000) Wavelets in chemistry. Elsevier, AmsterdamGoogle Scholar
  55. Zhang ZH, Wang ZH, Zhang ZR, Wang YX (2006) A novel method for apoptosis protein subcellular localization prediction combining encoding based on grouped weight and support vector machine. FEBS Lett 580:6169–6174CrossRefPubMedGoogle Scholar
  56. Zhang SW, Zhang YL, Yang YF, Zhao CH, Pan Q (2008) Using the concept of Chou’s pseudo amino acid composition to predict protein subcellular localization: an approach by incorporating evolutionary information and von Neumann entropies. Amino Acids 34:565–572CrossRefPubMedGoogle Scholar
  57. Zhou GP, Doctor K (2003) Subcellular location prediction of apoptosis proteins. Proteins Struct Funct Genet 50:44–48CrossRefPubMedGoogle Scholar
  58. Zhou P, Chou JJ, Olea RS, Yuan J, Wagner G (1999) Solution structure of Apaf-1 CARD and its interaction with caspase-9 CARD: a structural basis for specific adaptor/caspase interaction. Proc Natl Acad Sci USA 96:11265–11270CrossRefPubMedGoogle Scholar
  59. Zhou XB, Chen C, Li ZC, Zou XY (2008) Improved prediction of subcellular location for apoptosis proteins by the dual-layer support vector machine. Amino Acids 35:383–388CrossRefPubMedGoogle Scholar

Copyright information

© Springer-Verlag 2009

Authors and Affiliations

  • Jian-Ding Qiu
    • 1
    • 2
  • San-Hua Luo
    • 1
  • Jian-Hua Huang
    • 1
  • Xing-Yu Sun
    • 1
  • Ru-Ping Liang
    • 1
  1. 1.Department of ChemistryNanchang UniversityNanchangPeople’s Republic of China
  2. 2.Department of Chemical EngineeringPingxiang CollegePingxiangPeople’s Republic of China

Personalised recommendations