Skip to main content

Advertisement

Log in

Prediction of Apoptosis Protein’s Subcellular Localization by Fusing Two Different Descriptors Based on Evolutionary Information

  • Regular Article
  • Published:
Acta Biotheoretica Aims and scope Submit manuscript

Abstract

The apoptosis protein has a central role in the development and the homeostasis of an organism. Obtaining information about the subcellular localization of apoptosis protein is very helpful to understand the apoptosis mechanism and the function of this protein. Prediction of apoptosis protein’s subcellular localization is a challenging task, and currently the existing feature extraction methods mainly rely on the protein’s primary sequence. In this paper we develop a feature extraction model based on two different descriptors of evolutionary information, which contains the 192 frequencies of triplet codons (FTC) in the RNA sequence derived from the protein’s primary sequence and the 190 features from a detrended forward moving-average cross-correlation analysis (DFMCA) based on a position-specific scoring matrix (PSSM) generated by the PSI-BLAST program. Hence, this model is called FTC-DFMCA-PSSM. A 382-dimensional (382D) feature vector is constructed on the ZD98, ZW225 and CL317 datasets. Then a support vector machine is adopted as classifier, and the jackknife cross-validation test method is used for evaluating the accuracy. The overall prediction accuracies are further improved by an objective and rigorous jackknife test. Our model not only broadens the source of the feature information, but also provides a more accurate and reliable automated calculation method for the prediction of apoptosis protein’s subcellular localization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Adams JM, Cory S (1998) The Bcl-2 protein family: arbiters of cell survival. Science 281:1322–1326

    Article  Google Scholar 

  • Altschul SF, Schaffer TL, Madden AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402

    Article  Google Scholar 

  • Boeckmann B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, Gasteiger E, Martin MJ, Michoud K, O’Donovan C, Phan I, Pilbout S, Schneider M (2003) The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res 31:365–370

    Article  Google Scholar 

  • Bulashevska A, Eils R (2006) Predicting protein subcellular locations using hierarchical ensemble of Bayesian classifiers based on Markov chains. BMC Bioinform 7:298

    Article  Google Scholar 

  • Carbone A (2009) Detrending moving average algorithm: a brief review, Science and Technology for Humanity (TIC-STH), pp 691–696

  • Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machines

  • Chen YL, Li QZ (2004) Prediction of the subcellular location apoptosis proteins using the algorithm of measure of diversity. Acta Sci Nat Univ NeiMongol 25:413–417

    Google Scholar 

  • Chen YL, Li QZ (2007a) Prediction of the subcellular location of apoptosis proteins. J Theor Biol 245:775–783

    Article  Google Scholar 

  • Chen YL, Li QZ (2007b) Prediction of apoptosis protein’s subcellular location using improved hybrid approach and pseudo-amino acid composition. J Theor Biol 248:377–381

    Article  Google Scholar 

  • Chen J, Xu HM, He PA, Dai Q, Yao YH (2016) A multiple information fusion method for predicting subcellular locations of two different types of bacterial protein simultaneously. Biosystem 139:37–45

    Article  Google Scholar 

  • Cheng X, Xiao X, Chou KC (2017a) pLoc-mPlant: predict subcellular localization of multi-location plant proteins via incorporating the optimal GO information into general PseAAC. Mol BioSyst 13:1722–1727

    Article  Google Scholar 

  • Cheng X, Xiao X, Chou KC (2017b) pLoc-mVirus: predict subcellular localization of multi-location virus proteins by incorporating the optimal GO information into general PseAAC. Gene 628:315–321

    Article  Google Scholar 

  • Chou KC, Shen HB (2007) Review: Recent progress in protein subcellular location prediction. Anal Biochem 370:1–16

    Article  Google Scholar 

  • Dehzangi A, Heffernan R, Sharma A, Lyons J, Paliwal K, Sattar A (2015) Gram-positive and Gram-negative protein subcellular localization by incorporating evolutionary-based descriptors into Chou’s general PseAAC. J Theor Biol 364:284–294

    Article  Google Scholar 

  • Ding SY, Yan SJ, Qi SH, Li Y, Yao YH (2014) A protein structural classes prediction method based on PSI-BLAST profile. J Theor Biol 353:19–23

    Article  Google Scholar 

  • Eisenhaber F, Bork P (1998) Wanted: subcellular localization of proteins based on sequence. Trends Cell Biol 8:169–170

    Article  Google Scholar 

  • Evan G, Littlewood T (1998) A matter of life and cell death. Science 281:1317–1322

    Article  Google Scholar 

  • Fan GL, Li QZ (2012) Predict mycobacterial proteins subcellular locations by incorporating pseudo-average chemical shift into the general form of Chou’s pseudo amino acid composition. J Theor Biol 304:88–95

    Article  Google Scholar 

  • He LY, Chen SP (2011) A new approach to quantify power-law cross-correlation and its application to commodity markets. Phys A 390:3806–3814

    Article  Google Scholar 

  • Huang C, Yuan JQ (2013) Using radial basis function on the general form of Chou’s pseudo amino acid composition and PSSM to predict subcellular locations of proteins with both single and multiple sites. Biosystem 113:50–57

    Article  Google Scholar 

  • Huang J, Shi F, Zhou HB (2005) Support vector machine for predicting apoptosis proteins types by incorporating protein instability index. China J Bioinform 3:121–123

    Google Scholar 

  • Jacobson MD, Weil M, Raff MC (1997) Programmed cell death in animal development. Cell 88:347–354

    Article  Google Scholar 

  • Liao B, Hu QM, Cai LJ, Chen HW, Zhu W (2014) RNA-transverse and longitudinal protein sequence encoding: an encoding method for protein sequence and its application. J Comput Theor Nanosci 11:1169–1173

    Article  Google Scholar 

  • Lin H, Wang H, Ding H, Chen YL, Li QZ (2009) Prediction of subcellular localization of apoptosis protein using Chou’s pseudo amino acid composition. Acta Biotheor 57:321–330

    Article  Google Scholar 

  • Liu T, Zheng X, Wang C, Wang J (2010) Prediction of subcellular location of apoptosis proteins using pseudo amino acid composition: an approach from auto covariance transformation. Protein Peptide Lett 17:1263–1269

    Article  Google Scholar 

  • Podobnik B, Stanley HE (2008) Detrended cross-correlation analysis: a new method for analyzing two non-stationary time series. Phys Rev Lett 100:084102.1–084102.4

    Article  Google Scholar 

  • Qiu JD, Luo SH, Huang JH, Sun XY, Liang RP (2010) Predicting subcellular location of apoptosis proteins based on wavelet transform and support vector machine. Amino Acids 38:1201–1208

    Article  Google Scholar 

  • Reed JC, Paternostro G (1999) Postmitochondrial regulation of apoptosis during heart failure. Proc Natl Acad Sci USA 96:7614–7616

    Article  Google Scholar 

  • Schulz JB, Weller M, Moskowitz MA (1999) Caspases as treatment targets in stroke and neurodegenerative diseases. Ann Neurol 45:421–429

    Article  Google Scholar 

  • Suzuki M, Youle RJ, Tjandra N (2000) Structure of Bax: coregulation of dimmer formation and intracellular location. Cell 103:645–654

    Article  Google Scholar 

  • Vandewalle N, Ausloos M (1998) Crossing of two mobile averages: a method for measuring the roughness exponent. Phys Rev E 58:6832–6834

    Article  Google Scholar 

  • Vapnik V (1995) The nature of statistical learning theory, 1st edn. Springer, New York

    Book  Google Scholar 

  • Wang GL, Dunbrack RL Jr (2003) PISCES: a protein sequence culling server. Bioinformatics 19:1589–1591

    Article  Google Scholar 

  • Wang JR, Wang C, Cao JJ, Liu XQ, Yao YH, Dai Q (2015a) Prediction of protein structural classes for low-similarity sequences using reduced PSSM and position-based secondary structural features. Gene 554:241–248

    Article  Google Scholar 

  • Wang X, Zhang WW, Zhang QW, Li GZ (2015b) MultiP-SChlo: multi-label protein subchloroplast localization prediction with Chou’s pseudo amino acid composition and a novel multi-label classifier. Bioinformatics 31:2639–2645

    Article  Google Scholar 

  • Wang X, Li H, Wang R, Zhang QW, Zhang WW, Gan Y (2017) MultiP-Apo: a multilabel predictor for identifying subcellular locations of apoptosis proteins. Comput Intell Neurosci 2017:9183796.1–9183796.10

    Google Scholar 

  • Yang JY, Chen X (2011) Improving taxonomy-based protein fold recognition by using global and local features. Proteins 79:2053–2064

    Article  Google Scholar 

  • Yao YH, Shi ZX, Dai Q (2014) Apoptosis protein’s subcellular location prediction based on position-specific scoring matrix. J Comput Theor Nanosci 11:2073–2078

    Article  Google Scholar 

  • Yao YH, Xu HM, He PQ, Dai Q (2015) Recent advances on prediction of protein subcellular localization. Mini Rev Org Chem 12:481–492

    Article  Google Scholar 

  • Zhang ZH, Wang ZH, Zhang ZR, Wang YX (2006) A novel method for apoptosis protein’s subcellular localization prediction combining encoding based on grouped weight and support vector machine. FEBS Lett 580:6169–6174

    Article  Google Scholar 

  • Zhang L, Liao B, Li DC, Zhu W (2009) A novel representation for apoptosis protein subcellular localization prediction using support vector machine. J Theor Biol 259:361–365

    Article  Google Scholar 

  • Zhang LC, Kong L, Han XD, Lv JF (2016) Structural class prediction of protein using novel feature extraction method from chaos game representation of predicted secondary structure. J Theor Biol 400:1–10

    Article  Google Scholar 

  • Zhou GP, Doctor K (2003) Subcellular location prediction of apoptosis proteins. Proteins Struct Funct Genet 50:44–48

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (No. 11601407), the Fundamental Research Funds for the Central Universities (Nos. JB160711 and JBG160703) and Doctoral Scientific Research Foundation of Xi’an Polytechnic University (No. BS1710).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Yunyun Liang or Shengli Zhang.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liang, Y., Zhang, S. Prediction of Apoptosis Protein’s Subcellular Localization by Fusing Two Different Descriptors Based on Evolutionary Information. Acta Biotheor 66, 61–78 (2018). https://doi.org/10.1007/s10441-018-9319-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10441-018-9319-x

Keywords

Navigation