Abstract
The apoptosis protein has a central role in the development and the homeostasis of an organism. Obtaining information about the subcellular localization of apoptosis protein is very helpful to understand the apoptosis mechanism and the function of this protein. Prediction of apoptosis protein’s subcellular localization is a challenging task, and currently the existing feature extraction methods mainly rely on the protein’s primary sequence. In this paper we develop a feature extraction model based on two different descriptors of evolutionary information, which contains the 192 frequencies of triplet codons (FTC) in the RNA sequence derived from the protein’s primary sequence and the 190 features from a detrended forward moving-average cross-correlation analysis (DFMCA) based on a position-specific scoring matrix (PSSM) generated by the PSI-BLAST program. Hence, this model is called FTC-DFMCA-PSSM. A 382-dimensional (382D) feature vector is constructed on the ZD98, ZW225 and CL317 datasets. Then a support vector machine is adopted as classifier, and the jackknife cross-validation test method is used for evaluating the accuracy. The overall prediction accuracies are further improved by an objective and rigorous jackknife test. Our model not only broadens the source of the feature information, but also provides a more accurate and reliable automated calculation method for the prediction of apoptosis protein’s subcellular localization.
Similar content being viewed by others
References
Adams JM, Cory S (1998) The Bcl-2 protein family: arbiters of cell survival. Science 281:1322–1326
Altschul SF, Schaffer TL, Madden AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Boeckmann B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, Gasteiger E, Martin MJ, Michoud K, O’Donovan C, Phan I, Pilbout S, Schneider M (2003) The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res 31:365–370
Bulashevska A, Eils R (2006) Predicting protein subcellular locations using hierarchical ensemble of Bayesian classifiers based on Markov chains. BMC Bioinform 7:298
Carbone A (2009) Detrending moving average algorithm: a brief review, Science and Technology for Humanity (TIC-STH), pp 691–696
Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machines
Chen YL, Li QZ (2004) Prediction of the subcellular location apoptosis proteins using the algorithm of measure of diversity. Acta Sci Nat Univ NeiMongol 25:413–417
Chen YL, Li QZ (2007a) Prediction of the subcellular location of apoptosis proteins. J Theor Biol 245:775–783
Chen YL, Li QZ (2007b) Prediction of apoptosis protein’s subcellular location using improved hybrid approach and pseudo-amino acid composition. J Theor Biol 248:377–381
Chen J, Xu HM, He PA, Dai Q, Yao YH (2016) A multiple information fusion method for predicting subcellular locations of two different types of bacterial protein simultaneously. Biosystem 139:37–45
Cheng X, Xiao X, Chou KC (2017a) pLoc-mPlant: predict subcellular localization of multi-location plant proteins via incorporating the optimal GO information into general PseAAC. Mol BioSyst 13:1722–1727
Cheng X, Xiao X, Chou KC (2017b) pLoc-mVirus: predict subcellular localization of multi-location virus proteins by incorporating the optimal GO information into general PseAAC. Gene 628:315–321
Chou KC, Shen HB (2007) Review: Recent progress in protein subcellular location prediction. Anal Biochem 370:1–16
Dehzangi A, Heffernan R, Sharma A, Lyons J, Paliwal K, Sattar A (2015) Gram-positive and Gram-negative protein subcellular localization by incorporating evolutionary-based descriptors into Chou’s general PseAAC. J Theor Biol 364:284–294
Ding SY, Yan SJ, Qi SH, Li Y, Yao YH (2014) A protein structural classes prediction method based on PSI-BLAST profile. J Theor Biol 353:19–23
Eisenhaber F, Bork P (1998) Wanted: subcellular localization of proteins based on sequence. Trends Cell Biol 8:169–170
Evan G, Littlewood T (1998) A matter of life and cell death. Science 281:1317–1322
Fan GL, Li QZ (2012) Predict mycobacterial proteins subcellular locations by incorporating pseudo-average chemical shift into the general form of Chou’s pseudo amino acid composition. J Theor Biol 304:88–95
He LY, Chen SP (2011) A new approach to quantify power-law cross-correlation and its application to commodity markets. Phys A 390:3806–3814
Huang C, Yuan JQ (2013) Using radial basis function on the general form of Chou’s pseudo amino acid composition and PSSM to predict subcellular locations of proteins with both single and multiple sites. Biosystem 113:50–57
Huang J, Shi F, Zhou HB (2005) Support vector machine for predicting apoptosis proteins types by incorporating protein instability index. China J Bioinform 3:121–123
Jacobson MD, Weil M, Raff MC (1997) Programmed cell death in animal development. Cell 88:347–354
Liao B, Hu QM, Cai LJ, Chen HW, Zhu W (2014) RNA-transverse and longitudinal protein sequence encoding: an encoding method for protein sequence and its application. J Comput Theor Nanosci 11:1169–1173
Lin H, Wang H, Ding H, Chen YL, Li QZ (2009) Prediction of subcellular localization of apoptosis protein using Chou’s pseudo amino acid composition. Acta Biotheor 57:321–330
Liu T, Zheng X, Wang C, Wang J (2010) Prediction of subcellular location of apoptosis proteins using pseudo amino acid composition: an approach from auto covariance transformation. Protein Peptide Lett 17:1263–1269
Podobnik B, Stanley HE (2008) Detrended cross-correlation analysis: a new method for analyzing two non-stationary time series. Phys Rev Lett 100:084102.1–084102.4
Qiu JD, Luo SH, Huang JH, Sun XY, Liang RP (2010) Predicting subcellular location of apoptosis proteins based on wavelet transform and support vector machine. Amino Acids 38:1201–1208
Reed JC, Paternostro G (1999) Postmitochondrial regulation of apoptosis during heart failure. Proc Natl Acad Sci USA 96:7614–7616
Schulz JB, Weller M, Moskowitz MA (1999) Caspases as treatment targets in stroke and neurodegenerative diseases. Ann Neurol 45:421–429
Suzuki M, Youle RJ, Tjandra N (2000) Structure of Bax: coregulation of dimmer formation and intracellular location. Cell 103:645–654
Vandewalle N, Ausloos M (1998) Crossing of two mobile averages: a method for measuring the roughness exponent. Phys Rev E 58:6832–6834
Vapnik V (1995) The nature of statistical learning theory, 1st edn. Springer, New York
Wang GL, Dunbrack RL Jr (2003) PISCES: a protein sequence culling server. Bioinformatics 19:1589–1591
Wang JR, Wang C, Cao JJ, Liu XQ, Yao YH, Dai Q (2015a) Prediction of protein structural classes for low-similarity sequences using reduced PSSM and position-based secondary structural features. Gene 554:241–248
Wang X, Zhang WW, Zhang QW, Li GZ (2015b) MultiP-SChlo: multi-label protein subchloroplast localization prediction with Chou’s pseudo amino acid composition and a novel multi-label classifier. Bioinformatics 31:2639–2645
Wang X, Li H, Wang R, Zhang QW, Zhang WW, Gan Y (2017) MultiP-Apo: a multilabel predictor for identifying subcellular locations of apoptosis proteins. Comput Intell Neurosci 2017:9183796.1–9183796.10
Yang JY, Chen X (2011) Improving taxonomy-based protein fold recognition by using global and local features. Proteins 79:2053–2064
Yao YH, Shi ZX, Dai Q (2014) Apoptosis protein’s subcellular location prediction based on position-specific scoring matrix. J Comput Theor Nanosci 11:2073–2078
Yao YH, Xu HM, He PQ, Dai Q (2015) Recent advances on prediction of protein subcellular localization. Mini Rev Org Chem 12:481–492
Zhang ZH, Wang ZH, Zhang ZR, Wang YX (2006) A novel method for apoptosis protein’s subcellular localization prediction combining encoding based on grouped weight and support vector machine. FEBS Lett 580:6169–6174
Zhang L, Liao B, Li DC, Zhu W (2009) A novel representation for apoptosis protein subcellular localization prediction using support vector machine. J Theor Biol 259:361–365
Zhang LC, Kong L, Han XD, Lv JF (2016) Structural class prediction of protein using novel feature extraction method from chaos game representation of predicted secondary structure. J Theor Biol 400:1–10
Zhou GP, Doctor K (2003) Subcellular location prediction of apoptosis proteins. Proteins Struct Funct Genet 50:44–48
Acknowledgements
This work was supported by the National Natural Science Foundation of China (No. 11601407), the Fundamental Research Funds for the Central Universities (Nos. JB160711 and JBG160703) and Doctoral Scientific Research Foundation of Xi’an Polytechnic University (No. BS1710).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
Rights and permissions
About this article
Cite this article
Liang, Y., Zhang, S. Prediction of Apoptosis Protein’s Subcellular Localization by Fusing Two Different Descriptors Based on Evolutionary Information. Acta Biotheor 66, 61–78 (2018). https://doi.org/10.1007/s10441-018-9319-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10441-018-9319-x