Abstract
Palmitoylation is one of the most important post-translational modifications involving molecular signalling activities. Two simple methods have been developed very recently for predicting palmitoylation sites, but the sensitivity (the prediction accuracy of palmitoylation sites) of both methods is low (< 65%). A regularised bio-basis function neural network is implemented in this paper aiming to improve the sensitivity. A set of protein sequences with experimentally determined palmitoylation sites are downloaded from NCBI for the study. The protein-oriented cross-validation strategy is used for proper model construction. The experiments show that the regularised bio-basis function neural network significantly outperforms the two existing methods as well as the support vector machine and the radial basis function neural network. Specifically the sensitivity has been significantly improved with a slightly improved specificity (the prediction accuracy of non-palmitoylation sites).
Chapter PDF
Similar content being viewed by others
References
Veit, M., Schmidt, M.F.G.: Palmitoylation of viral and cellular proteins. In: Schmidt, M.F.G. (Hrsg.) Influenza Viruses, Grosse-Verlag, Berlin (2006)
Navarro-Lerida, I., Alvarez-Barrientos, A., Rodríguez-Crespo, I.: N-terminal palmitoylation within the appropriate amino acid environment conveys on NOS2 the ability to progress along the intracellular sorting pathways. Journal of Cell Science 119, 1558–1596 (2006)
Kurayoshi, M., et al.: Post-translational palmitoylation and glycosylation of Wnt-5a are necessary for its signaling.
Smotrys, J.E., Linder, M.E.: Palmitoylation of intracellular signaling proteins: regulation and function. Annu. Rev. Biochem. 73, 559–587 (2004)
Li, M., et al.: Palmitoylation of the murine leukemia virus envelope protein is critical for lipid raft association and surface expression. J. Virol. 76, 11845–11852 (2002)
Yu, G., et al.: Palmitoylation and Polymerization of Hepatitis C Virus NS4B Protein. Journal of Virology 80, 6013–6023 (2006)
Peng, Y., Tang, F., Weisman, L.S.: Palmitoylation plays a role in targeting Vac8p to specific membrane subdomains. Traffic 7, 1378 (2006)
Poorman, R.A., et al.: A cumulative specificity model for protease from human immunodeficiency virus types 1 and 2, inferred from statistical analysis of an extended substrate data base. J. Biol. Chem. 22, 14554–14561 (1991)
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77, 257–286 (1989)
Nakata, K., Maizel, J.V.: Prediction of operator-binding protein by discriminant analysis. Gene Anal. Tech. 6, 111–119 (1989)
Chen, C.P., Rost, B.: State-of-the-art in membrane protein prediction. Applied Bioinformatics 1, 21–35 (2002)
Senawongse, P., Dalby, A., Yang, Z.R.: Predicting the phosphorylation sites using hidden Markov models and Machine Learning methods. Journal of Chemical Information and Computer Science 45, 1147–1152 (2005)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)
Scholkopf, B.: The kernel trick for distances. Technical Report. Microsoft Research (May 2000)
Nielsen, M., et al.: Reliable prediction of T-cell epitopes using neural networks with novel sequence representations. Protein Science 12, 1007–1017 (2003)
Hansen, J.E., et al.: Prediction of O-glycosylation of mammalian proteins: specificity patterns of UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase. Biochem. J. 30, 801–813 (1995)
Gutteridge, A., Bartlett, G.J., Thornton, J.M.: Using a neural network and spatial clustering to predict the location of active sites in enzymes. Journal of Molecular Biology 330, 719–734 (2003)
Blom, N., Gammeltoft, S., Brunak, S.: Sequence and structure based prediction of eukaryotic protein phosphorylation sites. J. Mol. Biol. 24, 1351–1362 (1999)
Ehrlich, L., et al.: Prediction of protein hydration sites from sequence by modular neural networks. Protein Eng. 11, 11–19 (1998)
Zien, A., et al.: Engineering support vector machine kernels that recognize translation initiation sites. Bioinformatics 16, 799–807 (2000)
Kim, J.H., et al.: Prediction of phosphorylation sites using SVMs. Bioinformatics 20, 3179–3184 (2006)
Zhao, Y., et al.: Application of support vector machines for T-cell epitopes prediction. Bioinformatics 19, 1978–1984 (2003)
Koike, A., Takagi, T.: Prediction of protein-protein interaction sites using support vector machines. Protein Eng. Des. Sel. 17, 165–173 (2004)
Zhou, F., et al.: CSS-Palm: palmitoylation site prediction with a clustering and scoring strategy (CSS). Bioinformatics 22, 894–896 (2006)
Xue, Y., et al.: NBA-Palm: prediction of palmitoylation site implemented in Naïe Bayes algorithm. BMC Bioinformatics 7, 1–10 (2006)
Qian, N., Sejnowski, T.: Predicting the secondary structure of globular proteins using neural network models. In: Proceeding of Int J. Conf. On Neural Networks, pp. 865–884 (1998)
Thomson, R., et al.: Characterising proteolytic cleavage site activity using bio-basis function neural networks. Bioinformatics 19, 1741–1747 (2003)
Yang, Z.R., Thomson, R.: Bio-basis function neural network for prediction of protease cleavage sites in proteins. IEEE Trans. on Neural Networks 16, 263–274 (2005)
You, L., Garwicz, D., Rognvaldsson, T.: Comprehensive bioinformatic analysis of the specificity of human immunodeficiency virus type 1 protease. Journal of Virology 79, 12477–12486 (2005)
Yang, Z.R., Berry, E.: A novel neural learning algorithm for protease cleavage site prediction. Journal of Bioinformatics and Computational Biology 2, 511–531 (2004)
Thomson, R., Esnouf, R.: Predict disordered proteins using bio-basis function neural networks. In: Yang, Z.R., Yin, H., Everson, R.M. (eds.) IDEAL 2004. LNCS, vol. 3177, pp. 19–27. Springer, Heidelberg (2004)
Yang, Z.R., et al.: RONN: use of the bio-basis function neural network technique for the detection of natively disordered regions in proteins. Bioinformatics 21, 3369–3376 (2005)
Berry, E., Dalby, A., Yang, Z.R.: Reduced bio-basis function neural networks in prediction of phosphorylation sites, a comparative study. Computational Biology and Chemistry 28, 75–85 (2004)
Yang, Z.R., Chou, K.C.: Predicting the O-linkage sites in glycoproteins using bio-basis function neural networks. Bioinformatics 20, 903–908 (2004)
Yang, Z.R.: Prediction of caspase cleavage sites using Bayesian bio-basis function neural networks. Bioinformatics 21, 1831–1837 (2005)
Yang, Z.R.: Mining SARS-CoV protease cleavage data using decision trees, a novel method for decisive template searching. Bioinformatics 21, 2644–2650 (2005)
Sidhu, A., Yang, Z.R.: Prediction of signal peptides using bio-basis function neural networks and decision trees. Applied Bioinformatics 5, 13–19 (2006)
Yang, Z.R.: Orthogonal kernel machine in prediction of functional sites in preteins. IEEE Trans. on Systems, Man and Cybernetics 35, 100–106 (2005)
Yang, Z.R., Johnathan, F.: Predict T-cell epitopes using bio-support vector machines. Journal of Chemical Information and Computer Sciences 45, 1142–1148 (2005)
Neumaier, A.: Solving ill-conditioned and singular linear systems: A tutorial on regularization. SIAM Review 40, 636–666 (1998)
Girosi, F., Jones, M., Poggio, T.: Regularization Theory and Neural Networks Architectures. Neural Computation 7, 219–269 (1995)
Bishop, C.: Neural Networks for Pattern Recognition. Oxford Press, Oxford (1995)
Dayhoff, M.O., Schwartz, R.M., Orcutt, B.C.: A model of evolutionary change in proteins. Matrices for detecting distant relationships. Atlas of protein sequence and structure 5, 345–358 (1978)
Henikoff, S., Henikoff, J.G.: Amino acid Substitution matrices from protein blocks. Proc. Natl. Acad. Sci. 89, 10915–10919 (1992)
Johnson, M.S., Overington, J.P.: A structural basis for sequence comparisons-an evaluation of scoring methodologies. Journal Molecular Biology 233, 716–738 (1993)
Matthews, B.W.: Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim. Biophys. Acta 405, 442–451 (1975)
Schneider, T.D., Stephens, R.M.: Sequence Logos: A new way to display consensus sequences. Nucleic Acids Res. 18, 6097–6100 (1990)
Metz, C.E.: Basic principles of ROC analysis. Seminars in Nuclear Medicine 8, 283–298 (1978)
Yang, Z.R.: Predicting Hepatitis C virus protease cleavage sites using generalised linear indicator regression models. IEEE Trans. on Biomedical Engineering 53, 2119–2123 (2006)
Yang, Z.R.: A probabilistic peptide machine for predicting Hepatitis C virus protease cleavage sites. IEEE Trans. on Information Technology in Biomedicine (in press)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, Z.R. (2007). Predicting Palmitoylation Sites Using a Regularised Bio-basis Function Neural Network. In: Măndoiu, I., Zelikovsky, A. (eds) Bioinformatics Research and Applications. ISBRA 2007. Lecture Notes in Computer Science(), vol 4463. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72031-7_37
Download citation
DOI: https://doi.org/10.1007/978-3-540-72031-7_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72030-0
Online ISBN: 978-3-540-72031-7
eBook Packages: Computer ScienceComputer Science (R0)