Prediction of linear B-cell epitopes using amino acid pair antigenicity scale
First Online: 26 January 2007 Received: 15 October 2006 Accepted: 28 November 2006 DOI:
Cite this article as: Chen, J., Liu, H., Yang, J. et al. Amino Acids (2007) 33: 423. doi:10.1007/s00726-006-0485-9 Summary.
Identification of antigenic sites on proteins is of vital importance for developing synthetic peptide vaccines, immunodiagnostic tests and antibody production. Currently, most of the prediction algorithms rely on amino acid propensity scales using a sliding window approach. These methods are oversimplified and yield poor predicted results in practice. In this paper, a novel scale, called the amino acid pair (AAP) antigenicity scale, is proposed that is based on the finding that B-cell epitopes favor particular AAPs. It is demonstrated that, using SVM (support vector machine) classifier, the AAP antigenicity scale approach has much better performance than the existing scales based on the single amino acid propensity. The AAP antigenicity scale can reflect some special sequence-coupled feature in the B-cell epitopes, which is the essence why the new approach is superior to the existing ones. It is anticipated that with the continuous increase of the known epitope data, the power of the AAP antigenicity scale approach will be further enhanced.
Keywords: B-cell epitope – AAP antigenicity scale – SVM classifier Abbreviations: AAP
amino acid pair
support vector machine
receiver operating characteristics
Electronic supplementary material
Supplementary material is available in the online version of this article at
and is accessible for authorised users. 10.1007/s00726-006-0485-9 References Alix, AJ 1999 Predictive estimation of protein linear epitopes by using the program PEOPLE Vaccine 18 311 314 PubMed CrossRef Blythe, MJ, Flower, DR 2005 Benchmarking B cell epitope prediction: underperformance of existing methods Protein Sci 14 246 248 PubMed CrossRef Cao, Y, Liu, S, Zhang, L, Qin, J, Wang, J, Tang, K 2006 Prediction of protein structural class with rough sets BMC Bioinformatics 7 20 PubMed CrossRef Chen, C, Zhou, X, Tian, Y, Zou, X, Cai, P 2006 Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network Anal Biochem 357 116 121 PubMed CrossRef Chou, KC 1993 A vectorized sequence-coupling model for predicting HIV protease cleavage sites in proteins J Biol Chem 268 16938 16948 PubMed Chou, KC 1995 A sequence-coupled vector-projection model for predicting the specificity of GalNAc-transferase Protein Sci 4 1365 1383 PubMed CrossRef Chou, KC 1996 Review: prediction of HIV protease cleavage sites in proteins Anal Biochem 233 1 14 PubMed CrossRef Chou, KC 1997a Prediction and classification of alpha-turn types Biopolymers 42 837 853 CrossRef Chou, KC 1997b Prediction of beta-turns in proteins J Peptide Res 49 120 144 CrossRef Chou, KC 1999 Using pair-coupled amino acid composition to predict protein secondary structure content J Protein Chem 18 473 480 PubMed CrossRef Chou, KC 2000 Review: prediction of tight turns and their types in proteins Anal Biochem 286 1 16 PubMed CrossRef Chou, KC 2001a Prediction of signal peptides using scaled window Peptides 22 1973 1979 CrossRef Chou, KC 2001b Using subsite coupling to predict signal peptides Protein Eng 14 75 79 CrossRef Chou, KC 2002 Review: prediction of protein signal sequences Curr Protein Peptide Sci 3 615 622 CrossRef Chou, KC, Blinn, JR 1997 Classification and prediction of beta-turn types J Protein Chem 16 575 595 PubMed CrossRef Chou, KC, Shen, HB 2006a Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization Biochem Biophys Res Commun 347 150 157 CrossRef Chou, KC, Shen, HB 2006b Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-nearest neighbor classifiers J Proteome Res 5 1888 1897 CrossRef Chou, KC, Zhang, CT 1993 Studies on the specificity of HIV protease: an application of Markov chain theory J Protein Chem 12 709 724 PubMed CrossRef Chou, KC, Zhang, CT 1995 Review: prediction of protein structural classes Crit Rev Biochem Mol Biol 30 275 349 PubMed Chou, PY, Fasman, GD 1978 Prediction of secondary structure of proteins from amino acid sequences Adv Enzymol Rel Subjects Biochem 47 45 148 Delacour, H, Servonnet, A, Perrot, A, Vigezzi, JF, Ramirez, JM 2005 ROC (receiver operating characteristics) curve: principles and application in biology Ann Biol Clin (Paris) 63 145 154 Emini, EA, Hughes, JV, Perlow, DS, Boger, J 1985 Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide J Virol 55 836 839 PubMed Feng, ZP 2001 Prediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition Biopolymers 58 491 499 PubMed CrossRef Gao, QB, Wang, ZZ, Yan, C, Du, YH 2005a Prediction of protein subcellular location using a combined feature of sequence FEBS Lett 579 3444 3448 CrossRef Gao, Y, Shao, SH, Xiao, X, Ding, YS, Huang, YS, Huang, ZD, Chou, KC 2005b Using pseudo amino acid composition to predict protein subcellular location: approached with Lyapunov index, Bessel function, and Chebyshev filter Amino Acids 28 373 376 CrossRef Guo, YZ, Li, M, Lu, M, Wen, Z, Wang, K, Li, G, Wu, J 2006 Classifying G protein-coupled receptors and nuclear receptors based on protein power spectrum from fast Fourier transform Amino Acids 30 397 402 PubMed CrossRef Karplus, PA, Schulz, GE 1985 Prediction of chain flexibility in proteins – a tool for the selection of peptide antigens Naturwissenschaften 72 212 213 CrossRef Kolaskar, AS, Tongaonkar, PC 1990 A semi-empirical method for prediction of antigenic determinants on protein antigens FEBS Lett 276 172 174 PubMed CrossRef Liu, H, Yang, J, Ling, JG, Chou, KC 2005 Prediction of protein signal sequences and their cleavage sites by statistical rulers Biochem Biophys Res Commun 338 1005 1011 PubMed CrossRef Liu, W, Chou, KC 1999 Protein secondary structural content prediction Protein Eng 12 1041 1050 PubMed CrossRef Odorico, M, Pellequer, JL 2003 BEPITOPE: predicting the location of continuous epitopes and patterns in proteins J Mol Recogn 16 20 22 CrossRef Parker, JM, Guo, D, Hodges, RS 1986 New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites Biochemistry 25 5425 5432 PubMed CrossRef Saha, S, Bhasin, M, Raghava, GP 2005 Bcipep: a database of B-cell epitopes BMC Genomics 6 79 PubMed CrossRef Scholkopf, B, Sung, KK, Burges, CJC, Girosi, F, Niyogi, P, Poggio, T, Vapnik, V 1997 Comparing support vector machines with Gaussian kernels to radial basis function classifiers IEEE Trans Sign Proc 45 2758 2765 CrossRef Sollner, J 2006 Selection and combination of machine learning classifiers for prediction of linear B-cell epitopes on proteins J Mol Recogn 19 209 214 CrossRef Sollner, J, Mayer, B 2006 Machine learning approaches for prediction of linear B-cell epitopes on proteins J Mol Recogn 19 200 208 CrossRef Sun, XD, Huang, RB 2006 Prediction of protein structural classes using support vector machines Amino Acids 30 469 475 PubMed CrossRef Vapnik, V 1998Statistical learning theory Wiley-Interscience New York
Wen Z, Li M, Li Y, Guo Y, Wang K (2007) Delaunay triangulation with partial least squares projection to latent structures: a model for G-protein coupled receptors classification and fast structure recognition. Amino Acids (in press) (DOI: 10.1007/s00726-006-0341-y)
Xiao, X, Shao, S, Ding, Y, Huang, Z, Huang, Y, Chou, KC 2005 Using complexity measure factor to predict protein subcellular location Amino Acids 28 57 61 PubMed CrossRef Zhang, CT, Chou, KC 1993 An alternate-subsite-coupled model for predicting HIV protease cleavage sites in proteins Protein Eng 7 65 73 CrossRef Zhang, SW, Pan, Q, Zhang, HC, Shao, ZC, Shi, JY 2006 Prediction protein homo-oligomer types by pseudo amino acid composition: approached with an improved feature extraction and naive Bayes feature fusion Amino Acids 30 461 468 PubMed CrossRef Zhou, GP 1998 An intriguing controversy over protein structural class prediction J Protein Chem 17 729 738 PubMed CrossRef Zhou, GP, Doctor, K 2003 Subcellular location prediction of apoptosis proteins Proteins Struct Funct Genet 50 44 48 PubMed CrossRef