Skip to main content
Log in

Prediction of linear B-cell epitopes using amino acid pair antigenicity scale

  • Published:
Amino Acids Aims and scope Submit manuscript

Summary.

Identification of antigenic sites on proteins is of vital importance for developing synthetic peptide vaccines, immunodiagnostic tests and antibody production. Currently, most of the prediction algorithms rely on amino acid propensity scales using a sliding window approach. These methods are oversimplified and yield poor predicted results in practice. In this paper, a novel scale, called the amino acid pair (AAP) antigenicity scale, is proposed that is based on the finding that B-cell epitopes favor particular AAPs. It is demonstrated that, using SVM (support vector machine) classifier, the AAP antigenicity scale approach has much better performance than the existing scales based on the single amino acid propensity. The AAP antigenicity scale can reflect some special sequence-coupled feature in the B-cell epitopes, which is the essence why the new approach is superior to the existing ones. It is anticipated that with the continuous increase of the known epitope data, the power of the AAP antigenicity scale approach will be further enhanced.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Abbreviations

AAP:

amino acid pair

SVM:

support vector machine

ROC:

receiver operating characteristics

References

  • AJ Alix (1999) ArticleTitlePredictive estimation of protein linear epitopes by using the program PEOPLE Vaccine 18 311–314 Occurrence Handle10506656 Occurrence Handle10.1016/S0264-410X(99)00329-1 Occurrence Handle1:CAS:528:DyaK1MXnsVegur8%3D

    Article  PubMed  CAS  Google Scholar 

  • MJ Blythe DR Flower (2005) ArticleTitleBenchmarking B cell epitope prediction: underperformance of existing methods Protein Sci 14 246–248 Occurrence Handle15576553 Occurrence Handle10.1110/ps.041059505 Occurrence Handle1:CAS:528:DC%2BD2MXhtFWrug%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • Y Cao S Liu L Zhang J Qin J Wang K Tang (2006) ArticleTitlePrediction of protein structural class with rough sets BMC Bioinformatics 7 20 Occurrence Handle16412240 Occurrence Handle10.1186/1471-2105-7-20 Occurrence Handle1:CAS:528:DC%2BD28Xht1Sks7s%3D

    Article  PubMed  CAS  Google Scholar 

  • C Chen X Zhou Y Tian X Zou P Cai (2006) ArticleTitlePredicting protein structural class with pseudo-amino acid composition and support vector machine fusion network Anal Biochem 357 116–121 Occurrence Handle16920060 Occurrence Handle10.1016/j.ab.2006.07.022 Occurrence Handle1:CAS:528:DC%2BD28XpsVOgs78%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (1993) ArticleTitleA vectorized sequence-coupling model for predicting HIV protease cleavage sites in proteins J Biol Chem 268 16938–16948 Occurrence Handle8349584 Occurrence Handle1:CAS:528:DyaK3sXlvFygsbY%3D

    PubMed  CAS  Google Scholar 

  • KC Chou (1995) ArticleTitleA sequence-coupled vector-projection model for predicting the specificity of GalNAc-transferase Protein Sci 4 1365–1383 Occurrence Handle7670379 Occurrence Handle1:CAS:528:DyaK2MXms1Knsr0%3D Occurrence Handle10.1002/pro.5560040712

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (1996) ArticleTitleReview: prediction of HIV protease cleavage sites in proteins Anal Biochem 233 1–14 Occurrence Handle8789141 Occurrence Handle10.1006/abio.1996.0001 Occurrence Handle1:CAS:528:DyaK28XmtVKmsA%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (1997a) ArticleTitlePrediction and classification of alpha-turn types Biopolymers 42 837–853 Occurrence Handle10.1002/(SICI)1097-0282(199712)42:7<837::AID-BIP9>3.0.CO;2-U Occurrence Handle1:CAS:528:DyaK2sXnsFWqu7g%3D

    Article  CAS  Google Scholar 

  • KC Chou (1997b) ArticleTitlePrediction of beta-turns in proteins J Peptide Res 49 120–144 Occurrence Handle1:CAS:528:DyaK2sXisVOjtrk%3D Occurrence Handle10.1111/j.1399-3011.1997.tb00608.x

    Article  CAS  Google Scholar 

  • KC Chou (1999) ArticleTitleUsing pair-coupled amino acid composition to predict protein secondary structure content J Protein Chem 18 473–480 Occurrence Handle10449044 Occurrence Handle10.1023/A:1020696810938 Occurrence Handle1:CAS:528:DyaK1MXkvFajtLg%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (2000) ArticleTitleReview: prediction of tight turns and their types in proteins Anal Biochem 286 1–16 Occurrence Handle11038267 Occurrence Handle10.1006/abio.2000.4757 Occurrence Handle1:CAS:528:DC%2BD3cXntlKrsL0%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou (2001a) ArticleTitlePrediction of signal peptides using scaled window Peptides 22 1973–1979 Occurrence Handle10.1016/S0196-9781(01)00540-X Occurrence Handle1:CAS:528:DC%2BD38Xitlaguw%3D%3D

    Article  CAS  Google Scholar 

  • KC Chou (2001b) ArticleTitleUsing subsite coupling to predict signal peptides Protein Eng 14 75–79 Occurrence Handle10.1093/protein/14.2.75 Occurrence Handle1:CAS:528:DC%2BD3MXjsVektrs%3D

    Article  CAS  Google Scholar 

  • KC Chou (2002) ArticleTitleReview: prediction of protein signal sequences Curr Protein Peptide Sci 3 615–622 Occurrence Handle10.2174/1389203023380468 Occurrence Handle1:CAS:528:DC%2BD38XosF2ku7w%3D

    Article  CAS  Google Scholar 

  • KC Chou JR Blinn (1997) ArticleTitleClassification and prediction of beta-turn types J Protein Chem 16 575–595 Occurrence Handle9263121 Occurrence Handle10.1023/A:1026366706677 Occurrence Handle1:CAS:528:DyaK2sXltlWgtrg%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou HB Shen (2006a) ArticleTitleHum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization Biochem Biophys Res Commun 347 150–157 Occurrence Handle10.1016/j.bbrc.2006.06.059 Occurrence Handle1:CAS:528:DC%2BD28Xmslyrsbc%3D

    Article  CAS  Google Scholar 

  • KC Chou HB Shen (2006b) ArticleTitlePredicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-nearest neighbor classifiers J Proteome Res 5 1888–1897 Occurrence Handle10.1021/pr060167c Occurrence Handle1:CAS:528:DC%2BD28XmvVeitr0%3D

    Article  CAS  Google Scholar 

  • KC Chou CT Zhang (1993) ArticleTitleStudies on the specificity of HIV protease: an application of Markov chain theory J Protein Chem 12 709–724 Occurrence Handle8136021 Occurrence Handle10.1007/BF01024929 Occurrence Handle1:CAS:528:DyaK2cXht1ygs7Y%3D

    Article  PubMed  CAS  Google Scholar 

  • KC Chou CT Zhang (1995) ArticleTitleReview: prediction of protein structural classes Crit Rev Biochem Mol Biol 30 275–349 Occurrence Handle7587280 Occurrence Handle1:CAS:528:DyaK2MXosFentb8%3D

    PubMed  CAS  Google Scholar 

  • PY Chou GD Fasman (1978) ArticleTitlePrediction of secondary structure of proteins from amino acid sequences Adv Enzymol Rel Subjects Biochem 47 45–148 Occurrence Handle1:CAS:528:DyaE1MXkvFOjtQ%3D%3D

    CAS  Google Scholar 

  • H Delacour A Servonnet A Perrot JF Vigezzi JM Ramirez (2005) ArticleTitleROC (receiver operating characteristics) curve: principles and application in biology Ann Biol Clin (Paris) 63 145–154 Occurrence Handle1:STN:280:DC%2BD2M7mslWitQ%3D%3D

    CAS  Google Scholar 

  • EA Emini JV Hughes DS Perlow J Boger (1985) ArticleTitleInduction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide J Virol 55 836–839 Occurrence Handle2991600 Occurrence Handle1:CAS:528:DyaL2MXlsFyksbY%3D

    PubMed  CAS  Google Scholar 

  • ZP Feng (2001) ArticleTitlePrediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition Biopolymers 58 491–499 Occurrence Handle11241220 Occurrence Handle10.1002/1097-0282(20010415)58:5<491::AID-BIP1024>3.0.CO;2-I Occurrence Handle1:CAS:528:DC%2BD3MXisVSntb8%3D

    Article  PubMed  CAS  Google Scholar 

  • QB Gao ZZ Wang C Yan YH Du (2005a) ArticleTitlePrediction of protein subcellular location using a combined feature of sequence FEBS Lett 579 3444–3448 Occurrence Handle10.1016/j.febslet.2005.05.021 Occurrence Handle1:CAS:528:DC%2BD2MXlt1KjsL0%3D

    Article  CAS  Google Scholar 

  • Y Gao SH Shao X Xiao YS Ding YS Huang ZD Huang KC Chou (2005b) ArticleTitleUsing pseudo amino acid composition to predict protein subcellular location: approached with Lyapunov index, Bessel function, and Chebyshev filter Amino Acids 28 373–376 Occurrence Handle10.1007/s00726-005-0206-9 Occurrence Handle1:CAS:528:DC%2BD2MXlt1Kmurw%3D

    Article  CAS  Google Scholar 

  • YZ Guo M Li M Lu Z Wen K Wang G Li J Wu (2006) ArticleTitleClassifying G protein-coupled receptors and nuclear receptors based on protein power spectrum from fast Fourier transform Amino Acids 30 397–402 Occurrence Handle16773242 Occurrence Handle10.1007/s00726-006-0332-z Occurrence Handle1:CAS:528:DC%2BD28Xls1egs7o%3D

    Article  PubMed  CAS  Google Scholar 

  • PA Karplus GE Schulz (1985) ArticleTitlePrediction of chain flexibility in proteins – a tool for the selection of peptide antigens Naturwissenschaften 72 212–213 Occurrence Handle10.1007/BF01195768 Occurrence Handle1:CAS:528:DyaL2MXktVeitbs%3D

    Article  CAS  Google Scholar 

  • AS Kolaskar PC Tongaonkar (1990) ArticleTitleA semi-empirical method for prediction of antigenic determinants on protein antigens FEBS Lett 276 172–174 Occurrence Handle1702393 Occurrence Handle10.1016/0014-5793(90)80535-Q Occurrence Handle1:CAS:528:DyaK3MXktVGntw%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • H Liu J Yang JG Ling KC Chou (2005) ArticleTitlePrediction of protein signal sequences and their cleavage sites by statistical rulers Biochem Biophys Res Commun 338 1005–1011 Occurrence Handle16256954 Occurrence Handle10.1016/j.bbrc.2005.10.046 Occurrence Handle1:CAS:528:DC%2BD2MXht1Wjur3F

    Article  PubMed  CAS  Google Scholar 

  • W Liu KC Chou (1999) ArticleTitleProtein secondary structural content prediction Protein Eng 12 1041–1050 Occurrence Handle10611397 Occurrence Handle10.1093/protein/12.12.1041 Occurrence Handle1:CAS:528:DC%2BD3cXmt1agug%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • M Odorico JL Pellequer (2003) ArticleTitleBEPITOPE: predicting the location of continuous epitopes and patterns in proteins J Mol Recogn 16 20–22 Occurrence Handle10.1002/jmr.602 Occurrence Handle1:CAS:528:DC%2BD3sXhtFKnsb8%3D

    Article  CAS  Google Scholar 

  • JM Parker D Guo RS Hodges (1986) ArticleTitleNew hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites Biochemistry 25 5425–5432 Occurrence Handle2430611 Occurrence Handle10.1021/bi00367a013 Occurrence Handle1:CAS:528:DyaL28XlsFWjtb4%3D

    Article  PubMed  CAS  Google Scholar 

  • S Saha M Bhasin GP Raghava (2005) ArticleTitleBcipep: a database of B-cell epitopes BMC Genomics 6 79 Occurrence Handle15921533 Occurrence Handle10.1186/1471-2164-6-79 Occurrence Handle1:CAS:528:DC%2BD2MXltlyjsLg%3D

    Article  PubMed  CAS  Google Scholar 

  • B Scholkopf KK Sung CJC Burges F Girosi P Niyogi T Poggio V Vapnik (1997) ArticleTitleComparing support vector machines with Gaussian kernels to radial basis function classifiers IEEE Trans Sign Proc 45 2758–2765 Occurrence Handle10.1109/78.650102

    Article  Google Scholar 

  • J Sollner (2006) ArticleTitleSelection and combination of machine learning classifiers for prediction of linear B-cell epitopes on proteins J Mol Recogn 19 209–214 Occurrence Handle10.1002/jmr.770 Occurrence Handle1:CAS:528:DC%2BD28XlvVOmt7g%3D

    Article  CAS  Google Scholar 

  • J Sollner B Mayer (2006) ArticleTitleMachine learning approaches for prediction of linear B-cell epitopes on proteins J Mol Recogn 19 200–208 Occurrence Handle10.1002/jmr.771 Occurrence Handle1:CAS:528:DC%2BD28XlvVOmt7s%3D

    Article  CAS  Google Scholar 

  • XD Sun RB Huang (2006) ArticleTitlePrediction of protein structural classes using support vector machines Amino Acids 30 469–475 Occurrence Handle16622605 Occurrence Handle10.1007/s00726-005-0239-0 Occurrence Handle1:CAS:528:DC%2BD28Xls1ehu7c%3D

    Article  PubMed  CAS  Google Scholar 

  • V Vapnik (1998) Statistical learning theory Wiley-Interscience New York

    Google Scholar 

  • Wen Z, Li M, Li Y, Guo Y, Wang K (2007) Delaunay triangulation with partial least squares projection to latent structures: a model for G-protein coupled receptors classification and fast structure recognition. Amino Acids (in press) (DOI: 10.1007/s00726-006-0341-y)

  • X Xiao S Shao Y Ding Z Huang Y Huang KC Chou (2005) ArticleTitleUsing complexity measure factor to predict protein subcellular location Amino Acids 28 57–61 Occurrence Handle15611847 Occurrence Handle10.1007/s00726-004-0148-7 Occurrence Handle1:CAS:528:DC%2BD2MXhsVKqsro%3D

    Article  PubMed  CAS  Google Scholar 

  • CT Zhang KC Chou (1993) ArticleTitleAn alternate-subsite-coupled model for predicting HIV protease cleavage sites in proteins Protein Eng 7 65–73 Occurrence Handle10.1093/protein/7.1.65

    Article  Google Scholar 

  • SW Zhang Q Pan HC Zhang ZC Shao JY Shi (2006) ArticleTitlePrediction protein homo-oligomer types by pseudo amino acid composition: approached with an improved feature extraction and naive Bayes feature fusion Amino Acids 30 461–468 Occurrence Handle16773245 Occurrence Handle10.1007/s00726-006-0263-8 Occurrence Handle1:CAS:528:DC%2BD28Xls1egsr0%3D

    Article  PubMed  CAS  Google Scholar 

  • GP Zhou (1998) ArticleTitleAn intriguing controversy over protein structural class prediction J Protein Chem 17 729–738 Occurrence Handle9988519 Occurrence Handle10.1023/A:1020713915365 Occurrence Handle1:CAS:528:DyaK1MXnslaltw%3D%3D

    Article  PubMed  CAS  Google Scholar 

  • GP Zhou K Doctor (2003) ArticleTitleSubcellular location prediction of apoptosis proteins Proteins Struct Funct Genet 50 44–48 Occurrence Handle12471598 Occurrence Handle10.1002/prot.10251 Occurrence Handle1:CAS:528:DC%2BD3sXlsVKmug%3D%3D

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Electronic Supplementary Material

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, J., Liu, H., Yang, J. et al. Prediction of linear B-cell epitopes using amino acid pair antigenicity scale. Amino Acids 33, 423–428 (2007). https://doi.org/10.1007/s00726-006-0485-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00726-006-0485-9

Navigation