Building MHC Class II Epitope Predictor Using Machine Learning Approaches

Protocol
Part of the Methods in Molecular Biology book series (MIMB, volume 1268)

Abstract

Identification of T-cell epitopes binding to MHC class II molecules is an important step in epitope-based vaccine development. This process has since been accelerated with the use of bioinformatics tools to aid in the prediction of peptide binding to MHC class II molecules and also to systematically scan for candidate peptides in antigenic proteins. There have been many prediction software developed over the years using various methods and algorithms and they are becoming increasingly sophisticated. Here, we illustrate the use of machine learning algorithms to train on MHC class II peptide data represented by feature vectors describing their amino acid physicochemical properties. The developed prediction model can then be used to predict new peptide data.

Key words

MHC Antigens/peptides/epitopes CTD Machine learning 

References

  1. 1.
    Lafuente EM, Reche PA (2009) Prediction of MHC-peptide binding: a systematic and comprehensive overview. Curr Pharm Des 15(28):3209–3220PubMedCrossRefGoogle Scholar
  2. 2.
    Tong JC, Tan TW, Ranganathan S (2007) Methods and protocols for prediction of immunogenic epitopes. Brief Bioinform 8(2):96–108PubMedCrossRefGoogle Scholar
  3. 3.
    Wang P, Sidney J, Dow C et al (2008) A systematic assessment of MHC class II peptide binding predictions and evaluation of a consensus approach. PLoS Comput Biol 4(4):e1000048PubMedCentralPubMedCrossRefGoogle Scholar
  4. 4.
    Lata S, Bhasin M, Raghava GP (2009) MHCBN 4.0: a database of MHC/TAP binding peptides and T-cell epitopes. BMC Res Notes 2:61PubMedCentralPubMedCrossRefGoogle Scholar
  5. 5.
    Vita R, Zarebski L, Greenbaum JA et al (2010) The immune epitope database 2.0. Nucleic Acids Res 38(Database issue):D854–D862PubMedCentralPubMedCrossRefGoogle Scholar
  6. 6.
    Hall M, Frank E, Holmes G et al (2009) The WEKA data mining software: an update. ACM SIGKDD Explorations Newslett 11(1):10–18CrossRefGoogle Scholar
  7. 7.
    Dubchak I, Muchnik I, Mayor C et al (1999) Recognition of a protein fold in the context of the Structural Classification of Proteins (SCOP) classification. Proteins 35(4):401–407PubMedCrossRefGoogle Scholar
  8. 8.
    El-Manzalawy Y, Dobbs D, Honavar V (2008) On evaluating MHC-II binding peptide prediction methods. PLoS One 3(9):e3268PubMedCentralPubMedCrossRefGoogle Scholar
  9. 9.
    Li ZR, Lin HH, Han LY et al (2006) PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence. Nucleic Acids Res 34(Web Server issue):W32–W37PubMedCentralPubMedCrossRefGoogle Scholar
  10. 10.
    Tomii K, Kanehisa M (1996) Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. Protein Eng 9(1):27–36PubMedCrossRefGoogle Scholar
  11. 11.
    Cui J, Han LY, Lin HH et al (2007) Prediction of MHC-binding peptides of flexible lengths from sequence-derived structural and physicochemical properties. Mol Immunol 44(5):866–877PubMedCrossRefGoogle Scholar
  12. 12.
    Gowthaman U, Agrewala JN (2008) In silico tools for predicting peptides binding to HLA-class II molecules: more confusion than conclusion. J Proteome Res 7(1):154–163PubMedCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2015

Authors and Affiliations

  • Loan Ping Eng
    • 1
  • Tin Wee Tan
    • 1
  • Joo Chuan Tong
    • 1
    • 2
  1. 1.Department of BiochemistryNational University of SingaporeSingaporeSingapore
  2. 2.Institute of High Performance ComputingSingaporeSingapore

Personalised recommendations