Abstract
Malaria disease is caused by the transmission of Plasmodium, through the bite of a female A nopheles mosquito. Although the Plasmodium life-cycle has been extensively characterized, relatively little is known about sporozoite interaction with host organelles and proteins. Individuals that survive continuous exposure to infection do eventually develop clinical immunity, suggesting that a vaccine against asexual blood stage of the parasite is achievable. The merozoite surface protein (MSP119) of Plasmodium yoelii was considered as the target protein for epitope prediction using the computational approaches. The T-cell and B-cell epitopes for MSP119 were predicted using a variety of computational tools. Out of these predicted epitopes, the epitopes being expressed by the protozoa were identified. The 3D structures of T-cell epitopes (MHC-I and MHC-II) were modeled by homology modeling method followed by validation using the SAVES server. Further, the MHC molecules were identified and their 3D structures were retrieved from the Protein Data Bank. The protein–protein docking of modeled epitopes with respective MHC molecules were also carried out. Total Six T-cell epitopes (‘ELSEHYYDRY’, ‘LLIITIVFNI’, ‘MMYHIYKLK’, ‘IYQAMYNVIF’, ‘SEEDMPADDF’, ‘YVLLQNSTI’) for MSP119 have been identified as promising vaccine candidates. Furthermore, six B-cell epitopes (‘QPTET’, ‘SEETE’, ‘SDKYNKKKP’, ‘KEKKKE’, ‘CKKNKA’, ‘THPDNT’) have also been identified as potential epitopes. In future, these predicted epitopes might be exploited in vaccine development against malarial infection.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Malaria, the most widespread disease throughout the globe is also caused by transmission of Plasmodium yeolii which is a rodent strain of malarial parasite, i.e., Plasmodium (Chauhan 1996). The malarial parasite maturation occurs within the erythrocyte (Orengo et al. 2008). In the final phase, a large plasmodial protein becomes a significant surface protein of merozoite which is synthesized in all malarial species (Schwartz et al. 2012). Studies on P. yoelii has shown that protein is found in a processed form on the merozoite surface, as a result of proteolytic cleavage of the large precursor molecule, hence called Merozoite surface protein (MSP119) (Holder et al. 1992; Blair and Carucci 2005). This disease is spread across the globe causing hundreds of millions of clinical infections and at least a million deaths per annum. An effective way to control and finally eradicate malaria is by developing cheap and effective vaccines (Brady et al. 2001; Giles 2005). Previous studies established a strong association between this population’s antibody responses to Merozoite surface protein-1 (antigens) against malarial parasite in humans (Woehlbier et al. 2010). Therefore, antigenic MSP119 protein can be exploited for the epitope prediction, and, consequently, vaccine development against malaria. Epitopes are the immunologically active regions of an immunogen that binds to the antigen-specific membrane receptor on lymphocytes to secret antibodies. The T-cells and B-cells recognize different epitopes on the same antigenic molecule.
Thus, epitopes against MSP119 protein could be predicted using the computational techniques (Yadav and Rana 2011; Yadav and Mishra 2012) that might further become good candidate(s) for the vaccine development against malaria (Doolan et al. 2003). A HLA supertype allowed the identification of epitopes capable of binding multiple HLA molecules and offered the prospect of designing broadly reactive epitope based vaccines (Sette and Sidney 1998). An arsenal of eight major supertypes (HLA-A1, -A2, -A3/-A11, -A24, -B7, -B44, -DR, and -DRB) protected the coverage of more than 99 % of any human population, at the level of both MHC class-I and class-II molecules (Southwood et al. 1998). Over the past few years to investigate the potential candidates for vaccine development, epitope-based predictions have been enormously used (Sette and Peters 2007; Yadav et al. 2011; Purcell et al. 2007). The merozoite surface protein (MSP119) of P. yoelii is a rodent malarial immunogenic protein (Carlton et al. 2002). The aim of the present study was to predict T-cell and B-cell epitopes for MSP119, respectively, using the computational techniques, and to predict the 3D structures of the best predicted epitopes (small peptides) followed by protein–protein docking study with respective MHC alleles of Homo sapiens.
2 Materials and methods
2.1 T-cell and B-cell epitope prediction
The protein sequence of merozoite surface protein (MSP119) of P. yoelii was retrieved from the UniprotKB database (ID: P13828) (http://www.uniprot.org) (Magrane and UniProt Consortium 2011). A variety of tools/servers were used for T-cell epitope predictions which include: (1) IEDB analysis tool based on Artificial Neural Network (ANNs) (http://tools.immuneepitope.org) (Neilson et al. 2003). (2) ComPred based on combination of Artificial Neural Networks (ANNs) and Quantitative Matrices (QM) (http://www.imetech.res.in/reghav/nhlapred/comp.html) (Lata et al. 2007). (3) SMM based on linear programming (http://zlab.bu.edu/SMM/) (Peters et al. 2003). (4) SVMHC based on the support vector machine (SVM) (http://www.sbc.su.se/svmhc/new.cgi) (Doones and Elofsson 2002). (5) SYFPEITHI based on peptide binding motif (http://www.uni-tuebingen.de/uni/kxi/) (Rammensee 1999). The B-cell epitopes were also predicted using two methods, i.e., the Bepipred linear epitope prediction method (Larsen et al. 2006) and Emini surface accessibility prediction method (Emini et al. 1985) (http://tools.iedb.org/bcell/). For each method, top 5 scoring predicted epitopes were selected. The step-wise selection of supertypes and peptides are depicted in Fig. 1.
2.2 Population coverage and allergenicity prediction
For the analysis of population coverage for the MHC class-I and class-II alleles of the selected epitopes, IEDB-Population Coverage Calculation tool was used (Bui et al. 2006). To predict the allergenicity of antigenic MSP119 protein in advance stage, the web-based AllerHunter server was also used (Muh et al. 2009), so that risk of failure can be avoided.
2.3 Modeling and docking of predicted epitopes
The 3D structures of top scoring epitopes (small peptides) for MHC-I and MHC molecules were modeled and predicted using the Pepstr server (http://www.imitech.res.in/raghava/pepstr/) (Kaur et al. 2007). Subsequently, protein–protein docking was carried out for the top scoring epitopes with their respective human MHC receptors. The molecular docking algorithms based on the shape complementarity principles were used by PatchDock server (http://bioinfo3d.cs.tau.ac.il/Patchdock) (Duhovny et al. 1997). The 3D structures of the MHC alleles, i.e., HLA-A*01:01 (pdb id:1W72), HLA-A*02:01(pdb id: 1AKJ), HLA-A*03:01 (pdb id: 2XPG), HLA-A*24:02 (3VXN), HLA-B*44:02 (pdb id: 1SYS), and HLA-DRB1*0101 (pdb id: 3S5L) were retrieved from the Protein Data Bank (http://www.rcsb.org/pdb) (Jardetzky et al. 1994). To perform the protein–protein docking with these MHC alleles, the receptors were prepared by removing all HETATOMs and solvents from the pdb files. The 3D structures of epitopes modeled by Pepstr server were docked with their respective MHC allele (s) in 10 conformations (Berman 2008). Out of these 10 conformations, the conformation with the minimum-binding free energy score was selected as the most stable MHC-epitope complex (Vajda and Kozakov 2009). All the docked complexes were visualized in 3D space using the CHIMERA tool (Pettersen et al. 2004).
3 Results and discussion
3.1 Epitope prediction
In the present work, several potential epitopes have been predicted for the Merozoite surface protein (MSP119) in P. yoelii which has been reported as potential antigenic protein (Holder et al. 1992; Blair and Carucci 2005; Woehlbier et al. 2010). For the T-cell, five epitope prediction algorithms (methods), and for the B-cell, two prediction methods were employed, respectively (Table 1).
3.2 Population coverage and allergenicity prediction
The population coverage of the MHC alleles (class-I and II) of the predicted epitopes were also calculated (Fig. 2).
Different Plasmodium yeolii affected regions were selected for the evaluation of the population coverage of the proposed epitopes. It calculated that the 73.73 fraction of east-Asian, 50.60 fraction of Indian, 92.41 fraction of Europeans, and 82.51 fraction of north-American individuals were responded to epitopes with selected MHC restrictions. The sequence-based allergenicity prediction was also carried out using the AllerHunter tool, which predicted the query sequence as a non-allergen with score of 0.0. [(sensitivity) SE = 91.6 %, (specificity) SP = 87.1 %].
3.3 Modeling and docking of predicted epitopes
Subsequent to T-cell epitope (MHC-I and MHC-II) prediction, the 3D structures of high ranking epitope was modeled which are prerequisite for the protein–protein docking studies. The binding free energy scores (docked energy) with MHC class-I and class-II molecules, respectively, were calculated using the PatchDock server. The T-cell epitopes identified against different alleles of MHC class-I and class-II molecules along with their docking energies (KJ/mol) are shown in Table 1.
After docking the highest ranked epitopes with HLA-A*01:01 allele (PDB ID: 1W72), it was found that the epitope ‘ELSEHYYDRY’ showed least energy score (−430.50 kJ/mol) which shows highest binding affinity for the MHC class-I receptor (Fig. 3).
For MHC class-I allele HLA-A*02:0, the most potential epitopes were docked with the receptor (PDB ID: 1AKJ), and it was found that ‘LLIITLIVFNI’ epitope was possessing highest binding affinity towards the MHC-I receptor (−496.01 kJ/mol) (Fig. 4).
The 3D structure of ‘MMYHIYKLK’ epitope was docked with MHC class-I receptor (HLA-A*03:01 allele) (PDB ID: 2XPG) which showed the lowest docking energy of -579.41 kJ/mol. The lower docking energy reveals higher binding affinity for HLA-A*03:01 allele (Fig. 5).
The highest ranked epitope ‘IYQAMYNVIF’ for MHC class-I HLA-A*24:02 allele was docked with the receptor (PDB ID: 3VXN), and the docking energy was found to be −375.63 kJ/mol (Fig. 6).
Finally, the highest ranking epitope ‘SEEDMPADDF’ MHC class-I HLA-B*44:02 allele was docked with the receptor (PDB ID: 1SYS), and the energy score was found to be −388.56 kJ/mol, which shows highest binding affinity for the MHC-I receptor (Fig. 7).
The HLA-DRB1*0101 allele of MHC class-II is lone allele whose 3D structure was present in the PDB database. The epitopes for above allele were predicted using the SVMHC tool which implements SYFPEITHI algorithm. For HLA-DRB1*0101 allele, top four predicted epitopes were carefully chosen as being the most suitable for further analysis. The 3D structures of all these epitopes were predicted followed by receptor-peptide docking. Out of four predicted epitopes, ‘YVLLQNSTI’ epitope was having lowest docking energy score (−424.77 kJ/mol) as compared to others. The lower energy score reveals highest binding affinity for the MHC-II receptor (PDB ID: 3S5L). The docked complex of epitope ‘YVLLQNSTI’ with HLA-DRB1*0101 allele of human HMC Class-II receptor are depicted in Fig. 8.
The B-cell epitopes for the MSP119 were also predicted using two methods, i.e., Bepipred linear epitope prediction method and Emini surface accessibility prediction method, respectively.
The BepiPred predicts the location of linear B-cell epitopes using a combination of a hidden Markov model and a propensity scale method. The epitopes are ranked according to their peptide score (Table 2).
Using the Emini surface accessibility method, the B-cell epitopes for MSP119 were again predicted for the hydrophillic antigenic epitope (Table 3). In this method, the calculation was based on surface accessibility scale on a product instead of an addition within the window.
All the B-cell epitopes predicted by the Bepipred linear epitope prediction and Emini surface accessibility prediction methods were further analyzed. After comparing all the B-cell epitopes predicted from above two methods, it was found that the small peptides ‘QPTET’, ‘SEETE’, ‘SDKYNKKKP’, ‘KEKKKE’, ‘CKKNKA’, and ‘THPDNT’ were commonly predicted potential B-cell epitopes for MSP119 against P. yoelii (Table 4).
The linear B-cell epitopes typically vary 5–20 amino acids in length (Morris 2007). All the predicted B-cell epitopes possessing less than 5 amino acids were filtered out. In Table 3 (S. No. 4, 10), two epitopes are having exceptionally high number of amino acid residues, but they possess common epitope(s), such as ‘SDKYNKKKP’ and ‘THPDNT’, respectively, predicted by the Emini surface accessibility method. Since these two epitopes were commonly predicted by both methods, therefore they have also been considered as potential B-cell epitope.
All the top ranked predicted epitopes for MHC-I as well as MHC-II might be synthesized in wet laboratories, which can be exploited for vaccine development. Interestingly, all six commonly predicted epitopes for B-cells are also promising vaccine candidates against malarial infections.
4 Conclusion
The T-cell (MHC-I and MHC-II) and B-cell epitopes for MSP119 of P. yoelii were predicted using various tools/servers. After 3D structure prediction of top ranked predicted epitopes followed by receptor-peptide docking studies, it was found that the epitope ‘IYQAMYNVIF’ was possessing highest binding affinity for MHC-I receptor. On the other hand epitope, ‘YVLLQNSTI’ has shown highest binding affinity for MHC-II receptor. For the B-cell, six small peptides ‘QPTET’, ‘SEETE’, ‘SDKYNKKKP’, ‘KEKKKE’, ‘CKKNKA’, and ‘THPDNT’ were predicted as most promising epitope. In future, these predicted epitopes can be synthesized in wet laboratory and might be exploited as potential candidate(s) for the vaccine development against malaria.
References
Berman HM (2008) The protein data bank: a historical perspective. Acta Crystallogr A A64(1):88–95
Blair PL, Carucci DJ (2005) Functional proteome and expression analysis of sporozoite and hepatic stages of malaria development. Microbiol Immunol 295:417–438
Brady PC, Shimp LR, Miles PA, Whitmore M, Stowers WA (2001) High-Level production and purification of P30P2MSP119, an important vaccine antigen for malaria, expressed in the methylotropic yeast pichia pastoris. Protein Expr Purif 23:468–475
Bui HH, Sidney J, Dinh K, Southwood S, Newman MJ, Sette A (2006) Predicting population coverage of T-cell epitope-based diagnostics and vaccines. BMC Bioinformatics 7:153
Carlton JM, Angiuoli SV, Suh BB et al (2002) Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii. Nature 419(6906):512–519
Chauhan VS (1996) Progress towards malaria vaccine. Curr Sci 71(12):967–975
Doolan DL, Hoffman SL, Southwood S et al (2003) Degenerate cytotoxic T cell epitopes from P. falciparum restricted by multiple HLA-A and HLA-B supertype alleles. Immunity 7:97–112
Doones P, Elofsson A (2002) Prediction of MHC class-I binding peptides, using SVMHC. BMC Bioinform 3:25
Duhovny A, Lawrence CM, Cupo S, Zaller DM, Wiley DC (1997) X-ray crystal structure of HLA-DR4 (DRAp0101, DRB1p0401) complexed with a peptide from human collagen II. Immunity 7:473–481
Emini EA, Hughes JV, Perlow DS, Boger J (1985) Induction of hepatitis a virus-neutralizing antibody by a virus-specific synthetic peptide. J Virol 55(3):836–839
Giles C (2005) why do not we have a malaria vaccine? (http://www.malariavaccine.org/malvac-vaccine-faqs.php)
Holder A, Blackman JA, Burghaus M, Chappe AJ, Ling TI, Shai S (1992) A malarial merozoite surface protein-structure, processing and function. Rio Jan 87:37–42
Jardetzky TS, Brown JH, Gorga JC, Stern LJ (1994) Three-dimensional structure of a human major histocompatibility molecule complexed with superantigen. Nature 368:711–718
Kaur H, Garg A, Raghava GPS (2007) PEPstr: a de novo method for tertiary structure prediction of small bioactive peptides. Protein Pept Lett 14:626–630
Larsen JE, Lund O, Nielsen M (2006) Improved method for predicting linear B-cell epitopes. Immunome Res 2:2
Lata S, Bhasin M, Raghava GPS (2007) Application of machine learning technique in predicting MHC binders. Methods Mol Biol 409:201–215
Magrane M and UniProt Consortium (2011) Uniprot knowledgebase: a hub of integrated protein data. Database 10:1093–1102
Morris GE (2007) Epitope mapping: B cell epitopes. Encyclopedia of life sciences, Wiley, pp 1–3. doi:10.1002/9780470015902.a0002624.pub2. http://www.els.net
Muh HC, Tong JC, Tammi MT (2009) AllerHunter: a SVM-pairwise system for assessment of allergenicity and allergic cross-reactivity in proteins. PLoS One 4(6):e5861
Neilson AK, Labaied M, Kappe SH, Matuschewski K (2003) The immune epitope database: a historical retrospective of the first decade. Immunology 433:164–167
Orengo J, Wong KC, Ocaña-Morgner C, Rodriguez A (2008) A Plasmodium yoelii soluble factor inhibits the phenotypic maturation of dendritic cells. Malar J 7:254
Peters B, Tong W, Sidney J, Sette A (2003) Examining the independent binding assumption for binding for peptide epitopes to MHC-I molecule. Bioinformatics 19:1765–1772
Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE (2004) UCSF chimera–a visualization system for exploratory research and analysis. J Comput Chem 25(13):1605–1612
Purcell AW, McCluskey J, Rossjohn J (2007) More than one reason to rethink the use of peptides in vaccine design. Nat Rev Drug Discov 6(5):404–414
Rammensee H (1999) SYFPEITHI: database for MHC ligands and peptide motifs. Immunogenetics 50(3–4):213–219
Schwartz L, Brown GV, Genton V, Moorthy VS (2012) A review of malaria vaccine clinical projects based on the WHO rainbow table. Malar J 11:11
Sette A, Peters B (2007) Immune epitope mapping in the post-genomic era: lessons for vaccine development. Curr Opin Immunol 19:106–110
Sette A, Sidney J (1998) HLA supertypes and supermotifs: a functional perspective on HLA polymorphism. Curr Opin Immunol 10:478–482
Southwood S, Sidney J, Kondo A, del Guercio MF, Appella S, Hoffman RT, Kubo RW, Chesnut HM, Grey A (1998) Several common HLA-DR types share largely overlapping peptide binding repertoires. J Immunol 160:3363–3373
Vajda S, Kozakov D (2009) Convergence and combination of methods in protein–protein docking. Curr Opin Struct Biol 19(2):164–170
Woehlbier U, Epp C, Hackett F, Blackman MJ, Bujard H (2010) Antibodies against multiple merozoite surface antigens of the human malaria parasite Plasmodium falciparum inhibit parasite maturation and red blood cell invasion. Malar J 9:77
Yadav PK, Mishra M (2012) Computational epitope prediction and docking studies of glycoprotein-G in Nipah virus. Int J Bioinform Biolog Sci 1:55–56
Yadav PK, Rana J (2011) Computer aided epitope prediction for glycoprotein-B in human cytomegalovirus. Elix Bio Phys 39:5021–5025
Yadav PK, Singh R, Jain PA, Singh S, Gautam B, Farmer R (2011) In silico epitope prediction for glycoprotein D in human herpese simplex virus-1. Int J Pharm Sci Rev Res 7(2):148–153
Acknowledgments
Authors wish to acknowledge the Department of Computational Biology and Bioinformatics, Sam Higginbottom Institute of Agriculture, Technology and Sciences (Deemed University), Allahabad, India, for providing the facilities to conduct this research work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Authors declare that there is no conflict of interest.
Rights and permissions
About this article
Cite this article
Dhusia, K., Kesarwani, P. & Yadav, P.K. Epitope prediction for MSP119 protein in Plasmodium yeolii using computational approaches. Netw Model Anal Health Inform Bioinforma 5, 19 (2016). https://doi.org/10.1007/s13721-016-0127-4
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13721-016-0127-4