Introduction

Nipah virus (NiV) was isolated in 1999 and was identified as the etiological agent responsible for an outbreak of severe respiratory disease and fatal encephalitis in Malaysia and Singapore in pigs and humans (Chua et al. 1999). During the first NiV outbreak, the virus infected both pigs and humans, in addition to a small number of cats, dogs and horses (Chua et al. 2000; Epstein et al. 2006). NiV, a member of the family Paramyxoviridae, possesses a negative-sense, non-segmented RNA genome that is 18,246 nt (Malaysian isolate) or 18,252 nt (Bangladesh isolate) in length (Harcourt et al. 2005). It has six transcription units that encode six structural proteins, the nucleocapsid (N), phosphoprotein (P), matrix protein (M), fusion protein (F), glycoprotein (G) and polymerase (L). Similar to other paramyxoviruses, the P gene of NiV expresses four proteins, namely P, V, W and C (Harcourt et al. 2000; Wang et al. 2001).

In Bangladesh, 135 probable or confirmed cases of NiV infection in humans were identified from 2001 through 2008; 98 (73 %) were fatal (Luby et al. 2009). Active Nipah virus encephalitis surveillance identified an encephalitis cluster and sporadic cases in Faridpur, Bangladesh, in January 2010. 16 case patients were identified, in which 14 of these patients died (Sazzad et al. 2013).

Vaccination is the most effective of all the medical interventions to save human and animal lives and to increase production (Horzinek 1999; Tang et al. 2012a). Compared to the conventional vaccines, peptide- or epitope-based vaccines are easy to produce, more specific, cost effective, less time consuming and also safe (Kumar et al. 2013). It is well established that T cells play a critical role in inducing cellular immune response against foreign antigens but they recognize antigenic fragments only when they are associated with major histocompatibility complex (MHC) molecules exposed on surface of all vertebrate cells (Shekhar et al. 2012; Mohabatkar and Mohammadzadegan 2007). Immunoinformatics approach uses computational algorithms to predict potential vaccine candidates or T-cell epitopes. The advantage of a peptide- or epitope-based vaccine is the ability to deliver high doses of the potential immunogen and at a low cost (Von Hoff et al. 2005; Tang et al. 2012b). Viral protein which could act as a vaccine candidate must be surface-exposed, antigenic and responsible for pathogenicity (Cerdino-Tarraga et al. 2003; Verma et al. 2011).

Materials and methods

The amino acid sequence of Nucleocapsid, phosphoprotein, matrix, fusion, glycoprotein, L protein, W protein, V protein and C protein was retrieved from the protein sequence database of NCBI (http://www.ncbi.nlm.nih.gov/protein) and their accession number is shown in Table 1.

Table 1 Scores generated by ProPred for MHC class I

Prediction of MHC class I binding peptides

The prediction of promiscuous MHC class I binding peptides was done using a popular immunoinformatic tool ProPred I (Singh and Raghava 2001). It is an online web tool which uses matrix-based method that allows the prediction of MHC-binding sites in an antigenic sequence for MHC class I alleles. It also allows the prediction of the standard proteasome and immunoproteasome cleavage sites in an antigenic sequence. The simultaneous prediction of MHC binders and proteasome cleavage sites in an antigenic sequence leads to the identification of potential T-cell epitopes.

Structure-based modeling of T-cell epitopes

The PEPstr (peptide tertiary structure prediction server) server (Kaur et al. 2007) predicts the tertiary structure of small peptides with sequence length varying between 7 and 25 residues. The prediction strategy is based on the realization that β-turn is an important and consistent feature of small peptides in addition to regular structures. Thus, the methods use both the regular secondary structure information predicted from PSIPRED and β-turns information predicted from BetaTurns. The side-chain angles are placed using standard backbone-dependent rotamer library. The structure is further refined with energy minimization and molecular dynamic simulations using Amber version 6.

Modeling and validation of MHC I alleles

The IMGT/HLA database (http://www.ebi.ac.uk/ipd/imgt/hla/intro.html) (Robinson et al. 2013) currently contains 10,103 allele sequences. In addition to the physical sequences, the database contains detailed information concerning the material from which the sequence was derived and data on the validation of the sequences. The IMGT/HLA database allows you to retrieve information upon a specific HLA allele (http://www.ebi.ac.uk/ipd/imgt/hla/allele.html) as named in the WHO Nomenclature Committee Reports. 3D structures of alleles were retrieved from IMGT/HLA database. Some of the allele’s structures, which are not presented in IMGT/HLA database, were modeled with the help of MODELLER 9.10. The stereochemical qualities of the alleles were checked by PROCHECK (Laskowski et al. 1993).

Molecular docking

Docking of peptides and alleles structure was carried out using AutoDock 4.2 (Goodsell and Olson 1990; Morris et al. 1998). Gasteiger charges were added to the ligand and maximum six numbers of active torsion are given to the lead compound using AutoDock tool (http://autodock.scripps.edu/resources/adt). Kollaman charges and solvation term were added to the protein structure using AutoDock tool. The Grid for docking calculation was centered to cover the protein-binding site residues and accommodate ligand to move freely. During the docking procedure, a Lamarckian genetic algorithm (LGA) was used for flexible ligand rigid protein docking calculation. Docking parameters were as follows: 30 docking trials, population size of 150, maximum number of energy evaluation ranges of 250,000, maximum number of generations is 27,000, mutation rate of 0.02, cross-over rate of 0.8, other docking parameters were set to the software’s default values.

Molecular dynamics simulation of epitope and HLA allele complex

Molecular dynamics simulation was done using the NAMD graphical interface module (James et al. 2005) incorporated visual molecular dynamics (VMD 1.9.2) (Humphrey et al. 1996). A protein structure file (psf) stores structural information of the protein, such as various types of bonding interactions. The psf was created from the initial pdb and topology files. The psfgen package of VMD is used to create this. To create a psf, we will first make a pgn file, which will be the target of psfgen. After running psfgen, two new files were generated protein pdb and protein psf and by accessing PSF and PDB files; NAMD generated the trajectory DCD file. Root mean square deviation (RMSD) of the complex was completed using rmsd tcl source file from the Tk console and finally rmsd dat was saved and accessed in Microsoft office excel 2007.

Results and discussion

Prediction and analysis of MHC class I binding peptides

The Nucleocapsid peptide VPATNSPEL at position 38–46 showed ProPred score of 1200 with HLA-B7 MHCI allele. The V protein peptide NPTAVPFTL at position 186–194 showed ProPred score of 2000 with the HLA-B*2705 allele. And the fusion protein peptide LLFVFGPNL at position 209–217 showed ProPred score of 2678.131 with the HLA-A2 allele. ProPred scores of peptides with MHC I alleles are shown in Table 1.

Docking energy determination by AutoDock

3-D coordinate files of allele were obtained through IMGT/HLA database or model through MODELLER (Table 2) were validated using PROCHECK tool. After that, binding simulation studies show that nucleocapsid epitope VPATNSPEL with HLA-B7 allele, V protein epitope NPTAVPFTL with HLA-B*2705 allele as well as fusion epitope LLFVFGPNL with HLA-A2 allele formed stable HLA–peptide complexes with the energy minimization values of −5.07, −3.13 and −3.11 kcal/mol, respectively (Table 3). After docking studies, we determined the number of H bonds present in the stable complex formed.

Table 2 List of class I MHC alleles considered in this study for prediction of binding peptides
Table 3 Docking result of epitopes with allele structures

Using AutoDock, it was found that three H-bonds were present in peptide VPATNSPEL-HLA-B7 allele complex as shown in (Fig. 1), first H-bond formed between allele residue ASP30:O with epitope amino acid THR4:OG1, second H-bond formed between allele residue ARG48:NH2 with epitope amino acid GLU8:OE1 and third H-bond formed between allele residue TYR27:OH with epitope amino acid ALA3:O, whereas one H-bond was present in V protein peptide NPTAVPFTL - HLA-B*2705 allele complex (Fig. 2) via residue ASP29:O with epitope amino acid ASN1:HN1. Fusion protein peptide LLFVFGPNL - HLA-A2 allele complex depicting position of amino acids along with formation of one H-bond with allele residue ASP122ASP122:OD2 with epitope amino acid LEU1:N (Fig. 3).

Fig. 1
figure 1

Docked nucleocapsid protein peptide VPATNSPEL-HLA-B7 allele complex depicting position of amino acids along with the formation of 3 H bonds with ASP30, TYR27 and ARG48

Fig. 2
figure 2

Docked V protein peptide NPTAVPFTL–HLA-B*2705 allele complex depicting position of amino acids along with the formation of 1 H bond with ASP29

Fig. 3
figure 3

Docked fusion protein peptide LLFVFGPNL–HLA-A2 allele complex depicting position of amino acids along with the formation of 1 H bond with ASP122

Molecular dynamics simulation of peptide–allele complex through NAMD

The peptide–allele complexes formed by AutoDock were subjected to molecular dynamics simulation and RMSD. Nucleocapsid epitope VPATNSPEL-HLA-B7 allele complex displayed the highest peak at RMSD value of 1.16 Å (Fig. 4). V protein peptide NPTAVPFTL-HLA-B*2705 allele complex resulted in highest peak at RMSD value of 0.46 Å (Fig. 5). And epitope LLFVFGPNL-HLA-A2 allele complex resulted in highest peak at RMSD value of 0.47 Å (Fig. 6).

Fig. 4
figure 4

Graph displaying molecular dynamic simulation of Nucleocapsid peptide–allele complex, resulted in highest peak at 1.16 Å

Fig. 5
figure 5

Graph displaying molecular dynamic simulation of V protein peptide–allele complex, resulted in highest peak at 0.46 Å

Fig. 6
figure 6

Graph displaying molecular dynamic simulation of fusion protein peptide–allele complex, resulted in highest peak at 0.47 Å

Nipah virus (NiV) was associated with highly lethal febrile encephalitis in humans and a predominantly respiratory disease in pigs. Periodic deadly outbreaks, documentation of person-to-person transmission, and the potential of this virus as an agent of agroterror reinforce the need for effective means of therapy and prevention (Walpita et al. 2011). The current study incorporates immunoinformatics approach for reducing the time consumed in the long array of experiments to avoid hit and trial sets. Walpita et al. (2011) describe the vaccine potential of NiV proteins glycoprotein, fusion and matrix. Sakib et al. (2014) designed epitope-based peptides for the utility of vaccine development by targeting glycoprotein G and envelope protein F of Nipah virus (NiV) that, respectively, facilitates attachment and fusion of NiV with host cells. AutoDock resulted in good binding affinity along with H bonds at default parameters. The molecular dynamics simulation showed that complex formed between a peptide and allele was attaining proper stability by creating a parallelism in RMSD over a time window. The mentioned peptides can be either isolated or formulated for further in vitro and in vivo testings.

Conclusion

The conclusion drawn from the present study is that the three epitopes VPATNSPEL, NPTAVPFTL and LLFVFGPNL of nucleocapsid, V protein and fusion protein, respectively, have considerable binding with HLA-B7, HLA-B*2705 and HLA-A2MHC class I allele and low-energy minimization values providing stability to the peptide–MHC complex. These peptide constructs may further be undergone wet laboratory studies for the development of targeted vaccine against Nipah virus.