Introduction

The current global coronavirus pandemic has now affected more than 213 countries and territories across the globe (Countries where Coronavirus has spread-Worldometer 2021). SARS-CoV-2 is said to have originated in Wuhan, China in December 2019 and gradually spread across the world affecting more than 24 million people (WHO Coronavirus Disease (COVID-19) 2021). The past coronavirus infections namely SARS (Severe Acute Respiratory Syndrome) and MERS (Middle Eastern Respiratory Syndrome) affected many parts of the world. This called for scientists to find various measures of treatments for viral diseases, even though no effective drug or vaccine accounts for full recovery in all patients (Lai et al. 2020). In the current study, some antimicrobial peptides (AMPs) are suggested with proper In-Silico studies as effective therapeutic agents against the SARS and SARS-CoV-2 infection. There are various classes of AMPs out of which the Human defensins which are produced and secreted by different mucosal and epithelial cells show antimicrobial properties against viral diseases (Amerikova et al. 2019). AMPs are now widely being studied as an alternative to antibiotics for rapid response. It is one of the first pathways to get upregulated during the defence action (Ahmed et al. 2019). The role of AMP with influenza A has also been extensively studied (Hsieh and Hartshorn 2016). The study of structural and non-structural proteins is important to understand how these viruses interact with the human body. In this paper, we targeted the structural nucleocapsid protein of SARS and SARS-CoV-2. The protein, coded by the 9th ORF of SARS, is helical in shape and plays a major role in pathogenesis, RNA packaging, and replication of the virus. It is a 46 kDa protein with 422 amino acids (Surjit and Lal 2008). It induces an antigenic and immunogenetic response in the human body (Peng et al. 2006). Along with the envelope M protein, the nucleocapsid protein aids in genomic condensation and packaging. It is also expressed in abundance in case of infection thus making it a crucial target to study (Surjit and Lal 2008; Verheije et al. 2010; Ahlquist et al. 2003). We chose nucleocapsid (N gene) over other SARS structural proteins because it is more stable and conserved. It has the fewer mutation and over 90% amino acid homology. In general, the nucleocapsid protein/ gene of coronavirus families are expressed in abundance and highly immunogenic when an infection occurs (Chang et al. 2014; Chen et al. 2020; Pang et al. 2020; Drosten et al. 2003; Marra et al. 2003). The nucleocapsid protein also shows an antigenic response for the T-cell receptors in terms of proliferation and cytotoxic activity in term of vaccine (Grifoni 2020; Cong, et al. 2019; Dutta et al. 2008; Kang, et al. 2020; Lee et al. 2010). There are around 186 AMPs which are antiviral in nature and a total of more than 3000 AMPs present in the antimicrobial peptide database (APD3) (Elnagdy and AlKhazindar 2020). Viral entry into the cell is blocked by human defensin 5 (HD5), which is a potential blocker for SARS-Cov-2 (Maiti 2020).In many kinds of infections by various pathogens, AMPs are one of the first immune pathways which are upregulated in vivo. Of these various classes, the two best AMPs studied till now are the defensins and human cathelicidin LL-37. Antimicrobial peptides like defensins have shown response against blocking cell receptor by binding to it in the case of HIV, inhibiting the protein kinase C signalling pathway in case of HSV, Transferrin has shown their activity by inhibiting cellular trafficking in case of Hantavirus, Rotaviruses etc. (Ahmed et al. 2019; Agier et al. 2015). Although AMPs are usually studied against bacterial infections, we aim to find AMPs for viruses as well for a specific target site namely the nucleocapsid protein of SARS and SARS-CoV-2. Studies have shown that antimicrobial peptides usually affect viral entry into a cell, although its effects have been seen at other sites as well. They generate an innate immune response which is crucial to keeping infections at bay. Such studies were conducted using AMPs in response to the dengue virus using the human neutrophils and further examination by qRT-PCR (Ahmed et al. 2019; Castañeda-Sánchez et al. 2016).The activity of AMPs is decided by their amino acid composition, net charge, size, hydrophobicity and hydrophilicity etc. which have been considered in our study as well which are important from the structural and conformational point of view. It is desired for the selected AMP to be stable (Cunha et al. 2017). Our work focuses on the binding efficacy of AMPs which hinders the activity of nucleocapsid protein of the viruses. Using the docking and dynamic simulation techniques, we aim to find the two most suitable and probable AMPs which will best block the pathway for the viruses.

Materials and methodology

Retrieval of the viral protein sequences (Nucleocapsid) of SARS and SARS-COV-2-CoV

The amino acid sequences of viral nucleocapsid strains (SARS, SARS-CoV-2) were retrieved from UniProtserver (Bateman 2019) (Scheme 1). Their PDB structures were selected from the RCSB PDB database (Burley et al. 2019).

Scheme 1
scheme 1

Schematic flowchart depicting overall methodology of the workflow

Database screening of antimicrobial peptides

A cluster of AMP's was retrieved from the APD3 server (Antimicrobial Peptide Database) (Wang et al. 2016). The screening of the database was made based on certain criteria such as –length of the peptide should be between 20 to 55 AA(The length of peptide correlated with inhibitory action. The longer the length of peptide, better antiviral activity has been observed as compared to sequences of shorter length (Du et al. 2017), an abundance of basic residue in the peptide, the net charge of the peptide should be more than or equal to zero, as the viral membrane is negatively charged, non-toxicity concerning mammalian cells and interfacial activity of peptide including-

  1. a)

    Wimley-White interfacial hydrophobicity scale (WWIHS > 0)

It is an experimentally calculated free energy scale which helps to determine the propensity of individual amino acids in peptide sequences to partition from water into a phosphatidylcholine interface.

  1. b)

    Interfacial helical hydrophobic moment (iHHM > 0)

It describes the degree to which a peptide can be segregated into hydrophilic and hydrophobic phases when folded in an alpha- helix.

A peptide with a large iHHM can interact strongly with membranes as a helix due to partitioning–folding coupling even with a WWIHS score that is not positive overall (Galdiero et al. 2015).

Structure prediction of the selected AMPs

The tertiary structure of the selected AMPs was generated by MODELLER and PEPFOLD server depending upon peptide length (Šali and Blundell 1993; Camproux et al. 2004). If the peptide lengths greater than 30AA, MODELLER was used and for peptide, less than 30AA in length, a PEPFOLD peptide structure prediction server was employed.

The generated models were then evaluated based on the Ramachandran Plot generated by ProCheck (Laskowski et al. 1993).

Docking simulation studies

To check the binding affinity between the selected AMP’s and the nucleocapsid protein receptor of SARS and SARS-COV-2-CoV (2CJR and 6M3M respectively), docking simulation was performed using the ClusPro v2.0 server. This server assesses massive datasets of putative complexes and gives a pre-set number of favourable docked poses.

For peptide-protein docking, the selected AMPs were set as ligands and docked with both receptors 2CJR and 6M3M. This server rotates the ligand with 70,000 rotations and for each rotation, it translates the ligand in x, y, and z axis relative to the receptor on a grid (Vajda et al. 2016).

Further, Autodock Vina (Trott et al. 2009) was used for docking simulation analysis on the best-docked poses that were obtained from the ClusPro v2.0 server to obtain the accurate interactions between the receptor nucleocapsid proteins and ligand anti-microbial peptides.

Molecular dynamic simulation

Molecular dynamic simulation and energy minimization was done for the AMPs and the nucleocapsid receptor complex using GROMACS. All simulated structures were centered typically in a dodecahedron box with a minimum distance of 2.0 nm between each atom of the protein and the box to reduce and the SPC216 water model was used to solvate the system (Abraham et al. 2015). The solvated protein was subsequently neutralized by Clions, where 58.21 ions were used for SARS and SARS—2, respectively. Covalent bonds involving hydrogen atoms were constrained using the LINCS algorithm, and long-range electrostatic interactions were treated with the Particle-Mesh Ewald (PME) method employing a real-space cutoff of 10 Å. Energy minimization was executed by the steepest descent method and the conjugate gradient method for the subsequent 50,000 steps (NIH Lincs Program 2021). The system was initially temporarily minimized with backbone atoms confined to the previous coordinates to remove close contacts, and the restrained system was gradually heated to 300 K under constant volume conditions at 100 ps. Each system was equilibrated for 50 ns using the constant isothermal-isobaric (NPT) ensemble at 1 bar and 300 K without any restraints. The Parrinello-Rahman barostat and a V-rescale thermostat were used with an integration time step of 2 fs. Lennard–Jones interactions were calculated using a cutoff of 1 Å. The pair lists were updated every 400 steps. All simulations were performed using GROMACS 2019.1 along with the OPLS– AA force field. To assess the stability of the selected model, a 50 ns MD simulation was performed using the GROMACS software version 5.0. Before the MD simulation, it must be ensured that the system has no steric clashes or inappropriate geometry. The structure is relaxed through a process called energy minimization. Energy minimization of the system is an important step to relax the structure. The complex system energy minimization was done for 100 ps till the force reaches < 1000 kJ/mol. Equilibrating the solvent and ions around the protein is also an important step before the dynamic simulation calculation.

Results and discussion

Filtration of antimicrobial peptides and modelling

Based upon the criteria for database screening, a total of 37 peptides were filtered out of which, ultimately ten antimicrobial peptides were selected based on the iHHM score. The AMPs were modelled using the PEPFOLD server and MODELLER. The validity of each of those models was checked using the Ramachandran plot generated by the Procheck program. The shortlisted AMPs consisted of the majority of residues in the favoured region, with the peptide ATCYCRTGRCATRESLSGVCEISGRLYRLCCR (Human defensin-5) possessing the highest favourability percentage. The Ramachandran plots can be seen in Fig. 1 of supplementary material.

Fig. 1
figure 1

Molecular docking simulation using ClusPro v2.0 A SARS nucleocapsid strain docked with AP00180 B SARS nucleocapsid strain docked with AP00223 C SARS- CoV- 2 nucleocapsid strain docked with AP01061 D SARS- CoV-2 nucleocapsid strain docked with AP00549

Docking simulation and evaluation of the best complexes

Molecular docking simulation between nucleocapsid protein of SARS (PDB ID:2CJR) and SARS-CoV-2 (PDB ID: 6M3M) and the ten antimicrobial peptides with the help of the ClusPro server was implemented. For SARS, the top two least energy weighted scores were that of AP00180 and AP00223 and for SARS-COV-2, the best-weighted scores were that of AP01061 as well as AP00549 (Table 1). Further docking was implemented on Autodock Vina, where the poses obtained from the ClusPro v2.0 server analysed for their receptor binding domain as well as the interacting bonds between the receptor and the ligand.

Table 1 Lowest energy weighed score of docked complexes using ClusPro

Peptide-protein complex interaction studies

After the selection of the top two lowest weighted score AMPs, the docked structures were analysed for the type of interaction that exists between the nucleocapsid protein receptor and the antimicrobial protein–ligand. The table provides information regarding the specific interaction between the peptide-protein complex as well as the ΔG score between the interacting chains of nucleocapsid and AMP. Interacting bonds that were observed between the receptors and their specific ligands were hydrogen bonds, alkyl bond, salt bridges and pi-sigma bonds. For SARS, human defensin 5 (AP00180) chains B, C and D were in close interaction with the chains E and F of the nucleocapsid protein. The chains A and B of RatNP-2 rat defensin (AP00223) were in close interaction with A, B, C, E and F chains of SARS nucleocapsid. In both cases, conventional hydrogen bonds were in the predominant number. Circulin D (AP01061) was interacting with the SARS-COV-2 nucleocapsid chains A as well as C and varying bond types between individual amino acids. And lastly, Plectasin (AP00549) was interacting with the chains A and C of SARS-COV-2 nucleocapsid and classical hydrogen bonds being in the highest number as seen in Fig. 1. The interaction between the receptor amino acid molecules and ligand AMP molecules is shown in Table 2 of supplementary material along with their respective bond category.

Table 2 Shortlisted antimicrobial peptides and their Ramachandran Plot statistics

Molecular docking simulation using Autodock Vina provided the amino acids of nucleocapsid proteins of SARS and SARS-CoV-2 which were directly interacting with the amino acids of their respective antimicrobial peptides. A total of 6 electrostatic bonds, 11 conventional hydrogen bonds and 8 carbon-hydrogen bonds were found to be interacting between SARS nucleocapsid and AP00180. For SARS and AP00223, 2 electrostatic bonds, 6 conventional hydrogen bonds and 2 carbon-hydrogen bonds were observed. For SARS-CoV-2 and AP00549, 6 conventional hydrogen bonds and 3 hydrophobic bonds were observed and finally, for SARS-CoV-2 and AP01061, 4 conventional hydrogen bonds and 1 hydrophobic bond were observed.

As per Table 3 we can observe that the binding affinity score of SARS-CoV with AP00549 and AP01061 and of SARS-CoV-2 with AP00180 and AP00223 are favourable. It was decided to proceed with only those binding affinity scores which showed high favourability in both-ClusPro as well as Autodock Vina.

Table 3 Grid-box dimensions and binding affinity values of Autodock Vina

Autodock Vina employs Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm that uses local optimization methods for identifying active sites in the target protein. The default size of the grid box wherein the docking would take place is calculated using protein–ligand complex structures. A docking box is initially constructed which encloses the particular ligand and then the box size increases in random direction that ensures that the minimum length in any direction is atleast 22.5 Å. The specific active site of the nucleocapsid protein was generated by the algorithm to provide best suitable interaction (Trott et al. 2009).

Thus, the type of interaction that was predominant between the receptor proteins and antimicrobial peptide ligands were conventional hydrogen bonds, as shown in Table 4 and the most frequent interactions as shown in Fig. 2. The highest interacting bond predominant between the receptor molecule and the AMPs was conventional hydrogen bond. The most prominent interacting amino acids as per the distance given in the table between the receptors and the ligand include Lys85 and Arg13 (Conventional Hydrogen Bond), Ala110 and Glu21(hydrogen and electrostatic interaction) and Lys85 and Cys30 (carbon-hydrogen bond) between AP00180 and SARS nucleocapsid protein. For SARS nucleocapsid and AP00223, Arg13 and Glu111 (hydrogen and electrostatic interaction), Lys85 and Thr8 (Conventional Hydrogen Bond) and Lys 114 and Phe12 (Carbon Hydrogen Bonds) were the most prominent amino acid interactions. For SARS-CoV-2 nucleocapsid protein and AP00549 Asn196 and Asn513 (Conventional Hydrogen Bond) was the most promising hydrogen bonding interaction. For AP01061, Tyr29 and Asn91, as well as Thr256 and Gly5 (Conventional Hydrogen Bond), showed the most promising hydrogen bond interaction as in Fig. 3 and results after molecular docking are seen in Fig. 4

Table 4 Amino acid interactions of receptor nucleocapsid protein and their respective antimicrobial peptides
Fig. 2
figure 2

Total count of most frequent interactions of SARS-CoV and SARS-CoV-2 with their respective AMPs

Fig. 3
figure 3

Molecular docking simulation using Autodock Vina and interacting bonds (conventional hydrogen bonds: green; hydrophobic interactions: pink). A SARS nucleocapsid strain docked with AP00180, B SARS nucleocapsid strain docked with AP00223, C SARS-CoV-2 nucleocapsid strain docked with AP01061, D SARS-CoV-2 nucleocapsid strain docked with AP00549

Fig. 4
figure 4

Molecular dynamic simulation of SARS with AP00180and SARS- CoV-2 with AP00549, (A, D) number of hydrogen bonds and pairs within 0.35 nm, (B, E) Distribution of H bonds across the interaction, (C, F) RMSD of the docked complex

Molecular dynamic simulation analysis

Taking the ClusPro weighted scores as well as Prodigy ΔG values of the four shortlisted AMPs, the two AMPs- one for SARS and one for SARS-COV-2 were further filtered and analysed for molecular dynamic simulation. MD simulation is an appropriate step to ensure the stability of the nucleocapsid protein-AMP complex. SARS -AP00180 complex achieved the force and potential energy in 1840 steps with the energy drift of − 469,527 kJ/mol, whereas SARS -AP00549 achieved the same in 1758 steps with the energy drift of − 254,944 kJ/mol. Potential energies are given in Table 5 and Fig. 5. Constant temperature, pressure and density were maintained for 100 ps (Fig. 6). The final MD simulation was done for 50 ns.

Table 5 Energy minimization values of the final two docked complexes
Fig. 5
figure 5

Molecular dynamic simulation plots by GROMACS. RMSD, RMSF, Radius of gyration and Energy Minimization for SARS-AP00180 (AD) and SARS-CoV-2-AP00549 (EH) respectively

Fig. 6
figure 6

Molecular dynamic simulation plots by GROMACS. Temperature, pressure, and density for SARS-AP00180 (AC), SARS-CoV-2- AP00549 (DF) respectively

The stability of the molecules was examined based on the RMSD, RMSF and Rg values. The average, maximum and minimum RMSD, RMSF, and Rg are presented in Table 6. The RMSD values of each frame corresponding to time are given in Fig. 4b. RMSD values indicated that both the systems achieved equilibrium in an early phase of a 50 ns trajectory. The difference between the highest and lowest RMSD values of SARS -AP00180 was 1.424 nm, which was quite close to the average value (2.273 nm), demonstrating that the backbone of the SARS -AP00180 complex did not fluctuate much. On the other hand, the SARS-CoV-2 -AP00549 complex system showed a similar rate of RMSD. The difference of the highest and lowest RMSD values of SARS-CoV-2 -AP00549 was 1.473 nm, which was close to the average value (2.887 nm), demonstrating the stability of the SARS-CoV-2 -AP00549 complex throughout the 50 ns MD simulation.

Table 6 Maximum, minimum, and average values of SARS and SARS-CoV-2

RMSF determine the flexibility of any residue in the SARS -AP00180 and SARS-CoV-2 -AP00549 complex system. The fluctuation of the individual atom explained based on the RMSF values obtained from the 50 ns MD simulation of both the complex systems. The maximum, minimum, and average RMSF values for SARS -AP00180 were 1.354, 0.069 and 0.362 nm, respectively whereas, the maximum, minimum, and average RMSF values for SARS-CoV-2 -AP00549 were 3.08,0.069 and 0.386 nm, respectively can be seen in Table 5. The low average RMSF value suggested that individual atom exhibited stability in the dynamic state of both the systems throughout the MD simulation. The plot of the RMSF values vs. the atom number is shown in Fig. 5. The figures indicate that atoms of the SARS -AP00180 complex system at the positions of about 1500 ~ 3000 and 6500–7000 and, atoms of SARS-CoV-2 -AP00549 complex system at the positions of about 7500 ~ 9000 fluctuated somewhat relative to the others. The remaining atoms in both the complex systems were found to be quite stable during the MD simulation.

Rg for the SARS -AP00180 complex and SARS-CoV-2 -AP00549 complex were 2.634 nm and 1.744 nm, respectively. Rg values showed no change from the beginning of the simulation to the end of the simulation, demonstrating that both the complexes remain stable throughout the 50 ns trajectory. The Rg values corresponding to the simulation shown in Fig. 5 were used to facilitate the interpretation of the secondary structure.

The hydrogen bonds were stable during the 15 ns run as seen in Fig. 7.

Fig. 7
figure 7

Molecular Dynamic simulation of SARS-CoV and SARS-CoV-2: Hydrogen bonds of the docked complexes represent the number of hydrogen bonds as a function of time. Pairs within 0.35 nm represent—the number of atom pairs that satisfy the distance criterion but not the angle criterion for hydrogen bonding

Conclusion

In the current work, we have employed a docking, scoring algorithm and dynamic simulation for combined peptide-protein binding search using an AMPs database. Our computational study confirms to provide two most suitable antimicrobial peptides which show positive interaction with nucleocapsid protein and may potentially hinder the activity of the viral protein. This nucleocapsid protein plays a major role in infecting a cell by RNA packaging and replication of the virus. Plectasin (AP00549) showed the lowest energy score (− 1183 kcal/mol) and highest favourability percentage (76.7%) making it efficient for SARS-CoV-2 viral intervention. Human Defensin-5 (AP00180) showed the lowest energy score (− 1063.7) and highest favourability percentage (92.9%) making it efficient for SARS. From our results, it may confirm that these AMPs may be suitable for inhibiting SARS-CoV-2, and SARS virus infection. With the simulation studies, we tried to create a natural environment for the peptide to interact with the protein. The stability of the molecules was examined based on the RMSD, RMSF and radius of gyration values. The difference between the highest and lowest RMSD values of the SARS-CoV-2-AP00549 complex and SARS-AP00180 complex determined that the backbone did not fluctuate throughout the 50 ns MD simulation. For RMSF, the maximum, minimum and average values were used to determine the overall stability of the docked structure and as for Rg values, the complexes exhibited no change from the beginning of the simulation to the end of the simulation, demonstrating the stability of both the complexes throughout the 50 ns trajectory. In-silico studies suggest a potent and successful interaction with the active site of nucleocapsid with the antimicrobial peptide. These molecules can be further taken for the next step of research and all in vitro experiments will be carried out at room temperature that is the reason for conducting MD simulations at room temperature. Other complementary experiments also carried out at the same temperature so that the structures and energies from different MD simulations can be collected into a single ensemble for analysis. The structure and interaction insights with the AMPs provided in this study may provide a deep understanding for developing anti-SARS-CoV-2 inhibitors. However, the results are preliminary and certainly need experimental validation using in vitro and in vivo studies. Specific assays are needed to confirm the mechanism of action.