Introduction

Acinetobacter baumannii is an aerobic, pleomorphic, non-motile, typically short, almost round, and rod-shaped Gram-negative bacterium (Perez et al. 2007). It can cause infections in parts of the body such as blood, urinary tract, and lungs (Eliopoulos et al. 2008). As an opportunistic pathogen, A. baumannii has a high incidence among patients in the intensive care unit (ICU) setting and immunocompromised individuals particularly those who have experienced a prolonged hospital stay (Valencia et al. 2009). In addition, the frequency of A. baumannii infections has been enhancing gently (Dijkshoorn et al. 2007). A. baumannii is able to grow at various temperatures and pH conditions. Also, this organism can exploits different types of both carbon and energy sources. These properties cause A. baumannii can persist in either moist or dry conditions in the hospital environment. This hardiness, combined with its resistance to many antimicrobial drugs, contributes to spread it in the hospital setting (Obeidat et al. 2014).

This phenomenon of multidrug-resistant (MDR) pathogens has become a serious concern about nosocomial and community infections (Dijkshoorn et al. 2007; Tohidinia et al. 2020). During 1970s, A. baumannii is thought to have been sensitive to wide range of antibiotics, but today the pathogen emerges immense resistance to most first-line antibiotics (Howard et al. 2012). Pathogenesis in A. baumannii infections is a result of several virulence factors such as porins, capsules, and cell wall lipopolysaccharide, enzymes, biofilm production, motility, and iron-acquisition systems (Eijkelkamp et al. 2014). These virulence factors contribute to the organism to resist in hard environmental conditions and enable development of intense infections (Ayoub Moubareck and Hammoudi Halat 2020). In addition, various resistance mechanisms in this pathogen are identified which resistance with major classes of antibiotics including penicillins, cephalosporins, carbapenems, most aminoglycosides, fluoroquinolones, chloramphenicol, and tetracyclines (Kyriakidis et al. 2021). These mechanism including wide array of antibiotic-hydrolyzing enzymes, efflux pump changes, impermeability, and antibiotic target mutations which gives a unique ability to the organism to maintain a multidrug-resistant phenotype (Asif et al. 2018; Espinal et al. 2019). According to the report of the Centers for Disease Control and Prevention, in 2017, Carbapenem-resistant Acinetobacter caused an estimated 8500 infections in hospitalized patients, and 700 people died as a result. CDC has reported a list of urgent threats bacteria that have the danger of antibiotic resistance. This list includes Carbapenem-resistant Acinetobacter, Candida auris, Clostridioides difficile, Carbapenem-resistant Enterobacterales, Drug-resistant Neisseria gonorrhea. Carbapenem-resistant Acinetobacter has been reported in the first line of this serious danger (Prevention 2019).

Carbapenem is a category of highly effective antibiotic drugs generally used for the treatment of severe or high-risk bacterial infections (Shaker and Shaaban 2017). The growing trend of Carbapenem resistance in A. baumannii recently has become a grave concern around the world because it limits the range of therapeutic alternatives (Higgins et al. 2010). In the context of the increasing worldwide occurrence of Carbapenem-resistant A. baumannii strains, Non-enzymatic molecular mechanisms act in cooperation with Carbapenem-hydrolyzing β-lactamases to provide an intense resistance. However, this process has not been described clearly (Catel-Ferreira et al. 2011). MDR strains showed a porin-mediated mechanism relating to the Carbapenem resistance-associated outer membrane protein, CarO (Catel-Ferreira et al. 2011; Mussi et al. 2007; Siroy et al. 2005). CarO protein generally is an eight‐stranded β‐barrel protein of 29 kDa which plays a key role in selective uptake of the ornithine, other basic amino acids, and uptake of Carbapenem (Catel-Ferreira et al. 2011). Mutations in the CarO gene can alter the structure or decrease the expression of the porin, resulting in a decline of antibiotics entry into the bacteria (Poirel et al. 2006; Zahn et al. 2015). In This study CarO protein was evaluated by available bioinformatics tools for designing an efficient multi-epitope subunit vaccine for the induction of immune responses against A. baumannii infections.

One of the critical issues in immunization is the presence of epitopes including B cells, HTL, and CTL that are recognized by immune system cells and capable of stimulating an immune response (Catel-Ferreira et al. 2011). The epitopes are the parts of a protein that have two shapes, continuous and discontinuous epitopes. Discontinuous epitopes consist of the residues from distant parts of the sequence that collectively made up the epitope and they will usually not recognized individually by antibodies appointed against the discontinuous epitope, therefore these epitopes are not suitable for the multi-epitope vaccine. However, continuous epitopes, also called sequential epitopes, are conformation independent, so they have the ability to be recognized by antibodies individually and independently (Chandra and Singh 2012; Rubinstein et al. 2008). Epitopes can induce a more direct and potent immune response, than the response induced by the whole protein. Therefore, the identification of antigenic epitopes can be more effective and safe in vaccine design (Palatnik-de-Sousa et al. 2018).

Immunoinformatics plays a key role in vaccine design and antibody production (Kazi et al. 2018). In the past, antibody design and vaccine development are expensive and time-consuming. Nowadays, advances in the field of bioinformatics have provided practical tools which can be used to lessen the time and cost for vaccine and antibody design (Anderson et al. 2013; Kazi et al. 2018). The Immunoinformatics approach allows the identification of the immunogenic epitopes from the pathogen genomes. Also, the ideal and immunogenic parts could be developed as potential vaccine candidates to trigger protective immune responses in the hosts (Ali et al. 2017).

Adjuvants are agents or drugs that have few or no antigenic effects or properties but can help enhance the efficacy and immunogenicity of vaccines (Mishra et al. 2020). In many studies, several kinds of adjuvants were used that had significant effects on the immunogenicity of the vaccine (Cayatte et al. 2017; Doener et al. 2019; Gasparini et al. 2001; Suschak et al. 2017; Yang et al. 2020). Adjuvants L7/L12 ribosomal protein (Behbahani 2020; Saadi et al. 2017), beta-defensin (Gupta et al. 2020; Wang et al. 2020), HBHA protein (Kumar et al. 2019; Lei et al. 2020), and HBHA conserved sequence(Rashidian et al. 2020; Solanki and Tiwari 2018) recently been identified as efficient and helpful adjuvants, and we used them in this study.

Infections caused by A. baumannii are becoming extremely difficult to prevent and treat because infection control strategies against the spread of A. baumannii are facing new multidrug-resistant (Rasooli et al. 2020). This study identifies the potential and immunogenic epitopes of CarO protein by bioinformatics tools. Then, linkers and adjuvant were added to the selected epitopes to design final multi-epitope vaccines and was selected the best vaccine construct based on allergenicity, antigenicity, and solubility features. The 3D structure of the protein was predicted, and its affinity to different HLA, TLR2, and TLR4 was investigated by molecular docking. Finally, the vaccine coding gene expression in E. coli was evaluated, and in silico cloning was performed in the pET28a (+) vector. This selected vaccine construct can be developed to cure the infection caused by the A. baumannii.

Method

Sequence Availability and Alignment

The CarO protein sequence was retrieved from NCBI at http://www.ncbi.nlm.nih.gov/protein and saved in FASTA format for further analyses (Pruitt et al. 2005); then Protein BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi) against reference sequence on the database was performed using Blosum80 matrix to collect homologous sequences between different bacteria (Tatusova and Madden 1999). Because BLOSUM80 created for highly similar alignments and is used to find related proteins (Wheeler 2003). PRALINE server at https://www.ibi.vu.nl/programs/pralinewww/ was used for generating of alignments. This server based on homology-extended multiple alignments and the integration of predicted secondary structure information with iteration capabilities methods which have many advantages and have made this server stronger (Simossis and Heringa 2005). Alignments can demonstrate the conservancy of the protein residues among various strains (Simossis and Heringa 2003). The suitable vaccine candidate should able effective against all strains of a given pathogen. The conservancy of the amino acids among bacteria other than Acinetobacter implies probable cross-reactivity levels.

Prediction T Cell MHC Class I Epitope

The recognition of MHC-I antigenic peptide which exposed at the target cell surface is crucial for the adaptive immune response (Nguyen van Binh et al. 2006). NetCTL server at http://www.cbs.dtu.dk/services/NetCTL/ was used to predict MHC class I epitopes. In this server, the method integrates prediction of peptide MHC class I binding, proteasomal C terminal cleavage, and TAP transport efficiency (Larsen et al. 2005). Prediction threshold value 0.75 was set for epitope identification. Epitopes were chosen with a combinatorial score > 1 for further analyses (Larsen et al. 2007). Also, IEDB server at http://tools.iedb.org/mhci/ was used to predict peptide binding to MHC class I molecules based on the consensus method. Identified T-cell epitopes with HLA alleles were selected by percentile rank and IC50 values. The lower the percentile rank was shown the higher interaction between the peptide and MHC molecules. In Addition, high binding epitopes have an IC50 value below 50 nM, and weak binding peptides and IC50 value below 500 nM. So, Epitopes with both IC50 value < 50 nM and percentile rank ≤ 0.2 were selected for more analyses (Vita et al. 2019).

Prediction Class I Antigenic, Toxicity and Immunogenicity

A second screening step including their important features such as antigenicity and allergenicity which were checked by AllergenFP server at http://ddg-pharmfac.net/AllergenFP/ and Vaxijen server at http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html. AllergenFP is generally utilized for the prediction of allergenicity of epitopes for vaccine design (Dimitrov et al. 2014). VaxiJen server used to predict antigenic properties of the peptide at a threshold value of 0.5. This server depends on Auto Cross Covariance (ACC) transformation, and alignment-independent predicted antigenic epitopes by their physicochemical behavior (Doytchinova and Flower 2007). In the following, Epitope/MHC complex should have the ability to evoke an immune response. So, we have used MHC I immunogenicity prediction tool in the IEDB server (Calis et al. 2013). Default parameters were selected and the epitopes which gained the positive value were chosen.

Prediction T Cell MHC Class II Epitope and Analysis Antigenic and Toxicity

To identify MHC-II binding epitopes two servers were employed. IEDB server at http://tools.iedb.org/mhci/ were used to compute MHC-II binding epitopes based on the consensus method. The consensus method uses a combination of both stabilization matrix alignment method and average relative binding matrix method. Predicted epitopes were selected relying on percentile rank and IC50 values (percentile rank ≤ 0.2 and IC50 value < 50 nM) (Jensen et al. 2018). NetMHCII 2.3 server at http://www.cbs.dtu.dk/services/NetMHCII/ was used to predict binding of peptides to HLA-DR, HLA-DQ and HLA-DP using artificial neural networks (Kaushik and Therapeutics 2019). Among predicted strong binding epitopes were chosen those had score < 1 for further analyses. Antigenicity, toxicity, and allergenicity were evaluated for predicted epitopes. The antigenicity and allergenicity of the epitopes were calculated by VaxiJen and AllergenFP server respectively (Dimitrov et al. 2014; Doytchinova and Flower 2007). In addition, mutation and toxicity features were checked by ToxinPred server at http://crdd.osdd.net/raghava/toxinpred/. In the silico method, ToxinPred was used to predict the toxicity of peptides and about how to design a peptide or protein with desired toxicity by mutating a minimum number of amino acids (Gupta et al. 2013).

Prediction and Screening of Liner B-Cell Epitopes

The recognition of B-cell epitopes has a key role in vaccine design, immunodiagnostic tests, and antibody production. There are various servers to predict B cell epitopes. Since each server performs based on specific algorithms and exclusive methods in epitope prediction. Therefore, the usage of several tools to predict linear B-cell epitopes in protein sequences are more valid. Ellipro server at http://tools.immuneepitope.org was employed to predict linear antibody epitopes based on a protein antigen's 3D structure (Ponomarenko et al. 2008). Also, liner B cell epitopes were predicted using an artificial neural network by ABCpred server at http://crdd.osdd.net (Saha and Raghava 2006a, b) In addition, SVMTriP (http://sysbio.unl.edu/SVMTriP/) was used in the prediction of the B cell linear epitopes. This server is a Support Vector Machine which used to predict linear antigenic epitopes which combine the Tri-peptide similarity and Propensity scores (SVMTriP) (Yao et al. 2012). For more identification of continuous B-cell epitopes from the CarO protein, Bcpred server at http://ailab.ist.psu.edu/bcpred/predict.html was utilized and the epitopes size of 20 amino acids as well as the specificity threshold of 0.75% were considered. The Bcpred server for predicting employs a subsequence kernel-based SVM classifier (EL‐Manzalawy et al. 2008). Selected epitopes from the servers mentioned above were screened for antigenicity and allergenicity properties (Dimitrov et al. 2014; Doytchinova and Flower 2007).

Construction of Model Vaccine

To construct the novel vaccine with low toxicity, allergenicity, and high immunogenicity, we have analyzed the different combinations of sequence constructs. During this novel multi-epitope subunit vaccine construction. Firstly, CTL, HTL, and B cell epitopes were joined with the help of amino acid linkers HEYGAEALERAG and GGGS respectively (Hasan et al. 2019; Nezafat et al. 2014). Secondly, for enhancing the immunogenicity of this sequence constructs were added the four different adjuvants L7/L12 ribosomal protein (Mallick et al. 2007), beta-defensin (Dar et al. 2015), HBHA protein (Nezafat et al. 2014), and HBHA conserved sequence respectively (Rana and Akhter 2016). Adjuvants sequences were added with the help of EAAAK linkers at both N- and C-terminal of Adjuvants (Saadi et al. 2017). PADRE peptide sequences were also incorporated along with the adjuvants by GGGS linker. PADRE peptide induced CD4+ T-cells that improve efficacy and potency of peptide vaccine (Wu et al. 2010).

Allergenicity, Antigenicity and Solubility Evaluation

The Allergenicity of vaccine constructs was predicted by AlgPred server (https://webs.iiitd.edu.in/raghava/algpred/submission.html) which based the SVM module on amino acid composition. A threshold score of prediction has been considered to be − 0.40. Prediction score less than the threshold value point displays the non-allergic feature of the peptide (Saha and Raghava 2006a, b). VaxiJen server and ANTIGENpro program (Kalita et al. 2020) at http://scratch.proteomics.ics.uci.edu were employed for the prediction of vaccine antigenicity behavior. In addition, SOLpro online server at http://scratch.proteomics.ics.uci.edu was utilized to predict the propensity of the protein to be soluble upon overexpression in Escherichia coli. This server uses a two-stage SVM architecture method to evaluate the protein solubility (Magnan et al. 2009).

Secondary Structure Prediction

The PSIPRED at http://bioinf.cs.ucl.ac.uk/psipred/ was used to predict the secondary structure of the vaccine construct. PSIPRED is a simple and accurate secondary structure prediction method, incorporating two feed-forward neural networks that perform an analysis on output obtained from PSI-BLAST (McGuffin et al. 2000). In addition, Phyre2 server at http://www.sbg.bio.ic.ac.uk/~phyre2/html/page.cgi?id=index employed to validate the PSIPRED predictions (Kelley et al. 2015).

Vaccine 3D Modeling, Refinement, and Validation

Homology modeling is the construction of an atomic model of a purpose protein based solely on the target's amino acid sequence and the experimentally determined structures of homologous proteins, referred to as templates (Kelley et al. 2015). While Ab initio prediction is the challenging attempt to predict protein structures based only on sequence information and without using templates (Yang and Zhang 2015). There are many tools and servers that are used for homology modeling and Ab initio prediction. There is no single modeling program or server which is superior in every aspect to others (Dolan et al. 2011). The SWISS-MODEL server at http://swiss-model.expasy.org/ is a fully automated protein structure homology-modeling server (Kiefer et al. 2009). I-TASSER server at https://zhanglab.dcmb.med.umich.edu/I-TASSER/ is based on hierarchical method to protein structure prediction and structure-based function annotation (Zhang 2008).

GMQE (Global Model Quality Estimation) is a quality estimation which combines properties from the target–template alignment and the template search method. QMEAN, which stands for Qualitative Model Energy Analysis, is a composite scoring function describing the major geometrical aspects of protein structures. All 3D models of the protein built by SWISS-MODEL server, were qualitatively estimated by GMQE and QMEAN scores (Schwede et al. 2003). In addition, C-score is a confidence score for estimating the quality of predicted models by I-TASSER. It is calculated based on the significance of threading template alignments and the convergence parameters of the structure assembly simulations (Yang et al. 2015). Finally, Qualitative evaluation of 3D models was done by ProSA at https://prosa.services.came.sbg.ac.at. ProSA specifically faces the needs confronted in the authentication of protein structures acquired from X-ray analysis, NMR spectroscopy, and hypothetical estimations (Wiederstein and Sippl 2007). Rampage at http://mordred.bioc.cam.ac.uk/rapper/rampage.php was also employed for estimation of model quality using Ramachandran plot which is an algorithm for atomic level, high-resolution protein structure improvement (Hooft et al. 1997).

Molecular Docking

The aim of molecular docking is to give a prediction of the binding affinity between a receptor molecule and ligand by computation methods (Ballester and Mitchell 2010). The vaccine construct 3 (V3) was made by I-TASSER server. In addition, PDB IDs of different HLA alleles were downloaded from the RCSB at https://www.rcsb.org/ (Rose et al. 2010). Molecular docking of V3 with 6 various HLA alleles including 1A6A(HLA-DR B1*03:01), 3C5J(HLA-DR B3*02:02), 1H15(HLA-DR B5*01:01), 2FSE(HLA-DRB1*01:01), 2Q6W(HLA-DR B3*01:01), and 2SEB(HLA-DRB1*04:01) was carried out by PatchDock server (Schneidman-Duhovny et al. 2005) at https://bioinfo3d.cs.tau.ac.il/PatchDock/ to represent HLA-peptide interactions. PatchDock predicted a number of 266 solutions that were again subjected to the FireDock server (http://bioinfo3d.cs.tau.ac.il/FireDock/) to refine and re-scoring of molecular docking score (Andrusier et al. 2007). TLRs are the most important membrane-bound PRRs, playing a critical role in the recognition of invading pathogens by the innate immune system. A. baumannii interacts with host cells mainly by engaging TLR2 and TLR4 (Chen 2020). In the same way, the PatchDock server was used to docking of vaccine construct (V3) with TLR2 (PDB ID 6NIG) and TLR4 (PDB ID 4G8A) and then the top 10 models were refined and re-scored by FireDock server. The docking models of TLRs with vaccine construct (V3) were displayed by the HADDOCK 2.4 server at https://wenmr.science.uu.nl/ (Van Zundert et al. 2016). The best cluster was chosen from the docked clusters based on the lowest HADDOCK score. Finally, the interacting residues, hydrogen bonds and hydrogen bonds distances between the vaccine and the TLRs were mapped using PDBsum at http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/pdbsum/GetPage.pl?pdbcode=index.html (Laskowski 2001).

Prediction of Various Physicochemical Properties

The functionally characterize of vaccine sequence was predicted by Expasy ProtParam server (https://web.expasy.org/protparam/). This server has determined the physicochemical characteristics such as the molecular weight, number of amino acids, PI values, hydropathicity GRAVY values, instability index, aliphatic index, and estimated half-life of the protein. Calculate of these physicochemical properties is on the basis of pK values of different amino acids. The instability index value shows protein is stable or unstable. The instability index value is < 40 for stable protein and > 40 for unstable protein. The aliphatic index determines the volume occupied by the aliphatic side chains of protein (Gasteiger et al. 2005).

Codon Optimization of the Vaccine Construct and In-Silico Cloning

It is important to consider codon optimization when performing expression studies because codon optimization plays a critical role in the success of protein expression (Mauro and Chappell 2018). Java Codon Adaptation Tool (JCAT) at http://www.jcat.de/ was employed to adapt the codon usage of vaccine construct to E. coli host strain 12. Vaccine amino acid sequence was back-translated to DNA sequence and then it was adapted for codon usage to E. coli. The adaptation was based on Codon Adaptation Index values (CAI). The rho-independent transcription terminators, prokaryotic ribosome binding sites, and cleavage sites of some restriction enzymes were avoided during the operation performed by JCAT server (Grote et al. 2005). In addition, to clone the adapted gene sequence of the final vaccine construct (V3) in E. coli pET28a vector (Ali et al. 2021), using Snapgene tool at https://www.snapgene.com/ to ensure the vaccine construct expression (Arumugam 2021) and BglII and ApaI restriction sites were introduced to the N and C-terminals of the sequence, respectively.

Result

The flow chart of experimental process which summarizes an overview of the methodology used in this study is shown in Fig. 1.

Fig. 1
figure 1

Flow chart showing an overview of our methodology

Protein Sequencing and Alignments

CarO protein sequence with accession No. AKL79738.1 is available as a query for protein BLAST against Acinetobacter producing a set of sequences containing various species of Acinetobacter. Out of 100 alignments, 20 sequences with the highest similarity (identity ≥ 90%, query coverage: 100% and E value: 0) were selected as T-COFFEE alignment input. BLAST search of Acinetobacter revealed numerous sequences of CarO. These hits showed that the CarO protein is specific for Acinetobacter species. The image of PRALINE alignment results with conservation color scheme is shown in Fig. 2.

Fig. 2
figure 2

CarO sequence alignment with 20 sequences obtained from protein BLAST against Acinetobacter. The superposition was made with the T-coffee program and adjusted manually. Residues conservancy is depicted by blue to pink colors

Selection of Potent MHC-I Epitopes in Protein and Analysis Eminent Features of Epitopes

On the basis of the high combinatorial score, the best T-cell epitopes were predicted by NetCTL server using the protein sequence. The software identified 36 epitopes in CarO protein. Peptides with combinatorial score > 1 were selected to other analyses (Table 1). In IEDB server, 9 epitopes were elicited with both IC < 200 nM and percentile rank ≤ 0.2 were subjected to afterward analysis (Table 1). After finalizing selected epitopes of both NetCTL and IEDB based above conditions, their important features including allergenicity, antigenicity, and immunogenicity Checked (Table 1). The sequences of IAPYLGFGF, KRIGNGDTL, WQANPYVGL and ALLWQANPY were considered as epitopes of T cells that interact with MHC Class-I and have the most important epitopes due to their features such as Non-allergen, positive scores in immunogenicity, and high scores in antigenicity.

Table 1 CTL epitopes (MHC I peptides) of CarO protein with their antigenicity, allergenicity and immunogenicity scores

Selection of MHC-II Epitopes and Prediction Their Antigenic and Toxicity

In addition to MHC-I epitope prediction, CarO protein was subjected to MHC-II binding prediction, using IEDB and NetMHCII server. The epitopes that elicited higher affinity (IC < 200 nM) and percentile rank ≤ 0.2 were subjected to afterward analysis (Table 2). NetMHCII server predicted more than 200 regions that could have an affinity for MHCII proteins. We selected 17 epitopes because these regions showed strong binding with a score < 1 and promising characteristics. After this process, selected epitopes from both servers were analyzed based on their essential features including mutation, toxicity, allergenicity, and antigenicity (Table 2). Finally, the sequences of GNGDTLSIDGKNYQQ, PWAQGLYIAAGAAYL, WAQGLYIAAGAAYLD, AQGLYIAAGAAYLDN, GLYIAAGAAYLDNDY, QGLYIAAGAAYLDND, ANEIRNDNKYEWMPV, GKMSYKNDIAPYLGF, KMSYKNDIAPYLGFG, RGKMSYKNDIAPYLG, DIAPYLGFGFAPKIS, and YLGFGFAPKISKNWG as the epitopes of T cells that interact with MHC Class-II alleles were considered as the most essential epitopes due to features of Non-allergen, non-toxin and high score in antigenicity.

Table 2 HTL epitopes (MHC-II peptides) of CarO protein with their antigenicity, allergenicity, toxicity and mutation scores.

Linear B Cell Epitope Prediction

The analysis of the sequences by different bioinformatics programs for T and B cells yielded approximately 100 epitopes. Svmtrip predicted 3 Linear B cell epitopes based on the score. The epitopes recommended by this server are "AMADEAVVHDSYAFDKNQLI", "GAAYLDNDYDLAKRIGNGDT", and "WQANPYVGLA LGYNGGDISW". 6 linear along B cell epitopes were predicted by ElliPro software. The sequences and pictures of 6 linear epitopes with the highest PI (protrusion index) are shown in Fig. 3 and Table 3 (Rows 7–12).

Fig. 3
figure 3

Epitope mapping on 3D models. Discovery Studio Visualizer 2.5.5 software was used. From 1 to 6 pictures, 6 linear epitopes with the highest PI score predicted by Ellipro server are shown

Table 3 B-cell epitopes of CarO protein with their antigenicity and allergenicity scores

ABCpred result showed 65 hits of 16 metric peptide sequences as B-cell epitopes ranking based on scores. The best epitopes predicted by the server with a score above 0.85 are the sequences of "ACINETACTERAMANN", "EVGAYYTGNPKVELTQ", "RAEVGTTGYGGALLWQ", "EAVVHDSYAFDKNQLI", "GGDISWTDD VSVNGTK", "DNKYEWMPVGKVGVNF", "PVTGNPTSAQDAVDKE", "TQYNLAPVTGNPTSAQ", and "GQEGGVRGKMSYKNDI". BCpred server predicted 6 B-cell epitopes. The scores of epitopes ranging from 0.787 to 0.994 where a high score represents the higher probability of becoming B-cell epitopes and higher affinity for the B-cell receptor. The total promising linear B cell epitopes which were mentioned above were analyzed by the Vaxijen server (≥ 0.4 score) and AllergenFP server in order to participate in the vaccine construct.

Comparison of All the Predicted Epitopes to Select the Final Epitopes

After collecting the immunoinformatics information with the help of T-cell prediction and B-cell prediction, the high scores of HTL, CTL, and B cell epitopes were predicted and analyzed for their essential feature. Finally, to make the final multi-epitope construct, 4 CTL epitopes, 4 HTL epitopes, and 4 B cell epitopes due to features of non-Allergen, non-toxic, and higher score in antigenicity (Table 4).

Table 4 The final selection of CTL, HTL and B-cell epitopes for multi-epitope vaccine construct

Construction of Final Vaccine with Different Adjuvants, Linkers, and PADRE Sequence

A total of four vaccine constructs were prepared and further analyzed. All four vaccine constructs were added with the respective adjuvants with the help of EAAAK linker. MHC-I, MHC-II, B-epitopes, and PADRE sequence were joined together by HEYGAEALERAG and GGGS linkers (Fig. 4). Details of the vaccine constructs have been mentioned in Table 5.

Fig. 4
figure 4

Schematic diagram of multi-epitope vaccine

Table 5 Allergenicity prediction of all vaccine construct using Algpred, Vaxijen and scratch protein predictor server

Prediction Features of Different Vaccine Constructs

Antigenicity of the vaccine proteins was predicted using VaxiJen server and ANTIGENpro. All Constructs were found as potential vaccine candidates with high antigenicity scores and the ability to stimulate an efficient immune response (Table 5). Also all 4 constructs showed the solubility more than 0.7, which indicated that the vaccine construct will be highly soluble during its heterologous expression in the E. coli (Table 5). But, among four constructs, the only vaccine with beta-defensin (V3) was found to be non-allergic in nature (Allergenicity < − 0.4). So, the V3 construct was chosen for further analysis.

Prediction Secondary Structure

PSIPRED and Phyre2 servers were used to predict the secondary structure of the construct V3. The predicted structure of the protein confirmed to have 68.70% alpha-helix, 27.48% sheet, and 3.82% coil structure (Fig. 5). The results from Phyre2 confirmed the results of the PSIPRED program.

Fig. 5
figure 5

Secondary structure prediction of vaccine construct (V3) using PESIPRED server

Vaccine 3D Modeling, Refinement, and Validation

Swiss model and I-TASSER software were recruited for homology modeling and Ab initio prediction. The former prepared 12 different models and the latter introduced 5 models. Among the structures predicted by the Swiss model, the best structures were selected based on the QMEAN and GMQE scores. QMEAN scores about zero show good agreement between the model structure and experimental structures of similar size and scores of − 4.0 or below indicate models with low quality. In addition, the GMQE score is expressed as a number between zero and one, reflecting the expected accuracy of a model built with that alignment and template. Higher numbers illustrate higher reliability. Two 3D models have shown GMQE with scores − 1.41 and − 1.38 and QMEAN with scores of 0.05 and 0.04 respectively. The best structure among five models of I-TASSER were measured on the basis of C-score. The C-score is normally in [− 5, 2] and It should be noted that the higher this score, the more accurate the model quality. Since the C-score of model 2 is higher (= − 0.83) than other models, is expected to have good quality. Finally, three predicted 3D models were selected for further analyses.

The quality of the 3D models were evaluated by Rampage and Prosa servers. Based on the result of Ramachandran plot and Prosa the best model was chosen from the predicted structure by I-TASSER (Fig. 6a). We showed the Cartoon diagram of a computational structural model of the vaccine in the following figure by using Molecular Operating Environment Software. This figure displays that the sequence of each epitope is properly folded and exposed (Fig. 6b). This structure of vaccine construct show which 87.438% of the residues are in favored regions. Also, Prosa revealed that the predicted model was among other acceptable proteins with z score = − 5.27 (Fig. 7).

Fig. 6
figure 6

a The best model of epitope vaccine (V3) predicted by I-TASSER server. b All of the regions of the vaccine for display on the scaffold are labeled and colored in representing adjuvants_yellow, PADRE_purple, B-cell epitopes_green, Linker_gray, HTL_blue, and CTL_pink (Color figure online)

Fig. 7
figure 7

Model evaluation. a Ramachandran plot of final epitope vaccine (V3) model. Number of residues in favored region: 355 (87.438%). Number of residues in allowed region: 41(10.099%). Number of residues in outlier region: 10 (2.463%). b Prosa protein structure analysis results. Z score = − 5.27. Overall quality of the ultimate model is acceptable

Molecular Docking

Docking analysis was employed between the vaccine construct 3 and different HLA alleles (Table 6). Construct V3 displayed biologically significant results and HLA-DR B1*01:01 found to be superior among different HLA in terms of free binding energy. Moreover, docking was also used to predict the binding affinity of the designed vaccine construct with human TLR2 and TLR4 receptors using PatchDock and Firedock servers respectively. FireDock output refinement of PatchDock server showed the lowest global energy of − 27.61 and − 50.74 for TLR2 and TLR4 respectively. In order to display docking of protein–ligand complexes was used HADDOCK. The top cluster with the lowest HADDOCK score of − 100.1 ± − 5.4 and − 160.9 ± − 29.4 suggests a good binding affinity between the vaccine and the TLR2 and TLR4. The docked complexes along with hydrogen bonds and distance of hydrogen bonds are shown in Figs. 8 and 9 and Tables 7 and 8.

Table 6 Prediction binding energy of vaccine construct (V3) with selected HLA molecules by PatchDock, Firedock
Fig. 8
figure 8

a Showing TLR2/vaccine docked complex. Vaccine construct 3 is shown in blue colour while TLR2 is shown in orange colour. Interacting residues and active pockets between docked TLR2/vaccine are displayed with purple and yellow filling space model respectively. b Few prominent hydrogen bonds within vaccine-TLR2 complex are focused (Color figure online)

Fig. 9
figure 9

a Showing TLR4/vaccine docked complex. Vaccine construct 3 is shown in blue colour while TLR4 is shown in orange colour. Interacting residues and active pockets between docked TLR4/vaccine are displayed with purple and yellow filling space model respectively. b Few prominent hydrogen bonds within vaccine-TLR4 complex are focused (Color figure online)

Table 7 List of residues involved in hydrogen bonds and hydrogen bonds distance in TLR2-vaccine interactions
Table 8 List of residues involved in hydrogen bonds and hydrogen bonds distance in TLR4-vaccine interactions

Physicochemical Analysis of Vaccine Construct (V3)

The vaccine construct (V3) was characterized on the basis of various physicochemical properties using ProtParam tool. The molecular weight of the vaccine construct was found 51.57 kDa which shows its good antigenic potential. GRAVY (a hydropathic index) was − 0.527 (a negative value), which represents the hydrophilic nature of constructs. The calculated instability index of the protein was 23.43 which classified it as a stable one. The estimated half-life of the constructed vaccine was expected to be 1 h in mammalian reticulocytes in vitro while more than 10 h in E. coli in vivo. Aliphatic value 71.90 also showed that the vaccine construct has a good characteristic to initiate an immunogenic reaction.

In-Silico Cloning of Chimeric Vaccine Construct (V3) for Its Heterologous Expression

Reverse translation and codon optimization were performed by considering the expression system of the host in order to express the multi-epitope vaccine. The optimum range of GC content was considered to be between 30 and 70%. The GC content of the optimized codons (53.95%) was significant. Also, the high value of the CAI score shows a high expression of the gene. Our construct has a codon adaptation index (CAI) of 1. Recognition sites for BglII and ApaI restriction enzymes were added to the 5′ and 3′ end of the optimized gene and then the final vaccine construct (V3) with a total length of 1465 bp was cloned into pET28a vector for its heterologous cloning and expression in E. coli (Fig. 10). Polyhistidine-tag is an amino acid motif in proteins that typically consists of at least six histidines (His) residues which is often at the N- or C-terminus with spaced or attached to inserted protein. His-tag will be used to purify protein in further experiments. In addition, the His-tag is part of the pET28a vector structure.

Fig. 10
figure 10

In silico cloning of the gene sequence of the final vaccine construct V3 into pET28a (+) expression vector. The target sequence was inserted between BglII (401) and ApaI (1334)

Discussion

The emergence of multidrug resistance (MDR) A. baumannii is recognized as a serious health concern in hospitals across the world (Savov et al. 2019). CarO from A. baumannii is a membrane porin that uptake small molecules such as l-ornithine, Carbapenem, and other basic amino acids (Labrador-Herrera et al. 2020; Tohidinia et al. 2020). In a study, researchers were demonstrated outer membrane vesicles (OMVs) of A. baumannii had able to induce protective immunity against A. baumannii. Among OMV proteins, CarO was identified as a highly immunogenic protein (McConnell et al. 2011). Currently, there is no definite cure to protect against the A. baumannii due to increase MDR. Therefore, it is essential to take preventive measures against it (Solanki and Tiwari 2018). The employment of bioinformatics tools in the field of vaccinology can be aiding the selection of potential vaccine candidates and simplify the optimization of the chosen immunogen (Gupta and Kumar 2020; Tohidinia and Sefid 2020).

The entire proteome of the A. baumannii ATCC 19606 strain was retrieved from NCBI and the result of BLAST from various sequences of CarO showed it is specific to Acinetobacter species. Alignment of CarO sequences from various strains indicates that such vaccines are able to trigger antibodies with complete specificity for A. baumannii and CarO is a conserved protein among Acinetobacter species. Therefore, it can one of the best choices for vaccine design. A multi-epitope vaccine consists of several B and T epitopes with sensible linkers in order to avoid the generation of junctional epitopes and increase the antigen presentation process (Forouharmehr 2021; Shey et al. 2019). A multi-epitope vaccine can trigger both humoral and cell-mediated immunity (Ali et al. 2017). The cell-mediated immune response is mediated by T-cells whereas the humoral immune response is mediated by produced B-cells (Channappanavar et al. 2014; Kozakiewicz et al. 2013). Therefore, the correct and accurate selection of TCL, HTL, and B-cell epitopes to design a multi-epitope vaccine has a direct relationship with the immune system response.

Approximately 300 epitopes of CarO protein were predicted by IEDB with recommended methods and NetCTL server to be MHC I peptides that can bind to a large number of HLA-A and HLA-B alleles with a high binding affinity. 350 epitopes of CarO protein were also predicted by using the MHC-II binding prediction tool of IEDB and NetMHCII server as MHC II peptides. But the most important epitopes were selected among the identified epitopes with certain scores mentioned in the method section and then their important characteristics were examined. One of the most important features considered for selecting CTL epitopes was the Vaxijen score. The higher Vaxijen score indicating the higher antigenicity of an amino acid sequence and is able to further stimulate the immune system (Saha et al. 2017). In this study, only CTL epitopes with non-allergenic and positive immunogenicity behavior were selected for further analysis. In addition, the characteristics of the allergenicity, antigenicity, toxicity, and mutation were examined for the selection of HTL epitopes. Finally, according to all these characteristics, sequences IAPYLGFGF, KRIGNGDTL, WQANPYVGL, and ALLWQANPY as CTL epitopes and GNGDTLSIDGKNYQQ, ANEIRNDNKYEWMPV, DIAPYLGFGFAPKIS, and YLGFGFAPKISKNWG as HTL epitopes were selected to participate in the structure of the vaccine. Amino acid sequences that are linear in shape are called continuous or linear epitopes (Abboud et al. 2009) which were predicted by ElliPro, ABCpred, Svmtrip, and BCpred servers. Through the epitope prediction servers carried out in this project we were able to compile 82 potential linear B cell epitopes in CarO protein. Among promising linear B cell epitopes, four B cell epitopes were selected based on their features such as antigenicity and allergenicity.

In the construction of vaccines, the selected epitopes were connected by HEYGAEALERAG and GGGS linkers. Each of these constructions was connected to 4 different adjuvants with the EAAAK linker. Finally, after examining the allergenicity, antigenicity, and solubility behaviors vaccine construct 3 which had HBHA adjuvant was selected due to non-allergic in nature and high allergenicity and antigenicity score to prediction the second and third structures.

To ensure effective binding between HLA molecules and vaccine construct 3, a docking study was performed. The vaccine construct 3 was subjected to the PatchDock server as a receptor and 6 different HLA alleles including 1A6A(HLA-DR B1*03:01), 3C5J(HLA-DR B3*02:02), 1H15(HLA-DR B5*01:01), 2FSE(HLA-DRB1*01:01), 2Q6W(HLA-DR B3*01:01), and 2SEB(HLA-DRB1*04:01) as a ligand were selected for docking analysis. All the selected HLA showed low binding energy with V3 which was biologically significant. HLA-DR B1*01:01 was found to be best considering the free binding energy. Moreover, to display interactions among human TLRs (TLR2 and TLR4) and the vaccine structure 3, HADDOCK server were applied for molecular docking. Docking results demonstrated that the multi-epitope vaccine we designed has the potential to bind the mention receptors. Also, the PDBsum server showed 22 and 23 hydrogen bonds between the vaccine and TLR2 and TLR4 respectively. Hydrogen bonds can increase the strength and stability of a vaccine-receptor interaction substantially (Fox et al. 2013). Physicochemical properties of the V3 sequence were analyzed using ProtParam server which showed nature of V3 is appropriate for vaccine construct. Finally, the vaccine construct V3 was reverse transcribed and adapted for E. coli strain K12 former to insertion within pET28a (+) vector for its cloning and expression. In our study, our in silico results were based on various databases. We suggest using model animals for further analysis and experimental validation of this designed vaccine.