In silico anti-quorum sensing activities of phytocompounds of Psidiumguajava in Salmonellaenterica serovar Typhi

Biofilm contributes hugely to the persistence of typhoid fever in human population and quorum sensing (QS) is an integral mechanism involved in biofilms. Interruption of the QS network has therefore been put forward as one of the important anti-virulence strategies. Methanol extract of Psidium guajava leaves has been confirmed to possess antibacterial and anti-biofilm activities against Salmonella Typhi. This study therefore aimed at investigating the interactions of phytocompounds previously identified in the extract with selected QS proteins of S. Typhi in silico. Appropriate formats of compounds were retrieved and translated using online web servers. Quantitative estimate of drug-likeness, as well as absorption, distribution, metabolism, excretion and toxicity profiles of the compounds, were assessed on ADMETlab 2.0. Three-dimensional structures of two QS proteins of S. Typhi were obtained from Protein Data Bank while others were modelled on SWISS-MODEL. Selected compounds (ligands) were docked with the four proteins via AutoDock 1.5.6 and analyzed on Discovery studio. Eight, out of the seventy-two, phyto-compounds of methanol extract of P. guajava possess desirable drug-likeness (QED > 0.67). Three of them have toxic characteristics and thus, were removed from further consideration. Molecular docking revealed that, of the 5 ligands docked against the proteins, only Benzeneethanamine, 4-methoxy- and Cyclopentadecanone, 2-hydroxy- had affinities for the proteins of interest. The affinity of Cyclopenftadecanone,2-hydroxy- for each of the proteins is higher than that of Benzeneethanamine,4-methoxy- with hydrogen bonds contributing significantly to the interactions. Benzeneethanamine, 4-methoxy- and Cyclopentadecanone,2-hydroxy- from Psidium guajava leaves possess inhibitory properties against QS proteins of S. Typhi.


Introduction
Salmonella enterica serovar Typhi (S. Typhi) is a foodborne pathogen responsible for causing typhoid fever, a disease that constitutes a major global health concern [3]. A major feature of S. Typhi infection is its host restriction to humans which means that the infection cycle begins and ends within the human host [16]. Typhoid fever has persisted in the human population through the development of carrier status by some infected individuals who continue to transmit the causative bacterium even after recovering from the clinical symptoms of the disease. Formation of biofilm in the epithelial cells of the gall bladder as well as gallstones, where present, have been implicated in the carrier status and antimicrobial resistance in typhoid fever [18,19].
A biofilm is a community (colony) of microorganisms that adhere to each other and to a solid (inert or live) surface that is encased in a self-initiated protective extracellular matrix, containing polysaccharides, proteins, 1 3 extracellular microbial DNA and enzymes, ions and nutrients from the environment [4,7,9,19]. Biofilm contributes to the failure of anti-infective therapy by serving as a reservoir of resistant microorganisms as bacteria in biofilms are up to 10 3 more resistant to antibiotics than their planktonic counterparts [36]. Due to the heterogeneous nature of biofilms, several mechanisms have been reported for increased antimicrobial resistance in biofilm structures. These include (1) Low diffusion of antibiotics across the extracellular matrix resulting in low intracellular concentration of antibiotic(s). This is due to the presence of glycocalyx, an integral part of bacterial biofilm that accumulates antimicrobial substances and limits the transportation to the organisms within the colony [38], (2) phenotypic and physiological changes due to slow growth rate and starvation responses (oxygen, nutrient deprivation or environmental stress-antibiotics are not active against dormant cells; (3) the expression of efflux pumps (EP) that decrease intracellular antimicrobial concentration; (4) the emergence of persister cells, a biofilmspecific phenotype which are multi drug-tolerant cells that have not acquired genetic resistance [4,7,13,14], Soto et al. [39]; (5) alteration in gene expression and (6) increase in horizontal transmission of resistance elements (Koopman et al., [25]). S. enterica is known to form biofilms on both living and non-living surfaces [19]. This characteristic has been reported to be very important for the survival of pathogenic bacteria within the human body as both innate and adaptive immune responses might not be able to eliminate pathogens within the well-established biofilm [9]. Enteric pathogens, salmonella inclusive, are resistant to the bile's detergent-like property and therefore can survive in the gall bladder. In addition, they respond to the bile environment by regulating other resistance-related genes [19]. An important mechanism for the formation of bacterial biofilm is quorum sensing [37].
Quorum sensing or signalling (QS) is a cell population density-dependent phenomenon that defines intra-and inter-specie cellular communications between bacteria which enables them to respond and adjust to their environment by regulation of certain gene expressions [31,32,37]. In Salmonella spp, QS is responsible for bile adaptation, enhancement of persister cells and biofilm (Walawalkar et al., [43]), expression of flagella genes [11] and normal expression of salmonella pathogenicity island-1 (SPI-1), a virulence factor), [10]. QS is also involved in the production of secondary metabolites and stress adaptation mechanisms such as bacterial competition systems including the secretion systems (SS) [32]. QS in bacteria is mediated by the production of signalling protein molecules referred to as auto-inducers (AI). AIs are normally produced at an elemental level during the exponential growth phase of bacteria. However, at a cell density-dependent threshold concentration (i.e. the quorum level), they accumulate intra-cellularly via ATP-binding cassette (ABC) transporter and activate receptor proteins (transcription regulators) leading to a cascade of processes that results in the regulation of certain gene expression [46].
Due to the integral involvement of QS in the formation and maintenance of biofilm, interruption of the QS network has been put forward as one of the important antivirulence strategies [36] used to target biofilm-forming bacterial by any of the following means: (1.) inhibition of synthesis of QS signalling molecules,(2.) blocking the transport of AI into the cell; (3.) inhibition of intracellular processing of AI and (4.) inhibition of receptor proteins.
There abounds, in the literature, evidence of anti-bacterial and anti-biofilm properties of plant materials that can be harnessed for combating antimicrobial resistance of pathogenic organisms by interrupting the biofilm formation system. For instance, various parts of P. guajava have been reported for anti-biofilm activities against Staphylococcus aureus (250 μg/mL of pulp extract) [12], Streptococcus mutans and Streptococcus gordonii (0.78 mg/mL of leave extract) [28]. Fu et al. also reported that aqueous extract of Herba patriniae exhibited an inhibitory effect on almost all the studied biofilm-related genes of Pseudomonas aeruginosa with a significant reduction in the formation of biofilm as well as an alteration in the structure of the mature biofilms. Exopolysaccharide production and swarming motility were also impaired in the disfavour of biofilm formation [15].
Psidium guajava L (family-Myrtaceae; common name-guava), a fruit-bearing plant of the tropics, has been variously reported for traditional treatment of typhoid fever [30] as well as other antibacterial activities. Methanol extract of P. guajava leaves has also been confirmed to possess antibacterial and anti-biofilm activities against S. Typhi [1]. Previously, we identified the phyto-compounds in the methanol extract of P. guajava leaves and their interactions with a protein involved in replication of S. Typhi [2]. In this study, we investigate the interaction between those compounds and key proteins involved in QS in the typhoid bacteria to identify molecules that could be investigated further in the quest for a new drug for tackling typhoid fever.

Methods
An overview of the stages involved in this study is presented in Fig. 1. The final docking scores of these phytochemicals are compared to those of ciprofloxacin which is currently used for the treatment of typhoid fever.

Ligands preparation
The ligands are compounds previously identified in the methanol extract of P. guajava leaves [2]. All seventy-two identified compounds were considered and coded 1-72. The Simplified Molecular-Input Line-Entry Specification System (SMILES) formats of the ligands were obtained from PubChem [22] and translated to PDB format using the CACTUS online translator [8]. Ligands were converted to PDBQT formats on OpenbabelUI software.

Drug-likeness and ADMET profiling of ligands
The drug-likeness or otherwise of each compound was assessed via the QED (Quantitative Estimate of Druglikeness). ADMET (Absorption, Distribution, Metabolism, Excretion and Toxicity) profiles of compounds with desirable QED values were also predicted. Both QED and ADMET were evaluated using ADMETlab 2.0 online webserver [47]. Only compounds with desirable QED and no predicted toxicity were selected for molecular docking.

Retrieval from Protein Data Bank
The three-dimensional crystal structures (Fig. 2) of two QS proteins of S. Typhi were obtained from PDB (Protein Data Bank) (LuxS, PDB Id: 5v2w; LsrB, PDB Id: 5GTA).

Homology modelling
Two proteins (LsrK and SdiA) whose 3-D structures were not available in PDB at the time of retrieval, were modelled using the SWISS-MODEL homology webserver [44]. Amino acid FASTA sequences of LsrK (530aa, Accession: QKX96401.1) and SdiA (240aa, Accession: QKX97843.1) were retrieved from the website of the National Centre for Biotechnology Information for modelling. SWISS-MODEL performs all the stages of modelling viz: (i.) template

Molecular docking
To study the interaction between selected compounds (ligands) and the selected QS proteins of S. Typhi (protein), molecular docking was carried out using AutoDock Vina 1.5.6 software. The proteins were first prepared by adding polar hydrogen atoms, calculating Kollman charges (Table 1) and saving them in PDBQT format on the AutoDock tool [40]. The grid box was set to cover the entire surface of the protein to allow for blind docking. The prepared and optimized ligands were blindly docked in the grid box of the protein to allow ligands to find any favourable binding site on the protein. Ligands were ranked based on their binding affinities with higher negative values signifying better affinity. Ligand-protein interactions were visualized using Discovery Studio Visualizer 2020 software.

Homology modelling
LsrK of S. Typhi was modelled using a crystal structure of Escherichia coli K-12 LsrK and HPr complex (PDB id: 5YA0) as a template, having a sequence identity of 83.02% and a GMQE (Global Model Quality Estimate) value of 0.89 with QMEAN (Qualitative Model Energy ANalysis) of 0.88 ± 0.05. Analysis of the Ramachandran plot of the modelled protein for the ϕ and ψ angles of individual amino acid residues revealed that 95.55% of residues were present in the favoured region while 3.64% and 0.81% residues were in the allowed and outlier regions respectively (Fig. 3). Apart from the Ramachandran outliers, the protein also has Rotamer outliers of about 1.01%. The total number of bad angles present in the chain is 78 out of 5,252. The template for SdiA of S. Typhi was a 3-D crystal structure of E. coli SdiA in complex with 3-oxo-C6-homoserine lactone (PDB id: 4Y15) with a sequence identity of 72.08% and a GMQE value of 0.88 with QMEAN of 0.85 ± 0.05. The Ramachandran plot and the 3-D structure of the model are presented in Fig. 4. Analysis of the Ramachandran plot of the modelled protein for the ϕ and ψ angles of individual amino acid residues revealed that 97.01% and 2.99% residues were present in the favoured and allowed regions respectively. There was no Ramachandran outlier while Rotamer outliers were about 0.48%. The total number of bad angles present in the chain is 16 out of 5398.

Molecular docking
Molecular docking revealed that only two compounds (Benzeneethanamine, 4-methoxy-and Cyclopentadecanone, 2-hydroxy-) have an interaction with each of the four proteins studied ( Table 4). The energy of affinities of these two compounds was only lower than that of ciprofloxacin in their interactions with SdiA. For all the proteins, Cyclopentadecanone, 2-hydroxy-has a better (lower) affinity energy than Benzeneethanamine, 4-methoxy-. Analysis of the docking output (Table 5 and Fig. 5) revealed that, except for SdiA, Benzeneethanamine, 4-methoxy-interacts with chain A of each protein with more hydrogen bonds than with any other type of (non-covalent) bonds. Almost all the interactions are with polar amino acids. For Cyclopentadecanone, 2-hydroxy-, analysis of the docking output (Table 6 and

Discussion
The process of drug discovery has been made easier with the help of computational drug designing which uses various computational methods to predict interactions between potential drug targets and small molecules to determine the best lead compounds. Virtual screening of the drug-likeness of small molecules is being estimated using the physicochemical properties of such molecules. Many researchers have come up with different sets of rules regarding the suitability or otherwise of compounds for lead-likeness based on their physicochemical properties, however, such rules are based on a few of those properties. For example, Lipinski's Rule of 5 is based on 4 of those parameters (molecular weight, LogP, Number of hydrogen bond acceptors and donors) [26]. Conversely, QED is a multi-criteria optimization that offers quantitative metrics based on 8 physicochemical parameters and offers a clear-cut distinction between desirable and undesirable compounds (Bickerton et al. [5]). Although many of the compounds studied did not violate more than one of Lipinski's Rule of 5, only eight of them have the 'desirable' value of QED and were thus selected for ADMET profiling. This would ensure that scarce resources are not wasted on molecules which are likely to be abandoned at a later stage of drug design.
Given the pivotal role played by the QS network in the development of both carrier status and antibiotic resistance in typhoid fever, and indeed other important bacterial infections, QS inhibitors are gaining more popularity as prospective alternatives in combating bacterial diseases (Jiang et al., [23]). Proteins in the QS network of S. Typhi selected for study are LuxS protein (synthesis of AI-2), LsrB (binding and transport of AI-2 into the cell), LsrK (intracellular phosphorylation of AI-2) and SdiA (transcription factor which is the receptor of the processed AI). Although LuxS does not affect the growth of Salmonella spp., it plays a role in the formation of biofilm as it catalyzes a crucial step in the synthesis of AI-2. It also controls the expression of polarization of the bacterial phase variation [33], the catalytic site of LuxS comprises a Zn 2+ coordinated by HIS: A54, HIS: A58 and CYS: A126. These amino acids are crucial for its catalysis [20]. Our molecular docking results show that Benzeneethanamine, 4-methoxy-bind HIS: A58 of LuxS with both Pi-Akyl and Pi-Pi T-shaped bonds and this could inhibit the protein. There is also a hydrogen bond at the neighbouring GLUA:57. Also, Benzeneethanamine, 4-methoxy-also interacts with TRY B:95, ALA B:110 and GLY B:118 while Cyclopentadec anone, 2-hydroxy-interacts with ASP A:80, all of which form part of the binding site of SdiA [17]. Better interactions of the compounds with SdiA (− 6.20 kcal/mol for Benzeneethanamine, 4methoxy-and − 7.70 kcal/mol for Cyclopentadecanone, 2-hydroxy-) relative to ciprofloxacin (− 4.30 kcal/mol) indicate that they fit well into the binding pockets of the protein and makes them promising S. Typhi QS inhibitors as inactivation of the processed AI receptors has been proven to reduce virulence in some pathogenic bacteria [45,48]. Interestingly Cyclopentadecanone, 2-hydroxyhas been previously reported to inhibit DNA gyrase, another important protein involved in the replication of S. Typhi [2]. Thus, Cyclopentadecanone, 2-hydroxy-, and Benzeneethanamine, 4-methoxy-are being considered for lead optimization.

Conclusion
QED, being a multi-criteria optimization, gave a numerical output which made it straightforward in determining the drug-likeness of the phytocompounds studied. Only eight out of seventy-two compounds had drug-likeness desirability (QED > 0.67). Prediction of ADMET profile accords the removal of compounds that could later be withdrawn if made into drugs. This led to discontinuation with three compounds due to predicted human hepatotoxicity, drug-induced liver injury and carcinogenicity. Benzeneethanamine, 4-methoxy-and Cyclopentadecanone, 2-hydroxy-from P. guajava leaves show remarkable interaction with all four QS proteins of S. Typhi studied with negative binding energies. Future research focused on molecular dynamics of the interactions as well as optimization of the two compounds will give more insight into the possibility of exploring them through in vitro and in vivo experiments against the deadly salmonella specie.
Author contributions All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by TDO. The first draft of the manuscript was written by TDO and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Funding
The authors did not receive support from any organization for the submitted work.
Data availability All data generated during this study are contained in this published article.

Conflict of interest
The authors declare no conflict of interest.

Ethical approval Not applicable.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creat iveco mmons. org/ licen ses/ by/4. 0/.