Identification of novel mazEF/pemIK family toxin-antitoxin loci and their distribution in the Staphylococcus genus

Bukowski, Michal; Hyz, Karolina; Janczak, Monika; Hydzik, Marcin; Dubin, Grzegorz; Wladyka, Benedykt

doi:10.1038/s41598-017-13857-4

Identification of novel mazEF/pemIK family toxin-antitoxin loci and their distribution in the Staphylococcus genus

Article
Open access
Published: 18 October 2017

Volume 7, article number 13462, (2017)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Identification of novel mazEF/pemIK family toxin-antitoxin loci and their distribution in the Staphylococcus genus

Download PDF

2907 Accesses
10 Citations
1 Altmetric
Explore all metrics

Abstract

The versatile roles of toxin-antitoxin (TA) systems in bacterial physiology and pathogenesis have been investigated for more than three decades. Diverse TA loci in Bacteria and Archaea have been identified in genome-wide studies. The advent of massive parallel sequencing has substantially expanded the number of known bacterial genomic sequences over the last 5 years. In staphylococci, this has translated into an impressive increase from a few tens to a several thousands of available genomes, which has allowed us for the re-evalution of prior conclusions. In this study, we analysed the distribution of mazEF/pemIK family TA system operons in available staphylococcal genomes and their prevalence in mobile genetic elements. 10 novel m azEF/pemIK homologues were identified, each with a corresponding toxin that plays a potentially different and undetermined physiological role. A detailed characterisation of these TA systems would be exceptionally useful. Of particular interest are those associated with an SCCmec mobile genetic element (responsible for multidrug resistance transmission) or representing the joint horizontal transfer of TA systems and determinants of vancomycin resistance from enterococci. The involvement of TA systems in maintaining mobile genetic elements and the associations between novel mazEF/pemIK loci and those which carry drug resistance genes highlight their potential medical importance.

Single-molecule sequencing reveals the molecular basis of multidrug-resistance in ST772 methicillin-resistant Staphylococcus aureus

Article Open access 16 May 2015

Prediction of Type II Toxin-Antitoxin Loci in Klebsiella pneumoniae Genome Sequences

Article 10 December 2015

A megaplasmid family driving dissemination of multidrug resistance in Pseudomonas

Article Open access 13 March 2020

Introduction

Toxin-antitoxin (TA) systems are widespread among bacteria, but their physiological roles have only recently been revealed^1,2,3,4,5. The involvement of these systems in the stress response and the induction of dormancy have been described, and they have also been linked to virulence. However, given the number, variety and broad distribution of TA systems, it is clear that our current understanding only touches the tip of the iceberg.

TA systems were initially described as components of low-copy number plasmids that are involved in the stable propagation of these genetic elements in bacterial populations^6,7,8. Because the presence of a long-lived toxin is deleterious, it was thought that cells that are unable to produce a labile antitoxin (i.e. as a result of the loss of the encoding plasmid) would be eliminated. However, the more recent discovery that numerous TA systems are encoded within bacterial chromosomes has sparked a largely unsettled debate regarding their broader role in bacterial physiology^{9,10,11,12,13}.

In contrast to our insufficient understanding of the physiological importance of TA systems, the molecular mechanisms of their activity are relatively well understood. In the majority of cases TA systems consist of two components that are ordered within a single operon with the antitoxin gene preceding the toxin gene. Their open reading frames are often partially overlapping. Six types of TA systems have been distinguished based on the nature of the antitoxin (RNA or protein) and its mechanism of action, whereas the toxin component is in all known cases a protein^3,14. Type II TA systems are composed of protein antitoxin and toxin pairs and are the most prevalent and best-characterised TAs¹⁵. Under normal conditions, the toxin and antitoxin exist in equilibrium, and the toxin remains dormant. Any disturbance to this equilibrium, regardless of the inducting agent, manifests as a reduction in antitoxin stability, which results in the toxin being unleashed. TA toxins can interfere with important cellular processes, including replication and translation, or induce damage to the cell membrane. There is a significant controversy regarding whether this process reflects a fine-tuned regulatory mechanism or results in unspecific cell “poisoning”. In any case, as a consequence of the toxin’s activity, the cell enters a dormant state. When plasmid loss occurs, the resulting prolonged dormancy leads to cell death^2,16. However, although rescue mechanisms should exist for the majority of TA systems, in which dormancy is used as an important survival mechanism, the purpose of dormancy remains a matter of lively debate.

The number of TA systems that are encoded in the genome of a particular species of bacteria varies significantly and can range from none (e.g., in Mycobacterium leprae) to several dozen (e.g., in M. tuberculosis)^17,18. The rationale for this phenomenon is largely unknown, but more TA systems are carried by virulent pathogens than their non-virulent counterparts, despite the size of their genome, which is usually smaller in virulent species¹⁹. Despite the significant variety that has been observed among TA systems, certain characteristic features allow bioinformatics to be successfully used to identify novel systems^20,21,22,23. A large, relatively recent study by Makarova et al. revealed that there are numerous type II TA systems in prokaryotes²⁴. Massive, parallel sequencing has since provided a large amount of genetic data, which prompted us to re-evaluate the distribution and penetrance of TA systems in medically important bacteria in the Staphylococcus genus.

Staphylococci are Gram-positive bacteria that are associated with humans and other animals. The majority of species coexist with their hosts as harmless commensals, but S. aureus and S. pseudintermedius are both dangerous and opportunistic pathogens that are characterised by their increasing drug resistance^25,26. Current data indicate that staphylococci carry relatively few TA systems. Two paralogues of the YefM/YoeB (yefM/yoeB) system were identified in S. aureus ²⁷. MazEF (mazEF-Sa) was found in both S. aureus and S. eqorum ^10,28. Moreover, the PemIK (pemIK-Sa1) system, which is related to the canonical mazEF-Sa system, was identified in a plasmid that was initially isolated form S. aureus CH91 and later found in the chromosomes of S. pseudintermedius ¹². These results suggest that this system is mobile. It therefore seems that there are likely to be more such examples in staphylococci.

Here, we analysed currently available staphylococcal genomes to assess the distribution of mazEF/pemIK family TA system operons and their prevalence in mobile genetic elements. Five computational approaches were evaluated which enabled us to identify 10 novel mazEF/pemIK systems. A chromosomally encoded mazEF-Sa was found in virtually every evaluated staphylococcal genome, suggesting that this element is physiologically important to this group. PemIK systems were less frequent but more diverse. Interestingly, pemIK loci were often located in mobile genetic elements in the vicinity of drug resistance genes. Moreover, pemIK-Sa1 was preferentially identified in poultry-associated strains.

Results

General results

All 6,132 staphylococcal genomes (representing 36 species) that were published at the time of the study were screened for mazEF/pemIK loci. Data collection was substantially biased towards S. aureus (5,540 genomes, 90%) because more sequences were available for this clinically significant pathogen. A total of 22,321 open reading frames (ORFs) were identified on average (standard deviation, SD: 2,918) in each genome. Extensive blastp, psiblast, rpsblast and deltablast searches resulted in the identification of 73, 83, 101 and 110 two-gene clusters, respectively, and 4, 14, 38 and 41 of these clusters, respectively, were the unique result of a single approach. All clusters were characterised by a high average co-localisation coefficient for member genes (0.65, SD 0.25; 0.69, SD 0.25; 0.70, SD 0.25 and 0.71, SD 0.26; respectively). Remarkably, all four of the approaches returned exactly the same set of pemIK homologues (Fig. 1; Suppl. tab. 4). The same results were obtained when tools designed to detect moderate (blastp) and distant (psiblast, rpsblast and deltablast) homologues were used, indicating that staphylococcal mazEF/pemIK loci comprise a well-defined group of relatively closely related homologues. Among all 12 homologues, only 2 have been described previously, demonstrating the utility of our approach. None of the unique clusters met the exclusion criteria except for several that originated from a DELTA-BLAST search. These contained bicistronic operons that encoded toxins of known families that had never previously been detected in staphylococci, including five Fic/Doc toxins and one each of the following groups: PIN-domain, ParE and Xre toxins. The identification of such distant homologues indicated that our exclusion criteria were sufficiently broad to account for the majority (if not all) of the mazEF/pemIK loci in the analysed genomes. PemK-Sa6 is so distantly related to the other analysed proteins that its toxin domain was not detected using rpsblast, which relies on direct domain identification within a protein sequence. The operon was, however, identified by all other approaches we used, demonstrating the power of parallel analysis.

Distribution and genetic context of mazEF/pemIK loci

The distribution of different TA homologues among species and strains of staphylococci was non-uniform (Tables 1 and 2). Chromosomally encoded mazEF-Sa was present in virtually every analysed staphylococcal genome (99.82% of the tested strains) and preceded the operon that encoded the alternative RNA polymerase σ^B subunit. We believe that the lack of mazEF-Sa in the remaining 11 of 6,132 tested strains is most likely the result of incompleteness in shotgun sequencing data. pemIK-Sa1 was also characterised by a broad species distribution (positive in 12 out of 36 species tested), but its penetrance was relatively low (only 1.06% of all tested strains). Its low penetrance may be associated with the plasmid-encoded character of pemIK-Sa1. However, in certain species (e.g., S. delphini, S. intermedius and S. pseudintermedius), these loci were localised in the chromosome. The 65 identified pemIK-Sa1 loci that were present in the examined genetic material could be divided into 4 general groups (representing 57 genomes in total; Fig. 2). In the remaining 8 strains, the context was different in each strain (Figs 3 and 4b). Operons related to beta-lactam, arsenic or mercury resistance were identified in the close vicinity of many pemIK-Sa1 loci (Figs 2 and 3b,c), but the significance of this finding remains unknown. Additionally, the close neighbourhood of many of the pemIK-Sa1 loci contained genes that encode factors known to be involved in DNA mobilisation, such as DNA invertases, resolvases and transposases (Figs 2a,b,d and 3a–d), suggesting that these elements are potentially mobile. In plasmids (and contigs of likely plasmid origin), the pemIK-Sa1 loci often coexisted with yefM/yoeB loci (Figs 2a,d and 3e), which also belong to class II TA systems. However, the activity of the yefM/yoeB loci involves ribosome-dependent endoribonuclease toxins²⁷. In one instance, two pemIK-Sa1 loci neighboured a pemIK-Swar locus (Fig. 4b).

Table 1 Summary of mazEF/pemIK loci occurrence among the analysed species and strains.

Full size table

Table 2 Brief summary of the distribution of mazEF/pemIK loci among the analysed strains.

Full size table

All other identified loci were characterised by restricted distribution and/or penetrance. A set of mazEF/pemIK loci that was characteristic of a single species was identified (Fig. 5). The pemIK-Smic locus is encoded in the S. microti chromosome near a multi-drug transporter (Fig. 5a). pemIK-Sa2 – Sa5 loci were characteristic of S. aureus (Fig. 5b–e) and were characterised by very low penetrance. Even though a large number of S. aureus genomic sequences are available, these loci were identified in only one or a few strains, implying that pemIK-Sa2 – Sa5 may later be discovered in other species, provided that sufficient effort is made to expand sequencing in these species.

The low prevalence mazEF/pemIK homologues are interesting in terms of the evolution of these genetic elements. pemIK-Sa2 is an example of a horizontal transfer among unrelated bacterial species. This TA system is relatively common in the mobile genetic elements of Agrobacterium, Neorhizobium and Rhizobium species (Fig. 5b). Close homologues (70–80% sequence similarity) have been found in Burkholderia, Paraburkholderia and Pseudomonas, but none have been identified in species that are more closely related to staphylococci, either evolutionarily or according to an ecological niche. This finding demonstrates that TA systems may switch hosts in a highly unrestricted manner. Even more interesting, pemIK-Sa3 was found in only a single Staphylococcus strain (Fig. 5c). This operon was relatively widespread in the plasmids of E. coli, K. pneumonie, S. dysenteriae and S. enterolytica but was never previously identified in a Gram-positive bacterium. Whether this reflects a relatively recent interspecies jump or is simply an effect of contamination of sequencing sample remains to be determined.

pemIK-Sa4 and Sa5 represent horizontal transfer from enterococci. Both occurred together in the putative plasmid contigs of the three following strains: NRS2 and two vancomycin-resistant Staphylococcus aureus (VRSA) strains, VRS2 and VRS7. The pemIK-Sa4 ancestor was found in the pEF123 plasmid of Enterococcus faecalis EF123, whereas the pemIK-Sa5 loci were present in the pTEF1 plasmid of E. faecalis V583 and plasmid 3 of E. faecium DO. Interestingly, the genomes of NRS2, VRS2 and VRS7 contained other putative plasmid contigs that likely originated from the pLG2 and pWZ909 plasmids of E. faecalis, and pJEG040 of E. faecium. Even more compelling, the two latter plasmids carry determinants of vancomycin resistance and therefore merit a more extended analysis of their history of transfer and evolution. In terms of MazF/PemK toxin evolution, the PemK-Sa4 and PemK-Sa5 sequences are not closely related (Fig. 1). It is interesting that similar to a number of pemIK-Sa1 loci, the pemIK-Sa4 loci are located in close proximity to loci of another TA system, dinJ/yafQ, whereas pemK-Sa5 are found in the direct vicinity of a transposase gene (Fig. 5d,e).

Although it was identified in only one S. aureus genome, pemIK-Sa6 is exceptionally interesting. Here, the TA locus was found in the staphylococcal cassette chromosome SCCmec_Al16, which was previously described in S. pseudintermedius AI16²⁹ and is in close vicinity to a teicoplanin resistance-related gene (Fig. 6). SCCmecs are mobile genetic elements that have been associated with multi-drug resistance in methicillin-resistant Staphylococcus aureus (MRSA)³⁰, but their association with TA systems has not been previously reported. PemK-Sa6 is so distantly related to other analysed toxins that its toxin domain has not been detected in rpsblast, which relies on direct domain identification within a protein sequence. The operon was, however, identified by all other approaches we used, demonstrating the power of parallel analysis. Because PemK-Sa6 does not contain a PemK domain that was detectable using domain homology searches, these loci were originally annotated as two unrelated hypothetical proteins in the genome of S. aureus UCIM147 and in SCCmec _AI16 of S. pseudintermedius AI16, demonstrating the utility of focused studies similar to this one. Although the significance of the close genetic linkage observed between pemIK-Sa4, Sa5 and Sa6 loci and antibiotic resistance elements in MRSA/VRSA remains unclear, these findings clearly merit further investigation.

pemIK-Smic was found only in S. microti and is described above in more detail. The other identified loci were present in more species and were named after the most prevalent one. pemIK-Scap was identified in two S. capitis strains and one strain of S. epidermidis in a plasmid and two likely plasmid contigs, respectively. These loci were not characterised by a common genetic context (Fig. 7), suggesting that they are likely mobile. pemIK-Scar was identified in the chromosomes of S. carnosus, S. condimenti and S. simulans. Unlike pemIK-Scap, pemIK-Scar was present in all of the species within a common genetic context. This conserved cassette exceeds ~4 kbp and contains, in addition to pemIK-Scar, two ribosomal protein (S18 and S6)-encoding genes, a putative DNA-binding protein gene, thermonuclease and carbamoyltransferase genes (Fig. 8). pemIK-Ssci was identified in several S. sciuri strains in likely plasmid contigs. Interestingly, two such loci were present in a single contig. Additionally, pemIK-Ssci was identified in a single contig in a S. intermedius strain in which it neighboured a fosfomycin resistance gene (Fig. 9). Finally, pemIK-Swar was identified in several S. warneri plasmids and likely plasmid-derived sequences and a single plasmid sequence that was derived from S. aureus (Fig. 4a). Additionally, three of these loci were located in putative megaplasmids. One such megaplasmid that was identified in S. warnerii contained two pemIK-Swar loci and was also neighboured by one pemIK-Sa1 locus. This instance provides a unique example in which three pemIK loci were located in a single plasmid. Another pemIK-Swar-containing megaplasmid was identified in S. aureus and found to contain only a single TA locus (Fig. 4b).

Host specificity of pemIK-Sa1

We next sought to determine whether any detectable species preference could be associated with any particular TA system. The mazEF-Sa loci were present in all examined strains. Loci other than pemIK-Sa1 were excluded from analysis because of their low penetrance, which precluded a meaningful statistical analysis. Therefore, the analysis was limited to pemIK-Sa1. The operon of this TA system was first described in the pCH91 plasmid of a poultry-associated strain¹². Here, we show that the loci have a clear preference for strains that originate from animals (i.e. non-human origin strains). The reference distribution of all strains across all carrier hosts was significantly different from the distribution of pemIK-Sa1-carrying strains (χ ² test p = 0.5, Suppl. Fig. 1). Among the analysed pemIK-Sa1-positive strains (65 in total), there was a significant host preference for turkey (Meleagris gallopavo) and house and steppe mice (Mus musculus, Mus spicilegus). This correlation was also reflected in yet another analysis. A phylogenetic tree for all tested strains that was constructed using a gapless rpoB alignment (Suppl. Fig. 2) contained a distinct branch of closely related strains (46 in total) that were isolated within a short period of time in a single geographical location (Germany) from chickens, cows, humans, and turkeys. The relationships within this group are so close that the strains were indistinguishable in an rpoB analysis and by spa typing³¹ apart from individual cases. Interestingly, within this uniform group, the pemIK-Sa1 loci were found only among poultry isolates. This result is significant because the probability of such a chance distribution is only 5%.

Unique features of mazEF-Sa

The 100% penetrance observed for the mazEF-Sa loci in staphylococci is exceptional when compared to our results for other pemIK loci. A low diversity of MazF-Sa sequences was evident in our analysis of evolutionary distances (Fig. 1) and represents another substantial distinction between the mazEF-Sa and other pemIK loci, in which the toxins are much more diverse. Interestingly, the results of a phylogenetic tree constructed using a gapless multiple sequence alignment of mazF-Sa sequences (Suppl. Fig. 4) showed that these sequences had the highest log-likelihood of and nearly the same topology as the trees based on rpoB and saoC. Hence, the diversity observed within mazF-Sa genes reflects phylogenetic relatedness among species and had a precision comparable to that obtained using rpoB (Suppl. Fig. 2) or saoC (Suppl. Fig. 3), which are genes with proven utility for tracing phylogenetic relatedness among staphylococci^32,33. However, the relatedness among the PemK toxin sequences was not similar to the phylogenetic relatedness between strains and species (Fig. 1). Overall, these data indicate that the mazEF-Sa was acquired in a distant past and is currently stably co-evolving within staphylococcal genomes, whereas pemIK loci were spread by the horizontal transfer and evolved separately in their host genomes.

In vitro evaluation of the distribution of mazEF/pemIK loci in a diverse collection of staphylococcal strains

Although it is most unlikely to occur when using a dataset with a size and diversity similar to that used in this study, one could argue that sequencing or selection bias may have affected the distributions observed in our results. We therefore evaluated our results by experimentally testing the distribution of selected loci in a diverse collection of staphylococcal strains. We designed specific primers based on sequence clustering to detect particular mazEF/pemIK loci. PCR was used to determine whether they were present or absent in the genetic material of the tested strains. We carefully designed degenerate primers (Suppl. Table 3) to detect mazEF-Sa loci in the tested strains. Because of the high degree of diversity we observed in the sequences, it was not possible to design common primers that would always detect pemIK-Sa1 loci. Instead, we used a mix of multiple primers that were designed to detect different subgroups, and the results confirmed that a single species possessed pemIK-Smic and that multiple species possessed pemIK-Sa1 and pemIK-Scar. In this latter group, the distribution among different spcies was generally supportive of the results of our previous in silico analysis. We were also able to detect the presence of pemIK-Ssci and pemIK-Swar in single strains (Fig. 10; Suppl. Fig. 5).

Discussion

Results of homology search approaches

The significant impact of TA systems on bacterial physiology has been extensively investigated over the last three decades. Multiple genome-wide attempts have been made in an attempt to identify and classify new loci in various bacterial species. As a result of the development of massive parallel sequencing techniques, there has been a substantial increase in the number of complete bacterial genomic sequences in the last 5 years. Moreover, the last comprehensive analysis of staphylococcal genomes was performed 8 years ago by Makarova et al.²⁴; that is, before the era of massive sequencing. Even though the dataset analysed by Makarova and collaborators included all known archaeal and bacterial genomes, only 18 staphylococcal genomes were available at that time, whereas more than 6,000 were available when this study was initiated. The Makarova study identified only a single mazEF/pemIK family TA system, mazEF-Sa, in staphylococci and concluded that the entire Bacillales phylum was not abundant in TA loci²⁴. Until recently, only a single additional mazEF/pemIK homologue, pemIK-Sa1, was described in staphylococci¹². The recent acquisition of a large number of genomic sequences for staphylococci provided us with a fantastic opportunity to challenge these views and re-evaluate the status of mazEF/pemIK TA systems in this genus.

Analysing a large number of biological sequences requires time-efficient analytical tools, such as the heuristic approaches implemented in BLAST family algorithms. BLAST was first introduced in 1990³⁴ and has since evolved to include a broad array of tools^35,36,37. Approaches based on these tools now dominate database search routines. In this study, we used and compared the results of the most relevant approaches.

Our aim in performing a conservative cascade search using a protein BLAST approach was to identify loci characterised by close relatedness. This search was conducted based on a simple and intuitive parameter involving sequence similarity in which the threshold was arbitrarily set to 50%. Sequence similarity is inherently associated with each pair of sequences and is independent of score matrix and database size, opposed to the E-value (and the mathematically interrelated score value), which is the main statistic of significance provided with BLAST results and is calculated based on random simulations performed since the introduction of gapped BLAST³⁵. Protein BLAST is moreover not well tailored for searches for remote homologues^35,38,39. Only an iterated cascade search, which turns the outcomes of one search into a query for another search, enables the discovery of distant homologues. Our use of this approach resulted in the identification of 10 new mazEF/pemIK homologues within the analysed dataset of all available staphylococcal genomes. We considered this group a reference for extensive approaches aimed at discovering less-related sequences that present a risk of false positive results.

Many approaches incorporated in subsequent analyses have used BLAST tools and less stringent E-value thresholds. The E-value represents the number of hits with a score at least as high as the given one that would be expected purely by a chance³⁷. Hence, generously defining the E-value cutoff increases the hit rate but at the expense of false positives (i.e., randomly correlated sequences). In this regard, our approaches resembled those previously used by Sevin et al.²¹ and Makarova et al.²⁴. Obtained results were filtered to identify sequence pairs that encoded potential mazEF/pemIK TA systems. Our first approach was to use protein BLAST. The second approach involved PSI-BLAST, which tests for homologous sequences in an iterative fashion. In the first iteration, a simple protein BLAST search is performed, and all of the results that exceed a certain inclusion threshold are aligned and subsequently used to prepare a position-specific score matrix (PSSM). The PSSM serves as a query and is refined after each iteration. This approach allows the incorporation of information related to the substitution frequencies that are intrinsic to a particular family of protein sequences instead of a general substitution matrix, such as BLSUM62, which is the default matrix that is used for protein BLAST. Additional approaches used in this study included RPS-BLAST, which tests a query sequence against a database of PSSMs that have been predefined for various protein families, and DELTA-BLAST, which first queries a PSSM database with a sequence and then uses matched PSSMs to search a protein database. Use of the PSI-BLAST, RPS-BLAST and DELTA-BLAST approaches facilitate the discovery of distant homologues, whereas PSI-BLAST is carefully optimised to limit false positives to the greatest extent possible^35,39,40. This approach, especially the inclusion of PSI-BLAST-based cascade searches, is why remarkable levels of sensitivity and specificity can be achieved⁴¹. Creating a PSSM from scratch provides a considerable advantage that was demonstrated during our search for PemK-Sa6 sequences. Even though this sequence shares significant homology with the other staphylococcal PemK toxin sequences described in this study, no NCBI CDD conservative domain was detected within the sequence, likely because of specific amino acid substitutions (Fig. 11). This characteristic resulted in a false negative in the RPS search; indeed, PemK-Sa6 was only found because it is located in close vicinity to PemI-Sa6, an antitoxin with a detectable MazE domain. Moreover, PSI-BLAST readily identified PemK-Sa6 with no additional assumptions regarding its genetic neighbourhood. These findings for PemK-Sa6 demonstrate that although all extensive approaches return a similar general list of results (hence, only a single, arbitrarily chosen one is usually used, as in^21,24,42), a comprehensive analysis requires the parallel use of all of these approaches to ensure the identification of all rare examples.

Staphylococci carry at least 12 independent mazEF/pemIK homologues

In the end, all of the extensive methods tested in this study yielded results similar to those achieved using a conservative cascade search with protein BLAST. We identified 10 novel mazEF/pemIK homologues in addition to the previously known two. These results clearly indicate that in staphylococci, the mazF/pemK family forms a distinct and coherent group of relatively closely related members. Interestingly, the protein sequences of antitoxins within different TA families vary substantially and are therefore of limited use in finding homologues and devising a system of classification for TA systems. This proposal is in agreement with the previous findings of Makarova et al.²⁴ and supported by the findings presented here (Fig. 11).

Our threshold of 80% toxin protein sequence similarity to identify novel mazEF/pemIK homologues yielded sets (loci) in which the two most dissimilar sequences were of 95% similarity. This approach is compelling because the results of using this system of classifying mazEF/pemIK loci corresponded to their actual genomic context and the similarity threshold was above the similarity levels observed among toxin sequences encoded by distinct homologous loci, which is 78% between the two most similar observed sequences that belonged to different mazEF/pemIK homologues.

mazEF-Sa is undoubtedly unique because it is widespread throughout all staphylococcal genomes. Moreover, the sequence of the MazF-Sa toxin is far more conserved among different species than the sequences encoded by other pemIK loci. Additionally, we found that the phylogenetic relatedness of mazF-Sa sequences corresponded to the phylogenetic relatedness of the respective genomes. In fact, the chromosomal locations and strong coupling between the functions of the mazEF-Sa operon and the alternative sigma subunit B of RNA polymerase (SigB, σ^B)⁴³ suggest that there has been a long period of coevolution between these sequences. The mazEF-Sa operon proceeds the rsbUVWsigB operon, and the first is necessary for the full activity of SigB, whereas SigB negatively regulates mazEF-Sa transcription⁴⁴. Both operons are co-transcribed^44,45. Finally, the functions of mazEF-Sa that are associated with stress-induced persister cell formation and beta-lactam sensitivity⁴⁶ are highly correlated with the functions of SigB under stress conditions⁴⁷.

Maintenance of mobile genetic elements and pemIK loci

pemIK loci were initially recognised as plasmid maintenance systems^6,7,8. The loss of a TA-carrying plasmid leads to toxin release by degradation of the unstable antitoxin. A cell lacking a pemIK locus is unable to replenish the antitoxin pool, resulting in inhibited growth and eventual cell death. Hence, daughter cells that would not inherit a TA-encoding plasmid are not produced. The DNA maintenance property of TA systems is broader than that of the plasmids alone. Whereas most staphylococcal pemIK loci are located in plasmids (including megaplasmids or multiple loci per plasmid), pemIK loci have a stabilising effect on the genetic neighbourhood within the same bacterial chromosome^48,49. A pemIK-Sa2 locus is present in a chromid (a chromosome-like replicon that may exceed 1 Mbp in size) in Neorhizobium ⁵⁰. If pemIKs are capable of promoting the maintenance of these large genetic elements, it may also have a significant influence on the maintenance of antibiotic resistance. The pemIK loci have been found to be located in close proximity to antibiotic-resistance genes or in mobile genetic elements that carry such genes, especially the SSCmec element and probably plasmids that carry vancomycin-resistance genes. The maintenance of these genetic elements and the role played by TA systems in this process is clinically relevant in the context of MRSA and VRSA strains. Of particular interest are pemIK-Sa4 and pemIK-Sa5 loci. In S. aureus, vancomycin resistance has been proposed to arise as a result of horizontal transfer from vancomycin-resistant enterococci (VRE)^51,52. The exchange of fragments among plasmids and the aggregation of fragments into megaplasmids appear to be common occurrences, as shown in Fig. 4b for pDFAARGOS_115 of S. warneri. In this context, it seems likely that both the pemIK-Sa4 and pemIK-Sa5 loci were transferred from enterococci along with vancomycin-resistance genes, and these are likely embedded into the same putative plasmids. Although the pemIK loci have been identified next to vancomycin-resistance determinants in different putative plasmid contigs in VRSA strains, the potential functional relationship between these elements requires further investigation. It seems likely that particular pemIK TA systems preserve functionality even when transferred to distantly related species. We believe so because the non-coding regions vary significantly more between different species than the coding sequences (Suppl. Fig. 6). The non-coding sequences contain potential promoters which must adapt to particular species while the coding regions remain stable, further suggesting evolutionary pressure to preserve function.

Regulatory role of mazEF/pemIK homologues

In addition to its role in maintaining genetic elements, multiple lines of indirect evidence suggest that PemK toxins play a regulatory role in modulating the bacterial transcriptome and therefore gene expression. This regulation seems particularly relevant during stress-induction in TA systems and during persister formation by pathogenic bacteria^{53,54,55,56,57}. This presumed regulatory role is likely to be related to the endoribonucleolytic activity of the PemK toxin. PemKs have been demonstrated to be ribosome-independent mRNA interferases, but the role of this process in the regulation of orchestrated gene expression remains elusive at the experimental level. S. aureus MazF-Sa and its homologue form in S. equorum show stringent specificity for the pentanucleotide sequence UACAU^10,28. PemK-Sa1 recognises the tetranucleotide sequence UAUU¹². MazF homologues in other bacteria recognise sequences between 3 and 7 nucleotides in length^{9,10,58,59,60}. It is unlikely that such stringent specificity evolved in enzymes that are physiologically involved in unspecific degradation of total RNA. It has instead been speculated that many PemK toxins target specific gene pools, whereas other pools evolved that did not contain the target sites. Hence, PemKs could thereby globally regulate gene expression upon TA system activation^9,10,12. In the results presented, a broad array of staphylococcal pemIK loci were identified (Fig. 1).

The protein sequence similarity among different MazF/PemK homologues varies from 22 to 78%. A question arises whether our clustering criteria relate to functional differences? Even though very few well characterized examples are available, the 45% similarity associates with different cleavage specificities of MazF-Sa and PemK-Sa1^10,12. To the opposite, 85% similarity characterizes two proteins of identical specificity, MazF-Sa and MazF-Bs^10,61. The above levels correspond to the homology criteria adopted in our work according to which toxins from different strains were classified into a single or different groups. It is thus likely that most, if not all newly uncovered MazF/PemK homologues target different RNA sequences. But are they functional at all? Analysis in the context of structures of MazF-Bs and MazF-Ec^62,63 demonstrates that the novel homologues preserve some conserved residues, including those responsible for substrate binding and catalysis (Fig. 11) suggesting functional role. These conclusions are however highly speculative and future experimental characterization is clearly necessary.

Host preference of pemIK loci

With the exception of the pemIK-Sa1 loci, the low frequency of most of the pemIK loci in staphylococci did not enable us to determine host preference. It was previously suggested that pemIK-Sa1 exhibits a preference for non-human hosts, especially poultry⁶⁴. This TA system was first identified in the pCH91 plasmid, which is a homologue of pAvX, a plasmid characteristic of poultry strains of S. aureus ⁶⁴. The results of our current investigation demonstrate that pemIK-Sa1 occurs in a number of different contexts and primarily in plasmids and putative plasmid contigs. However, the host preference of pemIK-Sa1 is not completely clear from our statistical analysis. This question is sufficiently compelling to be considered in future studies.

Methods

Bioinformatic analyses

Staphylococcal genomic sequences, including both complete and shotgun contigs, were retrieved from the GenBank database⁶⁵. The complete list of analysed sequences, including accession numbers, is provided in Suppl. Table 1. Sequences from other species that were used in the accessory analyses are listed in Suppl. Table 2. The computational analysis was performed using self-developed Python/IPython scripts using standalone NCBI BLAST + tools 2.3.0⁶⁶ with default parameter values unless otherwise indicated. Open reading frames (ORFs) were identified and translated using multi-threaded functions developed in C++ and deployed as a Python module. A naïve search for all possible ORFs with a minimal length of 100 bp was performed for all sequences. The alternative start codons ATA, ATC, ATG, ATT, CTG, GTG and TTG were considered in parallel to the canonical ATG. To minimise redundancy, only the longest variant of each ORF was further analysed. The ORFs were translated to protein sequences using a bacterial codon table 11⁶⁷. Translated ORFs were probed for homology to well-characterised toxins and antitoxins in the MazEF/PemIK family, including MazEF-Sa¹⁰ and PemIK-Sa1¹². Five different approaches that were based on functionalities provided by BLAST+ tools were used as described below.

Conservative cascade search using protein BLAST

A BLAST database was created using makeblastdb for each set of translated ORFs representing a single bacterial strain. Each database was queried using the protein sequences of MazEF-Sa and PemIK-Sa1 components using the blastp tool with the E-value threshold of 0.1. The threshold for homologues was arbitrary set at a sequence similarity greater than 0.80 (corresponding to 0.60 sequence identity) and a match-to-query length ratio between 0.55 and 1.65. Matching sequences that met the length ratio requirement but had similarity below 0.80 but greater than 0.50 were further manually analysed. The presence in a two-gene operon and an arrangement typical of TA systems (i.e., genes partially overlapping or less than 100 bp apart) were initially used as criteria to classify such loci as a hypothetical TA system. Further criteria included the presence of a complete MazF/PemK domain (defined according to NCBI Conserved Domain Database v3.14)⁶⁸ in the putative toxin sequence or its length within 80 to 150 amino acids (aa) or the presence of a complete MazE domain in the putative antitoxin sequence or its length within a range of 50 to 100 aa. Newly discovered TA homologues were used to iterate the procedure until convergence, which was defined as when no new putative mazEF/pemIK loci were identified.

Extensive search using protein BLAST

Preparation for and the initial search of the database were performed using the same method as was used for the conservative cascade method. For each sequence match, protein sequences encoded by adjacent ORFs and arranged in a manner typical of TA systems (ORFs overlapping or closer than 100 bases) were added to the result list. This ensured the inclusion of potential TA system components in instances wherein only a single one was closely matched to the query sequence. The resulting set of protein sequences was clustered using blastclust with a length coverage threshold of 0.55 and a sequence identity threshold of 60%. Pairs of clusters containing ORFs that co-localised at any frequency in the analysed genomes were classified as potential TA systems. The final inclusion criteria were identical to those used in the conservative cascade method.

PSI-BLAST search

The procedure was identical to that used for the extensive search except that psiblast was used instead of blastp. PSI-BLAST is specifically tailored to identify distant homologues. To achieve this goal, it uses a position-specific matrix that is updated during each step of the iterative search procedure, unlike protocols that use a predefined score matrix (i.e., for a protein BLAST).

RPS-BLAST search

An RPS database containing the domain profiles that are present in components of mazEF/pemIK TA systems (e.g., COG2336, COG2337, pfam04014 and pfam02452) was created using makeprofiledb. The database was queried for all translated ORFs using rpsblast with the E-value threshold of 0.001. The ORF sets were prepared and subsequent steps were identical to those used to perform the conservative cascade method.

DELTA-BLAST search

DELTA database was created in a manner comparable to that used for the RPS database. The database search and the processing of results were performed in a manner identical to that used to perform the extensive search method except that deltablast was used instead of blastp.

Analysis of the genetic neighbourhood of mazEF/pemIK family loci

The genetic context of each identified TA loci was manually analysed using a graphical interface for the BLAST tools and implemented in CLC Main Workbench (CLC Bio/Qiagen). The analysis was based on the results of blast searches, annotation browsing, multiple sequence alignments and phylogenetic tree construction, and this allowed us to define particular genetic contexts. The figures were prepared based on graphics that were created in CLC Main Workbench and further processed using GIMP.

Construction of phylogenetic trees

The sequences for the MazF/PemK toxins identified in all relevant strains, the mazF-Sa, rpoB and saoC genes in strains carrying the pemIK-Sa1, belonging to a group of closely related strains that were isolated in Germany and included in the BioSample database^69,70, were used to analyse phylogenetic relationships. Toxins determine the biological activity of a particular TA system, and their sequences are less variable among different strains. Furthermore, one antitoxin family may be coupled with just a few toxin families²⁴. For these reasons, only toxin sequences were used in the phylogenetic analyses. All unique sequences were aligned in CLC Main Workbench (see supplementary data for detailed alignment parameters). Segments of the alignment containing gaps were removed, as were additional duplicates that arose after gaps were removed. Based on the resulting alignments, phylogenetic trees were constructed using the Maximum Likelihood Phylogeny tool in CLC Main Workbench. The model was chosen using the Model Testing tool (see the supplementary data for the details associated with the tree construction method).

Host distribution analysis

For all analysed strains, the collection date, geographical location and host name were retrieved from the BioSample database using self-developed Python scripts and the NCBI esearch and efetch tools^71,72. BioSample accession numbers were extracted from cross-referenced fields obtained from GenBank records whenever available. In the remaining cases (161 strains), the BioSample database was queried using the organism name and the strain/isolate signature provided within the GenBank record.

Experimental identification of mazEF/pemIK loci

Staphylococcus strains were obtained from international reference collections, including ATCC, BCCM/LMG and DSMZ, and the Polish Collection of Microorganisms (PCM, Wroclaw, Poland), as indicated by strain signatures, and from the collections of the Department of Microbiology, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University^33,73 (Table 3). The bacteria were cultured overnight in tryptic soy broth (TSB, Sigma-Aldrich) at 37 °C at 180 RPM. To isolate DNA, bacterial pellets obtained from 2 ml cultures were suspended in 200 μl of EC buffer (6 mM Tris-HCl pH 7.6; 1 M NaCl; 100 mM EDTA; 0.5% Brij; and 0.5% sarkosyl) supplemented with 1 μl of RNase A (10 mg/ml, Thermo Scientific) and 1 μl lysostaphin (10 mg/ml, Preparatis) and incubated for 1 h at 37 °C. After the lysis was completed, the DNA was isolated using a Genomic Mini Kit (A&A Biotechnology) according to the manufacturer’s protocol. Fragments of different mazEF/pemIK loci were PCR amplified from 50-100 ng genomic DNA using specific primers (1 μM; Suppl. Table 3) and the following PCR cycling parameters: initial denaturation (94 °C, 2 min), cycle-specific denaturation (94 °C, 30 sec), an annealing temperature appropriate for a particular set of primers (Suppl. Table 3) for 30 sec., and extension (74 °C, 1 min). The three latter steps were repeated 29 times and were then followed by a polishing step (74 °C, 10 min). DNA Polymerase (1 U, A&A Biotechnology) and manufacturer-supplied buffers were used. The PCR products were separated on 1% agarose gels in TAE buffer (40 mM Tris, 20 mM acetic acid, and 1 mM EDTA).

Table 3 List of strains used in PCR screens.

Full size table

Declarations

Availability of data and materials

All data supporting the conclusions presented in this article are available at the GenBank FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/bacteria/) or the NCBI Nucleotide website (https://www.ncbi.nlm.nih.gov/nucleotide/). The accession numbers of the analysed genomes are summarised in Suppl. Table 1.

References

Bukowski, M., Rojowska, A. & Wladyka, B. Prokaryotic toxin-antitoxin systems–the role in bacterial physiology and application in molecular biology. Acta Biochim Pol. 58, 1–9 (2011).
CAS PubMed Google Scholar
Schuster, C. F. & Bertram, R. Toxin-antitoxin systems are ubiquitous and versatile modulators of prokaryotic cell fate. FEMS Microbiol Lett. 340, 73–85 (2013).
Article CAS PubMed Google Scholar
Unterholzner, S. J., Poppenberger, B. & Rozhon, W. Toxin–antitoxin systems: biology, identification, and application. Mob Genet Elements 3, e26219 (2013).
Article PubMed PubMed Central Google Scholar
Wen, Y., Behiels, E. & Devreese, B. Toxin-antitoxin systems: their role in persistence, biofilm formation, and pathogenicity. Pathog Dis. 70, 240–9 (2014).
Article CAS PubMed Google Scholar
Schuster, C. & Bertram, R. Toxin-antitoxin systems of Staphylococcus aureus. Toxins (Basel). 8, 140 (2016).
Article PubMed Central Google Scholar
Gerdes, K. et al. Mechanism of postsegregational killing by the hok gene product of the parB system of plasmid R1 and its homology with the relF gene product of the E. coli relB operon. EMBO J. 5, 2023–9 (1986).
CAS PubMed PubMed Central Google Scholar
Tsuchimoto, S., Ohtsubo, H. & Ohtsubo, E. Two genes, pemK and pemI, responsible for stable maintenance of resistance plasmid R100. J Bacteriol. 170, 1461–6 (1988).
Article CAS PubMed PubMed Central Google Scholar
Sobecky, P. A., Easter, C. L., Bear, P. D. & Helinski, D. R. Characterization of the stable maintenance properties of the par region of broad-host-range plasmid RK2. J Bacteriol. 178, 2086–93 (1996).
Article CAS PubMed PubMed Central Google Scholar
Zhu, L. et al. The mRNA interferases, MazF-mt3 and MazF-mt7 from Mycobacterium tuberculosis target unique pentad sequences in single-stranded RNA. Mol Microbiol. 69, 559–69 (2008).
Article CAS PubMed Google Scholar
Zhu, L. et al. Staphylococcus aureus MazF specifically cleaves a pentad sequence, UACAU, which is unusually abundant in the mRNA for pathogenic adhesive factor SraP. J Bacteriol. 191, 3248–55 (2009).
Article CAS PubMed PubMed Central Google Scholar
Van Melderen, L., Saavedra De & Bast, M. Bacterial toxin-antitoxin systems: more than selfish entities? PLoS Genet. 5, e1000437 (2009).
Article PubMed PubMed Central Google Scholar
Bukowski, M. et al. A regulatory role for Staphylococcus aureus toxin-antitoxin system PemIKSa. Nat Commun. 4, 2012 (2013).
Article PubMed Google Scholar
Fernández-García, L. et al. Toxin-antitoxin systems in clinical pathogens. Toxins (Basel). 8, 227 (2016).
Article PubMed Central Google Scholar
Page, R. & Peti, W. Toxin-antitoxin systems in bacterial growth arrest and persistence. Nat Chem Biol. 12, 208–14 (2016).
Article CAS PubMed Google Scholar
Lee, K.-Y. & Lee, B.-J. Structure, biology, and therapeutic application of toxin–antitoxin systems in pathogenic bacteria. Toxins (Basel). 8, 305 (2016).
Article PubMed Central Google Scholar
Pimentel, B. et al. Toxin kid uncouples DNA replication and cell division to enforce retention of plasmid R1 in Escherichia coli cells. Proc Natl Acad Sci USA 111, 2734–9 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Ramage, H. R. et al. Comprehensive functional analysis of mycobacterium tuberculosis toxin-antitoxin systems: implications for pathogenesis, stress responses, and evolution. PLoS Genet. 5, e1000767 (2009).
Article PubMed PubMed Central Google Scholar
Sala, A., Bordes, P. & Genevaux, P. Multiple toxin-antitoxin systems in Mycobacterium tuberculosis. Toxins (Basel). 6, 1002–20 (2014).
Article PubMed PubMed Central Google Scholar
Georgiades, K. & Raoult, D. Genomes of the most dangerous epidemic bacteria have a virulence repertoire characterized by fewer genes but more toxin-antitoxin modules. PLoS One. 6, e17962 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Brown, J. M. & Shaw, K. J. A novel family of Escherichia coli toxin-antitoxin gene pairs. J Bacteriol. 185, 6600–8 (2003).
Article CAS PubMed PubMed Central Google Scholar
Sevin, E. W. & Barloy-Hubler, F. RASTA-Bacteria: a web-based tool for identifying toxin-antitoxin loci in prokaryotes. Genome Biol. 8, R155 (2007).
Article PubMed PubMed Central Google Scholar
Leplae, R. et al. Diversity of bacterial type II toxin–antitoxin systems: a comprehensive search and functional analysis of novel families. Nucleic Acids Res. 39, 5513–25 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sberro, H. et al. Discovery of functional toxin/antitoxin systems in bacteria by shotgun cloning. Mol Cell. 50, 136–48 (2013).
Article CAS PubMed PubMed Central Google Scholar
Makarova, K. S., Wolf, Y. I. & Koonin, E. V. Comprehensive comparative-genomic analysis of type 2 toxin-antitoxin systems and related mobile stress response systems in prokaryotes. Biol Direct. 4, 19 (2009).
Article PubMed PubMed Central Google Scholar
Tong, S. Y. C., Davis, J. S., Eichenberger, E., Holland, T. L. & Fowler, V. G. Staphylococcus aureus infections: epidemiology, pathophysiology, clinical manifestations, and management. Clin Microbiol Rev. 28, 603–61 (2015).
Article PubMed PubMed Central Google Scholar
Pires Dos Santos, T., Damborg, P., Moodley, A. & Guardabassi, L. Systematic review on global epidemiology of methicillin-resistant Staphylococcus pseudintermedius: inference of population structure from multilocus sequence typing data. Front Microbiol. 7, 1599 (2016).
Article PubMed PubMed Central Google Scholar
Yoshizumi, S. et al. Staphylococcus aureus YoeB homologues inhibit translation initiation. J Bacteriol. 191, 5868–72 (2009).
Article CAS PubMed PubMed Central Google Scholar
Schuster, C. F. et al. Characterization of a mazEF toxin-antitoxin homologue from Staphylococcus equorum. J Bacteriol. 195, 115–25 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chanchaithong, P., Prapasarakul, N., Perreten, V. & Schwendener, S. Characterization of a novel composite staphylococcal cassette chromosome mec in methicillin-resistant Staphylococcus pseudintermedius from Thailand. Antimicrob Agents Chemother. 60, 1153–7 (2016).
Article CAS PubMed PubMed Central Google Scholar
Malachowa, N. & Deleo, F. R. Mobile genetic elements of Staphylococcus aureus. Cell Mol Life Sci. 67, 3057–3071 (2010).
Article CAS PubMed PubMed Central Google Scholar
Koreen, L. et al. N. spa typing method for discriminating among Staphylococcus aureus isolates: implications for use of a single marker to detect genetic micro- and macrovariation. J Clin Microbiol. 42, 792–9 (2004).
Article CAS PubMed PubMed Central Google Scholar
Drancourt, M. & Raoult, D. rpoB gene sequence-based identification of Staphylococcus species. J Clin Microbiol. 40, 1333–8 (2002).
Article CAS PubMed PubMed Central Google Scholar
Bukowski, M. et al. Species determination within Staphylococcus genus by extended PCR-restriction fragment length polymorphism of saoC gene. FEMS Microbiol Lett. 362, 1–11 (2015).
Article PubMed Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J Mol Biol. 215, 403–10 (1990).
Article CAS PubMed Google Scholar
Altschul, S. F. et al. Gapped BLAST and PS I-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–402 (1997).
Article CAS PubMed PubMed Central Google Scholar
Altschul, S. F. & Koonin, E. V. Iterated profile searches with PSI-BLAST—a tool for discovery in protein databases. Trends Biochem Sci. 23, 444–7 (1998).
Article CAS PubMed Google Scholar
Boratyn, G. M. et al. Domain enhanced lookup time accelerated BLAST. Biol Direct. 7, 12 (2012).
Article CAS PubMed PubMed Central Google Scholar
Henikoff, S. & Henikoff, J. G. Embedding strategies for effective use of information from multiple sequence alignments. Protein Sci. 6, 698–705 (1997).
Article CAS PubMed PubMed Central Google Scholar
Friedberg, I., Kaplan, T. & Margalit, H. Evaluation of PSI-BLAST alignment accuracy in comparison to structural alignments. Protein Sci. 9, 2278–84 (2000).
Article CAS PubMed PubMed Central Google Scholar
Jones, D. T. & Swindells, M. B. Getting the most from PSI-BLAST. Trends Biochem Sci. 27, 161–4 (2002).
Article CAS PubMed Google Scholar
Kaushik, S. et al. Improved detection of remote homologues using cascade PSI-BLAST: influence of neighbouring protein families on sequence coverage. PLoS One. 8, e56449 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Wei, Y.-Q., Bi, D.-X., Wei, D.-Q. & Ou, H.-Y. Prediction of type II toxin-antitoxin loci in Klebsiella pneumoniae genome sequences. Interdiscip Sci Comput Life Sci. 8, 1–7 (2015).
CAS Google Scholar
Kullik, I. & Giachino, P. The alternative sigma factor sigmaB in Staphylococcus aureus: regulation of the sigB operon in response to growth phase and heat shock. Arch Microbiol. 167, 151–9 (1997).
Article CAS PubMed Google Scholar
Donegan, N. P. & Cheung, A. L. Regulation of the mazEF toxin-antitoxin module in Staphylococcus aureus and its impact on sigB expression. J Bacteriol. 191, 2795–805 (2009).
Article CAS PubMed PubMed Central Google Scholar
Senn, M. M. et al. Molecular analysis and organization of the sigmaB operon in Staphylococcus aureus. J Bacteriol. 187, 8006–19 (2005).
Article CAS PubMed PubMed Central Google Scholar
Schuster C. F. et al. The MazEF Toxin-antitoxin system alters the β-Lactam susceptibility of Staphylococcus aureus. Hayes F, editor. PLoS One. 10, e0126118 (2015).
Chan, P. F., Foster, S. J., Ingham, E. & Clements, M. O. The Staphylococcus aureus alternative sigma factor sigma B controls the environmental stress response but not starvation survival or pathogenicity in a mouse abscess model. J Bacteriol. 180, 6082–9 (1998).
CAS PubMed PubMed Central Google Scholar
Christensen-Dalsgaard, M. & Gerdes, K. Two higBA loci in the Vibrio cholerae superintegron encode mRNA cleaving enzymes and can stabilize plasmids. Mol Microbiol. 62, 397–411 (2006).
Article CAS PubMed Google Scholar
Szekeres, S., Dauti, M., Wilde, C., Mazel, D. & Rowe-Magnus, D. A. Chromosomal toxin-antitoxin loci can diminish large-scale genome reductions in the absence of selection. Mol Microbiol. 63, 1588–605 (2007).
Article CAS PubMed Google Scholar
Harrison, P. W., Lower, R. P. J., Kim, N. K. D. & Young, J. P. W. Introducing the bacterial “chromid”: not a chromosome, not a plasmid. Trends Microbiol. 18, 141–8 (2010).
Article CAS PubMed Google Scholar
Flannagan, S. E. et al. Plasmid content of a vancomycin-resistant Enterococcus faecalis isolate from a patient also colonized by Staphylococcus aureus with a VanA phenotype. Antimicrob Agents Chemother. 47, 3954–9 (2003).
Article CAS PubMed PubMed Central Google Scholar
de Niederhäusern, S. et al. Vancomycin-resistance transferability from VanA Enterococci to Staphylococcus aureus. Curr Microbiol. 62, 1363–7 (2011).
Article PubMed Google Scholar
Lou, C., Li, Z. & Ouyang, Q. A molecular model for persister in E. coli. J Theor Biol. 255, 205–9 (2008).
Article CAS PubMed Google Scholar
Helaine, S. et al. Internalization of Salmonella by macrophages induces formation of nonreplicating persisters. Science. 343, 204–8 (2014).
Article ADS CAS PubMed Google Scholar
Maisonneuve, E. & Gerdes, K. Molecular mechanisms underlying bacterial persisters. Cell. 157, 539–48 (2014).
Article CAS PubMed Google Scholar
Zhang, Y. Persisters., persistent infections and the Yin-Yang model. Emerg. Microbes Infect. 3, e3 (2014).
Article CAS Google Scholar
Fasani, R. A. & Savageau, M. A. Unrelated toxin-antitoxin systems cooperate to induce persistence. J R Soc Interface. 12, 20150130 (2015).
Article PubMed PubMed Central Google Scholar
Zhang, Y. et al. MazF cleaves cellular mRNAs specifically at ACA to block protein synthesis in Escherichia coli. Mol Cell. 12, 913–23 (2003).
Article CAS PubMed Google Scholar
Rothenbacher, F. P. et al. Clostridium difficile MazF toxin exhibits selective, not global, mRNA cleavage. J Bacteriol. 194, 3464–74 (2012).
Article CAS PubMed PubMed Central Google Scholar
Yamaguchi, Y., Nariya, H., Park, J. H. & Inouye, M. Inhibition of specific gene expressions by protein-mediated mRNA interference. Nat Commun. 3, 607 (2012).
Article PubMed Google Scholar
Park, J. H., Yamaguchi, Y. & Inouye, M. Bacillus subtilis MazF‐bs (EndoA) is a UACAU‐specific mRNA interferase. FEBS Lett. 585, 2526–32 (2011).
Article CAS PubMed PubMed Central Google Scholar
Simanshu, D. K., Yamaguchi, Y., Park, J. H., Inouye, M. & Patel, D. J. Structural basis of mRNA recognition and cleavage by toxin MazF and its regulation by antitoxin MazE in Bacillus subtilis. Mol Cell. 52, 447–58 (2013).
Article CAS PubMed PubMed Central Google Scholar
Zorzini, V. et al. Substrate recognition and activity regulation of the Escherichia coli mRNA endonuclease MazF. J Biol Chem. 291, 10950–60 (2016).
Article CAS PubMed PubMed Central Google Scholar
Lowder, B. V. et al. Recent human-to-poultry host jump, adaptation, and pandemic spread of Staphylococcus aureus. Proc Natl Acad Sci USA. 106, 19545–50 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Bacterial Genomes at GenBank Database. National Center for Biotechnology Information. 2016. ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/bacteria/. Accessed 2016 Jul 20. (2016).
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics. 10, 421 (2009).
Article PubMed PubMed Central Google Scholar
Codon Tables. National Center for Biotechnology Information. 2016. https://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi. Accessed 2016 Dec 17. (2016).
Marchler-Bauer, A. et al. CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Res. 31, 383–7 (2003).
Article CAS PubMed PubMed Central Google Scholar
Barrett, T., Clark, K., Gevorgyan, R., Gorelenkov, V., Gribov, E. & Karsch-Mizrachi, I. et al. BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata. Nucleic Acids Res. 40, D57–63 (2012).
Article CAS PubMed Google Scholar
BioSample Database. National Center for Biotechnology Information. 2016. https://www.ncbi.nlm.nih.gov/biosample. Accessed 2016 Dec 17.
Maglott, D., Ostell, J., Pruitt, K. D. & Tatusova, T. Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res. 33, D54–8 (2005).
Article CAS PubMed Google Scholar
Wheeler, D. L. et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 45, D39–45 (2005).
ADS Google Scholar
Polakowska, K. et al. The virulence of Staphylococcus aureus correlates with strain genotype in a chicken embryo model but not a nematode model. Microbes Infect. 14, 1352–62 (2012).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank prof. Jacek Miedzobrodzki for providing bacterial strains from the collection of the Department of Microbiology, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University. The Faculty of Biochemistry, Biophysics and Biotechnology of Jagiellonian University is a partner of the Leading National Research Center (KNOW) and is supported by the Ministry of Science and Higher Education. This research was supported by funds granted by the National Science Centre (NCN, Poland) based on the decision no. DEC-2014/13/B/NZ1/00043 (to BW).

Author information

Authors and Affiliations

Department of Analytical Biochemistry, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, Krakow, Poland
Michal Bukowski, Karolina Hyz, Monika Janczak, Marcin Hydzik & Benedykt Wladyka
Malopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
Grzegorz Dubin
Department of Microbiology, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, Krakow, Poland
Grzegorz Dubin

Authors

Michal Bukowski
View author publications
You can also search for this author in PubMed Google Scholar
Karolina Hyz
View author publications
You can also search for this author in PubMed Google Scholar
Monika Janczak
View author publications
You can also search for this author in PubMed Google Scholar
Marcin Hydzik
View author publications
You can also search for this author in PubMed Google Scholar
Grzegorz Dubin
View author publications
You can also search for this author in PubMed Google Scholar
Benedykt Wladyka
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.B. prepared the environment for these experiments, optimised and performed the computational analyses, and designed the primers. K.H., M.J., and M.H. optimised and performed the PCR screens. M.B., K.H., M.J., M.H., G.D. and B.W. analysed and interpreted the results. M.B. prepared the figures and tables. M.B., G.D. and B.W. wrote the manuscript. All authors revised the manuscript and agreed to be accountable for all aspects of the work presented herein.

Corresponding author

Correspondence to Benedykt Wladyka.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary data

Supplementary tables

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bukowski, M., Hyz, K., Janczak, M. et al. Identification of novel mazEF/pemIK family toxin-antitoxin loci and their distribution in the Staphylococcus genus. Sci Rep 7, 13462 (2017). https://doi.org/10.1038/s41598-017-13857-4

Download citation

Received: 15 June 2017
Accepted: 02 October 2017
Published: 18 October 2017
DOI: https://doi.org/10.1038/s41598-017-13857-4
Springer Nature Limited

Identification of novel mazEF/pemIK family toxin-antitoxin loci and their distribution in the Staphylococcus genus

Abstract

Similar content being viewed by others

Single-molecule sequencing reveals the molecular basis of multidrug-resistance in ST772 methicillin-resistant Staphylococcus aureus

Prediction of Type II Toxin-Antitoxin Loci in Klebsiella pneumoniae Genome Sequences

A megaplasmid family driving dissemination of multidrug resistance in Pseudomonas

Introduction

Results

General results

Distribution and genetic context of mazEF/pemIK loci

Host specificity of pemIK-Sa1

Unique features of mazEF-Sa

In vitro evaluation of the distribution of mazEF/pemIK loci in a diverse collection of staphylococcal strains

Discussion

Results of homology search approaches

Staphylococci carry at least 12 independent mazEF/pemIK homologues

Maintenance of mobile genetic elements and pemIK loci

Regulatory role of mazEF/pemIK homologues

Host preference of pemIK loci

Methods

Bioinformatic analyses

Conservative cascade search using protein BLAST

Extensive search using protein BLAST

PSI-BLAST search

RPS-BLAST search

DELTA-BLAST search

Analysis of the genetic neighbourhood of mazEF/pemIK family loci

Construction of phylogenetic trees

Host distribution analysis

Experimental identification of mazEF/pemIK loci

Declarations

Availability of data and materials

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Supplementary data

Supplementary tables

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation