Comparative analysis reveals the Genomic Islands in Pasteurella multocida population genetics: on Symbiosis and adaptability
Pasteurella multocida (P. multocida) is a widespread opportunistic pathogen that infects human and various animals. Genomic Islands (GIs) are one of the most important mobile components that quickly help bacteria acquire large fragments of foreign genes. However, the effects of GIs on P. multocida are unknown in the evolution of bacterial populations.
Ten avian-sourced P. multocida obtained through high-throughput sequencing together with 104 publicly available P. multocida genomes were used to analyse their population genetics, thus constructed a pan-genome containing 3948 protein-coding genes. Through the pan-genome, the open evolutionary pattern of P. multocida was revealed, and the functional components of 944 core genes, 2439 accessory genes and 565 unique genes were analysed. In addition, a total of 280 GIs were predicted in all strains. Combined with the pan-genome of P. multocida, the GIs accounted for 5.8% of the core genes in the pan-genome, mainly related to functional metabolic activities; the accessory genes accounted for 42.3%, mainly for the enrichment of adaptive genes; and the unique genes accounted for 35.4%, containing some defence mechanism-related genes.
The effects of GIs on the population genetics of P. multocida evolution and adaptation to the environment are reflected by the proportion and function of the pan-genome acquired from GIs, and the large quantities of GI data will aid in additional population genetics studies.
KeywordsPasteurella multocida Genomic islands Population genetics Pan-genome Symbiosis Adaptability
American Type Culture Collection
Conserved Domains Database
Coding DNA Sequences
Cluster of Orthologous Groups
China Veterinary Culture Collection Center
horizontal gene transfer
Integrative and Conjugative Elements
Kyoto Encyclopedia of Genes and Genomes
Locally collinear block
Multi Locus Sequence Typing
National Center for Biotechnology Information
PHAge Search Tool Enhanced Release
Web Gene Ontology Annotation Plot
As the technology-developing capacity and the scale of genomic data developes, population genetics and molecular evolution have become central analytical disciplines [1, 2, 3]. The question of how genetic structure and variation in species affect selective forces and adapt to novel conditions has been foundational to the field of population genetics [4, 5]. Gene islands (GIs) are an integrative mobile element, containing large genomic regions (> 10 kb) acquired through horizontal gene transfer (HGT) between different hosts, that might increase the versatility and adaptability of the recipient and enable certain bacteria to occupy entirely new niches. GIs act on for population genetics by facilitating gene exchange within or among species [6, 7]. Lysogenic bacteriophages and plasmids are important enablers of gene flow [8, 9]. They promote the formation of GIs and confer superior mobility as removable genetic components that move through site-specific recombination and integrating into the corresponding chromosomes . Integrative and conjugative elements (ICEs) are a special type of GIs that promote the mobilization of degenerative, immobile GIs and contributing to lateral gene flow [11, 12]. In general, GIs are divided into pathogenicity, resistance, xenobiotic-degradation, metabolic, symbiosis or fitness islands by the environmental context of the bacterium and the one or more important functional genes among them [13, 14].
Pasteurella multocida (P. multocida) is a causative agent of economically crucial diseases spanning the entire world and is a facultative pathogen of Gram-negative, atrichous, nonspore-bearing but encapsulated coccobacillus . Germs are the sole pathogenic agent that cause pasteurellosis in diversified animals, including atrophic rhinitis (AR) and swine plague hogs, fowl cholera (FC), haemorrhagic septicaemia (HS) of bovines, infectious pneumonia in sheep and snuffles in rabbits . Oliveira Filho (2018) demonstrated that P. multocida is able to cause pneumonia in pig without other pathogen (bacteria /virus) . And scratches or bites from infected dogs or cats can sporadically cause human infection . Currently, there have been studies on the virulence genes of P. multocida , with prospective analysis including forward selection genes and host-virulence interaction genes in the pan-genome [20, 21], and HS-specific genes involving disease-specific diagnostic tests and possible resistance genes [16, 22]. Moreover, there are also a few reports of P. multocida GIs, the resistant island ICEPmu1 from strain 36,950 , and the resistant islands from strains TX1 and BUKK, which are similar to ICEPmu1 .
However, the extremely dynamic pan-genome of facultative pathogens is largely derived from mobile elements and phage-derived GIs, representing for some taxa the major evolutionary route of facultative pathogens [9, 24]. The close correlation between GIs and bacteriophages and the characteristics of GIs’ large mobile components require more attention. For the facultative pathogens of P. multocida, the GI proportions and effects on P. multocida population genetics were explored through comparative genomics between the pan-genome and GIs in this article.
Genomic characteristic of P. multocida
The quantitative data of 10 sequenced avian-sourced strains and 104 reported P. multocida strains, 17 complete and 97 draft, are shown in Additional file 1: Table S1. Apart from containing gap aligned areas, the average genome length of the donors was 2,323,144 bp, ranging from 2.20 to 2.70 Mbp. The 114 strains were from 12 different countries. Of all the strains, 18 strains were isolated from poultry, 66 from bovines, 17 from rabbits, 5 from swine, 2 from goats, 1 from canis, 1 from alpaca and 4 from humans. The quantity of the encoded proteins ranged from 1903 (Razi 0002, avian) to 2313 (FDAARGOS_261, human); the average GC content was 40.29%, while the minimum was 36.9% (HN141013, DY120818, avian; CIRMBP-0760, Oryctolagus cuniculus) and the maximum was 40.7% (P1062, HS SKN01; cattle); the amount of rRNA ranged from 1 (2000, cattle) to 19 (the great majority), and tRNA ranged from 16 (PmP1933, cattle) to 59 (PmOH1905, canis; Razi_Pm0001, cattle); the pseudogenes ranged in magnitude disparately from 6 (Pm70, avian) to 210 (1500E, avian) (Additional file 2: Table S2). Simultaneously, the scanty rest was interrogative and unfilled portions were uncharted in the database.
The pan-genome analysis across all strains
Alignment and phylogenetic analysis of 114 strains
Genomic islands in P. multocida
GI regions of all strains were predicted and screened. Precise positioning and functional comments have been applied to parse the complete genomes of 17 P. multocida, shown in Additional file 6: File S1. The exact GI location was determined for 17 complete genome sequences according to the search for direct repeat GI sequences and whether they contained the basic GI origin DNA, such as hypothetical protein, phage-related protein, and integrase; the integrase type and integration site were counted and collated (Additional file 7: Table S4). On average, complete genomes from P. multocida strains harbor 36 GIs. In the results, there were 2 RGI, one of which contained a large number of integrative conjugative elements (36,950, ICEPmul), 23 PAIs, 3 metabolic islands, 4 symbiosis islands, together with 1 pathogenicity and symbiosis island, 2 pathogenicity and metabolic islands, and 1 pathogenicity and resistance island. The GI integrases in P. multocida mostly belong to the reorganization of tyrosine enzyme/integrase families, while a few belong to the Rve family. Ninety-seven draft genomes have been analysed with a similar method using P. multocida ATCC 43137 as reference. The regions of GIs without precise starting point were predicted and the interrupted GI fragments were discarded totally because of the limitations of the strains sequence themselves. Hence, 280 GIs were taken, 123 of which were PAIs, 27 RGIs, 97 symbiosis islands, 4 metabolic islands, 7 pathogenicity and metabolic islands, 1 pathogenicity and symbiosis island and 21 pathogenicity and resistance islands (Additional file 7: Table S4 and Additional file 8: Table S5).
Consequently, there were 280 GIs in this experiment. The virulence-related islands accounted for 44% (123 in total) of the all predicted results if the multifunctional GIs were excluded (the above statistical results that were not part of the data of PAIs), or 54% (152 in total) if the multifunctional GIs were included. In addition, the symbiosis islands also occupied a large proportion at 37% (97) or 35% (98). ICE accounted for 11% (32) of the total.
Prophage regions in P. multocida
Temperate phages are important factors in bacterial evolution and the formation of new pathogens and sometimes can be converted into virulent phage to kill bacteria [26, 27, 28]. The prophages of 114 P. multocida strains were predicted. The prophage-related elements were analysed and 324 phages were identified and described with their general characteristics, including 153 intact bacteriophages, 36 questionable bacteriophages and the 133 incomplete (Additional file 9: File S2 and Additional file 10: Table S7). Only the bacteriophage region in the genome of 93,002 is not present in all strains. According to the formation mechanism of GI, the phage is likely the precursor to the formation of GIs; that is, the phage could lose self-replicating genes and large segments of HGT by integrase, transposase, recombinase and so on between the strains in the form of GIs . Due to the differences in software rearrangement in the prediction of the draft genome strains, only the GI regions of the 17 complete genome strains were compared with their predicted prophage area, and 63.3% (31) of the possible phage regions were found to be approximately the same as those of the GIs (Additional file 10: Table S7). Of these, 64.5% (20) were intact, 22.6% (7) potential and 12.9% (4) incomplete prophage regions.
The particular correlation of the population genome and GIs
The prophage was an important source of GIs, and a significant element of GIs was the integrase/recombinase/transposase . Of the predicted strain phage, 66 strains contained integrase/recombinase/transposase. Therefore, the phylogenetic tree based on the mobile elements in these prophage of 66 strains was used as a proxy for the phylogeny of the GIs. In the Additional file 12: Figure S5, the arc was marked as the same part compared to the phylogenetic tree based on the core genes, the green arc indicating the strains were in the same large branch, the black arc indicating them were scattered but all in a same branch, respectively. The phylogenetic relationship exhibited by the important mobile elements was basically consistent with the phylogenetic relationship of the core genes.
In the bioenergetics-constrained prokaryotic genome size, the adaptive evolutionary process of populations is revealed through the pan-genome [29, 30]. As all examples of genome reduction to date have been limited to endosymbionts or pathogens with a host-dependent lifestyle, we studied the open pan-genome of 114 P. multocida taxa (Fig. 1); our conclusions were similar to Hurtado et al  conclusions, indicating that this type of strain strengthens the ability of the “survival of the fittest” alone . That is they are the pathogens of non-host dependent lifestyle from the open evolutionary pattern of P. multocida. The bacteria could expand their gene pool through a variety of mobile elements including GIs. HGT events with beneficial effects would increase the fitness of the acceptors, eventually being fixed in the population; whereas acquired genes being redundant or detrimental are subject to purifying selection, accumulating mutations before being pseudogeneticized or lost . In the process, new genes with small fluctuations and the growing number of unique genes are the embodiment of the ability to obtain foreign elements stably and regularly (Fig. 2). However, differences in the gene levels of each strain might be due to geographical and ecological isolation, as shown in Fig. 4, where most of the similarly located P. multocida strains were in the same large branch, and most of the same host-derived were in the same phylogenetic tree. This contradicts Cao, et al.  who proposed phylogeny that showed no correlation with other species except for serotype. Moustafa, et al.  discovered that only the phylogeny of bovine strains was related to geographic location and host source. The phylogenetic tree of 114 strains based on core genes was compared with the phylogenetic tree of 146 P. multocida constructed by an alignment-free method with NCBI using feature (or l-mer) frequency profiles (FFP) of whole genomes . As shown in Additional file 13: Figure S3, an arc of the same colour represents the same large branch. This comparison demonstrated that the loci of all strains in the phylogenetic tree had a high degree of similarity, except for the canis-sourced strain OH1905. OH1905 was not located on a single branch as on NCBI, but was located on the same large branch as P-1662 and some of the avian-source strains. This may be due to the different strains of the screening results making a slight change in common genes determined by these data, for example there are 306_PMUL, OH4807 in Additional file 13: Figure S3-A, while FDAARGOS_219 in Additional file 13: Figure S3-B. Genomic data for only one strain uploaded by a different research institution was retained in this paper, such as ATCC 11039 and X73, NCTC 10322 and ATCC 43137, and P-1059 and ATCC 15742. According to the MLST results (Additional file 14: Figure S4), OH1905 and P-1662 belong to two different MLST CC (clonal complexes), although both of them were unknown. This showed that they have a certain difference in gene level, so they should not be in the same branch but in line with their host source differences. Therefore, in this paper, the phylogeny of P. multocida accurately showed close relationships with geographical location and host origin.
Unquestionably, GIs are due to unique aspects of bacterial behaviour, and each of these important new genes brings a new possibility to this population . As shown in Additional file 15: Table S8, the toxin HipA gene was closely related to the dormancy and persistence of bacteria, as well as an increase in antibiotic resistance; this gene was only present in the strain P1933 (WP_016533466.1) and the pathogenicity and resistance island GIPm3480_1 of the strains 3480 and GIPmHNO6_1 [36, 37]. The toxin HipA gene has been reported in Escherichia coli (E. coli), Haemophilus parasuis and is widely found in fungi but has not been reported in Pasteurella [38, 39, 40]. Simultaneously, GIPm3480_1 also possessed a gene for a CopG family transcriptional regulator that plays an antitoxin role similar to plasmid pEMB1 in E. coli . GIPm3480_1 and multiple other strains contain the HNH endonuclease gene only carried by bacterium-infected COS phage. The HNH-TerS-TerL supramolecular complex in the GIs plus a complete capsid could cleave the cos-site of the phage. This is helpful to the island invasion packaging mechanism of the phage by certain helper phages to be excised, replicated, and packaged in an encapsulated form that can infect phage particles, thereby making GIs with no mobility inherent to their mobility [42, 43]. GIPmOH1905_1 and GIRazi_Pm0001_2 contained transcriptional regulator AlpA regulating multidrug/metal ion transporter genes and oxidative stress modulators . In addition, GIPmHNO7_1 contained LPS-related proteins, iron uptake-related antibiotic biosynthesis monooxygenase, an anti-tetracycline gene, and a large number of ICE-related genes. GIPmHN06_3 contained dermonecrotic toxin. GIRazi_Pm0001_3 contained TonB-dependent receptors related to iron transport . GIPm36950_1 contained an aminoglycoside antibiotic-resistance gene, aminoglycoside 3′-phosphotransferase, a beta-lactamase type resistance gene, a macrolide antibiotic resistance gene, tetracycline resistance genes, a nucleotidyl transferase gene, and multidrug efflux system-related proteins. It has been reported that this GI is ICEPmu1 with multiple drug resistance [23, 46]. According to statistics, there are multiple strains containing the ppGpp gene associated with starvation survival and virulence . Xenobiotic response element (XRE) is a family transcriptional regulator involved in the stress response , and HicAB, a type II toxin-antitoxin system similar to E. coli HicAB, affects the growth and death of the strains . The multitude of data and unique functions of GIs once again suggest this mobile element is likely related to the pathogenicity of strains and the host interactions in the process of population evolution .
Although the effect of HGT on prokaryotic evolution has been discovered , the precise interplay of bacterial GIs on population genomes is still poorly understood. Using P. multocida as an example, GIs accounted for 5.8% of the core genes in the pan-genome, 42.3% of the accessory genes, and 35.4% of the unique genes. The existence of a small number of core genes in the GI may be because these genes are not only widely present in Pasteurella, other types of bacteria, or even fungi. Therefore, when the entire GI is transferred to P. multocida, the genes with higher frequencies in the foreign strains were also transferred; however, because of their unique location characteristics at both ends of the GI, it is more likely that the host’s own genes have moved to the GI area after the GI was transferred to the host. Moreover, 42.3% of the accessory genes were derived from GIs. Most of the genes that have similar functional proportions to the accessory genes, possibly due to the greater demand for such functional categories during the survival and reproduction of these strains or the fact that most of the strains themselves contain many of the functional categories associated with biological processes to make GIs more likely to contain such genes. Gene enrichment of GIs within the accessory genes is an important concept, and these GI genes were fixed in the genome to enhance the invasiveness, anti-stress pathways and population genetic changes.
The unique genes were mostly exogenous, very likely to reflect the environmental pressure that the strains faced. As shown in Additional file 16: Table S9, there were 200 unique genes contributed by the GIs to strain 36,950, including most of the drug-resistant genes associated with the specific phenotype of the strain, such as aminoglycosides, macrolides, and beta-lactams. Most of the GI-related genes found in HS_SKN01 were related to ICE. The GI-related genes found in TX1 included many ICE-related proteins and genes involved in the metabolism of bacterial cell walls, ABC transporters, ATP-binding proteins containing both ATPase and permease components of an ABC-type multidrug transport system, cobalt transporters, membrane proteins, and TetR/AcrR family transcriptional regulator control genes involved in a variety of processes including antibiotic production, osmotic stress response, efflux pump expression, and multidrug resistance [50, 51, 52]. The unique genes of P-2095 located in the GI region included dTDP-4-dehydrorhamnose reductase associated with cell wall/membrane/envelope biogenesis . Additionally, FDAARGOS_261 had the highest GC% (40.7%) of all strains and significantly higher number of unique genes than those of other strains (Additional file 2: Table S2), but most of these genes are hypothetical proteins. There were a number of hypothetical proteins with undefined functional categories besides FDAARGOS_261, so more research and data are needed to determine the uniqueness of these components.
In addition to the above unique parts of GIs for the population genome, the GIs might also have an impact on the population genetics of P. multocida. The matching of 63.3% prophage with the 17 complete GIs suggested the prophage was a possible mechanism of GIs. The phylogenetic relationship of GIs was demonstrated by the phylogenetic tree based on prophage (Additional file 14: Figure S4), in which 79% (49) strains exhibited the same phylogenetic relationship as the core genes. The remaining parts that did not show the same clustering results, but 18%(11) strains with close phylogenetic relationship (such as FDAARGOS_218 and ATCC15742, CIRMBP-0760 and CIRMBP-0758 etc.) were closely related to each other. Thus, the GIs might be correlated with the phylogeny of P. multocida, remained to be investigated more in detail. The availability of additional data about the strains could pave the way to quantitative insights into the evolution of these bacteria.
We demonstrated that the pan-genome components of P. multocida were disparately affected by GIs, which played an important role in the amplification of the pan-genome of bacteria flora, especially the genes related to virulence, tolerability and environmental adaptability. These types of genes enable the bacteria to transfer benefits to themselves through the horizontal GI transfer, thereby enhancing symbiosis and adaptation of the bacteria.
Through pan-genome analysis, the phylogeny of P. multocida was found to be closely related to host source and geographical location. The GI genes found in the population genome of P. multocida were scrutinized. We identified 280 GIs, accounting for 5.8% of the core genes in the pan-genome, 42.3% of the accessory genes and 35.4% of the unique genes. The identified genes were mainly related to strain virulence and environmental adaptation. Therefore, in the context of population genetics of P. multocida, further studies analysing the contribution of GIs to strain adaptation and evolutionary benefit, followed by a large quantity of GI data analysis in population genome will lead to a more diverse understanding of strain adaptation and evolution.
Materials and methods
Source of genome sequences
Three isolates were identified as P. multocida, namely DY120818, HN141014 and RCAD0276, from China that originated from dead duck livers were utilized, as well as six avian origin P. multocida strains obtained from the American Type Culture Collection (ATCC) and one P. multocida CVCC44801 (C48) obtained from the China Veterinary Culture Collection Center (CVCC). Sequencing of the 10 isolates was conducted using an Illumina HiSeq 2500. High-quality genomic DNA was extracted with a TIANamp Bacteria DNA Kit (Tiangen Biotech Co, Ltd., Beijing, China). Illumina reads were corrected by Quake . De Novo assembly was performed using two different assembly tools: Velvet v1.2.09  together with VelvetOptimiser . The VelvetOptimiser was used to optimize assembly parameters automatically, and SPAdes version 3.1.0  was used with default parameters. The other 104 P. multocida complete genome sequences were retrieved from GenBank at NCBI . The genome completeness of all strains was evaluated using the CheckM14 genome quality estimator .
Calculation of pan-genome size and gene sets
BPGA, a fast pan-genome analysis tool, was used to analyse the related genomes of all strains . The cut-off of Blast is 50% without paralogous clustering. The median sizes of the pan-genomes data were generated using the mathematical model heap’s law, npan = kpanNγ . The N is the number of genomes, both kpan and γ are free parameters, and npan is the total number of all non-redundant gene families in the pan-genome. We retrieved the orthologous genes of various genomes and their classifications as core genes, accessory genes, unique genes and new genes.
The sequence typing (ST) and clonal complexes (CCs) of 114 P. multocida were determined by means of BioNumerics software resorting a temporary evaluation license from Applied Maths . In Multi Locus Sequence Typing (MLST), an ST is uniquely determined by the allelic profiles, and CCs differing at one of the seven MLST loci are grouped into STs that share a recent common ancestor .
The construction of the phylogenetic tree
The nucleotide sequences corresponding to the core genes of 114 strains were concatenated to construct a core genome concatamer. The core genome concatamer was aligned using the MAFFT online service  (default parameters) and then a neighbour-joining (NJ) trees was constructed using MEGA programme on the basis of the Jensen-Shannon divergences matrix from each type of genome partition . The p-distance of the tree based on core genes was calculated as 0.020, and the average Jukes-Canter (JC) distance was 0.009. Branch support was computed by bootstrap with 1000 replicates, and “Maximum Composite Likelihood” model was used to construct the resulting graph. The MLST sequence was analysed in the same way with a p-distance of 0.010 and a JC distance of less than 1, with 1000 bootstrapping replications. And the phylogenetic tree based on integrase/recombinase/transposase of prophage was constructed in the same way. Finally, the tree was recomputed to determine the disproportionation branches with > 50 uncertainty, and these branches were placed in the same larger branch and converted into a consistent tree in the graph.
Identification of genomic islands
The whole genome sequences across all strains were upload to IslandViewer4 , where the potential GI regions were predicted by at least one method and expounded explicitly. All data for the 114 taxa were reinterpreted with the Kyoto Encyclopedia of Genes and Genomes (KEGG)  and Cluster of Orthologous Groups (COG) . The approximate multi-collinearity of 17 complete genomes of P. multocida were compared to each other by means of Mauve, which was utilized to analyse transmutation and development among strains such as inversion, rearrangement, insertion and deletion in the shape of locally collinear blocks (LCBs) [68, 69]. The existing GIs were preliminarily screened on the basis of the unique or minimal number of comparative genomic characteristics of GI by Mauve . The hypothetical protein data output was validated again by BLAST and the Conserved Domains Database (CDD)  of NCBI to further determine the functional categories. Furthermore, the 17 obtained GIs demonstrated precise locations according to the various characteristics, including common GI themes, abnormal GC content, direct repeat (DR) structure flanking, several genes encoding traits that may increase bacterial adaptability or fitness under certain growth conditions, fragmented insertion sequence (IS) elements and other mobility-related genes (e.g., essential functional integrase (Int), recombinase or transposase); the functions of these genes are involved in the insertion and deletion of the region that is flanked by DR structures and are absent from the chromosome with which they are closely associated [6, 72]. The detailed information of the GIs of 114 strains as Additional file 6: File S1 could be seen in https://figshare.com/s/405c87082feea840e2da (DOI: https://doi.org/10.6084/m9.figshare.7440329).
Calculation of intersection between GI and pan-genome
These genes present on the GIs in the core genes, the accessory genes and unique genes. The common part of the core genes and GIs were analysed by the COG database for their functional clustering results, and the common parts of the unique genes and GIs were the same. In the background of the newest (2018-03-01) gene ontology files, the shared parts of the accessory genes and the GIs were used to obtain the gene enrichment at the 3rd level of GO in the native format with the Web Gene Ontology Annotation Plot (WEGO) .
Identification of prophage regions
Bacteriophages (phages) are bacterial viruses that infect bacteria, fungi, and actinomyces, while the prophage is the nucleic acid of a mild phage which is integrated into the host genome . According to PHASTER (PHAge Search Tool Enhanced Release) , a web of online analysis using the previously retrieved genome sequences, prophage sequences within all bacterial genomes were identified, annotated and forecasted successfully. The detailed information of the predicted prophage of 114 strains as Additional file 9: File S2 could be seen in https://figshare.com/s/405c87082feea840e2da (DOI: https://doi.org/10.6084/m9.figshare.7440329).
This work was supported by the National Key Research and Development Program of China under Grant No. 2017YFD050080; China Agricultural Research System under Grant No. CARS-42-17; Special Fund for Key Laboratory of Animal Disease and Human Health of Sichuan Province under Grant No. 2016JPT0004; Sichuan Veterinary Medicine and Drug Innovation Group of China Agricultural Research System (CARS-SVDIP).
Availability of data and materials
The data supporting the results of this paper is available in the NCBI WGS database. The accession numbers of P. multocida genomes are included in Additional file 1: Table S1. The detailed information of the GIs and prophage of 114 strains (Additional file 6: File S1 and Additional file 9: File S2) could be seen in https://figshare.com/s/405c87082feea840e2da (DOI: https://doi.org/10.6084/m9.figshare.7440329).
DZ and AC conceived and designed the project; JH, YL, LZ, YYu and XC cultured bacteria and extracted genomic DNA; ZY, ML, JR, SC and MW assembled and annotated the genomes; JH, XZ, QY and YW performed pan-genome and prophage regions analysis; JH, ZY and SZ performed genomic islands identification and calculated the intersection between GI and pan-genome; JH, DZ and YY performed MLST analysis and the construction of the phylogenetic tree; JH and DZ drafted and revised the manuscript. All authors have read and approved the final version manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 1.Pool J, Hellmann I, Jensen J, et al. Population genetics of genome-scale sequence variation. Genome Res. 2010. https://doi.org/10.1101/gr.079509.108.
- 2.Hartl DL, Clark AG. Principles of population genetics. J Animal Breeding Genet. 1982;34(1):148.Google Scholar
- 3.Ewens WJ. Mathematical population genetics. Biometrics. 1979;36(3):2241–304.Google Scholar
- 4.Ewens WJ. Evolutionary genetics. Oxford University Press. 1989;96(19):10559–61.Google Scholar
- 5.Charles Darwin Esq. F.R.S. F.L.S. &c F.G.S, Esq A W. On the tendency of species to form varieties; and on the perpetuation of varieties and species by natural means of selection. Zool J Linnean Soc. 2001;3(9):45–62.Google Scholar
- 7.Wolf JB, Ellegren H. Making sense of genomic islands of differentiation in light of speciation. Nat Rev Genet. 2016. https://doi.org/10.1038/nrg.2016.133.
- 11.Van JDM, Sentchilo V. Genomic islands and the evolution of catabolic pathways in bacteria. Curr Opin Biotechnol. 2003;14(3):248–54.Google Scholar
- 16.Moustafa AM, Seemann T, Gladman S, et al. Comparative genomic analysis of Asian Haemorrhagic Septicaemia-associated strains of Pasteurella multocida identifies more than 90 Haemorrhagic Septicaemia-specific genes. PLoS One. 2015;10(7):1–16.Google Scholar
- 18.Mutters R, Christensen H, Bisgaard M. Pasteurella: John Wiley & Sons, Inc; 2015. https://doi.org/10.1002/9781118960608.gbm012011-23.
- 19.Aktories K, Orth JHC, Adler B, et al. Pasteurella multocida: molecular biology, toxins and infection. Current Topics in Microbiology & Immunology. 2012;361:v.Google Scholar
- 24.Hoetzinger M, Schmidt J, Jezberová J, et al. Microdiversification of a Pelagic Polynucleobacter Species Is Mainly Driven by Acquisition of Genomic Islands from a Partially Interspecific Gene Pool[J]. Applied & Environmental Microbiology. 2016;83(3):AEM.02266–16.Google Scholar
- 27.Wagner PL, Waldor MK. Bacteriophage control of bacterial virulence. Infection & Immunity. 2002;70(8):3985–93.Google Scholar
- 31.Hurtado R, Carhuaricra D, Soares S, et al. Pan-genomic approach shows insight of genetic divergence and pathogenic-adaptation of Pasteurella multocida. Gene. 2018:193–206. https://www.sciencedirect.com/science/article/pii/S037811191830581X.
- 38.Che Y, Chen R, Wang L, et al. Bioinformatic analysis of toxin protein gene hipA from Haemophilus parasuis. Chinese Agricultural Science Bulletin. 2016;32(2):1–4.Google Scholar
- 41.Lee HJ, Jin HM, Park MS, et al. Recovery of plasmid pEMB1, whose toxin-antitoxin system stabilizes an ampicillin resistance-conferring β-lactamase gene in Escherichia coli, from natural environments. Applied & Environmental Microbiology. 2015;81(1):40–7.Google Scholar
- 42.Quilespuchalt N, Carpena N, Alonso JC, et al. Staphylococcal pathogenicity island DNA packaging system involving cos-site packaging and phage-encoded HNH endonucleases. Proc Natl Acad Sci U S A. 2014;111(16):6016–21.Google Scholar
- 46.Davies J. Bacterial resistance to aminoglycoside antibiotics. Trends in microbiology. 1971;124 Suppl(6):S7.Google Scholar
- 50.Pai CH, Wu HJ, Lin CH, et al. Structure and mechanism of Escherichia coli glutathionylspermidine amidase belonging to the family of cysteine; histidine-dependent amidohydrolases/peptidases. Protein Sci. 2015;20(3):557–66.Google Scholar
- 56.VelvetOptimiser. http://bioinformatics.net.au/software.velvetoptimiser.shtml. Accessed 19 May 2016.
- 58.Mizrachi I. GenBank: the nucleotide sequence database: National Center for Biotechnology Information. https://www.ncbi.nlm.nih.gov/genome/genomes/. Accessed 18 Dec 2017
- 61.BioNumerics version 7.6 created by Applied Maths NV. Available from http://www.applied-maths.com. Accessed 5 Mar 2018.
- 62.Donati C, Hiller NL, Tettelin H, et al. Structure and dynamics of the pan-genome of Streptococcus pneumoniae. Genome Biol. 2010;12(10):1–1.Google Scholar
- 63.Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization.[J]. Briefings in Bioinformatics. 2017. https://academic.oup.com/bib/advance-article/doi/10.1093/bib/bbx108/4106928.
- 65.Bertelli C, Laird MR, Williams KP, et al. IslandViewer 4: expanded prediction of genomic islands for larger-scale datasets. Nucleic Acids Res. http://www.pathogenomics.sfu.ca/islandviewer/. Accessed 27 Jan 2018.
- 67.Galperin MY, Makarova KS, Wolf YI, et al. Expanded microbial genome coverage and improved protein family annotation in the COG database. Nucleic Acids Res. 2015;43(Database issue):261–9.Google Scholar
- 71.Marchlerbauer A, Bo Y, Han L, et al. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures:[J]. Nucleic Acids Res. 2017;45(Database issue):D200–3.Google Scholar
- 74.Agrawal P, Soni V. Bacteriophage, a process for the isolation thereof, and a universal growth medium useful in the process thereof. US; 2002. https://patents.google.com/patent/US6482632. Accessed 18 Apr 2018
- 75.David A, Grant JR, Ana M, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic acids research. 2016;44(Web Server issue):W16–21.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.