Introduction

Although N. meningitidis gives rise to 1.2 million cases of meningitis and severe sepsis disease each year1, asymptomatic colonisation of the human oropharynx is common, with population carriage rates of 10–30%2. As invasive disease does not contribute to person-to-person transmission, the meningococcus is an example of an ‘accidental’ pathogen3. Carried meningococci are highly diverse at loci encoding both antigens and metabolic functions, with much of this diversity generated by genetic reassortment, mediated by horizontal genetic transfer (HGT). Despite this diversity, meningococcal populations are highly structured into distinct genealogical groups or lineages, which are recognised by multilocus sequence typing (MLST) as ‘clonal complexes’ (ccs), which comprise closely related sequence types (STs)4. Epidemic meningococcal disease is caused by a subset of ccs, the ‘hyperinvasive lineages’, which persist for decades and during geographical spread5,6. Notwithstanding their propensity to cause invasive disease, hyperinvasive meningococci must also be efficient at asymptomatic transmission and several theoretical frameworks have been proposed to explain their emergence and persistence, including ‘strain structure theory’ which posits that the invasive phenotype can be stable in highly transmissible lineages3.

Since 1905, many of the largest recorded epidemics of meningococcal disease have occurred in the sub-Saharan African Meningitis Belt, mostly caused by serogroup A meningococci belonging to cc1, cc4, and cc5 (A:cc1; A:cc4; and A:cc5; Figure S1)7. These meningococci have also been responsible for a number of pandemics throughout the 20th century8, with A:cc1 meningococci dominant in Africa until they were replaced by A:cc5 meningococci during the Hajj-associated epidemics of 1987, part of the third A:cc5 global pandemic. Since that time, A:cc5 organisms have caused successive epidemics in the meningitis belt8 (Fig. 1A), including a large outbreak in 1997–1998, with more than 250,000 cases9. A:cc5 epidemic across the belt in the early 2000s coincided with the emergence of the novel cc5 genotypes A:cc5:ST-7 and A:cc5:ST-2859, which were identical to the original genotype (A:cc5:ST-5) at their principal antigens10,11,12. These disease outbreaks in the African Meningitis Belt prompted the development and implementation of the serogroup A conjugate vaccine PsA-TT (MenAfriVacTM)13, a polysaccharide–tetanus toxoid conjugate vaccine which targets the polysaccharide capsule of serogroup A meningococci.

Figure 1: Geographic, genetic and antigenic diversity of 153 serogroup A isolates belonging to the ST-5 complex.
figure 1

(A) Global spread of the ST-5 complex in three successive pandemics (red: first pandemic wave; blue: second pandemic wave; green: third pandemic wave; yellow: ST-2859). (B) Allele-based phylogenetic network of 153 whole genomes of the ST-5 complex. (C) Pie-charts of allelic diversity of six major sub-capsular antigens across the 153 isolates. Most isolates had a highly consistent antigenic profile, with a single dominant allele found for each antigen (PorA: 97.4% = allele P1.20,9; FetA: 97.4% = allele F3-1; PorB: 92.1% = allele 3–47; NadA: 99.3% = allele 7; OpcA: 96.1% = allele 3; fHbp: 83.1% = allele 39). PorA, FetA, PorB, NadA and fHbp have all been shown to induce an immune response and deployed in various protein-based vaccines15,16. See also Table S7. The maps were created using mapchart.net (www.mapchart.net).

The reasons for repeated pandemics and epidemics of antigenically highly uniform but distinct serogroup A meningococci remain poorly understood. Such marked levels of genetic and antigenic uniformity are unusual among meningococci, which exhibit high levels of genetic and antigenic diversity14 and the global spread of A:cc5 meningococcal variants provides an opportunity to study the emergence of new pandemic lineages with limited genetic variation, in an organism which is usually highly diverse as a consequence of extensive HGT. Understanding this stability has important implications for the continued use of the PsA-TT vaccine, the success of which depends on the antigenic stability of hyperinvasive serogroup A meningococci13. Using a gene-by-gene whole genome MLST (wgMLST) approach, we analysed 153 sets of WGS data from A:cc5 meningococci representative of the various A:cc5 pandemics and epidemics, to elucidate the diversification of this lineage over time and to investigate possible genetic factors driving the evolution of distinct variants.

Results and Discussion

We refer to the three A:cc5 pandemics and the outbreaks caused by A:cc5:ST-2859 as four separate “epidemic waves”. There was allelic variation at 50.1% (999/1993) of loci across the 153 meningococcal genomes analysed, but within epidemic waves the genomes were highly uniform, with an average of 67.2% of loci exhibiting identical alleles within each wave, and 87.2% of variants exhibiting alleles which were identical across >90% of all alleles within each wave. Eight major immunogenic sub-capsular antigens, many of which have been used in licensed protein-based meningococcal vaccines15,16, were highly conserved, with >99% of the A:cc5 meningococci exhibiting identical alleles (Fig. 1C; Table S7). Previous studies identified differences in six antigens (transferrin-binding protein B, IgA1 protease, OpaB, OpaD, FetA) and the lgt gene (involved in lipooligopolysaccharide synthesis) between isolates of the second and third pandemics8,17,18,19, and differences in Maf adhesins and pilin glycosylation loci among ST-7 and ST-2859 isolates20. The fact that the A:cc5 meningococci isolates manifest identical alleles at loci encoding six major antigens, which are known to elicit protective immune responses as assessed by serum bactericidal antibodies21, and which are also known to vary among meningococci to avoid protective immunity in Africa and elsewhere22, suggests that the evasion of host immune responses is unlikely to have played a major role in the emergence of the epidemic waves in Africa.

Although the genes encoding the major immunogenic antigens were highly conserved, a number of allelic differences were observed among the epidemic waves: there were 39 loci with alleles specific to the first pandemic; 69 loci with alleles specific to the second pandemic; and 95 loci with alleles specific to the third pandemic (Tables S2–4). A total of 73 loci were specific to ST-2859 genomes, with 14 loci absent compared to ST-5 isolates (Fig. 1B; Tables S5–6). The variable loci were annotated as encoding: metabolic functions (40.0%); genetic information processing (16.2%); environmental information processing (12.3%); other functions (6.6%); antigenic genes (3.7%); and cellular processes (2.2%); and there was no characterised function for 18.3% of these loci (Fig. 2A). The variable loci were dispersed around the chromosome, but when plotted consecutively against their position in a reference genome, allelic changes at several contiguous loci were observed (Fig. 2B), suggesting the introgression of large genomic regions (of up to 16.2 kb) from other meningococci via HGT in the emergence of these strains.

Figure 2: Functional characterisation and location of alleles specific to each epidemic wave.
figure 2

(A) Functional characterisations of alleles specific to each epidemic wave. (B) Plots of successive allelic changes against their position in the reference genome for each epidemic wave, with an accompanying plot of these changes annotated on the circular chromosome of Z2491 (or WUE2594 for ST-2859). Letters indicate areas of allelic changes which are adjacent on the chromosome. (C) Genomic areas of putative recombination between ST-2859 and ST-11/ST-167 strains. Comparison plots show hypothetical donor strains on the bottom level (either reference strain FAM18, an isolate of serogroup C ST-11, or M12 240332, a serogroup Y ST-167 complex strain), ST-2859 on the central level (isolate ERR052831), and reference strain WUE 2594 (serogroup A, ST-5 complex) on the top level (representative of the recipient strain). The arrows signify putative areas of recombination, and correspond to higher sequence identity shared between FAM18/M12 240332 and ST-2859, than the ancestral WUE 2594 strain and ST2859.

These putatively introgressed loci were examined for possible sources of HGT by comparison to allelic variation recorded in the PubMLST.org/neisseria database. Exact nucleotide matches to recorded meningococcal isolates in the database were observed at eight contiguous groups of loci (areas A, C, D, E, F, G, H and I in ST-2859 isolates; areas G and H in second pandemic wave isolates; and area H in third pandemic wave isolates; see Table 1, Fig. 2C). Areas G and H from the second pandemic, as well as Area H in the third pandemic, had exact matches to large numbers of globally distributed isolates belonging to cc11 hyperinvasive meningococci23. The oldest matching cc11 isolates dated from 1964, several years in advance of the second A:cc5 pandemic, which was consistent with these introgressions originating in cc11. The regions of contiguous allelic changes within ST-2859 isolates contained exact matches to isolates belonging to multiple hyperinvasive clonal complexes. The alleles from seven of these regions were present in globally-distributed cc11 isolates and were universally present in a group of W:cc11 strains circulating in Burkina Faso and Niger in 2001 and 2002. These W:cc11 isolates therefore represent plausible relatives of the donor strains involved in the emergence of the A:cc5:ST-2859 strain in sub-Saharan Africa. This timescale is consistent with the phylogenetic analyses of Lamelas et al., who calculated that the ST-2859 lineage emerged in Burkina Faso in 200020. Area F of A:cc5:ST-2859 also matched isolates from another lineage typically expressing serogroup Y, Y:cc167, with the earliest allele identified in an isolate from 1940. These data are consistent with the epidemic A:cc5:ST-2859 strain arising from multiple introgression events involving several hyperinvasive meningococci in a short time, which accounts for the rapid accumulation of diversity and the long branch lengths in allele-based phylogenies (Fig. 1).

Table 1 Loci of putative recombined areas with sequence matches to other hyperinvasive clonal complexes.

Importantly these introgression events did not involve the major antigens, which are very different in these other lineages, as would be expected in an immune selection model. In addition, most of the introgressed alleles were present in older isolates in the PubMLST database (Table 1), with the earliest dating from 1937, demonstrating that the introgressed alleles are long-lived, having circulated over periods of several decades at least. HGT occurs frequently among carried meningococci3,14; however, given that the hyperinvasive meningococci represent a small minority of the carried population in Africa24, as elsewhere, the acquisition of multiple contiguous identical alleles by one hyperinvasive meningococcus from another, appears to be highly unlikely and is suggestive of a selective process. It is well established that rare HGT events, between and within Neisseria species, can be amplified in meningococcal populations by factors such as antibiotic25 use and immunological pressure26, and the data presented here are consistent with selection for tracts of metabolic gene variants within the A:cc5 genome.

The majority of putatively introgressed genes were annotated as having metabolic functions (62.5%, 25 out of 40 loci; 50% of all 50 loci, including those with unknown functions) (Figure S2). Differences in metabolic genes can contribute to the emergence of epidemic strains in a number of ways. First, there is increasing support for the idea that metabolic genes play important roles in pathogenesis and virulence in both meningococci27 and other bacterial pathogens28. Such metabolic adaptation could allow meningococci to exploit alternative host resources in invasive disseminated infections29, for example, and differences have been shown in the expression of metabolic genes between meningococci adhering to lung epithelial cells and growing in blood27,30,31. It is also plausible that differences in metabolic efficiency lead to differences in transmissibility among strains, as strains assimilate metabolites at different rates. Indeed, a recent study27 showed a significantly higher in vitro growth rate among meningococcal strains from hyperinvasive lineages compared to those from carried lineages.

Analysis of the predicted metabolic functions of the introgressed genes indicated that the largest category was energy metabolism (30%) (Figure S3). Notably, introgressed area G from isolates of the second pandemic encoded several subunits from the Na+-translocating NADH-quinone reductase complex, which carries out key redox reactions of the electron transport pathway, and are predicted to interact in the same functional network (Figure S4). Although we are not aware of any experimental data for the functional significance of Na+-translocating NADH-quinone reductase complex in N. meningitidis, there are experimental data from other Gram Negative pathogens suggesting that genetic variants of the Na+-translocating NADH-quinone reductase complex have different affinities for sodium ions in different species, thus influencing rates of NADH oxidation and energy transduction32. It is plausible that differences in energy metabolism among strains from the same species could result in differences in transmission phenotype. Given the importance of rapid growth of meningococci in the blood stream in the development of IMD in individuals, these changes also have implications for virulent phenotypes. Several of the other allelic differences were in loci assigned to genetic information processing functions (16.2%). Further, the functions of the genes in the genetic information processing category pertain to DNA replication, transcription, translation and repair, and it is therefore plausible that allelic differences may influence growth and replication rates33,34.

A predominant paradigm for changes in the frequency of epidemic clones is that antigenic genes vary over time by diversifying immune selection, while metabolic genes exhibit stabilising selection for conservation of function; indeed, the emergence of virulent strains in pathogenic bacteria has been associated with the import of, or mutation within, antibiotic resistance genes, antigens or virulence factors20. An alternate paradigm, however, posits that antigenic genes can be stable over time in non-overlapping combinations, as a consequence of between-host competition35. Such non-overlapping patterns have been observed among the PorA, FetA and Opa antigens of N. meningitidis, with many identical epitope combinations maintained over several decades5,36,37. By contrast, metabolic genes should evolve over time as a result of intense ecological competition within the host, so as to gain a competitive advantage against strains inhabiting the same antigenic niche: a phenomenon referred to as metabolic shift38. Our findings are consistent with mathematical models of immunological and ecological competition in N. meningitidis and also Streptococcus pneumoniae, which assume that metabolic differences between bacterial strains can lead to small differences in transmission fitness3,38. Simulations have shown successive replacement over time by strains with increasing metabolic fitness but similar antigenic properties.

Invasive disease is caused by a minority of meningococcal genotypes, the hyperinvasive lineages, which show strong temporal and geographic stability over decades and during global spread. Here, we show that HGT events among diverse long-lived hyperinvasive lineages (e.g. introgression into A:cc5:ST-5 from C:cc11:ST-11) can lead to the emergence of new epidemic strains (i.e. A:cc5:ST-2859). Our observations suggest that introgression events from one hyperinvasive lineage to another contribute to increased transmission fitness and/or increased virulence leading to the replacement of the previous variant. The introgressed genes, which constitute the majority of genetic differences among epidemic waves, encode metabolic functions, with many of the introgressed alleles present in other hyperinvasive isolates dating back several decades before the emergence of A:cc5:ST-2859.

Although immune escape may have played a minor role, our findings suggest that the most important events in the emergence of the A:cc5:ST-7 and A:cc5:ST-2859 epidemics were the acquisition of metabolic gene variants which have affected complex phenotypes, especially transmission fitness and possibly virulence. Such effects have been postulated in other pathogens38 and have potential implications for understanding epidemic bacteria and their prevention by vaccination as they indicate that large scale introgression events that alter the relationship between metabolic and antigenic types occur as a consequence of HGT. In principle such events could be induced by immune selection imposed by mass immunisation against principle antigens and on-going disease surveillance with genomic analysis is required to guard against such an eventuality.

Methods

The sequence reads for all ST-5 complex meningococcal genomes available as of 17/10/13 were downloaded from the European Nucleotide Archive (http://www.ebi.ac.uk/). The genomes were sequenced at the Wellcome Trust Sanger Institute in Cambridge, UK and the Institute for Genomic Sciences in Maryland, US (Table S1)20. Reads were assembled using an automated pipeline based on the Velvet algorithm, version 1.2.01. Annotation was carried out using the “autotagger” feature of the Bacterial Isolates Genome Sequence Database (BIGSdb) software39, which scans deposited sequences against defined loci in an automated BLAST process. The whole genome sequence data were compared using the BIGSdb Genome Comparator tool, which is implemented on the PubMLST website (www.pubmlst.org). The coding sequences within the annotation were extracted and compared against the reference strain Z2491 (accession number AL157959) using default parameters. Through Genome Comparator, unique allele sequences were labelled consecutively, allowing the identification of shared and unique alleles between isolates. Loci with alleles specific to each pandemic wave were identified, and functionally characterised according to the KEGG Orthology (KO) groupings of the KEGG database (www.kegg.jp). Genes with uncharacterised functions which did not fall into a KO category were blasted in the PFAM database (www.pfam.sanger.ac.uk), and functionally characterised accordingly. Genes without significant hits in the PFAM search were designated with an “unknown” function. Genome Comparator was run, in addition, with the ST-5 strain WUE 2594 (accession number FR774048) as a reference, to identify alleles specific to the ST-2859 strains.

Allele sequences were blasted against the PubMLST database to find the appropriate NEIS number and allele, which were then scanned in the database for sequence matches in other meningococci. The PubMLST database represents a large repository of whole genome data, with over 3700 whole genomes at the time of writing, from a variety of clonal complexes spanning a 78 year period. The Neighbour Net networks in Figures 1 and S1 were created using SplitsTree v440. Annotated plots of the genome (Fig. 2B) were created using the programme CG view41. The Artemis Comparison Tool42, funded by the Wellcome Trust, was used to create Fig. 2C.

Additional Information

How to cite this article: Watkins, E. R. and Maiden, M. C. J. Metabolic shift in the emergence of hyperinvasive pandemic meningococcal lineages. Sci. Rep. 7, 41126; doi: 10.1038/srep41126 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.