Keywords

1 Introduction

Campylobacter is one of the most common causes of foodborne infections worldwide (Kaakoush et al. 2015). To date, the genus Campylobacter includes 32 formally described species and 9 subspecies (Costa and Iraola 2019) and is part of the natural microbiota in the intestines of farm and wild animals (Altekruse et al. 1999). The most commonly known species are Campylobacter jejuni and Campylobacter coli that are mainly associated with campylobacteriosis in humans (Møller Nielsen 1997; Gillespie et al. 2002). Campylobacter lari, Campylobacter concisus and Campylobacter upsaliensis are less important for human gastrointestinal infections, but still can be frequently isolated from clinically relevant samples (Man 2011). Most notably, their multi-host lifestyles and ability for adaptation make C. jejuni and C. coli dangerous pathogens that are typically transmitted through the food chain (Oyarzabal and Backert 2012). Mainly spread through undercooked chicken meet or raw milk, these bacteria infect around 550 million people annually as reported by the World Health Organization (WHO), resulting in worldwide healthcare costs and economy loss of billions of dollars (Kaakoush et al. 2015).

Since the first complete genome sequence of the Campylobacter species C. jejuni published in 2000 (Parkhill et al. 2000), the functionality of whole genome sequencing (WGS) such as next-generation sequencing (NGS) or long read sequencing technology, namely Oxford Nanopore Technology (ONT) and Pacific Bioscience (PacBio), has massively improved. Time-consuming and low-resolution methods like pulsed-field gel electrophoresis (PFGE) (Yan et al. 1991; Potturi-Venkata et al. 2007) and flaA typing (Nachamkin et al. 1993) have been replaced by multilocus sequence typing (MLST) or whole/core genome MLST (wgMLST, cgMLST) that have since been frequently used for epidemiological studies (Tagini and Greub 2017). Instead of only analyzing a small part of the genome, e.g., a single gene (flaA typing) or MLST, which accounts for only 0.2% of the genome (Sheppard and Maiden 2015), wgMLST differentiates isolates by using all coding regions of the genomes incorporating hundreds of genes. This high discriminatory power even allows to link transmission events in epidemiological studies. Thus, high-throughput sequencing has become a time- and cost-effective method for typing, transmission-tracing, evolutionary analyses and surveillance of Campylobacter.

Besides comprehensive typing methods, NGS provides a broad range of possibilities to study genetic variations with respect to phenotypic difference. Powerful tools such as pan-genomic studies (Medini et al. 2005) or genome-wide association studies (GWAS), which were recently applied to microbial genomics (Falush 2016), allow very detailed correlation of the presence/absence and the allelic variants of all genes within a bacterial species population with specific phenotypes (see Sect. 4.1 below). These WGS-driven approaches enable researchers to effectively study the important aspects of host-specificity and adaptation of Campylobacter and help to understand the transmission and emergence of Campylobacter infections.

In this review, we give a broad overview of the historical evolution of Campylobacter and how the current population structure has been formed by niche adaptation together with inter- and intra-Campylobacter species recombination. Furthermore, we describe the huge potential of high-throughput and computational methods used to study relationships of Campylobacter strains in an agricultural and clinical environment that have provided new evidence regarding host and niche segregation.

2 Evolution Theory and Concepts for the Genus Campylobacter

In order to understand evolutionary and ecological processes within bacterial evolution, it is important to measure the molecular rate of mutations per replication event, also known as a molecular clock (Duchêne et al. 2016). The mutation rate of bacteria can be influenced by several different evolutionary processes such as selection pressure, genetic drift or the bottleneck effect that might play an important role in a host-adapted species like C. jejuni (Toft and Andersson 2010). The general approach of Ochman and Wilson (1987) to analyze the molecular clock is based on ancestral diversification calculated by 1% divergence in 16S rRNA nucleotides per 50 million years. Using this method, the divergence time of the genus Campylobacter was estimated to have started around 10 million years ago and clade formation of C. coli around 2.5 million years ago (Sheppard and Maiden 2015). However, Campylobacter was identified to evolve more rapidly than Escherichia coli and Salmonella Typhimurium, which have been used by Ochman and Wilson. Campylobacter has an unusually high rate of recombination, as horizontal gene transfer was estimated to generate two times more genetic diversity than de novo mutations (Wilson et al. 2009). Furthermore, bacterial lineages accumulate genetic substitutions more rapidly while they undergo adaptive evolution (Eyre-Walker and Keightley 2007). For all these reasons, Wilson et al. (2009) proposed a novel approach to estimate divergence in Campylobacter population by applying a more rapid rate of the molecular clock. They estimated the divergence of C. coli and C. jejuni to 6,580 years ago, with 95% confidence intervals (CI) of 3,580–12,400. This estimate fits within the time frame of the first domestication of wild animals during the agricultural revolution (Neolithic Revolution). The Neolithic Revolution started around 10,000–12,000 BC in the Middle East and spread to central Europe 3,000–5,000 BC, providing novel niches and possibilities to emerge for commensal and pathogenic bacteria (Mira et al. 2006). The divergence of C. coli into three distinct clades was estimated to 1,000–1,700 years ago, and clonal complexes of C. jejuni started to evolve 400 years ago (Fig. 1). This timeline indicates that the emergence of C. jejuni and C. coli as individual species is a very recent event compared to E. coli where the main population without members of related genera has been formed around five million years ago (Wirth et al. 2006).

Fig. 1
figure 1

Schematic representation of an evolutionary scenario of C. coli and C. jejuni (adapted from Sheppard et al. 2013a). C. coli and C. jejuni separated into two species. Due to different ecological niches C. coli differentiated into three clades (I–III) (Sheppard et al. 2008, 2013a). Recent recombination between strains from C. coli clade I and C. jejuni lead to the development of C. coli hybrid strains with substantial genomic introgression from C. jejuni (Sheppard et al. 2008; Golz et al. 2020)

Independent of the model used, it is clear that the clonal complexes and clade forming lineages separated after the ancestral split of the genus into these major species that currently play a significant role in clinical and foodborne diseases. However, the development of two distinct species did not force a strict recombination barrier between them (Sheppard et al. 2013a). While the speciation within the genus Campylobacter was probably triggered by the agricultural revolution thousands of years ago, methods of the modern food industry, globalization or environmental changes form novel evolutionary niches and selection pressure for bacteria in general (de Mazancourt et al. 2008; Van Alfen 2015; Caniça et al. 2019). In case of Campylobacter, there is evidence that C. coli started to converge toward C. jejuni due to a change in their ecology, e.g., by colonizing the same niche or host (Sheppard et al. 2008), which has been facilitating recombination between these species, as will be discussed below.

3 Population Structure

Since Campylobacter spp. have become more and more relevant for public health, high-throughput molecular typing plays an important role in surveillance programs and outbreak control. Most importantly, MLST and NGS provide a generic approach and, additionally, have a massive impact on understanding the population structure of Campylobacter. MLST is a generic scheme based on allelic variants from seven housekeeping genes used to classify bacteria into related or distant lineages (Maiden et al. 1998). C. jejuni and C. coli are characterized by the same MLST scheme which analyzes allelic variants of the same orthologous loci in both species, enabling the possibility of directly comparing the species with each other (Dingle et al. 2001; Miller et al. 2005). With the advent of high-throughput NGS, epidemiological studies made use of more detailed and complex schemes and methods developed for comparative genomics, which generated in-depth knowledge about the population structure of microbes. In this section, we will describe the population structure of both C. jejuni and C. coli that have an average nucleotide identity (ANI) of 85% (Fig. 2) (Dingle et al. 2005). Furthermore, we will give an overview of recombination events between these species, which resulted in the emergence of “hybrid” strains.

Fig. 2
figure 2

Graphical visualization of pairwise ANI values of C. coli and C. jejuni genomes. C. coli Clade I (yellow), Clade II (red) and Clade III (purple) are clearly separated based on ANI. C. jejuni (turquoise) and C. coli are distinct species, with approximately 85% ANI. Hybrid strains formed a separate cluster but were classified as C. coli based on 97% ANI, in contrast to 88% ANI between the hybrid strains and C. jejuni. Data were taken from (Sheppard et al. 2013a, b; Golz et al. 2020). ANI was calculated using FastANI (Jain et al. 2018) and visualized with pheatmap (Kolde 2015)

3.1 Diversity and Population Structure of C. Jejuni and C. Coli

C. jejuni is a natural part of the gut microbiota in a wide range of hosts such as chicken, cattle, pigs or wild birds and can also be found in environmental reservoirs such as water (Altekruse et al. 1999). This multi-host lifestyle is reflected by its broad diversity, which can even be detected by a low-resolution method like MLST, representing less than 1% of the genomic DNA in Campylobacter. Based on phylogenetic analyses (Fig. 3), resulting from a concatenated alignment of the genes used for cgMLST, C. jejuni forms a weak clonal complex structure (Dingle et al. 2001; Suerbaum et al. 2001). The clonal complexes CC-45 and CC-21 harbor the most relevant clinical and outbreak strains and are among the most prevalent isolates at PubMLST database (https://pubmlst.org/), with 24% and 9% of the entries, respectively, emphasizing their importance. Isolates belonging to these complexes are known to be “host-generalist” that can colonize cattle, chicken or human hosts (Manning et al. 2003; Dearlove et al. 2016). Their ability to switch rapidly between hosts makes them a dangerous threat for human health through consumption of contaminated milk and of undercooked chicken products. Geographical signatures in Campylobacter are relatively weak as “identical” host associated lineages emerge all over the world (Pascoe et al. 2017). However, the frequency of specific STs can vary between countries. For example, ST-22 has been identified in Finland (Revez et al. 2011), ST‐4526 in Japan (Asakura et al. 2012), and ST-190 and ST-474 were observed to emerge rapidly in New Zealand (McTAVISH et al. 2008; Mohan et al. 2013). Besides host generalists, repetivitve some lineages of host specialists can also cause human infections through food products. Those include CC-42 and CC-61 that are associated with cattle and sheep (Colles et al. 2003), and several different STs and CCs associated with chicken, including CC-257, CC-353 or CC-443 (Sheppard et al. 2011a, 2014). Other lineages such as CC-177 and CC-682 can be isolated from wild birds and water, causing the so-called water-born Campylobacter infections (Colles et al. 2009; Mohan et al. 2013). C. jejuni also shows a high level of diversity within the same barn or herd—e.g., isolates belonging to more than 10 distinct CCs have been found within a single chicken flock (Colles et al. 2008; Vidal et al. 2016). However, C. jejuni CCs may be subject to a strong recombination barrier even if they colonize the same host (Sheppard et al. 2014). This might be forced by a niche separation within the same host, due to subsequent colonization events at different time points, which limit the horizontal gene transfer (Sheppard and Maiden 2015).

Fig. 3
figure 3

Core genome-based phylogeny of C. coli, C. jejuni and hybrid strains. C. jejuni (turquoise) shows a diverse lineage-specific population structure with CC-21 and CC-45 (both host generalists), CC-42 and CC-61 (predominantly isolated from cattle), and CC-353 (from chicken). C. coli shows a three-clade structure with Clade I (yellow: from clinical- and farm-related sources), Clade II (purple) and Clade III (red: both from waterfowl and water samples). Clade I mainly consists of CC-828. Hybrid genomes with high DNA introgression from C. jejuni are colored in blue. Data were taken from (Sheppard et al. 2013a, b; Golz et al. 2020), and the phylogenetic tree was created with FastTree v2.1 (Price et al. 2010) based on 874 core genes including 123,227 variable sites

Even isolates assigned to the same ST based on the seven housekeeping genes can vary to great extent in their genetic diversity. For example, 16 strains assigned to ST-45 that were isolated during an outbreak in Finland formed three distinct strain clusters in wgMLST. Out of approximately 1200 shared loci, these clusters differed from each other by alleles in 293, in 414, and in 453 loci, respectively, indicating the presence of clearly different strains. In contrast, within the individual strain clusters the genomes differed by between zero and eighteen loci, suggesting clonal descent of those isolates (Kovanen et al. 2014). The other frequently isolated STs from this outbreak, including ST-230, ST-267 and ST-677, showed a maximum of 40 different alleles among genome clusters within each ST (Kovanen et al. 2014).

In contrast to C. jejuni, C. coli forms three distinct clades (I-III) (Figs. 1, 2 and 3), colonizing different ecological niches. Isolates from clade I are generally associated with an agricultural origin, whereas isolates belonging to clade II or clade III can most likely be found in environmental sources like water (Sheppard et al. 2008, 2013a; Skarp-de Haan et al. 2014). To date, around 81% of the genotyped isolates included in the PubMLST database belong to clonal complex CC-828 of clade I, reflecting the clinical relevance and industrial importance of this lineage (Miller et al. 2006; Thakur et al. 2006; Cody et al. 2012; Nohra et al. 2016). The second-most predominant clonal complex, also part of clade I, is CC-1150, comprising around 5% of C. coli isolates submitted to the PubMLST database. Clade I has a lower rate of diversity compared to C. coli clade II, to C. coli clade III, or to the general population structure of C. jejuni (Duim et al. 1999; Dingle et al. 2005; Sheppard et al. 2010b). The relatively low variation within the housekeeping genes as well as the lack of a proper lineage separation, especially in clade I, indicate the effect of a recent bottleneck and thus an early phase of lineage separation in the C. coli population (Sheppard et al. 2010b). Due to the distinct ecological niches, an ecological recombination barrier might have led to the development of three clades in C. coli (Sheppard et al. 2010b). However, recombination between C. coli clade I and C. jejuni resulted in hybrid strains (Figs. 1 and  3), as has been shown in several studies (Sheppard et al. 2008, 2013a; Sheppard and Maiden 2015; Golz et al. 2020).

3.2 Inter Species Recombination and Hybrid Species

Bacterial evolution is highly influenced by horizontal or lateral gene transfer (HGT or LGT) through transformation, transduction or conjugation. For recombination events, one has to distinguish between DNA introgression of complete genes or gene loci and intragenic recombination between loci leading to new mosaic allelic variants. Mosaic alleles consist of sequence content derived from different evolutionary and ancestral backgrounds (Smith 1992). As previously mentioned, early inter-species recombination, especially between C. jejuni and C. coli, plays a major role in the evolution of the genus Campylobacter, which might compensate for the small genome size of this genus (Suerbaum et al. 2001). Indeed, about 18.6% of the allelic variants of the seven MLST genes in C. coli exhibit C. jejuni ancestry, whereas just 2.3% of C. jejuni alleles were acquired from C. coli, indicating asymmetric gene flow between the two species (Sheppard et al. 2008). A more detailed analysis of the mosaic ancestry patterns among the seven housekeeping genes revealed an average inter-species gene flow of around 8.3% from C. jejuni to C. coli clade I, but less than 0.5% from C. coli clade I to C. jejuni (Sheppard et al. 2011b). Even in C. coli clade I, the genome-wide DNA introgression rate differs substantially among the predominant clonal complexes. CC-828 showed an overall introgression of approximately 10% whereas CC-1150 was found to contain up to 23% of its genome acquired from C. jejuni in agriculture-associated samples. Recombination mainly happened in agriculturally relevant isolates rather than in non-agricultural C. coli isolates and thus might be an important adaptation and niche aggregation factor. In C. coli clade II and clade III, genome-wide recombination with C. jejuni played a minor role as those isolated had only 0.2–1.2% inferred C. jejuni ancestry (Sheppard et al. 2013a).

Apart from the single allele exchanges, it is possible that multiple loci in the genomes have been exchanged between C. jejuni and C. coli. This would lead to the appearance of several hybrid strains (Fig. 1) that cannot clearly be identified by routine polymerase chain reaction (PCR) typing with single species differentiation marker genes and need to be investigated further by WGS. Several of such untypeable Campylobacter strains were isolated from egg shells of chickens in Germany (Golz et al. 2020). These isolates showed a DNA introgression of up to 15% from C. jejuni. However, they were still identified as C. coli as they exhibited 97% average nucleotide identity with C. coli clade I, but only 88% ANI with C. jejuni (Fig. 2). Furthermore, detailed genome analysis provided evidence that these recombination events are not distributed randomly across the chromosome. Instead, they particularly affect genes that are involved in general stress response, in DNA repair and in cell wall synthesis mechanisms and thus might enhance the fitness of C. coli for survival under harsh environmental conditions.

3.3 Additional Species

C. jejuni and C. coli are the most prevalent species concerning food contamination and clinical Campylobacter infections. Besides these, 13 additional Campylobacter species, sporadically causing clinically relevant symptoms, have been summarized (Costa and Iraola 2019). In the following subsection, we exemplarily describe the population structure of C. lari, C. upsaliensis and C. concisus that are frequently found in gastroenteritis patients (Man 2011).

C. lari is usually found in coastal regions and marine environments. It is mainly associated with shorebirds, like gulls, albatrosses, redshanks, to name a few, but also with marine mammals and shellfish, and occasionally causes gastroenteritis infections (Costa and Iraola 2019). However, the species definition of C. lari is an ongoing process, and several C. lari-like species have been described, including Campylobacter insulaenigrae, Campylobacter peloridis, Campylobacter subantarcticus and Campylobacter volucris. In 2009, C. lari was divided into two subspecies, namely C. lari subsp. lari and C. lari subsp. concheus (Debruyne et al. 2009). All C. lari and C. lari-like species are summarized as Campylobacter lari group (Miller et al. 2014).

C. concisus colonizes the human oral cavity and consists of two genetically distinct genomospecies (GS1 and GS2) that cannot be distinguished on the phenotypic level despite DNA binding values of only 42–50% in DNA-DNA hybridization experiments (Vandamme et al. 1989; Aabenhus et al. 2005). However, both genomospecies include multiple strains that have been isolated from healthy as well as diarrheic patients, which makes it difficult to make a general assumption on its pathogenicity (Chung et al. 2016). In particular, C. concisus GS2 seems to be more pathogenic as it is more often isolated from clinical patients with bloody diarrhea (Kalischuk and Inglis 2011). In addition, a recent study discovered novel genomic markers and a specific plasmid which are associated with C. concisus GS2 from patients suffering from Crohn’s Disease (Liu et al. 2018).

C. upsaliensis is commonly found in domestic animals like cats and dogs (Goossens et al. 1990), but has also been isolated all over the world from clinical cases of bloody diarrhea (Bourke et al. 1998). This Campylobacter species is closely related to C. coli and C. jejuni based on 16S rRNA comparison (Vandamme et al. 1991). In contrast to C. concisus, C. upsaliensis shows a homogenous population structure with 80–96% DNA-DNA hybridization between strains (Sandstedt et al. 1983), even though it possesses a high degree of diversity on a genotypic level (Lentzsch et al. 2004). Besides this, little is known about the emergence of C. upsaliensis, which needs to be investigated in further studies.

4 Host Association of Campylobacter

Comparative genomic methods not only had a major influence on our understanding of population structures, but also advanced our knowledge and understanding of host adaptive mechanisms of Campylobacter. Besides the MLST and cgMLST schemes, (pan-genome) approaches and genome-wide association studies have opened the door for large-scale genome analyses of these traits.

4.1 Impact of Genomic High-Throughput Methods

Pan-genomic analyses have become powerful tools to study a variety of bacterial species (Rouli et al. 2015). The term “pan-genome” describes the entire set of genes composed of core and accessory genes within a bacterial population. Genes that occur in at least 99% of the population are marked as core genes whereas accessory genes only have to occur at least once in the population. Core genes mostly encode proteins that are involved in housekeeping functions of the organisms. Accessory genes on the other hand can have an adaptive function toward a specific environment or selection pressure and are usually acquired by HGT. Therefore, it is highly probable that these parts of the genome are involved in niche or host adaptation of Campylobacter. CgMLST and wgMLST make use of the concept of pan-genomes and establish a novel typing scheme for bacterial strains that, in contrast to MLST, includes all core genes of a species and thereby provides a high resolution by comprising the whole genetic diversity (Sheppard et al. 2013b). Similar to the MLST scheme for C. jejuni and C. coli, the cgMLST scheme combines C. jejuni and C. coli and utilizes 1343 gene loci to describe the genetic variation among the strains (Cody et al. 2017).

Due to decreasing costs in WGS and a subsequent increase in bacterial genome sequencing, the concept of GWAS has emerged in the field of microbial genomics (Chen and Shapiro 2015; Lees and Bentley 2016). GWAS is a statistical concept to compare two different phenotypes in order to identify trait-associated genomic compounds. This can be generally used to analyze epidemiology-, resistance- or, in case of C. jejuni host-related determinants based on WGS data. Different methods have been developed to apply this method either on entire genes, k-mer (word of length k), or single nucleotide polymorphism (SNP) level to bacterial populations. In comparison to GWAS tools that are made for human genetic research, these take into account the clonal and lineage-related phylogenetic structure of bacterial populations (Brynildsrud et al. 2016; Power et al. 2017). In order to investigate the host association of C. jejuni, a couple of GWAS have been applied in this field of research, mainly for the clinically relevant lineages CC-21 and CC-45 (Sheppard et al. 2013b; Yahara et al. 2017; Thépault et al. 2017; Buchanan et al. 2017). These complexes contain isolates from different hosts of predominantly avian and ruminant origin. Thus, these strains need to adapt frequently to varying environments. For example, chicken and cattle hosts substantially differ in their body temperature, pH level or in the microbiome of their digestive tract. In addition, bacterial cells are exposed to oxidative stress outside the host gut (Kim et al. 2015) during transmission to a new host. Intentionally, many of these studies used a gene-by-gene approach (Yahara et al. 2017; Buchanan et al. 2017), whereas others also keep in mind that core genome adaptation might play a role in host adaptation, especially in host-adapted lineages. Therefore, a k-mer approach (Sheppard et al. 2013b; Lees et al. 2018) can not only be applied in order to detect the presence of entire genes but also to identify specific alleles of core genes that may be involved in host adaptation.

4.2 Source Attribution in Clinical and Agricultural Setting

Host-adapted clonal lineages can be observed in several different bacterial pathogens, such as C. jejuni, Staphylococcus aureus or Salmonella enterica on different genetic levels (2010a, 2011a; Weinert et al. 2012; Hayward et al. 2016; Sheppard et al. 2018). Gene sets of these lineages are affected by several factors, including the host, the composition of food, and by antibiotics and interactions with the host microbiome that can either lead to a temporary or to a permanent adaptation. Genetic mechanisms like DNA replication errors that lead to point mutations, insertions, deletions or recombination events may result in rapid adaptation and the formation of host-specific lineages. In general, Campylobacter species are distributed differentially among livestock animals; C. coli is dominant in pig-associated samples (Thakur et al. 2006) whereas C. jejuni is more abundant in cattle and chicken hosts. Additionally, there might also exist a geographic factor. For example, Campylobacter cases in France are more likely to be caused by isolates from ruminant hosts than in other countries (Thépault et al. 2017).

Several studies investigated host adaptation, especially from C. jejuni, as this species shows a well-defined lineage separation based on MLST data that distinguish the population into host-specialist and host-generalist clonal complexes (Sheppard et al. 2014). Several colonization studies revealed that modification and differential transcription of motility genes in C. jejuni play a key role in adaptation and transmission (Hermans et al. 2011; de Vries et al. 2017; Ren et al. 2018). These data were supported by in vitro experiments as well as by genomic data and by RNA sequencing. Apart from traditional WGS analysis, the novel concepts of GWAS provided great in-depth knowledge about host adaptation, colonization and clinically relevant factors of C. jejuni. The group of Sheppard and co-workers discovered multiple genes involved in vitamin B5 biosynthesis and iron uptake within cattle-related strains of the CC-45 complex by applying a k-mer-based GWAS (Sheppard et al. 2013b). These genes might be related to different nutrition of cattle host in comparison to poultry. Independently, the same genes have also been detected within a set of 25 diagnostic marker genes by a pan-genome approach leading to the identification of clinically relevant C. jejuni isolates with up to 90% accuracy (Buchanan et al. 2017). However, even strains of the clinically relevant complexes CC-21 and CC-45, isolated from poultry processing chains, show substantially different genotypes and carry different genes involved in lipooligosaccharide synthesis (kpsC, kpsD), metabolic processes (glmS), oxidative stress response (nuoK and fumC) as well as genes involved in nucleotide salvage (cj1377c) and antimicrobial resistance like efflux proteins (cj1375) (Yahara et al. 2017). A pan-genomic approach by Thèpault and co-workers identified 15 additional host-segregation markers in C. jejuni isolates from France that might aid to determine the source of clinical cases. Those genes are mainly involved in metabolic processes and nucleotide metabolism. These markers had been utilized to trace back the source of C. jejuni infections with an average accuracy of 80.7% for chicken-induced cases and of 68.2% for ruminant-caused cases (Thépault et al. 2017). While numerous studies have focused on the source of Campylobacter infections, less work has been dedicated to understanding the genetic mechanisms behind livestock- and environment-specific STs in chickens, cattle or water sources. However, this might generate valuable insights into the evolution and relevant host-specific factors of Campylobacter in order to deal with the spread and contamination in livestock environment and further understand the process of adaptation toward clinically relevant pathogens.

4.3 Relevance for Public Health (Applications)

WGS has not only improved our general understanding and knowledge of bacterial populations, adaptations and recombination to date, but is also an important part of routine high-throughput diagnostics for hospitals, for animal husbandry and for surveillance programs of foodborne diseases (Gerner-Smidt et al. 2019). Due to these programs, it is possible to detect a sudden increase in case numbers within a specific time interval. When applying WGS-driven approaches to outbreak detection and source tracking, it is important to distinguish between geographically restricted point-source outbreaks and clusters of cases that are not necessarily related to each other geographically (Llarena et al. 2017). Most outbreaks are diffuse and show a spatial and time-dependent clustering of Campylobacter genotypes or subtypes within livestock and clinical cases (Llarena et al. 2017). These outbreaks can spread across several countries, but can be linked to contaminated food products with a low level of contamination. The difficulty in detecting these outbreaks is to be able to distinguish them from sporadic Campylobacter cases and to handle the high rate of genetic exchange and recombination within the species (Llarena et al. 2017). This might be achieved by WGS-based molecular characterization in combination with wgMLST or cgMLST that provide the necessary resolution for the genomic comparisons of closely related strains (Deurenberg et al. 2017). For example, a recent wgMLST-based study on the genomic diversity of C. jejuni isolates from Israel detected 29 diffuse clusters of genetically related strains that have shown a low variance in allelic differences (Rokney et al. 2018). Importantly, this study further identified adapted clones that kept causing infections over the span of several years. Another study from Finland showed that C. jejuni infections, which increased during the summer, were mainly related to three STs with 16 to 37 allelic differences between the cluster, and thus, due to the short period of time, probably belonged to the same source (Kovanen et al. 2014).

In addition to such diffuse outbreaks, point-source outbreaks can also occur; however, those are less frequent and are usually locally restricted. They are mostly related to restaurant meals (Glashower et al. 2017), canteen food (Moffatt et al. 2016) or farming communities (Forbes et al. 2009) and are associated with a high level of contamination within the food products. An appropriate methodology to identify these outbreaks is based on single nucleotide variants (SNVs), because diversity in general should be low and resulting in only a small amount of allelic variants. This approach has been successfully applied in several studies. Moffatt et al. showed a high level of identity in a chicken-related outbreak in Australia with two different genotypes with a SNV difference of only 3–8 SNPs and 30 SNPs, respectively. Additionally, several studies conducted by Revez and colleagues (Revez et al. 2014) demonstrated how wgMLST can be applied in outbreak investigations and source tracing. Patient isolates from milk-born outbreaks shared 1432 loci with isolates from a milk source and only showed three SNPs difference between the strains. Just like for many other bacterial pathogens, WGS-based methods provide a great benefit for Campylobacter-related public health applications. However, in contrast to other bacteria, the high species diversity of Campylobacter within the same host often requires an adapted approach.

5 Concluding Remarks

The species of genus Campylobacter show a very individual population structure ranging from less clonal diversity to strictly separated clonal lineages. Horizontal gene transfer and recombination events may occur at various levels within the individual population but also between the Campylobacter species. Even “hybrid” strains exist that contain large proportion of genomic elements from two species. Modern next-generation sequencing-based methods paved the way for high-resolution molecular typing of outbreak and disease-related strains by applying a standardized typing scheme based on the whole core genome and, additionally, the pangenome. Further, they also allowed the identification of genomic factors that contribute to host adaptation of individual lineages on the gene and allele level and to trace the source of several Campylobacter lineages. This contribution to tracing and unraveling transmission and infection chains results in important public health applications to contain this important zoonotic pathogen.