Seasonality and the effects of weather on Campylobacter infections
Campylobacteriosis is a major public health concern. The weather factors that influence spatial and seasonal distributions are not fully understood.
To investigate the impacts of temperature and rainfall on Campylobacter infections in England and Wales, cases of Campylobacter were linked to local temperature and rainfall at laboratory postcodes in the 30 days before the specimen date. Methods for investigation included a comparative conditional incidence, wavelet, clustering, and time series analyses.
The increase of Campylobacter infections in the late spring was significantly linked to temperature two weeks before, with an increase in conditional incidence of 0.175 cases per 100,000 per week for weeks 17 to 24; the relationship to temperature was not linear. Generalized structural time series model revealed that changes in temperature accounted for 33.3% of the expected cases of Campylobacteriosis, with an indication of the direction and relevant temperature range. Wavelet analysis showed a strong annual cycle with additional harmonics at four and six months. Cluster analysis showed three clusters of seasonality with geographic similarities representing metropolitan, rural, and other areas.
The association of Campylobacteriosis with temperature is likely to be indirect. High-resolution spatial temporal linkage of weather parameters and cases is important in improving weather associations with infectious diseases. The primary driver of Campylobacter incidence remains to be determined; other avenues, such as insect contamination of chicken flocks through poor biosecurity should be explored.
KeywordsCampylobacter Time series Temperature Rainfall Climate change Environmental health
Comparative Conditional Incidence
Generalized structural time series model
Geographic Information System
Lower Layer Super Output Areas
Public Health England
Strategic Health Authorities
Ward’s Minimum Variance Clustering
The distinctive seasonal pattern of Campylobacter incidence has suggested that seasonal changes in the environment contribute to this pattern. The examination of the numbers of Campylobacter in the small intestine and caeca of chicken over a 12-month period showed significant seasonal variation in the carriage rate which was associated with sunshine and temperature . The analysis of surveillance data from Europe, Canada, Australia and New Zealand found a lack of clear and consistent association with rainfall, humidity, sunshine, and temperature . The analysis of Campylobacter data from Denmark, Sweden, Austria, Finland, New Zealand, Scotland, and Wales showed consistent seasonal patterns of infection with more cases in the summer, but with cases peaking at different times of the year . Campylobacter rates in England and Wales were associated with temperature, sunshine, and rainfall, with temperature being the most significant variable and a 5% increase in Campylobacter cases was associated with a 1 °C rise in temperature to a maximum temperature of 14 °C [4, 5, 6].
An examination of the relationship between Campylobacter cases (along with Salmonella and pathogenic Escherichia coli) in Canada found an increase in cases of 2.2% for every degree of increase in temperature, with a low temperature threshold of 10 °C; another Canadian study modelled Campylobacter and temperature and found an association with temperature above a threshold of 10 °C [7, 8]. Campylobacter incidence in Massachusetts closely followed the peak in annual temperature with a lag of 2–14 days, and in Georgia (US) a strong association was found between temperature and Campylobacter infections [9, 10].
The impact of weather parameters on Campylobacter differed between temperate and sub-tropical areas of Australia . A spatial examination of Campylobacter in New Zealand found no association with climate variables. Temperature did not appear to drive the seasonality, and there was little association with rainfall [12, 13, 14]. Jore et al.  showed a relationship between temperature with both human infections and chicken colonisation in six European countries. There has also been work in Switzerland suggesting the seasonality of Campylobacter is partly explained by contamination of chicken broilers .
While an association with weather in general, and temperature in particular, appears to be a relatively common finding in these studies, nevertheless the link is not as clear, consistent and direct as with Salmonella; nor does it have a well-defined mechanism. Nichols  and Ekdahl et al.  suggested that flies might explain the epidemiology of Campylobacter, through the direct transfer of contaminated material from raw meats or faeces to ready-to-eat foods, and/or by the seasonal contamination of chicken flocks [19, 20, 21]. Nichols  proposed a fly transmission model that explained the seasonal distribution by the impact of temperature on non-biting fly numbers, as measured by the development time of Musca domestica larvae. Evidence suggested that the use of fly screens on chicken production houses could reduce the seasonal contamination of chickens and subsequent human disease significantly . Similar responses to temperature in both chicken broiler carcase contaminations with Campylobacter and human infections with Campylobacter cases in six different European Countries, implying that the seasonal relationship between human infection and temperature is more likely a result of exposure to chicken .
One of the additional complicating factors in the Campylobacter transmission in most developed countries is the significant percentage of cases related to foreign travel, and the seasonality of this additional factor [22, 23]. Hartnack et al.  highlighted the necessity of quantifying the transmission of Campylobacter from broiler to humans and to include climatic factors in order to gain further insight in its epidemiology. Sterk et al.  used a runoff simulator to analyse the effect of climate change on Campylobacter (along with Cryptosporidium) runoff and human infection risks in the Netherlands; and found that climate change has little overall impact on runoff of Campylobacter (along with Cryptosporidium) from land to the surface waters.
Within climate change research, there are three reasons for undertaking this work. Firstly to demonstrate evidence of the potential short-term consequences on human health from climate change; secondly to understand what factors influence the occurrence of infectious diseases; and thirdly to provide models which can forecast future disease risks. The incidence of Campylobacter in the UK demonstrated both a strong and regular seasonality, and a variation in the number of cases per year [4, 6, 26]. This raises a question of whether the seasonal patterns of Campylobacter have changed over time as a result of climate change and other environmental factors.
Campylobacter surveillance data were extracted from the Second Generation Surveillance System (SGSS) database of Public Health England from 1989 to 2014. A case was defined as a human faecal sample submitted from a patient that was positive for Campylobacter. All Campylobacter surveillance data were laboratory-confirmed cases and were predominantly based on the isolation of Campylobacter from faeces using selective culture media in a micro aerobic environment; membrane filter, PCR or other approaches made up a very small percentage of cases. A few sentinel laboratory sites conducted screening using a range of PCR targets during the 2012 Olympic Games. Standard antimicrobial testing methods, used in primary diagnostic laboratories, were predominantly disc diffusion methods. Where speciation was reported, it was conducted in diagnostic laboratories using conventional phenotypic methods . Most cases were thought to be symptomatic and included patients with extra intestinal infections . The symptoms were those recorded on the physician request form and generally are in the simplistic form of “D & V” or something equivalent.
The meteorological data were supplied by the Met Office and held on the Medical and Environmental Data a Mash-up Infrastructure (MEDMI) platform (see https://www.data-mashup.org.uk).
Reported cases between 2005 and 2009 were linked to local weather parameters at the laboratory postcode locations; namely: maximum, minimum and average daily temperature and daily rainfall in 30 days before the specimen date using a previously published approach that prevented patient identification . The linkage was conducted on laboratory postcode because this was more complete than patient residential postcode and preserved patient confidentiality. An additional study  validated that the weather parameters at the laboratory postcode correlate well with the weather at the domestic postcode of cases.
Comparative conditional incidence
GIS analysis was performed on cases with Lower Layer Super Output Areas (LSOA) using ArcGIS 10.2 for Desktop by ESRI. A map of Campylobacter incidence between 2005 and 2009 from patient’s residential postcode data was produced. Some areas had poor residential postcode completeness resulting in areas with an apparent lower incidence. Comparative conditional incidence (CCI) is a new approach which uses cases and LSOA populations to examine differences between disease incidence at different temperature and rainfall in the 30 days before the specimen date within equal sized datasets. CCI is a sum of cases per week within all LSOAs with same temperature divided by sum of population in the same LSOAs. It is conditional as the weather variables were available for the weeks and LSOAs where there were positive cases and excluded patients without a residential postcode as it was not possible to establish the LSOA.
To detect potential time-varying seasonality in Campylobacter that might result from Salmonella control measures in chicken production, such as improved biosecurity practices, we used wavelet analysis. Campylobacter daily counts between 1989 and 2009 were adjusted from the day of the year using a seven-day moving average and systematic adjustments for the reduced reporting over bank holidays and for long-term trends as previously reported . The analyses were conducted in R using ‘waveletComp’ .
Generalized structural time series model
Campylobacter cases across ten geographical regions in England and Wales were examined using the generalized structural time series model (GEST)  with a negative binomial distribution, where the natural logarithm of the expected cases of Campylobacter was decomposed into baseline, trend and seasonal components. To investigate the effect of weather variables on the development of Campylobacter cases, explanatory variables of the average temperature (°C) and the total rainfall (mm), of one, two, three and four weeks before diagnosis were included in the model. The temperature variable was examined using fixed and varying-coefficient analyses in order to account for variations in the relationship through the year.
Methods, time, geography, and reason for choice in the analyses
Reason for choice
GIS incidence mapping
Lower Layer Super Output Areas (LSOAs)
Individual resident postcodes (required for mapping by population) were not available before 2005.
Local linkage of cases and weather
Local linkage of cases and weather variables through the laboratory postcode was available from 2005 to 2009 in ten geographical regions.
Comparative conditional incidence
Lack of negative cases, positive cases were linked to weather variables from 2005 to 2009.
Local linkage of cases and weather data through the laboratory postcode was available from 2005 to 2009 in ten geographical regions.
England and Wales
The cases were not linked to weather variables. Health and weather data were analysed separately.
Ward’s minimum variance clustering & GEST
Strategic Health Authorities postcode
To increase the number of regions from ten to thirty-three sub-regions and determine the clustering of geographic similarities of the seasonality. Cases were not linked to weather variables.
The CCI was estimated using datasets that split cases into ten equal parts by temperature and rainfall. The highest CCI values were found for the minimum temperature (9 °C) two weeks before the specimen date, average temperature (16 °C) two weeks before the specimen date and maximum temperature (21 °C) two weeks before the specimen date (Fig. 1c). The cases at the height of summer when temperatures were highest had lower cases than a few weeks earlier. Between average temperatures of 5 °C and 15 °C, there was a 0.09 case increase in the incidence per 100,000 per week (Fig. 1d). It is unclear how this relates to the actual incidence as the LSOA per weeks that did not have a Campylobacter case which could not be included. Using four week periods, average temperatures two weeks before and splitting the data for each time period into two equal halves by temperature, demonstrated that the relationship between temperature and incidence in each time period differed, forming a cyclic loop (Fig. 1e). For weeks 17 to 20 and 21 to 24 there was an increase in the CCI of 0.175 cases per 100,000 per week for each degree increase in the temperature two weeks before (Fig. 1e). A similar approach using average rainfall over the previous four weeks showed no consistent links with the CCI (Fig. 1f). Adjusting the number of data points to give larger or smaller percentages of the data in each point affected the R2 value of the CCI on all data, with optimum results for 200 data points or less (Fig. 1g).
GEST with single coefficients for temperature and rainfall found a statistically significant association between Campylobacter and temperature of two weeks before the case diagnosis (see Additional file 1: Table S1). The relationship between Campylobacter and temperature was positive, as for every degree increase in average weekly temperature (°C) of two weeks before the specimen date there is an increase of 0.73% in the mean number of Campylobacter cases. For rainfall, there was a slightly significant association between Campylobacter and total rainfall of one week before the case diagnosis. The relationship was negative, as for every millimetre increase in total rainfall of one week before the specimen date there is a decrease of 0.24% in the mean number of Campylobacter cases.
Additional file 1: Table S2 provides clear evidence for an association between Campylobacter and average, maximum and minimum temperatures of two weeks before the case diagnosis in most geographical regions of England. Wales did not show significant association. All the regions showed statistically significant increasing trend in cases except the East of England. Additional file 1: Figure S1 shows the weekly cases of Campylobacter in England and Wales with the fitted mean in red (panel a), and the decomposition of the fitted mean into fitted trend (panel e), fitted seasonality (panel f), fitted linear effect of average temperature (panel c) and fitted linear effect of total rainfall (panel d). Expected cases of Campylobacter increased with the average temperature two weeks before diagnosis above a threshold of 11 °C. The relative change in the expected cases of Campylobacter due to temperature and rainfall in England and Wales were 7.7% and − 1.9% respectively and the effect of seasonality was − 3.4% (see Additional file 1: Table S3).
The Ward’s minimum variance clustering of the fitted seasonality of the GEST showed three main clusters: metropolitan (London and Birmingham), rural (Wales and the North East), and other geographical areas (Fig. 4g). When cases were plotted by week of year for the three clusters and the same time period, the seasonal distribution differed significantly (Fig. 4h).
The epidemiology of Campylobacter in humans remains a subject of importance because it is the commonest bacterial cause of diarrhoea in most developed countries, and because intervention requires an understanding of disease transmission pathways and possible association with climate and other environmental change. Within the context of climate change research, the approaches to linking weather parameters with infectious disease data commonly rely on an assumption that the relationships are continuous throughout the year, sometimes assuming a threshold value above which the continuous relationship changes. Here, we used different methods to demonstrate that Campylobacter incidence changes dynamically through the season, and we were able to decouple the relationship of temperature from the seasonality. Most notable was the very steep increase in the number of cases diagnosed between weeks 19 and 24, although there was still a relationship to temperature before this (weeks 1-18) as well as later in the year (weeks 29-53). The distribution was also consistent with the direct and/or indirect fly transmission hypotheses [17, 18, 19], with the fly populations increasing during the late spring period resulting in transmission passing from faeces or other contaminated material to either food and/or chicken flocks (as a result of poor biosecurity). The expression of weekly incidence by the temperature during that week in an LSOA and averaged for all weeks with one or more Campylobacter cases, gives a means of examining the relationship between case numbers and temperature that is partially decoupled from seasonality and geography.
The GEST provides a useful approach for modelling over-dispersed counts with a negative binomial distribution. It allows for the extraction of hidden trend and seasonality and estimating the effect of weather parameters. The use of multilevel factors for temperature separated into 13 four week periods allowed the contribution of temperature two weeks prior to the diagnosis to be estimated, explaining that 33.3% of the contribution to the annual burden of Campylobacter (calculated as relative change in the predictive mean) was attributable to temperature, compared to 7.7% using a single coefficient value. The remaining regular seasonality across the five years was small, and was limited to holiday periods, perhaps reflecting surveillance bias. The contribution of rainfall to the incidence of Campylobacter was small. While being able to predict cases from the temperature in a much more complete way than the GEST model with a fixed coefficient, the GEST with the varying coefficient model did not provide an explanation of the causes of these changes, and suggested that other drivers linked to temperature were important.
When the GEST model was applied to weekly number of cases of Campylobacter from 1989 to 2009 in thirty three Strategic Health Authorities, it allowed the fitted seasonality to be used to group sub-regional SHAs into three geographical clusters.
The distribution of cases across geographically distinct sub-regions demonstrated strong differences in the strength of the seasonal pattern, but not the week when cases increase. The seasonality was strongest in Wales and the North East, and weakest and a bit more delayed in the large urban areas of London and Birmingham, with other geographic areas distributed in between. This implies that there are local factors that also influence Campylobacter disease occurrence. These could be demographic (e.g. more young people in cities), different risk profiles (e.g. more exposure to manures in the countryside; greater foreign travel in cities), and/or different impacts of weather locally (e.g. fly populations causing direct transmission to food or indirect transmission through the contamination of chicken flocks especially in rural areas) [17, 18, 21, 30].
Wavelet analysis showed that Campylobacter cases could be interpreted as the super-imposition of three main harmonics of annual, semi-annual and 4-months periods. The origin of these harmonics is not clear; however, annual and higher order (1/2 year, 1/3 year, etc.) harmonics, whose relative importance depends on the geographic region, are typical features of a range of climatic variables. Potential other drivers could be other environmental variables, reporting bias, human behaviour, and/or intrinsic seasonal patterns in animal/vector reservoirs (i.e. the contamination of chickens needs to be investigated). In this type of study the quality of data is important. We used laboratory postcode as a surrogate for patient residence postcode to anonymise the data and improve data completeness in linking cases to weather parameters in 30 days before the specimen date. Djennad et al.  demonstrated that it is valid to use laboratory postcode as a proxy for patient postcode. The algorithm to locally link case and weather was limited to a five-year period, for three temperature parameters and one rainfall parameter and had a lag of 30 days. Another limitation is that the data linkage was in cases only, with the result that weeks without cases were not recorded.
Being able to link cases to an LSOA allows the CCI to be applied to compare incidence using a range of weather parameters. This method utilised the geographic variation in weather parameters by combining data from small areas and their populations that had the same weather conditions. The CCI is limited by the weather data only being linked to cases only and not to the underlying population. While the approach provides a useful way of examining weather variables that are de-coupled from seasonality by grouping cases per week and their underlying populations at a particular temperature or rainfall level together as a combined incidence, it is clear that some weeks and areas will be left out because there are no Campylobacter cases. It would be expected that a measurement of the true incidence would give a result that differed somewhat from the CCI because the true incidence would include all the weeks with negative results not just those with positive ones. An extended version of this that uses weather parameters for both cases and populations has the potential to provide a more robust representation of the incidence of Campylobacter at different temperatures through the year.
The CCI and GEST approaches gave good results in examining environmental drivers for Campylobacter. Adding other weather parameters to these models should further improve the predictive power of these models or methods. As an example, the development of standard weather database and algorithms through the MEDMI project (https://www.data-mashup.org.uk) facilitates data linkage of environment and health data such that different analyses may be examined in a more systematic way for a wider range of infectious diseases . The PHE Campylobacter dataset is sufficiently large, nevertheless the epidemiology of Campylobacter remains enigmatic and interventions have had limited success to date. It is therefore an important organism to study to elucidate methodologies that have power and utility in explaining infectious disease and climate change epidemiology for other infectious diseases in the future.
The study provides strong association between Campylobacter and temperature. Using a range of statistical methods, the study suggests that temperature and/or rainfall alone cannot explain the entire seasonal variation of Campylobacteriosis risk in England and Wales. Further research should investigate if the temporal dependency of the relationship between Campylobacter incidence and temperature on the week might be driven by other environmental variables, or perhaps by an intrinsic seasonality in the dynamics of the bacterial population in the environment or in the zoonotic reservoir or potential vectors such as flies.
We thank Dr. Robert Rigby, Professor Mikis Stasinopoulos, Professor Ben Armstrong and Dr. Shakoor Hajat for their insightful comments and feedback on statistical analysis and methodology.
The research was funded in part by the UK Medical Research Council (MRC) and UK Natural Environment Research Council (NERC) for the MEDMI Project; the National Institute for Health Research Health Protection Research Unit (NIHR HPRU) in Environmental Change and Health at the London School of Hygiene and Tropical Medicine in partnership with Public Health England, and in collaboration with the University of Exeter, University College London and the Met Office. It also included collaboration with the NIHR HPRU in Gastrointestinal Infection Programme. The funders had no role in the design of the study; collection, analysis, and interpretation of data; and in writing the manuscript. The views expressed are those of the author(s) and not necessarily those of the NERC, MRC, NHS, the NIHR, the Department of Health or Public Health England.
Availability of data and materials
The datasets on MEDMI are described at https://www.data-mashup.org.uk/data/data-library/ and include the SGSS infectious disease dataset. Permissions are required to access these datasets and users require an account to be set up as described at https://www.data-mashup.org.uk/data/accessing-data/.
All authors were involved in the conception and design of the study and interpretation of the results. AD and TB were responsible for the mathematical modelling including time series analysis. GL was responsible for wavelet analysis and interpretation of the results. CS was responsible for temperature and rainfall linkages with Campylobacter. CL was responsible for LSOA linkage from postcode. AD, IL and FCG were responsible for cluster analysis. RE was responsible for producing the incidence map. CH and GN were responsible for cumulative incidence at different temperatures and CCI method. IL, JS, SK were responsible for epidemiological input. LEF and AK were the project consultants and lead. GN was responsible for the conception, design of the article and statistical input. All authors revised the manuscript critically and contributed to the final draft. All authors read and approved the final manuscript.
Ethics approval and consent to participate
PHE has legal permission, provided by Regulation 3 of The Health Service (Control of Patient Information) Regulations 2002, to process patient confidential information for national surveillance of communicable diseases and as such, individual patient consent was not required.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 6.Nichols GL, Richardson JF, Sheppard SK, Lane C, Sarran C. Campylobacter epidemiology: a descriptive study reviewing 1 million cases in England and Wales between 1989 and 2011. BMJ Open. 2012;2:e001179. https://doi.org/10.1136/bmjopen-2012-001179.
- 27.Djennad A, Lo Iacono G, Sarran C, et al. A comparison of weather variables linked to infectious disease patterns using laboratory addresses and patient residence addresses. BMC Infect Dis. 2018;18(189):2–9.Google Scholar
- 28.Rösch A, Schmidbauer H. WaveletComp: Computational Wavelet Analysis. R package version 1.0, https://CRAN.R-project.org/package=WaveletComp. 2014.
- 29.Djennad A, Rigby R, Stasinopoulos D, Voudouris V, Eilers P. Beyond location and dispersion models: the generalized structural time series model with applications. Munich Pers REPIC Repos. 2015:1–33.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.