Background

Each year influenza causes a substantial burden of illness. Even in a non-pandemic year, influenza is estimated to cause between 250,000 and 500,000 deaths worldwide [1]. In some years, the burden can be much higher. Serological studies estimated that in the second wave of the 2009 pandemic, in the United Kingdom (UK) 49% of under five year olds, 59% of 5-14 year olds, 35% of 15-24 year olds, and 25% of 25-44 year olds were infected [2]. While many cases of influenza can be mild, serious complications can occur, particularly in the very young, very old, and individuals with pre-existing health conditions [1]. In order to help reduce the impact of influenza, it is important to understand who is most likely to be infected. Despite the high burden of influenza there is little quantitative information about risk factors that may inform potential preventive strategies.

The incidence of influenza is not generally recorded by public health surveillance, since it is impossible to conclusively know if an individual has influenza without virological confirmation, which is very rarely performed. As a result, most reports are of cases of influenza like illness, or ILI.

Traditional ILI monitoring relies on general practitioner (GP) and hospital reports. These methods, although useful, have the main limitation that they require individuals to seek health care [3]. Given that individuals with ILI who seek health care are not a random sample of cases, but are likely to differ in a number of characteristics (for example gender, socio-economic class, severity of illness), there is a high likelihood of selection bias. Additionally, during pandemics or other periods of high incidence, uncomplicated cases may be discouraged from presenting to health care, which would further distort the picture emerging from routine (health-care dependent) surveillance [4]. Finally it is difficult to link reports of ILI to behavioural and biological risk factors, since much of this information will be absent from GP records. As a result estimates of the incidence of influenza and of associated risk factors based on routine surveillance may be unreliable and unrepresentative.

An alternative is to measure the incidence and risk-factors via community cohort surveys. The internet allows these to be conducted at relatively low cost and in a way that is convenient for participants. Hence this method has become increasingly used across Europe [57] and elsewhere [8, 9], to estimate the incidence of ILI, and to understand the epidemiology of ILI and the effectiveness of control efforts [10]. Each week, participants are asked to record the presence or absence of symptoms, by which means both a numerator and denominator are obtained. Background information collected when participants register allows real-time estimates of regional incidence by age group and assessment of the role of other risk factors.

The UK web-based surveillance “Flusurvey” (https://flusurvey.org.uk/) was launched in July 2009 during the H1N1 pandemic [11], and has now run for five years. Flusurvey is linked to Influenzanet (https://www.influenzanet.eu/) a Europe-wide ILI monitoring system, currently running in nine countries. Here we present UK specific incidence estimates and risk factor analyses for ILI from the 2012-2013 season.

Methods

Participants include any resident of the UK recruited into Flusurvey between November 22nd 2012 and April 14th 2013. Flusurvey was approved by the London School of Hygiene & Tropical Medicine Ethics Committee (Application number 5530).

Structure of the survey

Upon initial registration, participants were asked a set of background questions about their age, gender, socio-economic status, household composition, geographical location, vaccine status, use of public transport, employment, and educational status. Participants were also asked questions about pre-existing health-conditions including use of medication for diabetes, asthma and other respiratory diseases, heart disease, kidney disorders, and immune-compromising conditions. Further details about the background questions are available upon request. In addition to being able to register themselves, participants were able to register on behalf of others (e.g. members of their household). Participants who had registered for Flusurvey in previous years were required to fill in their background information again to ensure it was up to date. During the course of the season we reminded participants in the weekly email to update their vaccination status if necessary

Each week a reminder email was sent asking participants to complete a symptoms survey, whether or not they had any symptoms. Participants were asked to indicate on a list of symptoms which, if any, they had experienced in the past week or since their last symptoms survey report. Participants who reported any symptoms were asked a set of follow-up questions, detailing onset date, suddenness of onset, and health seeking behaviours. The full list of symptoms is found in Table 1.

Table 1 List of symptoms in the survey

Demographics

Our sample was compared to the UK population to assess its representativeness. Background demographic data for the UK was taken from the office of National Statistics website [12]. Representativeness with respect to vaccination uptake was assessed using influenza vaccine uptake reports from the Public Health England (PHE) Website [13].

Case definitions

A diagnosis of influenza-like-illness (ILI) was made based on participants’ self-reported symptoms. Since there is no case definition of ILI upon which everyone agrees, and recognising the possible sensitivity of any results to the precise definition chosen, in this study we consider two different definitions of ILI. First, ILIECDC, using European Centre for Disease Prevention and Control (ECDC) definition which required: Sudden onset of symptoms; at least one of fever or chills, malaise, headache, muscle pain; at least one of cough, sore throat, shortness of breath. In the Additional file 1: Appendix we consider a stricter ILI definition which was the ECDC definition but additionally required a fever.

Data management

Participants received a reminder each week to complete their symptoms survey. However, not all participants responded on the day the reminder was sent, so completed surveys are not evenly spread. On less than six percent of occasions participants submitted multiple symptoms surveys on the same day. Multiple responses by individuals on the same day were treated as follows: if there was one symptom surveys that included a report of ILIECDC that one was kept, and if not, the most recent one was kept. If an individual reported symptoms in consecutive surveys, they were asked whether the symptoms belonged to the same episode of illness. If symptoms were reported to be from the same episode, they were only included as an incident case once. As it was believed that individuals were more likely to register for Flusurvey when they had an ILI, upon enrolment, the first symptoms survey submitted by each participant was dropped when estimating weekly incidence. For risk factor analyses, only individuals who had submitted at least two symptoms surveys were included in the sample.

Statistical analysis

Measure of disease frequency

Data were analysed using R-studio version 0.97.237 and R 2.15.2. Unadjusted incidence was calculated both as a participant’s risk of reporting ILI at least once in a season, as well as a risk per symptoms survey. The overall risk was calculated by total number of positive reports over total number of reports. Weekly ILIECDC incidence by age group was plotted alongside actual recorded percent positive confirmed influenza swabs collected by PHE’s Respiratory DataMart system [14].

Risk factor analysis

For risk factor analysis, we considered factors potentially related to the chance of an individual reporting ILI at least once during the season. Risk factors included geographical area, vaccine status, pre-existing health conditions, employment, use of public transportation, household makeup, exposure to children, gender and age group.

Data were analysed using a generalised linear model using a binomial regression; odds ratios (and 95% confidence intervals) were reported as measures of association.

The odds of reporting ILI at least once were adjusted by the number of reports an individual submitted; age group (under 18, 18-24, 25-34, 35-44, 45-64, 65+); gender; influenza vaccination status; region (13 NHS regions); employment status (studying, working full or part time, homemaker, unemployed); whether the participant was in contact with groups of 10 or more children (children defined as under the age of 18) in the course of a typical day; whether the participant lived with children; pet ownership; having pre-existing health conditions (asthma, diabetes, kidney disease, heart disease, immuno-compromising conditions); allergies; smoking (any number of cigarettes); regular use of public transport. Because some of the covariates such as smoking and interacting with a child may have differences in effect in participants under the age of 18, a sensitivity analysis was conducted removing children from the analysis.

Results

Demographics

5943 participants were recruited. Of those 1511 only completed the symptoms survey once, leaving 4532 participants that were included in the analysis. Participants’ characteristics are shown in Tables 2 and 3. Participants included in the analysis completed a median of nine symptoms surveys and a mean of 9.9 symptoms surveys.

Table 2 Characteristics of all participants, and participants with at least two reports included in the study
Table 3 Proportion of participants by region and risk factors

Females were over-represented in our sample (64% female). Children were under-represented (6.2% of our sample versus 21% of the UK population), and individuals aged 35-64 were over-represented. About ten percent of all participants (456) were registered by proxy however due to the anonymous nature of our survey we do not know about how many individuals participated from the same household, for this reason we did not cluster by household.

There were regional differences in the sample, with London being over-represented and Northern Ireland being the least well represented region in the survey (Table 3).

Participants had a higher rate of vaccination than the UK population (Table 4), suggesting that Flusurvey participants may have more awareness of influenza than the UK population in general. Table 5 shows number of reports submitted per week.

Table 4 Vaccination characteristics of participants with at least two reports compared to overall UK population
Table 5 Number of submitted reports by week

Unadjusted ILI incidence

46.1% (95% CI 44.6, 47.5) of individuals reported at least one episode of ILIECDC during the season, and 6.0% (95% CI 5.7, 6.2) of symptom reports were positive for ILIECDC. Incidence was higher in females, with 49.6% (95% CI 47.8, 51.5) of females reporting at least one ILIECDC compared to 39.7% (95% CI 37.3, 42.2) of males. 6.9% (95% CI 6.6, 7.2) of symptoms surveys submitted by females were positive for ILIECDC, compared to 4.6% (95% CI 4.3, 4.9) of surveys submitted by males. Participants over 65 had the lowest positive proportion of total reports: 33% (95% CI 29.1, 37.0) of individuals had at least one episode of ILIECDC and 2.9% (95% CI 2.5, 3.2) of all surveys were positive; participants between 45 and 65 had the highest proportion of having at least one report of ILIECDC (50.1%, 95% CI 47.8, 52.5); participants under the age of 18 had the highest proportion (7.9%, 95% CI 6.8, 9.1) of positive symptoms surveys (Table 6). Analysis of ILI as measured using the stricter ILIfever found similar patterns (see Additional file 1: Appendix 1).

Table 6 Overall and weekly incidence of ILI using ECDC and fever definitions

The highest incidence of ILI in any age group came the week ending December 23, in children under the age of 18. In the following week, which coincided with school closures for the Christmas holidays, there was a substantial fall in cases in under 18 year olds, but a small rise in incidence in the older age groups. With the re-opening of schools, there was a corresponding increase in cases of ILI in the youngest age group. The older age groups, particularly the 19-45 year olds, had a similar pattern to the under 18 year olds, but slightly delayed. This pattern was repeated on a smaller scale when the schools broke up for Easter holidays, with the older age groups having a slightly delayed peak. Figure 1 shows weekly incidence by age group, with shaded areas representing school holidays, although we note that there is some local variation in holiday dates (http://www.halftermdates.co.uk/). The bottom portion of the graph gives the percent positivity of laboratory confirmed Influenza A and Influenza B reported by PHE’s Respiratory DataMart system [14]. The increase of ILI just before the Christmas holidays corresponds to an increase of Influenza B recorded by PHE [14]. The peak in ILI incidence later in the season corresponds to an increase in cases of Influenza A. The first week of reporting is not included on Figure 1 because first week reporting may be inflated due to participants reporting illness over a potentially longer time period.

Figure 1
figure 1

Weekly incidence by age group. Highlighted areas show school holidays (taken from http://www.halftermdates.co.uk/ and may not be representative of entire population). The lower panel shows the proportion of samples testing positive for Influenza A and Influenza B. The Respiratory DataMart System is a laboratory surveillance tool to monitor influenza and other respiratory viruses based on collated lab results from a network of Public Health England (PHE) and NHS laboratories in England. Respiratory swabs from primary and secondary care are tested for a variety of viruses using real time polymerase chain reaction (RT-PCR) assays. Weekly results are published in the PHE National Influenza Report (http://www.hpa.org.uk/Topics/InfectiousDiseases/InfectionsAZ/SeasonalInfluenza/EpidemiologicalData/03influsweeklyreportpdfonly/) First week of reporting is not included as responses may include all cases of ILI since the previous season.

Risk factor analysis

Results of multivariate analysis for factors influencing the risk of reporting at least one episode of ILIECDC are found in Table 7. Working with groups of patients and the elderly was not found to have an effect on having ILIECDC. Our findings suggest that not being vaccinated against influenza was associated with the highest odds of getting ILIECDC at least once (OR 1.94, 95% CI 1.66, 2.27). Having an underlying health condition such as asthma or diabetes also substantially increased participant’s odds of reporting at least one ILIECDC (OR 1.53, 95% CI 1.27, 1.83). Other factors showing an association included being female (OR 1.52, 95% CI 1.33, 1.73) having daily contact with large groups of children (OR 1.47, 95% CI 1.22, 1.77), smoking (OR 1.32, 95% CI 1.07, 1.64), and being aged between 35 and 64. (35-44 OR 1.62, 95% CI 1.11, 2.08; 45-64 OR 1.55, 95% CI 1.14, 2.10).

Table 7 Risk factors of having at least one ILI using ECDC definition

People who took public transport were at no greater odds of reporting ILIECDC than people who did not (OR 0.91, 95% CI 0.78, 1.05). Likewise, there was no statistically significant difference in the odds of reporting ILIECDC by region. Multivariate results using the stricter ILIfever definition were similar, and can be found in Additional file 1: Appendix 2. Multivariate results removing children from the analysis were also similar and are found in Additional file 1: Appendix 3.

Discussion

Our results show that online ILI surveillance can be an effective tool for measuring ILI incidence, and that volunteers are willing to take part in ILI surveillance. The analyses shown in this paper highlight the advantages of community surveillance of ILI compared to traditional GP surveillance techniques. Six percent of all symptom reports were positive for ILI in the 2012-2013 flu season based on a sample of 4523 participants. When standardised by age to the UK population, this corresponds to an incidence of 6.3 per 100 person weeks. As in previous years, we showed that incidence was lowest in participants over 65. This is in part because individuals over 65 were more likely to be vaccinated. There were no significant regional differences. Our results show that people with underlying health conditions were more likely to report ILI, a similar result to that found using telephone surveillance in the United States [15].

In our study, females were more likely to report ILI. This pattern was still evident when adjusting for living with children, or for daily contact with groups of children (which would include teachers and nursery workers). This same pattern was observed in a study of healthy adults in Australia [16]. We speculate that females may be more likely to take care of symptomatic individuals, and therefore be exposed to illness or that the different influenza rates in females may be due to physiological differences [17].

The highest rates of ILI were in the youngest age category, in agreement with other community-based findings [18]. The decline in incidence is coincident with Christmas holidays, and the resurgence of cases in this age group in the new year, after schools had reopened, is similar to other data [7, 1921]. The observation that incidence in the 19-45 year olds lagged behind the youngest age group, suggests that under 18s may have brought the illness back to their families.

Our results also show that taking public transport does not increase your risk of reporting an ILI regardless of definition used. Analyses from other European countries and over different seasons confirm this finding [Van Noort S, Codeço C, Kopperschaar C, Van Ranst M, Gomes M: Influenzanet: ILI trends, behaviour and risk factors in cohorts of internet volunteers, submitted, [22], though others have found the opposite [23].

Our results suggest that influenza vaccination gives some protection against ILI. With any definition of ILI considered, not being vaccinated was the greatest predictor of reporting an ILI. This result is consistent with results from previous years [10]. Some ILI was still reported in participants who reported vaccination. This can in part be attributed to the fact that the vaccine is only effective against influenza and we are measuring ILI some of which will not be due to influenza. Restricting the analysis of our data to the largest peak of virologically confirmed influenza (9th of December through 30th of December), the OR for being unvaccianted increased to 2.14 (95% CI 1.68, 2.74).

Strengths and weaknesses of the study

The use of internet based surveillance has a number of limitations.

All of our data are based on self-reports of symptoms. As such we are unable to comment on influenza, only on ILI. However, most GP surveillance is also based on ILIs since most cases are not virologically confirmed. There is little information on the specificity and sensitivity of ILI during the influenza season. As shown by Figure 1, our measured incidence of ILI corresponded well to virological surveillance recorded by PHE. Ideally, weekly swabbing of Flusurvey participants would be used to assess the proportion of ILI cases that are confirmed influenza. Currently the costs of swabbing have not allowed this. However we would like to attempt this in the future.

Our sample does not represent a random sample; by nature of its design it is biased towards internet users, and people who are willing to fill in surveys. Additionally Flusurvey participants are more likely to be from London, be female, have risk factors and be vaccinated than the UK general population. Other methods such as telephone surveillance can be used to overcome these limitations [15, 24], however these surveillance techniques are more expensive, time absorbing, have representativeness problems of their own, and still have the problem of self-reports of ILI.

Our decision to exclude first reports in incidence estimates may have caused us to underestimate the overall amount of ILI in our sample. When including first reports, 7.8% (95% CI 7.6, 8.0) of total symptom reports were positive. When removing first reports 6.0% (95% CI 5.7, 6.2) were positive. However, the survey asks participants if they had had any symptoms since the last time they logged in, so participants’ first reports may reflect symptoms that occurred outside the current flu season.

Our risk factor analysis is restricted to the questions that are asked at the beginning of the survey. Our multivariate analysis suggests that being under the age of 18 is not a risk factor for reporting ILI. However, being in contact with groups of children is. Similarly, living with children was not seen to be a risk factor, but this is colinear with being a child. As a result, due to the wording of this question, it is difficult to understand the implications of these results. If we had asked the question in another way, for example, are you an adult who works with children, we may have been better able to separate the effects of working with children versus being a child. It is plausible that living with children is a risk factor, but the colinearity with being a child masks this relationship. Sensitivity analysis showed no substantial difference in results when children were removed from the analysis.

Flusurvey does have many strengths. In effect the survey consists of an online cohort. We were able to attribute illness directly (although anonymously) to individuals, thus understanding their individual risk factors. This is in comparison to other online influenza tracking sights such as Flu Near You (https://flunearyou.org/) which is run in the United States. They do not ask background questions about individual risk factors, and can therefore only report on influenza prevalence. Flutracking (https://www.flutracking.net/), based in Australia, only asks about gender, age, and working with patients. Only Flusurvey and the other members of Influenzanet ask in-depth questions about risk factors. The data are available in real-time allowing rapid examination of these risk factors and the effectiveness of control programmes. Finally, the size of the cohort can be expanded at very little additional cost.

Conclusion

Overall we found that 6.0 percent of all symptoms surveys were positive for ILI in the 2012-2013 flu season based on a sample of 4523 participants. Age standardised to the UK population this corresponds to an incidence of 6.3 per 100 person weeks. Females reported higher incidence of ILI than males, and the highest incidence was in under 18 year olds. This internet-based cohort has confirmed that failure to be vaccinated was the most important risk factor for ILI during the 2012-13 influenza season in the UK. It has also demonstrated that public transport use does not appear to be a risk-factor for ILI, whereas smoking and having a pre-existing health condition are. Such participatory surveillance systems have the ability to provide reliable information to policy-makers in close to real-time.