A maximum curvature method for estimating epidemic onset of seasonal influenza in Japan
Abstract
Background
Detecting the onset of influenza epidemic is important for epidemiological surveillance and for investigating the factors driving spatiotemporal transmission patterns. Most approaches define the epidemic onset based on thresholds, which use subjective criteria and are specific to individual surveillance systems.
Methods
We applied the empirical threshold method (ETM), together with two non-thresholding methods, including the maximum curvature method (MCM) that we proposed and the segmented regression method (SRM), to determine onsets of influenza epidemics in each prefecture of Japan, using sentinel surveillance data of influenza-like illness (ILI) from 2012/2013 through 2017/2018. Performance of the MCM and SRM was evaluated, in terms of epidemic onset, end, and duration, with those derived from the ETM using the nationwide epidemic onset indicator of 1.0 ILI case per sentinel per week.
Results
The MCM and SRM yielded complete estimates for each of Japan’s 47 prefectures. In contrast, ETM estimates for Kagoshima during 2012/2013 and for Okinawa during all six influenza seasons, except 2013/2014, were invalid. The MCM showed better agreement in all estimates with the ETM than the SRM (R^{2} = 0.82, p < 0.001 vs. R^{2} = 0.34, p < 0.001 for epidemic onset; R^{2} = 0.18, p < 0.001 vs. R^{2} = 0.05, p < 0.001 for epidemic end; R^{2} = 0.28, p < 0.001 vs. R^{2} < 0.01, p = 0.35 for epidemic duration). Prefecture-specific thresholds for epidemic onset and end were established using the MCM.
Conclusions
The Japanese national epidemic onset threshold is not applicable to all prefectures, particularly Okinawa. The MCM could be used to establish prefecture-specific epidemic thresholds that faithfully characterize influenza activity, serving as useful complements to the influenza surveillance system in Japan.
Keywords
Japan Influenza surveillance Epidemic threshold Non-thresholding method Segmented regression Maximum curvature method MCMAbbreviations
- C/S/W
ILI case(s) per sentinel per week
- ETM
empirical threshold method
- IDWR
Infectious Disease Weekly Report
- ILI
influenza-like illness
- MCM
maximum curvature method
- MEM
moving epidemic method
- MLRM
moving logistic regression method
- SMI
sentinel medical institution
- SRM
segmented regression method
Background
Influenza is a common respiratory infectious disease that imposes significant morbidity and mortality impact on public health [1]. Every year, seasonal influenza epidemics are estimated to cause about 3 to 5 million cases of severe illness and up to 650,000 deaths globally [2], placing a substantial burden on health services. To curb these epidemics, the beginning of major influenza activity in each season must be declared. A timely alert of the onset of seasonal influenza epidemic could allow health communities to activate appropriate influenza response plans and prepare for a subsequent dramatic increase in incidence and utilization of health services [3]. In temperate regions such as Japan, seasonal influenza epidemics are expected to occur during winter [4, 5]; however, the exact onset, duration, and severity of these epidemics are not known because of annual differences in the circulating virus strains, population immunity, human mobility, as well as environmental and other factors [6, 7, 8]. Therefore, an intuitive and reliable method for estimating epidemic onset is of great interest to public health decision makers because it can help public health agencies to timely respond to the upcoming epidemic peak.
The epidemic onset is technically defined as the time when the incidence exceeds the epidemic threshold [9]. Hence, the algorithm behind the calculation of the epidemic threshold becomes the key to detecting epidemic onset. Without a consensus for calculating epidemic thresholds, a range of approaches with varying complexity have been proposed [6, 8, 10]. The simplest but the most subjective option is to empirically specify a fixed threshold for the epidemic by visual inspection of observations [6, 11, 12, 13, 14]. A slightly more quantitative manner of determining a fixed epidemic threshold is to use simple statistics, e.g., mean or median [15, 16, 17, 18, 19]. One class of widely used methods for obtaining time-varying epidemic thresholds stem from the periodic regression model proposed by Serfling in 1963 [20]. A variety of Serfling-like regression models have since been developed to detect the onset [15, 21, 22, 23] and peak timing [24] of influenza epidemics, and to characterize the seasonal patterns of influenza [25, 26, 27]. The Serfling regression model fits the non-epidemic data from previous years and predicts a baseline curve, above which a certain increase is considered the epidemic threshold. However, these Serfling-type approaches have several drawbacks. Firstly, epidemic and non-epidemic periods are required to be predefined based on subjective criteria [28], such as manual removal of epidemic peaks, the proportion of influenza-like illness (ILI) patients among all outpatients (ILI proportion), the proportion of laboratory specimens from ILI patients testing positive for influenza (positive proportion), and so on. The precise determination of epidemic and non-epidemic periods is actually the onset that we would like to estimate. Secondly, the baseline curve is estimated relying on long-term (usually the 5 or more previous years) historical data [13]. Finally, the quantities added to the baseline are varied and not standardized [15, 22].
Several studies have attempted to define epidemic thresholds, taking into account properties of the epidemic curve, e.g., the rate of increase in the number of cases. Nobre and Stroup [29] detected the epidemic onset using the exponential smoothing technique and properties of numerical derivatives of the epidemic curve. This method does not require long-term historical data and can be applied to surveillance series of less than a year; however, prequisites include that the chosen polynomial model must fit the data well, and exploratory analysis is required to choose the parameters of the exponential smoothing model. The World Health Organization (WHO) Regional Office for Europe and the European Center for Disease Prevention and Control have implemented the moving epidemic method (MEM) to determine the baseline influenza activity and epidemic thresholds for influenza surveillance in Europe [8]. The MEM calculates the epidemic start and end after the optimum epidemic duration is firstly found with the slope of the maximum accumulated rates percentage curve less than a predefined criterion δ. Although the MEM can be used for analyzing a single influenza season with as few as 33 weeks of observations, the determination of δ is difficult as it is country-specific. Recently Cheng et al. [30] developed a moving logistic regression method (MLRM) to determine the thresholds of seasonal influenza epidemics across 30 provinces in mainland China. The MLRM approximates the cumulative epidemic curve by a logistic regression model. Following the MEM, the MLRM chooses the optimum epidemic duration with a slight change of R^{2} < 0.01. However, the application of MLRM is limited to symmetric epidemic waves and is not appropriate to asymmetric or bimodal epidemic waves.
While the predominant approaches to detecting epidemic onset are based on thresholds, a few non-thresholding methods have been proposed for estimating epidemic onset. To study the spatiotemporal transmission patterns of influenza, Charu et al. [31] and Geoghegan et al. [7] determined the onset time of epidemics using the segmented regression model (SRM). They fitted a segmented regression model to the first half of the epidemic curve (i.e., the weekly time series of ILI before the peak), where the breakpoint quantifies an abrupt change in incidence and its timing corresponds to the epidemic onset. The SRM does not rely on any threshold and can be applied to a single influenza season without requirements for historical data because it defines epidemic onset totally based on the properties of the epidemic curve.
Charu et al. [31] also demonstrated excellent agreement between influenza epidemic onset estimates derived by the SRM and the Serfling regression model in the United States (US). However, the consistency between epidemic onsets estimated by the SRM and other threshold-based methods using other influenza surveillance systems remains unknown. The lack of reliable information on epidemic onset observations limits the execution of such evaluations. Since 2000, the national epidemic threshold for sentinel surveillance of ILI in Japan has been empirically defined as 1.0 ILI case per sentinel per week (C/S/W) [32, 33]. This epidemic threshold successfully captures a unique feature of the epidemic curve, which means that once the threshold is exceeded, the weekly number of ILI cases increases rapidly and consistently until peaking [34]. Hence, those onsets derived by this empirical threshold method (ETM) for influenza epidemics in Japan can be used as a reference standard for assessing other approaches to estimating epidemic onsets.
The thresholds for the onset and end of influenza epidemic are supposed to vary across Japanese prefectures [35]. Yet, no appropriate epidemic threshold exists for each prefecture. We propose a novel statistical method, the maximum curvature method (MCM), to determine prefecture-specific onsets of influenza epidemics in Japan. This method is based on the maximum curvature of the epidemic curve, which makes the best use of the epidemic curve’s unique feature and retains the advantages of non-thresholding methods for estimating epidemic onset. As we focus on the non-thresholding methods, in this study, epidemic onset estimates derived by both the MCM and SRM are evaluated in comparison with the reference epidemic onsets obtained by the ETM with a fixed value of 1.0 C/S/W. Finally, prefecture-specific thresholds for epidemic onset and end are established using the MCM.
Methods
Study area and ILI surveillance data
Japan is a bow-shaped strip of islands, stretching from 24°N to 46°N for approximately 2400 km. At its widest point, Japan is no more than 230 km across. Japan is divided into 47 prefectures for local administration. Hokkaido is the northernmost prefecture; Okinawa is the southernmost prefecture. Most regions of Japan lie in the temperate zone with humid subtropical climate. However, Japan’s climate varies from a cool humid continental climate in the north, such as in northern Hokkaido, to a warm tropical rainforest climate in the south, such as in Ishigaki, Okinawa.
Influenza (excluding avian influenza and pandemic influenza, e.g. novel influenza or re-emerging influenza) is subject to sentinel surveillance under the National Epidemiological Surveillance for Infectious Disease in Japan. The number of patients diagnosed with ILI is reported from approximately 5000 sentinel medical institutions (SMIs) (3000 for pediatrics and 2000 for internal medicine) across Japan on a weekly basis (ISO 8601 week date system according to the Weeks Ending Log [36]). The criteria for reporting ILI used by SMIs have been previously described elsewhere [37]. The data are aggregated at the National Institute of Infectious Diseases into weekly total number of cases and weekly average number of cases per sentinel for both the national and prefectural levels [37]. The surveillance data tables are published on the website of the Infectious Disease Weekly Report (IDWR) [38] every Tuesday. A detailed description of infectious diseases surveillance system in Japan has been made available [39].
In our study, an influenza season was defined to range anywhere from week 35 in September of each year up to week 34 in August of the following year. We downloaded IDWR surveillance data tables from week 35 of 2012 to week 34 of 2018 (from 2012-09-02 to 2018-08-26 in terms of week ending date). Our study period covered six influenza seasons from 2012/2013 through 2017/2018 (Additional file 1: Fig. S1). Only the weekly number of ILI cases per sentinel was used in the following estimation of epidemic onsets, so as to be compatible with the empirical epidemic threshold.
Methods for estimating epidemic onset
We estimated the onset time of influenza epidemics in each prefecture for each of the six influenza seasons from 2012/2013 to 2017/2018 using three methods: the ETM, SRM, and MCM. The epidemic end is equivalent to the epidemic onset in reverse chronological order. The duration of an epidemic is defined as the period from its onset time to its ending time. Therefore, we focused on describing the algorithm for estimating epidemic onset.
The empirical threshold method (ETM)
The ETM defines an epidemic as occurring when the weekly number of ILI cases per sentinel has been reported to exceed a prespecified threshold Y_{0} for three consecutive weeks [40]. The first week of the three consecutive weeks corresponds to the epidemic onset. We used the criterion Y_{0} = 1.0 C/S/W, which is the threshold for the nationwide onset of an influenza epidemic in Japan. This threshold was empirically defined in the year 2000 based on more than 10 years of observations from sentinel surveillance of influenza in Japan [34]. The details of implementing the ETM are described in the Additional file 1: Text S1 and Fig. S2.
The segmented regression method (SRM)
Different from the above threshold-based method, the SRM fits piecewise linear models to determine the breakpoint in the first half of the epidemic curve, which corresponds to the epidemic onset. In other words, the breakpoint is the optimal knot location with the maximal difference-in-slope between the two fitted straight lines (Additional file 1: Figure S3). To find the optimal breakpoint, the log-likelihood function for the breakpoint is maximized. Further details of using the SRM to determine epidemic onset refer to [7, 31]. We implemented the SRM using the R package segmented [41], and the procedure is summarized in the Additional file 1: Text S2. An illustration of the SRM is shown in Additional file 1: Figure S3.
The maximum curvature method (MCM)
Given the unique feature of the epidemic curve in Japan, it may be more appropriate to identify the epidemic onset in terms of curvature. Therefore, we developed the MCM to detect epidemic onset and end. Inspired by the SRM definition of epidemic onset as the point of maximum change in the slope, the MCM defines epidemic onset as the point of maximum curvature located within the increasing phase of the epidemic curve. Likewise, epidemic end is defined as the point of maximum curvature located within the decreasing phase of the epidemic curve. To reduce the effect of small fluctuations in the epidemic curve, instead of directly calculating the osculating circle at each point on the curve, the MCM fits a least-squares circle to the n points around it. n ≥ 3 because three points are required to determine a circle and n is odd for the sake of symmetry. The curvature of the fitted circle only measures how fast the epidemic curve is changing direction at a given point. We further used the directional angle of the tangent vector at the given point to indicate its changing direction. In the first half of the epidemic curve, the point with maximum curvature and a directional angle between [0°, 90°] is defined as the epidemic onset; in the second half, the point with maximum curvature and a directional angle between [270°, 360°] is determined as the epidemic end. Any possible points that occur above an upper threshold, h C/S/W, are eliminated, because they are already in an epidemic state.
Let {y_{t}, t = 1, 2, … , T} denote the weekly epidemic curve of an influenza season with T weeks, where y_{t} is the number of ILI cases per sentinel reported at week t, which is referred to as intensity hereafter, for the sake of simplicity. The steps for using the MCM to detect epidemic onset and end are as follows.
Step 1. At a given point K (t, y_{t})(t = 1, 2, … , T), a circle with center \( O\ \left({t}_{\mathrm{c}},{y}_{t_{\mathrm{c}}}\right) \) and radius r is determined by least-squares fitting to n points \( \left(t-\frac{n-1}{2},{y}_{t-\frac{n-1}{2}}\right),\dots, \left(t+\frac{n-1}{2},{y}_{t+\frac{n-1}{2}}\right) \) surrounding K, using the algorithm proposed by Pratt [42]. When K is at the edge of the epidemic curve (\( t=1,\dots, \frac{n-1}{2}\ \mathrm{or}\ t=T-\frac{n-3}{2},\dots, T \)), the first (or last) two points of the epidemic curve are linearly extrapolated to pad the curve with \( \frac{n-1}{2} \) extra points. The raw curvature C_{t} at K is the reciprocal of the radius r.
Step 2. The tangent point \( P\ \left(\widehat{t},\widehat{y_t}\right) \) closest to K, is determined by intersecting the line OK with the fitted circle. The directional angle θ_{t} (in degrees) of the tangent vector \( \overrightarrow{PQ} \) is then calculated.
Step 4. Find the points with the maximum filtered curvature \( {t}_o=\underset{t=1,\dots, {t}_p}{\arg \max}\left\{{C}_t^{\prime}\right\} \) and \( {t}_e=\underset{t={t}_p,\dots, T}{\arg \max}\left\{{C}_t^{\prime}\right\} \) for each half of the epidemic curve.
Step 5. The coordinates of the tangent point at \( \left(\widehat{t_o},\widehat{y_{t_o}}\right) \) correspond to the epidemic onset and the epidemic onset intensity. Likewise, the coordinates of the tangent point at \( \left(\widehat{t_e},\widehat{y_{t_e}}\right) \) correspond to the epidemic end and the epidemic ending intensity.
Comparison of epidemic characteristic parameters derived by different methods
For each season, epidemic characteristic parameters including epidemic onset, end, duration, and intensities at epidemic onset and end were estimated using the above ETM, SRM, and MCM, nationally and for each prefecture. The threshold for the nationwide onset of an influenza epidemic in Japan has been empirically defined as 1.0 C/S/W since 2000 [34]. However, the prefecture-specific thresholds for epidemic onsets have yet to be determined. We presumed that the epidemic onset thresholds at prefecture level would be similar to the national threshold and thus specified Y_{0} to be 1.0 C/S/W when using the ETM to estimate epidemic characteristic parameters for each prefecture. Owing to the continued success of the nationwide epidemic onset indicator in Japan, estimates of the ETM using this indicator were used as the reference standard, against which epidemic characteristic parameter estimates using the other two methods were compared. A sensitivity analysis varying n (3, 5, and 7) and h (4.0, 6.0, 8.0, and 10.0) was performed to examine the MCM’s robustness. For each combination of n and h, epidemic characteristic parameters estimated by the MCM were also compared with those from the ETM.
Establishment of prefecture-specific thresholds for epidemic onset and end
With the epidemic characteristic parameters estimated by the MCM (n = 5, h = 5.0) in hand, the prefecture-specific thresholds for epidemic onset were calculated by averaging the epidemic onset intensities over the six available seasons, 2012/2013 to 2017/2018. The prefecture-specific epidemic ending thresholds were also calculated using the same procedure.
All methods and analyses were implemented in R 3.4.2 [43]. The datasets and codes are available under MIT license at the GitHub repository [44].
Results
Descriptive statistics of epidemic characteristic parameter estimates
Summary statistics of epidemic characteristic parameters estimated by the ETM, SRM, and MCM from 2012/2013 to 2017/2018
Parameters | Methods | 2012/2013 | 2013/2014 | 2014/2015 | 2015/2016 | 2016/2017 | 2017/2018 | Mean |
---|---|---|---|---|---|---|---|---|
Onset^{a} (weeks) | ETM | 16.4 | 17.0 | 13.9 | 19.3 | 12.2 | 12.6 | 15.2 |
SRM | 18.9 | 19.2 | 15.8 | 20.6 | 17.9 | 17.1 | 18.2 | |
MCM | 16.0 | 17.0 | 13.2 | 18.5 | 12.7 | 12.4 | 15.0 | |
End (weeks) | ETM | 38.7 | 37.7 | 36.5 | 37.6 | 37.7 | 34.3 | 37.1 |
SRM | 29.4 | 34.6 | 26.5 | 34.4 | 30.1 | 29.4 | 30.7 | |
MCM | 34.3 | 37.5 | 32.4 | 38.1 | 36.3 | 34.1 | 35.5 | |
Duration (weeks) | ETM | 23.0 | 21.8 | 23.5 | 19.1 | 26.3 | 22.6 | 22.7 |
SRM | 11.5 | 16.3 | 11.7 | 14.8 | 13.3 | 13.3 | 13.5 | |
MCM | 19.3 | 21.6 | 20.2 | 20.6 | 24.6 | 22.7 | 21.5 | |
Onset intensity^{b} | ETM | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
SRM | 4.25 | 3.90 | 5.08 | 3.30 | 7.95 | 9.87 | 5.72 | |
MCM | 0.70 | 0.87 | 0.50 | 0.61 | 1.13 | 0.84 | 0.78 | |
Ending intensity | ETM | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
SRM | 7.39 | 4.04 | 8.74 | 5.13 | 8.05 | 8.06 | 6.90 | |
MCM | 1.99 | 1.07 | 1.96 | 0.66 | 1.67 | 1.01 | 1.40 | |
Dominant subtype^{c} | A(H3) | A(H1N1)pdm09 | A(H3) | A(H1N1)pdm09 | A(H3) | B/Yamagata | – |
Prefectures with invalid epidemic characteristic parameters estimated by the ETM
Season | Onset | End | Duration | Onset intensity | Ending intensity |
---|---|---|---|---|---|
2012/2013 | Kagoshima Okinawa | Kagoshima Okinawa | Kagoshima Okinawa | ||
2013/2014 | |||||
2014/2015 | Okinawa | Okinawa | Okinawa | ||
2015/2016 | Okinawa | Okinawa | Okinawa | ||
2016/2017 | Okinawa | Okinawa | Okinawa | ||
2017/2018 | Okinawa | Okinawa | Okinawa | Okinawa | Okinawa |
Agreement between the SRM, MCM and ETM on epidemic onset, end, and duration estimates
Agreement of epidemic characteristic parameters determined by the ETM and MCM using different n and h
Parameters | Onset | End | Duration | ||||||
---|---|---|---|---|---|---|---|---|---|
n = 3 | n = 5 | n = 7 | n = 3 | n = 5 | n = 7 | n = 3 | n = 5 | n = 7 | |
h = 4.0 | 0.43^{a} | 0.81 | 0.82 | 0.20 | 0.19 | 0.14 | 0.17 | 0.29 | 0.27 |
h = 6.0 | 0.41 | 0.81 | 0.80 | 0.13 | 0.16 | 0.13 | 0.10 | 0.24 | 0.24 |
h = 8.0 | 0.41 | 0.81 | 0.80 | 0.11 | 0.10 | 0.08 | 0.09 | 0.18 | 0.18 |
h = 10.0 | 0.35 | 0.71 | 0.68 | 0.10 | 0.11 | 0.08 | 0.07 | 0.14 | 0.15 |
Prefecture-specific epidemic onset and ending thresholds
Discussion
In this study, three methods including the ETM, SRM, and MCM, were used to estimate epidemic characteristic parameters for each of the 47 prefectures in Japan during each of the six influenza seasons from 2012/2013 to 2017/2018. Among them, the ETM is a thresholding method to detect epidemic onset based on the nationwide epidemic onset threshold of 1.0 C/S/W. The SRM is an existing non-thresholding method for capturing the breakpoint of the epidemic curve as the epidemic onset. The MCM is also a non-thresholding method that we proposed to detect epidemic onset based on the maximum curvature of the epidemic curve. Proper evaluations of methods for detecting epidemic onset are often impaired because of a lack of suitable datasets with reliable information on the occurrence of epidemics [29]. To address this issue, in the present study, estimates from the ETM were used as reference standards to evaluate the performance of the other two methods.
The incompleteness of ETM estimates suggests that the empirical epidemic threshold is not appropriate for the levels of influenza activity observed in prefectures located at or near the southernmost part of Japan, such as Okinawa and Kagoshima (Table 2). The severe lack of valid ETM estimates in Okinawa resulted from a level of background influenza activity that was higher than the empirical epidemic threshold of 1.0 C/S/W. It has been recognized that background influenza activity is high throughout the year in tropical regions [51]. Hence, the influenza seasonality is less defined in Okinawa, where the lowest influenza activity usually occurs later than in other, more northern prefectures (Additional file 1: Figure S5). By contrast, the epidemic onset and ending thresholds (1.9 and 2.6 C/S/W) for Okinawa established using the proposed MCM were the largest, and much higher than those of other prefectures and the empirical epidemic threshold of 1.0 C/S/W (Fig. 4), faithfully reflecting the characteristics of influenza epidemics in Okinawa.
The epidemic curves in all prefectures were asymmetrical because when approaching the epidemic end, the second half of the epidemic curve was relatively gentle compared with the first half, as demonstrated in the 2014/2015 season (Additional file 1: Figure S5). This asymmetry of the epidemic curve not only explains why better agreement with the ETM was achieved for epidemic onset than for epidemic end, regardless of the method used, but also suggests that thresholds for epidemic onset and end are likely to be different and should be established individually. The high consistency between the MCM and ETM guarantees the continuity of using epidemic thresholds derived by the MCM in the Japanese sentinel surveillance system for influenza. Although the prefecture-specific thresholds for epidemic onset and end were established using the only six available influenza seasons, these thresholds can be further refined as more data become available in the future. In addition to the mean statistic used in the present study, other procedures for calculating the thresholds [8] are worth exploring.
The IQRs of the epidemic ending intensities derived by the MCM during 2012/2013, 2014/2015, and 2016/2017 were wider than those during the other three seasons (Additional file 1: Figure S4). This may be explained by the severity of epidemics. In Japan, the 2012/2013, 2014/2015, and 2016/2017 influenza seasons were characterized by the predominance of the A(H3) subtype whereas the dominant virus subtypes in the other three seasons were A(H1N1)pdm09 and B/Yamagata. Seasonal influenza epidemics dominated by A(H3N2) subtype are generally more severe than those dominated by A(H1N1) and B [52], which may affect the shape of the epidemic curve. Therefore, establishment of epidemic thresholds, particularly the epidemic ending thresholds, could incorporate information on the dominant influenza virus subtype.
The proposed MCM has several properties that make it broadly applicable for estimating epidemic onset in public health surveillance. First, the MCM is intuitive as it defines epidemic onset by capturing the local point with maximum curvature. The MCM is a non-thresholding approach to determining epidemic onset that is based entirely on the shape of the epidemic curve. During implementation of the MCM, an upper threshold h is prespecified to limit the search scope for points. However, the sensitivity analysis suggests that the MCM is robust to h for a wide range (Table 3). Therefore, this threshold is not required to be as precise as Y_{0} in the ETM, and is easy to be set. Moreover, it also provides the flexibility to adjust the search scope for points according to the background levels of influenza activity. These properties together with the success of Okinawa give the MCM the potential to estimate epidemic characteristic parameters in the subtropics and tropics where various respiratory pathogens that can cause acute respiratory illness, such as respiratory syncytial virus, parainfluenza virus etc., circulate year round [18]. Consequently, the patterns of influenza in subtropical and tropical regions are complex with year-round high background rate of acute respiratory illness [51] and lack of apparent ILI seasonality [18]. The recent experience of establishing influenza epidemic thresholds in Cambodia using the WHO method [19] suggests that unlike in temperate regions, the ILI syndromic surveillance data was less useful for setting thresholds [18]. Therefore, priority to virological surveillance data, such as the positive proportion [30], the product of the ILI proportion and the positive proportion, should be given when applying the MCM to establish thresholds for influenza epidemics in subtropical and tropical regions.
Second, in contrast to the widely used Serfling-like regression models requiring long series of historical data to estimate model parameters [13, 20, 22, 26], parameters of the MCM are prespecified. This means the MCM can be applied in areas with limited historical data and in analyzing influenza pandemics that usually last for a single season. Epidemic onsets determined using empirical thresholds [12], Serfling-type regression model [21], and the SRM [7, 31] have been used to investigate spatial transmission of both influenza pandemics and epidemics. New insights into the spatial transmission of influenza may be gained using the MCM as it defines epidemic onset totally based on the properties of the epidemic curve.
Third, although the calculation in the MCM is more complex than that in the SRM, the estimates derived using our novel MCM were in much better agreement with those derived using the ETM. The high consistency between epidemic onsets derived by the ETM and MCM implies that curve properties, such as the curvature, may have been taken into consideration during the determination of the national epidemic onset indicator in Japan. A comparison conducted by Charu et al. [31] showed excellent agreement between estimates of influenza epidemic onset in the US derived by the SRM and Serfling-like regression method, which in essence determines epidemic onset based on thresholds. In constrast, the agreement between the ETM and SRM was poor in Japan. This may be linked to the differences in sentinel surveillance systems for influenza in the US and Japan.
Finally, the MCM is robust not only to model parameters n and h but also to the partitioning of the influenza seasons and the determination of the epidemic peak. Regarding the estimation of epidemic onset, the MCM calculates the curvature at each point by fitting a least-square circle using only n points around the current one. While searching for the local point of maximum curvature, the MCM also takes into account the changing direction of the curvature at each point, which ensures that only points in the ascending phase of the epidemic curve are targeted. In contrast, the SRM fits two broken lines, using all points in the first half of the epidemic curve. Therefore, when the influenza season begins and ends could have an impact on the epidemic onset estimate. In the present study, it was appropriate to define the start of each influenza season as week 35 with the exception of Okinawa during 2012/2013, 2014/2015, and 2016/2017 (Additional file 1: Figure S3 and S5). For example, during 2012/2013 in Okinawa, the influenza season should have been defined to start around week 44. The first broken line fitted by the SRM included approximately the last 10 weeks of the previous influenza season, which resulted in a biased epidemic onset estimate toward earlier weeks. In this case, the curvatures for these weeks is filtered out by the MCM as their directional angles were not between [0°, 90°] (Fig. 2C and D). Furthermore, taking the direction of curvature into consideration may enable the MCM to overcome the constraint of the MLRM [30] and to be applicable to multiple epidemic waves of influenza observed in subtropical and tropical regions, such as southern China [25]. In addition, the SRM is more sensitive to the determination of the epidemic peak timing than the MCM. However, epidemic peaks may suffer from large fluctuations, such as the sharp decrease in ILI activity during the National Day Holiday in the 2009 pandemic in China [53]. Under such circumstances, the SRM will result in a large bias in the epidemic onset estimates.
There are several limitations to the proposed MCM that deserve consideration. First, the MCM can only be used in retrospective analysis of epidemics because data from later weeks are required for fitting the least-square circles. Second, the MCM implicitly relies on the smoothness of the epidemic curve. For epidemic curves with small fluctuations, we can address this limitation by increasing the number of points (e.g., n = 7) used for fitting least-square circles. For irregular epidemic curves with large and frequent fluctuations, techniques such as Savitzky-Golay filtering [54], among others, may be used to smooth the epidemic curve before applying the MCM. Finally, in comparison with the SRM, the MCM cannot provide confidence intervals for epidemic onset estimates, which limits the ability of the MCM to take uncertainties into account.
Conclusions
In conclusion, our findings indicate that the nationwide epidemic onset threshold of 1.0 C/S/W currently used in the sentinel system for influenza surveillance in Japan should be adjusted for each prefecture, especially for Okinawa. The proposed MCM shows better agreement with the ETM than the SRM and performs very well in the context of Japanese influenza surveillance. The prefecture-specific thresholds for epidemic onset and end established using the MCM could serve as useful complements to the influenza surveillance system in Japan. Further research should be undertaken to evaluate the applicability of the MCM in different public health surveillance systems or in tropical and subtropical zones, and in detecting the onset of influenza pandemics.
Notes
Acknowledgments
JC is sincerely grateful to Cecile Viboud from the Fogarty International Center, National Institutes of Health, USA, for her support during the visit of JC as a predoctoral fellow.
Funding
This work was supported by the National Research Program of the Ministry of Science and Technology of China (2016YFA0600104), donations from Delos Living LLC and the Cyrus Tang Foundation to Tsinghua University, the National Natural Science Foundation of China (81673234), the Beijing Natural Science Foundation (JQ18025), and the Young Elite Scientist Sponsorship Program by CAST(YESS) (2018QNRC001). The funders had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Availability of data and materials
The datasets and R codes for reproducing the methods and analyses used in the present study are available under MIT license at the GitHub repository, https://github.com/caijun/MCM.
Authors’ contributions
JC and Bi.X conceived and designed the study. JC and BZ collected the data. JC, BZ, BoX, HT and Bi.X analyzed the data and interpreted the results. JC wrote the first draft of the manuscript. BZ, BoX, KC, GC and HT revised the manuscript and contributed important intellectual content. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable [44].
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary material
References
- 1.World Health Organization. Influenza (seasonal) fact sheet [Internet]. Geneva. 2018. Available from: http://www.who.int/mediacentre/factsheets/fs211/en/. Accessed 21 March 2018.
- 2.Iuliano AD, Roguski KM, Chang HH, Muscatello DJ, Palekar R, Tempia S, et al. Estimates of global seasonal influenza-associated respiratory mortality: a modelling study. Lancet. 2018;391(10127):1285–300.PubMedGoogle Scholar
- 3.Won M, Marques-Pita M, Louro C, Gonçalves-Sá J. Early and real-time detection of seasonal influenza onset. PLoS Comput Biol. 2017;13(2):e1005330.PubMedPubMedCentralGoogle Scholar
- 4.Lipsitch M, Viboud C. Influenza seasonality: lifting the fog. Proc Natl Acad Sci. 2009;106(10):3645–6.PubMedGoogle Scholar
- 5.Tamerius J, Nelson MI, Zhou SZ, Viboud C, Miller MA, Alonso WJ. Global influenza seasonality: reconciling patterns across temperate and tropical regions. Environ Health Perspect. 2011;119(4):439.PubMedGoogle Scholar
- 6.Tay EL, Grant K, Kirk M, Mounts A, Kelly H. Exploring a proposed WHO method to determine thresholds for seasonal influenza surveillance. PLoS One. 2013;8(10):e77244.PubMedPubMedCentralGoogle Scholar
- 7.Geoghegan JL, Saavedra AF, Duchêne S, Sullivan S, Barr I, Holmes EC. Continental synchronicity of human influenza virus epidemics despite climactic variation. PLoS Pathog. 2018;14(1):e1006780.PubMedPubMedCentralGoogle Scholar
- 8.Vega T, Lozano Jose E, Meerhoff T, Snacken R, Mott J, Ortiz de Lejarazu R, et al. Influenza surveillance in Europe: establishing epidemic thresholds by the moving epidemic method. Influenza Other Respir Viruses. 2012;7(4):546–58.PubMedPubMedCentralGoogle Scholar
- 9.Centers for Disease Control and Prevention. Principles of epidemiology in public health practice: an introduction to applied epidemiology and biostatistics. Atlanta, GA: US Dept. of health and human services, Centers for Disease Control and Prevention (CDC), Office of Workforce and Career Development; 2012.Google Scholar
- 10.Unkel S, Farrington CP, Garthwaite Paul H, Robertson C, Andrews N. Statistical methods for the prospective detection of infectious disease outbreaks: a review. Journal of the Royal Statistical Society: Series A (Statistics in Society). 2011;175(1):49–82.Google Scholar
- 11.Watts CG, Andrews RM, Druce JD, Kelly HA. Establishing thresholds for influenza surveillance in Victoria. Aust N Z J Public Health. 2007;27(4):409–12.Google Scholar
- 12.Eggo RM, Cauchemez S, Ferguson NM. Spatial dynamics of the 1918 influenza pandemic in England. Wales and the United States Journal of the Royal Society Interface. 2010.Google Scholar
- 13.Cowling BJ, Wong IOL, Ho L-M, Riley S, Leung GM. Methods for monitoring influenza surveillance data. Int J Epidemiol. 2006;35(5):1314–21.PubMedGoogle Scholar
- 14.Yang P, Duan W, Lv M, Shi W, Peng X, Wang X, et al. Review of an influenza surveillance system, Beijing, People's Republic of China. Emerging Infectious Disease. 2009;15(10):1603.Google Scholar
- 15.Centers for Disease Control and Prevention, National Center for Immunization and Respiratory Diseases (NCIRD). Overview of influenza surveillance in the United States [Internet]. 2017. Available from: https://www.cdc.gov/flu/weekly/overview.htm . Accessed 2 August 2018.
- 16.Baumeister E, Duque J, Varela T, Palekar R, Couto P, Savy V, et al. Timing of respiratory syncytial virus and influenza epidemic activity in five regions of Argentina, 2007-2016. Influenza Other Respir Viruses. 2018;0(0):1–8.Google Scholar
- 17.Azziz Baumgartner E, Dao CN, Nasreen S, Bhuiyan MU, Mah-E-Muneer S, Mamun AA, et al. Seasonality, timing, and climate drivers of influenza activity worldwide. J Infect Dis. 2012;206(6):838–46.PubMedGoogle Scholar
- 18.Ly S, Arashiro T, Ieng V, Tsuyuoka R, Parry A, Horwood P, et al. Establishing seasonal and alert influenza thresholds in Cambodia using the WHO method: implications for effective utilization of influenza surveillance in the tropics and subtropics. Western Pacific Surveillance and Response Journal : WPSAR. 2017;8(1):22–32.PubMedGoogle Scholar
- 19.World Health Organization. WHO global epidemiological surveillance standards for influenza. Geneva: World Health Organization; 2014. 84 pGoogle Scholar
- 20.Serfling RE. Methods for current statistical analysis of excess pneumonia-influenza deaths. Public Health Rep. 1963;78(6):494–506.PubMedPubMedCentralGoogle Scholar
- 21.Gog JR, Ballesteros S, Viboud C, Simonsen L, Bjornstad ON, Shaman J, et al. Spatial transmission of 2009 pandemic influenza in the US. PLoS Comput Biol. 2014;10(6):e1003635.PubMedPubMedCentralGoogle Scholar
- 22.Costagliola D, Flahault A, Galinec D, Garnerin P, Menares J. Valleron AJ. A routine tool for detection and assessment of epidemics of influenza-like syndromes in France. Am J Public Health. 1991;81(1):97–9.PubMedPubMedCentralGoogle Scholar
- 23.Olson DR, Konty KJ, Paladini M, Viboud C, Simonsen L. Reassessing Google flu trends data for detection of seasonal and pandemic influenza: a comparative epidemiological study at three geographic scales. PLoS Comput Biol. 2013;9(10):e1003256.PubMedPubMedCentralGoogle Scholar
- 24.Wang X, Wu S, MacIntyre CR, Zhang H, Shi W, Peng X, et al. Using an adjusted Serfling regression model to improve the early warning at the arrival of peak timing of influenza in Beijing. PLoS One. 2015;10(3):e0119923.PubMedPubMedCentralGoogle Scholar
- 25.Yu H, Alonso WJ, Feng L, Tan Y, Shu Y, Yang W, et al. Characterization of regional influenza seasonality patterns in China and implications for vaccination strategies: spatio-temporal modeling of surveillance data. PLoS Med. 2013;10(11):e1001552.PubMedPubMedCentralGoogle Scholar
- 26.Wenger JB, Naumova EN. Seasonal synchronization of influenza in the United States older adult population. PLoS One. 2010;5(4):e10187.PubMedPubMedCentralGoogle Scholar
- 27.Liu X-X, Li Y, Zhu Y, Zhang J, Li X, Zhang J, et al. Seasonal pattern of influenza activity in a subtropical city, China, 2010–2015. Sci Rep. 2017;7(1):17534.PubMedPubMedCentralGoogle Scholar
- 28.Amorós R, Conesa D, Martinez-Beneito MA, López-Quılez A. Statistical methods for detecting the onset of influenza outbreaks: a review. REVSTAT–statistical. Journal. 2015;13(1):41–62.Google Scholar
- 29.Nobre FF. Stroup DF. A monitoring system to detect changes in public health surveillance data. Int J Epidemiol. 1994;23(2):408–18.PubMedGoogle Scholar
- 30.Cheng X, Chen T, Yang Y, Yang J, Wang D, Hu G, et al. Using an innovative method to develop the threshold of seasonal influenza epidemic in China. PLoS One. 2018;13(8):e0202880.PubMedPubMedCentralGoogle Scholar
- 31.Charu V, Zeger S, Gog J, Bjørnstad ON, Kissler S, Simonsen L, et al. Human mobility and the spatial transmission of influenza in the United States. PLoS Comput Biol. 2017;13(2):e1005382.PubMedPubMedCentralGoogle Scholar
- 32.Ministry of Health, Labour and Welfare. National Institute of Infectious Diseases. Influenza, 2000/01 season. Japan. Infectious Agents Surveillance Report (IASR). 2001;22(12):309–10.Google Scholar
- 33.Ministry of Health, Labour and Welfare. National Institute of Infectious Diseases. Influenza in 2001/02 season. Japan. Infectious Agents Surveillance Report (IASR). 2002;23(12):307–8.Google Scholar
- 34.Gu Y, Shimada T, Yasui Y, Tada Y, Kaku M, Okabe N. National surveillance of influenza-associated encephalopathy in Japan over six years, before and during the 2009–2010 influenza pandemic. PLoS One. 2013;8(1):e54786.PubMedPubMedCentralGoogle Scholar
- 35.Hashimoto S, Murakami Y, Taniguchi K, Nagai M. Detection of epidemics in their early stage through infectious disease surveillance. Int J Epidemiol. 2000;29(5):905–10.PubMedGoogle Scholar
- 36.National Institute of Infectious Diseases (NIID) of Japan. Weeks Ending Log [Internet]. 2018. Available from: https://www.niid.go.jp/niid/en/calendar-e.html. Accessed 8 August 2018.
- 37.Zaraket H, Saito R. Japanese surveillance systems and treatment for influenza. Current Treatment Options in Infectious Diseases. 2016;8(4):311–28.PubMedPubMedCentralGoogle Scholar
- 38.National Institute of Infectious Diseases (NIID) of Japan. Infectious Disease Weekly Report (IDWR) [Internet]. 2018. Available from: https://www.niid.go.jp/niid/en/idwr-e.html. Accessed 1 September 2018.
- 39.National Institute of Infectious Diseases (NIID) of Japan. Infectious disease surveillance system in Japan [Internet]. 2018. Available from: https://www.niid.go.jp/niid/ja/nesid-program-summary.html. Accessed 1 September 2018.
- 40.Shoji M, Katayama K, Sano K. Absolute humidity as a deterministic factor affecting seasonal influenza epidemics in Japan. Tohoku J Exp Med. 2011;224(4):251–6.PubMedGoogle Scholar
- 41.Muggeo VMR. Segmented: an R package to fit regression models with broken-line relationships. R news. 2008;8(1):20–5.Google Scholar
- 42.Pratt V. Direct least-squares fitting of algebraic surfaces. ACM SIGGRAPH Computer Graphics; 1987: ACM.Google Scholar
- 43.R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2017.Google Scholar
- 44.Cai J. Datasets and codes from a maximum curvature method for estimating epidemic onset of seasonal influenza in Japan [internet]. 2018. Available from: https://github.com/caijun/MCM. Accessed 1 October 2018.
- 45.Ministry of Health, Labour and Welfare. National Institute of Infectious Diseases. 2012/13 influenza season. Japan. Infectious Agents Surveillance Report (IASR). 2013;34(11):325–7.Google Scholar
- 46.Ministry of Health, Labour and Welfare. National Institute of Infectious Diseases. 2013/14 influenza season. Japan. Infectious Agents Surveillance Report (IASR). 2014;35(11):251–3.Google Scholar
- 47.Ministry of Health, Labour and Welfare. National Institute of Infectious Diseases. Influenza 2014/15 season. Japan Infectious Agents Surveillance Report (IASR). 2015;36(11):199–201.Google Scholar
- 48.Ministry of Health, Labour and Welfare. National Institute of Infectious Diseases. Influenza 2015/16 season. Japan. Infectious Agents Surveillance Report (IASR). 2016;37(11):211–2.Google Scholar
- 49.Ministry of Health, Labour and Welfare. National Institute of Infectious Diseases. Influenza 2016/17 season. Japan. Infectious Agents Surveillance Report (IASR). 2017;38(11):209–11.Google Scholar
- 50.Ministry of Health, Labour and Welfare. National Institute of Infectious Diseases. Influenza 2017/18 season. Japan. Infectious Agents Surveillance Report (IASR). 2018;39(11):181–3.Google Scholar
- 51.Viboud C, Alonso WJ, Simonsen L. Influenza in tropical regions. PLoS Med. 2006;3(4):e89.PubMedPubMedCentralGoogle Scholar
- 52.Greene SK, Ionides EL, Wilson ML. Patterns of influenza-associated mortality among US elderly by geographic region and virus subtype, 1968–1998. Am J Epidemiol. 2006;163(4):316–26.PubMedGoogle Scholar
- 53.Yu H, Cauchemez S, Donnelly CA, Zhou L, Feng L, Xiang N, et al. Transmission dynamics, border entry screening, and school holidays during the 2009 influenza a (H1N1) pandemic, China. Emerging Infectious Disease. 2012;18(5):758.Google Scholar
- 54.Savitzky A, Golay MJE. Smoothing and differentiation of data by simplified least squares procedures. Anal Chem. 1964;36(8):1627–39.Google Scholar
Copyright information
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.