Abstract
Hemorrhagic fever with renal syndrome (HFRS) is seriously endemic in China with 70%~90% of the notified cases worldwide and showing an epidemic tendency of upturn in recent years. Early detection for its future epidemic trends plays a pivotal role in combating this threat. In this scenario, our study investigates the suitability for application in analyzing and forecasting the epidemic tendencies based on the monthly HFRS morbidity data from 2005 through 2019 using the nonlinear model-based self-exciting threshold autoregressive (SETAR) and logistic smooth transition autoregressive (LSTAR) methods. The experimental results manifested that the SETAR and LSTAR approaches presented smaller values among the performance measures in both two forecasting subsamples, when compared with the most extensively used seasonal autoregressive integrated moving average (SARIMA) method, and the former slightly outperformed the latter. Descriptive statistics showed an epidemic tendency of downturn with average annual percent change (AAPC) of −5.640% in overall HFRS, however, an upward trend with an AAPC = 1.213% was observed since 2016 and according to the forecasts using the SETAR, it would seemingly experience an outbreak of HFRS in China in December 2019. Remarkably, there were dual-peak patterns in HFRS incidence with a strong one occurring in November until January of the following year, additionally, a weak one in May and June annually. Therefore, the SETAR and LSTAR approaches may be a potential useful tool in analyzing the temporal behaviors of HFRS in China.
Similar content being viewed by others
Introduction
Hemorrhagic fever with renal syndrome (HFRS) is a rodent-borne contagious disease caused by several distinct families of Hantaviruses, which can lead to various degrees of fever, shock, congestion, bleeding, and acute renal failure1. Currently, this disease globally occurs in more than seventy countries, and an approximate 70%~90% notification was reported in China2, where HFRS is still considered a serious public health problem due to its highly endemic in 28 of 31 provinces, municipal districts and autonomous regions with about 20,000–50,000 incident cases per year3,4,5, leading to a fatality rate of around 3%~10%6, despite many efforts, such as effective rodent control, vaccination, and environmental management, in reducing HFRS-related incidence over the past decades7. In China, the pathogenic agents of HFRS predominantly include Hantaan virus (HTNV) and Seoul virus (SEOV), though other viruses can be involved4,8,9. Since 1990s, under the intervention-driven strategies, the notified HFRS cases have begun to decline2, but its epidemic trends of HFRS seemingly show a recurring sign in recent years9,10,11,12,13. Therefore, to facilitate to offer a quantitative and explicit direction for the prevention and control of HFRS, a forecasting model with strong robustness and high accuracy to understand its epidemic trajectories is required.
At present, many forecasting methods that act as effective policy-supportive tools have widely been adopted to assess and analyze the temporal patterns of the incidence of contagious diseases, such as pertussis14, HFRS3, pulmonary tuberculosis15, influenza16, syphilis17, etc. Of them, the most commonly used model is the seasonal autoregressive integrated moving average (SARIMA) method that essentially belongs to a linear model3,17. However, what is most often encountered in practice is that the data-generating process is highly nonlinear, especially for the morbidity series of infectious diseases because such data often include complicated traits of seasonality, secular trend, cyclicity, and stochastic fluctuation15,18. At this time, the linear methods simulated to such complicated nonlinear data frequently fail to obtain satisfactory forecasting performance, whereas the nonlinear methods may do better in that they can better capture the underlying dynamic mechanism of the target series18,19. Currently, numerous nonlinear techniques have been recommended to evaluate and analyze the temporal patterns of the incidence of contagious diseases, such as artificial neural networks (ANNS)19, support vector machine (SVM)20, autoregressive conditional heteroscedasticity (ARCH)21, Error-Trend-Seasonal (ETS) approaches22, etc. However, the popular non-linear regime-switching models such as self-exciting threshold autoregressive (SETAR) and logistic smooth transition autoregressive (LSTAR) specifications so far remain unexplored for the incidence time series forecasting of contagious diseases. Therefore, in the setting of the epidemic status of HFRS in China, the aim is to investigate their forecasting abilities of the SETAR and LSTAR approaches to the HFRS incidence data. Meanwhile, their predictive powers were compared with the SARIMA method to detect the best-performing one that can act as an effective policy-supportive tool for the prevention and control of HFRS.
Materials and Methods
Data source
In this time series analysis, the monthly new cases of HFRS from January 1, 2005 through September 31, 2019 were collected from the national notifiable infectious disease surveillance system (NNIDSS), and the population data were extracted from National Bureau of Statistics (http://www.stats.gov.cn/tjsj/ndsj/). In China (The geographical distribution of China is shown in Fig. 1), the clinically diagnosed or laboratory-confirmed HFRS cases based on the diagnostic criteria for notifiable infectious diseases must be registered on the NNIDSS within 24 hours. A case was confirmed based on the following diagnostic principles23: 1) Epidemiological exposure histories. A person has a history of living in the epidemic area or has a direct or indirect contact history with the infected rodents or their excreta (such as feces, saliva, and urine) and secretions within 2 months before the onset of this disease. 2) Clinical manifestations. A person is characterized by gastrointestinal symptoms (such as asthenia, nausea, vomiting, abdominal pain, and diarrhea) and manifestations of capillary damage (such as hyperemia, exudation, and hemorrhage), coupled with hypotension shock or renal damage. 3) Laboratory test. The person with at least one of the laboratory test results in addition to the above 1) and 2) can be confirmed: a positive result for the serum specific IgM antibody, or a 4-time increment for the serum specific IgG antibody in convalescent period than in acute period, or hantavirus RNA detected from the patients, or hantavirus isolated from the patients. We obtained all the data in an anonymous format, without access to any initial information identifying patients, and thus the ethical approval was not needed.
Building SARIMA model
Owing to the seasonal variation of infectious diseases, the SARIMA approach was often built to simulate and forecast their epidemic levels5. This approach is composed of seasonal and non-seasonal parts and can be written as SARIMA(p, d, q)(P, D, Q)s, in which p, d, and q signify the non-seasonal autoregressive (AR) order, the non-seasonal differenced times, and the non-seasonal moving average (MA) order, respectively; P, D, and Q represent the seasonal AR (SAR) order, the seasonal differenced times, and the seasonal MA (SMA) order, respectively; S denotes the length of seasonal pattern (S = 12 in this work)5. The development of the SARIMA approach was followed by four procedures: Initially, we judged whether the HFRS morbidity series was stationary by plotting its sequence graph and performing an augmented Dickey-Fuller (ADF) test22,24. If a nonstationary series was shown, the transformed techniques including logarithm or square root, or/and difference were employed to make it stationary19. Secondly, the autocorrelation function (ACF) and partial autocorrelation function (PACF) diagrams were applied to choose its plausible parameters of this model3. Subsequently, we determined the preferred SARIMA approach. Among the possible models, the one that presented the lowest values of the Bayesian information criterion (BIC) and Akaike information criterion (AIC), together with the maximum value of the Log-likelihood was considered as the best-fitting22. Finally, we further conducted a checking for its parameters and residuals of this optimal model. Once all parameters displayed statistical significances (p < 0.05) and the residuals showed a white noise series under the Ljung-Box test (p > 0.05), meaning that this best-undertaking SARIMA model can be used to perform forecasting19. Otherwise, the above-mentioned modeling steps should be repeated until the best model was found.
Developing regime-switching models
Due to the data-generating process that is often highly nonlinear, which results in an increasing interest in nonlinear techniques modeled to time series18. Of these techniques, the regime-switching methods are significantly popular because they are apt to evaluate and interpret, and capable of producing interesting nonlinearities and rich dynamics25,26. These models describe a class of nonlinear regression featuring piecewise linear specifications and regime switching, and are commonly divided into two categories based on the transition function27: it is called the SETAR method when using the first-order exponential function; another is called the LSTAR method that uses the logistic function. Both methods have the characteristics of asymmetric cycle27. Among them, the LSTAR method allows the expansion and contraction regimes to possess various dynamics, with a smooth transition from one to another. Instead, the SETAR method indicates that different regimes have similar dynamics, whereas the pattern in the transition period may be varied when the process crosses the corresponding threshold27. The formula of a two-regime SETAR (2, p1, p2) method with delay d can be written as28.
where p1 and p2 represent the autoregressive orders of these two submodels, respectively; d denotes the delay parameter; r is the threshold value. Further, this representation can be extended to three or more regimes.
The formula of a two-regime LSTAR (2, p1, p2) method with delay d can be defined as29.
where the p1, p2, and r have the same meanings described in the SETAR method; \(G({Z}_{t},r,th)\) is the logistic function, its location and scale parameters are th and 1/r, respectively.
In this research, the preferred SETAR model was selected based on the pooled AIC = AIC (low regime model) +AIC (high regime model), a lower value frequently corresponded to the best-fitting model, but a close pooled AIC value was very competitive, which should also be tried. The optimal LSTAR model was chosen on the basis of the AIC and BIC values, in which the one that had lower values of both two indices was the best-undertaking.
Performance comparison
We used four statistical measures of the mean absolute deviation (MAD), the mean absolute percentage error (MAPE), the root mean squared error (RMSE), and the mean error rate (MER) to evaluate the accuracy of the forecasts among methods. Typically, the method that presented the lowest value among the above-mentioned measures should be deemed as the optimal.
where \({Y}_{i}\) stands for the original HFRS incidence values, \({\hat{Y}}_{i}\) denotes the forecasts from the three models, \({\bar{Y}}_{i}\) signifies the mean of the original values, \(N\) represents the number of forecasts.
Statistical process
In this study, we classified the observed series into training and testing subsets, among which the observed series between January 1, 2005 and December 31, 2018 (training subset) was used to fit the models, and then selecting the optimal models to forecast the rest of data (testing subset). Meanwhile, an additional training subset from January 1, 2005 and December 31, 2017 and testing subset from January 1, 2018 to September 31, 2019 were provided to account for the models’ uncertainty. The SARIMA, SETAR, and LSTAR methods were erected using the statistical packages of “forecast,” “fUnitRoots,” “TSA,” “tsDyn” and “tseries” of R3.4.3 (R Development Core Team, Vienna, Austria). Additionally, we detected the nonlinearity of the HRFS morbidity series by applying a Brock-Dechert-Scheinkman (BDS) test to the errors of the optimal SARIMA approach30, and using a Lagrangian Multiplier (LM) test to examine whether there existed conditional heteroskedastic behavior and volatility (ARCH effect) in the residual sequence yielded by these three models22. A two-sided p < 0.05 suggests a statistical significance.
Results
Statistical description
Throughout the study period, the reported HFRS cases totaled 181,402, resulting in an annualized and monthly morbidity rates of 0.924 and 0.076 per 100,000 persons, respectively. The original incidence series and the decomposition of this series into trend, seasonal pattern, and irregular component are displayed in Fig. 2 and Supplementary Fig. S1, indicating that together HFRS incidence displayed a downward trend with average annual percent change (AAPC) of −5.640%, and yet the variation trend seemed to show a natural cyclical pattern with 3–5 years’ fluctuations: morbidity rate dramatically dropped from 1.704 to 0.690 per 100,000 persons in the period 2005–2009, with AAPC = − 19.029%; then it climbed to 1.028 per 100,000 persons in 2012, with AAPC = 9.037% relative to the level of 2009; immediately afterward the trend was decreasing between 2012 and 2016 (1.028 to 0.671 per 100,000 population), with AAPC = − 2.793%; and then with an AAPC = 1.213% from 2016 to 2018. And the HFRS incidence series was strongly seasonal with a cycle of 12 months, where a semi-annual seasonal pattern was observed, with a strong peak occurring from November to January of the following year and a weak one in May and June annually, while a trough was observed in August and September per year (Fig. 2 and Supplementary Fig. S2).
The best-performing SARIMA method
Before modeling the training samples from January 1, 2005 through December 31, 2018, the ADF test was applied to the data (ADF = − 3.621, p < 0.001), being indicative of a stationary series, which met the requirement of the SARIMA method establishment. However, it appeared that there was an unstable variance and mean in this series over time (Fig. 2B). Accordingly, the logarithmic and square root transformations were applied to the series to stabilize its variance, indicating a similar trend between these two series (Supplementary Fig. S3). After an attempt, it seemed that the logarithmic transformation was more suitable for the SARIMA model construction. Subsequently, the seasonal and nonseasonal differences were performed to reduce its trend and seasonality of this processed series (Supplementary Figs S4-S6). Now, the transformation and differencing have made the data achieve completely stationary. Based on the ACF and PACF plots of this stationary series, several possible SARIMA methods were chosen (Table 1). Further, the results from the goodness of fit tests intimated that the SARIMA(0,1,3)(0,1,1)12 tended to be the best-fitting model, as this model had the lowest values of AIC = − 851.561 and BIC = − 836.344, together with the maximum value of Log-likelihood=430.781, and the parameters of this model indicated a significant difference at the 5% level (Table 2). Moreover, a greater p-value than 0.05 under the Ljung-Box test meant that the residual series successfully accomplished white noise (Fig. 3). In addition, the LM test indicated that the ARCH effects existed in the original observed data were also eliminated (Table 3). This optimal model passed all required checking, and thus can be utilized to perform projections for the future (Table 4). Likewise, following the modeling steps, we conducted a sensitivity analysis using the additional training subset from January 1, 2005 to December 31, 2017 to verify the model’s uncertainty. The obtained best-conducting SARIMA model and its goodness of fit testing results are summarized in Supplementary Tables S1-S3 and Fig. S7.
The best-performing regime-switching methods
The results of the BDS test are displayed in Table 5, all statistics revealed a p-value less than 0.05, being suggestive of a highly nonlinear mechanism of the data. Consequently, it is necessary to establish the model-based nonlinear SETAR and LSTAR methods fitted to the HFRS incidence series. In this work, we used the grid search to detect the appropriate parameters (d, p1, and p2) for these two methods. After trying over and over again, we found that the nominal AIC was smallest when the delay parameter d = 2 (d = 1, 2, 3, 4, and 5 corresponded to the nominal AIC = − 756.0, −852.4, −798.5, −753.3, and −781.0, respectively), and as shown in Supplementary Table S4, suggesting that the pooled AIC had the lowest value of −754.045 when p1 and p2 were 3 and 5, respectively, in the SETAR method, and yet the p1 = 4 and p2 = 5 were competitive. Thus, an approximation of these possible parameters of the SETAR method to the HFRS incidence series was attempted, the comparative results are given in Table 6, and the mimic performance measures of the SETAR(2,4,5) method provided smaller values than that of the SETAR(2,3,5) method. The results hinted that the SETAR(2,4,5) method seemed more suitable for our data (Supplementary Table S5), and the statistical checking results for the residuals from this method are shown in Table 3 and Fig. 4. Further, we tested the preferred three-regime SETAR method, which produced a poorer performance with MAPE = 21.820% than the best-fitting two-regime. Consequently, we selected the two-regime model as the optimal in our study. In the meantime, we could also get the best-fitting LSTAR(2,4,5) approach using the grid search (Table 6, Fig. 5 and Supplementary Tables S5-S6). Whereafter, the out-of-data forecasts can be made by using these two best-undertaking approaches (Table 4). Similarly, the preferred SETAR and LSTAR approaches used to account for the models’ uncertainty can be established based on the above-mentioned steps, and all results are listed in Supplementary Tables S7-S9 and Figs S8-S9.
Measuring for forecasting accuracy
The comparative results of the out-of-sample forecasting are presented in Table 7. As can be seen from the data, the SETAR and LSTAR approaches visibly provided smaller values among the measures of MAE, MAPE, RMSE, and MER in both two forecasting sets, and the SETAR approach was slightly superior to the LSTAR method in view of the above four indices. Looking at Fig. 6, compared with the SARIMA model, also indicating that the SETAR and LSTAR methods could better capture the dynamic dependent structure of the data. In the light of these results, we thus constructed the SETAR model depending on the entire HFRS incidence data to undertake a projection into June 2021, and the 95% predictive intervals were resorted to simulation with 5,000 sizes (Fig. 7). According to the predictive results, it appeared that there would be a likelihood of HFRS outbreak in December 2019 since its forecast in this month was out of the 95% uncertainty intervals.
Discussion
Recently, the recurring risk of HFRS has been an increasing concern in China9,10,11,12,13. Forecasting based on high accuracy models may provide a useful aid in the development of a preventive and control system, as well as the reallocation of the limited resources. In this study, we established SETAR method and LSTAR approach to analyze and forecast the temporal tendencies of HFRS, moreover, the predictive abilities of the frequent use of SARIMA method and above used methods were compared. The time series analysis results demonstrated a valuable estimation for the 9-data-ahead (short-term) and the 21-data-ahead (long-term) predictions using the SETAR and LSTAR approaches, which provided more accurate and robust predictions for the HFRS morbidity series relative to the SARIMA approach, additionally, the SETAR model seemed to slightly overmatch the LSTAR model in the predictive power. Furthermore, given the MAPE value that is often used to measure the accuracy of a prediction31, suggesting no significant deterioration in the long-term prediction performance in comparison to that in the short-term predictions (the MAPE values were 0.2388 vs. 0.2412 in the SETAR method and 0.2406 vs. 0.2604 in the LSTAR method). Our investigation meant that the predictive performances of these two methods maintained robustness, and they can be recommended as a useful tool in understanding and predicting the epidemic patterns of HFRS, which will be of fundamental importance for the prevention and control of this disease. What’s more, we observed that the SARIMA method showed an unacceptable level of accuracy with the predictive period increased, which further confirmed that the SARIMA approach is suited to evaluate the short-term temporal levels of a time series16.
The SARIMA method assumes that there exists certain linear link between the future epidemic trajectories of the target time series and the changing state of its historical data32, thus, it has been emerged as the most popular model to perform a forecast for the future by considering the overall trends and seasonal pattern of a time series with seasonality or non-seasonality16,32. For example, Cong et al. built a SARIMA(1,0,0)(0,1,1)12 method to forecast the epidemic patterns of influenza incidence in China16. Fu et al. developed a SARIMA(0,0,2)(0,1,1)12 method to conduct a forecast for the incidence series of hand-foot-mouth disease in Zhejiang Province33. Albeit this method frequently provides a good approximation to the target time series, it suffers from weaknesses in handling nonlinear patterns and which is only suitable for undertaking short-term forecasting16,17. All in all, this is in agreement with our findings. In the real world, the development and occurrence of diseases are associated with many drivers, which make the relationship between the observations show nonlinear modes. Therefore, it is necessary to test the data-generating mechanism prior to establishing a model for the target time series. In this study, the BDS test was applied to the residuals produced by the SARIMA method to detect nonlinearity of the HFRS incidence series, suggesting a notable nonlinear tendency. In this context, nonlinear methods are seemingly appropriate. To our best knowledge, the model-based nonlinear SETAR and LSTAR approaches were the first time employed to predict HFRS incidence in the forecasting domain of infectious diseases, and our experimental results also demonstrated their usefulness in the prediction for the HFRS morbidity series. However, for one thing, considering the different types of non-linear modes in data-generating process18, further investigations are required to evaluate the suitability for forecasting other infectious diseases; for another, other nonlinear statistical models (including ANNS19, ARCH21, SVM20, etc.) have also been established to study the temporal behaviors of infectious diseases. Collectively, further comparisons between our used methods and the above discussed models are also required in order to find the most accurate one that captures the nonlinear relationship. Besides, also notice that mathematical epidemiology has played an important role in the understanding of infectious disease transmission in human populations in the past decades34. Among which, the mechanistic models based on time series (such as susceptible exposed infectious recovered (SEIR) model or SIR model) have widely been used to model the transmission dynamics of contagious diseases such as measles35, coronavirus disease 201936, dengue fever37, HFRS2, etc, in that the SEIR or SIR model can easily explain inhomogeneous mixing in a phenomenological manner by considering the nonlinear dependence of contact rates between susceptible population and hosts38, and thus can be used to assess the key parameters of infectious processes and clarify the potential processes driving the transmission dynamics of infectious diseases39. However, the SEIR or SIR is a deterministic model with the assumption that the infectious persons are independently and randomly mixing with all other persons2. As mentioned above, in practice, the transmission of contagious diseases is limited and affected by varying indeterminate divers (e.g., climatic variability, seasonal variation, variations in pathogens, or government policy)2,40,41. Under such circumstances that the morbidity data are often inclined to show uncertainty and nonlinearity32, the SEIR or SIR model may obtain unsatisfactory forecasting results. At this time, our used regime-switching methods may be more suitable and more convincing, because these nonlinear models assign multiple potential drivers and comprehensive effects of uncertainty factors that may drive the disease occurrence and development to a univariate time series, and then performing prediction by identifying the potential relationships between the future state of the incidence series and the past and present internal rules of the historical series. Moreover, the regime-switching methods with the advantages of low-cost data collection and extensive application in practice are easy to develop (based only on intrinsic variables) and can obtain relatively satisfactory predictive accuracy as evidenced by our experiment results. Despite these advantages of the regime-switching models, much work is still required to compare the real forecasting effects between mechanistic models and regime-switching models.
Our research manifested a downward trend in HFRS morbidity in the whole study period, which is similar to that observed in some countries in Asia42. This may mainly be attributed to the government’s continued efforts such as the implementation of a series of rodents’ control measures, the improvement of living standards, the increased urbanization and farm mechanization, and the development of the targeted vaccine and so forth7. Under these efforts, some achievements have been attained, but we observed that the epidemic trend started to rise since 2016, which is seemingly not until December 2019 that such an increase will reach the climax with the highest incidence of 0.191 per 100,000 people according to our predictions using the SETAR method, and there may be a risk of outbreak. Regarding the substantial increase in HRFS morbidity, one plausible explanation may be related to the effects of climatic change which has posed a serious threat on the global scale1,43. HFRS has been identified as a climate-sensitive disease because weather variability has a direct or indirect impact on the rodent population dynamics, such as reproductive rates and incubation period, crop output that serves as the foremost food sources for rodents, and viral exposure opportunities in predisposed population1,11,43; another main reason may emanate from the fact that periodic outbreak is among the most important epidemiological characteristics of HFRS44. Previous work has reported a natural cyclical pattern in HFRS morbidity with around 7–12 years44,45, this phenomenon was also observed in our work, despite with a periodic outbreak being 3–5 years. Besides, new Hantavirus subtypes may also be associated with this sudden increase since a recent study has shown that the emergence of new Seoul viruses raises new challenges to fight against HFRS11. Also, investigations into other plausible causes still go on.
Understanding the seasonal distribution of infectious diseases is of great significance for the analysis and estimation of the diseases’ transmission patterns. Our analytical results exhibited a strong seasonality in HFRS morbidity with a dual-peak pattern, where a strong peak was observed in November until January in the next year and a weak one in May and June per year. The observation fits well with that reported in most areas of China, such as Qingdao11, Zibo46, Zhejiang47, Changsha45, Heilongjiang4, Shenyang7, Hubei44, Liaoning and Anhui43, and also concurs well in Korea48, but inconsistent with that reported in Guangzhou (which peaked in February until May)12. Such a significant difference in seasonal behaviors is predominantly responsive to climatic and demographic factors in the northern hemisphere city, Guangzhou, and its climate is characterized by a wet of high temperatures and a high humidity index, which is significantly different from other areas in China12. In China, the double peak activities in HRFS morbidity may be mainly reeling from its etiologic factors and climatic factors2,10,11,44. Earlier work has found that the HTNV-related HFRS infections are reported all through the year, yet most of them occur in fall and winter whereas the SEOV-related cases are typically observed in spring, and these two pathogenic agents are predominantly spread by A. agrarius and R. norvegicus rodents, respectively10,44. Significantly, climatic drivers, such as temperature, relative humidity, precipitation, etc., can affect hosts’ reproduction and thus causing the transmission of HFRS2,11,44. For instance, the relationship between temperature and relative humidity and HFRS epidemic exhibits a U-shaped curve11, which is in agreement with the peak activities present in winter and summer in HFRS incidence.
This study focused on an investigation into the suitability for application in analyzing and forecasting the epidemic trends in HFRS morbidity using the SETAR and LSTAR methods and has shown their usefulness. Nevertheless, several potential shortfalls need to be considered. Firstly, the under-reporting and under-diagnosis may still be inevitable, in spite of the well-monitored data quality regarding infectious diseases in China, Secondly, we only collected the monthly and yearly HFRS cases absent from some detailed information (such as age, sex, and occupation) due to their unavailability, which precludes further stratified or sensitivity analysis that accounts for the models’ uncertainty. Thirdly, whether these methods are applicable to study HFRS epidemic in other areas needs to be further verified. Finally, in application, these methods entail to be updated with the newly aggregated data in order to maintain their high prediction accuracies.
In conclusion, our findings suggested that the SETAR and LSTAR methods showed superiorities in tracking the temporal patterns than the most commonly adopted SARIMA approach, moreover, they can undertake long-term forecasting, which can function as a useful tool in offering an advanced warning for the epidemiological characteristics of HRFS, and therefore formulating a long-term targeted prevention and control plans in response to this threat of HRFS. Additionally, China is still afflicted with the risk of HFRS outbreak under the present control and prevention strategies. Consequently, more effective control measures are warranted.
Data availability
All data were presented in our analytical results or please contact the first author or the corresponding author on reasonable request.
References
Tian, H. Y. et al. Changes in rodent abundance and weather conditions potentially drive hemorrhagic fever with renal syndrome outbreaks in Xi’an, China, 2005-2012. PLoS Neglected Tropical Diseases 9, e0003530, https://doi.org/10.1371/journal.pntd.0003530 (2015).
Li, L. et al. Hemorrhagic fever with renal syndrome in China: Mechanisms on two distinct annual peaks and control measures. International Journal of Biomathematics 11, 1850030, https://doi.org/10.1142/s1793524518500304 (2018).
Liu, Q., Liu, X., Jiang, B. & Yang, W. Forecasting incidence of hemorrhagic fever with renal syndrome in China using ARIMA model. BMC Infectious Diseases 11, 218, https://doi.org/10.1186/1471-2334-11-218 (2011).
Li, C. P. et al. Association between hemorrhagic fever with renal syndrome epidemic and climate factors in Heilongjiang Province, China. American Journal of Tropical Medicine and Hygiene 89, 1006–1012, https://doi.org/10.4269/ajtmh.12-0473 (2013).
Sun, L. & Zou, L. X. Spatiotemporal analysis and forecasting model of hemorrhagic fever with renal syndrome in mainland China. Epidemiology and Infection 146, 1680–1688, https://doi.org/10.1017/s0950268818002030 (2018).
Yan, L. et al. Landscape elements and Hantaan virus-related hemorrhagic fever with renal syndrome, People’s Republic of China. Emerging Infectious Diseases 13, 1301–1306, https://doi.org/10.3201/eid1309.061481 (2007).
Liu, X., Jiang, B., Gu, W. & Liu, Q. Temporal trend and climate factors of hemorrhagic fever with renal syndrome epidemic in Shenyang City, China. BMC Infectious Diseases 11, 331, https://doi.org/10.1186/1471-2334-11-331 (2011).
Zhang, S. et al. Epidemic characteristics of hemorrhagic fever with renal syndrome in China, 2006-2012. BMC Infectious Diseases 14, 384, https://doi.org/10.1186/1471-2334-14-384 (2014).
Liang, W. et al. Mapping the epidemic changes and risks of hemorrhagic fever with renal syndrome in Shaanxi Province, China, 2005-2016. Scientific Reports 8, 749, https://doi.org/10.1038/s41598-017-18819-4 (2018).
Fang, L. Q. et al. Spatiotemporal trends and climatic factors of hemorrhagic fever with renal syndrome epidemic in Shandong Province, China. PLoS Neglected Tropical Diseases 4, e789, https://doi.org/10.1371/journal.pntd.0000789 (2010).
Jiang, F. et al. Meteorological factors affect the epidemiology of hemorrhagic fever with renal syndrome via altering the breeding and hantavirus-carrying states of rodents and mites: a 9 years’ longitudinal study. Emerging Microbes & Infections 6, e104, https://doi.org/10.1038/emi.2017.92 (2017).
Wei, Y. et al. Meteorological factors and risk of hemorrhagic fever with renal syndrome in Guangzhou, southern China, 2006-2015. PLoS Neglected Tropical Diseases 12, e0006604, https://doi.org/10.1371/journal.pntd.0006604 (2018).
Zhang, W. Y. et al. Spatiotemporal transmission dynamics of hemorrhagic fever with renal syndrome in China, 2005-2012. PLoS Neglected Tropical Diseases 8, e3344, https://doi.org/10.1371/journal.pntd.0003344 (2014).
Zhang, Y. et al. Using big data to predict pertussis infections in Jinan city, China: a time series analysis. International Journal of Biometeorology 64, 95–104, https://doi.org/10.1007/s00484-019-01796-w (2019).
Wang, Y. et al. Temporal trends analysis of tuberculosis morbidity in mainland China from 1997 to 2025 using a new SARIMA-NARNNX hybrid model. BMJ Open 9, e024409, https://doi.org/10.1136/bmjopen-2018-024409 (2019).
Cong, J., Ren, M., Xie, S. & Wang, P. Predicting Seasonal Influenza Based on SARIMA Model, in Mainland China from 2005 to 2018. International Journal of Environmental Research and Public Health 16, https://doi.org/10.3390/ijerph16234760 (2019).
Zhang, X. et al. Time Series Modelling of Syphilis Incidence in China from 2005 to 2012. PloS One 11, e0149401, https://doi.org/10.1371/journal.pone.0149401 (2016).
Zhang, G. P., Patuwo, B. E. & Hu, M. Y. A simulation study of artificial neural networks for nonlinear time-series forecasting. Computers & Operations Research 28, 381–396 (2001).
Wu, W., An, S. Y., Guan, P., Huang, D. S. & Zhou, B. S. Time series analysis of human brucellosis in mainland China by using Elman and Jordan recurrent neural networks. BMC Infectious Diseases 19, 414, https://doi.org/10.1186/s12879-019-4028-x (2019).
Zhang, X., Zhang, T., Young, A. A. & Li, X. Applications and comparisons of four time series models in epidemiological surveillance data. PloS One 9, e88075, https://doi.org/10.1371/journal.pone.0088075 (2014).
Zheng, Y. L., Zhang, L. P., Zhang, X. L., Wang, K. & Zheng, Y. J. Forecast model analysis for the morbidity of tuberculosis in Xinjiang, China. PloS One 10, e0116832, https://doi.org/10.1371/journal.pone.0116832 (2015).
Wang, Y. et al. Temporal trends analysis of human brucellosis incidence in mainland China from 2004 to 2018. Scientific Reports 8, 15901, https://doi.org/10.1038/s41598-018-33165-9 (2018).
National Health Commission of the people’s Republic of China. Diagnostic criteria for epidemic hemorrhagic fever, http://www.nhc.gov.cn/wjw/s9491/200802/39043.shtml. Accessed on 2020 May 5.
Li, Z. et al. Application of a hybrid model in predicting the incidence of tuberculosis in a Chinese population. Infection and Drug Resistance 12, 1011–1020, https://doi.org/10.2147/idr.s190418 (2019).
Eviews Help: Discrete Threshold Regression, http://www.eviews.com/help/helpintro.html#page/content/tar-Discrete_Threshold_Regression.html. Accessed on 2019 December 14.
Feng, H. & Liu, J. A SETAR model for Canadian GDP: non-linearities and forecast comparisons. Applied Economics 35, 1957–1964.
Pérez-Rodríguez, J. V., Torra, S. & Andrada-Félix, J. STAR and ANN models: forecasting performance on the Spanish “Ibex-35” stock index. Journal of Empirical Finance 12, 1–39.
Boero. The performance of SETAR models: a regime conditional evaluation of point, interval and density forecasts. International Journal of Forecasting 20, 305–320.
tsDyn: Nonlinear Time Series Models with Regime Switching, http://cran.rapporter.net/web/packages/tsDyn/index.html. Accessed on 2019 December 14.
Gutierrez, A. N. Detecting Nonlinear Dynamics Using BDS Test and Surrogate Data in Financial Time Series. Journal of Mathematics and System Science 9, https://doi.org/10.17265/2159-5291/2019.02.002 (2019).
Pao, H. T. Forecasting energy consumption in Taiwan using hybrid nonlinear models. Energy 34, 1438–1446, https://doi.org/10.1016/j.energy.2009.04.026 (2009).
Liu, Q. et al. Forecasting the seasonality and trend of pulmonary tuberculosis in Jiangsu Province of China using advanced statistical time-series analyses. Infection and Drug Resistance 12, 2311–2322, https://doi.org/10.2147/idr.s207809 (2019).
Fu, T. et al. Development and comparison of forecast models of hand-foot-mouth disease with meteorological factors. Scientific Reports 9, 15691, https://doi.org/10.1038/s41598-019-52044-5 (2019).
Cauchemez, S. & Ferguson, N. M. Likelihood-based estimation of continuous-time epidemic models from time-series data: application to measles transmission in London. Journal of the Royal Society Interface 5, 885–897, https://doi.org/10.1098/rsif.2007.1292 (2008).
Becker, A. D. & Grenfell, B. T. tsiR: An R package for time-series Susceptible-Infected-Recovered models of epidemics. PloS One 12, e0185528, https://doi.org/10.1371/journal.pone.0185528 (2017).
Wu, J. T., Leung, K. & Leung, G. M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet 395, 689–697, https://doi.org/10.1016/s0140-6736(20)30260-9 (2020).
Cheng, Y. H. et al. Assessing health burden risk and control effect on dengue fever infection in the southern region of Taiwan. Infection and Drug Resistance 11, 1423–1435, https://doi.org/10.2147/idr.s169820 (2018).
Kraemer, M. U. et al. Big city, small world: density, contact rates, and transmission of dengue across Pakistan. Journal of the Royal Society Interface 12, 20150468, https://doi.org/10.1098/rsif.2015.0468 (2015).
Rasmussen, D. A., Ratmann, O. & Koelle, K. Inference for nonlinear epidemiological models using genealogies and time series. PLoS Computational Biology 7, e1002136, https://doi.org/10.1371/journal.pcbi.1002136 (2011).
Xiao, Y., Tang, S. & Wu, J. Media impact switching surface during an infectious disease outbreak. Scientific Reports 5, 7838, https://doi.org/10.1038/srep07838 (2015).
Sun, G.-Q. & Zhang, Z.-K. Global stability for a sheep brucellosis model with immigration. Applied Mathematics and Computation 246, 336–345, https://doi.org/10.1016/j.amc.2014.08.028 (2014).
Joshi, Y. P., Kim, E. H. & Cheong, H. K. The influence of climatic factors on the development of hemorrhagic fever with renal syndrome and leptospirosis during the peak season in Korea: an ecologic study. BMC Infectious Diseases 17, 406, https://doi.org/10.1186/s12879-017-2506-6 (2017).
Xiang, J. et al. Impact of meteorological factors on hemorrhagic fever with renal syndrome in 19 cities in China, 2005-2014. Science of the Total Environment 636, 1249–1256, https://doi.org/10.1016/j.scitotenv.2018.04.407 (2018).
Zhang, Y. H. et al. The epidemic characteristics and changing trend of hemorrhagic fever with renal syndrome in Hubei Province, China. PloS One 9, e92700, https://doi.org/10.1371/journal.pone.0092700 (2014).
Xiao, H. et al. Atmospheric moisture variability and transmission of hemorrhagic fever with renal syndrome in Changsha City, Mainland China, 1991-2010. PLoS Neglected Tropical Diseases 7, e2260, https://doi.org/10.1371/journal.pntd.0002260 (2013).
Wang, L. et al. Hemorrhagic Fever with Renal Syndrome, Zibo City, China, 2006-2014. Emerging Infectious Diseases 22, 274–276, doi:10.3201eid/2202.151516 (2016).
Wu, H. et al. Spatial-temporal characteristics and the epidemiology of haemorrhagic fever with renal syndrome from 2007 to 2016 in Zhejiang Province, China. Scientific Reports 8, 10244, https://doi.org/10.1038/s41598-018-28610-8 (2018).
Ryou, J. et al. Prevalence of hantavirus infection in wild rodents from five provinces in Korea, 2007. Journal of Wildlife Diseases 47, 427–432, https://doi.org/10.7589/0090-3558-47.2.427 (2011).
Acknowledgements
We thank all people who collected the HFRS data. This project was supported by the MD Research Project of Xinxiang Medical University (XYBSKYZZ201912). The funders failed to take part in all the processes of the manuscript.
Author information
Authors and Affiliations
Contributions
Y.B.W., C.J.X. and S.Q.Y. conceived this work, and collected and analyzed the data. W.D.W., J.C.R., Y.C.L. and L.H.G. improved the paper. All authors agree to submit this manuscript. The funders did not participate in the process of this manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, Y., Xu, C., Wu, W. et al. Time series analysis of temporal trends in hemorrhagic fever with renal syndrome morbidity rate in China from 2005 to 2019. Sci Rep 10, 9609 (2020). https://doi.org/10.1038/s41598-020-66758-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-66758-4
- Springer Nature Limited
This article is cited by
-
Transcriptomic analysis reveals key molecular signatures across recovery phases of hemorrhagic fever with renal syndrome
BMC Medical Genomics (2024)
-
Spatial-temporal drivers and incidence heterogeneity of hemorrhagic fever with renal syndrome transmission in Shandong Province, China, 2016–2022
BMC Public Health (2024)
-
Trend analysis and prediction of gonorrhea in mainland China based on a hybrid time series model
BMC Infectious Diseases (2024)
-
Time-series forecasting through recurrent topology
Communications Engineering (2024)