Prediction of congenital heart disease for newborns: comparative analysis of Holt-Winters exponential smoothing and autoregressive integrated moving average models

Xu, Weize; Shao, Zehua; Lou, Hongliang; Qi, Jianchuan; Zhu, Jihua; Li, Die; Shu, Qiang

doi:10.1186/s12874-022-01719-1

Prediction of congenital heart disease for newborns: comparative analysis of Holt-Winters exponential smoothing and autoregressive integrated moving average models

Research
Open access
Published: 01 October 2022

Volume 22, article number 257, (2022)
Cite this article

Download PDF

You have full access to this open access article

BMC Medical Research Methodology Aims and scope Submit manuscript

Prediction of congenital heart disease for newborns: comparative analysis of Holt-Winters exponential smoothing and autoregressive integrated moving average models

Download PDF

Weize Xu¹^na1,
Zehua Shao²^na1,
Hongliang Lou³^na1,
Jianchuan Qi¹,
Jihua Zhu⁴,
Die Li¹ &
…
Qiang Shu¹

2989 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Objective

To describe the temporal trend of the number of new congenital heart disease (CHD) cases among newborns in Jinhua from 2019 to 2020 and explored an appropriate model to fit and forecast the tendency of CHD.

Methods

Data on CHD from 2019 to 2020 was collected from a health information system. We counted the number of newborns with CHD weekly and separately used the additive Holt-Winters ES method and ARIMA model to fit and predict the number of CHD for newborns in Jinhua. By comparing the mean square error, rooted mean square error and mean absolute percentage error of each approach, we evaluated the effects of different approaches for predicting the number of CHD in newborns.

Results

A total of 1135 newborns, including 601 baby girls and 534 baby boys, were admitted for CHD from HIS in Jinhua during the 2-year study period. The prevalence of CHD among newborns in Jinhua in 2019 was 0.96%. Atrial septal defect was diagnosed the most frequently among all newborns with CHD. The number of CHD cases among newborns remained stable in 2019 and 2020. There were fewer cases in spring and summer, while cases peaked in November and December. The ARIMA(2,1,1) model relatively offered advantages over the additive Holt-winters ES method in predicting the number of newborns with CHD, while the accuracy of ARIMA(2,1,1) was not very ideal.

Conclusions

The diagnosis of CHD is related to many risk factors, therefore, when using temporal models to fit and predict the data, we must consider such factors’ influence and try to incorporate them into the models.

View this article's peer review reports

Time series prediction of under-five mortality rates for Nigeria: comparative analysis of artificial neural networks, Holt-Winters exponential smoothing and autoregressive integrated moving average models

Article Open access 03 December 2020

Disease burden and attributable risk factors of neonatal disorders and their specific causes in China from 1990 to 2019 and its prediction to 2024

Article Open access 18 January 2023

Comparison of ARIMA, ETS, NNAR, TBATS and hybrid models to forecast the second wave of COVID-19 hospitalizations in Italy

Article Open access 04 August 2021

Introduction

Congenital heart disease (Congenital heart defect, CHD), one of the most common birth defects among perinatal infants, has caused great harm to health and life [1,2,3]. CHD includes a great number of types, such as holes inside the heart that make the blood unable to flow normally. In some cases, CHD could be detected at birth. And other times, these problems may not be discovered until after adulthood [4]. In 2015, 48.9 million people worldwide were reported to have CHD [5]. CHD is one of the leading causes of birth defect-related deaths, it resulted in more than 300,000 deaths in 2015 [6]. The incidence of CHD is usually higher in developing countries than in developed countries [7, 8]. The prevalence of CHD in Beijing was about 7.77 per 1000 births in 2016 and a total of 1851 newborns were diagnosed with critical CHD during 2010–2017 in Beijing, the prevalence was 10.43 per 10,000 [1, 9]. Previous studies have identified that genetic and environmental factors are risk factors for CHD [10]. However, there is no effective method to prevent CHD.

Time series are arranging the numeric value of statistical indicators in chronological order and forming corresponding sequences. When study on the time series of certain infectious diseases or disease events, the long-term trends, seasonal patterns, cyclic or rhythmic patterns of them allow for modelling and prediction of future outbreaks. For decades, the temporal models have been greatly developed and can be divided into deterministic models and stochastic models. Deterministic models are usually suitable for time series with typical variation characteristics. While the data of infectious diseases do not always have some typical variation characteristics, which makes the stochastic error terms produced by deterministic models cannot meet the conditions for randomness. Therefore, researchers usually choose stochastic models rather than deterministic models to perform the time-series analysis for disease events. Based on the temporal models, time-series analysis has been widely used in epidemiology to fit the data, such as influenza, malaria and so on. Spaeder et al. built a Box-Jenkins model using laboratory-confirmed H1N1 influenza incidence data in 2009 to forecast the H1N1 incidence during 2010–2011 [11]. The result showed that the 95% confidence intervals (95% CI) of the Box-Jenkins model were accurate to ±3.6 cases per 3-day period for their institution, which suggest this model may be a useful tool in forecasting the incidence of H1N1 influenza. Alegana et al. used a Bayesian Spatio-temporal conditional-autoregressive model to fit the malaria data in Afghanistan from 2006 to 2009 [12]. They found that the incidence of malaria usually peaked in August and November, this discovery would make a great contribution to the malaria case management in a local area. To determine the possible trend and seasonal pattern in hospitalizations for pulmonary embolism (PE) in Spain, Guijarro et al. used some different kinds of methods to generate a predictive time series model, which showed a linear increase and a seasonal pattern of PE incidence for hospitalizations [13].

We explored different approaches including the exponential smoothing method (ES) and autoregressive integrated moving average model (ARIMA) to fit the weekly cases of CHD among newborns in Jinhua, Zhejiang Province during 2019–2020, and then forecast the weekly cases of CHD among newborns for 3 months (12 weeks). We hypothesize a suitable temporal model which can provide a reference for the study of the epidemic trend of CHD among newborns in Jinhua, Zhejiang Province and help the government to take rational measures for disease prevention.

Methods

Study area and data on CHD

This study was conducted in Jinhua city, the fourth largest advanced economy region of Zhejiang Province, China. Jinhua is located in the middle of Zhejiang Province, with a total area of 10,942 km². According to the official population statistics, the permanent population of Jinhua is 7,050,683 in 2020.

We collected neonatal data from all hospitals in Jinhua from 2019 to 2020 through the health information system (HIS). Diagnosis and classification of CHD for newborns were performed by qualified physicians based on ultrasound results. Newborns with CHD were classified using a previous algorithm which classified CHD based on embryo-associated defect phenotypes [14, 15]. These defect phenotypes mainly included patent ductus arteriosus (PDA), atrial septal defect (ASD), ventricular septal defect (VSD) and patent foramen ovale (PFO). Other phenotypes were uniformly classified as other due to the small number of cases. Population data was collected from Jinhua Statistic Yearbook.

Statistical analysis

We counted the number of newborns with CHD weekly and separately used ES method and ARIMA model to fit and predict the number of CHD for newborns in Jinhua.

ES, which was put forward by Robert G. Brown, is a common method in production forecasting, also used for medium and short-term economic development trend forecasting. The basic principle of ES method is to give different weights to the observed values of the time-series data. Compared with the earlier data, the recent data will be given greater weight, by which it can better eliminate the influence of noise and get a more reasonable and reliable model. According to the counts of smoothing process and parameters, the ES method can be divided into the basic exponential smoothing method, double exponential smoothing method and triple exponential smoothing method [16,17,18]. The basic exponential smoothing method is to apply exponential smoothing only once for training data. The double exponential smoothing method, which applies exponential smoothing two times, is usually suitable for the time series with a linear trend. Compared with the basic exponential smoothing method and double exponential smoothing method, the triple exponential smoothing method, which applies exponential smoothing three times, incorporates the seasonal effects into the model. If we set α as the smoothing factor (0 < α < 1), then we can find that:

$${S}_t=a\times {y}_t+\left(1-a\right){S}_{t-1}$$

Where the smoothed statistic S_t is a simple weighted average of the current observation y_t and the previous smoothed statistic S_t − 1. Therefore, basic exponential smoothing method, double exponential smoothing method and triple exponential smoothing method can be expressed respectively as:

$${\displaystyle \begin{array}{c}{S}_t^{(1)}=a\times {y}_t+\left(1-a\right){S}_{t-1}^{(1)}\\ {}{S}_t^{(2)}=a\times {S}_t^{(1)}+\left(1-a\right){S}_{t-1}^{(2)}\\ {}{S}_t^{(3)}=a\times {S}_t^{(2)}+\left(1-a\right){S}_{t-1}^{(3)}\end{array}}$$

Double exponential smoothing model, also called linear prediction model, is given by the formulas as follow:

$${\displaystyle \begin{array}{c}{\hat{Y}}_{t+T}={a}_t+T\bullet {b}_t\\ {}{a}_t=2{S}_t^{(1)}-{S}_t^{(2)}\\ {}{b}_t=\frac{a}{1-a}\left({s}_t^{(1)}-{s}_t^{(2)}\right)\end{array}}$$

Where the original data sequence of observations is represented by y_t, beginning at time t = 0. We use a_t to represent the smoothed value for time t, and b_t is our best estimate of the trend at time t. The output of the algorithm is now written as ${\hat{Y}}_{t+T}$, an estimate of the value of x at time t + T for T > 0 based on the raw data up to time t, α is the data smoothing factor, 0 < α < 1.

Triple exponential smoothing model, with multiplicative seasonality, is given by the formulas as follow:

$${\displaystyle \begin{array}{c}{\hat{Y}}_{t+T}={a}_t+{b}_t\bullet T+{c}_t\bullet {T}^2\\ {}{a}_t=3{S}_t^{(1)}-2{S}_t^{(2)}-{S}_t^{(3)}\\ {}\begin{array}{c}{b}_t=\frac{a}{2{\left(1-a\right)}^2}\left[\left(6-5a\right){S}_t^{(1)}-2\left(5-4a\right){S}_t^{(2)}+\left(4-3a\right){S}_t^{(3)}\right]\\ {}{c}_t=\frac{a}{2{\left(1-a\right)}^2}\left[{S}_t^{(1)}-2{S}_t^{(2)}+{S}_t^{(3)}\right]\end{array}\end{array}}$$

Autoregressive integrated moving average model (ARIMA), also called Box-Jenkins model, is a classical modelling approach for non-stationary time series. Generally, the non-stationary time series need to be converted into stationary time series, then we can build ARIMA model based on the regression of hysteresis values and the previous random error terms. According to the stability of the original sequence and the parts contained in the regression, ARIMA model is usually divided into moving average process (MA), autoregressive process (AR), autoregressive moving average process (ARMA) and ARIMA process. The model is written as ARIMA(p, d, q) where p describes the AR part, d describes the integrated part, and q describes the MA part. The ARIMA model can be expressed as follows:

$$\varnothing \left(\mathrm{B}\right){\Delta }^d{Y}_t=\uptheta \left(\mathrm{B}\right){\varepsilon}_t$$

Where Y_t represents the response sequence, ε_t represents the random error at time t, ∅(B) = 1 − ∅₁B − ∅₂B² − … − ∅_PB^P represents the autoregressive operator, θ(B) = 1 − θ₁B − θ₂B² − … − θ_PB^P represents the moving average operator, and ∅(B)∆^dY_t represents the correlation among the different periodic points in the same periods. When P = D = Q and they all equal to 0, the model is a simple ARIMA model.

The last 3 months (12 weeks) of the dataset were divided as test sets to evaluate the accuracy of different time series models. We use Akaike’s information criterion (AIC) to evaluate the fitting effects of each approach. The prediction effects of the models are usually evaluated by the difference between the predicted value and the actual value, that is, the error. By comparing the mean square error (MSE), rooted mean square error (RMSE) and mean absolute percentage error (MAPE) of each approach, we can evaluate the effects of different approaches for predicting the number of CHD in newborns.

Time series analyses were performed using R 3.6.3 and the results with P ≤ 0.05 would be considered as significant.

Results

General characteristics

A total of 1135 newborns, including 601 baby girls and 534 baby boys, were admitted for CHD from HIS in Jinhua during the 2-year study period. The prevalence of CHD among newborns in Jinhua in 2019 was 0.96%. Overall, there were 10 newborns with CHD per week in Jinhua. The median number of newborns with CHD was higher among baby girls than which among baby boys (6.0 vs. 5.0). Up to 31 newborns in Jinhua were diagnosed with CHD in 1 week. ASD was diagnosed the most frequently among all newborns with CHD, accounting for 81.9% of all subjects. 81.6% of CHD baby boys were diagnosed with ASD, compared with 82.0% of CHD baby girls. PDA was the second most common phenotype among newborns with CHD, accounting for 64.3% of all subjects, and the constituent ratio for baby boys and baby girls were 63.7 and 62.4%, respectively (Table 1).

Table 1 Weekly frequency of diagnoses for congenital heart disease in newborns in Jinhua, China, 2017–2019

Full size table

Trends of CHD

Although the duration of this study was not long enough, it could still be seen that the epidemiology trend of CHD was cyclical (Fig. 1). Overall, the number of CHD cases among newborns remained stable in 2019 and 2020. There were fewer cases in spring and summer, while cases peaked in November and December. The trend of CHD was the same in both male and female newborns as in the total subjects, with no obvious difference.

Fitting results

We firstly used the additive Holt-winters ES method to fit the time series data of CHD in Jinhua. The fitting result was shown in Fig. 2. The additive ES model performed well in the early stage of fitting, however, it could no longer fit the training-set data well in the later stage. The results of parameter estimation showed that this time series had no obvious seasonality and the ES model could not fit the long-term trend well. The horizontal smoothing factor, seasonal smoothing factor and trend smoothing factor were all less than 0.001 and had no significance (P < 0.05).

Then we used ARIMA model to fit the training-set data. According to the observation of the original sequence and the result of Kwiatkowski-Phillips-Schmidt-Shin test, we can find that the time series of cases with CHD in Jinhua is non-stationary (KPSS Level = 0.834, P = 0.01). Therefore, we did a first order differencing to make it smooth. The results of auto-correlation and partial correlation after first order differencing were presented in Fig. 3.

The results showed that after differencing the sequence is randomly fluctuating with 0-centered, which suggested that this sequence is stable. Combining the information from auto-correlation figure and partial auto-correlation figure, we finally tried to establish ARIMA(2,1,1) model. The residual of ARIMA(2,1,1) model was shown in Fig. 4. Dickey-Fuller test was used to examine the stationary.

We used least squares to build the ARIMA(2,1,1) model for the differencing sequence, the results showed that the parameters of the first order moving average model was − 0.588, and the parameters of the first order auto regression model and the second auto regression model were − 0.133 and 0.156. respectively. The AIC is 557.48, and this ARIMA(2,1,1) model can be given as follow:

$$\Delta \log (x)=\frac{\left(1+0.588B\right)}{\left(1+0.133B-0.016{B}^2\right)}{\varepsilon}_t$$

The comparison of different models

We respectively used the additive ES model and ARIMA(2,1,1) model to forecast the weekly number of CHD cases among newborns for 12 weeks in Jinhua. Each approach’s MSE, MAPE and RMSE were calculated to compare the predictive effect (Table 2).

Table 2 The comparison of ES method and ARIMA model for the weekly new cases of CHD among newborns in Jinhua

Full size table

The results indicated that MSE, MAPE and RMSE of ARIMA(2,1,1) model were smaller than the additive Holt-winters ES method (MSE is 84.83, MAPE is 226.07 and RMSE is 9.21, respectively). We finally determine the most suitable predictive model for the study of new cases with CHD among newborns in Jinhua was ARIMA(2,1,1) model.

Discussion

In this study, we described the temporal trend of newborns with CHD in Jinhua, Zhejiang Province from 2019 to 2020 and separately used the additive Holt-winters ES method and ARIMA model to fit and forecast the weekly number of cases with CHD among newborns in Jinhua. Totally 1135 newborns with CHD were included in this study and there was an average of 10 newborns with CHD per week in Jinhua. ASD was the most common type of CHD, accounting for 81.9% of all subjects. The weekly number of new CHD cases among newborns had a distinct peak and a slump every year and the seasonality was not obvious. The ARIMA(2,1,1) model relatively offered advantages over the additive Holt-winters ES method in predicting the number of newborns with CHD, while the accuracy of ARIMA(2,1,1) was not very ideal.

CHD is one of the most common congenital anomalies and imposes a severe emotional and economic burden on children and their families. Previous studies have shown that a range of malformations, including coronary artery disease, also described in genomic rearrangement syndromes, are difficult to diagnose in newborns [19]. The etiology of CHD remains uncertain. Maternal exposure during pregnancy is strongly associated with CHD in infants [20,21,22]. Some studies suggest that infection during pregnancy (e.g., German measles), exposure to toxic substances, and folic acid deficiency may be risk factors for CHD [10, 23, 24]. Since CHD usually occurs during embryogenesis, it is difficult to detect by examination during this period. Some CHD can be diagnosed prenatally by fetal echocardiography, while some CHD is usually diagnosed shortly after birth or sometimes even many years later [25].

Before this study, few studies have described and analyzed the prevalence trend of CHD in newborns. We found that the annual number of CHD patients in newborns was low at the beginning of the year, then gradually increased and peaked at the end of the year. However, no significant seasonal trends were observed. This phenomenon may be related to certain social factors in China, such as the lowest number of newborns with CHD during the Lunar New Year holiday, that is, the number of CHD among newborns is limited by the hospital’s diagnostic capacity. Our study suggests that increasing hospital capacity during the holidays may enable more newborns with CHD to be diagnosed and treated promptly on time. More data are needed to confirm and further uncover other patterns of CHD incidence among newborns.

ARIMA model is the most common method for non-stationary time series analysis, it can integrate the trend factors, long-term factors and random errors from the original sequences and extract the deterministic information by transforming the non-stationary time series into stationary time series. ARIMA model is widely used in various fields and it works well on prediction. Omar et al. extract words from article titles and propose a novel hybrid neural network model based on ARIMA model to forecast sales [26]. In Sweden, researchers use it to estimate the association between cannabis and alcohol use among teenagers [27]. Cortes et al. also try to estimate the temporal patterns of dengue incidence in two Brazilian cities through ARIMA model [28]. Besides, ARIMA model is also one of the popular machine learning techniques, it can even be used to predict the receiving waters of sewage treatment plants [29]. We tried to use ARIMA(2,1,1) models to fit the original data of new cases with CHD in Jinhua, Zhejiang Province. The error between the predicted value and the original value was relatively small which meant using this model for prediction was feasible to some extent.

ES method is also a very common method, it is intuitive, highly adaptable and easy to operate. We can forecast the future data by giving different weights to the observed values. ES method can fit the long-term trends, cyclic fluctuation and stochastic fluctuation of time series sequences. Since some observations were zero, we did not use the multiplicative ES method, but only used the additive Holt-winters ES method to fit the data. The results showed that in the predicting process for the new CHD cases in Jinhua, the prediction effect of the additive Holt-winters ES method was not as good as the effect of ARIMA model. Generally, the predicted values of the additive Holt-winters ES method were lower than the original values. It illustrated that additive Holt-winters ES method cannot fit the time series sequences perfectly if the original sequences had a sudden fluctuation, and as Fig. 1 shown, the number of CHD cases had an obvious decline at the end of the year, which might influence the effect of this model.

Our study had some limitations. Firstly, our data was relatively single, which made it impossible to fully discuss the risk factors of CHD. Secondly, our study only included data from 2019 to 2020, and more observations are needed to refine our model. Finally, our modes were relatively simple, neural network method could be used to predict the number of cases in the future. Our study also had some advantages. Firstly, we described the temporal trend in CHD among newborns, which has rarely been addressed before. Secondly, our research data were from all hospitals in Jinhua, which was reliable and covers the region comprehensively. The onset of CHD and diagnosis of CHD are related to many risk factors, therefore, when using temporal models to fit and predict the data, we must consider such factors’ influence and try to incorporate them into the models.

Conclusions

In general, although ARIMA(2,1,1) offered advantages over the additive Holt-winters ES method in the prediction of the weekly new cases with CHD among newborns in Jinhua, Zhejiang Province, the accuracy of time series models in predicting new cases with CHD was still inadequate. More detailed information on cases should be collected and an improved time series model is necessary to predict the number of new cases with CHD among newborns in the future.

Availability of data and materials

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

CHD:: Congenital heart disease
95% CI:: 95% confidence interval
ES:: Exponential smoothing
ARIMA:: Autoregressive integrated moving average
HIS:: Health information system
PDA:: Patent ductus arteriosus
ASD:: Atrial septal defect
VSD:: Ventricular septal defect
PFO:: Patent foramen ovale
MA:: Moving average
AR:: Autoregressive
ARMA:: Autoregressive moving average
AIC:: Akaike’s information criterion
MSE:: Mean square error
RMSE:: Rooted mean square error
MAPE:: Mean absolute percentage error

References

Wang D, Jin L, Zhang J, Meng W, Ren A, Jin L. Maternal periconceptional folic acid supplementation and risk for fetal congenital heart defects. J Pediatr. 2021;240:72–8.
Article CAS Google Scholar
Dolk H, Loane M, Garne E, European Surveillance of Congenital Anomalies Working G. Congenital heart defects in Europe: prevalence and perinatal mortality, 2000 to 2005. Circulation. 2011;123(8):841–9.
Article Google Scholar
Collaborators GBDCoD. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the global burden of disease study 2017. Lancet. 2018;392(10159):1736–88.
Article Google Scholar
Dolbec K, Mick NW. Congenital heart disease. Emerg Med Clin North Am. 2011;29(4):811–27 vii.
Article Google Scholar
Disease GBD, Injury I, Prevalence C. Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990-2015: a systematic analysis for the global burden of disease study 2015. Lancet. 2016;388(10053):1545–602.
Article Google Scholar
Mortality GBD, Causes of Death C. Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980-2015: a systematic analysis for the global burden of disease study 2015. Lancet. 2016;388(10053):1459–544.
Article Google Scholar
Wu W, He J, Shao X. Incidence and mortality trend of congenital heart disease at the global, regional, and national level, 1990-2017. Medicine (Baltimore). 2020;99(23):e20593.
Article Google Scholar
Liu Y, Chen S, Zuhlke L, Black GC, Choy MK, Li N, et al. Global birth prevalence of congenital heart defects 1970-2017: updated systematic review and meta-analysis of 260 studies. Int J Epidemiol. 2019;48(2):455–63.
Article Google Scholar
Zhang W, Xu HY, Zhang YC, Liu KB. Delayed diagnosis of critical congenital heart defects predicting risk factors and survival rate in newborns in Beijing: a retrospective study. J Int Med Res. 2021;49(7):3000605211028028.
Article CAS Google Scholar
Virani SS, Alonso A, Benjamin EJ, Bittencourt MS, Callaway CW, Carson AP, et al. Heart disease and stroke statistics-2020 update: a report from the American Heart Association. Circulation. 2020;141(9):e139–596.
Article Google Scholar
Spaeder MC, Stroud JR, Song X. Time-series model to predict impact of H1N1 influenza on a children's hospital. Epidemiol Infect. 2012;140(5):798–802.
Article CAS Google Scholar
Alegana VA, Wright JA, Nahzat SM, Butt W, Sediqi AW, Habib N, et al. Modelling the incidence of plasmodium vivax and plasmodium falciparum malaria in Afghanistan 2006-2009. PLoS One. 2014;9(7):e102304.
Article CAS Google Scholar
Guijarro R, Trujillo-Santos J, Bernal-Lopez MR, de Miguel-Diez J, Villalobos A, Salazar C, et al. Trend and seasonality in hospitalizations for pulmonary embolism: a time-series analysis. J Thromb Haemost. 2015;13(1):23–30.
Article CAS Google Scholar
Wu XX, Ge RX, Huang L, Tian FY, Chen YX, Wu LL, et al. Pregestational diabetes mediates the association between maternal obesity and the risk of congenital heart defects. J Diabetes Investig. 2021;13(2):367–74.
Article Google Scholar
Botto LD, Lin AE, Riehle-Colarusso T, Malik S, Correa A, National Birth Defects Prevention S. Seeking causes: classifying and evaluating congenital heart defects in etiologic studies. Birth Defects Res A Clin Mol Teratol. 2007;79(10):714–27.
Article CAS Google Scholar
Anggrainingsih R, Aprianto GR, Sihwi SW, editors. Time series forecasting using exponential smoothing to predict the number of website visitor of Sebelas Maret University. 2015 2nd International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE). 2015. pp. 14–19.
Bermúdez JD, Segura JV, Vercher E. Holt-Winters forecasting: an alternative formulation applied to UK air passenger data. J Appl Stat. 2007;34(9):1075–90.
Article Google Scholar
Winters PR. Forecasting sales by exponentially weighted moving averages. Manag Sci. 1960;6(3):324–42.
Article Google Scholar
Falsaperla R, Giacchi V, Aguglia MG, Mailo J, Longo MG, Natacci F, et al. Monogenic syndromes with congenital heart diseases in newborns (diagnostic clues for neonatologists): a critical analysis with systematic literature review. J Pediatr Genet. 2021;10(3):173–93.
Article CAS Google Scholar
Wiener SL, Wolfe DS. Links between maternal cardiovascular disease and the health of offspring. Can J Cardiol. 2021;37(12):2035–44.
Article Google Scholar
Bolin EH, Gokun Y, Romitti PA, Tinker SC, Summers AD, Roberson PK, et al. Maternal smoking and congenital heart defects, national birth defects prevention study, 1997–2011. J Pediatr. 2022;240:79–86.
Article Google Scholar
Yu X, Miao H, Zeng Q, Wu H, Chen Y, Guo P, et al. Associations between ambient heat exposure early in pregnancy and risk of congenital heart defects: a large population-based study. Environ Sci Pollut Res Int. 2022;29(5):7627–38.
Article Google Scholar
Liu S, Joseph KS, Luo W, Leon JA, Lisonkova S, Van den Hof M, et al. Effect of folic acid food fortification in Canada on congenital heart disease subtypes. Circulation. 2016;134(9):647–55.
Article CAS Google Scholar
Zhang S, Wang L, Yang T, Chen L, Zhao L, Wang T, et al. Parental alcohol consumption and the risk of congenital heart diseases in offspring: an updated systematic review and meta-analysis. Eur J Prev Cardiol. 2020;27(4):410–21.
Article Google Scholar
Brida M, Gatzoulis MA. Adult congenital heart disease: past, present and future. Acta Paediatr. 2019;108(10):1757–64.
Article Google Scholar
Omar H, Hoang VH, Liu DR. A hybrid neural network model for sales forecasting based on ARIMA and search popularity of article titles. Comput Intell Neurosci. 2016;2016:9656453.
Article Google Scholar
Gripe I, Danielsson AK, Ramstedt M. Are changes in drinking related to changes in cannabis use among Swedish adolescents? A time series analysis for the period 1989-2016. Addiction. 2018;113(9):1643–50.
Article Google Scholar
Cortes F, Turchi Martelli CM, Arraes de Alencar Ximenes R, Montarroyos UR, Siqueira Junior JB, Goncalves Cruz O, et al. Time series analysis of dengue surveillance data in two Brazilian cities. Acta Trop. 2018;182:190–7.
Article Google Scholar
Ansari M, Othman F, Abunama T, El-Shafie A. Analysing the accuracy of machine learning techniques to develop an integrated influent time series model: case study of a sewage treatment plant, Malaysia. Environ Sci Pollut Res Int. 2018;25(12):12139–49.
Article Google Scholar

Download references

Acknowledgements

We thank the nurses, clinicians, and management staffs of hospitals in Jinhua for their participation in and support for this research.

Funding

This study was supported in part by the Zhejiang Province Key Research and Development Project (2020C03120), and in part by the Key Project of the National Research Program of China (2019YFC0840702).

Author information

Weize Xu, Zehua Shao and Hongliang Lou contributed equally to this work.

Authors and Affiliations

Department of Cardiac Surgery, The Children’s Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, No. 3333 Binsheng Road, Binjiang District, Hangzhou, 310000, Zhejiang, China
Weize Xu, Jianchuan Qi, Die Li & Qiang Shu
Zhengzhou University People’s Hospital, Henan Provincial People’s Hospital, Zhengzhou, 450003, China
Zehua Shao
Jinhua Maternal and Child Health Care Hospital, Jinhua, 321000, China
Hongliang Lou
Department of Nursing, The Children’s Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Hangzhou, 310000, China
Jihua Zhu

Authors

Weize Xu
View author publications
You can also search for this author in PubMed Google Scholar
Zehua Shao
View author publications
You can also search for this author in PubMed Google Scholar
Hongliang Lou
View author publications
You can also search for this author in PubMed Google Scholar
Jianchuan Qi
View author publications
You can also search for this author in PubMed Google Scholar
Jihua Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Die Li
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Shu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Weize Xu: conceptualization, data curation, formal analysis, methodology, writing – original draft, and writing – review & editing. Zehua Shao and Hongliang Lou: data curation, conceptualization, formal analysis. Jianchuan Qi and Jihua Zhu: data curation. Die Li: data curation, formal analysis. Qiang Shu: supervision, funding acquisition. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Qiang Shu.

Ethics declarations

Ethics approval and consent to participate

All study protocols were approved by Ethics Committee of Zhejiang University School of Medicine the Chilren’s Hospital. All data used in this study were anonymized before its use. This study was performed in accordance with the Declaration of Helsinki. The study has been granted an exemption from requiring informed consent by Ethics Committee of Zhejiang University School of Medicine the Chilren’s Hospital. All methods were carried out in accordance with relevant guidelines and regulations in the declaration.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Xu, W., Shao, Z., Lou, H. et al. Prediction of congenital heart disease for newborns: comparative analysis of Holt-Winters exponential smoothing and autoregressive integrated moving average models. BMC Med Res Methodol 22, 257 (2022). https://doi.org/10.1186/s12874-022-01719-1

Download citation

Received: 22 March 2022
Accepted: 29 August 2022
Published: 01 October 2022
DOI: https://doi.org/10.1186/s12874-022-01719-1

Prediction of congenital heart disease for newborns: comparative analysis of Holt-Winters exponential smoothing and autoregressive integrated moving average models

Abstract

Objective

Methods

Results

Conclusions

Similar content being viewed by others

Time series prediction of under-five mortality rates for Nigeria: comparative analysis of artificial neural networks, Holt-Winters exponential smoothing and autoregressive integrated moving average models

Disease burden and attributable risk factors of neonatal disorders and their specific causes in China from 1990 to 2019 and its prediction to 2024

Comparison of ARIMA, ETS, NNAR, TBATS and hybrid models to forecast the second wave of COVID-19 hospitalizations in Italy

Introduction

Methods

Study area and data on CHD

Statistical analysis

Results

General characteristics

Trends of CHD

Fitting results

The comparison of different models

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation