Influenza-like illness prediction using a long short-term memory deep learning model with multiple open data sources

The influenza problem has always been an important global issue. It not only affects people’s health problems but is also an essential topic of governments and health care facilities. Early prediction and response is the most effective control method for flu epidemics. It can effectively predict the influenza-like illness morbidity, and provide reliable information to the relevant facilities. For social facilities, it is possible to strengthen epidemic prevention and care for highly sick groups. It can also be used as a reminder for the public. This study collects information on the influenza-like illness emergency department visits to the Taiwan Centers for Disease Control, and the PM2.5 open-source data from the Taiwan Environmental Protection Administration's air quality monitoring network. By using deep learning techniques, the relevance of short-term estimates and the outbreak calculation method can be determined. The techniques are published by the WHO to determine whether the influenza-like illness situation is still in a stage of reasonable control. Finally, historical data and future forecasted data are integrated on the web page for visual presentation, to show the actual regional air quality situation and influenza-like illness data and to predict whether there is an outbreak of influenza in the region.


Introduction
Human physiological disorders reflect an altered condition that interferes with or modifies the vital functions of various organs or body parts [1].Kim et al. [2] examined whether the severity of posttraumatic stress disorder (PTSD) symptoms and perceived functional impairment in firefighters with current possible PTSD are correlated with the use of mental health services.Gautam et al. [3] also studied the treatment of medical conditions in humans, and found the rate of predictive accuracy obtained by ant colonyoptimization (ACO) and neural network hybridization to be more promising than other individual or hybrid approaches.Based on their predicted human health implications, Van der Fels-Klerx et al. [4] thoroughly reviewed criteria for rating risks related to food safety and dietary hazards.The method to be used should be selected based on the criteria of the risk manager/assessor, the quality of data, and the method's characteristics.
Influenza is a direct and far-reaching problem.An uncontrolled flu epidemic can have a significant impact on all of society, for example, in the global H1N1 influenza outbreak around the world in 2009.According to data released by the World Health Organization, there were more than 1.3 million confirmed cases of H1N1 in the world, with a death toll of more than 14,000, presenting a significant challenge to the world's quarantine mechanism [5].For Taiwan, many prevention and control measures have been proposed for incidents of this type, such as strict border control, mobilization, and isolation of medical facilities, public transportation bases, and campuses in order to maximize anti-epidemic measures in public places.It can be said that the influenza epidemic has a profound and long-lasting impact on people's lives, the safety of their property, human resources, material resources, and socioeconomic stability.If compelling trend predictions for flu-like epidemics can be achieved, relevant units can be provided.More reaction time allows the public to take prevention or treatment measures earlier, which can have a positive impact on the control of the flu-like epidemic.There are many medical studies confirming the relationship between air quality index (AQI) and influenza-like illness (ILI) throughout the world.Increased rates of culture-negative pneumonia and influenza were associated with increased concentrations of particulate matter < 2.5 µm (PM 2.5 ) during the previous week, which persisted despite reductions in PM 2.5 from air quality policies and economic changes.Though unexplained, this temporal variation may reflect altered toxicity of different PM 2.5 mixtures or increased pathogen virulence [6][7][8][9][10][11][12][13][14][15].
Influenza-like illness (ILI) is defined as a medical condition that may be caused by influenza or other diseases such as parainfluenza viruses or adenoviruses with clinical manifestation in common with the influenza virus.According to the WHO surveillance case definition, ILI is defined as an acute respiratory infection with fever more than 38 °C and with cough symptoms, as well as morbidity within the last 10 days.This study will collect information on the rate of influenza-like emergency visits from the Department of Health and Welfare and the PM 2.5 open-source data from the Environmental Protection Department's Air Quality Monitoring Network [16].Deep learning techniques are used to predict the status of future influenza-like treatment and the effects of air pollution on the respiratory tract in inducing influenza-like symptoms, as well as the natural delay characteristics between a patient's inhalation of excessively high levels of contaminated gases and the onset of flu-like symptoms [17].This system predicts the rate of ILI after a short period based on the current level of air pollution.
In this paper, the long short-term memory network (LSTM) of the recurrent neural network (RNN) will be implemented to solve the time-series prediction case.Also, it 1 3 Influenza-like illness prediction using a long short-term… can address the shortcomings of the disappearance of the RNN gradient [18,19], so that LSTM can effectively reduce the weight of the current time.Therefore, the module can more effectively preserve the influence of special events before the distant future point.In combination with the Outbreaks algorithm published by the WHO, this algorithm can calculate the criteria of influenza-like outbreaks at various time points during the past 5 years, and can be used as supplementary data for training and subsequent data visualization.It may also function as a benchmark to aid decisions when presented.
Our experimental setup presents a visualization of the model based on historical data combined with the short-term forecast data predicted by the module.It allows the user to intuitively understand real-time air pollution data in a user-friendly system.The user can easily understand the current rate of influenza visits and the predicted rate in the short term.Hopefully, this influenza prediction platform can serve as a reference for administrative decision-making and as a basis for general public health measures, with the ultimate goal of effectively controlling and preventing influenza-like epidemics.The significance of our contribution lies in the ability to predict ILI based on the integration of the rate of influenza-like emergency visits and PM 2.5 data using LSTM.

Background review and related works
In this section, we discuss several components that are included in the approaching in this paper: RNN, LSTM, and the mean absolute percent error (MAPE).The following sections discuss each component in more detail.

Recurrent neural network (RNN)
RNN is a type of feed-forward neural network.It is simply a network type that returns the value output from the neural network back to the input [20][21][22].In this way, the output value of the last time point is transmitted back to the neural network, so that the weight calculation of each time point of the network model is related to the content of the previous time point, which means that the neural network is included.The concept of time, through such a mechanism, makes the neural network memory.Figure 1 describes recurrent neural network architecture.After each round of operation, the output of the web is retained up to the next transfer operation.Therefore, the time t will be t + 1 time points.The output is taken into consid- eration, so it has the characteristics of before and after memory.
The following is the formula for RNN: where x t is the vector of the input layer, h t is the vector of the hidden layer, y t is the vector of the output layer, W, U and b are the matrix or vector of the weight (1) parameter, and h and y are the activation functions.From this formula, it can be clearly observed that the output previous time unit is added to the calculation of y t−1 .

Long short-term memory network
LSTM is a deformation module constructed on the basis of RNN, but the mechanism of adding gate allows the LSTM to effectively save events before long-distance time [23].The weights of this method solve the defect in which the RNN is prone to gradient disappearance.Therefore, LSTM is more suitable for processing important events with longer intervals or delay time units in the time series than the RNN [24,25] (Fig. 2).
The following is the formula for LSTM: (3) Influenza-like illness prediction using a long short-term… where i t is the input gate used to determine whether the current input value is added to the memory state, and f t is the forget gate to determine whether the "forget" is previously stored.The memory state o t is the output gate that will be used to deter- mine the amount of memory output to the hidden layer, and also represents the degree of memory impact on the calculation.

MAPE
The mean absolute percent error (MAPE) as a percentage of the error reflects accuracy.Since the MAPE is a percentage, it can be easier to understand than the other figures on precision measurement.For example, if the MAPE is 5, the forecast is by 5% on average.
Nevertheless, a very large value of MAPE may sometimes be seen even though the model seems to suit the data well.Examine the plot to see if there are approximately 0 data values.Since MAPE divides the absolute error by the actual data, values close to 0 will inflate the MAPE significantly [26].

Related works
Yang et al. [27] proposed an analysis of air quality data and influenza-like illness (ILI) to determine the associations accurately and effectively.In their work, a novel integrated platform was implemented by building a cluster environment based on Hadoop and Spark.The experimental results showed the visualization of air quality and influenza-like illness data collected from 2016 to 2018 in Taichung, Taiwan.The association between air quality and influenza-like illness was also presented and discussed.
Liu et al. [28] focused on the association between invasive aspergillosis and ambient fine particulate air pollution.Two data sets were leveraged for this study: the National Health Insurance Research Database (NHIRD) was used to define invasive aspergillosis, while the Taiwan air quality index (AQI) monitoring network was used to profile the PM 2.5 concentration in Taiwan.The findings of this study suggest a positive association between PM 2.5 concentration and incidence of aspergillosis.
Lee et al. [29] demonstrated analysis and automated air pollution forecasting using RNN.A distributed computing environment was deployed based on RHadoop, which analyzed air pollution and presented a visualization using HBase from historical data.The short-term prediction of PM 2.5 was presented, and the prediction accuracy based on the MAPE was measured.
Tang et al. [30] performed an integrated data analysis for the period from November 15, 2010, to November 14, 2016, to quantify the connection between AQI, (8) meteorological variables, and risk of respiratory infection in China's Shaanxi province.Their assessment showed a statistically significant positive correlation between the number of instances of ILI and AQI, and with a time lag of 0-3 days, the risk of pulmonary disease gradually improved with enhanced AQI.
Liu et al. [31] applied RNNs of LSTM to predict influenza patterns.They were the first to use various novel data sources to forecast influenza patterns, including virological surveillance, the geographic spread of influenza, trends in Google, the environment, and air pollution.They also discovered that there are several environmental and climatic variables that have an important correlation with the frequency of ILI.
Zhang et al. [32] used four distinct LSTM multi-step prediction algorithms to predict influenza outbreaks in multiple steps.The results showed that the greatest precision was attained by applying various single-output predictions in a six-layer LSTM framework.The MAPE for the US ILI rates from 2 to 13 steps forward were all less than 15%, with an average of 12.930%.
Huang and Kuo [33] integrated and applied a convolutional neural network (CNN) and LSTM to the PM 2.5 prediction method for controlling and predicting PM 2.5 concentration.Four measurement indexes, including mean absolute error (MAE), root mean square error (RMSE), Pearson correlation coefficient, and index of agreement (IA), were applied to the experiments to compare the overall performance of each algorithm.The experimental results confirmed that the predictive accuracy of the proposed CNN-LSTM model (APNet) was the highest in comparison with other machine learning models.The feasibility and practicability of the CNN-LSTM model in predicting the PM 2.5 concentration was also checked.
Tsai et al. [34] suggested an approach for predicting the concentration of PM 2.5 with LSTM using RNN.They leveraged Keras, a high-level neural network application programming interface (API) written in Python and able to run on top of Ten-sorFlow, to construct a neural network and run RNN via TensorFlow with LSTM.Training data used in the network was collected from Taiwan's Environmental Protection Administration (EPA) from 2012 to 2016 and converted into 20-dimensional data at 66 stations across Taiwan.They conducted experiments to determine the importance of the predicted PM 2.5 concentration for the next 4 h.The results showed that the proposed method could effectively forecast the PM 2.5 value.
Qin et al. [35] introduced a new method for predicting pollutant concentration based on vast amounts of environmental data and deep learning techniques.The approach incorporated big data using two forms of deep networks.The method is based on a design that uses a convolutional neural network as the base layer, extracting input data features automatically.For the output layer, a long-term memory network was used to determine the time dependence of pollutants.With performance optimization, as a time series, the model was able to predict future concentrations of PM 2.5 .Eventually, the outcomes of the forecasts were correlated with numerical model tests.The model's applicability and benefits were also analyzed.The experimental results indicated that it increased the accuracy of predictions compared with traditional models.

3
Influenza-like illness prediction using a long short-term… 3 System design and architecture

TensorFlow
TensorFlow is an open-source code library that uses data flow graphs to express numerical operations.TensorFlow can help users easily construct models of machine learning and deep learning, reducing users' learning threshold [36].TensorFlow also supports distributed computing and can run well on a variety of platforms.It can be supported from a single CPU to a multi-GPU system.In addition, it supports multiple programming languages.Both Python and C++ can be used to write Ten-sorFlow programs.

Keras
Keras is a neural network library written entirely in Python and fully compatible with TensorFlow.Using Keras to write TensorFlow programs, building a neural network architecture will be simple and fast, allowing users to skip complex operations [18,37].The details of the complex module construction reduce the learning threshold in the field of machine learning.Keras supports most of the model solutions, supports multi-input and multi-output training, and can be used under the CPU and GPU system base.Its wide compatibility enables Keras to work on different operating systems such as Windows, Linux, and MacOS, and it can be used normally.

System design
Influenza is an acute respiratory disease that can easily lead to pneumonia and death.Early prediction and response to influenza epidemics is the most effective method of control.This paper intends to determine the relevance of PM 2.5 to influenza pneumonia death cases by using the government's open database to further build predictive modules through machine learning.The research method is to use the flu pandemic death database of the Department of Health and Welfare [38] and link to the EPA Air Quality Monitoring Network for PM 2.5 [39] open data to visualize the data in determining the correlation between PM 2.5 and influenza pneumonia death.The machine deep learning method is used for the prediction module of PM 2.5 causing death from influenza pneumonia, applied to the epidemic prevention policy.Figure 3 depicts the system design.• Step 1: National Health Insurance Research Database and Air Quality Monitoring Network The diagnostic data for Aspergillus infection was analyzed by Taichung Veteran General Hospital [28] with the National Health Insurance Research Database (NHIRD).AQI and PM 2.5 data obtained from the Environmental Protection Agency Air Quality Monitoring Network in Taiwan, combined with the above long-term big data, was used to prove the correlation between air pollution and respiratory diseases.• Step 2: Use of visual data to build a visualization system and monitoring platform Information on the rate of weekly emergency visits due to influenza-like illness, pneumonia, and enterovirus was gathered from the Department of Health and Welfare.This information was then combined to establish an associated visualization system and monitoring platform for air pollution and disease.• Step 3: Establishing a predictive model The 95% (confidence interval) upper limit was calculated as the prevalence threshold using the rate of weekly visits for the past 5 years.The LSTM model was used to predict the rate of visits in the next 4 weeks, and the threshold was used as an early warning.

System architecture
The architecture of the present study is shown in Fig. 4. The data set is collected from Taiwan Ministry of Health and Welfare.The data consist of influenza emergency department visits and the EPD data rate.PM 2.5 open air quality monitoring network data is stored in a MySQL database, and preliminary data cleaning is completed.The LSTM model is then built and trained through the TensorFlow library and using Keras' high-level tools, and the results of future short-term predictions are passed back to the MySQL repository.Finally, the data is visually presented on the web page by linking MySQL and PHP.
This study uses a linear method and a k-nearestneighbor (KNN) method, two methods for obtaining missing values.The linear compensation method is effective at filling missing values of fragmented time.However, when longterm continuous time values are missing, the linear complement method cannot accurately fill in missing values.Therefore, this study adopts two complementary methods of linear and KNN in parallel.If the missing value is fragmented, the linear complement method to is used fill in the missing value.Otherwise, if Influenza-like illness prediction using a long short-term… the value is missing for more than 6 consecutive hours, the KNN complement method is used to fill in the value.
On the topic of data cutting, this study uses a ratio of 6:2:2 to cut the original data into three parts: the training set, the validation set, and the testing set.This arrangement is adopted in the hope that a part of the data can be completely excluded After training, one can use the testing set as brand-new data to objectively verify the model, and to determine whether the training effectiveness has reached the same level as the training data.

Provided services
This cloud system provides real-time air quality index information and population statistics of each city with various kinds of graphs on the dashboard.This data can be exported to Excel for further analysis.In addition, more detail of the AQI data formula is as follows: • O 3 , 8 h: average of the last 8 h • O 3 : real-time • PM 2.5 : 0.5 × average of the last 12 h + 0.5 × the last 4 h • PM 10 : 0.5 × average of the last 12 h + 0.5 × the last 4 h • CO: average of the last 8 h • SO 2 : real-time • NO 2 : real-time Table 1 describes the normal value index of ozone (O 3 ), fine aerosol (PM 2.5 ), aerosol (PM 10 ), carbon monoxide (CO), sulfur dioxide (SO 2 ), and nitrogen dioxide (NO 2 ).

System implementation
• The software deployment server that we use for experiments comprises five servers: one name node server, two data node servers, one ELK stack server, and one web service server.More detail for each server is presented in Table 2.
• Cluster deployment has become a well-known architecture design in the science of big data, because it can be more efficient in data processing.In this study, we created a Cloudera Manager cluster server.

Hadoop or Spark memory configuration recommendation
This part describes the way to configure YARN and MapReduce memory allocation settings depending on the hardware component specifications.
When we determine the appropriate YARN and MapReduce memory configurations for a cluster node, we have to start with the available hardware resources.The following are the values of each node: • Random access memory (amount of memory) • Cores (number of CPU cores) • Disk storage (number of disks) Influenza-like illness prediction using a long short-term…

General Formula Container
If Hbase is included, the following formula is used: #Container = min(2 × Cores, 1.8 × Disks, (Total available RAM − Reserved system memory − Reserved HBase memory)/Minimum container size) = min(2 × 8, 1.Reserved HBase memory is the RAM needed by system processes and HBase processes.The recommendation of reserve memory is described in Table 3.
Minimum container size recommendations is described in Table 4 5.
This Cloudera Manager is used to build related package services like Hadoop, HDFS, MapReduce, Spark, HBase, and Hive.The website can be used to monitor CPU usage, disk I/O, network I/O, and HDFS status.

Data preprocessing
This study collected information on the rate of influenza-like emergency visits from the Ministry of Health and Welfare and the PM 2.5 Air Quality Monitoring Network.The total number in the six districts was more than 3600, collecting weekly averages from 2007 to 2019.The current data set is still in continuous collection in real time.This work uses 6:2:2 to divide the original data into three parts: training set, test set, and verification set, respectively.After the training is done, the test and verification sets are used as new data to verify the module in order to determine whether the training effect is correct.This method is used to Influenza-like illness prediction using a long short-term… avoid over-fitting on the data set and underestimating the module performance evaluation data.

The outbreak calculation method
The outbreak calculation is based on the Outbreaks calculation method published by the World Health Organization [40].Historical data of the past 5 years can be used to obtain the threshold of the influenza-like visit data for the current year.This means that the current influenza-like epidemic can be obtained through this calculation method.The criteria for judging and the data will be added to the data set of the training model, and at the same time, when the web page is visualized, it is the most direct criterion for the user to judge the current epidemic.The Outbreaks formula is as follows: where X is the average and Z is the confidence level coefficient.Taking this study as an example, when the confidence level is 95%, Z is 1.96, s is the standard deviation, and n is the sample size.

The LSTM model
In order to effectively predict influenza-like trends, this study will construct the LSTM model and use the air pollution index PM 2.5 and influenza-like visit rates.
Outbreaks calculates values as supplementary data to train the model.There is no correct answer to the neural network model design, which must be determined according to the researcher's hardware environment, data characteristics, data set size, and user response time requirements.If the number of neural layers or the number of neurons are blindly increased, it may make the training effect worse, and also waste a lot of time and computing efficiency.This research uses trial and error to find the best model architecture.The content includes the number of neural layers, the number of neurons, the design of the input and output layers, the activate function, and so on.The judgement criterion is of the highest accuracy.The MAPE evaluation index is used to select the architecture parameter with the lowest error.Therefore, after many experiments and adjustments, it was decided to set the membrane structure to four input layer neurons, four output layer neurons, 256 neurons in two layers of LSTM layers, and 128 hidden layers in two layers.Sixty-four neurons, that is, four future time units are predicted in four consecutive time units.
The input layer is set to four neurons according to trial and error results, which represents four data as an input for a single training.The output layer neurons are set to four according to the needs of the topic, which represents the prediction of the rate of ILI consultations in the next 4 weeks.The activate function of the output layer uses linear as an approximation prediction of the true value.The structure of ( 11) the input data set is the historical ILI consultation rate and PM 2.5 value for 4 weeks, and the data dimension is (2208, 4, 2).The output data is structured to predict the ILI consultation rate for the next 4 weeks.The data dimension is (2208,4).In order to avoid the problem of overfitting, the early stopping mechanism and the design of the dropout layer are added to the model.Early stopping monitors the error value of the training during the training, if it is under a certain time.If the error value does not continue to be effectively reduced, the training will be terminated early, and this method can be used to avoid over-learning.In addition to the addition of the dropout layer design, the dropout layer randomly stops the neurons of some hidden layers temporarily during training.Since the neurons that are paused during each training are randomly selected, it can be imagined.Each training is on a new model, and since the relevant neurons cannot appear at the same time each time, it is also possible to avoid the fact that there are joint features that are only retained when they appear with each other.

Model performance evaluation
After completion of the neural network module training, the model evaluation is carried out to understand the effectiveness of the model learning, which serves as the basis for the experiment and is also an important evaluation criterion for the model performance.In this study, the mean absolute percentage error (MAPE) was chosen as the criterion for the prediction performance of this module.The formula for the MAPE value is as follows:

Web visualization
This study presents data analysis results through web pages.In the operation and display of web pages, the system is constructed based on the current mainstream HTML5 + JavaScript + CSS3.Users can connect web pages with multiple devices.In addition, the concept of responsive web design (RWD) and a bootstrap framework are added to the interface design.In this case, the implementation of a bootstrap grid system interface is to make the system interface work at different resolutions.With this environment running on the device, including on smart mobile devices and tablets, a good user experience can be achieved.
1 3 Influenza-like illness prediction using a long short-term…

Experimental results
In this section, we describe our experimental results obtained from the designed methods.

Data collection
The data set is obtained from the weekly average influenza-like emergency visit rate information of the six major subdistricts of Taiwan.The PM 2.5 open data is collected from the Environmental Protection Department's air quality monitoring network with 77 air pollution stations.The data is organized into six district average weekly data sets that are consistent with the needs of the present study.The MySQL database stores these data set with a specific data format, as shown in Figs. 5 and 6.This study also organizes historical air pollution data into air quality index (AQI), which is set by the Taiwan Environmental Protection Agency.The data set consists of ozone ( O 3 ), fine aerosol (PM 2.5 ), suspended particulates (PM 10 ), carbon monox- ide (CO), sulfur dioxide ( SO 2 ), and nitrogen dioxide ( NO 2 ) in the air.The degree of health impact is converted into the sub-indicator values of different pollutants, and the maximum value is taken as the AQI value for the day.This study uses a full six-zone AQI value and ILI binding to influenza treatment.A line graph is displayed on a web page system, as shown in Fig. 7, for a user to clearly observe the realtime AQI value and the relationship of AQI and ILI value.From the graph, one can observe the association of AQI and ILI weekly status in six areas: Taipei, North, Central, South, Kaoping, and East.The maximum value index indicator of AQI also can be shown in the balloon symbol.

Training result
In this section, the training module process is presented to visualize the loss value data.It will help researchers to understand whether the module training process has smooth convergence, and whether there is an excessive situation or not.

The evaluation results
This study selected MAPE as the standard for model evaluation, and MAPE values of the six major divisions of the whole station according to the planning of this  7.According to the contents of the table, the evaluation results of the LSTM model have errors that are lower than those of the RNN model from the first week to the sixth week.This experiment confirms that LSTM has more advantages than RNN for the issue of ILI consultation rate prediction.

Outbreak calculation method
The Outbreaks value and influenza-like treatment rate value are visualized on the web system to determine whether the influenza-like disease control status is still in a safe state.Figures 12 and 13 present the ILI visit rate and Outbreaks value for 2017 and 2018, respectively.Influenza-like illness prediction using a long short-term… Fig. 11 The plotting of validation

Web visualization
Figure 14 presents the Outbreaks value in 2019, in which the middle region demonstrates the actual influenza-like visit rate; the next 4 weeks of influenzalike visits are presented in predicted values.Figure 15 shows the actual influenza-like visit rate in the six districts of the country and the predicted value of the next 4 weeks of influenza-like visits.This visualization allows users to know the future trend of influenza-like epidemics and whether it will occur or not.It can forecast the possibility of an ILI outbreak.Figure 16 demonstrates the real-time monitoring system per area.
Figure 17 shows the real-time ILI time-series data.The web visualization also has an additional feature to give the early warning.If the predicted visit rate is more than the threshold, then it is an early warning of influenza outbreak.Figure 18 describes the graph of an early warning system.Influenza-like illness prediction using a long short-term…

Conclusion and future work
This study collected data on air pollution and rates of doctor visitation due to ILI.After the air pollution and ILI treatment rate data were integrated with quality control and time units applied, the data from 2007 to the present were divided into weekly average values for six regions of and the Outbreaks algorithm was used to calculate the influenza-like epidemic.A threshold value was used as the auxiliary data for training and the decision basis for the user to read the data.It also solved the problem with the large differences between the influenza-like out-of-control standard in the outbreak period and the non-perfect period, which can easily cause misunderstanding.The flu-like epidemic provides a more comprehensive and accurate understanding.We generated a simple linear regression model to estimate the correlation coefficient between AQI and ILI in previous papers.According to the model, we observed a significant correlation between PM 2.5 and risk of ILI (p value = 0.04) after adjusting for confounders.Influenza-like illness prediction using a long short-term… This paper also implemented deep learning techniques to find correlations between data.The TensorFlow library and Keras tools were applied to construct neural network models for experiments.Long-term and short-term memory models (LSTM) were used to deal with time-series problems occurring in time-series data.The analysis of the influenza-like epidemic demonstrated very good performance based on the experiments.The analysis obtained a valid prediction of the rate of influenza-like visits in the next 4 weeks.The MAPE values for the module evaluation were all below 20, which is a valid result.The web page system platform was built as a visual presentation interface.The integration of the AQI and the ILI treatment data demonstrated an interactive relationship between the two trends.The historical data of influenza-like visits and the Outbreaks threshold were graphically presented to observe the past epidemic control status of ILI, the yearly Outbreaks threshold, and real-time ILI visit rate data plus forecasts for the next 4 weeks.It showed the estimation of future flu-like epidemics and whether there would be an outbreak epidemic.This information will provide government agencies, medical-related units, and general users a clear and concise data reference for epidemic trends.The concept of responsive web design (RDD) was added to the web system design, and therefore the system interface can be connected with devices of different resolution, including smart mobile devices and tablets, and all users can achieve a good user experience.
In the future, it is possible to predict influenza-like epidemic trends with shorter historical data without sacrificing too much accuracy.In other words, it is possible to more rapidly predict outbreaks, and so the system can aid in achieving a faster response.This system can be combined with meteorological data, in which the daily maximum temperature difference and humidity data are highly correlated with influenza induction.In addition, past outbreaks of influenza have often been accompanied by variants of the influenza virus.The causes of recent pandemics have been Fig. 18 Early warning of influenza outbreaks related to pigs, influenza, and avian flu mutations.Combining this data with data regarding poultry epidemics, such as rates of culling rate or diagnosis, as well as poultry transport flow data, might be helpful for early prediction of flu occurrence and the risk of spreading.Moreover, comparative experiments of similar algorithms can also be performed in the future.

Fig. 5 Fig. 6
Fig. 5 Schematic diagram of PM 2.5 data Figure 8 shows the change of the loss value of the LSTM modules in the six districts.The six-zone model shows good performance during the training process.The training and verification loss values are consistent.There is no indication of over-fitting, and a stable error value is reached at the specified number of training times.In addition, the training set, the test set, and the verification set are respectively implemented in the module to verify the prediction effect of the module.The central area data is taken as an example, and the actual values of the three data sets are respectively taken.The visualized graphs display the predicted values, as shown in Figs. 9, 10, and 11.The training effect of the observation module and the data set with the training can have the same prediction performance.

Fig. 8
Fig. 8 The change of the loss value of the six-zone model

Fig. 16 Fig. 17
Fig. 16 Real-time monitoring system per area Fig. 17Real-time ILI time-series data

Table 1
The concentration of pollutants and air quality index value

Table 6
MAPE value comparison

Table 7
Comparison of central area model evaluation