1 Introduction

In China, more than 200 water inrushes from Ordovician limestone have occurred in North China coalfields over the past 40 years, resulting in economic losses of more than 30 billion yuan and 1300 deaths [22, 25].Water inrush problems in deep mining become increasingly serious as the mining depth increases. Mine water inflow is one of basic technical conditions for coal mine development, which is the total amount of surface water, fissure water, karst water, old kiln water, and other water flowing into mine pit. Accurately forecast the water inflow of coal mines can help to prevent coal mine accidents and ensure the safety of coal mine production and efficient use of mine water [34, 40]. Therefore, scholars have gradually developed the prediction of mine water inflow from deterministic methods (such as hydrogeological analytical method, water balance method, numerical methods, etc.) to non-deterministic stochastic prediction (such as analog method, fuzzy mathematical model, gray system theory, etc.) [1, 24, 33, 41]. However, mine water inflow is related to many factors, such as strata pressure, mining disturbance, geologic structures, and water pressure, and these factors have obvious nonlinear characteristics [11, 16]. Moreover, with the increasing depth of coal mining, the hydrogeological conditions of mine water filling are becoming more and more complex, and the control factors of mine water inflow are increasing. Consequently, these prediction methods are usually time-consuming and less reliable.

Artificial neural network (ANN) model is able to learn the underlying relationship between input and output signals of a sequential process with no need to take explicit physical rules into onsideration [18]. So many scholars use ANN to predict mine water onflow [3], and these models have higher prediction accuracy. Generalized Regression Neural Network (GRNN) is a radial basis function network based on mathematical statistics [5]. It has the advantages of simple model structure, few parameters to adjust, strong approximation ability, fast learning speed, and high prediction accuracy, which has been widely used in many fields [5, 36]. Wang et al. [31] selected three main influencing factors of mine water inflow as the input neurons in GRNN to predict the mine water inflow. Using GRNN to predict the mine water inflow, the numbers of input layer neurons and their values are the key to determining the accuracy of the model, which are determined by the choice of the influencing factors of mine water inflow and their values. However, the data of many influencing factors of mine water inflow usually are difficult to collect and quantify, moreover, it is difficult to determine the key factors among all the influencing factors of mine water inflow.

Fortunately, chaos theory considers that there are a lot of information about related factors in the time series of mine water inflow, and the number and value of related factors can be obtained through the reconstructed phase space [19, 29]. Chaos theory is an effective tool for studying complex systems [13], which can restore the rich dynamic information hidden in a single variable of the system [27].

So, this study combined chaos theory with GRNN, using reconstructed phase space of mine water inflow time series to determine the number of GRNN input layer neurons and their values (reducing the subjectivity and limitations of the original GRNN), establishing Chaos-Generalized Regression Neural Network (Chaos-GRNN) model to achieve a more effective prediction model of mine water inflow. Additionally, this model is used to predict the mine water inflow for a long time (24 months) to verify practicality of the model.

The paper is organized as follows. Section 2 briefly introduces the basic theory behind reconstructed phase space, and GRNN, outlines the proposed Chaos-GRNN model approach. Section 3 describes the application of Chaos-GRNN model to predict mine water inflow. Section 4 discusses the rationality, the range of parameters of the Chaos-GRNN model, and the comparison with the prediction results of the Chaos-BPNN model. Section 5 lists the main conclusions.

2 Methodology

2.1 Reconstructed phase space

Takens’ time-delay embedding theorem (1981) paved the way for the analysis of chaotic time series. In the reconstructed m-dimensional space, the regular trajectory of the motive system can be restored [8, 10]. The key to reconstruct phase space is to determine the time delay τ and the embedding dimension m. In the paper, the embedding dimension m was determined by the Cao method [2], the time delay τ was determined by the autocorrelation function method [7, 38].Further details about chaos theory can be found in Karunasingha et al. [13] and Huang et al. [12].

  1. (1)

    Time delay τ

Let \(x_{i}\) denotes the observed mine water inflow, with \(\overline{x}\) denoting their means, and n denoting the number of data points considered, τ denoting the number of time moving. Autocorrelation functiont C(τ) can be expressed as

$$C(\tau ) = \frac{{\frac{1}{n - \tau }\sum\limits_{i = 1}^{n - \tau } {(x_{i} - \overline{x} )(x_{i + \tau } - \overline{x} )} }}{{\frac{1}{n}\sum\limits_{i = 1}^{n} {(x_{i} - \overline{x} )^{2} } }}$$
(1)

The curve draws with τ as the independent variable and C(τ) as the dependent variable. When C(τ) drops to (1–1/e) of the initial value, the corresponding τ is the time delay of reconstructing space.

  1. (2)

    Embedding dimension m

The distance change \(\alpha (i,m)\) of the nearest point in phase space under different embedding dimensions is calculated by formula (2) firstly; then the mean E(m) of \(\alpha (i,m)\) is calculated by formula (3), E1(m) is calculated by formula (4). Finally, to draw the curve between E1(m) and m. When the change of E1(m) gradually becomes stable, the m at the stable position is what is required.

$$\alpha (i,m) = \frac{{\left\| {Y_{\eta (n)} - Y_{n} } \right\|_{\infty }^{(m + 1)} }}{{\left\| {Y_{\eta (n)} - Y_{n} } \right\|_{\infty }^{m} }}$$
(2)

where \(\left\| {} \right\|_{\infty }^{(m + 1)}\) is the m + 1 dimensional space-norm; Yn is the closest vector.

$$E(m) = \frac{1}{n - m\tau }\sum\limits_{i = 1}^{n - m\tau } {\alpha (m,i)}$$
(3)
$$E_{1} (m) = \frac{E(m)}{{E(m + 1)}}$$
(4)
  1. (3)

    Reconstructed phase space

After τ and m are determined, phase space reconstruction is performed by the mine water inflow time series X = {x1, x2,…,xn}. Getting a m-dimensional vector sequence:

$$Y_{i} = \left\{ {x_{i} ,x_{i + \tau } , \ldots ,x_{{i + \left( {m - 1} \right)\tau }} } \right\}$$
(5)

where i = 1,2, …, M, M = n-(m-1) τ.

2.2 The maximum Lyapunov index λ max

The maximum Lyapunov exponent λmax not only is the identification parameter of the chaotic feature (if λmax > 0, then the system has chaotic characteristics), but also can determine the step size R (R = 1/λmax) of prediction model [30, 32].

After reconstructing the phase space, to calculate the distance (Li) between two adjacent points Y(j) and Y(i)in the m-dimensional space by the formula (6):

$$L_{i} = \min \left[ {\left\| {Y(j) - Y(i)} \right\|} \right],i,j = 1,2, \ldots ,M$$
(6)

Then, λmax can be obtained by the formula (7):

$$\lambda_{\max } = \frac{1}{M - 1}\frac{1}{\tau }\sum\limits_{i = 1}^{M - 1} {\ln \frac{{L_{i + 1} }}{{L_{i} }}}$$
(7)

The step size R of prediction model can be obtained by the formula (8):

$$R = 1/\lambda_{\max }$$
(8)

2.3 Generalized regression neural networks (GRNN)

Generalized regression neural networks (GRNN) were first proposed by Specht [26], which was a radial basis function network [5].

The function f (x, y) is defined as the joint probability density function (PDF) of two random variables x and y. When f (x, y) is known, the condition mean of y on a given x0 can be calculated by

$$\hat{y}(x_{0} ) = E(\left. y \right|x_{0} ) = \frac{{\int_{ - \infty }^{ + \infty } {yf(x_{0} ,y){\text{d}}y} }}{{\int_{ - \infty }^{ + \infty } {f(x_{0} ,y){\text{d}}y} }}$$
(9)

In fact, f (x, y) is usually unknown in many systems. GRNN uses a group of measured x and y values to estimate f (x, y) based on the consistent estimators proposed by Parzen [26, 36]. For a given measured sample data point {xi, yi|I = 1, 2,…,N}, these timated PDF can be expressed as

$$f(x_{0} ,y) = \frac{1}{{n(2\pi )^{{\frac{m + 1}{2}}} \sigma^{m + 1} }}\sum\limits_{i = 1}^{n} {e^{{ - d(x_{0} ,x_{i} )}} e^{{ - d(y_{0} ,y_{i} )}} }$$
(10)
$$d(x_{0} ,x_{i} ) = (x_{0} - x_{i} )^{T} (x_{0} - x_{i} )/2\sigma^{2}$$
(11)
$$d(y_{0} ,y_{i} ) = (x_{0} - x_{i} )^{2} /2\sigma^{2}$$
(12)

where n is the number of measured samples, m is the dimension of vector random variable x, and σ is the smooth parameter of the Gauss function (also called the spread factor).

By substituting the estimated PDF in formula (10, 11 and 12) into the condition mean in formula (9) and interchanging the order of integration and summation, the desired condition mean of y for a given x0 can be yielded:

$$\hat{y}(x_{0} ) = \frac{{\sum\limits_{i = 1}^{n} {y_{i} e^{{ - d(x_{0} ,x_{i} )}} } }}{{\sum\limits_{i = 1}^{n} {e^{{ - d(x_{0} ,x_{i} )}} } }}$$
(13)

GRNN is structurally composed of an input layer, a pattern layer, a summation layer, and an output layer (Fig. 1).

Fig. 1
figure 1

Structural diagram of GRNN

The training set consists of values of inputs x, each with a corresponding value of an output y. This regression method produces the estimated value of y, which minimizes the squared error [14].The network has the advantages of simple structure, few parameters, and fast prediction speed, however, there is also a disadvantage that the selection of input layer neurons is subjective.

The MATLAB software (R2015b) is utilized to implement analysis of chaotic characteristics and GRNN for mine water inflow estimation.

2.4 Chaos-GRNN model

Giving full play to the respective advantages of chaos theory and GRNN, and combining them organically to establish a Chaos-GRNN prediction model for mine water inflow (Fig. 2).

Fig. 2
figure 2

Structural diagram of Chaos-GRNN model

The steps to establish the Chaos-GRNN model are as follows:

First, by chaos theory to construct the sample set of GRNN. To calculate the time delay τ and the embedding dimension m and the maximum Lyapunov exponent λmax of the time series{x1, x2,…,xn}. By formula (5), the time series{x1, x2,…,xn}will transform into the sample set: Y = (Y1,Y2,…,YM)T, M = n − (m − 1) τ. And the sample set will be divided into the training samples and test samples.

Second, to determine input layer neuron sand pattern layer neurons of GRNN. m is the number of factors affecting the mine water inflow, which is equal the number of both input layer neurons and pattern layer neurons of the Chaos-GRNN model; the value of the input layer neuron is determined by formula (5): Yi = (xi, xi+τ,…, xi+(m − 1) τ).

Third, to optimize the smooth parameter of GRNN. Based on the neural network toolbox in MATLAB, the cross-validation method is used to optimize the optimal Spread parameter value of newgrnn() function to obtain the smoothing factor σ.

Fourth, to set up model and forecast water inflow. Combine the trained input layer, mode layer data and optimization parameters to construct an optimized Chaos-GRNN model and import the data to predict. The step size of Chaos-GRNN model is determined by formula (8).

Last, to evaluate the model. The model evaluation is based on four performance metrics described below. Let \(x_{o} (i)\) and \(x_{e} (i)\) denote, respectively, the estimated and observed mine water inflow, with \(\overline{{x_{o} }}\), \(\overline{{x_{e} }}\) denoting their means, and n denoting the number of data points considered.

  1. (1)

    The coefficient of correlation (R) can be expressed as

$$R = \frac{{\frac{1}{n}\sum\limits_{i = 1}^{n} {(x_{o} (i) - \overline{{x_{o} }} )(x_{e} (i) - \overline{{x_{e} }} )} }}{{\sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {(x_{o} (i) - \overline{{x_{o} }} )^{2} } } \cdot \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {(x_{e} (i) - \overline{{x_{e} }} )^{2} } } }}$$
(14)

where the closer the coefficient R is to1,the more perfect is the positive linear relationship.

  1. (2)

    The root mean squared error (RMSE) can be expressed as:

    $${\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{{i = 1}}^{n} {(x_{e} (i) - x_{o} (i))^{2} } }$$
    (15)
  1. (3)

    The mean absolute percentage error (MAPE) can be expressed as:

    $${\text{MAPE}} = \frac{1}{n}\sum\limits_{k = 1}^{n} {\left| {\left. {\frac{{x_{o} (i) - x_{e} (i)}}{{x_{o} (i)}}} \right|} \right.} \times 100$$
    (16)

The closer the values of RMSE or MAPE are to 0, the more reliable is the model performance.

  1. (4)

    Nash–Sutcliffe efficiency (NSE) can be expressed as:

$${\text{NSE}} = 1 - \frac{{\sum\limits_{i = 1}^{n} {(x_{o} (i) - x_{e} (i))^{2} } }}{{\sum\limits_{i = 1}^{n} {(x_{o} (i) - \overline{{x_{o} }} )^{2} } }}$$
(17)

The value of NSE is − ∞ to 1. NSE is close to 1, indicating high reliability of the model; NSE is close to 0, indicating that the model is generally reliable, but the process simulation error is large; NSE is far less than 0, indicating that the model is not credible.

There are many performance metrics to evaluate a model. In this study, the above four indices were chosen, because these four indices are most basic and used most widely [6].

3 Application and results

3.1 Sources of mine water inflow of the study sites

Pingdingshan No.12 Coal Mine is located in Pingdiangshan city of Henan province, China (Fig. 3).The study area, approximately 12.87 km2, is part of Pingdingshan Mining Area. So far, there have been 9 water inrush accidents in Pingdingshan No.12 Coal Mine.

Fig. 3
figure 3

Location of Pingdingshan No. 12 Coal Mine, China

At present, the normal mine water inflow is about 129 m3/h. The direct sources of mine water inflow are the water in the goaf of the early mining, the sandstone water in the roof of the coal seam, and the limestone water in the coal floor. The indirect sources of mine water inflow are the atmospheric precipitation and surface water.

3.2 Characteristics of water inflow

Monthly mine water inflow data are obtained from Pingdingshan No.12 Coal Mine between February 1976 and December 2015. Thus, the time series data set contains 479 points (Fig. 4).

Fig. 4
figure 4

Monthly mine water inflow curve of Pingdingshan No.12 Coal Mine

The data of mine water inflow from February 1976 to December 2013 (No.1–No.455) are used to reconstruct phase space of the mine water inflow time series. The data from July 2012 to December 2013 (No.438–No.455) are used to set up Chaos-GRNN model; the data from January 2014 to December 2015(No.456–No.479) are used to verify model.

3.3 Phase space reconstruction

The time delay of the water inflow time series, τ = 13 months, is determined by formula (1) (Fig. 5a); the embedding dimension m = 7, is obtained by formulas (2, 3 and 4) (Fig. 5b).Using formulas (6) and (7), the maximum Lyapunov exponent λmax is equal to 0.053.

Fig. 5
figure 5

a Relationship between time delay τ and autocorrelation coefficient C(τ); b The relationship between m and E1(m) of mine discharge series

The embedding dimension m = 7 indicates that there are 7 factors affecting the mine water inflow, so as to determine the number of input layer neurons in the Chaos-GRNN model is 7. λmax = 0.053 > 0, indicates that the mine water inflow time series has chaotic characteristics. R = 1/λmax = 18.86 ≈ 19, indicates that prediction step size of the Chaos-GRNN model is 19 steps.

The phase space of mine water inflow time series in Pingdingshan No. 12 Coal Mine is reconstructed to obtain a 7-dimensional vector space Y with a time delay of 13 months:

(18)

3.4 Water inflow forecast

Due to the test samples using the water inflow data from January 2014 to December 2015, so need to assume that the values of x456, , x479 are unknown. That is, in the formula (18), the last column number of Y378 ~ Y401 is unknown, so the training samples must be selected before Y378. Additionally, the step size of the Chaos-GRNN model is 19steps. So the detailed procedure to predict mine water inflow is shown as follows:

  • Step one: To forecast x456.

Selecting (Y360, Y361,…,Y377)T as the training sample in GRNN. Output layer output is \(\hat{Y}_{378}\), that is, the predicted value of Y378 is:

$$\hat{Y}_{378} = \left( {\hat{x}_{378} ,\hat{x}_{391} ,\hat{x}_{404} ,\hat{x}_{417} ,\hat{x}_{430} ,\hat{x}_{443} ,\hat{x}_{456} } \right)$$
(19)

In \(\hat{Y}_{378}\), \(\hat{x}_{456}\) is predictive value of x456.

  • Step two: To forecast x457.

Let \(Y_{378}^{\prime } = \left( {x_{378} ,x_{391} ,x_{404} ,x_{417} ,x_{430} ,x_{443} ,\hat{x}_{456} } \right)\) Selecting (Y361, Y362,…,Y377, \(Y_{378}^{\prime }\))T as the training sample in GRNN. Output layer output is \(\hat{Y}_{379}\), that is, the predicted value of Y379 is:

$$\hat{Y}_{379} = \left( {\hat{x}_{379} ,\hat{x}_{392} ,\hat{x}_{405} ,\hat{x}_{418} ,\hat{x}_{431} ,\hat{x}_{444} ,\hat{x}_{457} } \right)$$
(20)

In \(\hat{Y}_{379}\), \(\hat{x}_{457}\) is predictive value of x457.

  • Step three: To forecast x458.

Let \(Y^{\prime}_{379} = \left( {x_{379} ,x_{392} ,x_{405} ,x_{418} ,x_{431} ,x_{444} ,\hat{x}_{457} } \right)\) Selecting (Y362, Y363,…,Y377, \(Y_{378}^{\prime }\), \(Y_{379}^{\prime }\))T as the training sample in GRNN. Output layer output is \(\hat{Y}_{380}\), that is, the predicted value of Y380 is:

$$\hat{Y}_{380} = \left( {\hat{x}_{380} ,\hat{x}_{393} ,\hat{x}_{406} ,\hat{x}_{419} ,\hat{x}_{432} ,\hat{x}_{445} ,\hat{x}_{458} } \right)$$
(21)

In \(\hat{Y}_{380}\), \(\hat{x}_{458}\) is predictive value of x458.

  • Step four: Repeat the above steps to obtain the predicted values of x459,…,x479, namely, the verified values of mine water inflow from January 2014 to December 2015 in Pingdingshan No. 12 Mine.

In the prediction process, the initial value of the smoothing factor σ is set to 0.1, the step size is 0.1, and the final value is 2. Under MATLAB, the optimal smoothing factors for the 24 groups of models selected by the cross-validation method are from 0.1to 0.9 (Table 1 and Fig. 6).

Table 1 Chaos-GRNN model prediction results
Fig. 6
figure 6

Comparison between measured value and predicted value of mine water inflow

4 Discuss

4.1 Chaos Parameters of mine water inflow time series

Parameters like time delay τ, embedding dimension m and Maximum Lyapunov exponent λmax are very important for describing the chaotic characteristics and predicting of mine water inflow. In this paper, we have calculated the chaotic parameters of mine water inflow in some coal mines China (Table2).

Table 2 The chaotic parameters of mine water inflow in some coal mines China

In Table 2, we can see that:

  1. (1)

    All the maximum Lyapunov exponent λmax in different coal mines are greater 0. By chaos theory, λmax > 0 means that the mine water inflow time series has chaotic characteristics. Therefore, this result lays the foundation to use the Chaos-GRNN model for mine water inflow prediction.

  2. (2)

    τ, m, λmax of the mine water inflow time series are different, which reflects the diversity and complexity of factors related to mine water inflow. According to chaos theory, the embedding dimension m of mine water inflow time series is equal to the number of factors affecting the mine water inflow [15, 19, 29]. In Table 2, the value of m is between 4 and 8, which indicating that the influencing factors of mine water inflow in different coal mines are roughly 4–8. This result reflects the difficulty of accurately predicting the amount of mine water inflow. On the other hand, it also shows the advantage of chaos theory in mine water inflow prediction (influencing factors of mine water inflow and their values are quantified by Yi).

  3. (3)

    However, the value of Yi is still the amount of mine water inflow, not the value of each influencing factor of mine water inflow, so this quantification is not a complete quantification. This also led to the slow application of chaos theory.

ANN simulates the way the human brain nerves process information. It has powerful self-adaptation and self-learning functions for samples, and just has the ability to process this information. Therefore, the chaos theory and ANN are coupled to establish the Chaos-ANN model (Fig. 7), which provides a powerful tool for the effective prediction of mine water inflow.

Fig. 7
figure 7

Schematic representation of the proposed Chaos-ANN model (modified by Fig. 1, Mohammad [17])

4.2 Comparison of Chaos-BPNN and Chaos-GRNN prediction models

Artificial neural network (ANN), which owns powerful adaptive and self-learning functions for samples, can effectively obtain output from input neurons. Therefore, the chaos theory and ANN can be organically combined (the number and value of input neurons and predicted duration of ANN model can be determined by phase space reconstruction) to predict mine inflow.

Among the artificial neural networks, BP neural network (BPNN) is the most widely used now. So in order to compare with the prediction effect of the Chaos-GRNN model, we also establish a Chaos-BPNN model for predicting the water inflow in Pingdingshan No. 12 Coal Mine.

The Chaos-BPNN model consists of three parts: the input layer, the hidden layer, and the output layer, and the layers are connected by weights. Since the embedding dimension m = 7, the Chaos-BPNN model sets 7 input elements, and 1 output element. The number of hidden layer neurons is determined by empirical Equation [42] and trial and error method [23, 37]. The Chaos-BPNN model sets 7 hidden layer elements.

After normalizing the data, we conduct network training by MATLAB. The prediction results are shown in Fig. 8.

Fig. 8
figure 8

Comparison of prediction results of Chaos-GRNN model and Chaos-BPNN model

The models evaluation are based on R, RMSE, MAPE, NSE, and violin plot (Fig. 9). Using formulas (14, 15, 16 and 17), the values of these four indices are calculated (Table 3).

Fig. 9
figure 9

Violin plots of two predictive models

Table 3 Prediction results of Chaos-BPNN model

It can be seen from Table3: (1) The value of R of Chaos-GRNN model and Chaos-BPNN model is 0.755 and 0.546, respectively. The value of R of Chaos-GRNN model is more closer to 1. (2) The value of RMSE, MAPE, and NSE of Chaos-GRNN model is 7.63, 5.02%, and − 0.233, respectively. The values of these three indices are closer to 0 than those of Chaos-BPNN model.

It can be seen from Fig. 9: (1) The median of Chaos-GRNN model is close to the median of measured value, while the median of Chaos-BPNN model is obviously larger. (2) The discreteness of Chaos-BPNN model is much greater than that of Chaos-GRNN model. (3) The density function of Chaos-GRNN model is more concentrated than that of Chaos-BPNN model. Moreover, the maximum value of the density function of Chaos-GRNN model is more closer to the maximum value of the density function of the measured value.

By comparing the four indices (R, RMSE, MAPE, NSE) and violin plot of the two models, the forecast result of Chaos-GRNN model is better than that of Chaos-BPNN model.

5 Conclusions

Chaos theory can recover the rich dynamic information hidden in time series. By reconstructing water inflow time series in phase space, the multi-dimensional matrix could be obtained, with each column representing an influencing factor of water inflow and its value representing the change of the influencing factor with time. Therefore, a new prediction model of mine water inflow is established by combining ANN with chaos theory when lacking data on the influencing factors of water inflow.

The two models were evaluated by using four evaluation indices (R, RMSE, MAPE, NSE) and violin plot. It was found that both models can realize the long-term prediction of water inflow, and the prediction effectiveness of Chaos-GRNN model is better than that of Chaos-BPNN model.

However, whether all the time series of mine water inflow has chaotic characteristics, there is no clear answer yet. Therefore, before using the Chaos-GRNN model, it is necessary to verify the chaotic characteristics of the water inflow time series firstly. Once the water inflow time series does not have chaotic characteristics, other predict methods need to be used.