Review of the limitations and potential empirical improvements of the parametric group method of data handling for rainfall modelling

Lake, Ronald William; Shaeri, Saeed; Senevirathna, STMLD

doi:10.1007/s11356-022-23194-3

Review of the limitations and potential empirical improvements of the parametric group method of data handling for rainfall modelling

Advances in Science, Engineering and Technology in Combating Pollution for a Sustainable Future
Open access
Published: 10 October 2022

Volume 30, pages 98907–98921, (2023)
Cite this article

Download PDF

You have full access to this open access article

Environmental Science and Pollution Research Aims and scope Submit manuscript

Review of the limitations and potential empirical improvements of the parametric group method of data handling for rainfall modelling

Download PDF

1657 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

This study furthers the utilisation of the parametric group method of data handling (GMDH) in assessing the possibility of rainfall modelling and prediction, using publicly available temperature and rainfall data. In using ordinary GMDH approaches, the modelling is inconclusive with no clear consistency demonstrated through coefficients of determination and analysis of variance. Hence, an empirical assessment has been undertaken to provide an explanation of the inconsistency. In doing so, state variable distribution, their classification within the fuzzy context, and the need to integrate the principle of incompatibility into the GMDH modelling format are all assessed. The mathematical foundations of GMDH are discussed within the heuristic framework of data partitioning, partial description synthesis, the limitations of the least-squares coefficient of determination, incompleteness theorem, and the necessity for an external criterion in the selection procedure for polynomials. Methods for modelling improvement include the potential for hybridisation with least square support vector machines (LSSVM), the application of filters for parameter estimation, and the combination with signal processing techniques, ensemble empirical mode decomposition (EEMD), wavelet transformation (WT), and wavelet packet transformation (WPT). These have been investigated in addition to the implementation of enhanced GMDH (eGMDH) and fuzzy GMDH (FGMDH). The inclusion of exogenous data and its application within the GMDH modelling paradigm are also discussed. The study concludes with recommendations to enhance the potential for future rainfall modelling study success using parametric GMDH.

Modelling and predicting annual rainfall over the Vietnamese Mekong Delta (VMD) using SARIMA

Article Open access 20 June 2024

Evaluation of the support vector regression (SVR) and the random forest (RF) models accuracy for streamflow prediction under a data-scarce basin in Morocco

Article Open access 03 June 2024

On Splitting Training and Validation Set: A Comparative Study of Cross-Validation, Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning

Article Open access 01 July 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Rainfall trend analysis is an active research area that includes environmental, agricultural, and engineering studies. Reviewing historical rainfall data that extends back for over decades without discontinuities provides an opportunity for trend identification that may point to a changing climate. The Intergovernmental Panel on Climate Change (IPCC 2014) advises that statistically significant increases in heavy rainfall events have occurred since 1951 across more regions than the converse.

In Australia, the Commonwealth Scientific and Industrial Research Organisation (CSIRO) and the Bureau of Meteorology (BoM) released the State of Climate Report (CSIRO and BoM 2020), indicating that there has been an increase in the intensity of heavy rainfall events over the past 25 years. Based on that report, extreme rainfall events up to and including 60-min duration have also increased in frequency by at least 10% across different regions of the country (CSIRO and BoM 2020, p. 8). Such short-duration events impose the added potential of flash flooding, placing communities at risk. Cashen (2011) quantified the effects of climate change being superimposed over the Australian climate influences of El Niño Southern Oscillation (ENSO) and the Indian Ocean Dipole (IOD). Southern Oscillation in its two forms (i.e., La Niña delivering an increase in rainfall, and El Niño delivering a reduction in rainfall), combined with positive IOD (i.e., rainfall decrease) or negative IOD (i.e., rainfall increase), oscillate about the typical rainfall trend (Cashen 2011).

There is considerable rainfall variability across the Central West of New South Wales region, which is a result of complex interactions between weather patterns, large-scale climate influences such as ENSO, the topography of the Blue Mountains and the Great Dividing Range, coupled with the circa 80-km straight line distance from the East Coast of NSW (Office of Environment and Heritage 2014). Much of the Central West experiences annual mean precipitation within the 400–800 mm range. This is in contrast to the far western plains, which receive only around 400 mm per year (Office of Environment and Heritage 2014).

As a means of attempting to quantify the existence of trends in rainfall data to predict future rainfall events, a selection of six local government areas (LGAs) within the Central West of NSW, Australia, was chosen for this research. The LGAs are Forbes, Lachlan, Bland, Parkes, Cowra, and Weddin, where the historical rainfall datasets recorded by the BoM (2020) are publicly available. Some of the adjoining LGAs (such as Cabonne) are purposely excluded as their catchment characteristics are quite disparate, influencing noticeable meteorological differences (for instance, Cabonne’s rainfall ranges within 800–1200 mm per year). The choice for exclusion is also supported by the Central West and Orana—climate change snapshot (2014).

In 2020–2021, the first author studied and analysed (Lake 2021) the variation of monthly temperature and rainfall data using machine learning and polynomial neural network (PNN), specifically the group method of data handling (GMDH) to investigate their relationship and correlation. To the authors’ knowledge, no other published studies have used GMDH within a regional Australian context for time series rainfall and temperature trend analysis. Accordingly, GMDH Shell version 3 (GMDH 2022) has been utilised which is machine learning software, available for a windows 10 operating system (with user access to influence regressor distribution or external criterion not permitted). The modelling results illustrated a noticeable disparity between the accuracy of the temperature modelling when compared with the rainfall modelling using factors such as coefficients of determination. Cox et al. (2019) also reported a similar outcome and detailed it as intrinsically noisy, featuring a large scatter with no obvious trend. However, due to the capabilities offered by such methods (e.g., Aghelpour and Varshavian 2020; Nguyen et al. 2019), the authors committed to expanding their approach and investigating the potential for improvement of GMDH.

To overcome the modelling issues and achieve the benefits of the GMDH approach, this paper aims to study an established collection of methods that, when combined with GMDH, deliver modelling improvement. Many of these improvements to GMDH were not utilised for rainfall modelling, but the results illustrate the potential if applied within the rainfall domain. Hence, this paper brings together these methods of improvement that, to the author’s knowledge, have not been presented collectively in a single study. Each section details a mathematical background and guidance for its implementation in a rigorous and informative manner utilising appendices to illustrate the mathematics. The composition of parametric GMDH is introduced together with the formation of combinatorial algorithms (COMBI) and multilayered iterative algorithms (MIA) (Madala and Ivakhnenko 1994). Least square support vector machines (LSSVM), the application of unscented Kalman filters (UKF) for polynomial parameter determination, and three signal processing techniques were investigated for the hybridisation with GMDH. Variations of GMDH are also discussed within the context of enhanced and fuzzy prior to suggesting additional methods that may prove beneficial for modelling accuracy. It should be noted that within this paper the term standard GMDH applies to the COMBI and MIA algorithms, while parametric GMDH is a single-layer network of neurons.

Analysis of parametric GMDH

GMDH is machine learning, a branch of artificial intelligence (Wakefield 2021). It was introduced in 1966 by Dr A. G. Ivakhnenko as an algorithm that would allow the development of high-order regression-type polynomials (Farlow 1981). In designing GMDH, Ivakhnenko employed a heuristic and perceptron approach, the latter being a feature of artificial neural networks (ANN) (Anastasakis and Mort 2001). GMDH algorithms are grouped broadly into two categories: parametric and non-parametric. COMBI and MIA fall within the former category, where the input data are either exact or possess noise of low variance (Anastasakis and Mort 2001). These algorithms model the full range of “possible input variable combinations”, selecting the best model that has been generated from the complete set of models according to the external criterion (Anastasakis and Mort 2001, p. 4). Ivakhnenko et al. (1983) define COMBI algorithms as a complete mathematical inductive method, as no potential models will be passed prior to consideration.

COMBI algorithms organise the models through gradual term increments from 1 to m, where m is the number of arguments, while the external criterion will specify the optimum solution that exists between models that exhibit the same degree of complexity. It will produce a minimum value within the “plane of complexity vs selection criteria”, thereby corresponding to the non-physical optimum model. The primary distinction between the COMBI algorithm and MIA is the number of layers. According to Anastasakis and Mort (2001), the structure of the multilayer algorithms is comparable to multilayer feedforward neural networks; however, the distinction lies in the number of layers and neurons. These are objectively allocated by the external criterion in compliance with the incompleteness theorem. The principal theory behind GMDH and the GMDH algorithms is framed by four heuristics (Ivakhnenko 1970).

1)
Collect a dataset ideally representative of the object sought—for example, rainfall data.
2)
Partition the dataset into two subsets; the first is deemed the ‘training’ set by which the polynomial coefficients are determined. The second dataset forms the ‘testing’ set for use with the external criterion allowing separation of the embedded metadata into two divergent categories: helpful or unhelpful. The partitioning is undertaken automatically by the GMDH algorithmic process, requiring no user input. Test data usually comprise 33% of the total dataset, with the algorithm selecting where to draw the data from (Dorn et al. 2012).
3)
Produce a set of elementary functions (often quadratic polynomials) delivering increasing complexity through an iterative process where a range of different models are produced.
4)
Apply the external criterion for the selection of the optimised model. This procedure is based upon Gödel’s incompleteness theorem, which states “under certain conditions in any language there exist true but unprovable statements” (Uspensky 1994, p. 241). The implication is that for the model most representative of the system to be found, a comparative analysis is required with the external criterion. He et al. (2008) state that the data used within the training set and the external criterion are mutually exclusive. The data that is not used within the training set for estimating the parameters and creating the model from the testing set data is then used by the external criterion for evaluating and selecting the model of the best quality. He et al. (2008) emphasise the significance of optimised cooperation between the external criterion and the division of the dataset; the latter, though, is not an option with GMDH Shell 3. Readers should refer to He et al.’s work for a comprehensive analysis.

The foundational mathematical process that underpins GMDH theory is the Volterra functional series, represented in discrete analogue form as Eq. (1), the Kolmogorov–Gabor polynomial. (Anastasakis and Mort 2001). A nonlinear multivariate high-order polynomial can describe past and present time series (Gilbar 2002). This capability is distinct from a Taylor series that can only describe a specific moment in time. This means the former captures dynamic time representation; the latter is specifically static

$$Y=\left(x,t\right)={a}_0+\sum\nolimits_{i=1}^M{a}_i{x}_i+\sum\nolimits_{i=1}^M\sum\nolimits_{j=1}^M{a}_{ij}{x}_i{x}_j+\sum\nolimits_{i=1}^M\sum\nolimits_{j=1}^M\sum\nolimits_{k=1}^M{a}_{ij k}{x}_i{x}_j{x}_k+\dots$$

(1)

The model output response is designated Y, x = (x₁, x₂, x₃, …, x_m) the vector of input variables also referred to as regressors where ${x}_m\in {\mathbb{R}}^{m_x}$ , and a = (α₀, α₁, α₂, …, α_m) the vector of coefficients or weights, and m is the number of regressors.

Müller et al. (1998) detail that the GMDH algorithm utilises an inductive approach framed by the self-organisation principle. The inductive approach is unbounded with the regressors randomly shifted and activated allowing for the closest match to the dependent variables to be selected (Madala 1991). Self-organising is, according to Green et al. (1988), a non-parametric process in terms of there being a priori. The idea of a unique model with optimum complexity that can be determined through self-organisation forms the foundation of the inductive approach. GMDH delivers the output through the construction of analytic functions formed by quadratic polynomials in a feedforward network structure. The polynomial coefficients are derived from pairs of regressors through a regression technique based upon the ordinary least squares method (OLS). A cascade of quadratic polynomials, commonly referred to as partial descriptions (PD), resides in each neuron (Madala and Ivakhnenko 1994). The quadratic polynomials being of the form

$$y={a}_0+{a}_1{x}_i+{a}_2{x}_j+{a}_3{x}_i^2+{a}_4{x}_j^2+{a}_5{x}_i{x}_j$$

(2)

Appendix A illustrates the mathematical functionality of the algorithmic process.

GMDH limitations

The unsuccessful rainfall modelling that spawned the writing of this paper is not provided here as its inclusion is unnecessary. What is necessary is ascertaining why parametric GMDH was unsuccessful and what can be introduced to facilitate improvement in using GMDH for modelling phenomenon such as rainfall. Limitations encompass a range of factors, including the exclusion of essential regressors initiating noise that impairs model performance (Anastasakis and Mort 2001). From studies undertaken by Green et al. (1988), the problem appears to be caused by collinearity. Even with independent regressors within the input vector, the PDs formed within the first and subsequent iterations were not mutually independent. Green et al. (1988) further emphasised that this results in a selection of regressors being excluded. Overfitting can also be a problem, and when combined with multiple neuron layers, instability delivers poor prediction quality results (Green et al. 1988). Biased estimates of PD coefficients resulting from the application of OLS are an additional shortcoming (Anastasakis and Mort 2001). There is an assumption that the observed output values and estimated output values should be reflected in a Gaussian distribution, meaning that the use of linear regression is justified for PD parameter determination. The reality, though, is this assumption is frequently violated, meaning that OLS is not a suitable method (Anastasakis and Mort 2001). Standard GMDH will also fail with fuzzy input data, meaning a suitably modified GMDH could be appropriate (Anastasakis and Mort 2001). The limitations listed may not seem considerable, but their implications can be significant for the functionality of standard GMDH. For this reason, modified versions of GMDH are explored below. These formats will be shown to deliver improvements over standard GMDH that are worth investigating further, particularly to test their suitability for rainfall modelling.

Hybridising GMDH

Accurate rainfall prediction is a challenge when using a singular model, so the introduction of a hybrid that combines two models has the potential to deliver performance that exceeds the capabilities of each composing model (Parviz et al. 2021). One example of time series forecasting is hybridising least square support vector machines (LSSVM) with GMDH. In research undertaken by Samsudin et al. (2011) for time series forecasting, the hybrid LSSVM GMDH model delivered more accurate results due to its robust nature and ability to model nonlinear data. Support vector machines (SVM) like GMDH fall under the umbrella of ML. They map the vector of regressors into a designated high-dimensional feature space Z via a nonlinear mapping procedure selected a priori. A hyperplane is constructed within this space that ensures the high generalisation ability of the network (Cortes and Vapnik 1995). The technique of support vector networks was originally developed for limited application in separating training data without errors. The situation often arose where this ideal was not possible, so by extension SVMs were deemed a new class of ML with comparable power and universality to neural networks.

Further development introduced the least-squares variant of SVM, designated LSSVM (Cortes and Vapnik 1995). The difference is that the SVM employs equality constraints, whereas the LSSVM uses inequality constraints and implements the system of linear least squares as its loss function (Samsudin et al. 2010a). The LSSVM also offers good convergence combined with high precision, whereas the SVM uses quadratic program solvers that are more challenging to use (Samsudin et al. 2010a). Analogous to GMDH, the LSSVM predictor detailed in Samsudin et al. (2010b) is trained by employing a set of historical time series regressors that deliver a single output as the goal. Appendix B details the mathematical approach of Samsudin et al. (2011). Figure 1 illustrates the structure of the SVM. Most real-world data is nonlinearly separable, meaning the hyperplane is not represented by a straight line. To overcome this problem, Kernel functions are used to allow the transformation of the nonlinear data for linear presentation at higher dimensions (Ampadu 2021).

The steps taken for hybridisation (Samsudin et al. 2011) are:

1)
Separate the normalised data into training and testing sets.
2)
Using the GMDH MIA, combinations of all input state variables (x_i, x_j) are generated, with the totality of independent variables being ${C}_2^M$. The regression polynomial is constructed with an approximation of the output given by equation (A9). The linear vector of coefficients A for each PD is determined by OLS.
3)
The output of each neuron x^′ is assessed against the external criterion with the smallest MSE selected for the formation of a binary input { x₁, x₂, …, x_M, x^′} with M = M + 1, for a neuron within the next hidden layer.
4)
In the GMDH output layer from neurons within the hidden layers, these outputs form inputs { x₁, x₂, …, x_M, x^′ } for the LSSVM. The minimal MSE from the LSSVM will be selected as the output model.
5)
The minimal MSE obtained from the LSSVM for the test data set extracted at each layer during the current iteration is compared against the minimal value from the previous iteration. In the case of an improvement, steps 2 to 4 are repeated. Otherwise, the iterations cease with the knowledge that the network is now complete. Determination of the final layer signifies that only one node with the best performance will realise selection. When this occurs, the remaining nodes with the output layer are discarded, thus delivering the hybrid group least squares support vector machine (GLSSVM) model.

The Kalman filter and GMDH

In its original form, the Kalman filter (KF) algorithm employs a dynamics model that describes its status and its expected status at the following step, provided the system is linear, with any process disturbance combined with the error of measurement being additive and Gaussian (Pasek and Kaniewski 2021). Most real-world systems that include rainfall data are within the nonlinear category, and in the case of monthly rainfall data, non-Gaussian. To overcome this incompatibility, the extended Kalman filter (EKF) was developed for handling nonlinear functions which are locally approximated with linear equations through Taylor expansion (Pasek and Kaniewski 2021). Monthly rainfall data are highly nonlinear, potentially introducing significant errors in linearisation, as only the first term of the Taylor series is utilised (Pasek and Kaniewski 2021). The EKF also requires the computation of the Jacobian matrix at each time step.

If the initial conditions are poorly known, or there are significant measurement errors, major errors in state vector estimation could potentially lead to divergence of the filter (Kraszewski and Czopik 2017). As a means of overcoming these problems, the unscented Kalman filter (UKF) was developed by Julier and Uhlmann (1997). To implement the UKF, a set of sigma points is chosen with a known mean and covariance. A nonlinear function is applied at each point, yielding a set of transformed points. The transformed point statistics can be calculated to provide an estimate of the nonlinearly transformed mean and covariance (Julier and Uhlmann 2004). In combining the UKF with GMDH, the UKF is utilised for parameter estimation for each GMDH neuron PD (Luzar et al. 2011). GMDH determines the parameters for each PD separately, and it is this procedure that allows the UKF to be utilised (Mrugalski 2013). The main advantage of applying UKF for estimating PD parameters is the generation of an asymptotically stable GMDH model (Mrugalski 2013).

The design of a Kalman filter is normally based upon an analytical model of a dynamic process or system X_k + 1 = F(x_k, p), with x_k the unknown state vector for the dynamic process taken at time step k (Pan et al. 2020). F(.) is the dynamic model of the system parameterised by p, a covariance vector.

y_k = H(X_k, p) is the observable variables model belonging to the dynamic system, with y_k the observation vector composed of variables that can be measured (Pan et al. 2020). H(.) is the state to measurement matrix, which maps between x_k and y_k.

Additive process noise affects the degree of accuracy of F which is modelled by Q, the covariance matrix. R, the observation noise covariance matrix, models the uncertainty properties of system observations (Pan et al. 2020).

The UKF is initialised by making an estimate of the unknown state variables $\left({\hat{X}}_0\right)$, supplied with various measurements (y_k). The UKF state variables $\left({\hat{X}}_0\right)$ are recursively estimated through covariance minimisation ${\hat{P}}_k=\mathit{\operatorname{cov}}\left({x}_k-{\hat{x}}_k\right)$. ${\hat{P}}_k$ is a diagonal matrix with elements representing the uncertainty estimate of ${\hat{x}}_k$ (Pan et al. 2020). Through the implementation of the UKF algorithm, the mean squared estimate errors are minimised with an increase in the convergence rate in addition to a stability improvement against corruption by the noise of the experimental data (Masoumnezhad et al. 2016). For a detailed mathematical explanation of the parameter estimation of GMDH dynamic neurons, refer to Mrugalski (2013). Figure 2 illustrates the integration of the UKF into GMDH for PD parameter estimation. No literature to date has been found where this approach has been applied to rainfall modelling. However, it has been used with success in the identification of various dynamic systems (Luzar et al. 2011) and for the successful design of a robust fault detection system (Mrugalski 2013).

Signal processing approaches

In research undertaken by Moosavi et al. (2017), hybridising wavelet transform (WT) and wavelet packet transform (WPT) with GMDH (WGMDH and WPGMDH, respectively) delivered improved results over GMDH alone for runoff forecasting. Of these two, the WPT hybrid delivered the best results. Both the WT and WPT improve the performance of the GMDH modelling by decomposing the original dataset into components (Moosavi et al. 2017).

In a study undertaken by Moosavi (2019) predicting rainfall within the context of natural disasters, GMDH was hybridised with ensemble empirical mode decomposition (EEMD), and again with WT, and WPT forming WTGMDH and WPTGMDH, respectively. In all three cases, the rainfall modelling with GMDH improved once hybridised with these signal processing approaches. The WPTGMDH performed best, followed by EEMD-GMDH, then WTGMDH. All these hybrids outperformed the standard GMDH without signal processing assistance. In further research by Moosavi et al. (2021), the combination of GMDH with the same three separate signal processing approaches—EEMD, WT, and WPT—improved the performance of GMDH for groundwater level forecasting in all three cases.

Given these findings, it seems prudent to investigate further the potential for improving the modelling of monthly rainfall data with GMDH by pre-processing the data with these signal processing approaches. An example of the monthly rainfall modelling undertaken during 2020–2021 using GMDH is illustrated in Fig. 3, covering the town of Forbes from 1995 to 2021.

In Fig. 3, the BoM (2020) data profile is formed by the grey graph, while the blue graph illustrates the GMDH model, with the red being the prediction over 12 months into the future. The coefficient of determination returned a value of 0.246. All rainfall modelling undertaken returned similar unsatisfactory results. There is no clear trend depicted within the actual data, nor can past events be a reliable means of predicting future events. The focus of this empirical assessment is to consider whether pre-processing the rainfall data might improve the modelling outcome, thereby providing greater confidence in the rainfall prediction.

Ensemble empirical mode decomposition (EEMD)

Empirical mode decomposition (EMD) was introduced by Huang et al. (1996). It is a method that is applicable for both nonlinear and non-stationary time series with Srikanthan et al. (2011) noting that the EMD method makes no assumption about the form of the time series prior to analysis. The EMD can adaptively decompose any complex dataset into a series of intrinsic mode functions (IMF) (Huang et al. 1996). The IMFs are independent, and in combination with a residual can effectively be summed together to reform the original series (Srikanthan et al. 2011). EMD does, however, experience mode mixing, which occurs when at a given frequency a fluctuation may separate across two IMFs. Specifically, mode mixing often occurs from intermittent signals (Wu and Huang 2009). The EMD algorithm employs a data-driven adaptive iterative approach meaning that there is difficulty in avoiding mode mixing without subjectively contemplating the likely form of any signal for extraction before undertaking analysis (Srikanthan et al. 2011).

A noise-assisted data analysis (NADA) is proposed to overcome the mode mixing problems. The ensemble EMD (EEMD) provides the definition of the true IMF components “as the mean of an ensemble of trials, each consisting of the signal plus white noise of finite amplitude” (Wu and Huang 2009, p. 2). Looking more closely at the EMD method and then introducing white noise for the EEMD approach, refer to Appendix C for details on the process of decomposing the signal into IMFs. In utilising the EEMD, the decomposition effect is that the augmented white noise series cancel out in the final mean of the corresponding IMFs (Wu and Huang 2009). The mean IMFs are positioned within the natural dyadic filter windows, thereby significantly reducing the potential of mode mixing while preserving the dyadic property (Wu and Huang 2009). The IMFs are more effective in isolating physical processes across a range of time scales owing to mutual orthogonality (Molla et al. 2006). Applying EEMD into the GMDH rainfall modelling is based upon the historical rainfall data signal being decomposed into a set of IMFs with a residual prior to processing by GMDH. The process that GMDH takes is through implementing supervised learning, with IMFs being supplied as input data. That process and the formation of a function to represent the data, enabling the capacity to predict future rainfall events, should in theory be improved. Figure 4a illustrates a flowchart of the EEMD-GMDH hybrid.

Wavelet transform (WT)

Like EEMD, the WT provides a mechanism for signal decomposition. Wavelets are functions that meet a set of mathematical conditions with the ability to represent data and other functions (Graps 1995). They can analyse non-stationary time series across a range of frequencies (Daubechies 1990). By decomposing a time series into time-frequency space, determination of the dominant modes of variability can be achieved, and their time invariance. Wavelet transforms are available in both continuous and discrete forms and both types have been used in the modelling of hydrometeorological studies (Torrence and Compo 1998). The fundamental idea behind WT is their ability to perform signal analysis at different resolutions, which precipitates their most important feature, that of multiresolution decomposition (Lubis et al. 2017). Özger et al. (2012) employed the continuous WT (CWT) in their study of drought forecasting, while Alfa et al. (2019) utilised the discrete WT (DWT) in a hybrid with GMDH for their drought forecasting study. Moosavi et al. (2021) employed the DWT as a hybrid with GMDH in their research on groundwater level modelling. Appendix D provides a brief mathematical overview with reference to Addison (2018) and Lambers (2006).

It is important to recognise that the integral defining Wf(m, n) exists across an unbounded interval, but effectively it exists across a finite interval provided the mother wavelet has compact support, hence a numerical approximation is easy. Moosavi et al. (2021) detailed the importance of selecting the best WT structure, choosing the mother wavelet and the most effective level of decomposition. The decomposed data is then supplied to GMDH for processing with the optimum model delivered. Figure 4b illustrates a flowchart featuring the integration of the WT with GMDH.

Wavelet packet transform (WPT)

The WPT improves the WT by raising the degree of resolution of high-frequency signals precipitating an enhanced high-frequency time-frequency localisation effect (Yan et al. 2021). The WPT prevents the loss of time-frequency information, the signal placed within a domain where simultaneous analysis in both time and frequency can occur (Wickerhauser 1991; Gokhale and Khanduja 2010). Wavelet packet decomposition (WPD) involves passing the signal through a greater number of filters compared to the WT (Gokhale and Khanduja 2010). In general terms, a WPT is a square-integrable function with a mean of zero and compact support, across both time and frequency (Zha et al. 2008). A brief mathematical outline by Zha et al. (2008) is provided. In describing wavelet packets by a collection of functions ${\left\{{\varphi}_k\right\}}_k^{\infty }$ is obtained from:

$${\varphi}_{2k}(x)=\sqrt{2}\sum\nolimits_n{h}_k(n){\varphi}_k\left(2x-n\right)$$

(3)

$${\varphi}_{2k+1}(x)=\sqrt{2}\sum_n{g}_k(n){\varphi}_k\left(2x-n\right)$$

(4)

noting that the discrete filter h_k(n) and g_k(n) are quadrature mirror filters (p. 405). The function φ₀(x) is identifiable with the scaling function, and φ₁(x) with the mother wavelet. The wavelet packet basis, due to its orthonormality and completeness, guarantees retention of the original signal information. The inverse transform of the wavelet packets can be expressed as:

$${\varphi}_k(x)=\sum\nolimits_n\left[{h}_k(n){\varphi}_{2k}\left(x-2n\right)+{g}_k(n){\varphi}_{2k+1}\left(x-2n\right)\right]$$

(5)

The potentially significant component of the WPT specifically within this paper and the context of rainfall modelling is its ability to recursively decompose high-frequency signal components (Zha et al. 2008). The WPT constructs a tree formation multiband extension of the WT. This facilitates the ability of the WPT to recursively divide the entire frequency band for noise detection and removal. By sifting the signal of components that can be classed as noise within the context of isolating the intense rainfall events, it is hypothesised that the capacity of the hybridised WPTGMDH to produce models with an improved coefficient of determination is possible. Such could potentially highlight trends within the rainfall data with greater clarity and prediction significance. Figure 5 illustrates the hybrid WPTGMDH modelling process.

Improved GMDH algorithms

MIA and COMBI algorithms share the same approach to determining the PD coefficients, that of OLS. This regression approach can be subject to the production of coefficients with biased estimates, which is detrimental to model accuracy. Structurally, the MIA is superior to the COMBI algorithm by virtue of possessing multiple layers of neurons which make processing time more expedient. One method for improving the performance of the MIA algorithm can be found within the neuron pruning process (Buryan 2013). In the MIA, the pruning of neurons based upon the external criterion can result in the loss of neurons that may have been useful. Synthesising the neuron outputs within a given layer promotes ill-performing regression within the next layer (Buryan 2013). The network’s complexity can exceed levels desirable owing to the quadratic polynomials that form the PDs when the vector of regressors is highly nonlinear (Buryan and Onwubolu 2011). To overcome these shortcomings, the enhanced MIA-GMDH (eGMDH) or eMIA-GMDH algorithm is proposed, detailed by Buryan and Onwubolu (2011) from work undertaken by Buryan (2006).

Enhanced MIA-GMDH

The improvements encompass the following (Buryan and Onwubolu 2011):

1.
Within the original GMDH, equation (A2) n = 2 for all layers, with eGMDH application applies only within layer one. All subsequent layers can utilise different values of n.
2.
Utilising OLS for PD coefficient determination is not limited to quadratic polynomials, instead including a further seven types which modify Eq. (2).

a
Harmonic based on the cosine function
b
Radical
c
Inverse polynomial
d
Natural logarithm
e
Exponential
f
Arc tangent
g
Rounded polynomial presenting integer coefficients

2.
The pruning of layers is semi-randomised. Upon selecting a subset of the best-performing neurons in each layer, the balance is selected at random.
3.
Implementation of coefficient rounding and thresholding as a means of stabilising the regression process by rejecting coefficients that lay either above or below pre-set thresholds.
4.
Neuron inputs are not binary limited.

From experiments undertaken by Buryan (2013), the enhanced MIA-GMDH delivered improved results when compared to the standard MIA-GMDH algorithm for both Mackay-Glass time series predictions and JPY/USD exchange rate predictions for both daily and monthly timeframes. In a study undertaken by Unwubolu et al. (2007) covering weather forecasting in Fiji, the eGMDH delivered improved results when compared to a PNN and an enhanced PNN (ePNN) for temperature modelling, but it was less successful for rainfall modelling against the alternatives. The study concluded with an unclear finding as to why this was the case.

Enhanced GMDH using modified Levenberg-Marquardt method

Pournasir et al. (2013) proposed an enhanced GMDH algorithm utilising a modified Levenberg-Marquardt (LM) method, which incorporated the use of singular value decomposition (SVD) for the first time as an improved means for the first guess. It is noted that the process of combining the SVD output into LM for the initial guess was not detailed in their paper. The LM algorithm facilitates swift convergence for solving nonlinear systems and singularity problems (Pournasir et al. 2013). The experimental outcome illustrates that the enhanced GMDH using the LM algorithm outperformed standard GMDH in delivering results with high inventory control accuracy (Pournasir et al. 2013). In fitting a parameterised mathematical model to a set of data points (within the focal context of the initiator of this report—rainfall modelling) through minimisation of an objective function, which within the context of GMDH is the selection criterion—the MSE—, the problem of least squares arises. Minimising the objective with respect to the parameters may be possible with speed, provided the solution fits a linear matrix equation (Gavin 2020). However, if this is not the case and the fit function is not linear within its parameters, then the least squares problem needs an iterative algorithmic solution (Gavin 2020). The algorithmic process involves reducing the sum of the squares of the errors between the function representative of the model, and the data points through a sequence of carefully selected model parameter updates (Gavin 2020). The LM algorithm merges two numerical minimisation procedures: the gradient descent (GD) method and the Gauss-Newton (GN) method. In the former case, by updating the parameters in the direction of steepest descent, the sum of the squared errors is reduced. For the latter, the assumption is made that the least squares function is locally quadratic in the parameters, and the sum of the squared errors is reduced, thereby allowing the minimum of the quadratic to be found (Gavin 2020). When the parameters are far removed from their optimal value, the LM behaves akin to the GD method, and when the parameters are close to their optimal target, the LM acts like the GN method (Gavin 2020). Appendix E details mathematically the integration of LM with GMDH.

Integrating LM with GMDH is presented in pseudocode (Transtrum et al. 2011; Mulashani et al. 2021).

1.
Input rainfall data into GMDH
2.
Determine the initial information for the LM-GMDH structure—initial point x₀, damping parameter λ, λ_↑, λ_↓ for damping term adjustment
3.
Estimate the parameter weights
4.
Evaluate the Jacobian at the initial parameter value and the residuals
5.
Calculate the metric Γ = J^TJ + λI, the objective function $f(x)=\frac{1}{2}{\left|r(x)\right|}^2$, and ∇f(x) = J^Tr
6.
Calculate the new residuals r_new at the location provided by x_new = x − Γ⁻¹ ∇ f(x),then calculate the objective function at the new location $f{(x)}_{\mathrm{new}}=\frac{1}{2}{\left|r{(x)}_{\mathrm{new}}\right|}^2$
7.
If f(x)_new < f(x), accept the step, assign x_new to x and set r = r_new then assign $\lambda =\frac{\lambda }{\lambda_{\mathrm{down}}}$ else reject the step, retain the old parameter guess x and residuals, setting λ = λλ_up
8.
Convergence assessment. If there is convergence, return x as the best fitting parameter. If there is no convergence but the step was accepted, calculate the Jacobian at the new parameter values and return to step 5.
9.
Assess the PDs within the current layer against the external criterion
10.
Retain the best-performing neurons
11.
If the current layer is the final layer, terminate, else return to step 2.

Fuzzy GMDH

In non-fuzzy or standard GMDH, the residuals between the observed output values and estimated output values are assumed to be Gaussian, thereby allowing parameter estimation through linear regression (Anastasakis and Mort 2001). This assumption though does not always hold, meaning OLS is not appropriate (Anastasakis and Mort 2001). It follows that real-world systems adhere to Zadeh’s (1980) principle of incompatibility, thus the modelling procedure and theory that is more appropriate is fuzzy (Anastasakis and Mort 2001). In dealing with the fuzzy phenomenon and fuzzy data, Tanaka et al. (1989) suggest utilising possibility measures to describe the fuzzy system equation. Variations between observed and model values are normally categorised as measurement errors within a standard regression model (Tanaka et al. 1989). Within the fuzzy context, the system parameters are considered responsible; correspondingly, the deviations are reflected in possibilistic linear systems (Tanaka et al. 1989). In fuzzy GMDH (FGMDH), the architecture remains unchanged, but the PDs contain fuzzy parameters, which, to be found, require possibilistic linear regression (Anastasakis and Mort 2001). A mathematical description as detailed by Hayashi and Tanaka (1990); Fuzzy number M, μ_M : ℝ → [0, 1], satisfies:

$${M}_{\lambda }=\left\{x:{\mu}_M(x)\ge \lambda \right\}\to \mathrm{closed}\ \mathrm{interval}$$

(6)

$$\exists x\ \mathrm{such}\ \mathrm{that}\ {\mu}_M(x)=1$$

(7)

$${\mu}_M\left(\lambda {x}_1+\left(1-\lambda \right){x}_2\right)\ge {\mu}_M\left({x}_1\right)\bigwedge \kern0.5em {\mu}_M\left({x}_2\right)\ \mathrm{with}\ \lambda \in \left[0,1\right]$$

(8)

An alternative to linear or quadratic polynomials that form the PDs is orthogonal polynomials (Zaychenko and Zaychenko 2019). Their orthogonality precipitates faster coefficient determination than non-orthogonal polynomials, and the coefficients are not dependent upon the initial polynomial degree (Zaychenko and Zaychenko 2019). Chebyshev’s polynomial approximation is a form of regression that minimise autocorrelation between the model response and the sampling locations (Nakajima 2006). Chebyshev orthogonal polynomials are especially well suited to equally spaced sample points (Nakajima 2006), which is the case with the BoM rainfall data. A mathematical representation of the general case of Chebyshev’s orthogonal polynomials can be found in Zaychenko and Zayets (2001). PDs featuring trigonometric polynomials are also an option with FGMDH with a mathematical outline provided within Zaychenko and Zaychenko’s (2019) paper. The advantage that FGMDH offers over standard MIA-GMDH is the zero requirements to use OLS for determining the PD parameters. Fuzzy GMDH can be used with both crisp and fuzzy regressors, which extend its application for potential rainfall modelling. Zaychenko and Zaychenko (2019) reported high accuracy of results when modelling with FGMDH for forecasting financial processes. Shi et al. (2020) found that FGMDH identified regional economic bottlenecks within China with high recognition accuracy compared to standard GMDH. Panchal et al. (2014), in their study of rainfall-runoff modelling using fuzzy logic, returned a coefficient of determination of 0.988. They did not combine it with GMDH, but the illustration shows the gains that can be achieved with the fuzzy approach. No papers have been located to date that uses FGMDH for rainfall modelling.

Neuro-fuzzy GMDH

Neuro-fuzzy is derived from the application of Gaussian radial basis functions (GRBF) as PDs (Anastasakis and Mort 2001). Radial basis functions (RBF) are univariate functions that provide a mechanism to approximate multivariate functions through linear combinations of terms (Buhmann 2010). In considering the GRBF as fuzzy production rules, Mamdani (1976) explored a fuzzy reasoning rule: Let x₁ = A_k1 and x₂ = A_k2 then output y = w_k. Letting the Gaussian membership function A_kj of the kth fuzzy rule (k = 1, …, 4) in the domain of the jth input variables x_j(j = 1, 2) be defined as:

$${A}_{kj}\left({x}_j\right)=\exp \left\{-\frac{{\left({x}_j-{a}_{kj}\right)}^2}{b_{kj}}\right\}$$

(9)

with parameters a_kj and b_kj being given for each rule. Let w_k be a real number from the conclusion of the kth rule suggesting model output y. The degree of compatibility from the proposition is:

$${\mu}_k=\prod_{j-1}^2{A}_{kj}\left({x}_j\right)$$

(10)

with the model output defined

$$y=\sum\nolimits_{k=1}^4{\mu}_k{w}_k$$

(11)

The GRBF ↔y → is a neural network composed of three layers (Poggio and Girosi 1990). The neuro-fuzzy GMDH (NFGMDH) is formed through the implementation of this fuzzy reasoning model as the PDs (Mucciardi 1972, as cited in Nagasaka et al. 1995). NFGMDH follows the same paradigm as standard GMDH. Within the context of the GRBF as PDs, the inputs from the βth model and ρth layer form the output variables of the (β − 1)th and βth model within the (ρ − 1)th layer (Nagasaka et al. 1995; Najafzadeh 2015). The mathematical expression for ascertaining y^ρβ is illustrated as:

$${y}^{\rho \beta}=f\left({y}^{\rho -1,\beta -1},{y}^{\rho -1,\beta}\right)=\sum\nolimits_{k=1}^K{\mu}_k^{\rho \beta}{w}_k^{\rho \beta}$$

(12)

$${\mu}_k^{\rho \beta}=\exp \left\{-\frac{{\left({y}^{\rho -1,\beta -1}-{a}_{k,1}^{\rho \beta}\right)}^2}{b_{k,1}^{\rho \beta}}-\frac{{\left({y}^{\rho -1,\beta }-{a}_{k,2}^{\rho \beta}\right)}^2}{b_{k,2}^{\rho \beta}}\right\}$$

(13)

with ${\mu}_k^{\rho \beta}$and ${w}_k^{\rho \beta}$ as the kth Gaussian function with its associated weight parameter respectively. Additionally, ${a}_k^{\rho \beta}$ and ${b}_k^{\rho \beta}$ are the Gaussian parameters that feature for the ith input variable supplied by the βth model and ρth layer (Nagasaka et al. 1995; Najafzadeh 2015). The complete model output is illustrated with:

$$y=\frac1M\sum\nolimits_{m=1}^My^{\rho\beta}$$

(14)

through each iteration of the network model construction, the error parameter is:

$$E=\frac{{\left({y}^{\ast }-y\right)}^2}{2}$$

(15)

with y^∗ the predicted output value (Nagasaka et al. 1995; Najafzadeh 2015).

In terms of successful outcomes, Nagasaka et al. (1995) reported from their research in utilising NFGMDH for modelling grinding characteristics with GRBF PDs that it delivered an improved result when compared with standard GMDH. Yousefpour and Ahmadpour (2011), from their research into air pollution predictions with NFGMDH, found improved results when compared to those obtained from a multilayer perceptron (MLP) neural network. Miyagishi et al. (2010) found an improvement when compared to the Kalman filter for temperature prediction when the seasonal change was moderate. No papers were located where NFGMDH was used to model rainfall data.

Exogenous data

Exogenous data is the additional data that is not required to run a model but provides an increase in modelling accuracy. Such data was used in the research study undertaken by Moosavi (2019), investigating the impact of rainfall on natural disasters. GMDH was utilised with three signal processing approaches EEMD, WT, and WPT, in addition to exogenous data. Two distinct datasets were used to predict rainfall, one without exogenous data and one with exogenous data. The exogenous data were evaporation, minimum and maximum temperature, and humidity. All data was on a monthly timescale. Modelling rainfall with 1-to-4-month forecasts using just GMDH delivered poor performance. With the inclusion of the exogenous data, the GMDH modelling was an improvement on the modelling that excluded the exogenous data but overall was still poor. Hybridising GMDH with each of the signal processing techniques in turn improved the rainfall modelling, which further improved when the exogenous data were included. Exogenous data were used by Mendoza et al. (2020) for their research study into rainfall in Ecuador. The modelling was undertaken using a dynamic harmonic regression framework where global climate signals formed the exogenous data. Their results were favourable, illustrating greater reliability and robustness against limitations within the data. In a study undertaken by Sarveswararao and Ravi (2020) covering ATM cash demand forecasting, GMDH was supplied with a dummy exogenous variable in the form of the day of the week. Supplied with this information, the GMDH modelling delivered an improvement in the symmetric mean absolute percentage error (SMAPE) when compared to the data modelling without inclusion of the dummy exogenous variable. From the evidence presented, the GMDH paradigm is accepting of the inclusion of exogenous data, which improves the success of the model.

Discussion

As a mechanism for enhancing the usability and possibly the capacity of standard GMDH to model rainfall data, this empirical study considers alternative platforms for further investigation. Gaining an appreciation and deeper understanding of the GMDH algorithm could allow for greater modelling success. Luzar and Witczak (2014) utilised MATLAB for implementing the algorithmic idea of GMDH. Advantages of this approach include state vector dimensions, choice of multiple selection criteria, and stopping criteria. Onwubolu (2014) details the implementation of GMDH in C programming language, and Onwubolu (2016) describes the implementation of the GMDH paradigm into MATLAB. Dag and Yozgatligil (2016) investigated short-term forecasting with a GMDH R-package running within the R-workspace. This option allows the implementation of different transfer functions precipitating the selection that delivers the smallest prediction MSE. The Hilbert-Huang transform (HHT) method, although not specifically within the scope of this paper, is worth mentioning as it is a combination of EMD and Hilbert spectral analysis (HSA) (Huang and Shen 2014). The HHT is potentially suitable for analysing nonlinear and non-stationary data (Huang and Shen 2014). Bowman and Lees (2013) discuss the HHT package available for the R programming language. As GMDH can also be implemented with the R platform, the potential for pre-processing the rainfall data with HHT cannot be denied. This paper suggests that this combination would be a viable option for further investigation.

Conclusion

The GMDH hierarchical structure of the COMBI and MIA algorithms has been discussed in this paper with an investigation of state variable distribution, their classification, and the synthesis of PDs. The limitations of OLS in determining PD coefficients, the inherent potential for biased estimates, the significance of fuzzy input data, and the integration of Gödel’s incompleteness theorem point to the requirement of an external criterion. The methods for modelling improvement covered hybridising with LSSVM, which was shown to be successful for time series forecasting, although modelling of rainfall data appears not to have yet taken place. In implementing LSSVM, normalised data is a requirement, which is distinct from how data is provided to standard GMDH. This provides an opportunity to test the application of normalised data for both standard GMDH and hybridised form and allows for further research in utilising LSSVM with GMDH for rainfall modelling and forecasting. The integration of Kalman filters into the GMDH paradigm for the forming of PD parameters was detailed with success in this application in the modelling of dynamic systems and fault detection. This pairing has not yet been applied to rainfall modelling and forecasting, leaving the opportunity for further research in this area. By hybridising GMDH with three signal processing techniques; EEMD, WT, and WPT, all delivered an improvement for rainfall prediction within the context of natural disasters, with WPTGMDH delivering the largest improvement. It would be interesting to see the outcome of further rainfall modelling using these signal processing techniques within the Australian context, particularly if it covered the same six LGAs in the first author’s original study.

Enhanced MIA-GMDH received exposure to rainfall modelling but delivered results that were deemed unsatisfactory, quite distinct from the success in the temperature modelling. This paper recommends that this would be an ideal area for further research encompassing rainfall modelling through hybridisation with one or more of the applications already discussed. The hybridising of the LM algorithm with GMDH for determining the PD parameters was very successful for the high degree of accuracy associated with inventory control. This pairing has not yet been found to have received the application of rainfall modelling, providing an opportunity for further research in this area. It would be interesting to see what benefit the modelling has for rainfall forecasting when the PD parameters are not within the confines of OLS. Fuzzy GMDH finds favour when modelling real-world systems, given their inherently fuzzy nature resulting in adherence to Zadeh’s principle of incompatibility. The PD parameters within FGMDH require determination by possibilistic linear regression or they take an alternative form of orthogonal polynomials. FGMDH can see the application of modelling with both crisp and fuzzy input vector regressors thereby widening the base of the application. Fuzzy logic as distinct from FGMDH was used with great success in Panchal et al. (2014) study of rainfall-runoff modelling, returning a coefficient of determination of 0.988. This paper highlights the potential for further research into rainfall modelling and forecasting with the application of FGMDH. NFGMDH, which utilises GRBFs as PDs, found success in terms of an improvement in previous studies that modelled grinding characteristics and for air pollution predictions when compared to standard GMDH and an MLP. No studies have been found where NFGMDH has been applied to rainfall modelling, but the opportunity exists for further research in this area. The inclusion of exogenous data improved the modelling for each of the cited studies. This paper considers it a worthwhile proposition to include exogenous data in all the modelling avenues discussed, as the evidence suggests an improvement in the model will be achieved.

Data availability

Not applicable.

References

Addison PS (2018) Introduction to redundancy rules: the continuous wavelet transform comes of age. Phil Trans R Soc A 376:20170258. https://doi.org/10.1098/rsta.2017.0258
Article Google Scholar
Aghelpour P, Varshavian V (2020) Evaluation of stochastic and artificial intelligence models in modelling and predicting of daily river flow time series. Stoch Env Res Risk A 34:33–50
Google Scholar
Alfa MS, Shabri AB, Shaari MA (2019) Drought forecasting using wavelet-GMDH model with standardised precipitation index. Int J Recent Technol Eng 8(4):1431–1435. https://doi.org/10.35940/ijrte.d7402.118419
Article Google Scholar
Ampadu H (2021) Understanding of support vector machine. Understanding of Support Vector Machine (SVM) (ai-pool.com).
Anastasakis L, Mort N (2001) The development of self-organisation techniques in modelling: a review of the group method of data handling (GMDH). (ACSE Research report No 813). Dept of Automatic Control & Systems Engineering, the University of Sheffield. https://eprints.whiterose.ac.uk/83130/
BoM (2020) Climate Data Online. Australian Government: Bureau of Meteorology. http://www.bom.gov.au/climate/data/
Bowman DC, Lees JM (2013) The Hilbert-Huang transform: a high resolution spectral method for nonlinear and nonstationary time series. Seismol Res Lett 84(6):1074–1080. https://doi.org/10.1785/0220130025
Article Google Scholar
Buhmann M (2010) Radial basis function. Scholarpedia. 5. 9837. https://doi.org/10.4249/scholarpedia.9837
Buryan P (2006) Time series analysis by means of enhanced GMDH algorithm. [Dissertation thesis]. CTU Prague, Prague
Google Scholar
Buryan P (2013) Enhanced MIA-GMDH algorithm. https://www.researchgate.net/publication/228815829_Enhanced_MIA-GMDH_Algorithm
Buryan P, Onwubolu GC (2011) Design of enhanced MIA-GMDH learning networks. Int J Syst Sci 42(4):673–693. https://doi.org/10.1080/00207720903225526
Article Google Scholar
Cashen M (2011) Drivers of climate variability in the Murray-Darling basin. Drivers of climate variability in the Murray Darling basin. https://www.dpi.nsw.gov.au/
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(1995):273–297
Google Scholar
Cox T, Bywater J, Heineman M, Rodrigo D, Wood S (2019) Forecasting extreme events: making sense of noisy climate data in support of water resources planning. H2Open J 2(1):45–57. https://doi.org/10.2166/wcc.2018.006
Article Google Scholar
CSIRO, BoM (2020) State of the climate 2020. Australian Government: Commonwealth Scientific and Industrial Research Organisation and Bureau of Meteorology biannual report. Retrieved from http://www.bom.gov.au/state-of-the-climate/
Dag O, Yozgatligil C (2016) GMDH: an R package for short term forecasting via GMDH-type neural network algorithms. R J 8(1):379–386
Google Scholar
Daubechies I (1990) The wavelet transform, time-frequency localisation, and signal analysis. IEEE Trans Inf Theory 36(5):961–1005
Google Scholar
Dorn M, Braga ALS, Llanos CH, Coelho LS (2012) A GMDH neural network-based method to predict approximate three-dimensional structures of polypeptides. Expert Syst Appl 39(2012):12268–12279. https://doi.org/10.1016/j.eswa.2012.04.046
Article Google Scholar
Farlow SJ (1981) The GMDH algorithm of Ivakhnenko. Am Stat 35(4):210–215
Google Scholar
Gavin HP (2020) The Levenberg-Marquardt algorithm for nonlinear least squares curve-fitting problems. Department of Civil and Environmental Engineering–Duke University. https://people.duke.edu/~hpgavin/ce281/lm.pdf
Gilbar TC (2002) A new GMDH type algorithm for the development of neural networks for pattern recognition. Doctoral Dissertation. Florida Atlantic University, Boca Raton
Google Scholar
GMDH (2022) Group method of data handling. Retrieved from http://www.gmdh.net/
Gokhale MY, Khanduja DK (2010) Time domain signal analysis using wavelet packet decomposition approach. Int J Commun Netw Syst Sci 3(2010):321–329
Google Scholar
Graps A (1995) An introduction to wavelets. Comput Sci Eng IEEE 2(2):50–61
Google Scholar
Green DG, Reichelt RE, Bradbury RH (1988) Statistical behaviour of GMDH algorithm. Biometrics 44(1):49–69 https://www.jstor.org/stable/2531895
Google Scholar
Hayashi I, Tanaka H (1990) The fuzzy GMDH algorithm by possibility models and its application. Fuzzy Sets Syst 36(2):245–258
Google Scholar
He C-Z, Wu J, Müller JA (2008) Optimal cooperation between external criterion and data division in GMDH. Int J Syst Sci 39(6):601–606. https://doi.org/10.1080/00207720701750816
Article Google Scholar
Huang NE, Shen SSP (2014) Hilbert-Huang transform and its applications. World Scientific Publishing Company, Hackensack. https://doi.org/10.1142/8804
Book Google Scholar
Huang NE, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen N-C, Tung CC, Liu HH (1996) The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proceedings: Math Physical Eng Sci 454(1971):903–995
Google Scholar
IPCC (2014) AR5 climate change 2014 Synthesis report. https://www.ipcc.ch/report/ar5/syr/
Ivakhnenko AG (1970) Heuristic self-organisation in problems of engineering cybernetics. Automatica 6(1970):207–219
Google Scholar
Ivakhnenko AG, Krotov GI, Stepashko VS (1983) Harmonic and exponential harmonic GMDH algorithms. Part 2. Multilayer algorithms with and without calculation of remainders. Soviet Automatic Control c/c of Avtomatika 16(1):1–9
Google Scholar
Ivakhnenko AG, Zholnarskij AA (1992) Estimating the coefficients of polynomials in parametric GMDH algorithms by the improved instrumental variables method. J Autom Inf Sci 25(3):25–32
Google Scholar
Julier SJ, Uhlmann JK (1997) A new extension to the Kalman filter to nonlinear systems. Proc. AeroSense: 11th Int. Symp Aerospace/Def Sens Simul Controls 1997:182–193
Google Scholar
Julier SJ, Uhlmann JK (2004) Unscented filtering and nonlinear estimation. Proc IEEE 92(3):2004
Google Scholar
Kraszewski T, Czopik G (2017) Nonlinear Kalman filtering in the presence of additive noise. Conference on Reconnaissance and Electronic Warfare Systems. https://doi.org/10.1117/12.2269355
Lake RW (2021) What will climate modelling with a GMDH neural network reveal for stormwater structures within the dual context of carbonation and pluvial containment across a spatial subset of the Murray-Darling basin. Master’s Thesis. Charles Sturt University, Bathurst, NSW, Australia
Lambers J (2006) Lecture 10: PE281. Retrieved from https://web.stanford.edu/class/energy281/WaveletAnalysis.pdf
Lubis MZ, Lubis RA, Lubis RUA (2017) Two-dimensional wavelet transform de-noising and combining with side scan sonar image. J Appl Geospatial Inform 1(1):1–4
Google Scholar
Luzar M, Witczak M (2014) A GMDH toolbox for neural network-based modelling. Conference October 2010. European Workshop on Advanced Control and Diagnosis, vol 8. https://www.researchgate.net/publication/235901341_A_GMDH_TOOLBOX_FOR_NEURAL_NETWORK-BASED_MODELLING
Luzar M, Mrugalski M, Witczak M, & Korbicz J (2011) An unscented Kalman filter approach to designing GMDH neural networks: application to the tunnel furnace. International Conference on Methods and Models in Automation and Robotics, 16. https://www.researchgate.net/publication/235901186_An_unscented_Kalman_filter_approach_to_designing_GMDH_neural_networks_Application_to_the_tunnel_furnace
Madala H (1991) Comparison of inductive verses deductive learning networks. Complex Syst 5(1991):239–258
Google Scholar
Madala HR, Ivakhnenko AG (1994) Inductive learning algorithms for complex system modeling. CRC Press, Boca Raton
Google Scholar
Mamdani EH (1976) Advances in the linguistic synthesis of fuzzy controllers. Int J Man-Mach Stud 8(1976):669–678
Google Scholar
Masoumnezhad M, Jamali A, Nariman-zadeh. (2016) Robust GMDH-type neural network with unscented Kalman filter for nonlinear systems. Trans Inst Measur Control 38(8):992–1003
Google Scholar
Mendoza DE, Samaniego EP, Mora DE, Espinoza MJ, Pacheco EA, Avilés AM (2020) Local rainfall modelling based on global climate information: a data-based approach. Environ Model Softw 131(2020):1–15
Google Scholar
Miyagishi K, Ohsako M, Ichihashi H (2010) Temperature prediction from regional spectral model by neurofuzzy GMDH. Researchgate
Molla MKI, Rahman MS, Sumi A, Banik P (2006) Empirical mode decomposition analysis of climate changes with special reference to rainfall data. Discret Dyn Nat Soc 2006:1–7. https://doi.org/10.1155/DDNS/2006/45348
Article Google Scholar
Moosavi V (2019) Prediction of rainfall as one of the main variables in several natural disasters. In Pourghasemi H, Rossi M (eds.). Natural hazards GIS-based spatial modeling using data mining techniques. Advances in Natural and Technological Hazards Research, vol 48. Springer, Cham. https://doi.org/10.1007/978-3-319-73383-8_8
Moosavi V, Talebi A, Hadian MR (2017) Development of a hybrid wavelet packet – group method of data handling (WPGMDH) model for runoff forecasting. Water Resour Manag 31(2017):43–59. https://doi.org/10.1007/s11269-016-1507-3
Article Google Scholar
Moosavi V, Mahjoobi J, Hayatzadeh M (2021) Combined group method of data handling with signal processing approaches to improve the accuracy of groundwater level modelling. Nat Resour Res 30(2). https://doi.org/10.1007/s11053-020-09799-w
Mrugalski M (2013) An unscented Kalman filter in designing dynamic GMDH neural networks for robust fault detection. Int J Appl Math Comput Sci 23(1):157–169
Google Scholar
Mucciardi AN (1972) Neuromine nets as the basics for the predictive component of robot brains.C ybernetics, Artificial Intelligence and Ecology. New York: Spartan books, pp 159–194
Mulashani AK, Shen C, Nkurlu BM, Mkono CN (2021) Enhanced group method of data handling (GMDH) for permeability prediction based on the modified Levenberg-Marquardt technique from well log data. Energy 239(2022). https://doi.org/10.1016/j.energy.2021.121915
Müller JA, Ivakhnenko AG, Lemke F (1998) GMDH algorithms for complex systems modelling. Math Comput Model Dyn Syst 4(4):275–316. https://doi.org/10.1080/13873959808837083
Article Google Scholar
Nagasaka K, Ichihashi H, Leonard R (1995) Neuro-fuzzy GMDH and its application to modelling grinding characteristics. Int J Prod Res 33(5):1229–1240
Google Scholar
Najafzadeh M (2015) Neuro-fuzzy GMDH systems based evolutionary algorithms to predict scour pile groups in clear water conditions. Ocean Eng 99(2015):85–94. https://doi.org/10.1016/j.oceaneng.2015.01.014
Article Google Scholar
Nakajima M (2006) “Note on Chebyshev Regression,” Lecture notes. University of Illinois Urbana-Champaign, Illinois
Google Scholar
Nguyen TN, Lee S, Nguyen-Xuan H, Lee J (2019) A novel analysis prediction for geometrically nonlinear problems using group method of data handling. Comput Methods Appl Mech Eng 354:506–526
Google Scholar
Office of Environment and Heritage (2014) Central West and Orana; climate change snapshot. Retrieved from https://www.environment.nsw.gov.au
Onwubolu G (2014) GMDH-methodology and implementation in C. Imperial College Press, London
Google Scholar
Onwubolu G (2016) GMDH-methodology and implementation in MATLAB. Imperial College Press, London
Google Scholar
Özger M, Mishra AK, Singh VP (2012) Long lead time drought forecasting using a wavelet and fuzzy logic combination model: a case study in Texas. J Hydrometeorol 13:284–297. https://doi.org/10.1175/JHM-D-10-05007.1
Article Google Scholar
Pan Z, Zhang Y, Gustavsson JPR, Hickey J-P, Cattafesta LN III (2020) Unscented Kalman filter (UKF)-based nonlinear parameter estimation for a turbulent boundary layer: a data assimilation framework. Meas Sci Technol 31(2020):094011
CAS Google Scholar
Panchal RA, Suryanarayana TMV, Parekh FP (2014) Rainfall-runoff modelling: a fuzzy logic approach. Int J Sci Res Dev 2(5):2321–0613
Google Scholar
Parviz L, Rasouli K, Torabi A (2021) Improving hybrid models for precipitation forecasting by combining nonlinear machine learning methods. Res Square. https://doi.org/10.21203/rs.3.rs-779973/v1
Pasek P, Kaniewski P (2021) Unscented Kalman filter application in personal navigation. Radioelectronic Systems Conference 2019. https://doi.org/10.1117/12.2564984
Poggio T, Girosi F (1990) Regularization algorithms for learning that are equivalent to multilayer networks. Am Assoc Adv Sci 247(4945):978–982
CAS Google Scholar
Pournasir M, Alam MJ, Marthandan G (2013) Enhanced group method of data handling type modelling for nonlinear systems in inventory control. Artif Intell Eng Design Anal Manuf 2013(27):377–385. https://doi.org/10.1017/S0890060413000358
Article Google Scholar
Samsudin R, Saad P, Shabri A (2010a) A hybrid least squares support vector machines and GMDH approach for river flow forecasting. Hydrol Earth Syst Sci Discuss 7(2010):3691–3731. https://doi.org/10.5194/hessd-7-3691-2010
Article Google Scholar
Samsudin R, Saad P, Shabri A (2010b) Hybridizing GMDH and least squares SVM support vector machine for forecasting tourism demand. IJRRAS 3(3):274–279
Google Scholar
Samsudin R, Saad P, Shabri A (2011) A hybrid GMDH and least squares support vector machines in time series forecasting. Neural Netw World 3(11):251–268
Google Scholar
Sarveswararao V, Ravi V (2020) ATM cash demand forecasting in an Indian bank with chaos and deep learning. https://www.researchgate.net/publication/343849433_ATM_Cash_demand_forecasting_in_an_Indian_Bank_with_chaos_and_deep_learning
Shi Z, Wen Z, Xia J (2020) Fuzzy GMDH-type method and its application in bottle-neck diagnosis of regional economic system. Asian J Econ Finance 2(1):11–18
Google Scholar
Srikanthan R, Peel MC, McMahon TA, Karoly DJ (2011) Ensemble empirical mode decomposition of Australian monthly rainfall and temperature data. 19th International Congress on Modelling and Simulation, Perth, Australia, 12–16 December 2011. https://mssanz.org.au/modsim2011/
Tanaka H, Hayashi I, Watada J (1989) Possibilistic linear regression analysis for fuzzy data. Eur J Oper Res 40(1989):389–396
Google Scholar
Torrence C, Compo GP (1998) A practical guide to wavelet analysis. Am Meteorol Soc. https://doi.org/10.1175/1520-0477(1998)079<0061:APGTWA>2.0.CO;2
Transtrum MK, Machta BB, Sethna JP (2011) Geometry of nonlinear least squares with applications to sloppy models and optimisation. Phys Rev E 83:036701
Google Scholar
Unwubolu GC, Buryan P, Garimella S, Ramachandran V, Buadromo V, Abraham A (2007) Self-organising data mining for weather forecasting. IADIS Eur Conf Data Min: Rome
Uspensky VA (1994) Gödel’s incompleteness theorem. Theor Comput Sci 130(1994):239–319. https://doi.org/10.1016/0304-3975(94)90222-4
Article Google Scholar
Wakefield K (2021) A guide to machine learning algorithms and their applications. SAS Insights. https://www.sas.com/en_au/insights/articles/analytics/machine-learning-algorithms-guide.html
Wickerhauser MV (1991) INRIA lectures on wavelet packet algorithms. https://www.researchgate.net/publication/243782439_INRIA_lectures_on_wavelet_packet_algorithms
Wu Z, Huang NE (2009) Ensemble empirical mode decomposition: a noise assisted data analysis method. Adv Adapt Data Anal 1(1):1–41
Google Scholar
Yan Z, Yan H, Wang T (2021) A fast non-local means filtering method for interferometric phase based on wavelet packet transform. Radio Sci 56:e2019RS007052 Retrieved from https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2019RS00705
Google Scholar
Yousefpour A, Ahmadpour Z (2011) The prediction of air pollution by using Neuro-fuzzy GMDH. J Math Comput Sci 2(3):488–494
Google Scholar
Zadeh N (1980) What is the worst case behaviour of the simplex algorithm? Technical Report. Department of Operations Research, Stanford Retrieved from https://apps.dtic.mil/sti/pdfs/ADA089486.pdf
Google Scholar
Zaychenko Y, Zaychenko H (2019) Fuzzy GMDH and its application to forecasting financial processes. Int J 1(2019):91–109. https://doi.org/10.20535/SRIT.2308-8893.2019.1.07
Article Google Scholar
Zaychenko YP, Zayets IO (2001) Synthesis and adaption of fuzzy forecasting models on the basis of self-organisation method. Scientific Papers of NTUU “KPI” No 3, 34–41
Zha X, Fu R, Dai Z, Liu B (2008) Noise reduction in interferograms using wavelet packet transform and wiener filtering. IEEE Geosci Remote Sens Lett 5(3):404–408
Google Scholar

Download references

Acknowledgements

The authors would like to thank Mr. Mark Filmer for his assistance in enhancing the readability of this manuscript.

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions

Author information

Authors and Affiliations

School of Computing, Mathematics and Engineering, Charles Sturt University, Bathurst, New South Wales, Australia
Ronald William Lake, Saeed Shaeri & STMLD Senevirathna
Gulbali Institute for Agriculture, Water and Environment, Charles Sturt University, Albury, NSW, 2640, Australia
STMLD Senevirathna

Authors

Ronald William Lake
View author publications
You can also search for this author in PubMed Google Scholar
Saeed Shaeri
View author publications
You can also search for this author in PubMed Google Scholar
STMLD Senevirathna
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by Ron Lake with the supervision of STMLD Senevirathna and Saeed Shaeri. The first draft of the manuscript was written by Ron Lake and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to STMLD Senevirathna.

Ethics declarations

Ethics approval

Not applicable

Consent to participate

Not applicable

Consent for publication

All authors agreed with the content and that all gave explicit consent to submit and that they obtained consent from the university.

Competing interests

The authors declare no competing interests.

Additional information

Responsible Editor: Marcus Schulz

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

ESM 1

(DOCX 195 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lake, R.W., Shaeri, S. & Senevirathna, S. Review of the limitations and potential empirical improvements of the parametric group method of data handling for rainfall modelling. Environ Sci Pollut Res 30, 98907–98921 (2023). https://doi.org/10.1007/s11356-022-23194-3

Download citation

Received: 30 March 2022
Accepted: 19 September 2022
Published: 10 October 2022
Issue Date: September 2023
DOI: https://doi.org/10.1007/s11356-022-23194-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Review of the limitations and potential empirical improvements of the parametric group method of data handling for rainfall modelling

Abstract

Similar content being viewed by others

Modelling and predicting annual rainfall over the Vietnamese Mekong Delta (VMD) using SARIMA

Evaluation of the support vector regression (SVR) and the random forest (RF) models accuracy for streamflow prediction under a data-scarce basin in Morocco

On Splitting Training and Validation Set: A Comparative Study of Cross-Validation, Bootstrap and Systematic Sampling for Estimating the Generalization Performance of Supervised Learning

Introduction

Analysis of parametric GMDH

GMDH limitations

Hybridising GMDH

The Kalman filter and GMDH

Signal processing approaches

Ensemble empirical mode decomposition (EEMD)

Wavelet transform (WT)

Wavelet packet transform (WPT)

Improved GMDH algorithms

Enhanced MIA-GMDH

Enhanced GMDH using modified Levenberg-Marquardt method

Fuzzy GMDH

Neuro-fuzzy GMDH

Exogenous data

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s note

Supplementary information

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation