Introduction

REIT is the shorthand for real estate investment trust, which is an open-end trust run by a company that allows people to invest in real estate in a non-physical purchase. The US Congress built the first REIT and introduced four main regulations to protect investors and regulate REIT. First, REIT contains at least 75% of assets in real estate, government bonds, and cash. Second, they must have more than 50% of the total shareholding of more than five people combined. Third, more than 75% of income should come from renting or selling real estate, or from mortgage interest. Fourth, at least 95% of taxable income must be distributed as dividends each year (Irem et al. 2020; Brueggeman & Fisher 2022).

Based on their characteristics, REITs are classified into three types: equity-REIT (EREIT), mortgage-REIT (MREIT), and hybrid-REIT (HREIT). EREIT is a publicly traded company whose primary business is the acquisition, management, renovation, maintenance, and, in some cases, sale of real estate properties. MREITs issue and hold loan and other debt instruments backed by real estate, and their dividend yield is generally higher than MREIT. Meanwhile, HREIT combines the features and operations of EREIT and MREIT (Block 2012; Hansz et al. 2017).

REITs have become a popular financial product over the last half-century, and more than 40 countries have now established REIT markets to securitize their real estate assets. The global REIT market is now worth over $2 trillion. REITs have become a vital investment option for many investors today. Ott et al. (2005) indicated that REITs have become one of the most popular investment options for investors, and the market has reached maturity in many markets and regions outside the United States, such as Australia and Canada. Furthermore, because the majority of REIT earnings are distributed to investors, they are considered a more profitable financial product than common stock portfolios (Mori & Ziobrowski 2011). A new investment trend is to construct investment portfolios by integrating common stocks and REITs; such activities increase investment portfolios’ diversification and profitability (Anderson et al. 2015). REIT selection and return prediction, like stocks, are two important strategies for REIT investment that affect principal and investment returns and help investors optimize their portfolio returns (Lee & Pai 2010).

REIT prices are important in the investment process. Predicting the returns of various REIT products can assist in determining whether they should be added to the portfolio; additionally, it can assist in determining the best trading operations, such as buying, holding, or selling these REIT products (Cici et al. 2011). Researchers have attempted to develop and apply various traditional and machine learning (ML) approaches to improve the selection and price prediction of REIT products.

The fintech and ML approaches have become popular across diverse research areas, such as stock market analysis, users’ behavior identification, biometrics, customer service, cyber defense, bankruptcy prediction, and peer-to-peer network analysis (Livingston 2005; Maxwell et al. 2018; Kou et al. 2021a, b; Li et al. 2022). Conventional methods have a fundamental difference from the ML approach in that they are model-based and rely on a parametric model constructed based on domain knowledge. For example, the underlying process of the GARCH model is a plausible mathematical model. The data are then used to fit the parametric model to find the optimal parameter settings. By contrast, ML is purely data-driven and non-parametric. A learning algorithm (support vector machine (SVM), neural network, etc.) is selected to apply to the data to learn a function that best maps inputs to outputs. There is no requirement that this learned function be a close approximation of the true (but unknown) function underlying the data—in most cases, one simply chooses a convenient ML algorithm that is expressive enough and then uses data to learn an ML model that can best map inputs to outputs on all possible observations in the training set (Dixon et al. 2020).

Nevertheless, the ML method in REIT prediction remains relatively new. Existing studies mainly focused on using the ML approach to construct portfolios for REIT investment. This method can forecast REIT performance in the future and assist investors in making better investment decisions to optimize their REIT portfolio or control the proportion of REIT in multi-asset portfolios. For example, Li et al. (2017) studied the performance of the GMDH to predict the long-term return trend of REITs, whereas Loo (2019) used an artificial neural network (ANN) to estimate the Hong Kong REIT market’s future performance.

The Group Method of Data Handling (GMDH) is used in this study to forecast US REIT returns. Using the GMDH has several advantages. First, the GMDH relies less on the input because of its self-organization programming. Second, GMDH automatically determines the number of hidden layers, ensuring that the training process selects the most relevant variables (Li et al. 2017). In their study related to the electricity market, Yang et al. (2018) found that GMDH can process complex datasets in a highly effective manner in forecasting the daily electricity usage. Similarly, Srinivasan (2008) claimed that GMDH shows more precise prediction results than traditional regression models in short-to-medium-term energy consumption.

The following factors motivate our research. First, REIT investment has grown in popularity; thus, investigating the effectiveness of ML in REIT prediction is worthwhile, as it is one of the most popular data mining techniques in recent years. Second, when compared with traditional data analysis techniques, the application of ML in REIT investment is still in its early stages. Although many scholars have studied the implementation of ML in REIT selection in recent years, the accuracy of the ML technique in REIT prediction remains questionable (Li et al. 2017; Loo 2019). Thus, it is worthwhile to compare the accuracy of ML and traditional models. Third, when dealing with systemic financial risk, such as the one caused by COVID-19, REIT is an important asset. The outbreak of COVID-19 causes a setback for the REIT and the stock market, with both losing 30% of their market capitalization in a short time (Hui & Chan 2022). However, stock indexes, such as S&P 500 and Dow Jones Industrial Average, which took six months to complete a U-shaped recovery, it took only four months for the REIT markets to bounce back. REITs seem to have a stronger ability to hedge the sysmatic financial risk and thus are valuable in portfolio diversification.

We contribute to the literature by introducing a new method for predicting the US REIT market, despite the fact that a previous study applied GMDH to REIT in some non-US markets and showed that it is an effective tool for REIT prediction (Li et al. 2017). Nevertheless, the previous study’s results may not be representative because the GMDH algorithm was not applied to the US REIT, which constitutes 2/3 of the REIT market in the world. The US REIT is important to the global REIT market not only because of its market share, but also because it is the country with the largest variety of REITs. Furthermore, our study discusses in detail the impact of different kernel functions on the GMDH neural network for predicting the US REIT market. Our out-of-sample test yields day-by-day prediction results using various kernel functions. The discovery of significant differences in prediction accuracy between kernel functions for different prediction horizons is novel in the literature.

Furthermore, based solely on the GMDH model’s analysis, it is difficult to ascertain whether the GMDH model outperforms traditional models in predicting REIT returns (Li et al. 2017; Loo 2019). To ensure the robustness of the evidence, this study extends the analysis and compares the prediction of the GMDH model with that of the traditional GARCH model.

The remainder of the study is organized as follows: "Literature review: REIT prediction with traditional and ML methods" section reviews the literature and summarizes empirical studies conducted over the last two decades in both traditional and ML settings. "Model specifications and data" section presents the GMDH and GARCH models and discusses the variables and data used in the study. "Empirical results and analysis" section presents and analyzes the empirical results. “Discussion” section provides a discussion. Finally, “Conclusions” sectioon concludes the paper.

Literature review: REIT prediction with traditional and ML methods

Previous research has applied and developed many different traditional and ML methods to predict the REITs’ return and price. Traditional methods dominated the REIT price and return studies before 2007.

An earlier study notes that a simple linear regression model is difficult to obtain good out-of-sample testing results in the US REIT return prediction (Ling et al. 2000). Researchers use multifactor models to respond, and they find that REIT return is highly correlated with dividend, return history, price momentum, liquidity, and profitability factors (Chui et al. 2003; Cheng & Roulac 2007; Olanrele et al. 2014). Furthermore, Li & Lei (2011) found that the macroeconomy plays a significant role in determining REIT's performance; meanwhile, the REIT market’s performance has an impact on the macroeconomy. Using Fama–Macbeth approach, Shen (2021) determined that distressed REIT securities earn high future returns, whereas Shen et al. (2021) showed that low-beta REITs deliver a significantly higher risk-adjusted return than high-beta REITs. Swinkels (2023) noted that the real estate index has a significant impact on the value of real estate tokens, which are digital tokens similar to REITs.

The vector autoregression model is another traditional model commonly used in REIT research. According to Lu and So (2001), the REIT return is negatively related to the economy’s inflation rate. Ling and Naranjo (2006) found that new fund flow has a greater effect on REIT price than trading volume. Moreover, some researchers attempt to predict REIT returns using publicly available information; for example, Ling and Naranjo (2015) used publicly available information to predict REIT returns, but their results are unsatisfactory. Sirmans et al. (2006) reported that management changes cannot be used to predict REIT performance. According to Siew (2015), random events may cause a shift in the development trend of the Australian REIT market. Although many researchers have used traditional approaches to study REIT, they acknowledge that ML may be a more efficient method of research (Cheng & Roulac 2007; Siew 2015). Several researchers have also advocated for the use of ML and emerging algorithms in financial analysis (Kou et al., 2022).

Since 2007, ML has been widely used in REIT research. Lertwachara (2007) compared the return of a randomly chosen portfolio with that of an ANN-based portfolio; the ANN outperforms the randomly chosen portfolio by 26.05% higher return. Feng & Li (2014) created an REIT portfolio with 20% above market returns via an SVM. Wang et al. (2016) employed SVM and back-propagation neural network in the Singaporean REIT market; they report that ML approaches predict REIT performance more accurately than simple regression models, and that the selection of variables affects the simulation outcomes. According to Li et al. (2017), the GMDH neural network can accurately predict REIT performance in following countries and region: Australia, Hong Kong, Italy, and Turkey. Furthermore, Hausler et al. (2018) captured and analyzed investor sentiment via the SVM; they reported that, in addition to financial and environmental factors, news-based sentiment has a significant impact on the returns of REITs and security markets.

The review finds that ML methods can improve portfolio selection as well as REIT performance prediction. The return of portfolios constructed using ML approaches is generally higher than the return of portfolios constructed using traditional methods for the construction of REIT portfolios (Le 2006; Lertwachara 2007; Feng & Li 2014; Loo 2020). In terms of REIT return prediction, the accuracy of results obtained from ML models is higher than that obtained from non-ML methods (Wang et al. 2016; Li et al. 2017; Loo 2019; Antunes 2021). In summary, this review suggests that it is still a relatively new area of research to use ML techniques to predict REIT returns. Most researchers continue to focus on using ML to build portfolios for REIT investment.

However, the review of the literature reveals some challenges in implementing ML approaches in REIT prediction. First, the GMDH neural network can predict the return of the REIT in four countries and region in a timely and accurate manner (Li et al. 2017). However, the study overlooked the US REIT market, which is the world’s largest. The US Congress created the world’s first REIT in the 1960s and established REIT-related regulations. Other countries and regions have established REITs in accordance with US regulations. The US REIT market capitalization reached $1,352.41 billion at the end of 2021, accounting for 67.4% of the global market. In contrast, the total capitalization of REITs in these four markets is only $124.59 billion, accounting for only 6.21% of the global market (EPRA 2022). The REIT market in the United States is more iconic and representative of the global REIT trend. As a result, the GMDH neural network should be assessed on the US REIT market. Second, previous research has not addressed the impact of the sample size used in ML on the accuracy of REIT prediction results. Time-series model research shows that REIT prediction results are related to regression analysis datasets. In this case, the study seeks to investigate whether this effect exists in ML approaches.

Based on the above reviews and discussions, we developed the following two testable hypotheses:

Hypothesis 1.

The GMDH neural network predicts the US REIT market more efficiently and accurately than the GARCH model.

Hypothesis 2.

GMDH neural network can predict both short- and long-term returns in the US REIT market.

Model specifications and data

This section describes how GMDH, an ANN-based technique, predicts US REIT prices. It goes over the definitions of the variables as well as the data sources used to estimate the model. The traditional model for predicting REIT returns is the generalized autoregressive conditional heteroscedasticity (GARCH). The results of GMDH and GARCH were compared to assess the relative effectiveness of these two techniques in predicting REIT returns.

The ANN model

Theoretical framework of the ANN model

The general expression of the output of a neuron j is given by:

$$\begin{aligned} & y_{j} = f\left( {net_{j} } \right) \\ & net_{j} = \mathop \sum \limits_{n} w_{jn} x_{n} + w_{j0} = {\varvec{w}}_{{\varvec{j}}} {\varvec{x}} + b \\ \end{aligned}$$
(1)

where \({\varvec{x}}\) is a vector of inputs to the jth neuron, b the bias term, yj is the output of the jth neuron, net the network and w the weights. Figure 1 shows the fundamental structure of a neuron.

Fig. 1
figure 1

Basic structure of a neuron

The feedforward structure, which includes the input layer, multiple hidden layers, and the output layer, is the most common type of ANN. Each layer’s neuron receives signals from the previous layer and generates signals for the next layer. The first and last layers are referred to as the input and output layers, respectively, while the other intermediate layers are referred to as the hidden layers. The neurons in two adjacent layers are completely linked (Hornik et al. 1989; Svozil et al. 1997; Liu & Chen 2020).

The training process starts the ANN with random weights and iteratively updates the weights using the back-propagation (BP) algorithm. The BP algorithm propagates the error computed at the output during training back through the layers, adjusting the weight of each neuron accordingly until the network converges to a local optimum. Following the completion of the training process, the network’s performance is evaluated using the testing dataset. ANNs have been demonstrated to be effective algorithms and are widely used in complex nonlinear function mapping, image processing, and pattern recognition (Svozil et al. 1997).

Our study employs GMDH, as proposed by Li et al. (2017), to forecast US REIT returns. GMDH has a number of advantages that make it appealing for REIT research. First, the self-organization and control characteristics of GMDH ensure that the output has few effects from the input. Second, because GMDH algorithms do not require many assumptions, they are relatively easy to develop. Third, the GMDH method enables each layer to obtain the best structure in the training process to find the most relevant variables and eliminate irrelevant variables automatically. Fourth, GMDH can automatically determine the number of network layers and the neurons contained within them (Li et al. 2017).

The structure of the GMDH neural network is depicted in Fig. 2. The GMDH network is made up of a series of identical modules that are linked together. The computing process of one module, for example, the process from \({\mathrm{x}}_{\mathrm{i}}^{\left(k\right)}\) to \({\mathrm{y}}_{\mathrm{i}}^{(k)}\) can be explained as follows. The \({\mathrm{x}}_{\mathrm{i}}^{\left(\mathrm{k}\right)}\) is a series of input variables and \({\mathrm{y}}_{\mathrm{i}}^{(k)}\) is a series of the output of the k-th module. From \({\mathrm{x}}_{\mathrm{i}}^{\left(k\right)}\) to \({\mathrm{y}}_{\mathrm{i}}^{(k)}\), a kernel function is selected to connect the input variables and the outputs. In general, the kernel function can be expressed as \(y=f({x}_{1},{x}_{2},\dots {x}_{i}, \dots , {x}_{j}, \dots , {x}_{n})\). In this equation, \({x}_{i}\) and \({x}_{j}\) indicate the input variables, and n represents the number of variables. The three kernel functions used are as follows: a linear function, a linear covariance function, and a quadratic function. With various models, these kernel functions can resolve underfitting and overfitting issues. The most commonly used kernel function for GMDH is shown in Eq. (4). Here, w represents the weights assigned to these variables during the process of obtaining outputs from the input layer to the output layer.

Fig. 2
figure 2

Structure of the GMDH model

$$\mathrm{y}={w}_{0}+{w}_{1}{x}_{i}+{w}_{2}{x}_{j}$$
(2)
$$\mathrm{y}={w}_{0}+{w}_{1}{x}_{i}+{w}_{2}{x}_{j}+{w}_{3}{x}_{i}{x}_{j}$$
(3)
$$y=G({x}_{i}, {x}_{j})={w}_{0}+{w}_{1}{x}_{i}+{w}_{2}{x}_{j}+{w}_{3}{x}_{i}{x}_{j}+{w}_{4}{x}_{i}^{2}+{w}_{5}{x}_{j}^{2}$$
(4)

The results are labeled as G in Fig. 2. To select the appropriate variables from the intermediate outputs to feed to the next module, a filtering procedure is used. When the appropriate variables are chosen, the next module’s calculation repeats the operations of the previous module until the final outputs are obtained.

Variables used in the ANN model

Several variables are directly inputted into the training process, allowing the GMDH method to select the appropriate variables. Table 1 defines the variables used in the training test. This empirical analysis takes into account four types of variables: volatility, trend, momentum, and return rates. These variables are, in general, standard technical indicators used in the stock market to forecast stock prices. They were chosen to forecast the return on the REIT index in the US market. “Appendix 1” contains detailed definitions and explanations of these variables.

Table 1 List and definitions of the variables used in GMDH neural network

Computation pipeline

The ML method’s computation pipeline is divided into six steps: data collection, technical indicator measurements, training, test, accuracy evaluation, and discussion. Each step is depicted in Fig. 3.

Fig. 3
figure 3

Computation pipeline of machine learning process

GARCH model

The GARCH model was developed around 40 years ago (Bollerslev 1986). Previous studies compared the GARCH model’s performance in predicting stock returns and volatility with that of other traditional financial models, such as the Capital Asset Pricing Model (CAPM) and the Stochastic Volatility Model. Prior studies have demonstrated the outperformance of GARCH models over other models (Ng 1991; Fung et al. 2014; Maneemaroj et al. 2021). In addition, the GARCH model supports a significantly positive relationship between risk and return in several markets (Darrat et al. 2011). Therefore, this study employs the GMDH model to forecast US REIT and stock market returns and compares its performance with that of the traditional financial data analysis model, GARCH. The general form of the GARCH model is as follows (Lee & Pai 2010; Cho & Elshahat 2011; Zhou & Kang 2011):

$${\mathrm{r}}_{\mathrm{t}}=\mathrm{ln}\left({p}_{t}\right)-\mathrm{ln}({p}_{t-1})$$
(5a)
$${\mathrm{r}}_{\mathrm{t}}=\mu +\alpha {r}_{t-1}+{\varepsilon }_{t}$$
(5b)
$${\upsigma }_{\mathrm{t}}^{2}=\omega +\beta {\varepsilon }_{t-1}^{2}+\gamma {\sigma }_{t-1}^{2}$$
(5c)

The expression (5a) shows how the dependent variable is measured (Lee & Pai 2010; Zhou & Kang 2011). The \({p}_{t}\) and \({p}_{t-1}\) are the prices of the REIT at t and t-1, respectively. \({r}_{t}\) indicates REIT return at time t, and ln the logarithm value of price. The daily log return used in the GMDH neural network can be used directly in the estimation process.

The expression (5b) shows how the previous instance’s information affects the REIT return. In this expression, \({r}_{t-1}\) indicates the return value in the previous instance, and \({\varepsilon }_{t}\) represents how the returns are innovated; it depends on the return variance and a distribution function. Equation (6a) presents the expression of \({\varepsilon }_{t}\) (Lee & Pai 2010). According to the expressions shown in Eq. (6), the value of \({\varepsilon }_{t}\) depends on the variance value and the traditional distribution value at the instance t (Lee & Pai, REIT volatility prediction for skew-GED distribution of the GARCH model, 2010; Zhou & Kang 2011). Here, the density function of the error term \({z}_{t}\) is a standard normal distribution, shown in Eq. (6b) (Lee & Pai 2010).

$${\varepsilon }_{t}={\upsigma }_{\mathrm{t}}{z}_{t}$$
(6a)
$$\mathrm{f}\left({\mathrm{z}}_{\mathrm{t}}\right)=\frac{1}{\sqrt{2\pi }}{e}^{-\frac{{z}_{t}^{2}}{2}}$$
(6b)

Equation (5c) shows the calculation of the variance in the prediction process (Lee & Pai 2010; Zhou & Kang 2011). This equation shows that the variance at time t depends on its previous value and the innovation value in the last instance.

The other variables shown in Eq. (5), including \(\mu\), \(\alpha\), \(\beta\), \(\gamma\), and \(\omega\), are the coefficients obtained from the estimation process. They can be constants (Lee & Pai, REIT volatility prediction for skew-GED distribution of the GARCH model, 2010), or they can be changed to meet proper distribution function (Cho & Elshahat 2011); this is one way to improve the GARCH model’s performance and accuracy. In this empirical study, no distribution functions are used to measure these coefficients, and their values are obtained directly from regression estimation.

When the REIT’s return is predicted by Eq. (5b), its price at time t can be recalculated by transforming the Eq. (5b).

$${p}_{t}={p}_{t-1}{e}^{{r}_{t}}$$
(7)

Data sources

Our daily data come from DataStream and Bloomberg. The US REIT index includes 135 of EREIT, MREIT, and HREIT stocks. These REITs invest in the following industries: residential, industries, retail, health care, offices, hotel & resort, and mortgages. Our variables, such as momentum, volatility, and trend, are calculated from S&P500. The total number of daily observations used in this study was over 2 million.

The sample period for the US REIT daily data is from October 6, 2016, to July 30, 2021. The US REIT index includes 135 of EREIT, MREIT, and HREIT stocks. These REITs invest in the following industries: residential, industries, retail, health care, offices, hotel & resort, and mortgages. The S&P500 is used to calculate our variables, such as momentum, volatility, and trend. The number of training and test samples is controlled to investigate GMDH’s accuracy under different scenarios. The training cases considered are 30, 100, 200, 300, and 600 observations, with testing samples of 30 and 60 observations. A sample is an observation at a specific time point in this context. To compare the performance of ML and traditional approaches, we estimate the GARCH model with the same number of observations.

Empirical results and analysis

The three kernel functions mentioned above are tested when using the GMDH model to predict REIT returns. Meanwhile, cases of various training and testing samples are being tested. The accuracy of these cases’ results is then compared to determine which case produces the best prediction results. In addition, the GMDH model’s results are compared with the GARCH model’s results to determine which model produces more accurate REIT prediction results.

Estimates of the GMDH model

Linear function

In this empirical study, 17 cases were tested. Table 2 contains information about these cases, such as the number of training samples, the number of testing samples, the kernel functions used, and the input variables.

Table 2 Details of the machine learning cases tested in this study

The sample period for these cases with 30 testing samples is from May 13, 2021, to June 30, 2021. The sample period for these cases with 60 testing samples is from March 31, 2021, to June 30, 2021. These cases’ training periods are the transaction days preceding their testing period. The training period of the case linear-30/30, for example, is the 30 transaction days preceding the testing period.

Figure 4 depicts a comparison of actual returns and cases with 30 testing samples when the kernel function is the linear function. Except for the case linear-30/30, the other cases can generally depict the development trend of US REIT returns during the testing period, but the prediction results are almost always lower than the actual returns.

Fig. 4
figure 4

Actual versus predicted returns [Case A: 30 testing samples under linear function]

Table 4 presents the results that compare the gaps between the actual and predicted returns of these cases under the three kernel functions. Table 3 shows that the gaps between actual and predicted returns are much smaller for linear-100/30 and linear-200/30 than for the other cases. However, the gaps between actual and predicted returns for linear-30/30 are the largest of the five cases. Thus, the linear-100/30 and linear-200/30 prediction results outperform the other cases.

Table 3 Gaps between actual and predicted returns under the linear function [The presented value = predicted returns—actual returns]
Table 4 MSE obtained by other studies in predicting returns of stocks and REITs

The mean squared error (MSE) is used to determine the predictability of the results. It computes the mean squared value of the difference between predicted and actual values, as shown below (Pai & Lin, 2005; Wang & Bovik 2009):

$$\mathrm{MSE}=\frac{{\sum }_{i=1,n}{\left({y}_{i}^{real}-{y}_{i}^{predicted}\right)}^{2}}{n}$$
(8)

A lower MSE indicates that the prediction is more accurate. However, preliminary research does not suggest that there are fixed criteria for determining what value of MSE is acceptable when the MSE is used to assess prediction accuracy. Table 4 summarizes the findings from studies on stock prediction and REIT prediction to determine the acceptable level of MSE. When different models were used to predict stock and REIT returns, different studies obtained different MSE values.

Compared with the MSE values of the five cases under the linear function shown in Table 3, case linear-200/30 has the lowest value, 0.70, while case linear-100/30 has a very close MSE value, 0.72. These results show that the test accuracy in the two cases is higher than in the other three cases, which is consistent with the analysis of the gaps between actual and predicted returns. Furthermore, when the MSE values of the five cases are compared with the previous works, the linear function’s accuracy is similar to the findings of Kogan et al. (2009). However, this study can accept prediction results when the MSE is less than 1.45 because the MSE value is still small. In other words, except for linear-30/30, the results of all cases can be accepted.

Linear covariance function

When the linear covariance function is used as the kernel function in the GMDH model, four cases with the same linear function condition setting are studied. Table 5 describes the details of these cases, which are labeled linear-cov-30/30, linear-cov-100/30, linear-cov-200/30, and linear-cov-300/30. Figure 6 compares the prediction results under the linear covariance function with the actual returns. This graph shows that, with the exception of case linear-cov-30/30, the prediction results are close to the actual results for all cases under the linear covariance function. The results in Fig. 5 and Table 5 show that the prediction results of the other three tests under the linear covariance function are acceptable.

Table 5 Gaps between actual and predicted returns under the linear covariance function
Table 6 Gaps between actual and predicted returns under the quadratic function
Table 7 Gaps between actual and long-term predicted returns [Under linear and linear covariance and quadratic functions]
Fig. 5
figure 5

Actual versus predicted returns [Case B: 30 testing samples under linear covariance function]

Table 5 shows that with an MSE value of 0.69, the gap between the actual and predicted returns for case linear-cov-200/30 is much smaller than the gaps for the other cases under the linear covariance function. Cases linear-cov-100/30 and linear-cov-300/30 have acceptable gaps of around 1%, with MSE values of 1.05 and 1.08, respectively.

Quadratic function

This study also tests cases where the quadratic function is implemented as the kernel function. In total, six cases are tested: quadratic-30/30, quadratic-100/30, quadratic-200/30, quadratic-300/30, quadratic-600/30, and quadratic-300/60. Table 6 shows the results of these cases. All cases, with the exception of quadratic-300/60, test 30 samples. Figure 6 compares the actual returns in these cases with the predicted results.

Fig. 6
figure 6

Actual versus predicted returns [Case C: 30 testing samples under the quadratic function]

Figure 6 shows that three cases—quadratic-30/30, quadratic-100/30, and quadratic-200/30—have invalid results due to outliers in their predicted returns and the absence of a similar trend as the actual return. Meanwhile, the predicted returns of quadratic-300/30 and quadratic-600/30 are close to the actual returns. As a result, the three cases, quadratic-30/30, quadratic-100/30, and quadratic-200/30, are deemed invalid, and no further investigation is conducted for them.

Table 6 shows the differences between the actual and predicted returns for quadratic-300/30 and quadratic-600/30. Compared with the gaps between actual and predicted quadratic-300/30 results, the gaps between actual and predicted quadratic-600/30 results are much smaller. The respective MSE values for the two cases are 4.35 and 0.62. Thus, only the predicted quadratic-600/30 is acceptable when the quadratic function is used as the kernel function.

Long-term predictions

The above cases have predicted 30 days’ returns of the US REIT. In general, a 30-day period is relatively short, and when the training samples meet the proper conditions, the GMDH model effectively predicts the short-term returns of REIT under the three kernel functions. Under these circumstances, it is worthwhile to determine whether the GMDH model is also capable of predicting REIT long-term returns. Hence, the effectiveness of the GMDH model in predicting 60-day returns was tested in this study, with 300 training samples. The testing period runs from March 31, 2021 to June 30, 2021, with 300 training samples coming before that. In this study, three more cases are tested: linear-300/60, linear-cov-300/60, and quadratic-300/60. Figure 7 depicts a comparison of the three cases’ actual returns and prediction results. Directly, the results of the case quadratic-300/60 are unacceptable because the magnitude of this model’s prediction results is not at the same level as the actual returns.

Fig. 7
figure 7

Actual versus long-term predicted returns [Under linear, linear covariance and quadratic functions]

Meanwhile, the other two cases can show the trend of actual returns, but there are still significant gaps between the prediction results and the actual returns. The gaps between the two cases and the actual returns, as well as their MSE values, are calculated to determine whether the two cases should be accepted. Table 7 shows the differences between the actual returns and linear-300/60 and linear-cov-300/60.

The results in Table 7 show that the gaps between actual returns and linear-cov-300/60 are much smaller than those between actual returns and linear-300/60. The MSE values for linear-cov-300/60 and linear-300/60 are 0.82 and 1.83, respectively, indicating that linear-cov-300/60 is more accurate in predicting long-term returns than linear-300/60.

The following points can be summarized based on an analysis of the results of these tests with different kernel functions. First, the linear kernel function is robust to training model use in that the prediction is very stable and can achieve stable returns for both long-term and short-term goals. Second, the linear covariance kernel function is robust to the use of the training model. When compared with the linear kernel, the linear covariance kernel function fits the data better and predicts more accurate results, but it is less stable, which requires further consideration (e.g., long-term goals). Third, the quadratic kernel function is difficult to train, necessitating a large number of samples (300+) to facilitate learning. A trained quadratic kernel is excellent for short-term goals but fails miserably for long-term goals.

The preceding analysis shows that kernel functions play an important role in determining the outcomes of the GMDH approach in predicting REIT returns. Scholars find that, aside from the GMDH model, kernel functions impact the outcomes of other ML approaches, such as the SVM model (Yuan et al. 2010). Yuan et al. (2010) have implemented the SVM model to test the internet traffic classification, and they investigated the effectiveness of four kernel functions, including RBF (radial basis function), poly, linear, and sigmoid, and discovered that RBF has the highest accuracy, followed by poly. This study, like Yuan et al. (2010), finds that different kernel functions are accurate in different cases. When the training sample is 100 and the testing sample is 30, the linear function is more accurate than the other two functions.

For the following pairs of training and testing samples, the linear covariance function is more accurate than the other two functions: 200/30, 300/30, and 300/60. The quadratic function is more accurate than the other two functions when the training sample is 600, and the testing sample is 30. Therefore, when implementing the GMDH model to forecast REIT returns, it is important to control the kernel functions in order to improve the accuracy of this ML method.

Estimates of the GARCH model

The GARCH model is also used to predict REIT returns. To compare the results of the GARCH model and the GMDH model, the numbers of observations used in the GARCH model and the numbers of predicted returns were controlled, and then six scenarios were tested, including Scenarios G-300/60, G-600/30, G-300/30, G-200/30, G-100/30, and G-30/30. These scenarios are explained as follows: Scenario G-600/30 means that the 300 samples are used in the GARCH model to estimate coefficients in Eqs. (5a), (5b), and (5c), and 60 returns are predicted. The meanings of the other scenarios are similar to Scenario G-600/30, but the number of samples used in the simulation and prediction differ. Table 8 depicts the specifics of these scenarios. Meanwhile, Table 9 displays the simulation results for these scenarios using the GARCH model.

Table 8 Descriptions of the scenarios by the GARCH model
Table 9 Coefficients of these scenarios based on the GARCH model

GMDH versus GARCH predictions: a comparison

The MSEs of the two approaches were shown in Table 10. Given enough training samples, the GMDH neural network outperforms the GARCH model in terms of accuracy. Moreover, the GMDH model always outperforms the GARCH model in terms of accuracy when the kernel function is linear. When the number of training samples is greater than 100, the accuracy of the linear covariance function is greater than that of the GARCH model. When the kernel function is the quadratic function, its accuracy is higher than the GARCH model only for one case with 600 training samples, quadratic-600/30. Unlike the GARCH model, whose accuracy increases as the number of samples increases, the GMDH model’s accuracy is much more negatively affected by a lack of training samples.

Table 10 MSE values of the two approaches

When compared with a traditional GARCH-based approach, ML-based GMDH outperforms the GARCH model significantly. In other words, GMDH can predict more accurate results with less data and obtain results faster than GARCH models.

Discussion

The results shown above support the first hypothesis that the GMDH neural network outperforms the GARCH model in predicting the US REIT market. The GMDH neural network outperforms GARCH in predicting the US REIT for two reasons. First, GMDH’s self-organization can sort the effects of all possible model variables to find the best solution. Second, GDMH can analyze the structure of nonlinear and complex systems and avoid overfitting through using appropriate kernel functions.

Table 10 lists the comparison results of the two approaches for different number of training and testing samples. Our findings are consistent with Li et al. (2017) in that the GMDH neural network and ANN have better performance in predicting stocks and REITs than other traditional models, such as CAPM and double exponential smooth models. Furthermore, our results also lend support to the second hypothesis that GMDH can predict the US REIT market in both short- and long-term. Tables 3, 5, and 6 illustrate the short-term prediction results, while Table 7 shows the long-term prediction results of GMDH using various kernel functions. Given enough training data, our empirical results suggest that the GMDH approach can accurately predict REIT prices. Finally, we find that the GMDH produces better prediction results by avoiding model underfitting and overfitting issues through the use of different kernel functions. For example, using a linear kernel function can avoid the overfitting with simple data, while the quadratic function can better fit complex data.

Previous research applies two ML approaches to predict four REIT markets—Australia, Hong Kong, Italy, and Turkey—illustrating that the GMDH neural network stands out from the simple BP neural network (Li et al. 2017). However, the world’s largest REIT market, the US, has been omitted. This study uses the GMDH neural network to forecast short- and long-term return forecasts for the US REIT market, as well as daily return forecasts. Furthermore, it demonstrates how the kernel function influences GMDH prediction results, which has previously been overlooked.

Conclusions

In recent years, ML has been widely used in the study of finance and economics, and it has also been practically applied to the field of finance. For example, JP Morgan devoted 280 pages of its investor report to reporting on how JP Morgan applies ML, deep learning, and neural networks to quantitative investment and algorithmic trading. Our study employs the ML-based GMDH model to forecast REIT returns in the United States and assesses the method’s practical utility. The primary contribution of this study is the application of a GMDH neural network to forecast the trend of the US REIT market. The accuracy of the GMDH neural network is compared with that of a traditional approach, the GARCH model, in an empirical study. Several points can be identified by comparing the results of the two models. First, GMDH outperforms GARCH in predicting REIT returns. The ML method is used to test 17 cases while controlling the training samples, testing samples, and kernel functions. The test results reveal that the ML approach’s accuracy is high in most cases, except when the training sample size is 30. Except for the quadratic kernel function, the linear and linear covariance kernel functions can be easily trained to obtain stable prediction results. Compared with the GARCH method’s accuracy, the MSE values of most of these cases for the GMDH method are lower, indicating that the ML approach can easily obtain more accurate prediction results. However, the accuracy of the GMDH method is influenced by the kernel functions used when the training and testing samples are changed. Hence, when using the GMDH method to forecast REIT returns, the kernel function must be adjusted when changing the training and testing samples. Based on the results of this study, the GMDH method produces more accurate predictions of REIT returns in the US market, even with small sample sizes. When investors decide to enter the REIT market, they can use the GMDH method to predict the short-term trend of the US REIT market based on the previous year’s historical data.

Although this study found that the ML approach is more significant than traditional methods—by comparing the accuracy of the GMDH neural network with that of the GARCH model in an empirical study—this research still has some limitations. First, although the GMDH neural network is an excellent data analysis algorithm, it is a model that financial analysts will find more difficult to develop and implement. Second, the US REIT index was used in this empirical analysis. Although the US REIT market capitalization accounts for 61% of the global market, comparison results based on the US REIT index cannot accurately represent the overall situation of all REIT stocks. Therefore, further analysis is needed to improve the representativeness of the findings.

The following research directions can be considered for future studies. First, more ML and non-ML approaches can be tested and compared to determine the efficiency and effectiveness of ML approaches in predicting REIT returns. In the future, researchers can compare the efficiency of other ML and deep learning methods, such as SVM and Random Forest, in predicting REIT returns with the results of the GMDH method. Second, out-of-sample studies can collect data on REITs from multiple markets, as opposed to only the US market. Third, future research can explore which ML approaches can be used to effectively build REIT portfolios to increase returns.