1 Introduction

In the literature, fuzzy, intuitionistic, and picture sets are used for different purposes. In recent years, forecasting, image segmentation, multiple criteria decision-making and pattern recognition problems have been solved by using intuitionistic and picture fuzzy sets. Hung et al. (2008), Hwang et al. (2012), Boran et al. (2014), and Chen et al. (2016) used intuitionistic fuzzy sets for solving pattern recognition problems. Chen (2014), Kumar and Chen (2021), Kumar and Chen (2022), Chai et al. (2023), Priyanka and Kalia (2023), and Kumar and Tyagi (2023) used intuitionistic fuzzy sets for multiple criteria decision-making. He and Wang (2023) used picture fuzzy sets for energy analysis with sentiment analysis. Haktanır and Kahraman (2023) used picture fuzzy sets for a defender-challenger comparison application. Fuzzy set theory plays an important role in the development of inference systems. Intuitionistic and picture fuzzy sets developed in recent years have paved the way for fuzzy inference systems, especially for solving forecasting problems. Another area where intuitionistic and picture fuzzy sets have started to be used is the solution of forecasting problems.

Joshi and Kumar (2012) proposed a method using intuitionistic fuzzy sets for fuzzy time series forecasting. Hung and Lin (2013) proposed a forecasting method for long-term business cycle forecasting based on intuitionistic fuzzy sets and a support vector regression approach. Gangwar and Kumar (2014) proposed a forecasting method based on both probabilistic and intuitionistic fuzzy sets. An intuitionistic fuzzy time series forecasting model based on intuitionistic fuzzy reasoning was proposed by Wang et al. (2016). In Fan et al. (2016), an intuitionistic fuzzy time series method based on similarity measures is proposed. A forecasting method is proposed for solving the high-order intuitionistic fuzzy time series forecasting model by Gautam and Singh (2018). Fan et al. (2020), Chen et al. (2021), Pant and Kumar (2022), Wang et al. (2023), and Dixit and Jain (2023) are other recent studies of forecasting methods using intuitionistic fuzzy sets. One of the most well-known fuzzy inference systems is the fuzzy regression functions approach, which in recent years has been able to produce successful results in solving the forecasting problem.

The fuzzy regression functions approach was first proposed in Türksen (2008). In the fuzzy regression functions approach, the inputs of the system are fuzzified by the fuzzy c-means method first proposed by Bezdek et al. (1984). The memberships obtained from fuzzy sets are used in the construction of the models used in the fuzzy inference system. In the fuzzy regression functions approach in Celikyilmaz and Turksen (2009), the multiple linear regression method is used to construct fuzzy functions. In Atanassov (1986), intuitionistic fuzzy sets, which are a more generalized form of fuzzy functions and also utilize the degree of hesitation, are proposed. With intuitionistic fuzzy sets, it is possible to create an inference system just like fuzzy sets. The intuitionistic fuzzy regression functions approach using intuitionistic fuzzy sets was first introduced to the literature by Bas et al. (2021). In the intuitionistic fuzzy regression functions approach proposed by Bas et al. (2021), the fuzzification process is performed by the intuitionistic fuzzy c-means method proposed in Chaira (2011) instead of fuzzy c-means.

The fuzzy regression functions approach was first generalized as picture fuzzy regression functions for picture fuzzy sets by Bas et al. (2020). Picture fuzzy sets were defined for the first time in Cường (2014) and in this study, the basic operations of picture fuzzy sets are presented. In Bas et al. (2020), picture fuzzy sets are used instead of fuzzy sets in the fuzzy regression functions approach. In the picture fuzzy regression functions method, the picture fuzzy clustering method proposed by Thong and Son (2016) is used to obtain positive, negative, and neutral memberships of picture fuzzy sets. According to Bas et al. (2020), three different outputs of the system are formed according to positive, negative, and neutral memberships. In combining these outputs, an optimization process based on a genetic algorithm was carried out. The number of forecasting methods utilizing picture fuzzy sets is quite limited in the literature. Some researchers have used picture fuzzy sets to obtain forecasts from picture fuzzy inference systems.

Thong and Son (2015a) used picture fuzzy clustering and picture fuzzy rule interpolation methods for forecasting. Thong and Son (2015b) proposed hybrid forecasting methods based on picture fuzzy clustering. Son et al. (2017) proposed a picture fuzzy inference system and made an application for control theory. Fuzzy regression function approaches continue to be developed in the literature. In Kizilaslan et al. (2020), an intuitionistic fuzzy function approach is improved using ridge regression. In Tak (2020a, 2020b) and Egrioglu et al. (2022), recurrent fuzzy and intuitionistic fuzzy regression function approaches are proposed. In Bas (2022), the fuzzy regression functions approach is made robust by using M estimators. In Tak and Inan (2022), the fuzzy regression functions method was improved by using elastic net regularisation. The meta-fuzzy functions method is proposed by Tak (2018) and Tak (2021) to realize a type of forecasting combination that exploits fuzzy function approaches. An approach of fuzzy regression functions based on probabilistic fuzzy sets was proposed in Tak (2020a, 2020b) and Tak (2022).

A picture fuzzy regression function approach works based on the picture fuzzy clustering method. Therefore, it contains more information than fuzzy regression and intuitionistic fuzzy regression function approaches. One reason for this is that the picture fuzzy set is a generalization of both fuzzy and intuitionistic fuzzy sets. Since the fuzzy and intuitionistic fuzzy regression function approaches are inevitably affected by outliers in the time series, the picture fuzzy regression function approach is also affected by outliers. In this paper, an approach to a picture fuzzy regression function that can be utilized even in the presence of outliers in a time series is proposed. In a picture fuzzy regression function approach, the parameter estimation is done by ordinary least squares, while in the proposed method, the parameter estimation is done by robust regression with Bisquare, Cauchy, Fair, Huber, Logistic, Talwar, and Welsch functions.

Thus, with the proposed method, a picture fuzzy regression method is proposed that can be less affected when there are outliers in the time series. The forecasting performance of the proposed method is evaluated on the London and Spain time series. The analysis is performed for both cases where these time series contain both original and outliers. Other parts of the paper are as follows. Section 2 summarizes the picture fuzzy clustering method. Section 3 gives the steps of the proposed method. Section 4 presents the results of the application on real-life data with tables and graphs to evaluate the performance of the proposed method. Section 5 is on the conclusions and discussion of the study.

2 Picture Fuzzy Clustering Method

Since the fuzzy clustering method (FCM) method works only on membership values, it is inadequate when real life problems are evaluated on positive, negative, neutral and refusal decisions. From this point of view, the picture fuzzy clustering (PFC) method, which is a clustering method that has both positive, negative, neutral and refusal membership values, is used as a generalisation of the FCM method, which has many limitations. PFC method proposed by Thong and Son (2016) is based on updating the positive membership values, neutral membership values, and refusal membership values and therefore the negative membership values in the structure of the picture fuzzy sets during iterations. The flow chart of PFC method is given in Fig. 1.

Fig. 1
figure 1

The flow chart of PFC method

Besides, in order to make the algorithm more understandable, the algorithm of PFC method is given step by step with Algorithm 1.

Algorithm 1. The steps of PFC method.

Step 1 The initial values are generated.

The number of observations \(\left( n \right)\), the number of picture fuzzy sets \(\left( c \right)\), \(m\) is the fuzziness index and ε is an error value determined by the researcher; firstly, positive membership values \(\left( {u_{kj} { };\left( {k{ } = { }1,{ }2, \ldots ,{ }n{ };{ }j{ } = { }1,{ }2, \ldots ,c} \right)} \right)\), neutral membership values \(\left( {\eta_{kj} ;\left( {k{ } = { }1,{ }2, \ldots ,{ }n{ };{ }j{ } = { }1,{ }2, \ldots ,c} \right)} \right)\) and refusal membership values \(\left( {\xi_{kj} ;\left( {k{ } = { }1,{ }2, \ldots ,{ }n{ };{ }j{ } = { }1,{ }2, \ldots ,c} \right)} \right)\) are randomly generated and stored in matrices \(U,{ }{\rm N},{\rm K}\) respectively.

Step 2 The cluster centres \(\left( {V_{j} } \right)\) are updated at each iteration using Eq. (1). According to the updated cluster centres, updated positive membership values \((u_{kj}^{*} )\), updated neutral membership values \(\left( {\eta_{kj}^{*} } \right)\) and updated refusal membership values \(\left( {\xi_{kj}^{*} } \right)\) are obtained by Eqs. (2)-(4) and stored in the matrices \(U^{*} ,{ }{\rm N}^{*} ,{\rm K}^{*}\) respectively.

$$V_{j} = \frac{{\mathop \sum \nolimits_{k = 1}^{n} \left( {u_{kj} \left( {2 - \xi_{kj} } \right)} \right)^{m} X_{k} }}{{\mathop \sum \nolimits_{k = 1}^{n} \left( {u_{kj} \left( {2 - \xi_{kj} } \right)} \right)^{m} }},\left( {{ }j = 1,2, \ldots ,c} \right)$$
(1)
$$u_{kj}^{*} = \frac{1}{{\mathop \sum \nolimits_{i = 1}^{c} \left( {2 - \xi_{ki} } \right)\left( {\frac{{X_{k} - V_{j} }}{{X_{k} - V_{i} }}} \right)^{{\frac{2}{m - 1}}} }},\left( {k = 1,2, \ldots ,n;j = 1,2, \ldots ,c} \right)$$
(2)
$$\eta_{kj}^{*} = \frac{{e^{{ - \xi_{kj} }} }}{{\mathop \sum \nolimits_{i = 1}^{c} e^{{ - \xi_{ki} }} }}\left( {1 - \frac{1}{c}\mathop \sum \limits_{i = 1}^{c} \xi_{ki} } \right),\left( {k = 1,2, \ldots ,n;j = 1,2, \ldots ,c} \right)$$
(3)
$$\xi_{kj}^{*} = 1 - \left( {u_{kj} + \eta_{kj} } \right) - \left( {1 - \left( {u_{kj} + \eta_{kj} } \right)^{\alpha } } \right)^{{\frac{1}{\alpha }}} ,\left( {k = 1,2, \ldots ,n;j = 1,2, \ldots ,c} \right)$$
(4)

Step 3 The stopping condition is checked.

If the condition \(\left\| {U^{*} - U} \right\| + \left\| {{\rm N}^{*} - {\text{\rm N}}} \right\| + \left\| {{\rm K}^{*} - {\text{\rm K}}} \right\| < \varepsilon\) is met or the maximum number of iterations is reached, the process is terminated, otherwise the process is continued by returning to Step 2.

3 The Proposed Method

Fuzzy regression function approaches can exhibit superior performance in solving the forecasting problem. The picture fuzzy regression function approach, introduced by Bas et al. (30), is a method with superior forecasting performance based on the dynamic combination of more linear models, which provides a different treatment of uncertainty compared to classical fuzzy regression function approaches. In the proposed method, diversity is provided by using each of the methods based on Bisquare, Cauchy, Fair, Huber, Logistic, Talwar, and Welsch weight functions as M estimators. The flow chart for the proposed method is given in Fig. 2.

Fig. 2
figure 2

The flow chart for the proposed method

The steps of the proposed robust picture fuzzy regression function approach (R-PFRF) method are introduced in Algorithm 2.

Algorithm 2. The steps of R-PFRF method.

Step 1 Determine the parameters of the proposed method.

The parameters of the R-PFRF method are determined by the researcher by the applied data. These parameters are the number of lags \({ }\left( p \right)\), the length of the training set \(\left( {ntrain} \right)\), the length of the test set \(\left( {ntest} \right)\), the number of picture fuzzy sets (\(c)\).

Step 2 Applied PFC method.

According to the number of picture fuzzy sets given in Step 1, the PFC method is applied using PFC. The PFC method is applied for the data set (DX) consisting of the inputs and targets of the training data. The DX data set consists of \(p\) lagged variables and simultaneous variables of the time series. The time series is \(\left\{ {x_{1} ,x_{2} , \ldots ,x_{n} } \right\}\), the training time series is \(\left\{ {x_{1} ,x_{2} , \ldots ,x_{n - ntest} } \right\}\), \(ntarin = n - ntest\) and the test time series is \(\left\{ {x_{n - ntest + 1} ,x_{n - ntest + 2} , \ldots ,x_{n} } \right\}\). The DX matrix given in Eq. (5) is formed from the lagged and simultaneous variables of the training time series. By applying the PFC method, positive, neutral, negative, and refusal membership values are obtained for each picture fuzzy set.

$$DX = \left[ {\begin{array}{*{20}c} {x_{1} } & {x_{2} } & \cdots & {x_{p} } & {x_{p + 1} } \\ {x_{2} } & {x_{3} } & \cdots & {x_{p + 1} } & {x_{p + 2} } \\ \vdots & \vdots & \cdots & \vdots & \vdots \\ {x_{ntrain - p} } & {x_{ntrain - p + 1} } & \cdots & {x_{ntrain - 1} } & {x_{ntrain} } \\ \end{array} } \right]$$
(5)

Step 3 Obtain picture fuzzy functions and the forecasts for each picture fuzzy set.

For each picture fuzzy set, picture fuzzy functions are created by applying robust regression for the training set. If \(u\) is positive, \(\eta\) is neutral and \(\nu\) is a negative membership value, the matrix of explanatory variables is given in Eq. (6).

$$X^{\left( i \right)} = \left[ {\begin{array}{*{20}l} 1 \hfill & {u_{i1} } \hfill & {u_{i1}^{2} } \hfill & {{\text{exp}}\left( {u_{i1}^{2} } \right)} \hfill & {\eta_{i1} } \hfill & {\eta_{i1}^{2} } \hfill & {{\text{exp}}\left( {\eta_{i1}^{2} } \right)} \hfill & {\nu_{i1} } \hfill & {\nu_{i1}^{2} } \hfill & {{\text{exp}}\left( {\nu_{i1}^{2} } \right)} \hfill & {x_{1} } \hfill & {x_{2} } \hfill & \ldots \hfill & {x_{p} } \hfill \\ 1 \hfill & {u_{i2} } \hfill & {u_{i2}^{2} } \hfill & {{\text{exp}}\left( {u_{i2}^{2} } \right)} \hfill & {\eta_{i2} } \hfill & {\eta_{i2}^{2} } \hfill & {{\text{exp}}\left( {\eta_{i2}^{2} } \right)} \hfill & {\nu_{i2} } \hfill & {\nu_{i2}^{2} } \hfill & {{\text{exp}}\left( {\nu_{i2}^{2} } \right)} \hfill & {x_{2} } \hfill & {x_{3} } \hfill & \ldots \hfill & {x_{p + 1} } \hfill \\ \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill \\ 1 \hfill & {u_{i,n1} } \hfill & {u_{i,n1}^{2} } \hfill & {{\text{exp}}\left( {u_{i,n1}^{2} } \right)} \hfill & {\eta_{i,n1} } \hfill & {\eta_{i,n1}^{2} } \hfill & {{\text{exp}}\left( {\eta_{i,n1}^{2} } \right)} \hfill & {\nu_{i,n1} } \hfill & {\nu_{i,n1}^{2} } \hfill & {{\text{exp}}\left( {\nu_{i,n1}^{2} } \right)} \hfill & {x_{n1} } \hfill & {x_{n1 + 1} } \hfill & \ldots \hfill & {x_{n1 + p - 1} } \hfill \\ \end{array} } \right]$$
(6)

Response vector \(\left( Y \right)\) is constituted in Eq. (7) and the parameter estimations are calculated by using Eq. (8).

$$Y^{\prime } = \left[ {\begin{array}{*{20}c} {x_{p + 1} } & {x_{p + 2} } & \ldots & {x_{ntrain} } \\ \end{array} } \right]$$
(7)
$$\hat{\beta }^{\left( i \right)} = (X^{\left( i \right)}{\prime} W_{f} X^{\left( i \right)} )^{ - 1} X^{\left( i \right)}{\prime} W_{f} Y,i{ } = { }1,{ }2, \ldots c$$
(8)

The forecasts for each picture fuzzy set are calculated as in Eq. (9).

$$\hat{Y}^{\left( i \right)} = X^{\left( i \right)} \hat{\beta }^{\left( i \right)} { };{ }i{ } = { }1,{ }2, \ldots c$$
(9)

In Eq. (8), \(W_{f}\) is any of Bisquare, Cauchy, Fair, Huber, Logistic, Talwar, and Welsch weight functions. Weight functions are a nonlinear transformation of adjusted residuals (\(r\)) calculated as a function of the residuals from classical multiple regression (\(e_{i}\)) and are generally based on the idea of suppressing or masking very large and very small residuals. Adjusted residual is calculated by Eq. (10). In robust regression based on M-estimators, the aim is to minimize the least squares calculated with adjusted residuals instead of ordinary least squares.

$$r{ } = { }\frac{e}{{tune \times \frac{{median\left( {\left| {e_{i} - \overset{\lower0.5em\hbox{$\smash{\scriptscriptstyle\smile}$}}{e}_{i} } \right|} \right)}}{0.6745} \times sqrt\left( {1 - h} \right)}}$$
(10)

The tune parameter value varies according to the M-estimator applied and the tune parameter value used for each method is presented in Table 1. \(h\) is the vector of leverage values from an ordinary least-squares fit.

Table 1 Tune parameter values for different weight functions

Formulas of Bisquare, Cauchy, Fair, Huber, Logistic, Talwar, and Welsch weight functions are given in Eqs. (11), (12), (13), (14), (15), (16), and (17).

$$w_{Bisquare} = \left\{ {\begin{array}{*{20}c} {\left( {1 - r^{2} } \right)^{2} } & {\left| r \right| < 1} \\ 0 & {\left| r \right| \ge 1} \\ \end{array} } \right.$$
(11)
$$w_{Cauchy} = \frac{1}{{\left( {1 + r^{2} } \right)}}$$
(12)
$$w_{Fair} = \frac{1}{{\left( {1 + \left| r \right|} \right)}}$$
(13)
$$w_{Huber} = \frac{1}{{max\left( {1, \left| r \right|} \right)}}$$
(14)
$$w_{Logistic} = \frac{tanh\left( r \right)}{r}$$
(15)
$$w_{Talwar} = \left\{ {\begin{array}{*{20}c} 1 & {\left| r \right| < 1} \\ 0 & {\left| r \right| \ge 1} \\ \end{array} } \right.$$
(16)
$$w_{Welsch} = exp\left( { - r^{2} } \right)$$
(17)

Step 4 Calculate the forecasts for each membership type.

For the training data, forecasts are calculated separately for each membership type as given in Eqs. (18) to (20).

$$\hat{Y}_{j}^{positive} = \frac{{\mathop \sum \nolimits_{i = 1}^{c} \mu_{ij} \hat{Y}_{j}^{\left( i \right)} }}{{\mathop \sum \nolimits_{i = 1}^{c} \mu_{ij} }} j = 1,2, \ldots ,n$$
(18)
$$\hat{Y}_{j}^{neutral} = \frac{{\mathop \sum \nolimits_{i = 1}^{c} \eta_{ij} \hat{Y}_{j}^{\left( i \right)} }}{{\mathop \sum \nolimits_{i = 1}^{c} \eta_{ij} }} j = 1,2, \ldots ,n$$
(19)
$$\hat{Y}_{j}^{negative} = \frac{{\mathop \sum \nolimits_{i = 1}^{c} \nu_{ij} \hat{Y}_{j}^{\left( i \right)} }}{{\mathop \sum \nolimits_{i = 1}^{c} \nu_{ij} }} j = 1,2, \ldots ,n$$
(20)

Step 5 Obtain optimal weight values for each membership type.

The forecasts of the R-PFRF method for the training data are the weighted average of the forecasts obtained according to positive, neutral, and negative memberships as given in Eq. (21).

$$\hat{Y}_{j}^{R - PFRF} = w_{1} \hat{Y}_{j}^{positive} + w_{2} \hat{Y}_{j}^{neutral} + w_{3} \hat{Y}_{j}^{negative}$$
(21)

In determining the weights, an optimization based on a genetic algorithm is performed for the training data. For optimization, the genetic algorithm is applied under the condition that the weights are between 0 and 1 and the sum is one. The applied optimization problem can be represented as in Eq. (22).

$$\begin{array}{*{20}c} {\mathop {min}\limits_{{w_{1} , w_{2} , w_{3} }} \mathop \sum \limits_{j = p + 1}^{ntrain} \left( {w_{1} \hat{Y}_{j}^{positive} + w_{2} \hat{Y}_{j}^{neutral} + w_{3} \hat{Y}_{j}^{negative} - Y_{j} } \right)^{2} } \\ {Subject to:\mathop \sum \limits_{i = 1}^{3} w_{i} = 1;0 \le w_{i} \le 1, i = 1,2,3} \\ \end{array}$$
(22)

Step 6 Calculate the forecasts for the test data.

Forecasts are calculated for the test data. For this purpose, the matrix of explanatory variables given in Eq. (6) is reconstructed for the test set with the membership values calculated according to the cluster centers obtained for the training data. Forecasts for the test data are obtained using Eqs. (8), (9), (18), (19), (20) and (23) respectively.

$$\hat{Y}_{j}^{R - PFRF} = \hat{w}_{1} \hat{Y}_{j}^{positive} + \hat{w}_{2} \hat{Y}_{j}^{neutral} + \hat{w}_{3} \hat{Y}_{j}^{negative}$$
(23)

Besides, a Pseudo-code of the R-PFRF method is given in Algorithm 3. Moreover, the codes of the R-PFRF method are uploaded to GitHub by name “https://github.com/erole1977/RPFF”.

figure a

4 Applications

To investigate the performance of the proposed method, applications are performed for a time series of Spain and London stock markets. The original time series are analyzed by the fuzzy time series network (FTS-N) proposed by Bas et al. (2020), the FRF method proposed by Turksen (2008), the IFRF method proposed by Egrioglu et al. (2022), and the PFRF method proposed by Bas et al. (2020) and compared with the proposed method in terms of forecasting performance. In addition, the superiority of the proposed method over other methods in terms of forecasting performance in case of outliers is investigated by artificially adding outlier observations to the original series. In the comparison of each method, the root of mean square error (RMSE) and mean absolute percentage error (MAPE) are used as evaluation criteria. The formulas of RMSE and MAPE criteria are given in Eqs. (24) and (25).

$${\text{RMSE}} = \sqrt {\frac{{\mathop \sum \nolimits_{{{\text{t}} = 1}}^{{{\text{ntest}}}} \left( {{\text{X}}_{{\text{t}}} - {\hat{\text{X}}}_{{\text{t}}} } \right)^{2} }}{{{\text{ntest}}}}}$$
(24)
$${\text{MAPE}} = \frac{1}{{{\text{ntest}}}}\mathop \sum \limits_{{{\text{t}} = 1}}^{{{\text{ntest}}}} \left| {\frac{{{\text{X}}_{{\text{t}}} - {\hat{\text{X}}}_{{\text{t}}} }}{{{\text{X}}_{{\text{t}}} }}} \right|$$
(25)

In Eqs. (24) and (25); \(ntest,\) \({\text{X}}_{{\text{t}}} { }\) and \(\hat{X}_{t}\) shows the length of the test set, the observed value, and the forecasting value, respectively. In the analysis process, the data set is primarily divided into training, validation, and test sets. The validation and test set lengths are determined as 30 in the analysis of the Spanish stock market time series. The validation and test set lengths are determined as 45 in the analysis of the London stock market time series. The number of inputs is tried between one and five and the number of fuzzy clusters is tried between two and five for each of these methods. The optimal number of inputs and fuzzy clusters giving the lowest RMSE value is determined for each method over the validation set. The forecasting performances of the methods are compared on the test data for the best hyperparameters.

4.1 Comparison of the Original Time Series

In this first part of the analysis phase, the closing prices of the Spanish stock market (IBEX) are analyzed. While analyzing the IBEX time series, five-time series with 250 observations are randomly generated within the entire time series between the years 2015–2022. The start and end dates of these five randomly generated IBEX time series are given in Table 2. The graphs of each IBEX time series are also given in Fig. 3.

Table 2 The randomly generated IBEX time series
Fig. 3
figure 3

The graphs of each IBEX time series

The number of optimal input and fuzzy clusters for each method of Series 1–5 are given in Table 3. These optimal numbers of input and fuzzy clusters are used for the calculations on the test set. Considering that each method will give different results for different starting values, various statistics are calculated for each method. For this purpose, each method is run ten times, and minimum, maximum, standard deviation, and mean statistics for each method are calculated for both RMSE and MAPE criteria.

Table 3 The number of optimal input and fuzzy clusters for each method of Series 1–5

Some statistics for RMSE and MAPE values obtained from methods other than the proposed R-PFRF methods for the original Series 1 to 5 are given in Table 4. These RMSE and MAPE values are calculated over the test set of the original Series 1 to 5. Besides, some statistics for RMSE and MAPE values are given in Table 5 for the proposed R-PFRF methods.

Table 4 Some statistics for RMSE and MAPE results of some methods other than R-PFRF methods for the test set of the original time Series 1–5
Table 5 Some statistics for RMSE and MAPE results of each R-PFRF method for the test set of original time Series 1–5

For Series 1, 4, and 5, each R-PFRF method has better results than the other compared methods in Table 5 to the mean and minimum RMSE and MAPE statistics. For Series 2, the best method is the R-PFRF-CAUCHY method according to both RMSE and MAPE criteria. Again, better RMSE and MAPE results are obtained with a robust method than other methods. For Series 3, the best method according to both RMSE and MAPE statistics is the R-PFRF-WELSCH method. As a result, it can be said that the best results are obtained from the R-PFRF methods in the analysis of the original IBEX time series.

The graphs of the forecasts obtained from the best method obtained as a result of the analysis for the original Series 1 to 5 together with the test set are given in Fig. 4. The best method for each time series is determined by considering the mean statistics. Figure 4 shows that the forecasts obtained by the proposed method are highly consistent with the test sets of each original IBEX time series.

Fig. 4
figure 4

Forecast graphs for Series 1 and 5 in the original state

Second and lastly, the closing prices of the London Stock Exchange (FTSE) are analyzed. Again, for the FTSE time series, five different series with 250 observations are randomly obtained between the years 2015–2022. The start and end dates of these five randomly generated FTSE time series are given in Table 6. The graphs of each FTSE time series are also given in Fig. 5.

Table 6 The randomly generated FTSE time series
Fig. 5
figure 5

The graphs of each FTSE time series

Table 7 shows the optimal hyperparameter values for Series 6 to 10, while Table 8 shows the statistics for the RMSE and MAPE results for the methods other than the proposed method. Table 9 shows some statistics for RMSE and MAPE results of each R-PFRF method for the test set of original time Series 6–10.

Table 7 The optimal input and fuzzy cluster numbers for each method of Series 6–10
Table 8 Some statistics for RMSE and MAPE results of some methods other than R-PFRF methods for the test set of the original time Series 6–10
Table 9 Some statistics for RMSE and MAPE results of each R-PFRF method for the test set of original time Series 6–10

When the results in Tables 8 and 9 are analyzed, it is seen that the proposed method produces more successful forecasting results for the FTSE time series than all other methods in the analyses for the original time series.

The graphs of the forecasts obtained from the best method obtained as a result of the analysis for the original Series 6 to 10 together with the test set are given in Fig. 6. The best method for each time series is determined by considering the mean statistics. Figure 6 shows that the forecasts obtained by the proposed method are highly consistent with the test sets of each original FTSE time series.

Fig. 6
figure 6

Forecast graphs for Series 6 and 10 in original state

4.2 Analysis of Time Series with Outlier Cases

In the second part of the application studies, outlier data was artificially generated by adding an observation 10 times the maximum observation value to the training data. Table 10 shows the optimal hyperparameter values obtained for all methods in the IBEX time series. Tables 11 and 12 summarise the forecasting results obtained from the application of alternative methods and the proposed method, respectively.

Table 10 The optimal input and fuzzy cluster numbers of each method for Series 1–5 in outlier case
Table 11 Some statistics for RMSE and MAPE results of some methods other than R-PFRF methods for the test set of the time Series 1–5 in the outlier case
Table 12 Some statistics for RMSE and MAPE results of each R-PFRF method for the test set of time Series 1–5 in the outlier case

The forecasts obtained from the best method for the Series from 1 to 5, which have outliers along with the test set, are shown in Fig. 7. The best method for each time series is determined by considering the mean statistics. Figure 7 shows that the forecasts obtained with the proposed method are highly consistent with the test sets of each IBEX time series with outliers.

Fig. 7
figure 7

Forecast graphs for Series 1 and 5 with outliers

In Table 13, the RMSE and MAPE values and the percentage variation of these RMSE and MAPE values of some methods other than R-PFRF methods are given over the mean statistics for the case where Series 1–5 contains original and outlier values.

Table 13 The percentage variation of RMSE and MAPE values of some methods other than R-PFRF methods for Series 1–5 over the mean statistics

According to Table 13, the result of the RMSE value obtained by the FTS-N method of Series 1 in the case of the outlier has increased by 990% compared to the original case. The result of the RMSE value obtained by the FRF method of Series 3 in the case of the outlier has increased by 928% compared to the original case. The result of the RMSE value obtained by the PFRF method of Series 5 in the case of the outlier has increased by 2405% compared to the original case. Besides, the result of the MAPE value obtained by the PFRF method of Series 5 in the case of outlier has increased by 2868% compared to the original case.

Table 14 shows how R-PFRF methods are not affected by outliers. In Table 14, the RMSE and MAPE values and the percentage variation of these RMSE and MAPE values of R-PFRF methods are given over the mean statistics for the case where Series 1–5 contains original and outlier values.

Table 14 The percentage variation of RMSE and MAPE values of R-PFRF methods for Series 1–5 over the mean statistics

As can be seen from Table 14, lower RMSE and MAPE values can be calculated with any proposed R-PFRF methods. The result of the RMSE value obtained by the R-PFRF- FAIR method of Series 1 in the case of the outlier has just increased by 2.94% compared to the original case. The result of the MAPE value obtained by the R-PFRF-WELSCH method of Series 4 in the case of the outlier has decreased by 2.05% compared to the original case. The result of the MAPE value obtained by the R-PFRF-CAUCHY method of Series 5 in the case of the outlier has decreased by 0.90% compared to the original case. The result of the RMSE value obtained by the R-PFRF- TALWAR method of Series 3 in the case of the outlier has decreased by 1.02% compared to the original case.

Finally, the analysis results obtained from all methods for Series 6–10 are given for the case where these series contain outliers. In Table 15, the optimal number of input and fuzzy clusters of each method for Series 6–10 in the outlier case. Some statistics of RMSE and MAPE results obtained from alternative methods and the R-PFRF method belonging to Series 6–10 in outlier cases are given in Tables 16 and 17 respectively.

Table 15 The optimal input and fuzzy cluster numbers of each method for Series 6–10 in outlier case
Table 16 Some statistics for RMSE and MAPE results of some methods other than R-PFRF methods for the test set of the time Series 6–10 in outlier case
Table 17 Some statistics for RMSE and MAPE results of each R-PFRF method for the test set of time Series 6–10 in outlier case

The forecasts obtained from the best method for the Series from 6 to 10, which have outliers along with the test set, are shown in Fig. 8. The best method for each time series is determined by considering the mean statistics. Figure 8 shows that the forecasts obtained with the proposed method are highly consistent with the test sets of each FTSE time series with outliers.

Fig. 8
figure 8

Forecast graphs for Series 6 and 10 with outliers

In Table 18, the RMSE and MAPE values and the percentage variation of these RMSE and MAPE values of some methods other than R-PFRF methods are given over the mean statistics for the case where Series 6–10 contains original and outlier values.

Table 18 The percentage variation of RMSE and MAPE values of some methods other than R-PFRF methods for Series 6–10 over the mean statistics

According to Table 18, the result of the RMSE value obtained by the FTS-N method of Series 1 in the case of the outlier has increased by 1317% compared to the original case. The result of the RMSE value obtained by the FRF method of Series 4 in the case of the outlier has increased by 1559% compared to the original case. The result of the RMSE value obtained by the PFRF method of Series 5 in the case of the outlier has increased by 2356% compared to the original case. Besides, the result of the MAPE value obtained by the PFRF method of Series 5 in the case of outlier has increased by 3099% compared to the original case. Table 19 shows how R-PFRF methods are not affected by outliers. In Table 19, the RMSE and MAPE values and the percentage variation of these RMSE and MAPE values of R-PFRF methods are given over the mean statistics for the case where Series 6–10 contains original and outlier values.

Table 19 The percentage variation of RMSE and MAPE values of R-PFRF methods for Series 6–10 over the mean statistics

As can be seen from Table 19, lower RMSE and MAPE values can be calculated with any proposed R-PFRF methods. The result of the RMSE value obtained by the R-PFRF- LOGISTIC method of Series 1 in the case of the outlier has just increased by 1.50% compared to the original case. The result of the MAPE value obtained by the R-PFRF- WELSCH method of Series 2 in the case of the outlier has decreased by 1.31% compared to the original case. The result of the MAPE value obtained by the R-PFRF-FAIR method of Series 3 in the case of the outlier has just increased by 2.19% compared to the original case. The result of the RMSE value obtained by the R-PFRF-BISQUARE method of Series 4 in the case of the outlier has decreased by 1.14% compared to the original case.

5 Conclusions and Discussion

In this study, a new forecasting method based on picture fuzzy sets that is less affected by outliers is proposed. Since the proposed forecasting method is based on picture fuzzy sets, it can handle different dimensions of uncertainty. For this reason, a data augmentation process, which can be considered as some nonlinear transformations of the original data, can be operated thanks to different membership values such as positive, negative, and neutral. In addition, since the proposed method is a kind of fuzzy function approach, it leads to a kind of forecast combination method in which more than one linear model is combined with weights that vary from observation to observation. In addition, this forecast combination is a dynamic approach in which weights can be determined automatically through memberships. Since the proposed method uses robust regression instead of the OLS method in parameter estimation, the problem of being affected by outliers encountered by existing approaches is avoided.

When the performance of the proposed method is investigated on real-life stock market time series, it is observed that it is quite successful compared to its current competitors in the literature and can provide useful forecasting results for the closing futures of the Spanish and London stock markets. In particular, it is clear from the application results that the performance of the proposed method is not affected by artificially added outliers. In addition, in the proposed method, the selection of different weight functions does not have much effect on the results, and the selection of any weight function among the weight functions used in this study will be sufficient to obtain successful forecasting performance. In future studies, it is planned to improve the performance of the R-PFRF method using robust artificial neural networks.