1 Introduction

Accurate estimation of reference evapotranspiration (ETRef) is one of the priorities for estimating the water requirement of agricultural products. The complexity of the evapotranspiration process and its dependence on meteorological variables, lack of access to all meteorological data, and the lack of generalizability of a model for different climates have made it difficult to accurately estimate this variable. Meanwhile, the development of simple models based on multivariate linear regression due to the simplicity and the ability to select different models according to the available meteorological variables can help in estimating the appropriate ETRef in different climates. In general, the estimation of ETRef can be done through direct and indirect methods and using mathematical models (Jing et al. 2019). Indirect methods for estimating ETRef began with the development of empirical equations such as Penman-Monteith (PM) (Monteith 1965). Based on the PM method, FAO56 published a standard method for estimating ETRef. The PM method presented in FAO56 has been used by many researchers as the base method in calculating ETRef (Güçlü et al. 2017; Saggi and Jain 2019; Shiri et al. 2019).

In recent years, however, artificial intelligence-based methods such as the neural networks (Kisi et al. 2015; Gavili et al. 2018), the support vector machine (Tabari et al. 2013), the extreme learning machine (Abdullah et al. 2015), decision tree (Huang et al. 2019; Raza et al. 2020), and hybrid methods (Ehteram et al. 2019; Shiri et al. 2020; Tikhamarine et al. 2019; Zhu et al. 2020; Kim et al. 2014; Sanikhani et al. 2019; Mehdizadeh et al. 2017) have had many applications in estimating ETRef, but among them, multivariate linear regression method has been compared with other empirical equations and soft computing, validated by many researchers (Reis et al. 2019; Kisi and Heddam 2019; Mattar and Alazba 2019; Tabari et al. 2012).

The appropriate method for estimating ETRef in each region depends on climatic conditions, required data, and related costs (Sharafi et al. 2016). Accordingly, Unes et al. (2020) predicted daily ETRef based on climatic conditions using empirical equations (Hargreaves-Samani, Ritchie, Turc, and Penman FAO 56), multilinear regression (MLR), and different data mining techniques (M5T, ANFIS, SVM). According to their results, Turc empirical formula (radiation-based) is found better than other empirical equations and the highest correlation coefficient is calculated for ANFIS, and the minimum errors are calculated for radial basis function SVM. Also, Chen et al. (2020) estimate daily ETRef based on limited meteorological data using three deep learning methods, two classical machine learning methods, and seven empirical equations. Their results show that, when temperature-based features were available, the deep learning models performed markedly better than temperature-based empirical models, and when radiation-based or humidity-based features were available, all of the proposed deep and classical learning machine models outperformed radiation-based or humidity-based empirical equations beyond the study areas.

Yirga (2019) estimated ETRef by multiple linear regression. The results of the MLR model could estimate and predict ETRef in the Megecha basin and can be used in similar parts of data-sparse regions. Karbasi (2018) investigated the Gaussian process regression (GPR) and Wavelet-GPR models to forecast multi-step ahead daily ETRef at the synoptic station of Zanjan (Iran). The results of the Wavelet-GPR model showed that the performance of the model during the warmer season is better than its performance throughout the year.

dos Santos Farias et al. (2020) evaluated the performance of machine learning techniques and stepwise multiple linear regression method to estimate daily ETRef with limited weather data in a Brazilian agricultural frontier. Their results showed that machine learning methods are robust in estimating ETRef, even in the absence of some variables. On the other hand, the use of artificial intelligence models in estimating ETRef with high accuracy has become prevalent in recent years, but the complexity of these models makes their application difficult for different regions. For this purpose, the present paper uses multivariate linear regression models with different data input to develop a simple comprehensive model with the minimum data required to estimate the ETRef in different climates of Iran. Although in most of the previous researches, the accuracy of the results has been investigated based on meteorological stations (points), in this research, the accuracy of multivariate linear regression models has been investigated based on different climates in Iran. This innovation extends the results of this paper to similar climates around the world, and also reduces the uncertainty of results as a result of fluctuations in climate variables on a point-by-point basis.

2 Materials and methods

2.1 Study area

Iran is located in the northern hemisphere between two longitudes of eastern 44° and 64° and two latitudes of northern 40° and 25°. Meteorological data of different synoptic stations, including different climates, are collected and analyzed. Some records of data input were incomplete, or not available for some stations; therefore, only stations with long statistical period length remained. Accordingly, in the present study, meteorological data of 33 synoptic stations in Iran over a statistical period of 30 years (1990–2019) are used. Figure 1 shows the location of the studied stations along with climatic classification based on the Köppen method (Köppen 1931) and according to the values of air temperature and precipitation. According to the study of Sharafi and Karim (2020) (the Köppen climate classification), 7 stations (Bandar Abbas, Abadan, Ahwaz, Bushehr, Bam, and Zabol), 7 stations (Zahedan, Semnan, Sabzevar, Kerman, Isfahan, Shahroud and Torbat-e Heydarieh), 13 stations (Shiraz, Gorgan, Tehran, Khorramabad, Bandar Anzali, Birjand, Rasht, Kermanshah, Mashhad, Arak, Qazvin, Sanandaj, and Tabriz), and 6 stations (Shahrekord, Khoy, Saqez, Urmia, Hamedan, and Zanjan) are located in the arid desert (Bw), semiarid (Bs), humid with very mild winters (C), and humid with severe winters (D) zones, respectively (Fig. 1).

Fig. 1
figure 1

Location and classification of studied stations based on the Köppen method

To calculate ETRef values using empirical equations and regression models, the monthly data, parameters of minimum temperature (Tmin), maximum temperature (Tmax), relative humidity (RH), wind speed (u), solar radiation (Rs), extraterrestrial radiation (Ra), and vapor pressure deficit (VPD) were used in different stations. The statistical characteristics of these parameters are presented for different climates separately in Table 1.

Table 1 Statistical characteristics of meteorological variables

According to the results in Table 1, the average monthly ETRef values decreased from 123.3 to 90.5 mm month−1, respectively, from arid desert (Bw) to humid climate with severe winters (D). Also, the highest and lowest coefficient of variation (CV%) of ETRef changes in Bw and D climates was reported to be equal to 39.2% and 53.8%, respectively. The coefficients of variation of minimum and maximum temperatures similar to ETRef increased from Bw to D climates (Table 1).

R, rainfall; RH, relative humidity; Tmin, minimum temperature; Tmax, maximum temperature; u, wind speed; Rs, solar radiation; Ra, extraterrestrial radiation; ETRef, reference evapotranspiration by Penman-Monteith (Allen et al. 1998); Min, minimum; Max, maximum; Ave, average; SD, standard deviation; CV (%), coefficient of variation; Kur, kurtosis; Sk, skewness

In Fig. 2, the Pearson correlation coefficient values between ETRef and other meteorological variables (average, maximum and minimum temperature, vapor pressure deficient, wind speed, relative humidity, solar radiation, and extraterrestrial radiation) are shown for 33 study stations based on Köppen method. Pearson correlation coefficient values of + 1 and − 1 show the highest correlation with a direct and inverse relationship between the dependent variable (ETRef) and independent variable (meteorological variables), respectively. Figure 2 shows that ETRef were directly related to all meteorological variables. The only correlation coefficient between ETRef and relative humidity was negative, indicating an inverse relationship between ETRef and RH. Accordingly, in all 33 stations studied, there was the highest correlation coefficient of ETRef with average, maximum, and minimum temperature and VPD. Also, the correlation coefficient between ETRef and wind speed and relative humidity had the lowest values in most of the studied stations. ETRef in C and D climates (Fig. 2 c and d) compared to Bw and Bs climates (Fig. 2 a and b) show more correlation with solar radiation (Rs) and extraterrestrial radiation (Ra) variable. On the other hand, based on the results of Fig. 2 a and d, there were high and low changes in the correlation coefficient values between ETRef and meteorological variables in stations with Bw and D climates, respectively.

Fig. 2
figure 2

Radar diagram of the Pearson correlation coefficient between ETRef and meteorological variable based on the Köppen method

2.2 ETRef estimation models

2.2.1 Empirical equations

In the present paper, the FAO56 Penman-Monteith (ETRef -PMF56) method was used as the target ETRef values to compare with the results of empirical equations and MLR models, which can be calculated as:

$$ {\mathrm{ET}}_{\mathrm{Ref}}=\frac{0.408\Delta \left({R}_{n-}G\right)+\gamma \frac{900}{T_a+273}{u}_2\left({e}_s-{e}_a\right)}{\Delta +\gamma \left(1+0.34{u}_2\right)} $$
(1)

For more information on this method, you can refer to FAO56 (Allen et al. 1998). The developed empirical equations can be categorized into the temperature-based, radiation-based, and mass transfer-based according to the data used (Liu et al. 2017; Zhang et al. 2018). In this research, the empirical equations of Hargreaves-Samani (temperature-based), Irmak (radiation-based), and Dalton (mass transfer-based) are used based on Eqs. (2) (Hargreaves-Samani), (3) (Irmak), and (4) (Dalton), respectively, to estimate ETRef (Hargreaves and Samani 1985; Irmak et al. 2003; Dalton 1802).

$$ {\mathrm{ET}}_{\mathrm{Ref}}=a{R}_a\left({T}_a+b\right)\Delta {T}^{0.5} $$
(2)
$$ {\mathrm{ET}}_{\mathrm{Ref}}=a{R}_s\left(b{T}_a+c\right) $$
(3)
$$ {\mathrm{ET}}_{\mathrm{Ref}}=\left(a+{bu}_2\right) VPD $$
(4)

In these equations, ETRef, reference evapotranspiration (mm month−1); Δ, the slope of the saturation vapor pressure function (kPa °C); γ, psychometric constant (kPa °C); Rn, net radiation (MJ m−2 day−1); G, soil heat flux density (MJ m−2 day−1); u, average wind speed at 2 m height (m s−1); es, saturation vapor pressure (kPa); ea, actual vapor pressure; VPD, vapor pressure deficit; α, 1.26; λ, latent heat of the evaporation (MJ kg−1); Ra, extraterrestrial radiation (mm month−1); Rs, monthly solar radiation (MJ m−2 month−1); RH, relative humidity (%); Ta, average air temperature (°C); Tmax, maximum air temperature (°C); Tmin, minimum air temperature (°C); the values of a, b, and c are empirical coefficients.

2.2.2 Multivariate linear regression models

Multivariate linear regression is a method to model the relationship between several independent variables (average, maximum and minimum temperature, vapor pressure deficient, wind speed, relative humidity, solar radiation, and extraterrestrial radiation) with a dependent variable (ETRef). This method, based on the minimum of mean square error, performs the empirical coefficients of the linear relationship between the dependent variable and the independent variables in such a way that the linear model data has the best fit with the target data. In general, the form of multivariate linear regression equations is the following expression 5.

$$ \hat{Y}={b}_0+{b}_1{X}_1+{b}_2{X}_2+{b}_3{X}_3+\dots +{b}_m{X}_m $$
(5)

In this expression, \( \hat{Y} \) is a dependent variable, b0 to bm are empirical coefficients and X1 to Xm are independent variables. Also, the accuracy of the results of multivariate linear regression models extremely depends on the number and type of input variables to the model. In the present study, 6 multivariate linear regression models presented in Table 2 have been developed. Meteorological variables are referred separately for each empirical equations and multivariate linear regression models. On the other hand, MLR1, MLR2, and HS models are defined as temperature-based, MLR3, and IR as radiation-based, MLR4, MLR5, and DT as mass transfer-based, and MLR6 as combination models, respectively (Table 2).

Table 2 Characteristics of multivariate linear regression models for ETRef estimation

2.3 Evaluation criteria

In this study, 5 statistical criteria include the following: RMSE, SI, MAE, NSE, and R2 were used to compare the results of empirical equations and MLR with ETRef -PMF56 based on Eqs. (6) to (10).

$$ \mathrm{Root}\ \mathrm{Mean}\ \mathrm{Square}\ \mathrm{Error}\ \left(\mathrm{RMSE}\right)\kern0.5em \mathrm{RMSE}=\sqrt{\frac{1}{N}{\sum}_{i=1}^N{\left({ET}_{Ref_i}^{\operatorname{mod} el}-{ET}_{Ref_i}^{PMF56}\right)}^2} $$
(6)
$$ \mathrm{Scatter}\ \mathrm{Index}\ \left(\mathrm{SI}\right)\kern0.5em \mathrm{SI}=\frac{RMSE}{{\overline{ET}}_{Ref}^{PMF56}} $$
(7)
$$ \mathrm{Mean}\ \mathrm{Absolute}\ \mathrm{Error}\ \left(\mathrm{MAE}\right)\kern0.5em \mathrm{MAE}=\frac{1}{N}{\sum}_N^{i=1}\left|{ET}_{Ref_i}^{\operatorname{mod} el}-{ET}_{Ref_i}^{PMF56}\right| $$
(8)
$$ \mathrm{Nash}-\mathrm{Sutcliffe}\ \mathrm{Efficiency}\ \left(\mathrm{NSE}\right)\kern0.5em \mathrm{NSE}=1-\left[\frac{\sum \limits_N^{i=1}\ {\left({ET}_{Ref_i}^{PMF56}-{ET}_{Ref_i}^{\operatorname{mod} el}\right)}^2}{\sum \limits_{i=1}^N{\left({ET}_{Ref_i}^{PMF56}-{\overline{ET}}_{Ref}^{PMF56}\right)}^2}\right] $$
(9)
$$ \mathrm{Coefficient}\ \mathrm{of}\ \mathrm{determination}\ \left(\mathrm{R}2\right)\kern0.5em {R}^2={\left[\frac{\sum \limits_N^{i=1}\left({ET}_{Ref_i}^{PMF56}-{\overline{ET}}_{Ref}^{PMF56}\right)\left({ET}_{Ref_i}^{\operatorname{mod} el}-{\overline{ET}}_{Ref}^{\operatorname{mod} el}\right)}{\sqrt{\left[\sum \limits_{i=1}^N{\left({ET}_{Ref_i}^{PMF56}-{\overline{ET}}_{Ref}^{PMF56}\right)}^2\right]\left[\sum \limits_{i=1}^N{\left({ET}_{Ref_i}^{\operatorname{mod} el}-{\overline{ET}}_{Ref}^{\operatorname{mod} el}\right)}^2\right]}}\right]}^2 $$
(10)

In Eqs. (6) to (10), \( {ET}_{Ref_i}^{PMF56} \) and \( {ET}_{Ref_i}^{\operatorname{mod} el} \) are the ETRef based on PMF56 and modeled ETRef, \( {\overline{ET}}_{Ref}^{PMF56} \) and \( {\overline{ET}}_{Ref}^{\operatorname{mod} el} \) are the mean values of ETRef based on PMF56 and modeled ETRef, and N is the number of data set (360 months).

The perfect value for MAE, RMSE, SI, and MAE indices, except NSE and R2, is zero, and for NSE and R2 is unity. According to Li et al. (2013), the range of SI for the accuracy of the models as:

$$ IF\left\{\begin{array}{c} SI<0.1\mathrm{Excellent}\\ {}0.1< SI<0.2\mathrm{Good}\\ {}\begin{array}{c}0.2< SI<0.3\mathrm{Fair}\\ {} SI>0.3\mathrm{Poor}\end{array}\end{array}\right\} $$

3 Results and discussion

In this paper, a mathematical equation was developed for each of the 6 predefined multivariate linear regression models (MLR1–6) to more accurately estimate ETRef for each of the four climates of Iran. The developed equations illustrated in Table 3 are based on various independent variables (meteorological parameters) and dependent variables (ETRef). Thus, MLR1 and MLR5 models were developed with two variables, MLR2 model with three variables, MLR3 and MLR4 models with four variables, and MLR6 model with the combination variables, i.e., 2 to 7 variables (Table 3). The MLR6 model has been developed as the best selection from 7 independent variables to the estimation of minimum error between \( {ET}_{Ref}^{PMF56} \) and \( {ET}_{Ref}^{model} \). For example, according to Eq. (32), \( {ET}_{Ref}^{model} \) values have a linear relationship with the three variables Tmin, u, and VPD in Bw climate (Table 3).

Table 3 Equations derived for multivariate linear regression models in different climates

Kiafar et al. (2017) conducted the comparison of gene expression programming (GEP) models and empirical models led to the development of new equations for each climatic region. Their research showed that the models developed for very dry (Bw) and humid (C and D) climates had more accurate results. Also, based on their results, models that used the more climatic variables had a more accurate estimate of ETRef. Other researchers have reported similar results in the same climates (Traore et al. 2010; Ozkan et al. 2011; Huo et al. 2012).

To evaluate the relationship between ETRef and meteorological variables, a t test with a 95% confidence level (α = 0.05) was used. The significance of the multivariate regression linear relationship between ETRef and each independent variable will be significant at the 95% level if the t stat value is greater than 1.96 or less than − 1.96 (Mattar and Alazba 2019). Figure 3 shows the t stat parameter values for all stations studied by climates classified according to the Köppen method (Fig. 3).

Fig. 3
figure 3

t statistic of independent variables in different climates

According to the results of Fig. 3, in Bw climate, the two variables Ra and Rs had no significant linear relationship with ETRef in none of the stations of this climate at a 95% confidence level. Also in this climate, changes in the Tmin variable had a significant relationship with ETRef only in three stations of Bandar Abbas, Bushehr, and Yazd, and the relationship of this variable with ETRef was not significant in four stations of Abadan, Ahvaz, Bam, and Zabol at 95% confidence level. Also, in Bs climate (Fig. 3b), there was only a significant linear relationship between the three variables Tmin, u, and VPD at 95% confidence level with ETRef in most stations of this climate. For other variables including intercept width, Tmax, Ra, Rs, and RH, there was a significant linear relationship with ETRef at a 95% level in two out of seven stations in this climate (Fig. 3b).

Figure 3c shows that there is a significant linear relationship between ETRef and most meteorological variables in station C climates. In other words, there is a significant linear relationship between ETRef and most meteorological parameters in this climate at a 95% level (Fig. 3c). There is no significant linear relationship between ETRef values and Ra, Rs, and RH variables in the most stations with D climate (Fig. 3d). Based on the results of Fig. 3, there is a significant linear relationship between ETRef and VPD, u, and Tmin variables in 32, 28, and 25 stations of the 33 studied stations (α = 0.05%), respectively. Also, the least significant linear relationship between ETRef and Ra and Rs variables was observed in 8 and 10 stations of all studied stations, respectively (Fig. 3). According to the results of Shiri et al. (2013) and Khanmohammadi et al. (2018), models in which the RH variable was used showed better results in C and D climates, because the effect of RH on the ETRef is greater in these climates. Also, based on the results of Yassin et al. (2016), the Irmak model (radiation-based) was more accurate in Bw and Bs climates. RMSE values for the Irmak model in arid climates were 25.78% lower than humid climates.

Table 4 shows the average values of RMSE and R2 criteria in stations based on climates classified according to the Köppen method for empirical equations and MLR models. Also, the highest and lowest accuracy of the models in all climates was related to the MLR6 model and the empirical Hargreaves-Samani (HS) equation, respectively. The Irmak method had the best results in all climates in comparison with the other empirical equations. After the MLR6 model, which was estimated based on the linear relationship between ETRef and all climatic variables, the MLR4 model (based on the minimum and maximum temperature, relative humidity, and wind speed) was more accurate in Bw and Bs climates. In C and D climates, the MLR3 model (based on minimum, maximum temperatures, and solar and extraterrestrial radiations) showed the best estimates of ETRef between radiation-based models. This can be justified due to the higher correlation of ETRef with radiation variables in C and D climates (Fig. 2 c and d).

Table 4 RMSE and R2 criteria for different models in different climates

Based on the results of this study, a significant linear relationship between ETRef and VPD, u, and Tmin variables were reported in 32, 28, and 25 stations, but this linear relationship was only significant in three variables Tmin, u, and VPD with ETRef in most semiarid climate stations (Bs). However, the linear relationship between ETRef and most meteorological variables in C climate was significant.

Furthermore, according to the results of Table 4, results were more accurate of the simplest regression model (i.e., MLR1) in estimating the ETRef compared to all empirical equations. Also, in terms of RMSE criteria, the lowest error values were reported for C climate and with changes in the range of 13.3–22.0 mm month−1. On the other hand, the highest values of R2 for this climate were around 0.88–0.96 (Table 4).

The results of Shiri et al. (2012) showed that the HS method has less accurate in estimating daily reference evapotranspiration in the north of Spain in comparison with the Priestley-Taylor empirical method, intelligent gene expression programming (GEP), and adaptive neuro-fuzzy inference system (ANFIS) methods. Their results corroborate the results of the present study due to the lower accuracy of the HS method in all four climates of Iran.

Figure 4 shows the mean of the estimated ETRef values for the best empirical method (Irmak) and the best regression model (MLR6) in different climates with two criteria NSE and MAE (Fig. 4). Accordingly, NSE values in all climates for the MLR6 model were 0.97, which indicates the high accuracy of this model in estimating ETRef at all stations in different climates of the country.

Fig. 4
figure 4

Comparison of estimated ETRef values between Irmak and MLR6 models

The results also showed that the highest and lowest accuracy of the models in all climates was related to the MLR6 model and the empirical Hargreaves-Samani (HS) equation, respectively. Of the three empirical equations, the Irmak method showed the best results in all climates. After the MLR6 regression model, which was estimated based on the linear relationship between ETRef and all climatic variables, in Bw and Bs climates, the MLR4 model in climates C and D, the MLR3 model showed the best estimates of ETRef. Another noteworthy point was the high accuracy of the simplest regression model (i.e., MLR1) in estimating the ETRef compared to all empirical equations.

Also, Fig. 4 shows that in terms of NSE and MAE criteria, the Irmak empirical method is more accurate in arid and semiarid climates (Bw and Bs) than in humid climates (C and D).

According to the value of the SI criteria, the performance of a model can be divided into four levels: excellent to poor. Figure 5 shows the SI values at different stations for all empirical equations and the MLR models. Also, the MLR6 model had excellent estimates in 10 stations including Bushehr and Yazd stations (Bw); Zahedan, Kerman, and Isfahan (Bs); Shiraz, Gorgan, Birjand, and Qazvin (C); and Zanjan (D) (SI < 0.1) which in this regard had the highest accuracy in different climates. Then, MLR3 model in two stations in Bs climate (Zahedan and Kerman), in three stations in C climate (Gorgan, Birjand, and Qazvin), and one station in D climate (Zanjan) had the best estimate of ETRef. In other words, the HS equation in four stations of Ahwaz, Isfahan, Birjand, and Rasht had the lowest SI values and ETRef estimation using this method in these stations was poor (SI > 0.3). In general, Fig. 5 shows that except for the HS equation, the accuracy of estimation of other methods in most of the studied stations concerning SI values was good (0.1 < SI < 0.2).

Fig. 5
figure 5

SI criteria values using different empirical equations and MLR models in the studied stations

Figure 5 shows that the MLR6 model has the best estimation of ETRef in almost of the studied station located on different climates. However, in a few stations such as Arak, a model other than MLR6 has had better results in ETRef estimation based on the scatter index. According to Fig. 5, the Irmak method has been more accurate to estimate ETRef among the three studied empirical equations in most stations located in different climates. According to Unes et al. (2020) results, radiation-based empirical equations (Turc and Irmak) were found better than other empirical equations, the same with our results about Irmak empirical equation (radiation-based).

4 Conclusion

Based on the results of the study, the MLR6 model was developed as the best multivariate linear regression model with a minimum error value between ETRef -PMF56, and ETRef. The results of the model showed a significant linear relationship between ETRef as a dependent variable and other meteorological variables as independent variables, because of the MLR6 model. Also, the model had excellent estimates in 10 stations including Bushehr and Yazd stations (Bw), Zahedan, Kerman, and Isfahan (Bs); Shiraz, Gorgan, Birjand, and Qazvin (C); and Zanjan (D) (SI < 0.1), which in this regard had the highest accuracy of estimation in different climates. Therefore, the development of multivariate linear regression models provided the necessary preconditions for evaluating the function of the empirical equations of ETRef in the study.

Generally, the MLR6 shows efficient results under various climatic conditions. However, intensive data is required for this method and in developing countries; numerous climatic data are not available readily for all the weather stations. Also, the data always lack reliable quality. Certainly, there is a need to develop some approaches that can estimate ETRef precisely with available limited climatic data. The study demonstrated that all MLR models (even MLR1 with the Tmin and Tmax as input data) gave reliable results to estimation of ETRef in different climates.