Skip to main content
Log in

The effect of data quality on model performance with application to daily evaporation estimation

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

The model performance is usually influenced by the quality of the data used in model construction. If the model performance is less affected by data quality, the resulting estimates will be more reliable. In this paper, the variation in model performance due to different data quality is explored in a field-scale application. Hence, two models, the proposed support vector machine (SVM) based model and the Stephen and Stewart (SS) model, are employed for daily estimation of evaporation at an experiment station. Five scenarios corresponding to different data qualities are designed to evaluate the effect of data quality on model performance. Additionally, the most effective meteorological variables influencing evaporation are obtained by a systematic input determination process. These most effective meteorological variables are used as inputs to the SVM-based model. The results show that the model performance decreases as the data quality decreases (i.e. the percentage of missing data increases). However, the estimation accuracy of SVM-based models is still better than that of the SS model. Moreover, the variation of model performance of the SVM-based model is smaller than that of the SS model. That is, the negative impact of different data quality is effectively decreased by using the SVM-based model instead of the SS model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Aksoy H, Dahamsheh A (2009) Artificial neural network models for forecasting monthly precipitation in Jordan. Stoch Environ Res Risk Assess (SERRA) 23:917–931

    Article  Google Scholar 

  • Allen RG, Pereira LS, Raes D, Smith M (1998) Crop evapotranspiration, irrigation and drainage Paper No. 56. Food and Agriculture Organization: Rome, Italy

  • ASCE Task Committee on Application of Artificial Neural Networks in Hydrology (2000a) Artificial Neural Networks in hydrology, I: preliminary concepts. J Hydrol Eng 5(2):115–123

    Article  Google Scholar 

  • ASCE Task Committee on Application of Artificial Neural Networks in Hydrology (2000b) Artificial Neural Networks in hydrology, II: hydrological applications. J Hydrol Eng ASCE 5(2):124–137

    Article  Google Scholar 

  • Chen H, Xiang TT, Zhou X, Xu CY (2012) Impacts of climate change on the Qingjiang Watershed’s runoff change trend in China. Stoch Environ Res Risk Assess (SERRA) 26:847–858

    Article  Google Scholar 

  • Chetan M, Sudheer KP (2006) A hybrid linear-neural model for river flow forecasting. Water Resour Res 42(4):W04402. doi:10.1029/2005WR004072

    Article  Google Scholar 

  • Cristianini N, Shaw-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, New York

    Book  Google Scholar 

  • Guven A, Kisi O (2011) Daily pan evaporation modeling using linear genetic programming technique. Irrig Sci 29:135–145

    Article  Google Scholar 

  • Hosseinzadeh Talaee P, Heydari M, Fathi P, Marofi S, Tabari H (2012) Numerical model and computational intelligence approaches for estimating flow through Rockfill Dam. J Hydrol Eng 17(4):528–536

    Article  Google Scholar 

  • Izadifar Z, Elshorbagy A (2010) Prediction of hourly actual evapotranspiration using neural networks, genetic programming, and statistical models. Hydrol Process 24(23):3413–3425

    Article  Google Scholar 

  • Jahanbani H, El-Shafie AH (2011) Application of artificial neural network in estimating monthly time series reference evapotranspiration with minimum and maximum temperatures. Paddy Water Environ 9:207–220

    Article  Google Scholar 

  • Jeong DI, St-Hilaire A, Ouarda TBMJ, Gachon P (2012) Comparison of transfer functions in statistical downscaling models for daily temperature and precipitation over Canada. Stoch Environ Res Risk Assess (SERRA) 26:633–653

    Article  Google Scholar 

  • Kashani MH, Dinpashoh Y (2012) Evaluation of efficiency of different estimation methods for missing climatological data. Stoch Environ Res Risk Assess (SERRA) 26:59–71

    Article  Google Scholar 

  • Keskin ME, Terzi O (2006) Artificial neural network models of daily pan evaporation. J Hydrol Eng 11(1):65–70

    Article  Google Scholar 

  • Kisi O (2008) The potential of different ANN techniques in evapotranspiration modelling. Hydrol Process 22(14):2449–2460

    Article  Google Scholar 

  • Kisi O (2009a) Modeling monthly evaporation using two different neural computing techniques. Irrig Sci 27:417–430

    Article  Google Scholar 

  • Kisi O (2009b) Daily pan evaporation modelling using multi-layer perceptrons and radial basis neural networks. Hydrol Process 23:213–223

    Article  Google Scholar 

  • Kisi O, Cimen M (2009) Evapotranspiration modelling using support vector machines. Hydrol Sci J 55(8):1442–1450

    Google Scholar 

  • Kumar M, Bandyopadhyay A, Raghuwanshi NS, Singh R (2008) Comparative study of conventional and artificial neural network-based ETo estimation models. Irrig Sci 26:531–545

    Article  Google Scholar 

  • Kuo SF, Chen FW, Liao PY, Liu CW (2011) A comparative study on the estimation of evapotranspiration using backpropagation neural network: Penman–Monteith method versus pan evaporation method. Paddy Water Environ 9:413–424

    Article  Google Scholar 

  • Landeras G, Ortiz-Barredo A, Lopez JJ (2008) Comparison of artificial neural network models and empirical and semi-empirical equations for daily reference evapotranspiration estimation in the Basque Country (Northern Spain). Agric Water Manag 95(5):553–565

    Article  Google Scholar 

  • Lin GF, Chen LH (2004) A non-linear rainfall-runoff model using radial basis function network. J Hydrol 289(1–4):1–8

    Article  Google Scholar 

  • Lin JY, Cheng CT, Chau KW (2006) Using support vector machines for long-term discharge prediction. Hydrol Sci J 51(4):599–612

    Article  Google Scholar 

  • Lin GF, Chen GR, Wu MC, Chou YC (2009a) Effective forecasting of hourly typhoon rainfall using support vector machines. Water Resour Res 45(8):W08440. doi:10.1029/2009WR007911

    Article  Google Scholar 

  • Lin GF, Chen GR, Huang PY, Chou YC (2009b) Support vector machine-based models for hourly reservoir inflow forecasting during typhoon-warning periods. J Hydrol 372(1–4):17–29

    Article  Google Scholar 

  • Lin GF, Chen GR, Huang PY (2010) Effective typhoon characteristics and their effects on hourly reservoir inflow forecasting. Adv Water Resour 33(8):887–898

    Article  Google Scholar 

  • Maity R, Bhagwat PP, Bhatnagar A (2010) Potential of support vector regression for prediction of monthly streamflow using endogenous property. Hydrol Process 24(7):917–923

    Article  Google Scholar 

  • Moghaddamnia A, Ghafari Gousheh M, Piri J, Amin S, Han D (2009) Evaporation estimation using artificial neural networks and adaptive neuro-fuzzy inference system techniques. Adv Water Resour 32(1):88–97

    Article  CAS  Google Scholar 

  • Rahimikhoob A (2009) Estimating daily pan evaporation using artificial neural network in a semi-arid environment. Theor Appl Climatol 98(1–2):101–105

    Article  Google Scholar 

  • Shiri J, Kisi O (2011) Application of artificial intelligence to estimate daily pan evaporation using available and estimated climatic data in the Khozestan Province (South Western Iran). J Irrig Drainage Eng ASCE 7:412–425

    Article  Google Scholar 

  • Shirsath PB, Singh AK (2010) A comparative study of daily pan evaporation estimation using ANN, regression and climate based models. Water Resour Manag 24:1571–1581

    Article  Google Scholar 

  • Stephen JC, Stewart EH (1963) A comparison of procedures for computing evaporation and evapotranspiration. Publication 62. International Association of Scientific Hydrology, International Union of Geodynamics and Geophysics, Berkeley, CA, pp 123–133

  • Tabari H, Marofi S, Sabziparvar AA (2010) Estimation of daily pan evaporation using artificial neural network and multivariate non-linear regression. Irrig Sci 28:399–406

    Article  Google Scholar 

  • Tabari H, Hosseinzadeh Talaee P, Abghari H (2012a) Utility of coactive neuro-fuzzy inference system for pan evaporation modeling in comparison with multilayer perceptron. Meteorol Atmos Phys 116:147–154

    Article  Google Scholar 

  • Tabari H, Kisi O, Ezani A, Hosseinzadeh Talaee P (2012b) SVM, ANFIS, regression and climate based models for reference evapotranspiration modeling using limited climatic data in a semi-arid highland environment. J Hydrol 444:78–89

    Article  Google Scholar 

  • Tan SBK, Shuy EB, Chua LHC (2007) Modelling hourly and daily open-water evaporation rates in areas with an equatorial climate. Hydrol Process 21(4):486–499

    Article  Google Scholar 

  • Vapnik VN (1995) The nature of statistical learning theory. Springer, New York

    Book  Google Scholar 

  • Vapnik VN (1998) Statistical learning theory. Wiley, New York

    Google Scholar 

  • Wang WC, Chau KW, Cheng CT, Qiu L (2009) A comparison of performance of several artificial intelligence methods for forecasting monthly discharge time series. J Hydrol 374(3–4):294–306

    Article  Google Scholar 

  • Wu CL, Chau KW, Li YS (2008) River stage prediction based on a distributed support vector regression. J Hydrol 358(1–2):96–111

    Article  Google Scholar 

  • Xu ZX, Li JY (2002) Short-term inflow forecasting using an artificial neural network model. Hydrol Process 16(12):2423–2439

    Article  Google Scholar 

  • Yu XY, Liong SY (2007) Forecasting of hydrologic time series with ridge regression in feature space. J Hydrol 332(3–4):290–302

    Article  Google Scholar 

  • Zanetti SS, Sousa EF, Oliveira VPS, Almeida FT, Bernardo S (2007) Estimating evapotranspiration using artificial neural network and minimum climatological data. J Hydrol Eng 133(2):83–89

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gwo-Fong Lin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, MC., Lin, GF. & Lin, HY. The effect of data quality on model performance with application to daily evaporation estimation. Stoch Environ Res Risk Assess 27, 1661–1671 (2013). https://doi.org/10.1007/s00477-013-0703-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-013-0703-4

Keywords

Navigation