Skip to main content
Log in

Comparison of OLS, ANN, KTRL, KTRL2, RLOC, and MOVE as Record-Extension Techniques for Water Quality Variables

  • Published:
Water, Air, & Soil Pollution Aims and scope Submit manuscript

Abstract

In this study, nine record extension techniques were explored: ordinary least squares (OLS), maintenance of variance extension techniques (MOVE1, MOVE2, MOVE3, and MOVE4), Kendall–Theil robust line (KTRL), artificial neural network (ANN), and two recently developed techniques (RLOC and KTRL2). The first technique is the robust line of organic correlation (RLOC), which is a modified version of MOVE1 with the advantage of being robust in the presence of outliers and/or deviation from normality. The second technique is a modified version of the KTRL (KTRL2) that has the advantage of being able to maintain the variance in the extended records. Water quality data from the Nile Delta monitoring network in Egypt were used to conduct an empirical experiment. The nine record extension techniques were used to extend the Chloride records using Electric Conductivity as a predictor. A comparison was carried out between the nine techniques to assess their ability to provide extended records that preserve different statistical characteristics of the observed records. Results showed that the RLOC and KTRL2 are better than other techniques in preserving the characteristics of the entire distribution. However, the ANN and KTRL techniques are superior in estimating individual water quality records. The RLOC or KTRL2 techniques are recommended for extending records of discontinued water quality variables in the Nile Delta, while the ANN and KTRL techniques are recommended for the substitution of missing values. In addition, a Monte Carlo experiment was conducted to assess the impact of the presence of outliers on the performance of the MOVE techniques as well as the KTRL2 and RLOC. Results of the Monte Carlo experiment showed that, in the presence of outliers, the KTRL2 and RLOC techniques outperform the MOVE techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  • Albek, E. (2003). Estimation of point and diffuse contaminant loads to streams by non-parametric regression analysis of monitoring data. Water, Air, and Soil Pollution, 147, 229–243.

    Article  CAS  Google Scholar 

  • ASCE Task Committee on Application of Artificial Neural Networks in Hydrology. (2000). Artificial neural networks in hydrology (II): Hydrologic applications. Journal of Hydrologic Engineering, 5, 124–137.

    Article  Google Scholar 

  • Berryman, D., Bobée, B., Cluis, D., & Haemmerli, J. (1988). Nonparametric tests for trend detection in water quality time series. Water Resources Bulletin, 24(3), 545–556.

    Article  CAS  Google Scholar 

  • Burney, S.M., Jilani, T.A., Ardil, C. (2004). A comparison of first and second order training algorithms for artificial neural networks. Paper presented at International Conference on Computational Intelligence, Int. Comput. Intell. Soc., Istanbul.

  • Conover, W. L. (1980). Practical nonparametric statistics (2nd ed.). New York: John Wiley and Sons. 493 p.

    Google Scholar 

  • Coulibaly, P., & Anctil, F. (1999). Real time short term natural waters inflow forecasting using recurrent neural networks. in Proceedings of International Joint Conference on Neural Networks, 1999. IJCNN ’99, vol. 6, 3802– 3805, IEEE Press, Piscataway, N. J.

  • Coulibaly, P., Anctil, F., & Bobée, B. (2000). Daily reservoir inflow forecasting using artificial neural networks with stopped training approach. Journal of Hydrology, 230, 244–257.

    Article  Google Scholar 

  • Déry, S. J., Mlynowski, T. J., Hernandez-Henriquez, M. A., & Straneo, F. (2011). Interannual variability and interdecadal trends in Hudson Bay streamflow. Journal of Marine System, 88, 341–351.

    Article  Google Scholar 

  • Draper, N. R., & Smith, H. (1966). Applied regression analysis (p. 736). New York: John Wiley.

    Google Scholar 

  • Granato, G.E. (2006). Kendall–Theil Robust Line (KTRLine—version 1), a visual basic program for calculating and graphing robust nonparametric estimates of linear-regression coefficients between two continuous variables: Techniques and methods of the U.S. Geological Survey, Book 4, chap. A7, 31 p.

  • Gutiérrez-Estrada, J. C., de Pedro-Sanz, E., López-Luque, R., & Pulido-Calvo, I. (2004). Comparison between traditional methods and artificial neural networks for ammonia concentration forecasting in an eel (Anguilla anguilla L.) intensive rearing system. Aquacultural Engineering, 31, 183–203.

    Article  Google Scholar 

  • Hagan, M. T., & Menhaj, M. (1994). Training feedforward networks with the Marquardt algorithm. IEEE Transactions on Neural Networks, 5(6), 989–993. doi:10.1109/72.329697.

    Article  CAS  Google Scholar 

  • Halfon, E. (1985). Regression method in ecotoxicology: A better formulation using the geometric mean functional regression. Environmental Science and Technology, 19, 747–749.

    Article  CAS  Google Scholar 

  • Harmancioglu, N.B. & Yevjevich, V. (1986). Transfer of information among water quality variables of the Potomac River, Phase III: Transferable and transferred information. Report to D.C. Water Resources Research Center of the University of the District of Columbia, Washington, DC, 81 p.

  • Harmancioglu, N. B., & Yevjevich, V. (1987). Transfer of hydrologic information among river points. Journal of Hydrology, 91, 103–118.

    Article  Google Scholar 

  • Harmancioglu, N. B., Fistikoglu, O., Ozkul, S. D., Singh, V. P., & Alpaslan, M. N. (1999). Water quality monitoring network design. Dordrecht: Kluwer Academic Publishers. 290 p.

    Book  Google Scholar 

  • Helsel, D. R., & Hirsch, R. M. (2002). Statistical methods in water resources. Amsterdam: Elsevier Science Publishers. 522 p.

    Google Scholar 

  • Hirsch, R. M. (1982). A comparison of four streamflow record extension techniques. Water Resources Research, 18(4), 1081–1088.

    Article  Google Scholar 

  • Hirsch, R. M., Alexander, R., & Smith, R. A. (1991). Selection of methods for the detection and estimation of trends in water quality. Water Resources Research, 27, 803–813.

    Article  Google Scholar 

  • Huang, W., & Foo, S. (2002). Neural network modeling of salinity variation in Apalachicola River. Water Research, 36, 356–362.

    Article  CAS  Google Scholar 

  • Jia, Y., & Culver, T. B. (2006). Bootstrapped artificial neural networks for synthetic flow generation with a small data sample. Journal of Hydrology, 2006(331), 580–590.

    Article  Google Scholar 

  • Khalil, B., & Adamowski, J. (2012). Record extension for short-gauged water quality parameters using a newly proposed robust version of the line of organic correlation technique. Hydrology and Earth System Sciences, 16, 2253–2266.

    Article  Google Scholar 

  • Khalil, B., & Ouarda, T. B. M. J. (2009). Statistical approaches used to assess and redesign surface water quality monitoring networks. Journal of Environmental Monitoring, 11, 1915–1929.

    Article  CAS  Google Scholar 

  • Khalil, B., Awadallah, A.G., Karaman, H., El-Sayed, A. (2007). Application of artificial neural networks for the prediction of water quality variables in Nile Delta. Proceedings, the NAWQAM Conference, February, 2007, Sharm-Elsheikh, Egypt.

  • Khalil, B., Ouarda, T. B. M. J., St-Hilaire, A., & Chebana, F. (2010). A statistical approach of the rationalization of water quality indicators in surface water quality monitoring networks. Journal of Hydrology, 386, 173–185.

    Article  CAS  Google Scholar 

  • Khalil, B., Ouarda, T. B. M. J., & St-Hilaire, A. (2011a). A statistical approach for the assessment and redesign of the Nile Delta drainage system water quality monitoring locations. Journal of Environmental Monitoring, 13, 2190–2205.

    Article  CAS  Google Scholar 

  • Khalil, B., Ouarda, T. B. M. J., & St-Hilaire, A. (2011b). Estimation of water quality characteristics at ungauged sites using artificial neural networks and canonical correlation analysis. Journal of Hydrology, 405, 277–287.

    Article  CAS  Google Scholar 

  • Khalil, B., Ouarda, T. B. M. J., & St-Hilaire, A. (2012). Comparison of record-extension techniques for water quality variables. Water Resources Management. doi:10.1007/s11269-012-0143-9.

    Google Scholar 

  • Koch, R. W., & Smillie, G. M. (1986). Bias in hydrologic prediction using log-transformed regression models. Water Resources Bulletin, 22(5), 717–723.

    Article  Google Scholar 

  • Koutsoyiannis, D. & Langousis, A. (2011). In: P. Wilderer & S. Uhlenbrook (eds) Precipitation, treatise on water science, 2, 27–28, Academic Press, Oxford.

  • Kruskal, W. H. (1953). On the uniqueness of the line of organic correlation. Biometrics, 9, 47–58.

    Article  Google Scholar 

  • Lettenmaier, D. P. (1988). Multivariate nonparametric tests for trend in water quality, AWRA. Water Resources Bulletin, 24(3), 505–512.

    Article  CAS  Google Scholar 

  • Maier, H. R., & Dandy, G. C. (1996). The use of artificial neural network for the prediction of water quality parameters. Water Resources Research, 32, 1013–1022.

    Article  Google Scholar 

  • Maier, H. R., & Dandy, G. C. (2001). Neural network based modelling of environmental variables: a systematic approach. Mathematical and Computer Modelling, 33, 669–682.

    Article  Google Scholar 

  • Matalas, N.C., & Jacobs, B., (1964). A correlation procedure for augmenting hydrologic data. US Geological Survey Professional Paper 434-E, pp. E1–E7.

  • Mishra, A. K., & Desai, V. R. (2006). Drought forecasting using feed forward recursive neural network. Ecological Modeling, 198, 127–138.

    Article  Google Scholar 

  • Moog, D. B., & Whiting, P. J. (1999). Streamflow record extension using power transformations and application to sediment transport. Water Resources Research, 35(1), 243–254.

    Article  Google Scholar 

  • Morrison, M.A. & Bonta, J.V. (2008). Development of duration-curve based methods for quantifying variability and change in watershed hydrology and water quality. United States Environmental Protection Agency, EPA/600/R-08/065.

  • Nevitt, J. & Tam, H.P. (1998). A comparison of robust and nonparametric estimators under the simple linear regression model: multiple linear regression viewpoints. 25, 54–69.

  • Newman, M. C. (1993). Regression analysis of log-transformed data: Statistical bias and its correction. Environmental Toxicology and Chemistry, 12, 1129–1133.

    Article  CAS  Google Scholar 

  • Olson, O., Gassmann, M., Wegerich, K., & Bauer, M. (2010). Identification of the effective water availability from streamflows in the Zerafshan river basin, Central Asia. Journal of Hydrology, 390, 190–197.

    Article  Google Scholar 

  • Raziei, T., Saghafian, B., Paulo, A. A., Pereira, L. S., & Bordi, I. (2009). Spatial patterns and temporal variability of drought in western Iran. Water Resources Management, 23, 439–455.

    Article  Google Scholar 

  • Raziei, T., Bordi, I., & Pereira, L. S. (2011). An application of GPCC and NCEP/NCAR datasets for draught variability analysis in Iran. Water Resources Management, 25, 1075–1086.

    Article  Google Scholar 

  • Robinson, R. B., Wood, M. S., Smoot, J. L., & Moore, S. E. (2004). Parametric modelling of water quality and sampling strategy in a high-altitude Appalachian stream. Journal of Hydrology, 287, 62–73.

    Article  CAS  Google Scholar 

  • Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection (p. 352). New York: Wiley.

    Book  Google Scholar 

  • Rumelhart, D. E., & McClelland, J. L. (1986). Parallel distributed processing: Explorations in the microstructure of cognition, foundations (Vol. 1). Cambridge: MIT Press.

    Google Scholar 

  • Ryu, J. H., Svoboda, M. D., Lenters, J. D., Tadesse, T., & Knutson, C. L. (2010). Potential extents for ENSO-driven hydrologic drought forecasts in the United States. Climatic Change, 101, 575–597.

    Article  CAS  Google Scholar 

  • Sanders, T. G., Ward, R. C., Loftis, J. C., Steele, T. D., Adrian, D. D., & Yevjevich, V. (1983). Design of networks for monitoring water quality. Littleton, Colorado: Water Resources Publications. 328 p.

    Google Scholar 

  • Sandhu, N., & Finch, R. (1996). Emulation of DWRDSM using artificial neural networks and estimation of Sacramento River flow from salinity. North Am. Water and Environment Conference, Proceeding, ASCE, New York, pp. 4335–4340.

  • Serinaldi, F., Grimaldi, S., Abdolhosseini, M., Corona, P., & Cimini, D. (2012). Testing copula regression against benchmark models for point and interval estimation of tree wood volume in beech stands. European Journal of Forest Research, 13(5), 1313–1326.

  • Shu, C., & Burn, D. H. (2004). Artificial neural network ensembles and their application in pooled flood frequency analysis. Water Resources Research, 40, W09301. doi:10.1029/2003WR002816.

    Article  Google Scholar 

  • Shu, C., & Ouarda, T. B. M. J. (2007). Flood frequency analysis at ungauged sites using artificial neural networks in canonical correlation analysis physiographic space. Water Resources Research, 43, W07438. doi:10.1029/2006WR005142.

    Article  Google Scholar 

  • Theil, H. (1950). A rank-invariant method of linear and polynomial regression analysis, 1, 2, and 3: Ned. Akad. Wentsch Proc., 53, 386–392, 521–525, and 1397–1412.

  • Vogel, R. M., & Stedinger, J. R. (1985). Minimum variance streamflow record augmentation procedures. Water Resources Research, 21(5), 715–723.

    Article  Google Scholar 

  • Yevjevich, V. & Harmancioglu, N.B. (1985). Modeling water quality variables of Potomac River at the entrance to its estuary, Phase II (correlation of water quality variables within the framework of structural analysis). Report to D.C. Water Resources Research Center of the University of the District of Columbia, Washington, DC, 59p.

  • Zhang, S. P., Watanabe, H., & Yamada, R. (1994). Prediction of daily water demand by neural networks. In K. W. Hipel (Ed.), Stochastic and statistical methods in hydrology and environmental engineering, vol. 3 (pp. 217–227). New York: Springer.

    Chapter  Google Scholar 

Download references

Acknowledgments

The authors are grateful to Prof. Shaden Abdel-Gawad, Chairperson of the National Water Research Center of Egypt, for providing the data used in this paper. Financial support provided by “Le Fonds de recherche du Québec—Nature et technologies” as well as an NSERC Discovery Grant is acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bahaa Khalil.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khalil, B., Adamowski, J. Comparison of OLS, ANN, KTRL, KTRL2, RLOC, and MOVE as Record-Extension Techniques for Water Quality Variables. Water Air Soil Pollut 225, 1966 (2014). https://doi.org/10.1007/s11270-014-1966-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11270-014-1966-1

Keywords

Navigation