Abstract
In this study, nine record extension techniques were explored: ordinary least squares (OLS), maintenance of variance extension techniques (MOVE1, MOVE2, MOVE3, and MOVE4), Kendall–Theil robust line (KTRL), artificial neural network (ANN), and two recently developed techniques (RLOC and KTRL2). The first technique is the robust line of organic correlation (RLOC), which is a modified version of MOVE1 with the advantage of being robust in the presence of outliers and/or deviation from normality. The second technique is a modified version of the KTRL (KTRL2) that has the advantage of being able to maintain the variance in the extended records. Water quality data from the Nile Delta monitoring network in Egypt were used to conduct an empirical experiment. The nine record extension techniques were used to extend the Chloride records using Electric Conductivity as a predictor. A comparison was carried out between the nine techniques to assess their ability to provide extended records that preserve different statistical characteristics of the observed records. Results showed that the RLOC and KTRL2 are better than other techniques in preserving the characteristics of the entire distribution. However, the ANN and KTRL techniques are superior in estimating individual water quality records. The RLOC or KTRL2 techniques are recommended for extending records of discontinued water quality variables in the Nile Delta, while the ANN and KTRL techniques are recommended for the substitution of missing values. In addition, a Monte Carlo experiment was conducted to assess the impact of the presence of outliers on the performance of the MOVE techniques as well as the KTRL2 and RLOC. Results of the Monte Carlo experiment showed that, in the presence of outliers, the KTRL2 and RLOC techniques outperform the MOVE techniques.
Similar content being viewed by others
References
Albek, E. (2003). Estimation of point and diffuse contaminant loads to streams by non-parametric regression analysis of monitoring data. Water, Air, and Soil Pollution, 147, 229–243.
ASCE Task Committee on Application of Artificial Neural Networks in Hydrology. (2000). Artificial neural networks in hydrology (II): Hydrologic applications. Journal of Hydrologic Engineering, 5, 124–137.
Berryman, D., Bobée, B., Cluis, D., & Haemmerli, J. (1988). Nonparametric tests for trend detection in water quality time series. Water Resources Bulletin, 24(3), 545–556.
Burney, S.M., Jilani, T.A., Ardil, C. (2004). A comparison of first and second order training algorithms for artificial neural networks. Paper presented at International Conference on Computational Intelligence, Int. Comput. Intell. Soc., Istanbul.
Conover, W. L. (1980). Practical nonparametric statistics (2nd ed.). New York: John Wiley and Sons. 493 p.
Coulibaly, P., & Anctil, F. (1999). Real time short term natural waters inflow forecasting using recurrent neural networks. in Proceedings of International Joint Conference on Neural Networks, 1999. IJCNN ’99, vol. 6, 3802– 3805, IEEE Press, Piscataway, N. J.
Coulibaly, P., Anctil, F., & Bobée, B. (2000). Daily reservoir inflow forecasting using artificial neural networks with stopped training approach. Journal of Hydrology, 230, 244–257.
Déry, S. J., Mlynowski, T. J., Hernandez-Henriquez, M. A., & Straneo, F. (2011). Interannual variability and interdecadal trends in Hudson Bay streamflow. Journal of Marine System, 88, 341–351.
Draper, N. R., & Smith, H. (1966). Applied regression analysis (p. 736). New York: John Wiley.
Granato, G.E. (2006). Kendall–Theil Robust Line (KTRLine—version 1), a visual basic program for calculating and graphing robust nonparametric estimates of linear-regression coefficients between two continuous variables: Techniques and methods of the U.S. Geological Survey, Book 4, chap. A7, 31 p.
Gutiérrez-Estrada, J. C., de Pedro-Sanz, E., López-Luque, R., & Pulido-Calvo, I. (2004). Comparison between traditional methods and artificial neural networks for ammonia concentration forecasting in an eel (Anguilla anguilla L.) intensive rearing system. Aquacultural Engineering, 31, 183–203.
Hagan, M. T., & Menhaj, M. (1994). Training feedforward networks with the Marquardt algorithm. IEEE Transactions on Neural Networks, 5(6), 989–993. doi:10.1109/72.329697.
Halfon, E. (1985). Regression method in ecotoxicology: A better formulation using the geometric mean functional regression. Environmental Science and Technology, 19, 747–749.
Harmancioglu, N.B. & Yevjevich, V. (1986). Transfer of information among water quality variables of the Potomac River, Phase III: Transferable and transferred information. Report to D.C. Water Resources Research Center of the University of the District of Columbia, Washington, DC, 81 p.
Harmancioglu, N. B., & Yevjevich, V. (1987). Transfer of hydrologic information among river points. Journal of Hydrology, 91, 103–118.
Harmancioglu, N. B., Fistikoglu, O., Ozkul, S. D., Singh, V. P., & Alpaslan, M. N. (1999). Water quality monitoring network design. Dordrecht: Kluwer Academic Publishers. 290 p.
Helsel, D. R., & Hirsch, R. M. (2002). Statistical methods in water resources. Amsterdam: Elsevier Science Publishers. 522 p.
Hirsch, R. M. (1982). A comparison of four streamflow record extension techniques. Water Resources Research, 18(4), 1081–1088.
Hirsch, R. M., Alexander, R., & Smith, R. A. (1991). Selection of methods for the detection and estimation of trends in water quality. Water Resources Research, 27, 803–813.
Huang, W., & Foo, S. (2002). Neural network modeling of salinity variation in Apalachicola River. Water Research, 36, 356–362.
Jia, Y., & Culver, T. B. (2006). Bootstrapped artificial neural networks for synthetic flow generation with a small data sample. Journal of Hydrology, 2006(331), 580–590.
Khalil, B., & Adamowski, J. (2012). Record extension for short-gauged water quality parameters using a newly proposed robust version of the line of organic correlation technique. Hydrology and Earth System Sciences, 16, 2253–2266.
Khalil, B., & Ouarda, T. B. M. J. (2009). Statistical approaches used to assess and redesign surface water quality monitoring networks. Journal of Environmental Monitoring, 11, 1915–1929.
Khalil, B., Awadallah, A.G., Karaman, H., El-Sayed, A. (2007). Application of artificial neural networks for the prediction of water quality variables in Nile Delta. Proceedings, the NAWQAM Conference, February, 2007, Sharm-Elsheikh, Egypt.
Khalil, B., Ouarda, T. B. M. J., St-Hilaire, A., & Chebana, F. (2010). A statistical approach of the rationalization of water quality indicators in surface water quality monitoring networks. Journal of Hydrology, 386, 173–185.
Khalil, B., Ouarda, T. B. M. J., & St-Hilaire, A. (2011a). A statistical approach for the assessment and redesign of the Nile Delta drainage system water quality monitoring locations. Journal of Environmental Monitoring, 13, 2190–2205.
Khalil, B., Ouarda, T. B. M. J., & St-Hilaire, A. (2011b). Estimation of water quality characteristics at ungauged sites using artificial neural networks and canonical correlation analysis. Journal of Hydrology, 405, 277–287.
Khalil, B., Ouarda, T. B. M. J., & St-Hilaire, A. (2012). Comparison of record-extension techniques for water quality variables. Water Resources Management. doi:10.1007/s11269-012-0143-9.
Koch, R. W., & Smillie, G. M. (1986). Bias in hydrologic prediction using log-transformed regression models. Water Resources Bulletin, 22(5), 717–723.
Koutsoyiannis, D. & Langousis, A. (2011). In: P. Wilderer & S. Uhlenbrook (eds) Precipitation, treatise on water science, 2, 27–28, Academic Press, Oxford.
Kruskal, W. H. (1953). On the uniqueness of the line of organic correlation. Biometrics, 9, 47–58.
Lettenmaier, D. P. (1988). Multivariate nonparametric tests for trend in water quality, AWRA. Water Resources Bulletin, 24(3), 505–512.
Maier, H. R., & Dandy, G. C. (1996). The use of artificial neural network for the prediction of water quality parameters. Water Resources Research, 32, 1013–1022.
Maier, H. R., & Dandy, G. C. (2001). Neural network based modelling of environmental variables: a systematic approach. Mathematical and Computer Modelling, 33, 669–682.
Matalas, N.C., & Jacobs, B., (1964). A correlation procedure for augmenting hydrologic data. US Geological Survey Professional Paper 434-E, pp. E1–E7.
Mishra, A. K., & Desai, V. R. (2006). Drought forecasting using feed forward recursive neural network. Ecological Modeling, 198, 127–138.
Moog, D. B., & Whiting, P. J. (1999). Streamflow record extension using power transformations and application to sediment transport. Water Resources Research, 35(1), 243–254.
Morrison, M.A. & Bonta, J.V. (2008). Development of duration-curve based methods for quantifying variability and change in watershed hydrology and water quality. United States Environmental Protection Agency, EPA/600/R-08/065.
Nevitt, J. & Tam, H.P. (1998). A comparison of robust and nonparametric estimators under the simple linear regression model: multiple linear regression viewpoints. 25, 54–69.
Newman, M. C. (1993). Regression analysis of log-transformed data: Statistical bias and its correction. Environmental Toxicology and Chemistry, 12, 1129–1133.
Olson, O., Gassmann, M., Wegerich, K., & Bauer, M. (2010). Identification of the effective water availability from streamflows in the Zerafshan river basin, Central Asia. Journal of Hydrology, 390, 190–197.
Raziei, T., Saghafian, B., Paulo, A. A., Pereira, L. S., & Bordi, I. (2009). Spatial patterns and temporal variability of drought in western Iran. Water Resources Management, 23, 439–455.
Raziei, T., Bordi, I., & Pereira, L. S. (2011). An application of GPCC and NCEP/NCAR datasets for draught variability analysis in Iran. Water Resources Management, 25, 1075–1086.
Robinson, R. B., Wood, M. S., Smoot, J. L., & Moore, S. E. (2004). Parametric modelling of water quality and sampling strategy in a high-altitude Appalachian stream. Journal of Hydrology, 287, 62–73.
Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection (p. 352). New York: Wiley.
Rumelhart, D. E., & McClelland, J. L. (1986). Parallel distributed processing: Explorations in the microstructure of cognition, foundations (Vol. 1). Cambridge: MIT Press.
Ryu, J. H., Svoboda, M. D., Lenters, J. D., Tadesse, T., & Knutson, C. L. (2010). Potential extents for ENSO-driven hydrologic drought forecasts in the United States. Climatic Change, 101, 575–597.
Sanders, T. G., Ward, R. C., Loftis, J. C., Steele, T. D., Adrian, D. D., & Yevjevich, V. (1983). Design of networks for monitoring water quality. Littleton, Colorado: Water Resources Publications. 328 p.
Sandhu, N., & Finch, R. (1996). Emulation of DWRDSM using artificial neural networks and estimation of Sacramento River flow from salinity. North Am. Water and Environment Conference, Proceeding, ASCE, New York, pp. 4335–4340.
Serinaldi, F., Grimaldi, S., Abdolhosseini, M., Corona, P., & Cimini, D. (2012). Testing copula regression against benchmark models for point and interval estimation of tree wood volume in beech stands. European Journal of Forest Research, 13(5), 1313–1326.
Shu, C., & Burn, D. H. (2004). Artificial neural network ensembles and their application in pooled flood frequency analysis. Water Resources Research, 40, W09301. doi:10.1029/2003WR002816.
Shu, C., & Ouarda, T. B. M. J. (2007). Flood frequency analysis at ungauged sites using artificial neural networks in canonical correlation analysis physiographic space. Water Resources Research, 43, W07438. doi:10.1029/2006WR005142.
Theil, H. (1950). A rank-invariant method of linear and polynomial regression analysis, 1, 2, and 3: Ned. Akad. Wentsch Proc., 53, 386–392, 521–525, and 1397–1412.
Vogel, R. M., & Stedinger, J. R. (1985). Minimum variance streamflow record augmentation procedures. Water Resources Research, 21(5), 715–723.
Yevjevich, V. & Harmancioglu, N.B. (1985). Modeling water quality variables of Potomac River at the entrance to its estuary, Phase II (correlation of water quality variables within the framework of structural analysis). Report to D.C. Water Resources Research Center of the University of the District of Columbia, Washington, DC, 59p.
Zhang, S. P., Watanabe, H., & Yamada, R. (1994). Prediction of daily water demand by neural networks. In K. W. Hipel (Ed.), Stochastic and statistical methods in hydrology and environmental engineering, vol. 3 (pp. 217–227). New York: Springer.
Acknowledgments
The authors are grateful to Prof. Shaden Abdel-Gawad, Chairperson of the National Water Research Center of Egypt, for providing the data used in this paper. Financial support provided by “Le Fonds de recherche du Québec—Nature et technologies” as well as an NSERC Discovery Grant is acknowledged.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Khalil, B., Adamowski, J. Comparison of OLS, ANN, KTRL, KTRL2, RLOC, and MOVE as Record-Extension Techniques for Water Quality Variables. Water Air Soil Pollut 225, 1966 (2014). https://doi.org/10.1007/s11270-014-1966-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11270-014-1966-1