Validating a continental-scale groundwater diffuse pollution model using regional datasets
- 242 Downloads
In this study, we assess the validity of an African-scale groundwater pollution model for nitrates. In a previous study, we identified a statistical continental-scale groundwater pollution model for nitrate. The model was identified using a pan-African meta-analysis of available nitrate groundwater pollution studies. The model was implemented in both Random Forest (RF) and multiple regression formats. For both approaches, we collected as predictors a comprehensive GIS database of 13 spatial attributes, related to land use, soil type, hydrogeology, topography, climatology, region typology, nitrogen fertiliser application rate, and population density. In this paper, we validate the continental-scale model of groundwater contamination by using a nitrate measurement dataset from three African countries. We discuss the issue of data availability, and quality and scale issues, as challenges in validation. Notwithstanding that the modelling procedure exhibited very good success using a continental-scale dataset (e.g. R2 = 0.97 in the RF format using a cross-validation approach), the continental-scale model could not be used without recalibration to predict nitrate pollution at the country scale using regional data. In addition, when recalibrating the model using country-scale datasets, the order of model exploratory factors changes. This suggests that the structure and the parameters of a statistical spatially distributed groundwater degradation model for the African continent are strongly scale dependent.
KeywordsGroundwater nitrate Random Forest (RF) Validation Scale issue Country Africa
This study was carried out within the framework of a doctoral research programme, and has been supported by the Islamic Development Bank (IDB) under the Merit Scholarship Programme (MSP) for theses and the ‘Fonds Spécial de Recherche’ (FSR) of the Université Catholique de Louvain. Several people from across the world helped with data acquisition, namely T. Gleeson (McGill University), N. Moosdorf (Hamburg University), and M. Cissé (DGPRE/Senegal).
- Aljazzar, T. H., (2010). Adjustment of DRASTIC Vulnerability Index to Assess Groundwater Vulnerability for Nitrate Pollution Using the Advection-Diffusion Cell. Von der Fakultät für Georessourcen und Materialtechnik der Rheinisch-Westfälischen Technischen Hochschule Aachen Ph.D. thesis. 146pp.Google Scholar
- Ateawung, J. N. (2010). A GIS based water balance study of Africa. Master of physical land resources, Universiteit Gent Vrije Universiteit Brussel Belgium.55ppGoogle Scholar
- Bartram, J. and Ballance, R. [Eds] (1996). Water quality monitoring: a practical guide to the design and implementation of freshwater quality studies and monitoring programmes. Chapman and Hall, London. http://www.who.int/water_sanitation_health/resourcesquality/waterqualmonitor.pdf (Accessed online April 25th,2017).
- Bauder JW, Sinclair KN, Lund RE (1993) Physiographic and land use characteristics associated with nitrate nitrogen-nitrogen in Montana groundwater. J Environ Qual 22(2):255–262. https://doi.org/10.2134/jeq1993.00472425002200020004x Google Scholar
- Boy-Roura, M. (2013). Nitrate groundwater pollution and aquifer vulnerability: the case of the Osana region. PhD thesis. Universitat de Girona. 143ppGoogle Scholar
- Breiman, L., (2001a). Random forests. Mach. Learn. 45, 5–32. Doi: https://doi.org/10.1023/A:1010933404324. (https://link.springer.com/content/pdf/10.1023%2FA%3A1010933404324.pdf. Acccesed online June, 21st 2016).
- Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth International Group, Belmont, CaliforniaGoogle Scholar
- Chapman, D. (1996). Water quality assessments—a guide to use of biota, sediments, and water in environmental monitoring—second edition. 1996, 651 pages published on behalf of WHO by F & FN Spon. http://www.who.int/water_sanitation_health/resourcesquality/watqualassess.pdf. (accessed online March18th 2017).
- De’ath G, Fabricius KE (2000) Classification and regression trees: a powerful yet simple technique for ecological data analysis. Ecology 81(11):3178–3192. https://doi.org/10.1890/0012-9658(2000)081 [3178:CARTAP]2.0.CO;2 Google Scholar
- Donigan, A.S., Jr., and Rao, P.S.C. (1986). Examples models testing studies in vadose zone modelling of organic pollutants. Edited by S.C. Hem and S.LM Melancon, PP103–131, Lewis Publ., Chelsea, MI.Google Scholar
- El-Sadek, A. A. M. (2002). Engineering approach to water quantity and quality modelling at field and catchment scale. Ph.D. thesis. Katholieke Universiteit Leuven.251pp.Google Scholar
- Evans JS, Murphy MA, Holden ZA, Cushman SA (2011) Modelling species distribution and change using the random forest. In: Drew CA, Wiersma YF, Huettmann F (eds) Predictive species and habitat modeling in landscape ecology. Springer, New York, pp 139–159. https://doi.org/10.1007/978-1-4419-7390-0_8 Google Scholar
- Foster, S.; Garduño,H., Kemper, L., Tuinhof, A., Nanni, M., Dumars, C. (2003). Groundwater quality protection defining strategy and setting priorities. Briefing note 8.6pp. http://documents.worldbank.org/curated/en/434861468166483398/pdf/301000PAPER0BN8.pdf. Accessed online march 6th, 2017).
- Gross, E. L. (2008). Ground water susceptibility to elevated nitrate concentrations in South Middleton Township, Cumberland County, Pennsylvania. Master of Science. Shippensburg University. 117pp. http://www.ship.edu/uploadedfiles/ship/geo-ess/graduate/theses/gross_thesis_080505.pdf; accessed online July 6th, 2015).
- Gurdak JJ (2014) Groundwater vulnerability handbook of engineering hydrology. CRC Press, Taylor & Francis Group 2014:33Google Scholar
- Haller, L., McCarthy, P., O'Brien, T., Riehle, J. and Stuhldreher, T. (2013). Nitrate pollution of groundwater. 2014: alpha water systems INC.Google Scholar
- Hastie T, Tibshirani R, Friedman J (2008) The elements of statistical learning, 2nd edn. Springer. isbn:0-387-95284-5Google Scholar
- Heidema AG, Boer JMA, Nagelkerke N, Mariman ECM, van der, A.D.L., Feskens, E.J.M. (2006) The challenge for genetic epidemiologists: how to analyze large numbers of SNPs in relation to complex diseases. BMC Genet 7(1):23. https://doi.org/10.1186/1471-2156-7-23
- Heuvelink GBM, Pebesma EJ (1999) Spatial aggregation and soil process modelling. Geoderma 89: 47–65. https://doi.org/10.1016/S0016-7061(98)00077-9
- Li X, Zhai T, Jiao Y, Wang G (2015) Using Bayesian hierarchical models and random forest algorithm for habitat use studies: a case of nest site selection of the crested ibis at regional scales. PeerJ PrePrints 3:e871v1. https://doi.org/10.7287/peerj.preprints.871v1
- Liaw, A., Wiener, M., (2002). Classification and regression by random forest. Vol. 2/3, December 2002. http://www.bios.unc.edu/~dzeng/BIOS740/randomforest.pdf (accessed online April, 16th 2017).
- MacDonald, A. (2010). Groundwater, health, and livelihoods in Africa. British Geological Survey © NERC 2010 Earthwise 26, 2pp. ORAL PRESENTATION. http://nora.nerc.ac.uk/17329/1/29-30%5B1%5D.pdf (Accessed online January 28th 2016).
- MacDonald, A., M., R. Taylor, G., and H. Bonsor, C. (2013). (Eds.) Groundwater in Africa—is there sufficient water to support the intensification of agriculture from “Land Grabs”." Hand book of land and water grabs in Africa. pp 376–383Google Scholar
- MacDonald A, Davies J, Calow R (2008) African hydrogeology and rural water supply, Applied groundwater studies in Africa. IAH selected papers on hydrogeology, volume 13 (ed. by S. M. A. Adelana & a. M. MacDonald). CRC Press/Balkema, Leiden, The NetherlandsGoogle Scholar
- MacDonald AM, Davies J (2000) A brief review of groundwater for rural water supply in sub-Saharan Africa, British Geological Survey, technical report WC/00/33. Overseas Geology Series, BGS, Nottingham, UKGoogle Scholar
- Margat, J. (2010). Ressources et utilisation des eaux souterraines en Afrique. Managing Shared Aquifer Resources in Africa, Third International Conférence Tripoli 25–27 may 2008. International Hydrological Programme, Division of Water Sciences, IHP-VII Series on groundwater No.1, UNESCO, pp 26–34Google Scholar
- Mulla DJ, Addiscott TM (1999) Validation approaches for field-, basin-, and regional-scale water quality models. Assessment of non-point source pollution in the vadose zone:63–78. https://doi.org/10.1029/GM108p0063
- National Research Council (NRC), (1993). Ground water vulnerability assessment: Predictive relative contamination potential under conditions of uncertainty. National Academy Press, Washington D.C., pp. 224. ISBN: 978–0–309-04799-9Google Scholar
- Ouedraogo, I., Defourny, P., and Vanclooster, M.(2016a). Modeling groundwater nitrate concentrations at the African scale using random forest regression techniques. Accepted April 24th to review in the special issue on groundwater in sub-Saharan Africa for Hydrogeological Journal (HJ) (in progress, book expected in December 2017).Google Scholar
- Pearson S (2015) Identifying groundwater vulnerability from nitrate contamination: comparison of the DRASTIC model and environment Canterbury’s method. Lincoln University, Degree of Master of Applied Science (Environmental Management), 58 ppGoogle Scholar
- Postnote (2011). Water Adaptation in Africa. Number 373 April 2011. http://www.parliament.uk/documents/post/postpn_373-Water-Adapatation-in-Africa.pdf (Accessed online January 26th, 2016)
- Prasad AM, Iverson LR, Liaw A (2006) Newer classification and regression tree techniques: bagging and random forests for ecological prediction. Ecosystems (N.Y.), 9(2): 181–199. https://doi.org/10.1007/s10021-005-0054-1
- Refsgaard JC, Thorsen M, Jensen JB, Kleeschulte S, Hansen S (1999) Large scale modelling of groundwater contamination from nitrate leaching. J Hydrol 221(3):117–140. https://doi.org/10.1016/S0022-1694(99)00081-5
- Refsgaard, J.C., and Butts, M.B. (1999). Determination of grid scale parameters in catchment modelling by upscaling local scale parameters. Proceeding of the Int. workshop on modelling transport process in soils. EurAEng’s IG on soil and water, Leuven, Belgium, 24-26 Nov., 650-665Google Scholar
- Rodriguez-Galiano V, Mendes MP, Garcia-Soldado MJ, Chica-Olmo M, Ribeiro L (2014) Predictive modeling of groundwater nitrate pollution using random Forest and multisource variables related to intrinsic and specific vulnerability: a case study in an agricultural setting (southern Spain). Sci Total Environ 476-477:189–206. https://doi.org/10.1016/j.scitotenv.2014.01.001 Google Scholar
- Royal Society of Chemistry (RSC) (2010) Africa’s water quality. http://www.rsc.org/images/RSC_PACN_Water_Report_tcm18-176914.pdf Last accessed August 2016
- Spalding RF, Exner ME (1993) Occurrence of nitrate in groundwater- a review. J Environ Qual 22(392–402). https://doi.org/10.2134/jeq1993.00472425002200030002x
- Strebel, O., Duynisveld, W. H. M., and Böttcher, J. (1989). Nitrate pollution of groundwater in Western Europe, Agric. Ecosyst. Environ. 26, 189–214. doi.org/10.1016/0167-8809(89)90013-3
- UNEP (United Nations Environment Programme). (2010). Africa Water Atlas. Nairobi, UNEP, Division of Early Warning and Assessment (DEWA). http://na.unep.net/atlas/ africaWater/book.php.
- UNEP/DEWA, (2014). Sanitation and Groundwater Protection –a UNEP Perspective UNEP/DEWA, http://www.bgr.bund.de/EN/Themen/Wasser/Veranstaltungen/symp_sanitat-gwprotect/present_mmayi_pdf.pdf?__blob=publicationFile&v=2. 18pp (Accessed online August 14th 2014).
- Ward MH, deKok TM, Levallois P, Brender J, Gulis G, Nolan BT, VanDerslice J (2005) Workgroup report: drinking-water nitrate and health—recent findings and research needs. Environ Health Perspect 113(11):1607–1614. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1310926
- WHO (1992). GEMS/WATER Operational Guide. Third edition. World Health Organization, Geneva. 121pp. http://apps.mwho.int/iris/bitstream/10665/62446/1/GEMS_W_92.1_(part1).pdf. (Accessed online March 18th 2017)
- Yee TW, Mitchell ND (1991) Generalized additive models in plant ecology. Journal of vegetation science, 2(5), 587-602. ISO 690. https://doi.org/10.2307/3236170
- Zhao C, Liu C, Xia J, Zhang Y, Yu Q, Eamus D (2012) Recognition of key regions for restoration of phytoplankton communities in the Huai River basin, China. J Hydrol 420:292–300. https://doi.org/10.1016/j.jhydrol.2011.12.016