Abstract
Human activity has increased the amount of carbon dioxide and other greenhouse gases emitted into the atmosphere, causing climate change. As a result, rising temperatures have wide-ranging consequences on water management. This study proposes downscaling daily temperature based on a modified Classification and Regression Technique with an ensemble machine learning (EML) approach at the Woodstock station in the Upper Thames River basin. The GCM Canadian Earth System Model (CanESM5) from Coupled Model Intercomparison Project-6 is used. The CanESM5 model simulated variables are used as predictors and observed baseline daily temperature as predictands. The Regression-based single machine learning (Support Vector, Tree-based and Gaussian Process Regression) and EML based statistical downscaling are applied and compared. The variable temperature states are determined using Gaussian Mixture Model clustering, and the Light Gradient Boosting Model (LightGBM) is used to classify future temperature states. Results showed that applying the EML boosted the performance by 2–25% compared to single models. The temperature states for the two projected climate scenarios (SSP126 and SSP585) were simulated by selected best-performing single and EML model combinations for the near (2026–2050) and far future (2076–2100). The findings demonstrate that the future projected temperatures may rise 1–3 °C for both scenarios and are less volatile than the observed baseline temperature. Overall, the study indicates that the ensemble approach-based downscaling combining several single models have considerably improved the performance and was more reliable.
Similar content being viewed by others
References
Abba SI, Nourani V, Elkiran G (2019) Multi-parametric modeling of water treatment plant using AI-based non-linear ensemble. J Water Supply Res Technol AQUA 68:547–561
Abbasian M, Moghim S, Abrishamchi A (2019) Performance of the general circulation models in simulating temperature and precipitation over Iran. Theor Appl Climatol 135:1465–1483
Akhter J, Das L, Meher JK, Deb A (2019) Evaluation of different large-scale predictor-based statistical downscaling models in simulating zone-wise monsoon precipitation over India. Int J Climatol 39:465–482
Barzegar R, Moghaddam AA, Deo R, Fijani E, Tziritis E (2018) Mapping groundwater contamination risk of multiple aquifers using multimodel ensemble of machine learning algorithms. Sci Total Environ 621:697–712
Bennett FR (2019) Gradient boosting machine assisted approximate Bayesian computation for uncertainty analysis of rainfall-runoff model parameters. In: 23rd International congress on modelling and simulation
Breiman L (1996) Bagging predictors. Mach Learn 24:123–140
Breiman L (2001) Random forests. Mach Learn 45:5–32
Breiman L, Friedman J, Olshen R, Stone CJ (1983) Classification and regression trees
Burger G, Murdock TQ, Werner AT, Sobie SR (2012) Downscaling extremes: an intercomparison of multiple statistical methods for present climate. J Clim 25(12):4366
Environment Canada (2011) National climate data and information archive. http://climate.weatheroffice.gc.ca/climateData/canada_e.html
Chattrairat K, Wongseree W, Leelasantitham A (2021) Comparisons of machine learning methods of statistical downscaling method: case studies of daily climate anomalies in Thailand. J Web Eng. https://doi.org/10.13052/jwe1540-9589.2057
Chen J, Zhang XJ (2021) Challenges and potential solutions in statistical downscaling of precipitation. Clim Change 165:63
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39:1–38
Desai S, Ouarda TBMJ (2021) Regional hydrological frequency analysis at ungauged sites with random forest regression. J Hydrol 594:125861
Dormann CF et al (2013) Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography 36:27–46
Duhan D, Pandey A (2015) Statistical downscaling of temperature using three techniques in the Tons River basin in Central India. Theoret Appl Climatol. https://doi.org/10.1007/s00704-014-1253-5
Eum H-I, Simonovic SP (2012) Assessment on variability of extreme climate events for the Upper Thames River basin in Canada. Hydrol Process 26:485–499
Eum HI, Gupta A, Dibike Y (2020) Effects of univariate and multivariate statistical downscaling methods on climatic and hydrologic indicators for Alberta, Canada. J Hydrol 588:125065
Fajardo J, Corcoran D, Roehrdanz PR, Hannah L, Marquet PA (2020) GCM compareR: a web application to assess differences and assist in the selection of general circulation models for climate change research. Methods Ecol Evol 11:656–663
Feurer M, Hutter F (2019) Hyperparameter optimization. In: Hutter F, Kotthoff L, Vanschoren J (eds) Automated machine learning. Springer, pp 3–33. https://doi.org/10.1007/978-3-030-05318-5_1
Freund Y, Schapire R (1996) Experiments with a new boosting algorithm. In: Proceedings of the thirteenth international conference on machine learning, pp. 148–156 Bari, Italy
Gao X, Shan C, Hu C, Niu Z, Liu Z (2019) An adaptive ensemble machine learning model for intrusion detection. IEEE Access 7:82512–82521
Ghosh S, Katkar S (2012) Modeling uncertainty resulting from multiple downscaling methods in assessing hydrological impacts of climate change. Water Resour Manag 26:3559–3579
Ghosh S, Mujumdar PP (2008) Statistical downscaling of GCM simulations to streamflow using relevance vector machine. Adv Water Resour 31:132–146
Gidden MJ et al (2019) Global emissions pathways under different socio-economic scenarios for use in CMIP6: A dataset of harmonized emissions trajectories through the end of the century. Geosci Model Dev 12:1443–1475
Goly A, Teegavarapu RSV, Mondal A (2014) Development and evaluation of statistical downscaling models for monthly precipitation. Earth Interact 18:1–28
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction
He X, Luo J, Li P, Zuo G, Xie J (2020) A hybrid model based on variational mode decomposition and gradient boosting regression tree for monthly runoff forecasting. Water Resour Manag 34(2):865–884
Hernanz A et al (2022a) Evaluation of statistical downscaling methods for climate change projections over Spain: present conditions with perfect predictors. Int J Climatol 42:762–776
Hernanz A, García-Valero JA, Domínguez M, Rodríguez-Camino E (2022b) A critical view on the suitability of machine learning techniques to downscale climate change projections: illustration for temperature with a toy experiment. Atmosp Sci Lett 23:e1087
Ho TK (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20:832–844
Hood S, Swayne DA (2010) Novel model calibration technique through application of machine learning association rules. In: International congress on environmental modelling and software
Jing W, Yang Y, Yue X, Zhao X (2016) A comparison of different regression algorithms for downscaling monthly satellite-based precipitation over North China. Remote Sens 8:835
Kannan S, Ghosh S (2010) Prediction of daily rainfall state in a river basin using statistical downscaling from GCM output. Springer
Kannan S, Ghosh S (2013) A non-parametric kernel regression model for downscaling multisite daily precipitation in the Mahanadi basin. Water Resour Res 49:1360–1385
Ke G et al (2017) LightGBM: a highly efficient gradient boosting decision tree. In: Advances in neural information processing systems vol. 30 (Curran Associates, Inc.)
Kim D et al (2019) Ensemble learning regression for estimating river discharges using satellite altimetry data: Central Congo River as a Testbed. Remote Sens Environ 221:741–755
Kim S, Alizamir M, Kim NW, Ozgur K (2020) Bayesian model averaging: a unique model enhancing forecasting accuracy for daily streamflow based on different antecedent time series. Sustainability 12(22):9720. https://doi.org/10.3390/su12229720
Krishan R, Sanjay J, Gnanaseelan C, Mujumdar M, Kulkarni A, Chakraborty S (2020) Assessment of climate change over the Indian region: a report of the ministry of earth sciences (MoES). Gov India. https://doi.org/10.1007/978-981-15-4327-2
Lange H, Sippel S (2020) Machine learning applications in hydrology. In: Levia DF et al (eds) Forest-water interactions. Springer, UK, pp 233–257. https://doi.org/10.1007/978-3-030-26086-6_10
Mandal S, Srivastav RK, Simonovic SP (2016) Use of beta regression for statistical downscaling of precipitation in the Campbell River basin British Columbia, Canada. J Hydrol 538:49–62
Mani M, Bandyopadhyay S, Chonabayashi S & Markandya A (2018) South Asia's Hotspots: the impact of temperature and precipitation changes on living standards. (World Bank Publications)
McLachlan GJ, Rathnayake S (2014) On the number of components in a Gaussian mixture model. Wires Data Min Knowl Discov 4:341–355
Mirzaei S, Vafakhah M, Pradhan B, Alavi SJ (2021) Flood susceptibility assessment using extreme gradient boosting (EGB), Iran. Earth Sci Inf 14(1):51–67
Mosavi A, Golshan M, Janizadeh S, Choubin B, Melesse AM, Dineva AA (2020) Ensemble models of GLM, FDA, MARS, and RF for flood and erosion susceptibility mapping: a priority assessment of sub-basins. Geocarto Int 37:2541
Naftaly U, Intrator N, Horn D (1997) Optimal ensemble averaging of neural networks. Netw Comput Neural Syst 8:283–296
Nagaraj M, Srivastav R (2022) Non-stationary modelling framework for regionalization of extreme precipitation using non-uniform lagged teleconnections over monsoon Asia. Stoch Environ Res Risk Assess. https://doi.org/10.1007/s00477-022-02211-4
Naghibi SA, Hashemi H, Berndtsson R, Lee S (2020) Application of extreme gradient boosting and parallel random forest algorithms for assessing groundwater spring potential using DEM-derived factors. J Hydrol 589:125197
Najafi MR, Moradkhani H, Wherry SA (2011) Statistical downscaling of precipitation using machine learning with optimal predictor selection. J Hydrol Eng 16:650–664
Navarro-Racines C, Tarapues J, Thornton P, Jarvis A, Ramirez-Villegas J (2020) High-resolution and biascorrected CMIP5 projections for climate change impact assessments. Sci Data 7:7
Nguyen PT, Ha DH, Nguyen HD, Van Phong T, Trinh PT, Al-Ansari N, Prakash I (2020) Improvement of credal decision trees using ensemble frameworks for groundwater potential modeling. Sustainability 12(7):2622
Ni L, Wang D, Wu J, Wang Y, Tao Y, Zhang J, Liu J (2020) Streamflow forecasting using extreme gradient boosting model coupled with Gaussian mixture model. J Hydrol 586:124901
Nourani V, Razzaghzadeh Z, Baghanam AH, Molajou A (2019) ANN-based statistical downscaling of climatic parameters using decision tree predictor screening method. Theor Appl Climatol 137:1729–1746
Pahlavan HA, Zahraie B, Nasseri M, Mahdipour Varnousfaderani A (2018) Improvement of multiple linear regression method for statistical downscaling of monthly precipitation. Int J Environ Sci Technol 15:1897–1912
Pasolli L, Melgani F, Blanzieri E (2010) Gaussian process regression for estimating chlorophyll concentration in subsurface waters from remote sensing data. IEEE Geosci Remote Sens Lett 7:464–468
Patel E, Kushwaha DS (2020) Clustering cloud workloads: K-Means vs Gaussian mixture model. Proc Comput Sci 171:158–167
Polikar R (2012) Ensemble learning. In: Zhang C, Ma Y (eds) Ensemble machine learning: methods and applications. Springer, pp 1–34. https://doi.org/10.1007/978-1-4419-9326-7_1
Raghav P et al (2020) Revamping extended range forecast of Indian summer monsoon. Clim Dyn 55:1–15
Rasmussen CE, Williams CKI (2006) Gaussian processes for machine learning. MIT Press
Rehana S (2019) River water temperature modelling under climate change using support vector regression. In: Singh SK, Dhanya CT (eds) Hydrology in a changing world: challenges in modeling. Springer, pp 171–183. https://doi.org/10.1007/978-3-030-02197-9_8
Robert Mcsweeney and Zeke Hausfather (2018) Q&A: How do climate models work? Carbon Brief. https://www.carbonbrief.org/qa-how-do-climate-models-work
Sachindra DA, Ahmed K, Rashid MdM, Shahid S, Perera BJC (2018) Statistical downscaling of precipitation using machine learning techniques. Atmos Res 212:240–258
Salvi KSK, Ghosh S (2013) High-resolution multisite daily rainfall projections in India with statistical downscaling for climate change impacts assessment. J Geophys Res Atmosp 118:3557–3578
Sarwar R, Irwin SE, King LM, Simonovic SP (2010) Assessment of climatic vulnerability in the Upper Thames River Basin: Downscaling with SDSM. 66
Schapire R (1990) The strength of weak learnability. Mach Learn 5(2):197–227
Scrucca L, Fop M, Murphy TB, Raftery AE (2016) mclust 5: clustering, classification and density estimation using gaussian finite mixture models. R J 8:289–317
Sharghi E, Nourani V, Behfar N (2018) Earthfill dam seepage analysis using ensemble artificial intelligence-based modeling. J Hydroinf 20:1071–1084
Shashikanth K, Ghosh S (2013) Fine resolution indian summer monsoon rainfall projection with statistical downscaling. 1, 5
Shepard D (1968) A two-dimensional interpolation function for irregularly spaced data. In: Proceedings of the 1968 23rd ACM national conference. Pp 517–524. https://doi.org/10.1145/800186.810616
Simonovic SP (2010) A new methodology for the assessment of climate change impacts on a watershed scale. Clim Change Water Resour 98:9
Smola AJ (1996) Regression estimation with support vector learning machines. Diplomarbeit, Technische Universit¨at M¨unchen
Song YH, Chung ES, Shahid S (2021) Spatiotemporal differences and uncertainties in projections of precipitation and temperature in South Korea from CMIP6 and CMIP5 general circulation models. Int J Climatol 41:5899–5919
Srivastav R, Simonovic S (2014) An analytical procedure for multi-site, multi-season streamflow generation using maximum entropy bootstrapping. Environ Model Softw 59:59–75
Sulca J, Vuille M, Timm OE, Dong B, Zubieta R (2021) Empirical-statistical downscaling of austral summer precipitation over South America, with a focus on the central Peruvian Andes and the equatorial amazon Basin. J Appl Meteorol Climatol 60:65–85
Sun AY, Wang D, Xu X (2014) Monthly streamflow forecasting using Gaussian Process regression. J Hydrol 511:72–81
Trzaska S, Schnarr E (2014) A review of downscaling methods for climate change projections. African and Latin American Resilience to Climate Change (ARCC)
Tabari H (2020) Climate change impact on flood and extreme precipitation increases with water availability. Sci Rep 10:13768
Vapnik V (1998) Statistical learning theory. Wiley, New York
Xu L, Chen N, Zhang X (2018) A comparison of large-scale climate signals and the North American Multi-Model Ensemble (NMME) for drought prediction in China. J Hydrol 557:378–390
Zhai J, Mondal SK, Fischer T, Wang Y, Su B, Huang J, Uddin MJ (2020) Future drought characteristics through a multimodel ensemble from CMIP6 over South Asia. Atmos Res 246:105111
Zhao K, Popescu S, Meng X, Pang Y, Agca M (2011) Characterizing Forest canopy structure with lidar composite metrics and machine learning. Remote Sens Environ 115:1978–1996
Zhou ZH (2009) Ensemble learning. In: Li SZ, Jain A (eds) Encyclopedia of biometrics. Springer, pp 270–273. https://doi.org/10.1007/978-0-387-73003-5_293
Zounemat-Kermani M, Batelaan O, Fadaee M, Hinkelmann R (2021) Ensemble machine learning paradigms in hydrology: a review. J Hydrol 598:126266
Funding
The authors have not disclosed any funding.
Author information
Authors and Affiliations
Contributions
AN and RS: conceptualization and framework. AN and NM: model computation and result analysis. AN, NM and RS: results compilation, review, and editing of the manuscript. RS: Supervisor.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Naitam, A., Meghana, N. & Srivastav, R. Multimodel classification and regression technique for the statistical downscaling of temperature. Stoch Environ Res Risk Assess 37, 3707–3729 (2023). https://doi.org/10.1007/s00477-023-02472-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00477-023-02472-7