Skip to main content

Advertisement

Log in

Multimodel classification and regression technique for the statistical downscaling of temperature

  • ORIGINAL PAPER
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

Human activity has increased the amount of carbon dioxide and other greenhouse gases emitted into the atmosphere, causing climate change. As a result, rising temperatures have wide-ranging consequences on water management. This study proposes downscaling daily temperature based on a modified Classification and Regression Technique with an ensemble machine learning (EML) approach at the Woodstock station in the Upper Thames River basin. The GCM Canadian Earth System Model (CanESM5) from Coupled Model Intercomparison Project-6 is used. The CanESM5 model simulated variables are used as predictors and observed baseline daily temperature as predictands. The Regression-based single machine learning (Support Vector, Tree-based and Gaussian Process Regression) and EML based statistical downscaling are applied and compared. The variable temperature states are determined using Gaussian Mixture Model clustering, and the Light Gradient Boosting Model (LightGBM) is used to classify future temperature states. Results showed that applying the EML boosted the performance by 2–25% compared to single models. The temperature states for the two projected climate scenarios (SSP126 and SSP585) were simulated by selected best-performing single and EML model combinations for the near (2026–2050) and far future (2076–2100). The findings demonstrate that the future projected temperatures may rise 1–3 °C for both scenarios and are less volatile than the observed baseline temperature. Overall, the study indicates that the ensemble approach-based downscaling combining several single models have considerably improved the performance and was more reliable.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Abba SI, Nourani V, Elkiran G (2019) Multi-parametric modeling of water treatment plant using AI-based non-linear ensemble. J Water Supply Res Technol AQUA 68:547–561

    Article  Google Scholar 

  • Abbasian M, Moghim S, Abrishamchi A (2019) Performance of the general circulation models in simulating temperature and precipitation over Iran. Theor Appl Climatol 135:1465–1483

    Article  Google Scholar 

  • Akhter J, Das L, Meher JK, Deb A (2019) Evaluation of different large-scale predictor-based statistical downscaling models in simulating zone-wise monsoon precipitation over India. Int J Climatol 39:465–482

    Article  Google Scholar 

  • Barzegar R, Moghaddam AA, Deo R, Fijani E, Tziritis E (2018) Mapping groundwater contamination risk of multiple aquifers using multimodel ensemble of machine learning algorithms. Sci Total Environ 621:697–712

    Article  CAS  Google Scholar 

  • Bennett FR (2019) Gradient boosting machine assisted approximate Bayesian computation for uncertainty analysis of rainfall-runoff model parameters. In: 23rd International congress on modelling and simulation

  • Breiman L (1996) Bagging predictors. Mach Learn 24:123–140

    Article  Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45:5–32

    Article  Google Scholar 

  • Breiman L, Friedman J, Olshen R, Stone CJ (1983) Classification and regression trees

  • Burger G, Murdock TQ, Werner AT, Sobie SR (2012) Downscaling extremes: an intercomparison of multiple statistical methods for present climate. J Clim 25(12):4366

    Article  Google Scholar 

  • Environment Canada (2011) National climate data and information archive. http://climate.weatheroffice.gc.ca/climateData/canada_e.html

  • Chattrairat K, Wongseree W, Leelasantitham A (2021) Comparisons of machine learning methods of statistical downscaling method: case studies of daily climate anomalies in Thailand. J Web Eng. https://doi.org/10.13052/jwe1540-9589.2057

    Article  Google Scholar 

  • Chen J, Zhang XJ (2021) Challenges and potential solutions in statistical downscaling of precipitation. Clim Change 165:63

    Article  Google Scholar 

  • Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39:1–38

    Google Scholar 

  • Desai S, Ouarda TBMJ (2021) Regional hydrological frequency analysis at ungauged sites with random forest regression. J Hydrol 594:125861

    Article  Google Scholar 

  • Dormann CF et al (2013) Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography 36:27–46

    Article  Google Scholar 

  • Duhan D, Pandey A (2015) Statistical downscaling of temperature using three techniques in the Tons River basin in Central India. Theoret Appl Climatol. https://doi.org/10.1007/s00704-014-1253-5

    Article  Google Scholar 

  • Eum H-I, Simonovic SP (2012) Assessment on variability of extreme climate events for the Upper Thames River basin in Canada. Hydrol Process 26:485–499

    Article  Google Scholar 

  • Eum HI, Gupta A, Dibike Y (2020) Effects of univariate and multivariate statistical downscaling methods on climatic and hydrologic indicators for Alberta, Canada. J Hydrol 588:125065

    Article  Google Scholar 

  • Fajardo J, Corcoran D, Roehrdanz PR, Hannah L, Marquet PA (2020) GCM compareR: a web application to assess differences and assist in the selection of general circulation models for climate change research. Methods Ecol Evol 11:656–663

    Article  Google Scholar 

  • Feurer M, Hutter F (2019) Hyperparameter optimization. In: Hutter F, Kotthoff L, Vanschoren J (eds) Automated machine learning. Springer, pp 3–33. https://doi.org/10.1007/978-3-030-05318-5_1

    Chapter  Google Scholar 

  • Freund Y, Schapire R (1996) Experiments with a new boosting algorithm. In: Proceedings of the thirteenth international conference on machine learning, pp. 148–156 Bari, Italy

  • Gao X, Shan C, Hu C, Niu Z, Liu Z (2019) An adaptive ensemble machine learning model for intrusion detection. IEEE Access 7:82512–82521

    Article  Google Scholar 

  • Ghosh S, Katkar S (2012) Modeling uncertainty resulting from multiple downscaling methods in assessing hydrological impacts of climate change. Water Resour Manag 26:3559–3579

    Article  Google Scholar 

  • Ghosh S, Mujumdar PP (2008) Statistical downscaling of GCM simulations to streamflow using relevance vector machine. Adv Water Resour 31:132–146

    Article  Google Scholar 

  • Gidden MJ et al (2019) Global emissions pathways under different socio-economic scenarios for use in CMIP6: A dataset of harmonized emissions trajectories through the end of the century. Geosci Model Dev 12:1443–1475

    Article  CAS  Google Scholar 

  • Goly A, Teegavarapu RSV, Mondal A (2014) Development and evaluation of statistical downscaling models for monthly precipitation. Earth Interact 18:1–28

    Article  Google Scholar 

  • Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction

  • He X, Luo J, Li P, Zuo G, Xie J (2020) A hybrid model based on variational mode decomposition and gradient boosting regression tree for monthly runoff forecasting. Water Resour Manag 34(2):865–884

    Article  Google Scholar 

  • Hernanz A et al (2022a) Evaluation of statistical downscaling methods for climate change projections over Spain: present conditions with perfect predictors. Int J Climatol 42:762–776

    Article  Google Scholar 

  • Hernanz A, García-Valero JA, Domínguez M, Rodríguez-Camino E (2022b) A critical view on the suitability of machine learning techniques to downscale climate change projections: illustration for temperature with a toy experiment. Atmosp Sci Lett 23:e1087

    Article  Google Scholar 

  • Ho TK (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20:832–844

    Article  Google Scholar 

  • Hood S, Swayne DA (2010) Novel model calibration technique through application of machine learning association rules. In: International congress on environmental modelling and software

  • Jing W, Yang Y, Yue X, Zhao X (2016) A comparison of different regression algorithms for downscaling monthly satellite-based precipitation over North China. Remote Sens 8:835

    Article  Google Scholar 

  • Kannan S, Ghosh S (2010) Prediction of daily rainfall state in a river basin using statistical downscaling from GCM output. Springer

    Google Scholar 

  • Kannan S, Ghosh S (2013) A non-parametric kernel regression model for downscaling multisite daily precipitation in the Mahanadi basin. Water Resour Res 49:1360–1385

    Article  Google Scholar 

  • Ke G et al (2017) LightGBM: a highly efficient gradient boosting decision tree. In: Advances in neural information processing systems vol. 30 (Curran Associates, Inc.)

  • Kim D et al (2019) Ensemble learning regression for estimating river discharges using satellite altimetry data: Central Congo River as a Testbed. Remote Sens Environ 221:741–755

    Article  Google Scholar 

  • Kim S, Alizamir M, Kim NW, Ozgur K (2020) Bayesian model averaging: a unique model enhancing forecasting accuracy for daily streamflow based on different antecedent time series. Sustainability 12(22):9720. https://doi.org/10.3390/su12229720

    Article  Google Scholar 

  • Krishan R, Sanjay J, Gnanaseelan C, Mujumdar M, Kulkarni A, Chakraborty S (2020) Assessment of climate change over the Indian region: a report of the ministry of earth sciences (MoES). Gov India. https://doi.org/10.1007/978-981-15-4327-2

    Article  Google Scholar 

  • Lange H, Sippel S (2020) Machine learning applications in hydrology. In: Levia DF et al (eds) Forest-water interactions. Springer, UK, pp 233–257. https://doi.org/10.1007/978-3-030-26086-6_10

    Chapter  Google Scholar 

  • Mandal S, Srivastav RK, Simonovic SP (2016) Use of beta regression for statistical downscaling of precipitation in the Campbell River basin British Columbia, Canada. J Hydrol 538:49–62

    Article  Google Scholar 

  • Mani M, Bandyopadhyay S, Chonabayashi S & Markandya A (2018) South Asia's Hotspots: the impact of temperature and precipitation changes on living standards. (World Bank Publications)

  • McLachlan GJ, Rathnayake S (2014) On the number of components in a Gaussian mixture model. Wires Data Min Knowl Discov 4:341–355

    Article  Google Scholar 

  • Mirzaei S, Vafakhah M, Pradhan B, Alavi SJ (2021) Flood susceptibility assessment using extreme gradient boosting (EGB), Iran. Earth Sci Inf 14(1):51–67

    Article  Google Scholar 

  • Mosavi A, Golshan M, Janizadeh S, Choubin B, Melesse AM, Dineva AA (2020) Ensemble models of GLM, FDA, MARS, and RF for flood and erosion susceptibility mapping: a priority assessment of sub-basins. Geocarto Int 37:2541

    Article  Google Scholar 

  • Naftaly U, Intrator N, Horn D (1997) Optimal ensemble averaging of neural networks. Netw Comput Neural Syst 8:283–296

    Article  Google Scholar 

  • Nagaraj M, Srivastav R (2022) Non-stationary modelling framework for regionalization of extreme precipitation using non-uniform lagged teleconnections over monsoon Asia. Stoch Environ Res Risk Assess. https://doi.org/10.1007/s00477-022-02211-4

    Article  Google Scholar 

  • Naghibi SA, Hashemi H, Berndtsson R, Lee S (2020) Application of extreme gradient boosting and parallel random forest algorithms for assessing groundwater spring potential using DEM-derived factors. J Hydrol 589:125197

    Article  Google Scholar 

  • Najafi MR, Moradkhani H, Wherry SA (2011) Statistical downscaling of precipitation using machine learning with optimal predictor selection. J Hydrol Eng 16:650–664

    Article  Google Scholar 

  • Navarro-Racines C, Tarapues J, Thornton P, Jarvis A, Ramirez-Villegas J (2020) High-resolution and biascorrected CMIP5 projections for climate change impact assessments. Sci Data 7:7

    Article  Google Scholar 

  • Nguyen PT, Ha DH, Nguyen HD, Van Phong T, Trinh PT, Al-Ansari N, Prakash I (2020) Improvement of credal decision trees using ensemble frameworks for groundwater potential modeling. Sustainability 12(7):2622

    Article  Google Scholar 

  • Ni L, Wang D, Wu J, Wang Y, Tao Y, Zhang J, Liu J (2020) Streamflow forecasting using extreme gradient boosting model coupled with Gaussian mixture model. J Hydrol 586:124901

    Article  Google Scholar 

  • Nourani V, Razzaghzadeh Z, Baghanam AH, Molajou A (2019) ANN-based statistical downscaling of climatic parameters using decision tree predictor screening method. Theor Appl Climatol 137:1729–1746

    Article  Google Scholar 

  • Pahlavan HA, Zahraie B, Nasseri M, Mahdipour Varnousfaderani A (2018) Improvement of multiple linear regression method for statistical downscaling of monthly precipitation. Int J Environ Sci Technol 15:1897–1912

    Article  Google Scholar 

  • Pasolli L, Melgani F, Blanzieri E (2010) Gaussian process regression for estimating chlorophyll concentration in subsurface waters from remote sensing data. IEEE Geosci Remote Sens Lett 7:464–468

    Article  Google Scholar 

  • Patel E, Kushwaha DS (2020) Clustering cloud workloads: K-Means vs Gaussian mixture model. Proc Comput Sci 171:158–167

    Article  Google Scholar 

  • Polikar R (2012) Ensemble learning. In: Zhang C, Ma Y (eds) Ensemble machine learning: methods and applications. Springer, pp 1–34. https://doi.org/10.1007/978-1-4419-9326-7_1

    Chapter  Google Scholar 

  • Raghav P et al (2020) Revamping extended range forecast of Indian summer monsoon. Clim Dyn 55:1–15

    Article  Google Scholar 

  • Rasmussen CE, Williams CKI (2006) Gaussian processes for machine learning. MIT Press

    Google Scholar 

  • Rehana S (2019) River water temperature modelling under climate change using support vector regression. In: Singh SK, Dhanya CT (eds) Hydrology in a changing world: challenges in modeling. Springer, pp 171–183. https://doi.org/10.1007/978-3-030-02197-9_8

    Chapter  Google Scholar 

  • Robert Mcsweeney and Zeke Hausfather (2018) Q&A: How do climate models work? Carbon Brief. https://www.carbonbrief.org/qa-how-do-climate-models-work

  • Sachindra DA, Ahmed K, Rashid MdM, Shahid S, Perera BJC (2018) Statistical downscaling of precipitation using machine learning techniques. Atmos Res 212:240–258

    Article  Google Scholar 

  • Salvi KSK, Ghosh S (2013) High-resolution multisite daily rainfall projections in India with statistical downscaling for climate change impacts assessment. J Geophys Res Atmosp 118:3557–3578

    Article  Google Scholar 

  • Sarwar R, Irwin SE, King LM, Simonovic SP (2010) Assessment of climatic vulnerability in the Upper Thames River Basin: Downscaling with SDSM. 66

  • Schapire R (1990) The strength of weak learnability. Mach Learn 5(2):197–227

    Article  Google Scholar 

  • Scrucca L, Fop M, Murphy TB, Raftery AE (2016) mclust 5: clustering, classification and density estimation using gaussian finite mixture models. R J 8:289–317

    Article  Google Scholar 

  • Sharghi E, Nourani V, Behfar N (2018) Earthfill dam seepage analysis using ensemble artificial intelligence-based modeling. J Hydroinf 20:1071–1084

    Article  Google Scholar 

  • Shashikanth K, Ghosh S (2013) Fine resolution indian summer monsoon rainfall projection with statistical downscaling. 1, 5

  • Shepard D (1968) A two-dimensional interpolation function for irregularly spaced data. In: Proceedings of the 1968 23rd ACM national conference. Pp 517–524. https://doi.org/10.1145/800186.810616

  • Simonovic SP (2010) A new methodology for the assessment of climate change impacts on a watershed scale. Clim Change Water Resour 98:9

    Google Scholar 

  • Smola AJ (1996) Regression estimation with support vector learning machines. Diplomarbeit, Technische Universit¨at M¨unchen

  • Song YH, Chung ES, Shahid S (2021) Spatiotemporal differences and uncertainties in projections of precipitation and temperature in South Korea from CMIP6 and CMIP5 general circulation models. Int J Climatol 41:5899–5919

    Article  Google Scholar 

  • Srivastav R, Simonovic S (2014) An analytical procedure for multi-site, multi-season streamflow generation using maximum entropy bootstrapping. Environ Model Softw 59:59–75

    Article  Google Scholar 

  • Sulca J, Vuille M, Timm OE, Dong B, Zubieta R (2021) Empirical-statistical downscaling of austral summer precipitation over South America, with a focus on the central Peruvian Andes and the equatorial amazon Basin. J Appl Meteorol Climatol 60:65–85

    Article  Google Scholar 

  • Sun AY, Wang D, Xu X (2014) Monthly streamflow forecasting using Gaussian Process regression. J Hydrol 511:72–81

    Article  Google Scholar 

  • Trzaska S, Schnarr E (2014) A review of downscaling methods for climate change projections. African and Latin American Resilience to Climate Change (ARCC)

  • Tabari H (2020) Climate change impact on flood and extreme precipitation increases with water availability. Sci Rep 10:13768

    Article  CAS  Google Scholar 

  • Vapnik V (1998) Statistical learning theory. Wiley, New York

    Google Scholar 

  • Xu L, Chen N, Zhang X (2018) A comparison of large-scale climate signals and the North American Multi-Model Ensemble (NMME) for drought prediction in China. J Hydrol 557:378–390

    Article  Google Scholar 

  • Zhai J, Mondal SK, Fischer T, Wang Y, Su B, Huang J, Uddin MJ (2020) Future drought characteristics through a multimodel ensemble from CMIP6 over South Asia. Atmos Res 246:105111

    Article  Google Scholar 

  • Zhao K, Popescu S, Meng X, Pang Y, Agca M (2011) Characterizing Forest canopy structure with lidar composite metrics and machine learning. Remote Sens Environ 115:1978–1996

    Article  Google Scholar 

  • Zhou ZH (2009) Ensemble learning. In: Li SZ, Jain A (eds) Encyclopedia of biometrics. Springer, pp 270–273. https://doi.org/10.1007/978-0-387-73003-5_293

    Chapter  Google Scholar 

  • Zounemat-Kermani M, Batelaan O, Fadaee M, Hinkelmann R (2021) Ensemble machine learning paradigms in hydrology: a review. J Hydrol 598:126266

    Article  Google Scholar 

Download references

Funding

The authors have not disclosed any funding.

Author information

Authors and Affiliations

Authors

Contributions

AN and RS: conceptualization and framework. AN and NM: model computation and result analysis. AN, NM and RS: results compilation, review, and editing of the manuscript. RS: Supervisor.

Corresponding author

Correspondence to Roshan Srivastav.

Ethics declarations

Conflict of interest

The authors declare they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Naitam, A., Meghana, N. & Srivastav, R. Multimodel classification and regression technique for the statistical downscaling of temperature. Stoch Environ Res Risk Assess 37, 3707–3729 (2023). https://doi.org/10.1007/s00477-023-02472-7

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-023-02472-7

Keywords

Navigation