A perceptible stacking ensemble model for air temperature prediction in a tropical climate zone

Mollick, Tajrian; Hashmi, Galib; Sabuj, Saifur Rahman

doi:10.1007/s44274-023-00014-0

A perceptible stacking ensemble model for air temperature prediction in a tropical climate zone

Research
Open access
Published: 28 September 2023

Volume 1, article number 15, (2023)
Cite this article

Download PDF

You have full access to this open access article

Discover Environment Aims and scope Submit manuscript

A perceptible stacking ensemble model for air temperature prediction in a tropical climate zone

Download PDF

Tajrian Mollick¹,
Galib Hashmi² &
Saifur Rahman Sabuj¹

682 Accesses
Explore all metrics

Abstract

Bangladesh is one of the world’s most susceptible countries to climate change. Global warming has significantly increased surface temperatures worldwide, including in Bangladesh. According to meteorological observations, the average temperature of the world has risen approximately 1.2 °C to 1.3 °C over the last century. Researchers and decision-makers have recently paid attention into the climate change studies. Climate models are used extensively throughout the nation in studies on global climate change to determine future estimates and uncertainties. This paper outlines a perceptible stacking ensemble learning model to estimate the temperature of a tropical region—Cox’s Bazar, Bangladesh. The next day’s temperature, maximum temperature, and minimum temperature are estimated based on the daily weather database collected from the weather station of Cox’s Bazar for a period of 20 years between 2001 and 2021. Five machine learning (ML) models, namely linear regression (LR), ridge, support vector regression (SVR), random forest (RF), and light gradient boosting machine (LGBM) are selected out of twelve ML models and combined to integrate the outputs of each model to attain the desired predictive performance. Different statistical schemes based on time-lag values play a significant role in the feature engineering stage. Evaluation metrics like mean absolute error (MAE), mean squared error (MSE), mean absolute percentage error (MAPE), and coefficient of determination (R²) are determined to compare the predictive performance of the models. The findings imply that the stacking approach presented in this paper prevails over the standalone models. Specifically, the study reached the highest attainable R² values (0.925, 0.736, and 0.965) for forecasting temperature, maximum temperature, and minimum temperature. The statistical test and trend analysis provide additional evidence of the excellent performance of the suggested model.

Assessment and prediction of regional climate based on a multimodel ensemble machine learning method

Article 27 April 2023

A Comparative Study of Regressors and Stacked Ensemble Model for Daily Temperature Forecasting: A Case Study of Senegal

Analyzing and forecasting climate variability in Nainital district, India using non-parametric methods and ensemble machine learning algorithms

Article 07 March 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Climate change is a major environmental threat affecting many countries globally. Food production, water availability, forest biodiversity, and livelihoods highly relate to it. According to projections made by the Intergovernmental Panel on Climate Change (IPCC), the effects of global warming on human society and the environment would vary throughout time and space [1, 2]. When considering the effects of climate change on Earth and its atmosphere, the temperature parameter is often considered to be the most important of all meteorological variables [1]. Over the last 100 years, the average air temperature near Earth's surface has risen by little under 1 °C. Global warming alters the climate of the planet and raises average temperatures all around the globe [2]. The Asian winter monsoon has diminished due to a decrease in snow cover at mid-to-high latitudes, which has raised temperatures along the East Asian coast. Asia would likely experience significantly rising trends in mean surface temperature (0.25 to 0.34 °C per decade under RCP4.5 and 0.42 to 0.6 °C per decade under RCP8.5) [2]. The World Meteorological Organization (WMO) and the United Nations Environment Program (UNEP) developed the Intergovernmental Panel on Climate Change (IPCC) to study the causes of climate change and global warming because it is widely believed that human activity is the main contributor (UNFCCC, 2005) [3]. The extraction of greenhouse gases from air conditioners, refrigerators, and other appliances, as well as the increase in atmospheric CO₂, combustion of fossil fuels, deforestation, and other factors, all contribute to global warming. Bangladesh was ranked second among Asian nations and sixth overall in the Global Risk Index 2011 for countries that are most susceptible to natural disasters due to climate change. Bangladesh contributes only 0.3% of the emissions that cause global warming due to its low energy use. But Bangladesh is one of the worst-affected countries by the effects of global warming due to its geographic characteristics. The distinctive features that set Bangladesh’s climate apart from that of other tropical regions are its high temperatures, abundant rainfall, and seasonal change [4]. Bangladesh has seen an average summer temperature of 27.5 °C over the past 30 years, which is somewhat higher than the summer average [2]. Due to the extreme poverty in this country, the problems brought on by climate change are exacerbated (ICDDR B, 2019). Action Aid’s study report identified Bangladesh as the sixth most susceptible nation to famine, hurricanes, and floods [3]. Forecasting air temperatures aids meteorologists in determining the possibility in any region of the country. Air temperature is also considered an important element in evapotranspiration, which is essential for managing water supplies and agricultural operations. Many decision-making industries, including energy, transportation, and tourism, rely on accurate air temperature forecasting. Therefore, the most important component of environmental research involving functional eco-environmental systems is precisely estimating air temperature [1]. Therefore, numerous scientists and researchers over the world are attempting several investigations and creating sophisticated mathematical models to anticipate the air temperature.

Numerical-based and machine learning (ML)-based techniques are the two primary categories of weather forecasting methods used today. Numerical-based weather prediction (NWP) models include erroneous assumptions, unclear physical parameterization, and physical correlations of parameters and mechanisms of atmospheric dynamics. The model output may need to be post-processed to improve the models’ effectiveness in practical applications. It raises the cost of calculation due to complicated mathematical formulas. However, ML-based techniques have gained popularity recently due to their lower processing costs and insensitivity to the multicollinearity of the input variables [5]. Hanoon et al. proposed different machine learning algorithms including Gradient Boosting Tree (GBT), Random forest (RF), Linear regression (LR), multi-layered perceptron neural network (MLP-NN), and radial basis function neural network (RBF-NN) for the prediction of air temperature [6]. The findings indicate that the MLP-NN exhibits commendable performance in forecasting daily temperature. Azamathulla presents artificial neural networks (ANNs) and gene expression programming (GEP) to predict the monthly atmospheric temperature in Tabuk, Saudi Arabia [7]. In previous research, individual ML models have typically been used to verify the predictions and demonstrate their superiority. A single forecast model is challenging to adapt to various weather parameters, even if it can increase forecast accuracy by modifying parameters and selecting features during the forecasting process. Numerous studies have demonstrated that developing ensemble and hybrid models by integrating multiple single forecast models can efficiently harness the benefits of various models and increase the precision and reliability of weather forecasts [8]. The daily temperatures in five cities throughout Belgium were predicted using a 2-layer spatiotemporal stacked LSTM model presented by Karevan et al. [9]. The results show that the spatiotemporal stacked LSTM outperformed stacked LSTM. Roy employs three deep neural networks: Multi-Layer Perceptron (MLP), Long Short Term Memory Network (LSTM), and a hybrid of Convolutional Neural Network (CNN) with LSTM [10]. Out of these models, the CNN + LSTM combination showcases the top performance, with LSTM closely trailing behind. Lee et al. utilized MLP, RNN, and CNN models to predict daily average, minimum, and maximum temperatures. They incorporated input features at a frequency greater than what had been employed in prior studies [11]. Notably, CNN, primarily utilized for processing satellite images rather than numeric weather data in temperature forecasting, surpasses the other models in performance. Mohammadi et al. developed some novel hybrid models combining autoregressive (AR), multi-layered perceptron (MLP), and autoregressive conditional heteroscedasticity (ARCH) to estimate minimum, maximum, and mean air temperatures in Northwestern Iran for both daily and monthly time scales. The research concludes that the hybrid MLP-AR models demonstrated the highest performance out of all the models tested [12]. Zhou employed a hybrid model [i.e., an artificial neural network hybridized with the powerful hetaeristic Honey Badger Algorithm (HBA-ANN)] for forecasting monthly temperatures in the hottest and coldest regions of the world [13]. Nketiah et al. employed RNNs to construct temperature forecast models for five Chinese cities, employing five distinct model configurations. They also implemented the Ridge Regularizer (L2) during the neural network training process to prevent both overfitting and underfitting. In addition, hyperparameters were fine-tuned using the Bayesian optimization method [14].

While some studies have successfully applied ensemble and hybrid models for temperature forecasting, there may still be untapped potential in exploring different combinations of models or ensemble techniques. Further research could focus on identifying more effective ways to integrate and leverage the strengths of different models. The studies mentioned focus on specific regions, such as Belgium and Iran, and specific global extremes. There is a research gap in understanding how these models perform in a wider range of geographical contexts, including regions with different climate characteristics or extreme weather conditions. While individual studies have proposed specific lag-based schemes, there is a gap in comprehensive comparisons across various statistical schemes based on input lags. Such a comparative analysis can provide insights into the relative performance of different approaches. Additionally, previous works have not explored different statistical tests or trend analyses to identify the most appropriate approach for a given context, which may potentially lead to unreliable results.

The majority of Bangladesh is unaffected by initiatives connected to climate change, and there has been relatively limited studies based on daily temperature forecasts undertaken in this nation. The ability to effectively train policymakers and employees for mitigation and adaptation actions depends on their understanding of the nature and scope of potential climatic changes in south-eastern Bangladesh. The study uses Cox’s Bazar, Bangladesh, a tropical climate case study, to forecast air temperature. The research area is distinct and located in the popular tourism eastern coastal region of the Bay of Bengal. The study applies an expansion of the well-known stacking model to complete the forecasting task. The model has been used to analyze a 20-year weather dataset that Bangladesh Meteorological Department (BMD) collected. In the current study, we proposed a perceptible stacking methodology to execute a hybrid scheme that combines the models—LR, Ridge, SVR, RF, and LGBM. They both have complimentary benefits and drawbacks which can be utilized in the stacking ensemble approach. The method chose a meta-learner and base-learners from 12 candidate models to create the stacking model's structure. By contrasting the stacking model with the individual models, the improvement in the performance of temperature forecast is demonstrated. We also compare three types of statistical schemes regarding historical time-series values of lagged days in the feature engineering stage. Also, statistical tests and trend analysis performed in this work ultimately enhanced the quality and reliability of our research findings.

2 Materials and methods

2.1 Methodology

A well-planned approach is essential for doing the investigation systematically. The elaborate framework used to conduct this study is shown in detail in Fig. 1. The methodology is mainly divided into two phases: Phase I: Preparing the data, and Phase II: Training the model.

2.1.1 Phase I: preparing the data

The phase contains gathering observed data (i.e., raw data collected form BMD), data formatting, data preprocessing, and train-test splitting. Initially, the observed weather data of 20 years (from January 1, 2001, to December 31, 2021) were obtained from BMD. Once the extraneous data had been removed, the data had been rearranged, and descriptive statistics had been calculated. In the data preprocessing stage, there are several steps i.e., missing value imputation, outlier handling, feature engineering and data normalization. After preprocessing, the dataset was divided into two sets: (i) Training set (80%) and (ii) Testing set (20%).

2.1.2 Phase II: training the model

The phase includes the stacking ensemble model set-up and model training along with out-of-fold cross-validation. The model is constructed with level-0 base-learners and level-1 meta-learner. Level-0 base learners were chosen form 11 candidate ML models based on a performance index.

Lastly, the performance of the predicted values of each model was evaluated. The forecasting results were compared with the test dataset of the target variable in terms of mean squared error (MSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and coefficient of determination (R²). We also performed statistical tests to distinguish the significance of the performance.

2.2 Background of machine learning models

2.2.1 Stacking ensemble learning model

An ensemble approach for ML, stacked generalization was first presented by Wolpert [15]. The stacking model allows a variety of efficient models to carry out classification or regression tasks and get predictions that outperform all individual models in the ensemble. Two or more level-0 models constitute the framework of a stacking model, jointly with a level-1 model, which incorporates the predictions of the base models. Level-0 Model (Base-learner) predictions are fitted to the training set of data. The level-1 Model (Meta-learner) gains knowledge about the best methods for incorporating the forecasts of the base models. The base-model predictions derived using data from out-of-sample are used to train the meta-learner [16].

2.2.2 Candidate machine learning models

Choosing base-learning and meta-learning combinations is a primary concern while designing stacking ensemble architecture. Stacking is suitable when several ML models have different learning skills and make distinct assumptions on the predictive modeling performance. The 12 candidate models are LR, lasso, ridge, ElasticNet, decision tree (DT), bagging regression (BR), RF, adaptive boosting (AdaBoost), LGBM, extreme gradient boosting (XGBoost), SVR, and k-nearest neighbor (KNN). Among them, all models are non-linear except LR. The meta-model is frequently straightforward, allowing for an easy interpretation of the basic model predictions. As a result, the meta-model is often a basic linear model. The base-models are selected from the remaining 11 models. Table 1 provides the characteristics of the five models selected for generating a stacking model.

Table 1 A brief overview of the key aspects of five different regression models

A perceptible stacking ensemble model for air temperature prediction in a tropical climate zone

Abstract

Similar content being viewed by others

Assessment and prediction of regional climate based on a multimodel ensemble machine learning method

A Comparative Study of Regressors and Stacked Ensemble Model for Daily Temperature Forecasting: A Case Study of Senegal

Analyzing and forecasting climate variability in Nainital district, India using non-parametric methods and ensemble machine learning algorithms

1 Introduction

2 Materials and methods

2.1 Methodology

2.1.1 Phase I: preparing the data

2.1.2 Phase II: training the model

2.2 Background of machine learning models

2.2.1 Stacking ensemble learning model

2.2.2 Candidate machine learning models

2.2.3 Repeated K-fold cross-validation

2.2.4 Grid-search cross-validation

2.3 Performance metrics

3 Experimental analysis

3.1 Study area and data acquisition

3.2 Data handling and pre-processing

3.2.1 Formatting

3.2.2 Checking

3.2.3 Pre-processing

3.2.3.1 Missing value handling

3.2.3.2 Outliers handling

3.2.3.3 Feature engineering

3.2.4 Normalization

4 Model set-up

4.1 Data partition

4.2 Stacking model construction

4.2.1 Model selection

4.2.2 Hyper-parameter optimization

4.2.3 Stacking model implementation

5 Result and discussion

5.1 Inter-comparison of model performances

5.2 Computation time analysis

5.3 Statistical test analysis

5.4 Trend analysis

6 Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation