Early warning of low visibility using the ensembling of machine learning approaches for aviation services at Jay Prakash Narayan International (JPNI) Airport Patna

Shankar, Anand; Sahana, Bikash Chandra

doi:10.1007/s42452-023-05350-7

Early warning of low visibility using the ensembling of machine learning approaches for aviation services at Jay Prakash Narayan International (JPNI) Airport Patna

Research Article
Open access
Published: 07 April 2023

Volume 5, article number 132, (2023)
Cite this article

Download PDF

You have full access to this open access article

SN Applied Sciences Aims and scope Submit manuscript

Early warning of low visibility using the ensembling of machine learning approaches for aviation services at Jay Prakash Narayan International (JPNI) Airport Patna

Download PDF

1485 Accesses
2 Citations
Explore all metrics

Abstract

Extremely low visibility affects aviation services. Aviation services need accurate fog and low-visibility predictions for airport operations. Fog and low-visibility forecasting are difficult even with modern numerical weather prediction models and guiding systems. Limitations in comprehending the micro-scale processes that lead to fog formation, intensification, onset, and dissipation complicate fog prediction. This article predicts low visibility for Jay Prakash Narayan International Airport (JPNI), Patna, India, using a historical synoptic dataset. The proposed machine learning (ML) approaches optimize three meta-algorithm approaches: boosting (which reduces variances), bagging (which reduces bias), and stacking (which improves predictive forces). The ML approaches optimize the best prediction algorithms (at level 0) for fog (surface visibility ≤ 1000 m) and dense fog (surface visibility ≤ 200 m), and the suggested ensemble models at level 1 (an ensemble of level 0 ML approaches) deliver the highest performance and stability in prediction output. All time series perform well with the specified model (6-h to 1-h lead time for any combination of observed historical datasets). Airport management, planning, and decision-making rely on high reliability. Because it works well and is reliable, the proposed approaches can be used at other airports in India's Indo-Gangetic Plain.

Article Highlights

If low visibility can be predicted with a high level of accuracy, airports could run more efficiently.
Proposed ensemble ML approach optimize bagging, boosting, and stacking algorithms to reliably predict low visibility.
It fills the gap in predicting low visibility in the Indo-Gangetic Plain (IGP), where fog is a perennial issue in winter.

A comparative analysis of machine learning algorithms for predicting wave runup

Article Open access 18 December 2023

Hydrologic interpretation of machine learning models for 10-daily streamflow simulation in climate sensitive upper Indus catchments

Article 10 April 2024

Application of ensemble fuzzy weights of evidence-support vector machine (Fuzzy WofE-SVM) for urban flood modeling and coupled risk (CR) index for ward prioritization in NCT Delhi, India

Article 09 May 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The World Meteorological Organization (WMO) defines fog as a suspension of extremely minute, typically microscopic water or ice droplets that reduces horizontal visibility on the earth's surface to less than one kilometre [1, 2]. But as per our national practices, fog is reported wherever the visibility is 1000 m or less and the relative humidity is 90% or more [3]. Transportation [4, 5] and aviation services at airports [6,7,8,9,10] are significantly impacted by low visibility conditions brought on by the onset of fog. Fog limits visibility to about 1000 m, making it challenging for pilots to locate the airport [11]. There has been a continuous evolution in Instrument Landing System (ILS) technology, but fog and poor visibility still pose a significant challenge to the airport's ability to function normally. When the fog rolls in, airports may have to close or reduce their operations, which can have serious financial repercussions [12,13,14]. Flight delays, cancellations, and diversions are common when dense fog (surface visibility ≤ 200 m) occurs, adding to the misery of passengers and increasing the cost to airlines. In December 2017, 21 flights (the most in a month's time) were diverted because of poor visibility at the JPNI Airport in Patna.

In northern and northeastern India, especially in the Indo-Gangetic Plain (IGP), December, January, and February are foggy months [15, 16]. Recent fog research in India found a worrying spike in fog and land pollution, as well as fog in December, January, and February [17, 18]. The Numerical Weather Prediction (NWP) model, which is used by scientists throughout the world to predict the weather, is also used to predict visibility [19,20,21]. Many studies have tested high-resolution NWP models to predict fog [22]. Predicting fog is challenging due to the small scale and limited knowledge of the atmospheric process, which is so complicated that the NWP model does not capture many aspects of fog formation [21, 23]. Because low-visibility incidents are local, numerical weather prediction is a difficult procedure. Fog prediction is susceptible to small-scale alterations in meteorological factors (wind or atmospheric stability), which many existing models don't represent.

In this study, we focus on machine learning (ML) approaches for low visibility prediction due to the NWP model's limitations. We address statistical and machine learning approaches for low-visibility prediction and provide promising results. In the 1980s, academics proposed using a linear regression technique to make fog predictions, although limited success [24]. ML-based algorithms have been effectively applied to several fog prediction problems in the recent decade to address the constraints of conventional (e.g., linear) techniques. A multi-layer perceptron (MLP) approach [25] was tested at Canberra International Airport in Australia, utilizing meteorologically measured parameters. The Australian Bureau of Meteorology's data was utilized to train and test a neural network model. A fuzzy logic-based fog prediction system was proposed and analyzed at Perth airport [26]. The fog prediction model averaged two different fog forecasting systems using majority voting [27] and suggested a decision tree induction ML technique for fog event prediction in Dubai, improving on numerical model approaches. The performance of MLPs with backpropagation [28] training is examined in a Brazilian fog event prediction challenge. [29] applies a Bayesian network to Melbourne airport fog prediction. In this scenario, an 8-h prediction time horizon is employed to train the network with 34 years of data. This fog prediction system at Melbourne Airport has better results than the previous one. Different ML regression algorithms have been explored at Valladolid Airport in Spain [30]. Machine learning (ML) and ANN-based approaches have successfully predicted low visibility and fog-related extreme events. Examples include support vector regression or Extreme Learning Machine (ELM) [31, 32], the Bayesian Decision Method [29, 33, 34], neural networks [25, 35,36,37,38,39], regression algorithms [40], ordinal regression techniques [7]. Studies also examine the meteorological causes of fog [18, 41, 42]. Some studies use the Decision Tree algorithm from the NWP model's output weather data to predict fog [43]. Others use a technique based on synoptic weather observation data [44]. These studies focus on solitary events. Data-driven AI has the potential to revolutionize weather forecasting or location-specific fog prediction. ML prediction approaches are objective and independent of forecasters' subjectivity. Receiving weather predictions faster also facilitates aeroplane operation, which requires precision and efficiency. India's location-specific weather forecasting hasn't been studied extensively. Customized services require reliable location-specific visibility predictions. This forecast may reduce weather-impacted costs. Predicting whether the formation or dissipation of fog will create low or reduced visibility is, thus, a critical issue that has unintended repercussions in many fields of weather, transportation, and aviation.

To that end, this study aims to.

Characterize fog events locally specific to the JPNI Airport located in the Indo Gangetic Plain (IGP) region.
We present a discussion on the effectiveness of various ML approaches in predicting fog (surface visibility ≤ 1000 m) and dense fog (surface visibility ≤ 200 m) and propose a method for making precise predictions of fog and dense fog.

Therefore, fog and dense fog predictions with the utmost accuracy are the objectives of this research paper, along with the local characterization of fog events. The rest of the paper is organized as follows: the next section describes the fog events database specific to the JPNI Airport, Patna, India, along with the studied areas. The most important methodology of the research paper is discussed in Sect. 3. Section 4 presents the comprehensive experimental results and comparisons of different ML approaches considered in fog prediction. Discussion of the proposed ML approaches is placed in Sect. 5. Section 6 closes the paper with conclusions and remarks on the research carried out.

2 Measurement site and data

We consider real-hourly synoptic weather observations for Jay Prakash Narayan International Airport (JPNI), Patna (25.5947° N, 85.0908° E) (shown in Fig. 1), and its surrounding areas, which are prone to frequent low-visibility events caused by local features as well as fog patterns associated with western disturbances (WD). Patna Airport has an elevation of only 52 m on the Ganga basin's alluvial plains. Its location on the Ganges's southern bank makes it a notable landmark (because of prevailing winds). From November through February, the Western Disturbance (WD) often causes radiation fog and advection fog to appear in this region [18, 45]. Extreme fog in this region frequently causes airport closures or significant delays, having far-reaching economic and social consequences. Thus, we focused on one of the Indo-Gangetic Plain (IGP) region’s airports that experience fog very often (as indicated in Fig. 1).

In order to anticipate the low visibility (fog or dense fog) in the following hour, we use the data we have at the moment (x_n) and the previous hours (x_{n−1 to} x_n−7)to make a prediction about the target value in the hour (y_n+1) for the lead time of one hour and subsequently (y_n+2) for a two-hour lead time, and so on. We trained machine learning models (algorithms) to predict fog (surface visibility ≤ 1000 m) and dense fog (surface visibility ≤ 200 m) using synoptic hourly meteorological parameters that represent the availability of moisture and its distribution at the surface and in the lower boundary layer, including dry bulb temperature, dew point temperature, wind speed, wind direction, relative humidity, and cloud amount. The selected input variables and the current value of the predictive variables (fog or dense fog) do not have a direct or linear relationship, indicating that non-linear prediction models are more appropriate for this dataset in order to produce reliable predictions. However, it is crucial to remember that the linear correlation between variables is merely a suggestive parameter and should be considered with caution due to the non-linear interactions between meteorological variables that cause the fog events.

In this study, meteorological data from December, January, and February of 2014–2015 to 2020–2021 (21 months) were used to train the models, and meteorological data from 2021 to 2022 (3 months) was used to test the best-performing machine learning (ML) approaches (level 0 and level 1). In the train and test datasets, the fog/no fog ratio is 140/493 and 43/47, respectively. Similarly, the dense fog/no dense fog ratio in the train and test datasets is 51/582 and 11/79.

3 Methodology

The fundamental principles underlying all methods can be reduced to the concept of a dataset (x,y)N i = 1, where x = (× 1,…xd) refers to the input of the descriptive variables, and y is the equivalent response variable label. The purpose of these estimating procedures is to reconstruct the unknown functional dependence $x\to ^{f} y$ by estimate $\widehat{f\left( x \right)}$. As a result, the loss function Ψ(y,f) value is minimized.

$$\widehat{f\left( x \right)} = {\text{y}}$$

(1)

$$\widehat{f\left( x \right)} = arg\begin{array}{*{20}c} {min} \\ {f\left( x \right)} \\ \end{array} \Psi \left( {{\text{y}},{\text{f}}\left( {\text{x}} \right)} \right)$$

(2)

Reducing the function of the expected loss over the change in the response variable as a result of the Ey(Ψ[y, f (x)]), where x is the observed descriptive data, the equivalent expression is shown in equation no.3.

$$\widehat{f\left( x \right)} = {\text{arg}}\begin{array}{*{20}c} {min} \\ {f\left( x \right)} \\ \end{array} {\text{Ex}}\left[ {{\text{Ey}}\Psi } \right[{\text{y}},{\text{f}}\left( {\text{x}} \right)]\left] {\text{x}} \right]$$

(3)

Response y can be selected from a skewed distribution. There are a variety of loss functions that can affect this. The role of the binomial loss can be easily taken into account in classification problems if the variance of the response is dual (y = 0, 1). In Fig. 2, we present a block schematic of the entire process by which our proposed machine-learning prediction model operates the mechanism of the detailed methodology described in the algorithm 1 shown below. We implemented the explained algorithms in Python through h2o and Anaconda.

More specifically, the following are the research paper's novelty and significant contributions:

We offer a thorough benchmark for the problems in the prediction of low-visibility events caused by fog (with intensity), taking into account a wide range of comparisons of relevant ML approaches.
This is one of the first attempts to use statistics to describe the characterization of fog (intensity scale) at JPNI Airport in Patna, India, which is located in one of the foggiest Indo-Gangetic regions (IGP).
Assessment and optimum tuning of a wide range of relevant ML techniques (i.e., optimally six) at level 0, and subsequently stacked ensembling of these wide ranges of machine learning models at level 1, provide robust ML approaches with optimum performance and high accuracy in different time scales (a lead time of 6 h to 1 h).
This research paper comprehensively compares the usability of a wide range of machine learning classification algorithms like GBM, DRF, GLM, XRT, XGBoost, Feed Forward Neural Network (at level 0), and Stacked Ensemble (at level 1). And finally, it proposes that the ML technique has the best trade-off among three meta-algorithm approaches: bagging (which decreases variance), boosting (which decreases bias), and stacking (which improves predictive forces). While finding the best prediction algorithms among them, using ensemble algorithms (level 1) is another fun way to combine the information that level 0 techniques provide.
Based on the results of this study, we found that the proposed ML techniques had a high statistical skill score and performance metric and that the output was stable.

3.1 Supervised ML methods

Many proposed ensemble ML methods have the goal of training a large number of relevant base ML models and then combining their predictions to boost the accuracy of the resulting model or to make the model more robust or generalizable. Ensemble approaches, which have a learning paradigm that may be summed up as "better together," consistently outperform the predictions of other, more sophisticated machine learning methods, such as incredibly complex ANNs. Participants in a learning experience are the ensemble's building blocks. Although there are a wide variety of ensemble algorithms to choose from, most of them may be grouped into the categories of bagging, boosting, and stacking methods.

3.1.1 Random forest

When it comes to solving classification problems, Random Forest (RF) [46, 47] is one of the most well-known bagging-like algorithms. In this method, the decision trees are used as the learners, and then subsets are created using the bootstrap aggregating methodology, similar to the bagging method but with the added feature of dynamic tree topology. From a theoretical standpoint, this runs against the bagging paradigm because the trees in the ensemble (the forest) may differ in length, topology, or input factors, dramatically increasing the variability of the learners.

3.1.2 Bagging

Bootstrap aggregation, or bagging, is a more straightforward ensemble technique for training many learners and delivering a consistent output. Learners are bagged together because their topology, number of input–output variables, and parameters are all the same. The most common learners used to create the bagging ensemble method are decision trees with the same number of branches, the same training parameters, and the same input–output variables. Figure 3 depicts the decision procedure by majority vote (in classification) [48] used to generate the result of an ensemble model.

3.1.3 Boosting

Boosting algorithms are a special kind of ensemble algorithm with their own method of teaching new information. They are effective classifiers [49]. The ensemble consists of a set of learners, each of whom has its own set of learning parameters or input–output variables.

3.1.4 Stacking

To create an ensemble, a stacked ensemble "stacks" the predictions of numerous basic classifiers using a meta-learner. Stacking, also known as "super learning" [50] or "stacked regression” [47], refers to a family of algorithms that require training a "meta learner" at a higher level to determine the best possible combination of the learners at the lower levels. In order to boost a single predictor's robustness and generalizability, this ensemble approach combines the predictions of multiple base estimators. This study proposes a stacked ensemble model consisting of XGBoost, GLM, GBM, and XRT. With this method, the final prediction of the ensemble model is made by training a meta-model at a higher level (level 1) on the unintentional predictions of base models at a lower level (level 0).

3.2 ANN-based models

In this section, we also briefly describe the ANN-based approaches that were investigated in this study.

3.2.1 ANN based learning

To classify and predict complex data sets, we use information-processing algorithms called neural networks. Starting with a series of features (input variables) with predetermined labels, feedforward neural networks are trained to make predictions (classification). The internal layers perform their function by taking a directional (training-based) and non-linear approach to combining input and weights. Although there are a number of established algorithms for training a system, the back-propagation techniques used here are among the most effective. It uses a backpropagation technique to maximize the weight of each perceptron unit across an input layer, a hidden layer, and an output layer. Backpropagation is used to train a multi-layer feed-forward neural network to reduce the stochastic gradient. Improvements in prediction accuracy can be achieved by the use of more sophisticated techniques, including adaptive learning rate, rate annealing, momentum training, drop-out, standard L1 or L2 regularization, checkpointing, and grid search [51]. It can also automatically learn data features [52].

3.3 Statistical skill score

The statistical analysis of the skill scores was carried out using a contingency table (shown in Table 1) with the logic "if fog (Surface visibility ≤ 1000 m)occurs then 1 otherwise 0 (Surface visibility greater than 1000 m) ". The same applies to instances of dense fog (Surface visibility ≤ 200 m). This evaluation takes into account the statistical aspects of the skill scores: Accuracy, Selectivity and F1 Score.

$$Accuracy\frac{{\left( {TP + TN} \right)}}{{\left( {TP + FP + FN + TP} \right)}}$$

(4)

$$Specificity\,or\,Selectivity = \frac{{\left( {TN} \right)}}{{\left( {TN + FP} \right)}}$$

(5)

$$F_{1} = \frac{2}{{\frac{1}{recall}{*}\frac{1}{precision}}}$$

(6)

Table 1 Model contingency table for computation of forecast quality

Full size table

3.4 Performance classification metrics

The outcomes of machine-learning experiments on a testing data set are presented in order to assess the feasibility of using such algorithms to estimate low visibility. N samples from the testing data set compared with the predicted value with the actual values and proposed algorithms compared using verification results listed in Table 2.

Table 2 List of verification scores used in the study

Full size table

4 Results

4.1 Statistical characterization of fog at JPNI Airport Patna

In this subsection, we offer the statistical description of the low visibility/fog at the JPNI Airport, Patna. Out of 1210 days during the study period 2014–2022 (November to March), 604 days have surface visibility below 1000 m, indicating fog, and 166 days have surface visibility below 200 m, indicating dense fog. This information was used to propose comprehensive and generalized best-trade state-of-the-art ML techniques. Fig. 4 displays the average number of low visibility and fog hours at JPNI Airport Patna from 2014 to 2022, while Fig. 5 displays the average number of days with low visibility and fog (with the intensity scale).

In this analysis, low visibility owing to fog formation is most often in January, December, February and November, and least common in March. Fog and low visibility last for the longest time in January and for the shortest time in March. When compared to December to February, January has more dense fog days and dense fog hours (Surface visibility ≤ 200 m). In addition, February has a similar number of foggy days (Surface Visibility ≤ 1000 m) but fewer days with the other intensity scale values. Standard deviation (given in Figs. 4, 5) indicates that advection fog (fog in the rear sector of WD) occurs in January and sometimes in December, whereas radiation fog (more standard deviation) occurs in November and February (less standard deviation). Standard deviation for fog at JPNI airport Patna are largest between November and February, while the opposite is true in December and January. Figure 5 clearly demonstrates how the frequency of foggy days varies by month, with November and December having the largest fluctuation and December and January having the lowest. Table 3 displays the average number of low visibility/fog hours (surface visibility ≤ 1000 m) and dense fog (surface visibility ≤ 200 m) for the study period.

Table 3 Mean no. of hours of Fog (Surface Visibility ≤ 1000 m) and Dense Fog (Surface Visibility ≤ 200 m) events (In hours) for the JPNI Airport, Patna

Full size table

4.2 Prediction of fog (surface visibility ≤ 1000 m) by using proposed ML (machine learning) approaches

We investigate the classification outcomes of the prediction of fog (surface visibility ≤ 1000 m) and give a comprehensive evaluation of how well different ML approaches performed on these datasets. Which method proved most effective for these datasets with a lead time of six to one hour? Here, seven machine-learning approaches are used to predict fog (surface visibility ≤ 1000 m). A trial-and-error process is used to fine-tune the six base models (GBM, XGBoost, GLM, XRT, DRF, and Feed Forward Neural Network) at level 0 for optimal performance. The suggested stacked ensemble of level-0 models outperforms the aforementioned six ML approaches at the level-1 evaluation stage (GBM, XGBoost, GLM, XRT, DRF, and Feed Forward Neural Network). Figure 6 presents the comparison of all seven ML approaches for the prediction of fog (surface visibility ≤ 1000 m) and dense fog (surface visibility ≤ 200 m) for the 02-h lead time. Where, SE outperforms the other base model approaches, including feed-forward neural networks. Similarly, it compares performance metrics for the lead times from 6 to 1 h and finds that the proposed SE approaches get better as the lead time decreases (presented in Fig. 7). Figure 8 presents the statistical skill score of the proposed stacked ensemble approaches for a lead time of 6 to 1 h for the fog (surface visibility ≤ 1000 m) and dense fog (surface visibility ≤ 200 m). Empirical evidence shows that the AUC gets better as the time stamp gets smaller, with the best values being between 0.8721 and 0.9231 for the prediction fog.

Also, the statistical skill score like hit rate is more than 87%, selectivity > 80%, and a high F1 Score, which signifies the usability of the proposed models in fog prediction specific to the Patna airport.

4.3 Prediction of dense Fog (surface visibility ≤ 200 m) by using ML (machine learning) approaches

Figure 6 provides a thorough examination of various ML (Machine Learning) strategies for predicting dense fog (surface visibility ≤ 200 m) on the test dataset for the lead time of 02 h. It suggested stacked ensemble approaches at level 1 outperform the six base models at level 0 just as the algorithms outlined before in Sect. 4.2. The results have also improved significantly as compared to Sect. 4.2. The stacked ensemble approaches at level 1 have also surpassed level 0 (GBM, XGBoost, GLM, XRT, DRF, and Feed Forward Neural Network) ML approaches in terms of metric performance (shown in Fig. 7) and statistical skill score (shown in Fig. 8) when predicting dense fog (surface visibility ≤ 200 m). Compared to the performance in dense fog (i.e. surface visibility ≤ 200 m), all of the metrics and statistical skill scores show considerable improvements for the lead time of prediction from 6 to 1 h over the test dataset. In the range of a six-hour to a one-hour lead time, the accuracy varies between 91.2 and 96.7%. This results is important for the efficient management of one of the IGP (Indo Gangetic Plain) region’s busiest airports and the timely scheduling of aircraft. This means that the most cost-effective operations could take place on an exceptionally busy day for the airlines (i.e., foggy days). Empirical evidence shows that the AUC gets better as the time stamp gets smaller, with the best values being between 0.9162 and 0.9856 for the prediction dense fog. Also, the statistical skill score like hit rate is more than 91.8%, selectivity > 80%, and a high F1 Score (0.9161 (for 6 h lead time) to 0.9856 (for 6 h lead time), which signifies the usability of the proposed models in fog prediction specific to the Patna airport.

5 Discussion

Investigating the potential of ML (Machine Learning) approaches for regional or local low visibility (fog/dense fog) prediction in the context of Now cast (with a lead time of 06 h to 01 h) is the focus of this research. To put it another way, the suggested Ensemble ML approaches at level 1 perform significantly better than the corresponding base models at level 0. Predictions of fog (surface visibility ≤ 1000 m) receive an F1 score between 0.90 and 0.92 ( for a lead time of 06 h to 01 h). Also, F1 scores range from 0.95 (06 h lead time) to 0.98 (lead time of 01 h) for the prediction of dense fog (surface visibility ≤ 1000 m). This result confirms the findings of [21, 53, 54] which found that tree-based algorithms perform exceptionally well [43, 44]. Moreover, when compared to other forecast difficulties linked with meteorological factors like wind speed, temperature, and even precipitation, the prediction of low-visibility/fog is often more challenging. This is attributable to the extreme local characteristics of fog events. Despite this challenge, any advancement towards accurate techniques to improve the explicit variance of low visibility event prediction is crucial for the characterization and accurate prediction of these events and the respective application of such predictive models in fog modeling. Based on the results, this experimentation with competing ML algorithms led to important conclusions.

It is clearly shown that as the time stamp decreases the AUC also increases and its best values vary from 0.8721 to 0.9231 for the prediction of fog (visibility ≤ 1000 m) and varies between 0.9162 to 0.9856 for the prediction of dense fog (visibility ≤ 200 m) for the lead time of 06 h to 01 h.
Accordingly, our proposed approaches can be viewed as instruments that harness the power of several relevant ML approaches to develop accurate prediction of local-scale characteristics like fog/low visibility (with intensity scale).

6 Conclusion

Even with the advancement of numerical weather prediction models and the guidance of fog and low visibility prediction models, fog/low visibility prediction remains challenging. Accurately predicting fog requires a deep understanding of the complicated and chaotic atmospheric processes occurring at the boundary layer over a very short time and domain scale. The complexity involved in fog prediction is also linked to the inherent limitations in understanding micro-scale factors that contribute to the initiation, intensification, uplift, and dispersion of fog in the area of interest. As a result, accurate fog prediction is necessary for the smooth operation of aviation services. This article presents an alternative to traditional low visibility and fog forecasting techniques based on historical data, allowing for the accurate and precise prediction of fog/low visibility (with an intensity scale) at specific locations. This study aims to fill the gaps in the regional outlook and explore a variety of options. Therefore, the current research compares various ML algorithms for supervised classification, such as GBM, DRF, GLM, XRT, XGBoost, and ANN based learning at level 0, that model data in various ways, and a stacked ensembling of level 0 algorithms at level 1 for the best prediction of fog (surface visibility ≤ 1000 m) and dense fog (surface visibility ≤ 200 m). The proposed algorithm (model) optimally balances the benefits of three ML meta-algorithms. In order to increase the effectiveness of the model's metrics and the statistical skill score, bagging, boosting, and stacking are employed. Increasing predictability through stacking and decreasing bias through boosting. It's important to identify the most effective prediction algorithms when doing so. A nowcasting system tailored to JPNI Airport Patna has been developed using the stacked ensembling concept we've been discussing. This method's main benefit is that it generates output that may be used by the final consumers—air traffic controllers, airline managers, and so on—to determine exactly what it is they need.

Data availability

The hourly observed METAR data (or Synoptic hourly data) of weather parameters of JPNI Airport Patna was taken from the National Data Center, Climate Research Station of India Meteorological Department where weather data of the India Meteorological Department is available through the portal https://dsp.imdpune.gov.in/. It is noted that this portal can be accessed publicly. Also, data can be shared after the request.

References

World Meteorological Organization (2018) Guide to Instruments and Methods of Observation Volume I –Measurement of Meteorological Variables, 2018 editi. WMO-No. 8 © World Meteorological Organization, 2018, Geneva 2, Switzerland
World Meteorological Organization (2019) Manual on Codes International Codes, 2019 editi. WMO-No. 306 © World Meteorological Organization, 2019
IMD, Ministry of Earth Sciences G (2021) Standard Operation Procedure: Weather Forecasting and Warning Services Standard Operation Procedure Weather Forecasting and Warning
Bartok J, Bott A, Gera M (2012) Fog prediction for road traffic safety in a coastal desert region. Boundary-Layer Meteorol 145:485–506. https://doi.org/10.1007/s10546-012-9750-5
Article Google Scholar
Peng Y, Abdel-Aty M, Lee J, Zou Y (2018) Analysis of the impact of fog-related reduced visibility on traffic parameters. J Transp Eng Part A Syst 144:04017077. https://doi.org/10.1061/jtepbs.0000094
Article Google Scholar
Cornejo-Bueno L, Casanova-Mateo C, Sanz-Justo J, Cerro-Prada E, Salcedo-Sanz S (2017) Efficient prediction of low-visibility events at airports using machine-learning regression. Boundary-Layer Meteorol 165:349–370. https://doi.org/10.1007/s10546-017-0276-8
Article Google Scholar
Guijo-Rubio D, Gutiérrez PA, Casanova-Mateo C, Sanz-Justo J, Salcedo-Sanz S, Hervás-Martínez C (2018) Prediction of low-visibility events due to fog using ordinal classification. Atmos Res 214:64–73. https://doi.org/10.1016/j.atmosres.2018.07.017
Article Google Scholar
Guerreiro PMP, Soares PMM, Cardoso RM, Ramos AM (2020) An analysis of fog in the mainland Portuguese international airports. Atmosphere. https://doi.org/10.3390/atmos11111239
Article Google Scholar
Leung ACW, Gough WA, Butler KA (2020) Changes in fog, ice fog, and low visibility in the hudson bay region: impacts on aviation. Atmosphere (Basel) 11:1–19. https://doi.org/10.3390/atmos11020186
Article Google Scholar
Belo-Pereira M, Santos JA (2016) A persistent wintertime fog episode at Lisbon airport (Portugal): performance of ECMWF and AROME models. Meteorol Appl 23:353–370. https://doi.org/10.1002/met.1560
Article Google Scholar
Chandu K, Dharma Raju A, Kumar SVJ, Dasari MP, Reddy YK (2022) Operational constraints on flight navigation due to fog and consequent economic implications at the Rajiv Gandhi International Airport, Hyderabad, Telangana, India. Asian J Water Environ Pollut 19:25–32. https://doi.org/10.3233/AJW220052
Article Google Scholar
Kulkarni R, Jenamani RK, Pithani P, Konwar M, Nigam N, Ghude SD (2019) Loss to aviation economy due to winter fog in New Delhi during the winter of 2011–2016. Atmosphere (Basel) 10:1–10. https://doi.org/10.3390/ATMOS10040198
Article Google Scholar
Mitsokapas E, Schäfer B, Harris RJ, Beck C (2021) Statistical characterization of airplane delays. Sci Rep 11:1–11. https://doi.org/10.1038/s41598-021-87279-8
Article Google Scholar
Beckwith WB (1971) The effect of weather on the operations and economics of air transportation today. Bull Am Meteorol Soc 52:863–868. https://doi.org/10.1175/1520-0477(1971)052%3c0863:teowot%3e2.0.co;2
Article Google Scholar
Bhowmik SKR, Su AM, Singh C (2004) Forecasting fog over Delhi: an objective method. Mausam 55:313–322
Article Google Scholar
Singh J, Kant S (2006) Radiation fog over north India during winter from 1989–2004. Mausam 57:271–290
Article Google Scholar
Dey S (2018) On the theoretical aspects of improved fog detection and prediction in India. Atmos Res 202:77–80. https://doi.org/10.1016/j.atmosres.2017.11.018
Article Google Scholar
Sawaisarje GK, Khare P, Shirke CY, Deepakumar S, Narkhede NM (2014) Study of winter fog over Indian subcontinent: climatological perspectives. Mausam 65:19–28. https://doi.org/10.54302/mausam.v65i1.858
Article Google Scholar
Singh A, George JP, Iyengar GR (2018) Prediction of fog/visibility over India using NWP Model. J Earth Syst Sci 127:1–13. https://doi.org/10.1007/s12040-018-0927-2
Article Google Scholar
Bergot T, Terradellas E, Cuxart J, Mira A, Leichti O, Mueller M, Nielsen NW (2007) Intercomparison of single-column numerical models for the prediction of radiation fog. J Appl Meteorol Climatol 46:504–521. https://doi.org/10.1175/JAM2475.1
Article Google Scholar
Van Der Velde IR, Steeneveld GJ, Wichers Schreur BGJ, Holtslag AAM (2010) Modeling and forecasting the onset and duration of severe radiation fog under frost conditions. Mon Weather Rev 138:4237–4253. https://doi.org/10.1175/2010MWR3427.1
Article Google Scholar
Dhangar NG, Lal DM, Ghude SD, Kulkarni R, Parde AN, Pithani P, Niranjan K, Prasad DSVVD, Jena C, Sajjan VS, Prabhakaran T, Karipot AK, Jenamani RK, Singh S, Rajeevan M (2021) On the conditions for onset and development of fog over New Delhi: an observational study from the WiFEX. Pure Appl Geophys 178:3727–3746. https://doi.org/10.1007/s00024-021-02800-4
Article Google Scholar
Payra S, Mohan M (2014) Multirule based diagnostic approach for the fog predictions using WRF modelling tool. Adv Meteorol. https://doi.org/10.1155/2014/456065
Article Google Scholar
Michael C, Koziara RJR (1983) Estimating marine fog probability using a model output statistics scheme. Mon Weather Rev
Fabbian D, De Dear R, Lellyett S (2007) Application of artificial neural network forecasts to predict fog at Canberra International Airport. Weather Forecast 22:372–381. https://doi.org/10.1175/WAF980.1
Article Google Scholar
Miao Y, Potts R, Huang X, Elliott G, Rivett R (2012) A fuzzy logic fog forecasting model for Perth Airport. Pure Appl Geophys 169:1107–1119. https://doi.org/10.1007/s00024-011-0351-x
Article Google Scholar
Bartoková I, Bott A, Bartok J, Gera M (2015) Fog prediction for road traffic safety in a coastal desert region: improvement of nowcasting skills by the machine-learning approach. Boundary-Layer Meteorol 157:501–516. https://doi.org/10.1007/s10546-015-0069-x
Article Google Scholar
Colabone RDO, Ferrari AL, da Vecchia FA (2015) Application of artificial neural networks for fog forecast. J Aerosp Technol Manag 7:240–246. https://doi.org/10.5028/jatm.v7i2.446
Article Google Scholar
Boneh T, Weymouth GT, Newham P, Potts R, Bally J, Nicholson AE, Korb KB (2015) Fog forecasting for Melbourne Airport using a Bayesian decision network. Weather Forecast 30:1218–1233. https://doi.org/10.1175/WAF-D-15-0005.1
Article Google Scholar
Cornejo-Bueno S, Casillas-Pérez D, Cornejo-Bueno L, Chidean MI, Caamaño AJ, Sanz-Justo J, Casanova-Mateo C, Salcedo-Sanz S (2020) Persistence analysis and prediction of low-visibility events at valladolid airport, Spain. Symmetry (Basel) 12:1–18. https://doi.org/10.3390/sym12061045
Article Google Scholar
Zhu X, Ni Z, Cheng M, Jin F, Li J, Weckman G (2018) Selective ensemble based on extreme learning machine and improved discrete artificial fish swarm algorithm for haze forecast. Appl Intell 48:1757–1775. https://doi.org/10.1007/s10489-017-1027-8
Article Google Scholar
Cornejo-Bueno S, Casillas-Pérez D, Cornejo-Bueno L, Chidean MI, Caamaño AJ, Cerro-Prada E, Casanova-Mateo C, Salcedo-Sanz S (2021) Statistical analysis and machine learning prediction of fog-caused low-visibility events at a-8 motor-road in spain. Atmosphere. https://doi.org/10.3390/atmos12060679
Article Google Scholar
Roquelaure S, Bergot T (2008) A local ensemble prediction system for fog and low clouds: construction, bayesian model averaging calibration, and validation. J Appl Meteorol Climatol 47:3072–3088. https://doi.org/10.1175/2008JAMC1783.1
Article Google Scholar
Chmielecki RM, Raftery AE (2011) Probabilistic visibility forecasting using Bayesian model averaging. Mon Weather Rev 139:1626–1636. https://doi.org/10.1175/2010MWR3516.1
Article Google Scholar
Miao K, Han T, Yao Y, Lu H, Chen P, Wang B, Zhang J (2020) Application of LSTM for short term fog forecasting based on meteorological elements. Neurocomputing 408:285–291. https://doi.org/10.1016/j.neucom.2019.12.129
Article Google Scholar
Hosea MK (2019) Effect of climate change on airline flights operations At Nnamdi Azikiwe International Airport Abuja, Nigeria. Sci World J 14
Jiao S, Wang L (2021) Road obstacle detection in bad weather based on deep learning. In: Journal of Physics: Conference Series. IOP Publishing Ltd
Deng T, Cheng A, Han W, Lin HX (2019) Visibility forecast for airport operations by LSTM neural network. ICAART 2019 - Proc 11th Int Conf Agents Artif Intell 2:466–473. https://doi.org/10.5220/0007308204660473
Wang C. Exploiting deep learning in forecasting the occurrence of severe haze in Southeast Asia
Bang C-H, Lee J-W, Hong S-Y (2008) Predictability experiments of fog and visibility in local airports over Korea using the WRF model. J Korean Soc Atmos Environ 24:92–101
Google Scholar
Stolaki S, Haeffelin M, Lac C, Dupont JC, Elias T, Masson V (2015) Influence of aerosols on the life cycle of a radiation fog event. A numerical and observational study. Atmos Res 151:146–161. https://doi.org/10.1016/j.atmosres.2014.04.013
Article Google Scholar
Jenamani RK (2007) Alarming rise in fog and pollution causing a fall in maximum temperature over Delhi. Curr Sci 93:314–322
Google Scholar
Bari D, Ouagabi A (2020) Machine-learning regression applied to diagnose horizontal visibility from mesoscale NWP model forecasts. SN Appl Sci. https://doi.org/10.1007/s42452-020-2327-x
Article Google Scholar
Dutta D, Chaudhuri S (2015) Nowcasting visibility during wintertime fog over the airport of a metropolis of India: decision tree algorithm and artificial neural network approach. Nat Hazards 75:1349–1368. https://doi.org/10.1007/s11069-014-1388-9
Article Google Scholar
Syed FS, Körnich H, Tjernström M (2012) On the fog variability over south Asia. Clim Dyn 39:2993–3005. https://doi.org/10.1007/s00382-012-1414-0
Article Google Scholar
Klusowski JM (2018) Sharp analysis of a simple model for random forests 130
Breiman L (1996) Stacked regressions. Mach Learn 24:49–64. https://doi.org/10.1023/A:1018046112532
Article MATH Google Scholar
Mohandes M, Deriche M, Aliyu SO (2018) Classifiers combination techniques: a comprehensive review. IEEE Access 6:19626–19639. https://doi.org/10.1109/ACCESS.2018.2813079
Article Google Scholar
Ferreira AJ, Figueiredo AT (2012) Ensemble Machine Learning
Van Der Laan MJ, Polley EC, Hubbard AE (2007) Super learner. Stat Appl Genet Mol Biol. https://doi.org/10.2202/1544-6115.1309
Article MathSciNet MATH Google Scholar
Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR (2012) Improving neural networks by preventing co-adaptation of feature detectors. 1–18
LeCun Y, Bengio Y, GH, (2015) Deep learning. Nature 521:436–444
Article Google Scholar
Bergot T, Koracin D (2021) Observation, simulation and predictability of fog: review and perspectives. Atmosphere (Basel) 12:10–13. https://doi.org/10.3390/atmos12020235
Article Google Scholar
Castillo-Botón C, Casillas-Pérez D, Casanova-Mateo C, Ghimire S, Cerro-Prada E, Gutierrez PA, Deo RC, Salcedo-Sanz S (2022) Machine learning regression and classification methods for fog events prediction. Atmos Res. https://doi.org/10.1016/j.atmosres.2022.106157
Article Google Scholar

Download references

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

Author information

Authors and Affiliations

Department of Electronics and Communication Engineering, National Institute of Technology, Patna, Bihar, India
Anand Shankar & Bikash Chandra Sahana
India Meteorological Department, Ministry of Earth Sciences, Government of India, Patna, India
Anand Shankar

Authors

Anand Shankar
View author publications
You can also search for this author in PubMed Google Scholar
Bikash Chandra Sahana
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

AS and BCS conceptualized the model. AS programmed and produces on python. AS interpreted the analysis. AS and BCS prepared the original draft of the manuscript. BCS formatted the manuscript.

Corresponding author

Correspondence to Anand Shankar.

Ethics declarations

Conflict of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix 1: Acronyms

See Table

Table 4 List of alphabetically ordered acronyms that appear in this paper

Full size table

4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shankar, A., Sahana, B.C. Early warning of low visibility using the ensembling of machine learning approaches for aviation services at Jay Prakash Narayan International (JPNI) Airport Patna. SN Appl. Sci. 5, 132 (2023). https://doi.org/10.1007/s42452-023-05350-7

Download citation

Received: 03 December 2022
Accepted: 21 March 2023
Published: 07 April 2023
DOI: https://doi.org/10.1007/s42452-023-05350-7

Early warning of low visibility using the ensembling of machine learning approaches for aviation services at Jay Prakash Narayan International (JPNI) Airport Patna

Abstract

Article Highlights

Similar content being viewed by others

A comparative analysis of machine learning algorithms for predicting wave runup

Hydrologic interpretation of machine learning models for 10-daily streamflow simulation in climate sensitive upper Indus catchments

Application of ensemble fuzzy weights of evidence-support vector machine (Fuzzy WofE-SVM) for urban flood modeling and coupled risk (CR) index for ward prioritization in NCT Delhi, India

1 Introduction

2 Measurement site and data

3 Methodology

3.1 Supervised ML methods

3.1.1 Random forest

3.1.2 Bagging

3.1.3 Boosting

3.1.4 Stacking

3.2 ANN-based models

3.2.1 ANN based learning

3.3 Statistical skill score

3.4 Performance classification metrics

4 Results

4.1 Statistical characterization of fog at JPNI Airport Patna

4.2 Prediction of fog (surface visibility ≤ 1000 m) by using proposed ML (machine learning) approaches

4.3 Prediction of dense Fog (surface visibility ≤ 200 m) by using ML (machine learning) approaches

5 Discussion

6 Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix 1: Acronyms

Appendix 1: Acronyms

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation