Abstract
The go-around is a safety-critical procedure in civil aviation that is rarely executed but is essential to avoid risky landings. Analyzing the factors that trigger go-around events can aid in identifying measures that could lower their frequency. This involves circumstances that could be deemed abnormal and intrinsically harmful. The study employed the Explainable Boosting Machine (EBM), a contemporary transparent model, to predict aircraft go-arounds and interpret different influential factors. The model proposed exhibits comparable accuracy to black-box models. The study utilized pilot reports and applied SMOTE-ENN to address the imbalance problem. The EBM model was trained with treated data in conjunction with Bayesian optimization. The EBM model’s performance was evaluated using holdout data and compared to binary logistic regression and decision tree models, as well as black-box models such as adaptive boosting, random forest, and extreme gradient boosting. The EBM model exhibited superior performance compared to other models in terms of precision (83.15%), recall (79.77%), geometric mean (77.29%), and Matthews’s correlation coefficient (0.453). The EBM algorithm enables the comprehensive interpretation of individual and pairwise factor interactions in predicting aircraft go-around outcomes from both global and local perspectives. This facilitates the assessment of the impact of different factors on go-around outcomes.
Similar content being viewed by others
References
Blajev T, Curtis W (2017) Go-around decision-making and execution project. Final Report to Flight Safety Foundation
Bosschieter TM, Xu Z, Lan H, Lengerich BJ, Nori H, Sitcov K, Souter V, Caruana R (2022) Using interpretable machine learning to predict maternal and fetal outcomes. arXiv preprint arXiv:2207.05322, DOI: https://doi.org/10.48550/arXiv.2207.05322
Casner SM (2010) Why don’t pilots submit more pilot weather reports (PIREPs)? The International Journal of Aviation Psychology 20(4): 347–374, DOI: https://doi.org/10.1080/10508414.2010.487015
Causse M, Dehais F, Péran P, Sabatini U, Pastor J (2013) The effects of emotion on pilot decision-making: A neuroergonomic approach to aviation safety. Transportation Research Part C: Emerging Technologies 33:272–281, DOI: https://doi.org/10.1016/j.trc.2012.04.005
Chan PW (2012) A significant wind shear event leading to aircraft diversion at the Hong Kong international airport. Meteorological Applications 19(1):10–16, DOI: https://doi.org/10.1002/met.242
Chan PW (2022) Case study of a special event of low-level windshear and turbulence at the Hong Kong International Airport. Atmospheric Science Letters e1143, DOI: https://doi.org/10.1002/asl.1143
Chan PW, Hon KK (2016) Observation and numerical simulation of terrain-induced windshear at the Hong Kong International Airport in a planetary boundary layer without temperature inversions. Advances in Meteorology, DOI: https://doi.org/10.1155/2016/1454513
Chan PW, Hon KK (2022) Observations and numerical simulations of sea breezes at Hong Kong International Airport. Weather, DOI: https://doi.org/10.1002/wea.4282
Chen F, Peng H, Chan PW, Huang Y, Hon KK (2022) Identification and analysis of terrain-induced low-level windshear at Hong Kong International Airport based on WRF–LES combining method. Meteorology and Atmospheric Physics 134(4):60, DOI: https://doi.org/10.1007/s00703-022-00899-1
Chen F, Peng H, Chan PW, Ma X, Zeng X (2020) Assessing the risk of windshear occurrence at HKIA using rare-event logistic regression. Meteorological Applications 27(6):e1962, DOI: https://doi.org/10.1002/met.1962
Chou CS, Tien A, Bateman H (2021) October. A machine learning application for predicting and alerting missed approaches for airport management. In 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC) 1–9, IEEE, DOI: https://doi.org/10.1109/DASC52595.2021.9594418
Couronné R, Probst P, Boulesteix AL (2018) Random forest versus logistic regression: A large-scale benchmark experiment. BMC Bioinformatics 19:1–14, DOI: https://doi.org/10.1186/s12859-018-2264-5
Dai L, Liu Y, Hansen M (2021) Modeling go-around occurrence using principal component logistic regression. Transportation Research Part C: Emerging Technologies 129:103262, DOI: https://doi.org/10.1016/j.trc.2021.103262
Dao DV, Ly HB, Vu HLT, Le TT, Pham BT (2020) Investigation and optimization of the C-ANN structure in predicting the compressive strength of foamed concrete. Materials 13(5):1072, DOI: https://doi.org/10.3390/ma13051072
Dehais F, Behrend J, Peysakhovich V, Causse M, Wickens CD (2017) Pilot flying and pilot monitoring’s aircraft state awareness during go-around execution in aviation: A behavioral and eye tracking study. The International Journal of Aerospace Psychology 27(1–2):15–28, DOI: https://doi.org/10.1080/10508414.2017.1366269
Demir S, Sahin EK (2022) An investigation of feature selection methods for soil liquefaction prediction based on tree-based ensemble algorithms using AdaBoost, gradient boosting, and XGBoost. Neural Computing and Applications 1–18, DOI: https://doi.org/10.1007/s00521-022-07856-4
Donavalli B, Mattingly SP, Massidda A (2017) Impact of Weather Factors on Go-Around Frequency (No. 17-03934)
Dong S, Khattak A, Ullah I, Zhou J, Hussain A (2022) Predicting and analyzing road traffic injury severity using boosting-based ensemble learning models with SHAPley Additive exPlanations. International Journal of Environmental Research and Public Health 19(5):2925, DOI: https://doi.org/10.3390/ijerph19052925
Duan P, Soleimani H, Ghazanfari A, Dehghani M (2020) Distributed energy management in smart grids based on cloud-fog layer architecture considering PHEVs. IEEE Transactions on Industry Applications, DOI: https://doi.org/10.1109/TIA.2020.3010899
Ekanayake IU, Meddage DPP, Rathnayake U (2022) A novel approach to explain the black-box nature of machine learning in compressive strength predictions of concrete using Shapley additive explanations (SHAP). Case Studies in Construction Materials 16:e01059, DOI: https://doi.org/10.1016/j.cscm.2022.e01059
ElShawi R, Sherif Y, Al-Mallah M, Sakr S (2021) Interpretability in healthcare: A comparative study of local machine learning interpretability techniques. Computational Intelligence 37(4):1633–1650, DOI: https://doi.org/10.1109/CBMS.2019.00065
Fitriyani NL, Syafrudin M, Alfian G, Yang CK, Rhee J, Ulyah SM (2022, June) Chronic disease prediction model using integration of DBSCAN, SMOTE-ENN, and random forest. In 2022 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS) 289–294, IEEE, DOI: https://doi.org/10.1109/ICETSIS55481.2022.9888806
Frazier PI (2018) A tutorial on Bayesian optimization. arXiv preprint arXiv:1807.02811, DOI: https://doi.org/10.48550/arXiv.1807.02811
Greener JG, Kandathil SM, Moffat L, Jones DT (2022) A guide to machine learning for biologists. Nature Reviews Molecular Cell Biology 23(1):40–55, DOI: https://doi.org/10.1038/s41580-021-00407-0
Han L, Yang G, Yang X, Song X, Xu B, Li Z, Wu J (2022) An explainable XGBoost model improved by SMOTE-ENN technique for maize lodging detection based on multi-source unmanned aerial vehicle images. Computers and Electronics in Agriculture 194:106804, DOI: https://doi.org/10.1016/j.compag.2022.106804
Hon KK (2020) Predicting low-level wind shear using 200-m-resolution NWP at the Hong Kong International Airport. Journal of Applied Meteorology and Climatology 59(2):193–206, DOI: https://doi.org/10.1175/JAMC-D-19-0186.1
ICAO (2017) World airport traffic forecast 2017–2040 Airport Council International. Airport Council International: Montréal, QC, USA
Isayev O, Oses C, Toher C, Gossett E, Curtarolo S, Tropsha A (2017) Universal fragment descriptors for predicting properties of inorganic crystals. Nature Communications 8(1):15679, DOI: https://doi.org/10.1038/ncomms15679
Jayasundara S, Indika A, Herath D (2022) Interpretable student performance prediction using explainable boosting machine for multi-class classification. 2022 2nd International Conference on Advanced Research in Computing (ICARC) 391–396. IEEE, DOI: https://doi.org/10.1109/ICARC54489.2022.9753867
Jou RC, Kuo CW, Tang ML (2013) A study of job stress and turnover tendency among air traffic controllers: The mediating effects of job satisfaction. Transportation Research Part E: Logistics and Transportation Review 57:95–104, DOI: https://doi.org/10.1016/j.tre.2013.01.009
Kennedy Q, Taylor JL, Reade G, Yesavage JA (2010) Age and expertise effects in aviation decision making and flight control in a flight simulator. Aviation, Space, and Environmental Medicine 81(5):489–497, DOI: https://doi.org/10.3357/asem.2684.2010
Khattak A, Almujibah H, Elamary A, Matara CM (2022) Interpretable dynamic ensemble selection approach for the prediction of road traffic injury severity: A case study of pakistan’s national highway N-5. Sustainability 14(19):12340, DOI: https://doi.org/10.3390/su141912340
Khattak A, Chan PW, Chen F, Peng H (2022) Prediction of aircraft go-around during wind shear using the dynamic ensemble selection framework and pilot reports. Atmosphere 13(12):2104, DOI: https://doi.org/10.3390/atmos13122104
Khattak A, Chan PW, Chen F, Peng H (2023) Prediction of a pilot’s invisible foe: The severe low-level wind shear. Atmosphere 14(1): 37, DOI: https://doi.org/10.3390/atmos14010037
Kim Y, Kim Y (2022) Explainable heat-related mortality with random forest and SHapley Additive exPlanations (SHAP) models. Sustainable Cities and Society 79:103677, DOI: https://doi.org/10.1016/j.scs.2022.103677
Kumarakulasinghe NB, Blomberg T, Liu J, Leao AS, Papapetrou P (2020, July) Evaluating local interpretable model-agnostic explanations on clinical machine learning classification models. In 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS) 7–12, IEEE, DOI: https://doi.org/10.1109/CBMS49503.2020.00009
Lee O, Shun CM (2003) Observation of sea breeze interactions at and near Hong Kong International Airport. Meteorological Applications 10(1):1–9, DOI: https://doi.org/10.1017/S1350482703005012
Lin M, Zhu X, Hua T, Tang X, Tu G, Chen X (2021) Detection of ionospheric scintillation based on xgboost model improved by SMOTE-ENN technique. Remote Sensing 13(13):2577, DOI: https://doi.org/10.3390/rs13132577
Liu G, Sun B (2023) Concrete compressive strength prediction using an explainable boosting machine model. Case Studies in Construction Materials e01845, DOI: https://doi.org/10.1016/j.cscm.2023.e01845
Liu H, Zhao Y, Zaporowska A, Skaf Z (2020) A machine learning-based clustering approach to diagnose multi-component degradation of aircraft fuel systems. Neural Computing and Applications 1–17, DOI: https://doi.org/10.1007/s00521-021-06531-4
Lou Y, Caruana R, Gehrke J, Hooker G (2013) Accurate intelligible models with pairwise interactions. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 623–631, DOI: https://doi.org/10.1145/2487575.2487579
Luengo J, Fernández A, García S, Herrera F (2011) Addressing data complexity for imbalanced data sets: Analysis of SMOTE-based oversampling and evolutionary undersampling. Soft Computing 15:1909–1936, DOI: https://doi.org/10.1007/s00500-010-0625-8
Maddock S, Cormode G, Wang T, Maple C, Jha S (2022) Federated boosted decision trees with differential privacy. In Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security 2249–2263, DOI: https://doi.org/10.1145/3548606.3560687
Mangalathu S, Hwang SH, Jeon JS (2020) Failure mode and effects analysis of RC members based on machine-learning-based SHapley Additive exPlanations (SHAP) approach. Engineering Structures 219:110927, DOI: https://doi.org/10.1016/j.engstruct.2020.110927
McCarthy J, Serafin R, Wilson J, Evans J, Kessinger C, Mahoney III WP (2022) Addressing the microburst threat to aviation: research-to-operations success story. Bulletin of the American Meteorological Society 103(12):E2845–E2861, DOI: https://doi.org/10.1175/BAMS-D-22-0038.1
Mohril RS, Solanki BS, Kulkarni MS, Lad BK (2022) XGBoost based residual life prediction in the presence of human error in maintenance. Neural Computing and Applications 1–15, DOI: https://doi.org/10.1007/s00521-022-07216-2
Muntasir Nishat M, Faisal F, Jahan Ratul I, Al-Monsur A, Ar-Rafi AM, Nasrullah SM, Reza Md T, Khan Md RH (2022) A comprehensive investigation of the performances of different machine learning classifiers with SMOTE-ENN oversampling technique and hyperparameter optimization for imbalanced heart failure dataset. Scientific Programming 2022, 1–17, DOI: https://doi.org/10.1155/2022/3649406
Nohara Y, Matsumoto K, Soejima H, Nakashima N (2022) Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Computer Methods and Programs in Biomedicine 214:106584, DOI: https://doi.org/10.1016/j.cmpb.2021.106584
Nori H, Jenkins S, Koch P, Caruana R (2019) Interpretml: A unified framework for machine learning interpretability. arXiv preprint arXiv:1909.09223, DOI: https://doi.org/10.48550/arXiv.1909.09223
Prats X, Puig V, Quevedo J, Nejjari F (2010) Multi-objective optimisation for aircraft departure trajectories minimising noise annoyance. Transportation Research Part C: Emerging Technologies 18(6):975–989, DOI: https://doi.org/10.1016/j.trc.2010.03.001
Qu Y, Deng X, Lin S, Han F, Chang HH, Ou Y, Nie Z, Mai J, Wang X, Gao X, Wu Y, Chen J, Zhuang J, Ryan I, Liu X (2022) Using innovative machine learning methods to screen and identify predictors of congenital heart diseases. Frontiers in Cardiovascular Medicine 8:2087, DOI: https://doi.org/10.3389/fcvm.2021.797002
Rai A (2020) Explainable AI: From black box to glass box. Journal of the Academy of Marketing Science 48:137–141, DOI: https://doi.org/10.1007/s11747-019-00710-5
Santos JI, Martín Ó, Ahedo V, de Tiedra P, Galán JM (2022) Glass-box modeling for quality assessment of resistance spot welding joints in industrial applications. The International Journal of Advanced Manufacturing Technology, 1–16, DOI: https://doi.org/10.1007/s00170-022-10444-4
Shortle J, Sherry L (2013) A model for investigating the interaction between go-arounds and runway throughput. In 2013 Aviation Technology, Integration, and Operations Conference 4235, DOI: https://doi.org/10.2514/6.2013-4235
Singh NP, Goh SK, Alam S (2020) Real-time unstable approach detection using sparse variational gaussian process. In 2020 International Conference on Artificial Intelligence and Data Analytics for Air Transportation (AIDA-AT) 1–10, IEEE, DOI: https://doi.org/10.1109/AIDA-AT48540.2020.9049174
Tse SM, Chan PW, Wong WK (2014) A case study of missed approach of aircraft due to tailwind associated with thunderstorms. Meteorological Applications 21(1):50–61, DOI: https://doi.org/10.1002/met.1296
Victoria AH, Maragatham G (2021) Automatic tuning of hyperparameters using Bayesian optimization. Evolving Systems 12:217–223, DOI: https://doi.org/10.1007/s12530-020-09345-2
Wang X, Yang YQ, Liu SH, Hong XY, Sun XF, Shi JH (2020) Comparing different venous thromboembolism risk assessment machine learning models in Chinese patients. Journal of Evaluation in Clinical Practice 26(1):26–34, DOI: https://doi.org/10.1111/jep.13324
Xiaolin L, Qingyuan W, Panicker RC, Cardiff B, John D (2022) Binary ECG classification using explainable boosting machines for IoT edge devices. In 2022 29th IEEE International Conference on Electronics, Circuits and Systems (ICECS) 1–4, IEEE, DOI: https://doi.org/10.1109/ICECS202256217.2022.9970834
Zaal P, Campbell A, Schroeder JA, Shah S (2019) Validation of proposed go-around criteria under various environmental conditions. In AIAA Aviation 2019 Forum 2993, DOI: https://doi.org/10.2514/6.2019-2993
Zhang S, Khattak A, Matara CM, Hussain A, Farooq A (2022) Hybrid feature selection-based machine learning Classification system for the prediction of injury severity in single and multiple-vehicle accidents. PLoS One 17(2):e0262941, DOI: https://doi.org/10.1371/journal.pone.0262941
Zorkeflee M, Mohamed Din A, Ku-Mahamud KR (2015) Fuzzy and smote resampling technique for imbalanced data sets, 638–643
Acknowledgments
This research was funded by the Research Fund for International Young Scientists of the National Natural Science Foundation of China (Grant No. 52250410351), the National Foreign Expert Project (Grant No. QN2022133001L), and the National Natural Science Foundation of China (U1733113). We acknowledge the Hong Kong Observatory of Hong Kong International Airport for providing pilot reports for this study.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Khattak, A., Chan, Pw., Chen, F. et al. Explainable Boosting Machine for Predicting Wind Shear-Induced Aircraft Go-around based on Pilot Reports. KSCE J Civ Eng 27, 4115–4129 (2023). https://doi.org/10.1007/s12205-023-0410-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12205-023-0410-8