Abstract
Utilizing Artificial Intelligence (AI) techniques to forecast, recognize, and classify financial crisis roots are important research challenges that have attracted the interest of researchers. Moreover, the Explainable Artificial Intelligence (XAI) concept enables AI techniques to interpret the results of processing and testing complex data patterns so that humans can find efficient ways to infer and interpret the logic behind classifying complex data patterns. This paper proposes a novel XAI model to automatically recognize financial crisis roots and interprets the features selection operation. Using a benchmark dataset, the proposed XAI model utilized the pigeon optimizer to optimize the feature selection operation, and then the Gradient Boosting classifier is utilized to recognize financial crisis roots based on the obtained reduct of the most important features. The practical results showed that the short-term interest rates feature is the most important feature by which financial crisis roots can be detected. Moreover, the classification results showed that the built-in Gradient Boosting classifier in the Pigeon Inspired Optimizer (PIO) algorithm achieved training and testing accuracy of 99% and 96.7%, respectively, in recognizing financial crisis roots, which is an efficient and better performance compared to the random forest classifier.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In the new global economy, financial crisis recognition and prediction have become a huge challenge for avoiding the bankruptcy of financial institutions and companies. Therefore, the early detection of financial crisis warning alerts is a great concern for financial policymakers to reduce the likelihood of bank failure and avoid rising risks [1]. Although the financial crisis has various dimensions and depends on plenty of financial and political variables, it can result in one or more of the following patterns [2], (1) currency crisis is a situation in which the national central bank has not to have enough foreign currency to maintain the country's fixed exchange rate. This leads to a rapid and sharp decline in the value of a country's currency, which harms the economy of that country, (2) sovereign crisis is a situation in which the country cannot pay its government debt or rescue indebted banks under its national financial control without the dependence of financial third parties like the International Monetary Fund (IMF) or the European Central Bank (ECB), (3) banking crisis is a situation in which many banks in a country are suffering from serious liquidity or financial solvency problems at the same time. The bankruptcy of one bank or a set of banks propagates this failure to other banks in the financial system. Therefore, the amount of non-performing loans is rapidly increasing, most of the banking system's capital is rapidly depleting, and (4) asset price correction (or market correction) crisis is a situation in which asset prices or financial market are decreased by 10–20% or greater. This correction can continue from days to months or even longer. Although market correction may be a damaging crisis in the short period, it can be positive for adapting overvalued asset prices and granting buying chances.
The causes of the financial crisis are diverse, forked, and unpredicted. Some of them are due to false execution of financial procedures such as excessive leverage, liquidity mismatches (lending long, lending short), false assessment of taxes and subsidies, and increased borrowing by banks and investors [3]. Moreover, financial crises can occur due to sudden global events as occurred with the COVID-19 pandemic [4]. Therefore, there is no standard definition of the financial crisis; however, it can be described as a sharp and sudden disturbance in some economic balances, followed by the collapse of several financial institutions, and its effects on others. The International Monetary Fund (IMF) revealed, in 2022, the global economic outlook and said that it will be bleaker than expected last year. While he referred to the significant deterioration in purchasing managers' surveys in recent months, as they indicated the weakness of most of the G-20 economies. Moreover, IMF wrote in a blog prepared for the G20 leaders' summit in Indonesia that the latest indicators "confirm that the outlook is bleaker", especially in Europe. He added that recent purchasing managers' indices measuring manufacturing and services activity show weakness in most of the G20 economies, with economic activity expected to contract amid high inflation. Moreover, it is expected that the exacerbation of the energy crisis in Europe will inflict severe damage on growth and will raise inflation, which, if it continues to rise, may lead to greater increases in interest rate policy and further tightening of global financial conditions. In addition, IMF explained that this, in turn, poses increased sovereign debt crisis risks for weak economies. He also added that the multiplicity of dangerous weather events will harm growth around the world [2, 3].
There has been an increasing interest in using computational intelligence and Artificial Intelligence (AI) in financial applications such as asset pricing and derivatives, stock market predictions, algorithmic trading, and credit risk assessment [5, 6]. Specifically, machine learning and deep learning technologies started getting many researchers' attention to develop a smarter model for recognizing, predicting, and classifying financial crisis roots [7] than classical methods that depend on complex calculations as in statistical and operations research approaches [8]. Compared to classical approaches, AI-based methods have two major strengths in recognizing financial crisis roots. The AI-based method can process and learn from unbalanced and random data. This means that the statistical distribution of the financial data is not necessary for AI-based methods to recognize financial crisis roots. Another strength of AI-based methods is that they can be categorized as nonlinear approaches. This advantage makes AI-based methods to be more accurate when testing complex financial data patterns [1]. In addition, a novel concept has emerged. It is associated with AI that enables machine learning and deep learning models to interpret the results of processing and testing complex data patterns so that humans can not only find solutions to complex data patterns problems but also understand and interpret the logic of the obtained solutions. This new concept is called Explainable AI (XAI), or Interpretable AI [9,10,11]. XAI can be a magic solution to the financial transparency challenge in financial services. The recent AI models can enhance many financial services' accuracy, speed, and effectiveness, such as predicting liquidity balances, assigning credit scores, and optimizing investment portfolios. However, the stakeholders (e.g., banks, investors, employees, customers, etc.) must understand the logic and reasoning behind applying these AI models to avoid financial bias. Therefore, XAI can answer this challenge [12]. Although XAI standards are still being built and the market for this technology is developing, current trends imply that Explainable AI will be a significant need for incoming economic AI applications. According to the Gartner website, by 2025, 30% of large companies and government contracts for purchasing AI applications will require explainable and ethical AI [13].
Moreover, several startup leaders in Explainable AI started to raise their investment in XAI dependence. For instance, Fiddler Labs recently allocated $14 million to assist in interpreting disparities in machine learning approaches and assure compliance with industry laws based on explainable AI techniques [14]. These reports indicate that financial organizations will have to provide evidence for all financial decisions to regulators and their customers, particularly if the results of an AI model are inaccurate. Consequently, XAI in institutions' financial work will prepare the way for the future of the global economy.
This paper proposes an Explainable Artificial Intelligence model to recognize the financial crisis roots in financial datasets. The main contribution of this study is:
-
A novel XAI model is proposed to recognize financial crisis roots based on Pigeon Optimization (PIO) and Gradient Boosting classifier.
-
The PIO algorithm is applied to optimize the financial crisis features selection step to obtain the best reduct of features that have the most impact on financial crisis recognition
-
Two learning models, Gradient Boosting (GB), and (Random Forest) are applied and compared with the proposed XAI model to classify and recognize financial crisis roots
-
The proposed model has been tested and validated in a benchmark financial crisis dataset. The experimental results of the proposed XAI model achieved training and testing accuracy of 99% and 96.7%, respectively, in recognizing financial crisis roots.
The remaining sections of this paper are organized as follows. Section 2 outlines the related work. Section 3 presents background about two major algorithms. Section 4 discusses the proposed XAI model for recognizing financial crisis roots. Section 5 presents the experimental results. Section 6 discusses the obtained findings. Section 7 concludes this work.
2 Related Work
The literature on Artificial Intelligence to predict financial crisis roots has highlighted several techniques and methodologies [1, 15,16,17]. Over the past decade, most research has emphasized machine learning techniques to solve financial crisis forecasting problems.
Aristeidis et al. [18] proposed a machine learning-based monitoring system model called "Early Warning Systems” (EWS) to forecast financial crises based on the structure and analysis of financial networks. The proposed models achieved an accuracy of 98% in predicting the financial crisis.
Sotirios et al. [19] studied a stock market crisis prediction based on statistical and deep learning approaches. The authors investigated the efficiency of six models, Support Vector Machines (SVM), Extreme Gradient Boosting, Classification Trees, Random Forests, Neural Networks, and Deep Neural Networks, to predict stock market crisis cases. The experimental results proved that there is clear evidence of cross-contagion and interdependence impacts among stock, bond, and currency markets. Moreover, the results proved that the Deep Neural Networks approach optimizes sorting accuracy while providing a powerful method to build a systemic, efficient, and risk-sensitive early warning routine better than the currently established ones.
An additional stock crisis prediction model has been proposed and applied in the Indian stock market [20]. The authors applied a novel prediction model for stock crisis based on a practical comparison between Deep Neural Network (DNN) regression method and Extreme Gradient Boosting (XGBoost) using a proposed Hybrid Feature Selection (HFS) technique. The experimental results proved the superiority of XGBoost performance regarding Mean Absolute Error (MAE), Mean Squared Error (MSE), and Root Mean Square Error (RMSE) in predicting Indian stock market crises.
EeroTölö in [21] investigated the efficiency of selected traditional neural network models against the Gated Recurrent Unit (RNN-GRU) and Long-Short Term Memory (RNN-LSTM) for predicting financial crisis roots. The selected models have been validated on the Jórda–Schularick–Taylor dataset, which involves the crisis dates and annual macroeconomic series of 17 countries over the period 1870−2016. The experimental results proved that (RNN-LSTM) and (RNN-GRU) are efficient models for forecasting financial crises compared to the benchmark logistic model and basic neural nets.
Godwin et al. [22] investigated the efficiency of using the Light Gradient Boosting Machine (LightGBM) model for predicting credit risks and crises. The experimental results clarified that LightGBM prediction performance outperformed other models such as SVM and logistic regression.
Zukerman Rustam and Glori Stephani Saragih [23] investigated the utilization of the Random Forest (RF) model for predicting the financial crisis of Turkish banks. The simulation results clarified that RF achieved a competitive result in forecasting bank crises compared with other machine learning algorithms in the literature.
On the other hand, forecasting financial intermediaries' financial crises is very important. Stacey and Vlado [24] investigated a novel solution to this problem using the Convolutional Neural Networks model. The experimental results clarified classification accuracy, 88.24% to financial intermediaries as distressed or not distressed.
Financial crises lead to bank failures and insolvency. Youness et al. [25] introduced an empirical study using a Deep Neural Network approach for bank failure forecasting. The proposed model has been validated on a dataset collected over 14 years from 2004 to 2018. It consists of 1100 FDIC-insured US commercial banks. The experimental results clarified the outperformance of the proposed Deep Neural Network model compared with other traditional machine learning models and statistical techniques in the Matthews Correlation Coefficient and F1-score.
Hong Hanh Le and Jean-Laurent Vivian [26] conducted a comparison study between classical statistical models (i.e., Logistic Regression and Discriminant Analysis) and machine learning models (Support Vector Machines, Artificial Neural Network, and k-Nearest Neighbors) in forecasting banks' insolvency cases. The authors verified the comparison study. A dataset consists of 3000 US banks. The practical result proved the efficiency of Artificial Neural Networks and k-nearest neighbor methods in predicting bank failure compared to the SVM and statistical approaches.
Nurul Alam et al. [27] introduced an efficient deep learning algorithm using a panel data structure to forecast corporate bankruptcy. The authors compared the proposed deep learning model with the known discrete hazard model (it has been widely used in panel data applications through finance literature). They validated the proposed model's performance on a dataset of 641,667 firm-month observations of North American listed companies between 2001 and 2018. The practical results clarified the outperformance of the proposed deep learning model with a prediction accuracy of 93.71%, compared with the discrete hazard model, which achieved low accuracy of 86.95%.
Al-Haschimi et al. [28] proposed a machine learning technique for measuring the overall financial risk in China based on textual data. Latent Dirichlet Allocation (LDA) algorithm has been utilized to identify major episodes of financial risks in China based on the textual data from many newspaper articles. A Structural Vector Autoregressive (SVAR) model, is then used to quantify the impact of rising financial risk on the Chinese and the global economy.
Bitetto et al. [29] used multiple versions of the principal component analysis (PCA) approach to develop a novel index that can be used to measure the soundness of a country’s financial system around the world based on machine learning. Testing results have shown that the proposed index makes robust forecasting about the soundness of financial systems for 119 countries, with very good results.
Medianovskyi et al. [30] proposed Explainable Artificial Intelligence (XAI) model for predicting financial disasters in Small and Medium Enterprises (SMEs). The authors applied Gradient Boosting (XGBoost and Catboost), Random Forests, Logistic Regression, and Artificial Neural Networks as regression models, then used Shapley’s Additive Explanations (SHAP) framework for modeling result interpretation and validation of financial crisis prediction. The experimental results have shown that the most accurate way to evaluate financial crisis for SME enterprises is by utilizing the Catboost modeling technique. Moreover, it was specified that SMEs that are younger than 10 years old and have a cash ratio below 20% are significantly more likely to face a financial crisis. The result also finds that ‘hazardous’ feature combinations have a higher (or lower) impact on financial crisis occurrence.
Swarm optimization approaches such as Ant Colony Optimization (ACO) [31], Particle Swarm Optimization [32], Grey Wolf Optimization (IGWO), and Fuzzy Neural Classifier (FNC) [33] can be used also to develop smart systems able to accurately predict the financial crisis and bankruptcy. Although many appreciated efforts have been paid to solve financial crisis prediction issues, few works have used the concept of Explainable Artificial Intelligence (XAI) in predicting financial crises. Moreover, more enhancement in prediction accuracy is required while applying novel machine learning models. These two challenges are what we investigate in this study.
3 Background
This section discusses two major techniques in which the proposed system is built, the Pigeon Swarm Optimizer, and the SHAP algorithm. The next two subsections discuss the purpose, and methodology of each algorithm.
3.1 Pigeon Swarm Optimizer
In 2014, Duan developed a new swarm intelligence optimization technique known as Pigeon Inspired Optimization (PIO). Numerous optimization issues have been effectively solved using population-based swarm intelligence systems. All bio-inspired optimization algorithms attempt to imitate natural ecosystem systems, which have substantially enhanced the practicality of modern optimization approaches and provided practical solutions to those complex combinatorial optimization problems [34]. In this paper, we will adapt the new PIO algorithm proposed by Duan and Qiao [35] that mimics the real navigation process of pigeons. The algorithm contains two main operators as defined as follows.
-
(1)
Map and compass operator: Pigeons form a map in their brains when they sense the field of the Earth using magnetic reception and look at the sun's height as a compass to adjust direction as they depend less on magnetic particles and the sun when they fly to their final destination.
-
(2)
Landmark operator: Pigeons rely on their nearby landmarks when they fly near their destination as they fly directly to the destination. If they are familiar with the landmark, they will fly directly to the destination while following the pigeon that knows the destination if they are far from the destination and unfamiliar. The number of pigeons was halved in each iteration, the first half of the population was categorized as the current population, the other half of the poor were abandoned, and a new population center was identified.
The main steps of PIO are described in Fig. 1. The first step is used to initialize the parameters of the PIO algorithm such as the initial random set of pigeons, compass operator R, the solution dimension space, number of iterations, and size of the population. Next, randomly initialize the position \({\mathrm{X}}_{\mathrm{i}}(\mathrm{t})\) for each pigeon. At the \(t\) iteration, the machine learning model is trained and tested using the selected features. Moreover, the model is evaluated using the fitness function. Next, the following changes are implemented to the position \({\mathrm{X}}_{\mathrm{i}}(\mathrm{t})\) and velocity \({\mathrm{V}}_{\mathrm{i}}\left(\mathrm{t}\right)\) of each pigeon in the group as depicted in Eqs. 1 and 2:
where \(R\) is the map factor and is the random number between 0 and 1, \(\mathrm{rand}()\) represents the uniform random number, and \({\mathrm{X}}_{\mathrm{g}}\) indicates that the optimal position of the whole group of pigeons at t time is attained by selecting the individuals with the maximum fitness value after comparing the location of the entire flock, which is analogous to the direction indicated by the compass. Following a predetermined number of iterations, the \({\mathrm{X}}_{\mathrm{i}}\) is then transferred to the landmark operator to continue the iteration process.
In the landmark operator, the pigeons are arranged based on their fitness value. The \({N}_{landmark}\left(t\right)\) represents the pigeon’s number at iteration t as shown in Eq. 3. During each iteration, half of the pigeons are discarded since they are located too far from the destination, the center of the left pigeons and the pigeons that are closer to the destination can fly there faster. \({X}_{c}\left(t\right)\) represents the center location of the pigeon and is defined as in Eq. 4.
where \(fitness\left({X}_{i}\left(t\right)\right)\) is the fitness value of each pigeon, \(fitness\left({X}_{i}\left(t\right)\right)\) is the objective function value corresponding to the given pigeon's position, and \({N}_{landmark}\left(t\right)\) is the current reduced population. The new position is defined as in Eq. 5. Finally, pigeons update their positions using the best center position of each iteration. Through these two parts of updates, pigeons will soon find the global best position in history.
3.2 Shapley Additive Explanations (SHAP) Algorithm
The National Institute of Standards (NIST) has proposed four principles to define Explainable Artificial Intelligence (XAI): (1) an AI system should supply proof, or reasoning for each of its outputs; (2) the explanations produced by an AI system should be comprehensible to its users; (3) the explanations should accurately depict the process utilized by the AI system to obtain the output; (4) the AI system should only function within the parameters for which it was designed, and should not provide an output when its confidence in the result is insufficient.
The SHAP values provide a high level of interpretability for a model. The SHAP is a Python-based library developed by Lundberg and Lee [36] to explain an ML model's results by evaluating the impact of each feature on the model's output. This technique assesses the relative importance of a particular local feature using the Shapley values [37, 38] from game theory. The SHAP values provide two great advantages (1) global interpretability: the SHAP values can show how much each predictor contributes to the target variable, either positively or negatively. This is similar to the plot of variable significance, but it can illustrate the positive or negative link between each variable and the target and (2) local interpretability; the SHAP values for each observation are unique. However, traditional variable significance algorithms do not provide each example's findings. Local interpretability can be used to compare the effects of different factors [39, 40]. The Shapley values (\({\mathrm{\varnothing }}_{\mathrm{i}}\)) for a particular feature i can be calculated by taking the average of the marginal contribution of feature i computed for all subsets S of N, excluding feature i [41], as shown in Eq. 7.
where \(S\) is a subset of features, f is the prediction model, \(N\) is the set of all features, and \(n\) is the total number of features. For a single observation, \(\mathrm{x}\), a linear function \(\mathrm{g}\) is used to explain the output of the prediction model and can be calculated from Eq. 8.
where \(\mathrm{x}\) is the instance being explained, \({\mathrm{x}}^{*}\) represents the input being simplified, \(\mathrm{M}\) is the number of simplified input attributes, \({\varnothing }_{0}\) is the initial value when none of the inputs are present.
4 The Proposed Financial Crisis Roots Recognition System
The proposed Explainable AI system comprises three main phases: (1) pre-processing phase, (2) feature selection based on the Pigeon Swarm Optimization algorithm, and XAI-based interpretation phase, and (3) financial crisis roots recognition based on machine learning and XAI phase. Figure 2 depicts the proposed system's overall architecture. In the next subsections, each of the three phases, including the actions required and the characteristics and features of each phase are explained in more detail.
4.1 The Pre-processing Phase
In machine learning-based problems, one of the essential stages in developing classification algorithms in data science is balancing the data. Without balancing train data, classification models will generally perform ineffectively. One of the common quick, easy, and effective ways to accomplish this important task is SMOTE, which stands for "Synthetic Minority Oversampling Technique." It creates new data based on the implications of old data. Instead of removing or copying existing data, new rows will be produced and labeled according to the original data. Table 1 shows the number of samples in both two classes before and after applying the SMOTE approach.
4.2 Feature Selection Based on the PIO Phase
The feature selection method reduces the number of columns by selecting some sets from the original data and removing inappropriate, ineffective, and repetitive variables. Selecting features is one of the complex tasks of a large data set. It reduces the number of features by picking particular subsets of the original variables. When working with a large dataset, one of the most challenging problems is determining which features are the most important. It has been shown that covariate selection effectively removes inappropriate and redundant features. Also, it has the potential to minimize the computation time and increase the classifier performance. Feature selection using the metaheuristic algorithm has been successfully applied to financial forecastings such as Particle Swarm Optimization (PSO) and Quantum-behaved PSO.
The Pigeon Inspired Optimization algorithm normally relies on a fitness function to perform feature selection and measure the validity of solutions in a given optimization problem. The used fitness function depends on the total number of features (TF), the performance of the classification model (accuracy), the reduct length, and three corresponding weights \({w}_{1}\), \({w}_{2},\) and \({w}_{3}\) to the significance of classification performance, the sum of the three weights equals 1. Equation 9 defines the fitness function, while Eq. 10 defines the reduct weight dependent on the importance of the feature within the reduct. In this paper, we are setting \({w}_{1}=0.70\), \({w}_{2}=0.22\), and \({w}_{3}=0.08.\)
Figure 3 illustrates the process of optimized feature selection with their explanation based on XAI interpretation. The steps for feature selection based on the PIO algorithm are: (1) Initialize parameters and define the fitness function. (2) Evaluate the fitness value of all Pigeons. (3) Each pigeon's velocity and position are continuously updated. (4) The global best position is modified with the pigeon's new position having the lowest fitness function value. (5) The stop condition is the last step in the PIO cycle. The parameters are tuned to achieve a higher level of accuracy with a lower error rate. (6) The best reducts are sorted and the top ranks are selected. (7) Classification model and XAI are used to provide interpretation for each reduct generated. (8) Finally, the proposed model produced the best reduct.
4.3 Financial Crisis Roots Recognition Based on the Machine Learning Phase
In this phase, three different machine learning models, namely Support Vector Machine (SVM), Random Forest (RF), and Gradient Boosting (GB), are used and compared to identify the roots of the financial crisis. These models are trained without utilizing the SMOTE approach, and their performance is measured using the confusion matrix and the relative performance metrics such as precision, F1-score, and accuracy. Similarly, the three models are trained and tested after applying the SMOTE approach with tenfold cross-validation to prevent overfitting and unbalanced data problems. Finally, the SHAP approach is applied as an XAI model to explain the results of the applied machine learning algorithms in recognizing financial crisis roots.
5 Experimental Results
This section evaluates the effectiveness of the proposed Explainable AI model in recognizing financial crisis roots based on the PIO algorithm using a benchmark dataset [42, 43]. The design of the proposed model is developed in Python. Whereas, testing the system's performance is carried out on a Google collab cloud equipped with a CPU running at 2.6 GHz and 32 GB of RAM. We set the number of initial values related to feature selection and optimization operations. For instance, the feature threshold is 0.1. The number of iterations in PIO is 100, and the number of pigeons in the Pigeon Swarm Algorithm equals half the number of features. The halting condition occurs when the number of iterations equals the maximum value specified previously [44, 45].
The performance of the proposed model was evaluated using accuracy, precision, recall and F1-score, as outlined in Eqs. 11–14.
where TP, TN, FN, and FP are truly positive, true negative, false negative, and false positive variables.
5.1 Dataset Description
The used dataset is the JSTdatasetR3 dataset [42, 43]. This dataset was generated with considerable financial support from organizations such as the Institute for New Economic Thinking, the German Federal Ministry of Education and Research, the Volkswagen Foundation, and the European Research Council [42]. Table 2 provides a summary of this data. The meaning of attributes in the dataset is shown in Table 3. The parameters considered in the dataset collection for qualitative analysis were collected from the characteristics of industrial risk, competitiveness, credibility, financial flexibility, management risk, and operating risk.
5.2 Feature Selection Results
The used dataset has 29 features of financial crisis for 147 samples collected from 17 countries. For solving the unbalancing problem of the used dataset, SMOTE algorithm has been applied, then 80% of the samples have been used for training and 20% have been used for testing. Table 4 shows the number of samples in both of the two classes (i.e., No-Financial Crisis, and Financial Crisis) before and after applying the SMOTE approach.
Table 5 presents the feature selection results of the suggested PIO method on the financial crisis dataset. Moreover, the relative importance of the most significant features is depicted in Fig. 4. F_13 has the highest importance. Moreover, the optimization results clarified that the reduct that consists of these features: { F_0, F_1, F_2, F_3, F_5, F_6, F_7, F_8, F_9, F_12, F_13, F_14, F_16, F_18, F_19, F_21, F_23, and F_24} has a high stability value of 0.936. This means that they are the most significant features that can accurately recognize financial crises. Moreover, the classification results based on the built-in Gradient Boosting classifier in the PIO algorithm using the mentioned reduct of features are depicted in the confusion matrix in Fig. 5. Table 6 summarizes training and testing results of the performance of the proposed PIO model as well as the weight of the selected reduct selected from the financial crisis dataset, which is the most effective in recognizing financial crisis roots.
For interpreting and understanding the impact of each feature in the set of optimal features [0, 1, 2, 3, 5, 6, 7, 8, 9, 12, 13, 14, 16, 18, 19, 21, 23, 24] on the financial crisis prediction results, an Explainable AI (XAI) technique called SHAP is used to plot the interpretations of those features. SHAP is a powerful mathematical tool for interpreting the output of machine learning algorithms. It is based on the principles of game theory and allows for the estimation of the contribution of each feature to the prediction results generated by a model. SHAP is a valuable tool for understanding how a model is making its predictions and can be used to explain the output of any machine learning algorithm. SHAP can identify the most important features and their influence on the model prediction. Figure 6 depicts the XAI results using the SHAP function. The horizontal axis indicates the feature's influence on prediction (low or high). The color represents the feature's significance (red means high significance, and blue low significance). It is clear from Fig. 6 that feature F13 (i.e., short-term interest rate) has the most influence on financial crisis prediction results.
5.3 Crisis Roots Recognition and Classification Results
The availability of consistent financial data that has a sufficient history to allow for standard analysis is frequently lacking. Therefore, monitoring the financial risk that is present in the global economy seems to be a difficult challenge. The "shadow banking" industry is growing, and technological firms are also offering payment and credit services, which means that both the source of risk and the rules governing finance and financial reporting are rapidly evolving.
Forecasting financial crises are important for financial firms, which attempt to reduce possible losses by estimating prospective risks and avoiding new credit applications when the default risk exceeds a predetermined acceptance level. Forecasting financial crises can help prevent potential losses. This method is also known as a "credit default classification procedure" because it determines whether a consumer is "non-default" or "default" when he pays off the loan. In other words, it classifies the customer's credit status. The precision of the model that has been proposed is necessary to evaluate the productiveness and profitability of a financial company.
In this work, we address the challenge of evaluating the general healthiness of financial institutions, thereby contributing to the identification of the roots of the financial crisis. The model trained and performed with the PIO Optimization Algorithm feature selection had been successful, with the objectives of minimization of features and maximization of performance. The suggested PIO method has two stages: the first is the selection of the best reduct, and the second is the classification of financial crisis data.
After obtaining the optimal reduct of features that have the most influence on financial crisis prediction, the Gradient Boosting classifier is compared with the random forest (RF) classifier in classifying and recognizing financial crisis roots to validate its efficiency. Table 7 summarizes the comparison results of the performance metrics of Gradient Boosting, and Random Forest (RF), on the dataset after applying the feature selection step. The Gradient Boosting model achieved better performance in recognizing financial crisis roots than Random Forest. This result is confirmed through the confusion matrix results of both of the two applied classifiers, the Gradient Boosting, and Random Forest as depicted in Fig. 7.
6 Discussion
Although many works have been proposed to solve financial crisis prediction issues [1, 15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33], very little work has used the concept of Explainable Artificial Intelligence (XAI) in predicting financial crises. The first question in this study sought to determine the most important features which have a big influence in predicting a financial crisis. To answer this question, the Pigeon Swarm Optimizer (PIO) algorithm has been applied to the used dataset [42] to optimize the feature selection operation. The optimization results clarified that the highlighted features in Table 8 { F_0, F_1, F_2, F_3, F_5, F_6, F_7, F_8, F_9, F_12, F_13, F_14, F_16, F_18, F_19, F_21, F_23, and F_24} are the most significant features that can accurately recognize financial crisis roots according to ten rounds of optimization using ten fitness values.
Moreover, the optimization results clarified that the short-term interest rates feature (i.e., F_13) is the most important feature by which financial crisis roots can be detected (as depicted previously in Fig. 4). Another important finding was that the classification results based on the built-in Gradient Boosting classifier in the PIO algorithm achieved training and testing accuracy of 99% and 96.7%, respectively, in recognizing financial crisis roots using the reduct of features: { F_0, F_1, F_2, F_3, F_5, F_6, F_7, F_8, F_9, F_12, F_13, F_14, F_16, F_18, F_19, F_21, F_23, and F_24}.
On the question of interpreting the features influence in classification results, Fig. 6 presented the influence interpretations of the optimized features on financial crisis classification. The used XAI model showed that the short-term interest rates feature (i.e., F_13) is the most important feature by which financial crisis roots can be detected (as depicted previously in Fig. 6).
To validate the efficiency of the proposed XAI model in recognizing financial crisis roots, the Random Forest (RF) classifier has been applied using the same reduct of optimized features. The obtained results confirmed the superiority of the proposed XAI model in recognizing financial crisis roots based on PIO, and Gradient Boosting algorithms as depicted in Fig. 8.
Table 9 summarizes the comparison results between the state-of-art models and the proposed XAI model. The classification performances metrics ( e.g., accuracy, precision, recall, and F1-score) clarify that the proposed XAI model has the best performance in recognizing and interpreting financial crisis classification results compared with the state-of-art models, NN [46], LASSO-CART [47], and Decision Trees [48], Extreme Gradient Boosting [49] models.
7 Conclusion
The purpose of the current study was to investigate the efficiency of a proposed XAI model to recognize financial crisis roots based on Pigeon Optimization and Gradient Boosting classifier. The investigation of feature selection optimization has shown that the short-term interest rates feature is the most important feature by which financial crisis roots can be detected. On the other hand, the following features ids: [0, 1, 2, 3, 5, 6, 7, 8, 9, 12, 13, 14, 16, 18, 19, 21, 23, and 24] are the most reduct by which financial crisis roots can be detected. Another important finding was the high value of training and testing accuracy in recognizing financial crisis roots. The classification results showed that the built-in Gradient Boosting classifier in the PIO algorithm achieved training and testing accuracy of 99% and 96.7%, respectively, in recognizing financial crisis roots, which is an efficient and better performance compared to the Random Forest classifier. A further study with more focus on other features that can influence financial crisis roots is needed. In addition, other optimizer techniques can be used to perform better efficient feature selection optimization than the PIO algorithm. Moreover, deep learning algorithms using various designs of neural networks can be used to enhance the classification performance in recognizing financial crisis roots. In addition, predicting banks' bankruptcy is also an important challenge for future research.
Data availability
The supporting data are the JSTdatasetR3 dataset which is available in the ref [36].
Change history
02 April 2024
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1007/s44196-024-00494-9
References
Demyanyk, Y., Hasan, I.: Financial crises and bank failures: A review of prediction methods. Omega 38(5), 315–324 (2010)
Lo Duca, M., Koban, A., Basten, M., Bengtsson, E., Klaus, B., Kusmierczyk, P., Lang, J.H., Detken, C., Peltonen, T.A.: A new database for financial crises in European countries: ECB/ESRB EU crises database. ECB occasional paper. 2017 Jul 31(194).
McKibbin, W.J., Stoeckel, A.: The global financial crisis: Causes and consequences. Asian Econ. Pap. 9(1), 54–86 (2010)
Chen, H.C., Yeh, C.W.: The global financial crisis and COVID-19: Industrial reactions. Financ. Res. Lett. 1(42), 101940 (2021)
Cavalcante, R.C., Brasileiro, R.C., Souza, V.L., Nobrega, J.P., Oliveira, A.L.: Computational intelligence and financial markets: A survey and future directions. Expert Syst. Appl. 15(55), 194–211 (2016)
Kumar, G., Jain, S., Singh, U.P.: Stock market forecasting using computational intelligence: A survey. Arch. Comput. Methods Eng. 28(3), 1069–1101 (2021)
Buchanan, B.G., Wright, D.: The impact of machine learning on UK financial services. Oxf. Rev. Econ. Policy 37(3), 537–563 (2021)
Birge, J.R., Linetsky, V.: Handbooks in operations research and management science: Financial engineering. Elsevier; 2007 Nov 16.
Samek, W., Müller, K.R.: Towards explainable artificial intelligence. InExplainable AI: interpreting, explaining and visualizing deep learning 2019 (pp. 5–22). Springer, Cham.
Došilović, F.K., Brčić, M., Hlupić, N.: Explainable artificial intelligence: a survey. In2018 41st International convention on information and communication technology, electronics and microelectronics (MIPRO) 2018 May 21 (pp. 0210–0215). IEEE.
Arrieta, A.B., Díaz-Rodríguez, N., Del Ser, J., Bennett, A., Tabik, S., Barbado, A., García, S., Gil-López, S., Molina, D., Benjamins, R., Chatila, R.: Explainable Artificial Intelligence (XAI): concepts, taxonomies, opportunities, and challenges toward responsible AI. Inf. Fusion. 1(58), 82–115 (2020)
Kute, D.V., Pradhan, B., Shukla, N., Alamri, A.: Deep learning and explainable artificial intelligence techniques applied for detecting money laundering–a critical review. IEEE Access. 2021 Jun 4.
Kasey, P.: Gartner Top 10 Strategic Technology Trends For 2020, [online], available at: https://www.gartner.com/smarterwithgartner/gartner-top-10-strategic-technology-trends-for-2020 (accessed 28/2/2022)
Fiddler, Fiddler Secures Strategic Investment from Amazon Alexa Fund to Accelerate AI Explainability, [online], available: https://www.fiddler.ai/press-releases/fiddler-secures-strategic-investment-from-amazon-alexa-fund-to-accelerate-ai-explainability (accessed 28/2/2022)
Liu, L.X., Liu, S., Sathye, M.: Predicting bank failures: a synthesis of literature and directions for future research. J. Risk Finan. Manag. 14(10), 474 (2021)
Lavagna, G., Patterson, H., Razmadze, R.: The wisdom of the crowd: using ensemble machine learning techniques as an early warning indicator for systemic banking crises. 2021, Master Thesis, [online], available: https://repositori.upf.edu/handle/10230/49160 (accessed 12/6/2022)
Vadlamudi, S.: The impacts of machine learning in financial crisis prediction. Asian Bus. Rev. 10(3), 171–176 (2020)
Samitas, A., Kampouris, E., Kenourgios, D.: Machine learning as an early warning system to predict the financial crisis. Int. Rev. Financ. Anal. 1(71), 101507 (2020)
Chatzis, S.P., Siakoulis, V., Petropoulos, A., Stavroulakis, E., Vlachogiannakis, N.: Forecasting stock market crisis events using deep and statistical machine learning techniques. Expert Syst. Appl. 1(112), 353–371 (2018)
Naik, N., Mohan, B.R.: Novel stock crisis prediction technique—a study on indian stock market. IEEE Access. 14(9), 86230–86242 (2021)
Tölö, E.: Predicting systemic financial crises with recurrent neural networks. J. Financ. Stab. 1(49), 100746 (2020)
Ponsam, J.G., Gracia, S.J., Geetha, G., Karpaselvi, S., Nimala, K.: Credit risk analysis using LightGBM and a comparative study of popular algorithms. In2021 4th International Conference on Computing and Communications Technologies (ICCCT) 2021 Dec 16 (pp. 634–641). IEEE.
Rustam, Z., Saragih, G.S.: Predicting bank financial failures using random forest. In2018 International Workshop on Big Data and Information Security (IWBIS) 2018 May 12 (pp. 81–86). IEEE.
Taylor, S., Keselj, V.: Predicting the distress of financial intermediaries using convolutional neural networks. In2021 IEEE 23rd Conference on Business Informatics (CBI) 2021 Sep 1 (Vol. 2, pp. 71–77). IEEE.
Abakarim, Y., Lahby, M., Attioui, A.: Bank failure prediction: A deep learning approach. In Proceedings of the 13th International Conference on Intelligent Systems: Theories and Applications 2020 Sep 23 (pp. 1–7).
Le, H.H., Viviani, J.L.: Predicting bank failure: an improvement by implementing a machine-learning approach to classical financial ratios. Res. Int. Bus. Financ. 1(44), 16–25 (2018)
Alam, N., Gao, J., Jones, S.: Corporate failure prediction: an evaluation of deep learning vs discrete hazard models. J. Int. Finan. Markets. Inst. Money 1(75), 101455 (2021)
Al-Haschimi, A., Apostolou, A., Azqueta-Gavaldon, A., Ricci, M.: Using machine learning to measure financial risk in China. SSRN Electron. J. (2023). https://doi.org/10.2139/ssrn.4338206
Bitetto, A., Cerchiello, P., Mertzanis, C.: Measuring financial soundness around the world: A machine learning approach. Int. Rev. Finan. Anal. 85, 102451 (2023). https://doi.org/10.1016/j.irfa.2022.102451
Medianovskyi, K., Malakauskas, A., Lakstutiene, A., Yahia, S.B.: Interpretable machine learning for SME financial distress prediction. Lecture notes in networks and systems, pp. 454–464, (2023). doi: https://doi.org/10.1007/978-3-031-25344-7_42.
Uthayakumar, J., Metawa, N., Shankar, K., Lakshmanaprabu, S.K.: Financial crisis prediction model using ant colony optimization. Int. J. Inf. Manag. 50, 538–556 (2020). https://doi.org/10.1016/j.ijinfomgt.2018.12.001
Maryam, M., Anggoro, D.A., Tika, M.F., Kusumawati, F.C.: An intelligent hybrid model using artificial neural networks and particle swarm optimization technique for financial crisis prediction. Pak. J. Stat. Oper. Res. 2, 1015–1025 (2022). https://doi.org/10.18187/pjsor.v18i4.3927
Sankhwar, S., Gupta, D., Ramya, K.C., Sheeba Rani, S., Shankar, K., Lakshmanaprabu, S.K.: Improved grey wolf optimization-based feature subset selection with fuzzy neural classifier for financial crisis prediction. Soft Comput. 24(1), 101–110 (2019). https://doi.org/10.1007/s00500-019-04323-6
Li, C., Duan, H.: Target detection approach for UAVs via improved pigeon-inspired optimization and edge potential function. Aerosp. Sci. Technol. 39, 352–360 (2014)
Duan, H., Qiao, P.: Pigeon-inspired optimization: a new swarm intelligence optimizer for air robot path planning. Int. J. Intell. Comput. Cybern. 4, 24–37 (2014)
Ahmad, B., Hamed, M.P., Mohsen, A.: Financial forecasting using ANFIS networks with quantum-behaved particle swarm optimization. Exp. Syst. Appl. 41(14), 6235–6250 (2014)
Maratkhan, A., Ilyassov, I., Aitzhanov, M., Demirci, M.F., Ozbayoglu, M.: Financial forecasting using deep learning with an optimized trading strategy. IEEE Congr. Evol. Comput. (CEC) 2019, 838–844 (2019). https://doi.org/10.1109/CEC.2019.8789932
Kim, Y., Kim, Y.: Explainable heat-related mortality with random forest and SHapley Additive exPlanations (SHAP) models. Sustain. Cities Soc. 79, 103677 (2022)
Lundberg, S., Lee, S.I.: A unified approach to interpreting model predictions. In: Luxburg, U., Guyon, I., Benigo, S., Wallach, H., Fergus R. (eds.) Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS). Curran Associates, NY, USA (2017), pp. 4768–4777.
Shapley, L.S.: Stochastic games. Proc. Natl. Acad. Sci. Unit. States Am. 39(10), 1095–1100 (1953)
Pabodha, K.K., Kannangara, M., Zhou, W., Ding, Z., Hong, Z.: Investigation of feature contribution to shield tunneling-induced settlement using Shapley additive explanations method. J. Rock Mech. Geotech. Eng. 2, 2 (2022)
Òscar, J., Moritz, S., Alan, M.T.: Macrofinancial history and the new business cycle facts. In: NBER Macroeconomics Annual 2016, volume 31, edited by Martin Eichenbaum and Jonathan A. Parker. Chicago: University of Chicago Press (2017). https://www.macrohistory.net/database/
Jordà, Ò., Knoll, K., Kuvshinov, D., Schularick, M., Taylor, A.M.: The rate of return on everything, 1870–2015. Quart. J. Econ. 134(3), 1225–1298 (2019)
Arqub, O.A., Zaer, A.-H.: Numerical solution of systems of second-order boundary value problems using continuous genetic algorithm. Inf. Sci. 279, 396–415 (2014). https://doi.org/10.1016/j.ins.2014.03.128
Abo-Hammour, Z., Abu Arqub, O., Momani, S., Shawagfeh, N.: Optimization solution of Troesch’s and Bratu’s problems of ordinary type using novel continuous genetic algorithm. Discr. Dyn. Nat. Soc. 2014, 1–15 (2014). https://doi.org/10.1155/2014/401696
Ranjan, A.: An optimization of machine learning approaches in the forecasting of global financial stability. https://doi.org/10.36227/techrxiv.21273351. (2022)
Chen, S., Zong-De, S.: Financial distress prediction using hybrid machine learning techniques. Asian J. Econ. Bus. Account. (2020). https://doi.org/10.9734/ajeba/2020/v16i230231
Aydin, N., Sahin, N., Deveci, M., Pamucar, D.: Prediction of financial distress of companies with artificial neural networks and decision trees models. Mach. Learn. Appl. 10, 100432 (2022). https://doi.org/10.1016/j.mlwa.2022.100432
Kim, L.T., Hoang, A.L., Thanh, H.N., Duc, T.N.: Explainable machine learning for financial distress prediction: evidence from Vietnam. Data 7(11), 160 (2022). https://doi.org/10.3390/data7110160
Acknowledgements
This work is acknowledged by the Scientific Research Group in Egypt (SRGE)
Funding
Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB).
Author information
Authors and Affiliations
Contributions
Conceptualization—MT and IG. Data curation—IG. Formal analysis—MT and IG. Investigation—MT. Methodology—MT and IG. Senior administration—AEH. Software—IG. Supervision—AEH. Validation—MT and AEH. Visualization—MT. Writing—original draft—MT. Writing—review and editing; all authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Competing Interests
There is no conflict of interests.
Ethical Approval
Not Applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s44196-024-00494-9
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Torky, M., Gad, I. & Hassanien, A.E. RETRACTED ARTICLE: Explainable AI Model for Recognizing Financial Crisis Roots Based on Pigeon Optimization and Gradient Boosting Model. Int J Comput Intell Syst 16, 50 (2023). https://doi.org/10.1007/s44196-023-00222-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s44196-023-00222-9