1 Introduction

The transition from traditional to modern banking with the implementation of the so-called “universal banking model”Footnote 1 allowed banks to expand, over the past three decades, their traditional and specialized business strategies to a wider range of products and services, including security/investment banking and wholesale-debt. Hence, the constellation of possible combinations of asset-liability choices at the banks disposal makes the classification of their business strategies a complex task. This ultimately limits the ability of regulators to tailor risk monitoring to the specific vulnerability of each strategy to distress.Footnote 2

Although there is no strong evidence to support the claim that universal banks are riskier than specialized ones (Benston 1994; Cornett et al. 2002; Dietrich and Vollmer 2012; Curi et al. 2015), some studies (e.g., Demirgüc-Kunt and Huizinga 2010; Lozano-Vivas and Pasiouras 2010) show a non-monotonic impact of non-traditional funding-investment models on risk. Starting from low level of non-deposit funding in combination with non-interest income activities, these non-traditional banking strategies provide risk diversification benefits, whereas further increases in the level of this mix amplify bank fragility.Footnote 3 Consider for example the impact of leverage growth, that is typically associated with a higher level of risk. On one side, leverage could improve resilience for deposit-funded banks as it will boost their stable funding. On the other side, it could also increase bank vulnerability if associated with a very specialized investment strategy that suffers from the lack of diversification. The universal model offers better diversification opportunities that usually facilitate resilience (Dietrich and Vollmer 2012), although sources of funding towards wholesale-debt tend to exacerbate distress during market crisis due to the low levels of stable funding.

This paper relates to the growing debate on the importance of banks business model classification and its impact on performance and risk (Hryckiewicz and Kozłowski 2017; Martin-Oliver et al. 2017; Lucas et al. 2018). We focus on the interplay between assets and liabilities to propose the detection of stylized banking business strategies, which are interpreted as the combinations of funding and investment choices that unravel the core drivers of their business models.

To classify banks into strategy groups according to their business features, we implement a clustering technique designed for high-dimensional and sparse data sets, like accounting-based data. Then, we test whether those competitive business strategies identified by the clustering detection are exposed to different risk drivers. Finally, we provide insights on how to accurately signal business strategy instability. We are interested in exposing the vulnerability of banking business strategies to the likelihood of distress, which can help regulators to design ex-ante solutions to restore resilience.

Using a large set of bank level indicators of business strategies from 2005 to 2014, we identify three dominant and very popular peer groups each corresponding to a combination of business strategies that were stable over the whole period under investigation. The first, labelled A, is characterized by a universal banking model with dominant wholesale funding and diversified loans investments. Group B is an hybrid model that combines wholesale funding from modern banking with specialized investments in commercial loans, common in traditional banking. Group C follows a traditional banking model focused on deposit funding and commercial loans investments. Two other groups, less popular than the three core ones, were also available prior to the financial crisis, named D and E. The former resembles group C with more long-term wholesale funding, whereas the latter is another hybrid model with wholesale funding and specialized investments in retail loans.Footnote 4

We then test whether the identified peer groups show different risk drivers during the recent global financial crisis. Due to the magnitude of the 2008 financial crisis, we construct a global distress events list by merging signals regarding bankruptcies and liquidations, defaults, distressed mergers and public bailouts. We then follow the literature (see e.g., Betz et al. 2014; Vazquez and Federico 2015) which examine the impact of these major events and regress them against financial ratios after controlling for macro and sectoral effects, but differently from previous works we apply a rare-event logit model to take into account the possibility of a small amount of distress events compared to the size of the sample.

Although all groups were prone to the risk of distress at the outbreak of the financial crisis, we find different risk drivers when controlling for strategy group membership. The size of the institution (see e.g., Wang 2015), measured in terms of total assets, is positively related to the likelihood of distress only in the wholesale-funded groups, suggesting a higher vulnerability of large institutions adopting strategies with a much lower proportion of stable funding than the deposit-oriented group. Higher capital over total assets, i.e. lower leverage, contributes to limit the likelihood of distress only for deposit-funded banks. Besides, in our sample these latter appear less capitalized on average than wholesale-funded banks, suggesting that recapitalizations had a relatively limited impact on those modern banks that, due to their risky exposures, were already holding comparatively higher levels of capital.

However, changes in leverage can directly affect ROA and ROE, with indirect consequences to bank resilience. When looking at those drivers individually, we observe that ROA impacted positively the risk of distress for deposit-funded groups, and negatively for the wholesale ones. This implies that the limited assets diversification strategies of the traditional deposit-funded institutions can force them to concentrate their investments on fewer products in the pursuit of higher assets returns, generating an overall riskier profile compared to the wholesale-funded groups. In contrast, results for ROE suggest that leverage may intensify stable funding in the case of deposit-funded groups, hence improving their resilience, while wholesale-funded groups would pay the price of more unstable non-deposit funding growth in the pursuit of higher ROE. The notable exception is liquidity, whose impact on the risk of distress is significantly negative and this result is consistent across all business models. This suggests a positive contribution of liquidity to resilience which appears exogenous to the strategy group choice (Tirole 2011). Overall, although the above evidence refers to the period of the global financial crisis, it lends support to our premises that a targeted policy intervention based on business strategy classification may facilitate the prediction of banks’ distress.

The paper is organized as follows. Section 2 presents the relevant literature on banks’ business models classification. Section 3 describes the data sources and main methodological issues, including the construction of our global distress event list and the empirical design. Section 4 presents the cluster approach, validation and the economic features of banks’ peer groups. Results are discussed in Sect. 5. Section 6 concludes.

2 Literature review

The worst crisis since the Great Depression started roughly two decades after deregulation and financial innovation allowed banks to strategically move from traditional to modern banking. This resulted, among other things, in banks being able to offer a wider range of products and services, hedge derivatives and securitise. Individual banks, however, adapted their strategies to different extent depending on a variety of factors, including their ownership structures, size and risk culture. This paper sets out to determine the resilience of alternative banks’ business models appropriately identified using a large global sample and an indirect classification approach. This is a particularly useful analysis for the identification of early warning signals and the prediction of bank distress. The present study contributes to two strands of literature: the first is on clustering techniques and their applications to the banking sector to identify dominant business strategies (Sect. 2.1); and the second one is on the relationship between banks’ business models and risk drivers (Sect. 2.2).

2.1 Selected review of clustering approaches to detect banks’ business strategies

In the literature it is possible to identify two main clustering approaches to derive peer banks groups: (i) a direct, qualitative, approach, provided by the institutions themselves and based for example on bank characteristics such as being listed, mutual and so onFootnote 5; and (ii) an indirect, quantitative, approach that relies on hierarchical clustering methods.Footnote 6 This latter classification is known as the “Ward’s method” and employs the Ward (1963)’s procedure to form and identify the distance between the clusters, and includes a stopping rule on the number of clusters based on Caliński and Harabasz (1974).

Literature based on the well-known Ward approach exploits balance sheet variables (Ayadi et al. 2011; Roengpitya et al. 2014; Martin-Oliver et al. 2017) to detect similarities and create homogeneous peer groups. These studies test several combinations of assets and liabilities items to characterize banking activities and use Euclidean distances to measure how similar financial institutions are with respect to these dimensions over which banks are supposed to have a direct influence. Income statements characteristics are usually excluded from peer group assessment and identification.Footnote 7 This is because they tend to reflect the interaction between the institution and the market, so they are not under a strong direct control of the institution itself. Those characteristics are, however, part of the CAMELS rating framework used by supervisors to capture banks’ overall risk condition.Footnote 8

The direct approach has been used in number of studies, such as Demirgüc-Kunt and Huizinga (2010), and more recently (Köhler 2015). These authors document evidence of a non-monotonic effect of non-traditional banking activities to risk employing a direct classification of listed versus unlisted banks. The narrow focus of these studies on the identification of banks’ business models motivates our goal to go beyond the usual distinction based on ownership structure and extend the sample to all global institutions for which relevant data are available.

An indirect classification of large EU banks is provided by e.g. Roengpitya et al. (2014) and Ayadi and De Groen (2015). The latter study finds clear distinctions between retail-oriented models, characterized by the traditional customer deposit-loan intermediation activity (with different levels of diversification among the groups they identify) and the wholesale and investment models. For these latter, the non-traditional banking activities show a heavy reliance on either interbank lending and funding (wholesale), or trading assets and derivatives (investment). In terms of overall risk profile, retail-oriented models show higher distance to default (measured by the z-score.)Footnote 9 Our classification resembles the models seen above, and provides additional contribution to the literature by extending the sample on a global basis and benefiting from a considerably larger sample coverage (see Sect. 4).

Using a narrow list of balance sheets variables, Roengpitya et al. (2014) implement a similar indirect approach on a sample of top international banks, along with some subjective judgmental elements to filter the final groups. They found three prevailing models adopted globally: retail-funded, wholesale-funded and trading. Retail and wholesale funded models are found to perform better during the period 2006–2014 than trading model, while retail banks show more volatility in their performance compared to wholesale institutions. The same cluster analysis is also performed by Martin-Oliver et al. (2017) to investigate the banking crisis in Spain across institutions with different business models, ownership and governance features.

Alternative approaches to business model classifications adopt time series clustering as proposed by Mergaerts and Vander Vennet (2016) and, more recently, Lucas et al. (2018).Footnote 10 Mergaerts and Vander Vennet (2016) propose a continuous type classification method to discover peer groups by employing a factor analysis. However, this approach only provides a description of the peer groups that better explain the variables set without a direct link to the individual institutions adopting that business model. This is an important limitation that would make any direct intervention by the regulators problematic. Lucas et al. (2018) identify six main dimensions characterizing banks’ business models: size, funding, activity, complexity, geography and ownership. Their advantage compared to cross-sectional setups lies on the ability to control for both cross-section and time varying properties of business models. However, their approach is limited to the dynamics of the clusters overtime with a static composition. This means that institutions are assumed to adopt the same business model over time. Although banks tend to have stable business models, the transition between models is an important indicator of fragility that needs to be considered for accurate risk assessment.

We employ the indirect approach using the Louvain method of community detection (Blondel et al. 2008) that preserves the individual bank classification features and is specifically designed for large, sparse and complex data sets. We also advance on the accuracy of model specifications, which is usually considered a strong point in factor analysis, by relying on a state-of-the-art set of individual bank characteristics on a large sample. Specifically, in this paper we focus on three dimensions to characterize the business strategy of a bank, namely: funding sources, investment activities and the complexity of the strategy. As shown in the methodological section, we also employ a very granular list of bank characteristics.

2.2 Selected literature on the determinants of bank distress

There is abundant literature on the balance sheet determinants of bank distress. However, most studies tend to underestimate the importance of defining appropriately business strategies in assessing this relationship. Among the most typical variables that are found to be significantly associated with distress in recent studies are: leverage and capital, liquidity and size.

With emphasis on regulatory capital requirements and liquidity buffers, Vazquez and Federico (2015) claim that high leveraged banks with weak structural liquidity were more likely to default during the 2008–2009 financial crisis. The authors’ list of distress events consider the entire set of mergers, while in our study we concentrate on a narrow definition of distressed mergers (Sect. 3.3.1), as also suggested by Betz et al. (2014). Similar findings are reported in Beltratti and Stultz (2012): banks with lower leverage, higher regulatory capital and deposits showed more resilience to distress compared with those high leveraged institutions financed on short-term money markets. Although their analysis covers the pre-crisis period and only a relatively small sample of banks, they provide a detailed discussion on the resiliance of banks’ peer groups based on the mix of asset-liability structures compatible with the universal banking model. Their contribution inspired the set-up for our signal models detailed in Sect. 3.3. The data sample constructed in our study is closely related to Vazquez and Federico (2015) in terms of number of institutions and countries’ coverage.

Betz et al. (2014) focus more on the predictability of banks’ vulnerability by designing an early warning model for the largest European banks. Using CAMELS rating system as main descriptive elements of banks’ risk drivers (see Footnote 8), they develop a signal model on the distress events during the financial crisis by combining both direct bank failures (bankruptcies, liquidations and defaults) with indirect events like state support of distressed institutions and mergers in distress. They show that CAMELS are good indicators of bank distress and that macro-financial imbalances and sectoral indices of vulnerability improve the performance of model predictability. Our study is closely related to Betz et al. (2014) in terms of the use of CAMELS variables as risk drivers and the enlarged set of distress events. Demyanyk and Hasan (2010) offer a detailed review of these models. In our analysis we enrich the data sets described above by considering a larger global sample of banks with a more comprehensive distress events list. Recent work by Mousavi and Ouenniche (2018) provide a useful comparison of corporate distress predictions models.

3 Data and methodology

3.1 Data

To carry out the classification of banks’ peer groups, we employ data for the immediate pre-crisis period (2005–2007) drawn from the international database Bankscope. We select this specific interval because of data coverageFootnote 11 and to ensure some homogeneity in terms of accounting practices among sampled banks, as the International Financial Reporting Standards (IFRS) became compulsory in Europe for all publicly listed institutions in 2005. In addition, for robustness we extend the classification analysis to 2014 (see Appendix E). To investigate the presence of similarities across institutions in different countries, we consider a global sample covering about 180 countries, of which more than a hundred have at least 10 institutions. Our classification procedure is intended to overcome some of the potential limits due to broad classifications. For this reason, we include all types of peer groups available in Bankscope, except for Central Banks and Specialized Governmental Credit Institutions.

The procedure used to identify peer groups of banks with similar business strategies relies on the characterization of financial institutions by means of balance sheet items. The choice to discard income statement variables is in line with previous literature (e.g., Ayadi et al. 2011; Roengpitya et al. 2014) and reflects the need to identify economic dimensions over which financial institutions can have a direct influence. In our study we exploit a detailed set of balance sheet items for both assets and liabilities, providing a very granular representation of banking activities. The universal model framework has been used to select the list of variables that better represents the main classification of traditional versus modern banking, specifically deposits versus wholesale-debt (from interbank to long term borrowing) on the liability side and traditional versus non-traditional investments (from retail and commercial loans to modern interbank lending and non-interest based investments) on the asset side. Table 1 reports summary statistics of the balance sheet variables used to compute the similarities across financial institutions. The business strategy group detection is done for each year from 2005 to 2014 separately. From the set of institutions to be classified each year, we exclude institutions for which we have less than 1/3 of the selected balance sheet items. To avoid distortions across different sized banks in our global sample, we express each measure as a percentage of the respective total assets.

Table 1 Balance sheet variables statistics Column Annual Average refers to the average across annual mean values from 2005 to 2014; column Std (Annual Average) shows the standard deviation of annual mean values; column Pool Average indicates the average when observations are pooled across the entire interval 2005–2014; column Pool Std stands for the standard deviation of pooled data; column Annual Average NAs exhibits the average number of missing values computed across annual mean values

In the second part of the empirical analysis, to gauge banks’ probability of distress we use a wide set of predictors retrieved from several sources. Following the extant literature (see e.g., Flannery 1998; González-Hermosillo 1999), bank-specific features can be expressed in terms of proxies for CAMELS variables, representing our theoretical framework for selecting risk drivers. This representation helps the supervisory authority to identify institutions that need attention. Individual banks’ data to construct the CAMELS rating on overall banks’ stability are extracted from Bankscope.

In addition, we consider a set of country specific controls for the financial sector as well as macroeconomic conditions (see e.g., Betz et al. 2014; Demirgüç-Kunt and Detragiache 2005; Vazquez and Federico 2015). Since our data set is characterized by a broad classification of financial institutions, we represent the financial sector using a range of variables from classic banking sector indicators (e.g., aggregate non-performing loans to gross loans) to market measures (e.g., the financial sector market returns). Data are retrieved from BIS, Datastream, OECD and World Bank. Table 2 provides the full list and the specific data sources. Table 10 in Appendix B shows that, with only a few exceptions, correlations among predictors do not exhibit relevant and significant relationships over the period 2005–2007. However, we present different specifications of the baseline model for risk assessment, where we basically test different subsets of predictors to overcome potential issues related to multicollinearity. Results are reported in Appendix F.

Table 2 Measures, data description and sources This table shows for each predictor its definition and the source from which we retrieved data

3.2 Clusters detection

As discussed in Sect. 2.1, a typical benchmark for the cluster analysis of banks’ business models is the Ward (1963)’s algorithm combined with the Pseudo-F Index (Caliński and Harabasz 1974) as the stopping rule applied to provide the number of clusters. The Pseudo-F Index is the ratio of between-cluster variance to within-cluster variance and is used as a metric to assess the quality of the clustering results and to discriminate among different specifications of the algorithm setup. In particular, the best configuration, and therefore the resulting number of clusters, is the one associated with the greatest value of the Pseudo-F Index.Footnote 12

In this work, we use a wider list of variables than previous studies and for a larger set of institutions globally distributed. Hence, we prefer to adopt a hierarchical clustering algorithm (namely, the Louvain method) that, although in line with the Ward algorithm, is designed to address the multidimensionality issues in a more appropriate and elegant way for complex and sparse data samples (Lancichinetti et al. 2011; Chakraborty et al. 2013).

In our study, the Louvain classification method relies on the similarities among institutions’ financial statement attributes, collected in a vector per bank and per year. To determine these similarities we compute a standard measure used in information retrieval (van Dongen and Enright 2012), namely the cosine similarity, which is typically applied for sparse and multidimensional data (Tan et al. 2006). Hence, we calculate the cosine of the angle between each pair of vectors in the inner product space and we divide it by the vectors’ L2 norms to make it bounded between \({-}\)1 and \({+}\) 1. Then, the hierarchical Louvain clustering method (Blondel et al. 2008) is applied to the pair-wise similarity matrix to find groups of institutions adopting similar business strategies. These groups are identified by maximizing the modularity quantity, which is a measure that quantifies the strength of division of a system into clusters of densely interconnected institutions that are only sparsely connected with the rest of the system (Newman and Girvan 2004). In our analysis, the resulting clusters represent peer groups of banks sharing similar business strategies. Details of the classification approach are given in Appendix C.

It should be noted that missing values represent a key issue in clustering algorithms whenever authors aim to accommodate the complete data set. The absence of a certain balance sheet item is itself a sign of a business feature and some authors (e.g., Ayadi and De Groen 2015) replace them with zeros. The method proposed in this study does not require ad-hoc assumptions on missing values as by construction the cosine similarity is computed on available data only, providing a more suitable and robust classification for complex and sparse databases like ours.

To test the suitability, or “validation”, of the Louvain method we compare in Sect. 4.1 the quality of these clusters with the direct classification provided by Bankscope and the clusters obtained from the Ward algorithm, being aware that it is a complex task with no easy solutions (Han et al. 2011).Footnote 13

3.3 Empirical approach for risk assessment

The prediction of banks’ distress during the recent financial crisis is performed by means of a logit model on the cross-sectional distribution of three types of variables prior to the global financial crisis. Specifically, our empirical model is:

$$\begin{aligned} Pr(Y_{i}=1|\mathbf{x }_{i}) = \Lambda (\mathbf{x }_{i}\beta ) \end{aligned}$$
(1)

where \(\Lambda \) is a logistic function and \(Y_{i}\) assumes value 1 if institution i has been under distressed conditions in the period from 2008 to 2010 and 0 otherwise. Vector x includes bank-specific measures, financial sector indicators and macro variables. This investigation setup is in line with other works in this literature which aim at disentangling the effects of banks features from the impacts of sectoral and macro controls in the prediction of the likelihood of distress during the recent financial crisis. Given the broad geographical coverage we devote attention in including controls able to map banking and financial sectors worldwide (see Appendix D).

The design of our work intends to identify specific banks features that have different impact to distress due to peer groups membership. Although banks may have changed their business strategies in the expectation of future distress, we believe that the extent of the crisis of 2008 was not properly embedded in banking practices before the effective outbreak of financial markets. However, to limit endogeneity issues, we follow similar investigation frameworks proposed in the literature (see e.g., Manzaneque et al. 2015; Vazquez and Federico 2015) and we exploit the pre-crisis averages (from 2005 to 2007) to compute values of the explanatory variables.Footnote 14

In addition, due to the presence of a few distress events, we decide to apply a rare event logistic regression to take into account the possibility of a small amount of cases of the rarer outcome. In our work, this aspect is particularly relevant when risk assessment models are estimated within peer groups. Hence, to reduce the small-sample bias in the maximum likelihood estimation, we apply the Firth’s Penalized-likelihood logistic regression (Firth 1993), which is a convenient approach to obtain finite and consistent estimates of the regression parameters when maximum likelihood procedure suffers from complete or quasi-complete separation. This bias preventive approach imposes priors on model coefficients to limit small sample bias in the log-likelihood regression \(\log L^{*}(\beta )=\log L(\beta )+A(\beta )\), where \(A(\beta )\) stands for the Jeffreys invariant prior \(1/2~\log \left[ det(I(\beta ))\right] \) and \(I(\cdot )\) is the Fisher information matrix. To remove the first-order bias of the ML estimates, Firth’s correction introduces a term that goes to zero as the sample size increases and counteracts the first-order term from the asymptotic expansion of the bias of the ML estimation. Ma et al. (2013) find, for instance, that Firth’s correction is more powerful than score test based meta-analysis and controls type I error well for both balanced and unbalanced studies.

Our risk of distress models rely on proxies for CAMELS dimensions with a wide coverage (see also Table 2 for more details). Among the possible balance sheet indicators, we focus on those that are well-represented over time and across different types of institutions. In particular, capital adequacy is measured by two ratios: equity to assets ratio (as measure of leverage) and the sum of equity plus subordinated borrowings over total assets (Capital Funding Ratio). Capital adequacy represents the level of bank capitalization and higher values stand for better solvency conditions, thus lower values are expected to increase the probability of bank distress. The asset quality is assessed through the return on assets (ROA); in principle, higher returns are negatively related to distressed conditions. The assessment of asset quality within the CAMELS framework usually includes the relationship between loans losses and reserves for impaired loans as an indicator of the quality of the assets side. Due to the relatively poor coverage in our sample for the period prior to the crisis, we rely on ROA in our main model and then carry out robustness checks for a sub-sample that employs bad loans over total loans. ROA has embedded in the numerator (net income) the contribution of loans losses, while the denominator (total assets) reflects the presence of the corresponding reserves. Appendix F reports our findings for our risk assessment models in which both the capital adequacy and the asset quality are proxied by more accurate, although less available, accounting-based dimensions. For capital adequacy we consider, in fact, the Tier 1 and the Total Capital Ratio, while for asset quality we use Impaired Loans to Gross Loans and Loan Loss Reserves to Impaired Loans. Notwithstanding the sample coverage decreases substantially for these models, we anticipate that the main findings on risk assessment are largely confirmed.

We measure the management quality by means of the return on equity (ROE) and the ratio of operating expenses over operating income (Cost to Income Ratio). The relationships between management quality and the probability of distress is expected to be negative, as better management practices should foster economic performances and institution resilience to distress. Net Interest Margin is utilized as a proxy for earnings and the expected sign of the relationship with distress is negative. In addition, the earnings dimension is approximated by the ratio of Interest Expenses to Total Liabilities and, in this case, the expected relationship is positive. Liquidity is measured by the ratio of liquid assets over customer and short-term funding (Liquid Assets to Short-Term Funding) and by the ratio of deposits and short-term funding over total funding (Deposits to Total Funding). Usually, institutions with better liquidity conditions are more likely to meet their financial obligations and thus are perceived as less risky (Tirole 2011). Finally, the sensitivity to market risk is measured by the share of securities to total assets (Total Securities to Total Assets). The relationship with the probability of distress is ambiguous since securities are a volatile source of income but at the same time can be more liquid than, for example, loans. This feature is particularly relevant for risk assessment during the recent crisis since the effects of fire sales, which represented a channel through which financial distress spread throughout the system, made some institutions more vulnerable. Summary statistics of the predictors’ values are shown in Table 3.

Table 3 Predictors summary statistics Values are computed as averages over the interval 2005–2007

Although referring to accounting-based information, CAMELS indicators used in the models for risk distress are not trivially related to the business strategies detected in Sect. 3.2. In fact, (i) CAMELS are not among the indicators used in the multi-dimensional clustering procedure, and (ii) although CAMELS may represent transformation of some indicators used as inputs for the clustering analysis, they are in general strongly representative of the reciprocal influence of both balance sheet and income statement variables, where the latter are excluded from the clustering procedure.

3.3.1 Distress events definitions

As described above, the dependent variable \(Y_i\) in Eq. (1) is a dichotomous variable that takes value 1 if institution i has been under distressed conditions in the period from 2008 to 2010 and 0 otherwise. Table 4 shows the distress events definitions and the corresponding sources that we used to construct this variable.

Table 4 Distress events definitions and sources The amount of distress events refers to the period from 2008 to 2010

Institutions may be under distressed conditions due to several reasons, although the recent financial crisis suggests that government bailouts and state aids had a role in the avoidance of systemic crisis and cascade of banks’ failures. Therefore, direct failures were quite rare and presenting estimates separately for different distress events would have made the econometric estimation not robust enough. For these reasons, we propose a comprehensive list of distress events which takes into account several definitions of bank distress (for an approach similar to ours see Betz et al. (2014) and Vazquez and Federico (2015), while Kick and Koetter (2007) distinguish between different types of distress).

Bankruptcy occurs if the net worth of the bank falls below a country-specific regulatory threshold, while liquidation concerns the sale of bank’s assets by the liquidator as per the guidelines of the country regulations and the distribution of the corresponding assets to claimants. These two distress events were quite rare during the recent financial crisis as governments interventions created a safe net to prevent cascade failures. We therefore introduce additional distress definitions to capture these interventions. Defaults occurs if the bank failed to repay interests or principal on its financial obligations beyond any grace period or if some of its instruments are replaced by other obligations at a diminished value as a consequence of a distressed exchange between counterparts. We rely on ratings from Moody’s and Standard and Poor’s to assess the presence of a default state. In particular, we merge both short-term and long-term ratings and only if the evaluation of the bank conditions is poor in both cases we consider that bank under a default situation. Moreover, forced mergers of distressed institutions have occurred during the crisis. We define an institution as part of a distressed merger if its coverage ratio in \(\textit{t}-1\) was negative. Coverage ratio is a typical indicator used to assess banks’ vulnerability conditions (see e.g., González-Hermosillo 1999) and is computed as the sum of equity plus reserves for non-performing loans minus total impaired loans over total assets.Footnote 15 In addition, we consider as distressed mergers also those cases where the institutions present a rating indicating a vulnerable state. Finally, we enrich the data set of distressed institutions by including the information of public bailouts (Laeven and Valencia 2010, 2012) and, for the US perimeter, we integrate Bankscope with bankruptcy information from the Federal Deposit Insurance Corporation.Footnote 16

The largest proportion of distress events (42%) occurred by the end of 2008, declining considerably by the end of 2009 (33%) and 2010 (25%). However, discriminating distress events by year within the crisis period of 2008–2010 can be misleading as reported, for instance, by Laeven and Valencia (2012).Footnote 17

4 Banks’ business strategies identification and distress events

4.1 Clusters validation

We rely on three main measures for clustering validation, namely the average silhouette width, the Pearson gamma and the average within/between ratio of the distances (see e.g., Halkidi et al. 2001; Han et al. 2011). Note that to clusterize the whole data set of institutions, assumptions aimed at filling missing values, which are of a substantial proportion on a sample size and geographic coverage of this magnitude, have to be considered. Table 5 provides performance measures of each of the three clustering methodologies under four main assumptions for filling missing values: [Z] that assigns zeros to all missing values; [A] that replaces each missing value with the average value across the whole sample for that variable; [M] similar to the average case but using the medians; [EM] that estimates missing values using a bootstrap procedure for multiple imputations which combines the Expected Maximization algorithm (Dempster et al. 1977) used to find the mode of the posterior distribution with a bootstrap approach to take draws from this posterior (Honaker and King 2010). Clustering validation measures are computed for the indirect methods and the direct classification (i.e., Bankscope institutions’ specializations) using the resulting clusters from the respective approaches and, to enhance comparability, filling missing values using these four criteria separately.

Table 5 Validation of clustering approaches In the table we provide clustering validation measures for the Louvain, Ward and direct (Bankscope) clustering methods

Results confirm that the direct classification is a poor indicator of business strategy assessment, whereas the indirect approaches (both the Louvain adopted in this study and the Ward algorithm) provide superior clustering estimates across different model configurations and validation measures.

Specifically, our evidence indicates that institutions with the same direct specialization may adopt quite different business strategies. Therefore, in line with our expectations, we infer that the direct classification does not provide a sufficiently informative indication of banks’ activities, particularly when considering cross-country comparisons. Both Louvain and Ward scores are much higher for the first two statistics and lower for the wb ratio than the direct one by far, supporting the adoption of indirect classifications as better methods to identify banking peer groups. Between the two indirect methods, we observe a very close and consistent performance across the different model configurations and validation measures. However, we favour the Louvain over the Ward method given the characteristics of our sample and for its advantages when dealing with sparse and complex data.

4.2 Identification of banks’ business strategies

The aggregate factors used to identify banking business strategies resulting from the cluster analysis are reported in Table 6. Each column represents a business model, where the three dominant or “core” groups that account for the largest number of institutions are labelled as: A, B and C. Groups A and C correspond to the two polar cases of wholesale-oriented/modern banks and deposit-oriented/traditional banks, respectively, while group B is a hybrid case in between A and C.Footnote 18 Institutions within these peer groups span well across countries and continents, supporting our empirical setup on a global coverage (details in Appendix D). In some cases, institutions have migrated from one group to the other over the period as discussed in Appendix E.

Table 6 Economic features of business strategy peer groups-pre crisis summary features We report average values for aggregated balance sheet variables standardized by total assets for groups A, B, C, D and E

Group A is is the most popular group in terms of number of institutions (over 3500) and is characterized by the largest proportion of wholesale funding, accounting for roughly 34% of total assets on average, and well diversified loan investments with moderate exposures to interbank activities. Close similarities are found with the “Wholesale-funded” model discovered in Roengpitya et al. (2014) and the “Wholesale” peer group presented in Ayadi et al. (2012).Footnote 19 Typical institutions in this group have size above average, comparatively low leverage and medium levels of net liquidity. Many US bank holding companiess (e.g., Citigroup) and the top investment banks (e.g., Goldman Sachs), and Russian commercial banks are classified in this group, followed by French (e.g., Société Générale) and Swiss commercial banks (e.g., Credit Swisse).

Group B is characterized by a moderate amount of wholesale funding and the largest exposure to commercial loans investments, which differentiate it from the well diversified group A. Institutions in this group tend to be smaller than those in group A; they also tend to be highly leveraged and with an appropriate maturity transformation. Their features are similar to the risky “Stakeholder” banks reported in DeYoung and Torna (2013). Group B represents a hybrid peer group similar to group A on the liability side and more traditional on the asset side (similar to group C). Most of the institutions classified in this model are German (Volksbank and Raiffeisenbank cooperative banks and Sparkasse savings banks) and US-based (mainly regional and state commercial banks). Italy is well represented too in this business model in 2007, with a relatively large number of cooperative along with some commercial and saving banks migrated from the model D discussed below.

Institutions belonging to the third core peer group C engage primarily on traditional customer deposits funding (the largest across models) and lending activities (mainly commercial loans). They are also the only interbank net lenders among the three core business strategies. In theory, group C provides the most stable funded business strategy due to the deposit-driven liabilities. Roengpitya et al. (2014) find a retail-funded model closely related to our group C.Footnote 20 High leverage, along with dominant commercial loan investments, makes this model very similar to the peer group B, although featuring a much more stable funding due to the large proportion of customer deposits to total funding. This is also the business strategy that provides the most dominant maturity transformation of all (the lowest net liquidity value), confirming the provision of traditional banking services. This feature supports the empirical evidence reported in Paligorova and Santos (2016) on the maturity transformation characteristics of wholesale versus deposit-oriented banks. This model is primarily represented by Japanese cooperative banks and US commercial and savings institutions (e.g., Washington Mutual).

Our classification approach also captures two other specialised groups that were prevalent in the first 2 years of our sample and then disappeared at the onset of the crisis. The first, labelled group D, accounts for about 1500 institutions, and is characterized by dominant long-term funding, commercial investments and a net interbank lending exposure (as in group C). Italian and Spanish co-operative and savings banks are the most represented institutions. Most of these institutions migrated to the B peer group in 2007 due to the close similarity with regards to their assets side, with few exceptions migrating to group C (see Appendix E). The second of these specialised business strategies, labelled group E, was characterised by a diversified funding combined with the largest exposure to retail loans investments. Those institutions were also highly leveraged compared with the other peer groups. This model was dominated by Swiss saving banks and US bank holdings. At the onset of the crisis most of them migrated to group A.

4.3 Distress events by business strategy

Table 7 illustrates the distribution of 2008–2010 distress events by peer groups.Footnote 21 Most distress events are found in group A, suggesting a fragile business strategy as also reported, for instance, in Ayadi et al. (2011), Demyanyk and Hasan (2010) and Roengpitya et al. (2014). However, generally distress events appear sufficiently well-spread across business strategies.Footnote 22

Table 7 Distribution of distress events for business strategy membership Column names refer to business strategy peer groups in year 2007; row names in bold stand for peer groups in year 2006; finally, row names in italics refer to peer groups in 2005

Our findings offer evidence of a size effect for the wholesale-oriented peer groups as distressed institutions are significantly larger in terms of total assets than their peers. For group A, the 64 banks under distress accounted for $10.5tn of total group assets in 2007, 1/6 of the total of their peers, and average sizes of more than $160bn, almost six times bigger than their peers. Similarly, the 28 distressed institutions in group B reached a total assets coverage of $2tn in 2007, 1/5 of the whole amount of the group with average sizes up to 15 times those of their peer group members.

In contrast, for group C the average size of distressed institutions during the period 2008–2010 is in line with their peers, with not consistent patterns to be spotted at the country level either. For the 50 distress institutions found for group C in 2005–2007, total assets were “just” $600bn compared to the $24.5tn of the whole group in 2007, with average sizes even below the average of their peers ($12.2bn for the distressed institutions compared with the group average of $16.8bn in 2007). This would suggest that relative size on a business strategy with very high stable funding does not matter for purposes of risk assessment. This effect is empirically tested in Sect. 5.

Table 8 Distress assessment The first three models refer to specifications within the corresponding group of predictors: CAMELS, Macro and Sector, respectively

5 Empirical results on the determinants of banks’ distress

5.1 Baseline model results (all banks)

Table 8 reports the results from our baseline models to assess the likelihood of distress during the global financial crisis. The first three specifications regress our dependent variable, i.e. banks’ distress events, on each group of predictors separately, i.e. the proxies for CAMELS, the macro and the sectoral dimensions.Footnote 23 The last two columns provide, respectively, the results for: the model that includes all variables and selected variables only. Focusing on the first set of results (column CAMELS) our evidence reveals that in most cases relationships with banks’ distress are as expected, with negative and significant effects with the variables Capital,Footnote 24 Cost to Income Ratio, Net Interest margins and our two Liquidity proxies; and positive and significant associations with ROE and Total Securities to Total Assets.

The impact of bank capital is mixed, as more equity over assets seems to be associated with lower risk-taking, as found in similar models proposed by, e.g., Betz et al. (2014) and Vazquez and Federico (2015). However, when considered together with subordinated debts in order to mimic the regulatory capital (the Capital Funding Ratio variable), the sign is positive, although the level of significance is low. There are various possible reasons for this mixed result, including potential moral hazard effects at play at high regulatory capital levels. A caveat of this interpretation might be based on the presence of either non-linear or thresholds effects (Estrella et al. 2000); the model may also suffer from some multicollinearity issue which motivates a more parsimonious model that will be described below.

ROE is a measure of profitability that may be due to higher risk taking, for example if the denominator is low, so riskier activities seem to have implied higher probability of distress. We anticipate that ROE could ultimately have a negative impact on distress depending on the banks’ strategies, a result which would support our intuition on the main hypotheses to discriminate risk drivers among peer groups (see Sect. 5.2).

Large liquid positions influenced negatively the risk of distress, as higher levels of liquidity can strengthen the solvency of the institution (see e.g., Khan et al. 2016). Banks usually follow assets and liabilities management (i.e., ALM) procedures to gauge liquidity, interest rates and currency mismatches of their exposures and to assess the buoyancy of funding and liquidity sources (DeYoung and Jang 2016). Failure to properly manage these sensitivities before the outbreak of financial markets could have undermined the quality of balance sheets and made banks more prone to distress.

The share of securities to total assets (Total security to Total Assets) resembles the sensitivity to market risk. The relationship with the probability of distress can be ambiguous since securities represent a volatile source of income, although at the same time these assets can be more liquid than, for example, loans. This aspect was particularly relevant during the recent crisis since the effects of fire-sales made some institutions more vulnerable. Our baseline models point to a marginal and positive effect, although not always statistically significant.

We stress here the importance of macro and sectoral indicators in the prediction of banks’ distress. Higher GDP per capita reduced the likelihood of being in distress, so better economic conditions seem to have fostered financial system resilience, whereas higher Inflation and FDI Outflows increased the likelihood of institution’s distress (Ostry et al. 2012). We note that the variable House Price had a positive and significant sign which probably outlines the role of the mortgage market whose collapse during the recent crisis heavily influenced financial stability (Cole and White 2012). Odd sign of Unemployment in the Macro model disappears once we introduce a more comprehensive list of predictors. Expected results arise when considering the sectoral analysis, with Market Index returns working in the same direction as GDP per capita, coherently with the high level of correlation among these variables as reported in Table 10 of Appendix B. Marginal negative effects appear for the Central Government Debt and even for the proportion of domestic financial sector NPLs to total loans (Banks NPLs to Gross Loan). Opposite contribution to institutions’ resilience was associated with sovereign debt yields (Gvt. Long-Term Yield) whose dynamics usually reflects the country risk appetite by investors. The presence of sovereign instruments in banks’ balance sheets often relates to regulatory requirements and the need to fulfill capital constraints based on the amount of risk-weighted assets.Footnote 25 Financial institutions are, in fact, usually exposed to sovereign debt instruments, especially domestic issuances, which influence banking practices and impact on the likelihood of being vulnerable to distress from shocks arising in the sovereign debt markets (see e.g., Laeven and Valencia 2012). Macro-economic and sectoral conditions thus play a role in predicting the likelihood of banks’ distress, supporting the inclusion of macro-prudential policies as complement to micro-prudential and bank level tools.

For completeness, we also run a specification with all predictors, even though we note that the large number of variables compared with the amount of observations and distress events does not allow for an accurate and proper econometric investigation. The fifth specification aims at circumscribing this issue by implementing a parsimonious model using half of the predictors. The selection of these variables is driven mainly by their coverage within the sample and the significance of the estimates within the single models, but still preserving the representation of all CAMELS dimensions as well as macro and sectoral control variables. Among the individual predictors in this enriched framework we notice that higher levels for both ROA and Deposits to Total Funding facilitated the resilience of the institutions; an opposite contribution emerges for ROE suggesting that increasing propensity to risk taking prior to the crisis predicted institutions to be in distress. At the macro level, we confirm the impact of the country business cycle approximated by the GDP per capita, while high Inflation and Unemployment levels deteriorated banks’ stability conditions. At the sectoral level, Market Index has the same negative sign as GDP per capita as presented above. The sectoral market index (Sector Index, i.e. market returns of the financial sector), however, was positively and significantly related to distress, capturing the effect of the financial sector bubble prior to the crisis. It is worth noting that the pseudo \(R^{2}\) of the single model regressions are low, while we reach higher values of explained variability in bank probability of distress with the inclusion of a more comprehensive list of predictors (see model Selected). Due to the level of correlations of ROA, ROE and Capital reported in Table 10, Appendix F validates our estimations by providing more in-depth variants of the Selected model. Specifically, we estimate the model by retaining only one of the aforementioned regressors. We also include interaction effects to capture potential effects beyond a simple linear relationship. Results are reported in Table 15.

5.2 Risk drivers by banks’ peer groups

The information about banks’ business strategies represents a valuable opportunity for regulators to investigate institutions’ characteristics and vulnerability to distress during the crisis. This would support targeted intervention and more accurate risk monitoring. In this Section we analyze each peer group separately by partitioning the sample according to their ‘stable’ membership over the interval 2005–2007. This setup allows us to disentangle the impact of being in a particular peer group in the pursuit of testing our main hypothesis. Hence, Table 9 focuses on a restricted case where we discard institutions that switched peer groups prior to the crisis and, therefore, are less representative for those business strategies. Since the number of observations and, in particular, distress events under each peer group is modest, we need to consider a Penalized-likelihood logistic regression with very few predictors. For this analysis we focus on the two core and polar groups A (modern banking) and C (traditional model).Footnote 26 We propose a simple framework with common CAMELS variables and two basic macro predictors (GDP per capita and Inflation) along with two sectoral variables which capture market dynamics (Government LT Yield and Market Index return). Estimates (both the signs and the magnitudes) for the entire set of no-switching institutions (All model) are similar to those discussed in the Selected model of Table 8, providing reassurance against potential endogeneity problems. This also supports the selection of these predictors for the reference model used to create specifications for each peer group. Note that although confined to specific peer groups membership, these configurations share the same setup of previous models to enhance comparability. In addition, we add the level of institutions’ total assets as predictor. As discussed in Sect. 4.3, size relative to their peers seems to matter for the event of distress during the recent financial crisis at least for the wholesale-oriented institutions.

Table 9 Distress assessment within business strategies Columns Group A and Group C refer to observations for institutions that do not change peer group and that belong to that specific business strategy indicated in the name of the column

Table 9 confirms that dominant assets size relative to their peer group members exacerbated the likelihood of distress only on group A, i.e. the most wholesale-oriented model, whereas it had no significant impact for the deposit-oriented group. This finding provides an alternative view on the “too-big-to-fail” problem that can be tailored to specific peer groups. Estimates indicate that groups A and C present quite different risk drivers in terms of CAMELS variables during the global financial crisis: ROE impacted positively on the likelihood of distress for group A institutions and negatively for C ones (although the coefficient magnitude is relatively small), while ROA exhibited an opposite pattern and Capital had a negative sign for group C institutions but did not show significant effects for those in group A. The only exception is on the proxy for liquidity that was negative and consistent across model specifications. This result provides supporting evidence on the debates that stress the impact of liquidity to the recent financial crisis (Tirole 2011) as independent of the adopted business strategy.

On the impact of Capital (and reciprocally on leverage), we recall that group C institutions were on average less capitalized (highly leveraged) compared to those in group A, and would have definitely benefited from a boost of capital to enhance stability (Berger and Bouwman 2013; Vazquez and Federico 2015). However, we would take a step further into the assessment of leverage via ROA and ROE. Advancing on the explanation on the effect of ROA, we compare the level of diversification on the assets side between traditional deposit-oriented and modern wholesale-oriented groups, more precisely the differences between a loan-focused versus a well diversified investment strategy. Institutions within the highly leveraged business strategy as those in group C could not fully exploit a wide spectrum of investment choices to boost their returns on assets (see Chiorazzo et al. 2008 for evidence in the EU). This was due to a limited range of instruments in their assets side (recall their investment strategy mainly focused on commercial/corporate loans), which probably impacted on their ability to select risk-adjusted profitable investments compared with group A institutions. Empirical evidence shows that deposit funded banks tended to extend more credit lines to firms on longer maturities (Paligorova and Santos 2016) due to their stable funding (Ivashina and Scharfstein 2010; Kashyap et al. 2002), which although promoting profitability would have exposed them to the corporate credit market that was affected by the sub-prime mortgage crisis.

On ROE, the opposite effect would be related to the structure of the liabilities side of the institutions. Group A was already characterized by high exposure to profitable and risky assets, and further investment decisions aimed to improve the level of ROE could have worsen the sustainability of their activities (Stiroh 2004) compared with those in group C. This effect is in line with the findings of Demirgüc-Kunt and Huizinga (2010) on the nonlinear effect of the growth of non-deposit funding and non-interest income investments to the prediction of distress. Group A institutions had higher exposures to interbank debts, and further investments were indeed likely to involve an increase of this type of leverage, which in turn deteriorated the resilience of this peer group and, eventually, exacerbated their risk of distress during the crisis (Vazquez and Federico 2015); Group C institutions on the other end were characterized by more stable funding, namely deposits, so the mix of funding that they used for investment purposes was less prone to suffer from financial market instability (Beltratti and Stultz 2012). The latter phenomenon can imply contagion dynamics that are related to interbank exposures through which contagion may actually propagate (Krause and Giansante 2012; Cont and Minca 2016). It is worth recalling that the dependence on interbank positions was a specific feature of the wholesale-oriented peer groups, while during the recent crisis customer deposits, dominant funding source in deposit-oriented institutions, were not particularly affected by bank runs triggered by the lack of confidence in banks’ quality (for discussions see e.g., Gorton 2010). Furthermore, the interconnectivity arisen from interbank exposures might have determined the need to redefine the bilateral positions during the outbreak of financial markets to compensate for the increasing perception of counterpart risk, thus resulting in more volatile balance sheet compositions. This might have also influenced the reallocation of investments on the assets side, due to constraints on funding sources which reciprocally affected fire-sales dynamics (Anand et al. 2013; Georg 2013).

Finally, we add a specification which includes all the wholesale-oriented institutions in the same group (last column is circumscribed to the merge between A and B groups only). Although the main discussion above compares the two extreme peer groups, namely A and C, which is in line with the current debates on business models (see Sect. 2), this further analysis helps validating comparisons between wholesale-oriented models and the deposit-oriented model, due to the inclusion in particular of group B institutions. All three specifications of wholesale models proposed are consistent and estimates reinforce the interpretation that modern wholesale-oriented and traditional deposit-oriented institutions presented different predictive drivers for the likelihood of distress during the recent financial crisis. Thus, our results support our main hypothesis by emphasizing different predictive features of institutions’ probability of distress under traditional or modern banking models.

6 Conclusions

This study investigates the impact of financial institutions business strategies to the prediction of distress during the 2008 financial crisis. Although empirical evidence suggests that wholesale-funded institutions are more prone to distress due to low levels of stable funding, the recent financial crisis exposed the fragility of all business strategies. We therefore focus our analysis on exploiting banks adopting similar business strategies (peer groups) to estimate the likelihood of distress during the recent financial crisis in order to promote banks classifications as a useful assessment procedure for targeted monitoring.

By partitioning the sample according to peer group memberships that were consistent over the period 2005–2007, we confirm that size (in terms of total assets), and therefore the issue of “too-big-to-fail”, had a significant predictive impact only among modern wholesale-oriented models. This is not the case within the traditional deposit-oriented groups, where size did not play any significant role. We also compare modern wholesale-oriented versus traditional deposit-oriented institutions noting opposite patterns for CAMELS risk drivers contributions in predicting the risk of distress: (i) a significant and negative sign for capital (positive for leverage) only for the less capitalized (highly leveraged) deposit-oriented group, thus supporting the idea that recapitalization might not improve resilience for those institutions adopting modern and universal banking practices that already presented heavy capital burden on their risky exposures; (ii) a negative impact of ROE for traditional (not well diversified) business models and positive for modern banking models which captured the impact of leverage based on a stable versus an unstable funding structure; (iii) opposite effects for ROA that reflected the impact of the pursuit of higher assets returns on restricted investment portfolios compared to well diversified ones.

This analysis is expected to provide useful recommendations to regulators, particularly in the context of the identification of early warning signals and the prediction of bank distress. For example, the recent Supervisory Review and Evaluation Process (SREP) put in place in Europe directly states peer group analysis as an important pillar for the sustainability of banking sector. By enriching the mainstream banks’ peer group classification, we provide evidence of how each model presents peculiar predictive drivers for the likelihood of distress at a global level, where the one-rule-fits-all approach for monitoring and risk assessment could be dramatically misleading compared to a targeted intervention. The exception we find is on the impact of liquidity that appeared exogenous to business strategy, supporting the new adopted Liquidity Coverage Ratio and Net Stable Funding Ratio introduced by Basel III that are meant to tackle this issue (BCBS 2011).

The case study offered by the recent crisis identifies peculiar financial statement items as drivers for the prediction of banking distress. These variables emerge as distinctive features of their business strategies. In line with our findings, future research could address the reasons behind peer groups instability and the role of liquidity that we show to be quite consistent across business strategies. Funding liquidity risk might be deeper analyzed by exploiting the sensitivity of peer groups to liquidity composition and non-performing loans exposures, including earlier years and quarterly releases to improve the estimation procedure. This would require a more parsimonious set of institutions’ features to overcome comparability issues. The vulnerability of banks could also be assessed by other indicators well-adopted in literature, such us the distance to default (z-score), SRISK and MES (Acharya et al. 2012, 2017; Brownlees and Engle 2016), \(\Delta \)COVAR (Adrian and Brunnermeier 2011), or DIP (Huang et al. 2009), which can be easily included in our framework, as well as more focused macro and sector indicators designed for specific distress events within the geographical versus peer group space.