1 Introduction

According to the statistics from the world bank (data.worldbank.org), the GDP growth rate in China experiences a consistent decline from 2010 to 2019. The figure decreases from 10.636 to 6.109, however, this slow-down in economic growth does not prevent the banking sector from playing an even more important role in allocating credits to different economic sectors. We can see from the data provided by the global economy, the proportion of banking sector in the country’s GDP keeps increasing consistently from 2010 up to 2017, and the percentage increases from 125.55% to the peak of 175.54% by the end of 2017.

This statistic is within expectations because, similar to other developing countries, China does not have a well-developed stock market, and the allocation of credits heavily relies on the banking sector. To be able to fulfill the function of the banking industry to serve the economy, solving the question of how to improve resource allocation in the banking production process will be very helpful to achieve this specific bank function. In other words, banks need to find an effective way to minimize the input investment or maximize their output production. The empirical research has been consistently making great efforts in achieving this goal through analyzing the efficiency level of various economic sectors. The widely used methods to estimate efficiency can be classified into two streams: one is the parametric stochastic frontier analysis, which has been applied to the non- banking sector (Charoenrat & Harvie, 2014; Lin, & Long, 2015; Wanke et al., 2020) as well as the banking sector (Fang et al., 2019; Tan & Floros, 2019; Tan et al., 2017). However, the parametric stochastic frontier approach suffers from several limitations: (1) it does not work very well with small samples; (2) it needs to specify a specific function form and different specifications will lead to variations of efficiency scores (Charnes et al., 1995). The second stream of the method is the non-parametric data envelopment analysis (DEA). This method has been applied widely to various sectors of the economy and various attempts have been made to make relevant developments based on the traditional model.Footnote 1 Compared to the parametric stochastic frontier analysis, the data envelopment analysis also may be criticized from a variety of perspectives: (1) the selection of inputs and outputs will affect the efficiency scores; (2) no statistical noise is assumed by DEA and the results from DEA are sensitive to measurement errors and extreme observations. However, the empirical studies show that using DEA to estimate efficiency can generate more robust results (Seiford & Thrall, 1990).

Data Envelopment Analysis (DEA) (Charnes et al., 1978) is a prominent technique for evaluating the relative efficiency of a set of entities called decision making units (DMUs) with homogeneous structures. Efficiency in DEA is assessed by measuring how far a DMU is from the best practices (or worst practices if an optimistic manner is followed). DEA uses an axiomatically defined production possibility set with an efficiency measure assigned to it. The efficiency measure estimates the distance between the under-evaluation DMU and the efficient frontier. A variety of efficiency measures have been developed in DEA. An efficiency measure should possess some properties to be suitable for performance assessment and ranking DMUs. Units invariance, monotonicity and translation invariance are some basic requirements of an efficiency measure (Cooper et al., 1999; Fare & Lovell, 1978). Units invariance means that estimating efficiency is independent of the units by which inputs and outputs are measured. Monotonicity implies that the efficiency measure strictly decreases in response to worsening inputs or outputs of an inefficient DMU. Another vital property of an efficiency measure is translation invariance. A DEA model is said to be translation invariant if translating the original input and/or output data values results in a new problem that has the same optimal solution for the envelopment form as the old one.

Efficiency in DEA is evaluated by either radial or non-radial projection. Radial measures proportionally increase outputs (output-oriented) or decrease inputs (input-oriented) to project a DMU onto the efficient frontier. In terms of non-radial projection, each of the inputs and/or outputs can change independently. Non-radial measures are of great importance in DEA because non-proportional expansions or contractions are possible in many applications implying that only non-radial benchmarking and comparison are sensible. It is worth noting that the radial models ignore the effect of non-radial slacks in the efficiency (Tone & Tsutsui, 2010). The DMUs cannot be identified as completely efficient when the efficiency is one, but the slacks are not zero. Furthermore, for some cases—such as banking efficiency evaluation—some factors (inputs/outputs) are partly substitutional and may not change proportionally. Therefore, radial models have some shortcomings in evaluating banking efficiency. In this paper, a DEA method is proposed which provides a non-radial efficiency assessment in the banking sector.

The evaluation of performance including efficiency analysis will not only provide a picture regarding how the inputs and output resources are allocated in the production process, but also relevant policies can be made specifically on how to better allocate the factors of production through increasing or decreasing certain types of products. However, this policy implication is mainly limited within the production process; all the other factors outside of the production process are ignored. Therefore, a growing number of empirical studies have been engaging in a second-stage regression analysis to investigate the determinants of performance. In the banking sector specifically, efforts have been made to use various econometric techniques to evaluate the determinants of bank performance, in particular in the area of efficiency determinants examination through Tobit regression (Defung et al., 2016), fractional logic regression (Tan & Anchor, 2017), and Bootstrapped truncated regression (Wanke et al., 2016), while a step further on this has been made in the empirical literature to investigate the inter-relationships between efficiency and other variables concerned through simultaneous equation modelling under either seemingly unrelated regression analysis (Altunbas et al., 2007), three-stage least square estimator (Konara et al., 2019) or Grainger-causality test (Fiordelisi et al., 2011). However, the existing literature concerning the investigation of efficiency determinants, in particular, the inter-relationship between efficiency and other variables concerned has two issues: (1) all the empirical studies only examined the efficiency with a very limited number of variables focusing on bank risk. All the other variables which are very important in banking operating, such as bank profit, traditional and non-traditional business, expense management and bank size are ignored. The inclusion of these would not only provide more robust results, but also provide more accurate and concrete policies; (2) although the existing modelling framework addressed the issue of endogeneity to a larger or smaller extent, this issue has not been addressed thoroughly, which will influence the robustness of the results.

The current study significantly contributes to the empirical literature in efficiency analysis as well as the inter-relationship between efficiency and other variables concerned in the following two ways: (1) we propose a new non-radial DEA model to estimate efficiency using a sample of Chinese banks as the sample; our newly proposed innovative DEA model will provide the most accurate results related to the efficiency level in the Chinese banking industry; (2) unlike the existing studies, in a second-stage analysis, we propose a Robust Endogenous Neural Network Analysis to investigate the inter-relationships between a comprehensive set of bank-specific variables, including bank efficiency, bank size, bank profitability, expense management, traditional banking business and non-traditional bank business. The proposed new modelling framework will not address the endogeneity issue in a much better way than the existing literature, but the inclusion of comprehensive bank-specific variables in the modelling framework will facilitate a more accurate policy making process.

Our results show that the Chinese banking industry experiences a consistent increase in the level of efficiency up to 2015, after which a slight decline in the efficiency level is observed, with slight volatility over the last three years. The Chinese banking efficiency ends up with an efficiency score of 0.746 by the end of 2018. Also, we find that state-owned commercial banks have the highest level of efficiency, while rural commercial banks are the least efficient banking group. We notice that foreign banks experience the highest volatility in the efficiency level over the examined period. The second-stage analysis shows that bank size exerts a positive influence on the development of non-traditional banking business and proactive expense management. Bank size and non-traditional businesses have a positive impact on efficiency levels, while bank profitability, traditional businesses and expenses management have a negative influence on bank efficiency.

The current paper is structured as below: Sect. 2 reviews the empirical studies of DEA applications in various sectors of the economy as well as the DEA applications in the banking sector in general and Chinese banking sector specifically; Section 3 presents the non-radial DEA model as well as the Robust Endogenous Neural Network Analysis (RENNA). Section 4 provides the results and discussion, while the conclusion is given in Sect. 5.

2 Literature review

The evaluation of performance in various economic sectors is very important for policymaking purposes and, consequently, it attracts attention from various parties, including the government officials and regulatory authorities as well as academic researchers. The performance assessment in the empirical literature can be generally divided into two groups, with the first one focusing on the use of accounting ratios and the second stream concentrating on the use of parametric or non-parametric operational research methods. In particular, the use of non-parametric data envelopment analysis has been gaining popularity in estimating efficiency in various economic sectors, including the high technology industry (An et al., 2020); the hotel sector (Assaf et al., 2010; Yin et al., 2020; Yu & Chen, 2020); insurance industry (Barros et al., 2010; Cummins et al., 2010; Eling & Jia, 2019; Eling & Luhnen, 2010): telecommunications industry (Bayraktar et al., 2012; Sahoo & Sahoo, 2020); transportation industry (Chang et al., 2013; Merkert & Hensher, 2011; Stefaniec et al., 2020; Wu & Goh, 2010; Yu, 2010); iron and steel industry (He et al., 2013; Lee et al., 2019); the tourism industry (Pestana et al., 2011; Song & Li, 2019); the agricultural sector (Picazo-Tadeo et al., 2011; Wang & Feng, 2020); the hospital sector (Rouyendegh et al., 2019); petroleum industry (Sueyoshi & Goto, 2012; Wang et al., 2019); the education sector (Aparicio et al., 2019; Thanassoulis et al., 2011); and the energy industry (Wang et al., 2013; Wanke et al., 2020; Zhou et al., 2012, 2013). Also, the DEA models have been applied to analyse the efficiency level in various business-related operational issues, including supply chain management (Azadi et al., 2015; Malesios et al., 2020; Mirhedayatian et al., 2014); sustainability and environmental performance (Allevi et al., 2019; Chen & Delmas, 2011; Lee & Saen, 2012); innovation system (Guan & Chen, 2012; Kalapouti et al., 2020; Zhong et al., 2011); and the manufacturing industry (Margaritis & Psillaki, 2010; Walheer & He, 2020). In particular, the study of Psillaki et al. (2010) evaluates the efficiency level of different industries in France, including textile and wood, as well as research and development.

The evaluation of efficiency in the banking sector has been evaluated by empirical research through the traditional DEA model and various developments. The traditional DEA model has been applied by Chortareas et al. (2012) to evaluate the efficiency in the European banking sector over the period 2000–2008. The study also engages in a second stage regression analysis to further investigate the impact of bank supervision and regulation on bank efficiency under the truncated regressions, generalized linear models as well as a fractional logic estimator. Slightly different from Chortareas et al. (2012), the efficiency level of the European banking industry is also assessed by Chortareas et al. (2013) under the traditional DEA model, while they further examine the impact of financial freedom on bank efficiency under the Simar and Wilson (2007) bootstrapped truncated regression analysis. Studies also attempt to evaluate bank efficiency in emerging markets, not solely in the European banking industry. Konara et al. (2019) using a sample of banks from eight emerging market economies to investigate the relationship between FDI and bank efficiency over the period 2009–2013. A traditional DEA model is used to derive various efficiency estimates, including technical, pure technical, scale, cost and revenue efficiencies, while the relationship test is facilitated by the three-stage least square estimator. Traditional DEA models are also used by Staub et al. (2010) to evaluate the efficiency level (i.e. cost, technical and allocative efficiencies) in the Brazilian banking industry. In the second-stage analysis, three different panel data specifications are used to analyze the determinants of bank efficiency, including a dynamic model, a Tobit model and an autoregressive model. The empirical studies have made efforts to build on the traditional DEA models and relevant advancements in the modelling framework have been proposed, such as the network DEA model (Kumar & Gulati, 2010; Liu et al., 2018, 2020; Paradi et al., 2011).

There are some empirical research studies analyzing the efficiency level in the Chinese banking industry specifically. Using a sample of Chinese commercial banks over the period 2003–2009, Tan and Floros (2013) evaluate the efficiency level under the traditional DEA models. Various performance indicators, including technical, pure technical and scale efficiencies as well as Malmquist productivity index, are examined and the authors further test the inter-relationship between risk, efficiency and capitalization using a three-stage least square estimator.

Taking a sample of Chinese commercial banks as the sample over the period 2002–2009, Chang et al. (2012) proposed an input slack-based productivity index, which possesses the advantages of being able to disaggregate the total productivity index into the productivity change of each input. The findings show that the productivity growth of Chinese banks is dominated by capital productivity, which is mainly derived from the technical progress in capital use.

Nine super-efficiency DEA models have been proposed and used by Avkiran (2011) to explore the association between bank efficiency and performance ratios. The results show that there is a weak relationship between the two. However, profit margin and return on assets are found to have a significant relationship with the efficiency estimates, with the former having an even stronger effect.

A new three-stage network DEA model is proposed by Fukuyama and Tan (2020) to evaluate three different types of efficiencies, including input efficiency, stability efficiency and output efficiency. The study considers the role of market power in loans and deposits in the production process and treats loan loss provisions as the good intermediate product in a network model. Besides, Tobit regression is used in the second stage to examine the impact of non-performing loans from three different geographical areas (Eastern, Central and Western) on different types of efficiencies, and the results are also cross-checked by fractional logit regression and Bootstrapped truncated regression. The findings show that a higher level of financial risk from the Central area has a significant and positive impact on input inefficiency, while the Western area has an opposite effect.

A dynamic two-stage DEA model considering the carryover characteristics of the non-performing loans is proposed by Zha et al. (2016) to analyze the efficiency level for a sample of Chinese banks. The proposed method benefits itself from the ability to identify the source of inefficiencies more effectively. The findings show that both the productivity stage and profitability stage need to improve the allocation of resources to further improve the overall level of efficiency in the Chinese banking industry. City commercial banks are found to have higher levels of pure technical efficiency than joint-stock commercial banks. Although the state-owned commercial banks have the highest level of pure technical efficiency, their overall efficiency level is lower than that of city commercial banks.

Instead of considering the carryover characteristics of the non-performing loans, Zhou et al. (2019) treat the non-performing loans as the undesirable output to develop a multi-period, multi-stage DEA model in which not only is a different variable considered as carryover product, but the share inputs are also introduced in the model. Using a sample of Chinese banks over the period 2014–2016 as the model application object, the findings show that there is a level of disparity in the level of bank efficiency and all the banks in the sample are found to be inefficient. It is interesting to note that for different types of banks, the source of inefficiency is derived from different stages.

The network DEA models and the traditional DEA models mainly use the secondary databases or the annual financial statement of the banks to collect data of the variables in the production processes. Matthews (2013) made the first attempt to use qualitative data in the evaluation of performance in a network DEA model in the Chinese bank efficiency evaluation. More specifically, questionnaires are given to the Chinese bank managers to collect information about risk management and risk practice, through which two metrics are generated. The metric related to risk organization is used as internal intermediate input and the one related to risk practice is treated as an external input. The findings suggest that there is no significant relationship between the risk metrics and the bank performance measure; however, including these measures in the model would allow bank performance to be explained in a better way than would excluding them.

One important and interesting area in the efficiency analysis using the DEA model is to choose the appropriate input and output variables in the production process. This issue has been addressed by Luo et al. (2012) by proposing a method based on the concept of cash value-added. It is defined by the method that if a variable is tested to be positively related to the bank’s cash flow, it will be treated as output, otherwise, it will be regarded as input. This method has been applied to a sample of Chinese banks.

3 Methodology and data

3.1 The multi-period aggregative efficiency method

Most of the DEA studies have dealt primarily with cross-sectional data and measured relative efficiencies in a single period, usually 1 year. Exceptions are window analysis, dynamic models, and Malmquist-type indices of productivity. Looking beyond the difference between their model details, we recognize that their common goal is to account for the changing patterns of efficiency performances over several periods. However, these approaches, while vital and practically useful, do not consider an aggregated measure of efficiency for multiple period production systems. Consider a simple example where three DMUs produce different amounts of two outputs and consume the same unit amounts of single input in two periods, \(t = 1, 2\). Using an ordinary DEA, we can obtain the efficiency ratings of each DMU for individual periods. Suppose that the first DMU is efficient for both periods, the second DMU is efficient for the period \(t = 1\) but inefficient for the period \(t = 2\), and the last DMU is inefficient for both periods. These efficiency ratings obtained for individual periods are needed for efficiency valuations but reflect partial performances of multi-period production units.

This brings into play an important question as to how we can achieve a multi-period aggregative efficiency, shortly referred to as MAE. To carry out a panel data analysis, we assume underlying production technologies in which all of a period's input is expected to go into producing the output for the same period. We do not consider a special production system where current input amounts might be used to produce future period outputs. Specifically, the MAE is measured in such a manner that a DMU's performance in a particular period is compared with the performance of all DMUs in the same period. The same way is taken in DEA to obtain the efficiency ratings in the above example, but each of these ratings signifies a single period efficiency.

3.2 The proposed DEA model

Suppose we have \(n\) DMUs (banks) and there are \(T\) periods, \(t = 1, 2, \ldots , T,\) and in each period,\(DMU_{j} , j = 1,2, \ldots , n,\) consume \( m \) inputs, \(x_{ij}^{t} , i = 1,2, \ldots ,m,\) to produce \(s\) outputs, \(y_{rj}^{t}\), \(r = 1, 2, \ldots , s.\) In order to compute the multi-period aggregative efficiency, shortly referred to as MAE in the context of time serial data, we propose the following DEA model, related to \(DMU_{p}\), \(p = 1, 2, \ldots , n\):

$$ \begin{aligned} & E_{p} = {\text{max }}\frac{{T - \frac{1}{mT}\mathop \sum \nolimits_{t = 1}^{T} \mathop \sum \nolimits_{i = 1}^{m} \frac{{s_{i}^{t - } }}{{x_{ip}^{t} }}}}{{T + \frac{1}{sT}\mathop \sum \nolimits_{t = 1}^{T} \mathop \sum \nolimits_{r = 1}^{s} \frac{{s_{r}^{t + } }}{{y_{rp}^{t} }}}} \\ & s.t.\quad \mathop \sum \limits_{j = 1}^{n} \lambda_{j}^{t} x_{ij}^{t} + s_{i}^{t - } = x_{ip}^{t} , \forall i, \forall t \\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j}^{t} y_{rj}^{t} - s_{r}^{t + } = y_{rp}^{t} ,\quad \forall r, \forall t \\ & \lambda_{j}^{t} \ge 0,\quad \forall j, \forall t \\ & s_{i}^{t - } \ge 0, \quad \forall i, \forall t \\ & s_{r}^{t + } \ge 0, \quad \forall r, \forall t \\ \end{aligned} $$
(1)

Definition 1

Let \({\text{E}}_{{\text{P}}}^{*}\) be the optimal value of objective function of the above model. Then, \({\text{DMU}}_{{\text{p}}}\) is called efficient if \({\text{E}}_{{\text{P}}}^{*} = 1.\)

This condition is equivalent to all slack variables \(s_{i}^{t - }\) and \(s_{r}^{t + }\) equal to zero for all periods, and it means no input excesses and no output shortfalls in any optimal solution (Esmaeilzadeh & Hadi-Vencheh, 2013). The proposed model is a fractional programming model and can be transformed into a linear programming (LP) model by Charnes and Cooper (1962) transformation. By this change, we have the following LP model:

3.3 Linear model

$$ \begin{aligned} & E_{p} = {\text{max }}MT - \frac{1}{mT}\mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{i = 1}^{m} \frac{{s_{i}^{{\prime}t - } }}{{x_{ip}^{t} }} \\ & s.t.\quad \mathop \sum \limits_{j = 1}^{n} \lambda_{j}^{{{\prime }t}} x_{ij}^{t} + s_{i}^{{\prime}t - } = Mx_{ip}^{t} , \quad \forall i, \forall t \\ & \mathop \sum \limits_{j = 1}^{n} \lambda_{j}^{{\prime} t} y_{rj}^{t} - s_{r}^{{\prime} t + } = My_{rp}^{t} , \quad \forall r, \forall t \\ & MT + \frac{1}{sT}\mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{r = 1}^{s} \frac{{s_{r}^{{{\prime }t + }} }}{{y_{rp}^{t} }} = 1 \\ & \lambda_{j}^{{\prime}t} \ge 0,\quad \forall j, \forall t \\ & s_{i}^{{\prime} - t} \ge 0,\quad \forall i, \forall t \\ & s_{r}^{{\prime}t + } \ge 0,\quad \forall r, \forall t \\ \end{aligned} $$
(2)

where \(M = \frac{1}{{T + \frac{1}{sT}\mathop \sum \nolimits_{t = 1}^{T} \mathop \sum \nolimits_{r = 1}^{s} \frac{{s_{r}^{t + } }}{{y_{rp}^{t} }}}}, \lambda_{j}^{{{\prime }t}} = M\lambda_{j}^{t} , s_{i}^{{\prime} - t} = Ms_{i}^{ - t} and s_{r}^{{\prime}t + } = Ms_{r}^{t + } \). In this case the MAE of \(DMU_{p}\) is \(E_{p}^{*} = MT - \frac{1}{mT}\mathop \sum \limits_{t = 1}^{T} \mathop \sum \limits_{i = 1}^{m} \frac{{s_{i}^{{\prime}t - *} }}{{x_{ip}^{t} }}\) and the efficiency of \(DMU_{p}\) at time period \(t, t = 1, 2, \ldots , T,\) equals to \( E_{p}^{t*} = \frac{{M - \frac{1}{m}\mathop \sum \nolimits_{i = 1}^{m} \frac{{s_{i}^{{\prime}t - *} }}{{x_{ip}^{t} }}}}{{M + \frac{1}{s}\mathop \sum \nolimits_{r = 1}^{s} \frac{{s_{r}^{{\prime}t + *} }}{{y_{rp}^{t} }}}} .\)

3.4 Robust endogenous neural network analysis (RENNA)

Artificial Neural Networks (ANNs) are computational algorithms that are based on the human thinking paradigm. ANNs are formed of processing units (neurons) that are weight connected. These connections motivate the estimation of non-linear models by using a training data set. Athanassopoulos and Curram (1996) give the first literature insight on combining ANNs and DEA for predicting efficiency levels. Other ANN applications in DEA can be found in existing literature (Wu et al., 2006). Examples can be found in Santin and Delgado (2004), Emrouznejad and Shale (2009), Misiunas et al. (2016), Shokrollahpour et al. (2016), Olanrewaju et al. (2012), Bashiri et al. (2013) and Modhej et al. (2017).

In this research, a specific focus is placed on the MLP network that has been extensively studied in forecasting applications (Mubiru & Banda, 2008). Within the ambit of an MLP, neurons are pooled in layers and only forward connections are allowed. These features provide a robust architecture capable of learning upon any kind of continuous nonlinear mapping. A typical MLP is represented in Fig. 1.

Fig. 1
figure 1

MLP framework

MLP constituents encompass neurons, weights, and transfer functions. An input \(x_{j}\) is transmitted via connections that multiply their respective strength by \(w_{ij}\) weights, yielding the product \(x_{j} w_{ij}\) used in the transfer function f to compute a specific output \(y_{i}\) given as \(y_{i = } f\left( {\mathop \sum \nolimits_{j = 1}^{n} x_{j} w_{ij} } \right)\). i is the neuron index in the hidden layer and j is the input index to the MPL. The modification of the weights of each connection observing some orderly fashion is known as training. During the training, an input is assigned to the network along with the desired output and the weights are adjusted so that the MLP catches-up with the desired output value.

As regards RENNA, the robustness analysis is structured in two steps. In the first step, MLP is employed on unveiling endogeneity among banking efficiency, size, profitability, expense management, and business type (traditional vs. non-traditional). Hence, this paper departs from previous research in the banking sector by using the MLP network structure to explore endogeneity between these variables in terms of the following linear models:

  • Model 1: Efficiency ~ f(Size, Profitability, Expense Management, Traditional Business, Non-traditional Business).

  • Model 2: Size ~ f(Efficiency, Profitability, Expense Management, Traditional Business, Non-traditional Business).

  • Model 3: Profitability ~ f(Size, Efficiency, Expense Management, Traditional Business, Non-traditional Business).

  • Model 4: Expense Management ~ f(Size, Profitability, Efficiency, Traditional Business, Non-traditional Business).

  • Model 5: Traditional Business ~ f(Size, Profitability, Expense Management, Efficiency, Non-traditional Business).

  • Model 6: Non-traditional Business ~ f(Size, Profitability, Expense Management, Traditional Business, Efficiency).

In fact, neural networks were employed here to the detriment of other approaches, such as linear regressions, due to its ability to unveil non-linear relationships between dependent variables and their predictors. This feature is particularly useful as long as the ordinary covariance matrix embeds in itself a partial measurement of all possible associative functions among variables, solely relying on linear correlation. As long as non-linear associative measures between variables are not captured by ordinary covariance matrices, neural networks were chosen as the cornerstone for analyzing the distributional profile of the residuals of each one of these six regressions, in the search for evidence that could support endogeneity claims based on the hidden non-linear association between efficiency levels and their predictors related to the Chinese banking industry.

Hence, the underlying idea of running these six models is to explore their respective residuals, obtained by considering each variable as the dependent one. In the first step of the analysis, once the joint variances of residuals are minimized by finding optimal weights for each model, it would be possible to affirm that the resulting weighted combination of predictive models yielded minimal feedback effects. Hence, on the second step of the analysis, it becomes possible to address issues of cause-and-effect relationships among the dependent variables of each model, by identifying the respective conditional distributions of residuals with higher weights in a second optimization round.

The results for the optimal MLP architecture are depicted in Fig. 2, where an exhaustive combination of neurons and layers were tested for the MSE error combination of models (1)–(6). Two layers and 12 neurons in each yielded a minimal combined MSE of 0.054. In other words, a layer can be regarded as a collection of neurons that, while performing the same computational steps, assures non-linear relationships among inputs and outputs to be properly captured. The fundamental computational steps performed by neurons are to (i) calculate the weighted sum of inputs and weights, (ii) map the forecasting bias and (iii) execute an activation function among inputs and outputs.

Fig. 2
figure 2

MLP optimal architecture

The relative importance of Models 1–6 in explaining the feedback process between contextual variables and efficiency levels in the Chinese banking industry, together with the endogenous nature of these variables, were explored, respectively, by the variances of each model and the covariances between models. Variances and covariances of the residuals (\(R_{i}\)) of these eight models are simultaneously minimized by a non-linear stochastic optimization problem, as presented in Model (3), where \(w_{i}\) stands for the weights – which range from 0 to 1—assigned, respectively, to the residual vectors of each one of the six models previously described. The values of w are optimized so that the variance (Var) and covariance (Covar) of the pooled residuals are minimal. Model (3) was solved through differential evolution (DE). DE is a research stream of genetic algorithms, also emulating natural selection and evolution. Readers should refer to Ardia et al (2011) and Mullen et al. (2011) for further details. Results are discussed in the next section.

$$ \begin{aligned} & \min \left[ {Var\left( {\mathop \sum \limits_{i = 1}^{6} w_{i} * R_{i} } \right) + \left( {2*\mathop \sum \limits_{i,j = 1}^{6} Covar (w_{i} *w_{j} * R_{i} *R_{j} ), i \ne j, j < i} \right)} \right] \\ & s.t. \quad \mathop \sum \limits_{i = 1}^{6} w_{i} = 1 \\ & 0 \le w_{i} \le 1 \forall i \\ \end{aligned} $$
(3)

Residuals of the MLP models were bootstrapped 100 times allowing the collection of a distributional profile of w for the most accurate prediction of efficiency scores and contextual variables.

Subsequently, in the second step, a full combinatorial set of conditional distributions of residuals (\(CR_{k}\)) was computed. The previous 100 bootstrapped replications for the individual residuals (\(R_{i}\)) of each model served as cornerstones for this computation, where C \(R_{k} \sim f(R_{i} /R_{j}\)) for all i and j, \(i \ne j\), and \(K = i*j - i = 6*6 - 6 = 30\). Similarly, DE was employed for a minimal Var-Covar model to diagnose whether conditional distributions of each residual pairs presented significant differences in terms of directions (in this case, weights assigned to \(f(R_{i} /R_{j}\)) variance would differ from those assigned to \(f(R_{j} /R_{i}\)) variance). Besides, higher-order joint variations of conditional residuals were explored by means of joint covariance weights so that additional hidden effects could be explored, as depicted in the model (4).

$$ \begin{aligned} & \min \left[ {Var\left( {\mathop \sum \limits_{k = 1}^{30} w_{k} * CR_{k} } \right) + \left( {2*\mathop \sum \limits_{k,l = 1}^{30} Covar (w_{k} *w_{l} * CR_{k} *CR_{l} ), k \ne l, k < l} \right)} \right] \\ & s.t. \quad \mathop \sum \limits_{k = 1}^{30} w_{k} = 1 \\ & 0 \le w_{k} \le 1 \forall k \\ \end{aligned} $$
(4)

In terms of the data we use in the current paper, we collect a sample of 39 Chinese commercial banks over the period 2010–2018. The data is collected from the Fitchconnect database. The sample of Chinese banks is further divided into five groups according to the ownership types. They are 15 city commercial banks, 2 rural commercial banks, 14 foreign banks, 4 state-owned commercial banks and 4 joint-stock commercial banks. Regarding the use of inputs and outputs in our first-stage efficiency analysis, three inputs are considered, including fixed assets, total deposits, and personnel expenses, while total securities and gross loans are treated as outputs. In the second-stage Robust Endogenous Neural Network Analysis, we include a series of bank-specific variables, including bank size (natural logarithm of total assets), bank profitability (return on assets), expense management (the sum of total interest expenses and non-interest expenses over total assets); traditional bank business (the ratio of interest income to total assets); and non-traditional bank business (the ratio of non-interest operating income to total assets). Table 1 shows the descriptive statistics of the variables in the current study.

Table 1 Descriptive statistics of the variables used in the current study

4 Analysis and discussion of results

Efficiency score results are depicted in Fig. 3, altogether with their joint-variation profile with contextual variables related to size, profitability, expense management, traditional business, and non-traditional business. The choice of our contextual variables is based on the following considerations. Bank size is supposed to influence the efficiency level because large banks benefit from the cost advantage gained from economies of scale and economies of scope (Fu and Sio, 2011). Girardone et al. (2004) find that profitable banks have a higher ability to control all aspects of bank costs, therefore we include profitability as one of the contextual variables. Expense management is mainly related to the management of interest expenses and non-interest expenses in the banking operation. Good control of these costs is supposed to have a positive influence on the efficiency level. It seems that the empirical literature has not focused on the examination regarding the impact of traditional business on bank efficiency, but various efforts have been made to evaluate the impact of income diversification on bank efficiency (Alhassan, 2015; Nguyen, 2018). We argue that higher proportions of traditional business indicate a lower degree of diversification and further influence the efficiency level. The empirical literature has attempted to investigate the influence of non-traditional activity on bank efficiency by incorporating this item in the first stage of efficiency analysis (Gulati & Kumar, 2011; Lozano-Vivas and Pasiouras, 2010). We include this variable as one of the contextual variables to see whether we will get different results by incorporating it in a second-stage analysis. One can observe in upper diagonal significant correlation relationships, marked with “*”, for most cases, which may possibly be an indicator of feedback processes among efficiency and different contextual variables, thus denoting possible endogeneity. The main diagonal shows the empirical distribution profile for Efficiency and Contextual Variables individually as well as the lower diagonal shows scatterplots relationship between variables in the given column and row.

Fig. 3
figure 3

Correlogram and joint variation profile between efficiency scores and contextual variables in Chinese banks

In order to further look at the efficiency level of the banks in the sample in a clearer manner, we generate the figure below (Fig. 4), which shows the efficiency level in the Chinese banking industry over the period 2010 to 2018. The findings show that the Chinese banking industry experiences stronger volatility in the efficiency scores over the period, while in general there is a consistently increasing trend for the efficiency level up to 2015, after which we can see that the efficiency level declines in 2016, and although a slight increase in the efficiency level in 2017 is observed compared to 2016, the efficiency score experiences another drop in 2018, compared to the previous year, reaching 0.746. Our results are in contrast with results from Fukuyama and Tan (2020). The difference is mainly attributed to the fact that not only do we use a different method in the efficiency analysis, but also the inputs and outputs used in the method are different.

Fig. 4
figure 4

Efficiency level in the Chinese banking industry

Not only do we focus on the general efficiency level of the Chinese banking industry as a whole, but we also look at the efficiency level of different ownership types of Chinese banks. The figure below (Fig. 5) shows the efficiency scores of different ownership types of Chinese banks over the examined period. We can see that the state-owed commercial banks have the highest level of efficiency, and in general, the rural commercial banks are the least efficient banking group. There is no clear comparison about the efficiency level among joint-stock commercial banks, city commercial banks and foreign banks, while it is observed that the foreign banks experience the strongest volatility in the efficiency level compared to all the other banking groups in the sample. Zhou et al. (2019), over the same period (2014–2016), report that city commercial banks have a higher efficiency than state-owned banks, while joint-stock banks have the lowest efficiency, the difference in our results can be mainly explained by the number of banks included in the sample, the method adopted as well as the period of investigation.

Fig. 5
figure 5

Efficiency level in the Chinese banking industry with different ownership types

It is even more interesting to look at the efficiency level of individual banks in the sample. The figure below (Fig. 6) shows the efficiency level of all the 39 banks in the sample over the examined period. Combining the number of the bank and the specific bank name in the “Appendix”, we can see that the top four best performers in the sample are Bank of Beijing, Bank of Communication, Bank of Guangzhou and HSBC (China), while the worst four bank performers include Metropolitan Bank (China), KEB Hana Bank (China), Bank of Yingkou and Sumitomo Mitsui Banking Corporation (China). We can see that both of these two extreme groups of banks include two ownership types (city commercial banks and foreign banks), Bank of communication is a state-owned commercial bank. This finding generates feasible policies for the banks, especially the ones with very low levels of efficiency to improve their performance by communicating and learning from the ones with the very best performance within the same ownership type. For other state-owned commercial banks, the bank of communication can be a good model to learn in terms of its resource allocation.

Fig. 6
figure 6

Efficiency level of individual banks in the sample

4.1 Marginal impact of efficiency scores

Notwithstanding, the relative importance of models (1)–(6) for achieving minimal residual variance is quite heterogeneous (cf. Fig. 7). In fact, business type and bank profitability account for more than 50% of total weight, while efficiency and bank size account for less than 20%. These results suggest that the eventual feedback process (endogeneity) among banking efficiency, size, and expense management is obfuscated by more profitable business types. Results for the relative importance of the interaction pairs in explaining overall residual variance are presented in Fig. 4 and also confirm this issue. Putting into perspective the results obtained for 100 bootstrap replications, while the efficiency scores present marginal relevance in comparison to other variables related to business models and banking profitability, the feedback between size, profitability, and efficiency is also negligible, which should denote market regulation. The economies of scale and scope achieved by large banks and the resulting improvement in the level of bank profitability are offset by the level of intense competition in the banking industry, therefore their impact on bank efficiency is not obvious.

Fig. 7
figure 7

Relative importance of models (1)–(6)

4.2 Low endogeneity levels

The joint feedback effect of all models pairs on overall residual variance is lower than15% (cf. Fig. 8), pretty low when compared to the maximal possible endogeneity effect, achievable when all six models would account individually for 16.67% of the total residual variation. Under the case of this equal weight, the maximal joint effect of 83.36% = 2*41.68% = 2*16.67%*16.67%*15, where 15 is the number of combinations taken two by two obtained from models (1)–(6).

Fig. 8
figure 8

Endogeneity weights for pairs of models (1)–(6)

4.3 Cause–effect relationships

In summary, efficiency in Chinese banks appears to be weakly endogenous with size and profitability due to imperfect, oligopolistic, and strongly regulated industry. Readers should observe that the median expected weight for each conditional distribution would be 0.033 (1/30), considering a balanced bi-directional relationship among variables. According to Table 2, efficiency is a consequence of bank size, bank profitability, and its business model; rather than acting as causation on such characteristics, as long as conditional residual distribution weights are significantly higher than 0.033 at p < 0.05. Bank size, on the other hand, indirectly impacts efficiency levels, by favoring expense management and non-traditional business models, thus signifying a scale barrier. The final cause-effect framework of Chinese banks is given in Fig. 9. Our results related to the impact of bank size on the efficiency level is in line with Tan and Floros (2013).

Table 2 Median weights for conditional residual distribution of [ROW] given that [COLUMN]
Fig. 9
figure 9

Cause-effect and endogeneity framework

4.4 Endogenous relationships

RENNA, however, allowed the confirmation of endogenous processes or feedbacks among business type, expense management, and bank profitability, thus suggesting that market segmentation and product differentiation drivers are mutually reinforced over time, once Chinese banks are big enough to start alternative banking venues. It is important to note that, while non-traditional business appears to present negative feedback with banking profitability at first sight, on the other hand, the non-traditional business can be regarded as a positive efficiency driver as long as it gears a more than proportional increase of outputs relative to inputs in Chinese banking operations. This positive impact is in contrast with the findings of Lozano-Vivas and Pasiouras (2010). We explain the difference by the fact that we focused on the Chinese banking industry and that we use a second-stage analysis for the investigation, while Lozano-Vivas and Pasiouras (2010) incorporate the non-traditional activities in the first-stage efficiency analysis. These new business models, however, are still less profitable than traditional ones because, compared to the traditional interest generating activities, the fee-based activities are more likely to generate revenues with higher levels of volatilities due to the fact that there is a weaker bank-customer relationship in the non-traditional banking businesses compared to the traditional ones, and customers are easier to switch to another financial provider. Besides, unlike the traditional loan businesses, the increase in the cost of additional loans will be only the interest expenses. In comparison, fee-based businesses will significantly increase the banks’ fixed cost (Li & Zhang, 2013). Due to the risk and cost considerations, the banks would be more conservative in engaging in this business category and that explains the low profitability from the non-traditional businesses.**

4.5 Relationship signs

Olden’s sensitivity analysis results are presented in Fig. 10 and reveal the signs of the relationships between variables, and their relative importance in terms of joint variation. Bank size exerts a positive impact on the development of non-traditional business and more proactive expense management. While the positive impacts of bank size and non-traditional business are transferred to efficiency levels, it is worth noting the countervailing negative forces played by expense management, bank profitability, and traditional business on efficiency levels. This implies that physical and monetary resources in Chinese banks have been parsimoniously used in generating more loans. The positive sign of trend square term suggests that this behaviour is increasing over time, possibly due to higher regulation of Chinese banks after the global financial crisis of 2007/2008. The negative influence of bank profitability on bank efficiency is in contrast with the findings of Tan and Floros (2018) in the Chinese banking industry. This difference is mainly attributed to the fact that we not only use a different dataset, but also the methods in the efficiency analysis as well as the second-stage analysis are different. The empirical literature does not focus on the investigation related to the impact of traditional banking business on bank efficiency; instead, few attempts have been made to examine the influence of income diversification on bank efficiency. Our findings from this perspective show that income diversification has a positive impact on bank efficiency, which is different from the findings of Alhassan (2015) in terms of the Ghanaian banking industry. Finally, the negative influence of expense management on the efficiency level is in line with the consideration that interest expenses and non-interest expenses are the main inputs in banking operation and large amounts of expenses (poor expense management) will reduce the efficiency level (Khankhoje & Sathye, 2008).

Fig. 10
figure 10

Results for Olden’s sensitivity analysis in terms of signs and relative importance for models (1)–(6)

5 Conclusion

Using a sample of 39 Chinese commercial banks over the period 2010–2018 with different ownership types, including state-owned commercial banks, joint-stock commercial banks, city commercial banks, rural commercial banks, and foreign banks, we analyze the efficiency level using a proposed non-radial DEA model. In the second stage, a Robust Endogenous Neural Network Analysis is proposed and used for the first time to examine the inter-relationships between efficiency, bank size, bank profitability, expense management, traditional banking business and non-traditional banking business.

We fill in a gap in the empirical literature by not only proposing an innovative non-radial DEA model, but also using a Robust Endogenous Neural Network Analysis with a comprehensive set of bank-specific variables which can thoroughly address the endogeneity issue and provide more accurate and concrete policy implications for the banking regulatory authority.

Our findings show that, although in general, the Chinese banking industry experiences strong volatility in the level of efficiency over the examined period, consistent growth in the efficiency level has been observed up to 2015, after which a slight decline and volatility has been noticed and the Chinese banking industry ends up with an efficiency score of over 0.7 by the end of 2018. Comparing different ownership types, the results suggest that state-owned commercial banks are the most efficient banking group, while the rural commercial banks have the lowest level of efficiency. It is further found that foreign banks experience the strongest volatility in the efficiency level. The second-stage analysis shows that bank size exerts a positive influence on the development of non-traditional banking business and proactive expense management, bank size and non-traditional businesses have a positive impact on efficiency levels, while bank profitability, traditional businesses and expenses management have a negative influence on bank efficiency.

The current study generates important policy implications to the Chinese banking regulatory authority: (1) Chinese banks should reduce the volume of deposit-loan business; (2) the Chinese banks are encouraged to expand non-traditional banking businesses, which would be facilitated by financial innovation; (3) Chinese banks should engage in an improved practice of expense management.

In terms of future research, our newly proposed innovative multi-period aggregative efficiency method can be applied to other banking sectors or non-banking sectors and, in particular, our method can be used for comparison with other alternative efficiency evaluation methods. In terms of our second-stage Robust Endogenous Neural Network Analysis, this advanced method can be applied to test the inter-relationships between efficiency, competition and stability in the banking sector and the results can be compared with other empirical banking studies addressing the same issue but under other econometric methods.