1 Introduction

The financial system in China is supported by four pillars including the banking industry, the insurance industry, the trust industry, and the securities industry. Among these, the banking industry has specific advantages over the other three because its operations involve both enterprises and government. More specifically, the banks are supported and protected by the government and they mainly engage in providing services to different types and sizes of enterprises, while some competitive Chinese banks have engaged in insurance investment and the finance department of these banks has gradually provided trust-related money management services to customers. We can forecast that in the near future, big Chinese commercial banks will become “all-round” financial institutions that provide a variety of different businesses covering the functions of trust companies, insurance companies, and securities companies. Previous researches on the Chinse banking industry, which is the focus of this study, are rather scarce and limited to discussing rankings under the traditional “black-box” productive approach (Asmild & Matthews, 2012; Avkiran, 2011; Avkiran & Morita, 2010; Gattoufi et al., 2014; Luo et al., 2012; Tan & Anchor, 2017; Tan & Floros, 2013 among others).

Therefore, this research significantly contributes to the empirical literature in banking by focusing on the banking industry of China in light its productive and competition structures. Precisely, a GMSS-DEA (General Multi-Stage Structure-Data Envelopment Analysis) model is proposed here to capture the productive structure in the Chinese banking industry in terms of the well-known production and intermediation approaches in banking. Our method is completely different from other efficiency studies and compared to the empirical Chinese efficiency studies (Dong et al., 2016; Du et al., 2018; among others). The GMSS-DEA model is capable of simultaneously handling income statement and balance sheet related variables in different production stages so that we can produce more accurate results for the efficiency levels. GMSS has the advantage of computing in a simultaneous way the efficiency level of the overall production stage and the internal stages. Furthermore, the assumption that exogenous inputs are not consumed, and exogenous outputs are not produced during the internal processes (Kao, 2014) have been relaxed in GMSS. Particularly in this research, different assets are considered as the exogenous outputs in the first stage (production approach in banking), while deposits from the central bank and other institutions are the exogenous inputs in the second stage (intermediation approach in banking). Analogously, equity is regarded as an exogenous input in the production approach, while cash and deposits at central banks and other institutions are treated as exogenous outputs in the intermediation approach, altogether with provisions and fee/interest expenses.

Additionally, an integration of these results with Multi-Layer Perceptron (MLP) models is adopted to unveil the endogeneity between banking efficiency, contextual variables, and major market competition metrics such as the Herfindahl–Hirschman index, the Panzar–Rosse H-statistic, and the Lerner-index. Hidden Markov Models (HMM) are used as a basis for bootstrapping such variables while preserving their endogeneity within the ambit of MLP models. We are also the pioneer study to use this method to address competition and efficiency relationship in the banking literature.

This study has two main innovative motivations. It decomposes the efficiency of Chinese banks with a multi-stage system structure that encompasses the production and intermediation approaches. Meanwhile, it adopts a novel integrated MLP/HMM model to stochastically unveil the feedback processes that may exist among these approaches, macro-economic variables, and competition structures. We have the following findings: (1) productive efficiency appears to be positively impacted by competition and market power; (2) credit risk analysis in older local banks that focus the province level, would possibly be the factor that jeopardizes the productive efficiency levels of the entire banking industry in China; (3) a perfect banking competition structure at the province level and a reduced market power of local banks are drivers of a sound banking system in China; (4) increased banking productivity is a direct consequence of the concentration of credit into fewer banks as a form of maintaining scale and reducing transaction costs.

2 Banking sector overview in China

The main purposes of banking reform in China since the 1970s are to improve profitability, productivity, and efficiency while reducing the level of market power and enhancing the banking stability for the multi-layer bank structure. Non-performing loans is still a historical issue. To solve this problem, a number of different measurements have been taken by the Chinese government to reduce the risk level of Chinese commercial banks, including non-performing loan write-offs (Bonin & Huang, 2001), establishment of the China Banking Regulatory Commission (CBRC) (Liang et al., 2013), and the introduction of strategic foreign investors (Wu et al., 2012).

Lower level competition is the second issue. There are hundreds of banking and non-banking financial institutions in China and CBRC statistics report that among the different ownership types of Chinese banks, state-owned banks still dominate the industry, though the proportion of assets held by this bank type has declined over recent years. Foreign banks were allowed to provide financial services in mainland China with some restriction at the beginning with the restrictions being completely removed by the end of 2006 (Hsiao et al., 2015). The level of competition is supposed to further increase due to the fact that private banks were allowed to operate in China and there are few private banks operating in China since 2015 (Lu, 2016).

The Chinese banking industry has encouraged banks to engage in Initial Public Offering (IPO), which does not only increase the source of funding for their operation, but it also provides more incentive for the Chinese banks to optimize their resource, improve their management, and further improve performance (Okazaki, 2017). There have been two large IPO listings in the Chinese banking industry. One was the Industrial and Commercial Bank of China listed on the Hong Kong and Shanghai stock exchange and the other was the Agricultural Bank of China listed on the Hong Kong and Shanghai stock exchange in 2011.

The Chinese banking industry is still facing some challenges and difficulties. First, credit risk, as reflected by the non-performing loan ratios, is still the issue. Second, the operation and in particular how to keep the customer and sustain a good relationship between bank and customer is another difficulty faced by Chinese commercial banks derived from interest rate liberalization. In addition, Chinese commercial banks also face competition from Internet giants including Alibaba and Tencent, both of which provide financial services to customers. The financial products they offer strongly affect commercial banks and they need to keep innovating in order to have a competitive position.

3 Literature review

Efficiency in the Chinese banking industry is dispersed in terms of scope, but there has been a growing number of research in the last decade revealing a growing importance in the study of this issue, mostly due to the emergence of China as a relevant global player.

Berger et al. (2009) used a stochastic frontier analysis to examine efficiency in Chinese banks during 1994–2013. Their findings suggest that the state-owned commercial banks are least efficient while the foreign banks are most efficient. The results indicate that minority foreign ownership can improve the efficiency of Chinese banks. Fu and Heffernan (2009) extended the work of Berger et al. (2009) by investigating the X-efficiency and scale efficiency during 1985–2002. The results reported that X-efficiency had been declining on average and that most banks operated under the optional scale. The results further reported that the X-efficiency of joint-stock commercial banks were improved by banking reforms with no evidence supporting the quite-life hypothesis that the higher level of market power reduces the efficiency level of Chinese commercial banks. A number of studies have used the stochastic frontier analysis to assess the efficiency level in the Chinese banking industry (Berger et al, 2010; Dong et al., 2016; Jiang et al., 2013; Sun et al., 2013).

Besides using stochastic frontier analysis, the second method is the non-parametric Data Envelopment Analysis. Tan and Floros (2013) used this method to examine efficiency and productivity during 2003–2009 and to further examine their interrelationships with risk and capital. Their findings suggest that risk and efficiency are significantly related, and risk and capital are significantly and negatively related with each other. Wang et el. (2014) extended the work of Tan and Floros (2013) by innovatively dividing the banking production process into two stages, namely a deposit producing stage and a profit earning stage. The results from this two-stage DEA show that the source of inefficiency of Chinese banks is derived from the first stage production process. In addition, the results report that the overall efficiency level has improved and the difference in the efficiency level of state-owned banks and joint-stock banks decreased over the period. Similar research has also been conducted by Matthews, (2013), An et al., (2015), Zha et al., (2016), Zhou et al., (2018), Du et al., (2018), Liu et al. (2018), Liu et al. (2018), Liu et al., (2019); among others.

As far as we could find, there are only two pieces of research to apply the GMSS-DEA to economic sectors for efficiency analysis. One of them assessed the productivity of 17 major Chinese ports over the period 2006–2015 (Wanke et al., 2018a, 2018b). Another piece of research analyzed the efficiency level in the Portuguese banking sector (Alves et al., 2020). Our study significantly extends these two papers, in particular the latter one, by controlling bank-specific risks, capital indicators, as well as ownership variables.

4 Data and methodology

4.1 The data

Analyses were performed based on data obtained from Fitch Connect and annual financial statements from 27 Commercial banks operating in China with data available from 2007 to 2017. Table 1 shows the descriptive variables of the inputs, outputs, and intermediate resources used in the GMSS-DEA. A detailed discussion on the variable type is given in the next two subsequent subsections, where the GMSS-DEA model is presented and the alternative productive approaches in banking are discussed. Nevertheless, as regards variable selection with respect to physical and monetary productive resources, the data availability and inputs, outputs, and intermediate variables found in previous studies (Martinez-Campillo et al., 2020; Wanke Azad et al., 2019; Wanke et al., 2019; Wanke, dos Henrique, et al., 2019) constitute the two major criteria used. Monetary inputs and outputs considered in this paper are given in current dollars adjusted to China´s yearly consumer price index. Besides, descriptive for the major contextual variables considered are also given.

Table 1 Descriptive statistics for the physical and monetary resources used in this research

According to Berger and Humphrey (1997), there are two distinct approaches to input and output selection in the banking sector, which vary mainly in their consideration of deposits as inputs or outputs, i.e. the production approach and the intermediation approach.

Under the production approach, banks are assumed to produce financial services for the customers. Hence the loans, deposits, and services associated with deposits are all considered as outputs of banks. Meanwhile, the capital and work required to carry out the transactions and processes are considered as inputs of banks.

On the contrary, under the intermediation approach, banks are regarded as financial intermediaries between the savings and investments of their customers. Therefore, deposits and the interest costs are classified as inputs. According to Kumar and Gulati (2014), the intermediation approach can be further subdivided into the following approaches: assets, cost, and value added.

The assets approach focuses on the bank’s role of intermediating between depositors and the bank assets. The inputs include deposits, labor, physical capital, and other liabilities while the outputs consist of the income-earning assets such as loans and securities.

Under the cost approach, the input/output classification is based on its contribution to the bank's revenue. The value-added approach differs from the previous ones since it considers inputs and outputs in terms of their contribution to the bank's added value, allowing for a non-mutually exclusive choice.

Berger and Humphrey (1997) argue that neither the production approach, nor the intermediation approach is perfect since they do not fully consider the dual role of institutions as transaction and intermediary providers of financial products. However, the authors consider the production approach to be the most adequate for analyzing the efficiency of bank branches, while the intermediation approach is the most adequate for evaluating the efficiency of banking institutions. Indeed, the difference between the two approaches stems from the role that deposits assume in each of them. In the production approach, deposits are considered as outputs, while in the intermediation approach, they are considered as inputs.

In the controversy regarding the role of deposits in the productive process of banks, some authors present alternatives in order to dispense with their use. With Avkiran (2009a, 2009b), for example, interest expenses are incorporated as inputs. Sealey and Lindley (1977) use an asset-oriented model considering only income-earning assets as outputs. Other authors consider the deposits simultaneously as inputs and outputs (Tortosa-Ausina, 2002), but most studies consider deposits as inputs (Fethi & Pasiouras, 2010).

In order to overcome the problem of classifying deposits as inputs or outputs, a new approach has emerged that corresponds with the current orientation towards profitability and focuses mainly on operating results (profit-oriented approach). This approach considers revenues such as interest received and non-financial income as outputs, while cost components such as personnel expenses and interest paid (Drake et al., 2006) are inputs with the aim of minimizing costs and maximizing the bank revenues.

In general, the inputs used are mainly fixed assets and personnel measured in absolute value or money, as seen for example in Isik and Hassan (2002), Maudos and Pastor (2003), Casu and Girardone (2004), Havrylchyk (2006), and Diallo (2018). Several authors also use the number of branches (Chen, 2001) and provisions and equity (Pasiouras, 2008a) as inputs. The most commonly used outputs are loans and income-earning assets such as by Casu and Molyneux (2003), Casu and Girardone (2004), and Tzeremes (2015). Other research includes non-financial income or off-balance sheet resources as outputs such as Isik and Hassan (2002), Sturm and Williams (2004), Havrylchyk (2006), Pasiouras (2008b), and Degl'Innocenti et al. (2017).

In this research, we propose reconciling the production and intermediation approaches within the ambit of the GMSS-DEA model, situating them as two consecutive stages of the productive process of Chinese banks. In stage 1, following the production approach, these banks were regarded as firms producing loans, deposits, and other assets with labor and capital. In contrast, in stage 2, under the intermediation approach, these banks were considered to be financial intermediaries with the role of transforming deposits and purchased funds into loans, income, and revenues. More specifically, deposits are assumed to be an output under the production approach, while they are regarded as an input under the intermediation approach. As a distinctive feature of this study, both approaches are followed in a complimentary fashion in the network productive structure of the Chinese banking sector, as depicted in Fig. 1.

Fig. 1
figure 1

GMSS-DEA model for the Chinese banking industry

Readers should note the role played by the exogenous inputs and outputs in both stages. Although they are not directly involved in the core activities of each stage, they may affect the efficiency levels of each stage as long as these exogenous variables can be considered as by-products in the case of outputs or auxiliary resources in the case of the inputs.

4.2 GMSS-DEA

DEA is a linear programming approach applied to compute the efficiency scores of DMUs or Decision Making Units based on efficiency frontiers that constitute a convex envelope on the data set. Efficiency score is a value that normally ranges between 0 and 1 with 0 corresponding to an inefficient unit and 1 to an efficient unit. An efficiency frontier corresponds to a set of best practices in which actual or virtual combination of DMUs can obtain a greater quantity of outputs when considering fixed inputs (output orientation) or can reduce the quantity of inputs used when considering fixed outputs (input orientation). Thus, an efficiency frontier is formed on the convex set of production possibilities by linearly combining efficient DMUs. Suppose \(s = 1 \ldots S\) productive DMUs consuming inputs \(x_{s}^{T} = (x_{s1} , \ldots ,x_{sm} )\) and generating outputs \(y_{s}^{T} = (y_{s1} , \ldots ,y_{sn} )\). Additionally suppose that \(\lambda^{T} = (\lambda_{1} , \ldots ,\lambda_{s} )\) is non-negative and \(e^{T} = (1, \ldots ,1) \in R^{S}\) is an unitary value vector (Wanke & Barros, 2016). The dual LP are given next. The virtual input and output sums of \(\sum\nolimits_{i = 1}^{m} {v_{i} x_{ij} }\) and \(\sum\nolimits_{r = 1}^{s} {u_{r} y_{rj} }\) consist of weighted resources and products by decision variables endogenously optimized when solving the linear problem. The respective constant-returns to scale known as CCR (Charnes et al., 1978), and variable returns to scale known as BCC (Banker et al., 1984), are given in models (1) to (3):

Frontier type

Input-oriented

Output-oriented

 

\(\begin{gathered} \max \sum\limits_{r = 1}^{s} {u_{r} y_{ro} + u_{o} } \hfill \\ s.t. \hfill \\ \sum\limits_{r = 1}^{s} {u_{r} y_{rj} } - \sum\limits_{i = 1}^{m} {v_{i} x_{ij} + u_{o} \le 0} \hfill \\ \sum\limits_{i = 1}^{m} {v_{i} x_{io} = 1} \hfill \\ u_{r} ,v_{i} \ge 0 \hfill \\ \end{gathered}\)(1)

\(\begin{gathered} \min \sum\limits_{i = 1}^{m} {v_{i} x_{io} + v_{o} } \hfill \\ s.t. \hfill \\ \sum\limits_{i = 1}^{m} {v_{i} x_{ij} } - \sum\limits_{r = 1}^{s} {u_{r} y_{rj} + v_{o} \ge 0} \hfill \\ \sum\limits_{r = 1}^{s} {u_{r} y_{ro} = 1} \hfill \\ u_{r} ,v_{i} \ge 0 \hfill \\ \end{gathered}\)(2)

CCR

\(u_{o} = 0\)

\(v_{o} = 0\)(3)

BCC

\(u_{o}\) free in sign

\(v_{o}\) free in sign

The major drawback of the DEA model is the “black box” representation of the internal processes of a given DMU. As a matter of fact, a DMU may be formed by diverse sub-substructures that contribute differently to overall levels of efficiency. Network DEA has been designed to handle this drawback. The first and more basic DEA network structure encompasses two distinct stages, which are connected in series and represent specific processes (or substructures) that cooperate for achieving maximal overall efficiency. Explaining it differently, DMUs are structured as a two-stage linear network where the products generated in the first process enter as resources into the subsequent stage (Golany et al., 2006). The two-stage structures are in fact a reduced case derived from a broader network structure composed by multiple stages (Fare, 1991; Fare & Grosskopf, 1996, 2000; Fare & Whittaker, 1995;). Here the general network structure departs from Kao (2014) and Wanke et al. (2018a, 2018b). The productive system is allowed to consume m inputs that are exogenous, \(i = 1,2, \ldots ,m|i \in I^{\left( p \right)}\) to deliver s outputs that are exogenous, \(r = 1,2, \ldots ,s|r \in O^{\left( p \right)}\). Besides, the g endogenous intermediate variables \(f = 1,2, \ldots ,g|f \in M^{\left( p \right)}\) link subsequent stages. The X exogenous resources, Y exogenous products, and Z intermediate endogenous variables are weighted, respectively, by \(v_{i}\), \(u_{r}\), and \(w_{f}\). The efficiency estimation of the whole system (\(\theta_{k}\)) for unit k, considering a total of \(n\) DMUs, given q individual stages, \(p = 1,2, \ldots ,q\), is defined as:

$$ \max \theta_{k} = \sum\limits_{r = 1}^{s} {u_{r} Y_{rk} } $$
(4)
$$ \begin{aligned} & subject to \\ & \sum\limits_{i = 1}^{m} {v_{i} \,X_{ik} = 1} \\ \end{aligned} $$
(5)
$$ \mathop \sum \limits_{r = 1}^{s} u_{r} Y_{rj} - \mathop \sum \limits_{i = 1}^{m} v_{i} X_{ij} \le 0,\;for\;all\;j $$
(6)
$$ \left( {\mathop \sum \limits_{r} u_{r} Y_{rj}^{\left( 1 \right)} + \mathop \sum \limits_{f} w_{f} Z_{fj}^{\left( 1 \right)} } \right) - \mathop \sum \limits_{i} v_{i} X_{ij}^{\left( 1 \right)} \le 0,\;\;p = 2, \ldots ,\left( {q - 1} \right);\;for\;all\;j $$
(7)
$$ \begin{aligned} \left( {\sum\limits_{r} {u_{r} Y_{rj}^{(p)} + \sum\limits_{f} {w_{f} Z_{fj}^{(1)} } } } \right) - \left( {\sum\limits_{i} {v_{i} X_{ij}^{(p)} + \sum\limits_{f} {w_{f} Z_{fj}^{(p - 1)} } } } \right) \le 0, \\ p = 2, \ldots ,\left( {q - 1} \right);\;\;for\;all\;j \\ \end{aligned} $$
(8)
$$ \mathop \sum \limits_{r} u_{r} Y_{rj}^{\left( q \right)} - \left( {\mathop \sum \limits_{i} v_{i} X_{ij}^{\left( q \right)} + \mathop \sum \limits_{f} w_{f} Z_{fj}^{{\left( {q - 1} \right)}} } \right) \le 0,\;\;p = 2, \ldots ,\left( {q - 1} \right);\;\;for\;all\;j $$
(9)
$$ u_{r} \ge \varepsilon ,\;\;for\;all\;r $$
(10)
$$ v_{i} \ge \varepsilon ,\;\;for\;all\;i $$
(11)
$$ w_{f} \ge \varepsilon ,\;\;for\;all\;f $$
(12)

The set \(\left( {u_{r}^{*} ,v_{i}^{*} ,w_{f}^{*} } \right)\) represents an optimal solution while \(s_{k}^{*}\) and \(s_{k}^{\left( p \right)*}\) refer to the constraint slacks and the index p denotes the subset of elements encompassed in a given stage. The efficiency of each stage (\(\theta_{k}^{\left( p \right)}\)) can be computed as follows:

$$ \theta_{k} = \left( {{{\mathop \sum \limits_{r = 1}^{s} u_{r}^{*} Y_{rk} } \mathord{\left/ {\vphantom {{\mathop \sum \limits_{r = 1}^{s} u_{r}^{*} Y_{rk} } {\mathop \sum \limits_{i = 1}^{m} v_{i}^{*} X_{ik} }}} \right. \kern-\nulldelimiterspace} {\mathop \sum \limits_{i = 1}^{m} v_{i}^{*} X_{ik} }}} \right) = \mathop \sum \limits_{r = 1}^{s} u_{r}^{*} Y_{rk} = 1 - s_{k}^{*} $$
(13)
$$ \theta_{k}^{\left( 1 \right)} = {{\left( { \mathop \sum \limits_{r} u_{r}^{*} Y_{rk}^{\left( 1 \right)} + \mathop \sum \limits_{f} w_{f}^{*} Z_{fk}^{\left( 1 \right)} } \right)} \mathord{\left/ {\vphantom {{\left( { \mathop \sum \limits_{r} u_{r}^{*} Y_{rk}^{\left( 1 \right)} + \mathop \sum \limits_{f} w_{f}^{*} Z_{fk}^{\left( 1 \right)} } \right)} {\left( { \mathop \sum \limits_{i} v_{i}^{*} X_{ik}^{\left( 1 \right)} } \right)}}} \right. \kern-\nulldelimiterspace} {\left( { \mathop \sum \limits_{i} v_{i}^{*} X_{ik}^{\left( 1 \right)} } \right)}} $$
(14)
$$ \theta_{k}^{\left( p \right)} = {{\left( { \mathop \sum \limits_{r} u_{r}^{*} Y_{rk}^{\left( p \right)} + \mathop \sum \limits_{f} w_{f}^{*} Z_{fk}^{\left( p \right)} } \right)} \mathord{\left/ {\vphantom {{\left( { \mathop \sum \limits_{r} u_{r}^{*} Y_{rk}^{\left( p \right)} + \mathop \sum \limits_{f} w_{f}^{*} Z_{fk}^{\left( p \right)} } \right)} {\left( { \mathop \sum \limits_{i} v_{i}^{*} X_{ik}^{\left( p \right)} + \mathop \sum \limits_{f } w_{f}^{*} Z_{fk}^{{\left( {p - 1} \right)}} } \right)}}} \right. \kern-\nulldelimiterspace} {\left( { \mathop \sum \limits_{i} v_{i}^{*} X_{ik}^{\left( p \right)} + \mathop \sum \limits_{f } w_{f}^{*} Z_{fk}^{{\left( {p - 1} \right)}} } \right)}},\;\;p = 2, \ldots ,q - 1 $$
(15)
$$ \theta_{k}^{\left( q \right)} = {{\left( { \mathop \sum \limits_{r} u_{r}^{*} Y_{rk}^{\left( q \right)} } \right)} \mathord{\left/ {\vphantom {{\left( { \mathop \sum \limits_{r} u_{r}^{*} Y_{rk}^{\left( q \right)} } \right)} {\left( { \mathop \sum \limits_{i} v_{i}^{*} X_{ik}^{\left( q \right)} + \mathop \sum \limits_{f } w_{f}^{*} Z_{fk}^{{\left( {q - 1} \right)}} } \right)}}} \right. \kern-\nulldelimiterspace} {\left( { \mathop \sum \limits_{i} v_{i}^{*} X_{ik}^{\left( q \right)} + \mathop \sum \limits_{f } w_{f}^{*} Z_{fk}^{{\left( {q - 1} \right)}} } \right)}} $$
(16)

Finally, the system slacks are defined by the summation of the individual slacks calculated for each stage \(s_{k}^{*} = \mathop \sum \nolimits_{p = 1}^{q} s_{k}^{\left( p \right)*}\). This implies that system efficiency depends directly on the efficiency of the individual stages, therefore assuring that efficiency estimation bias regarding the single-stage “black box” DEA are eliminated.

4.2.1 Banking efficiency approach

In accordance to the seminal paper of Berger and Humphrey (1997), two alternative approaches for selecting inputs and outputs in the banking sector exist, which mainly vary in the consideration of deposits as resources or products: the production and the intermediation approaches, respectively.

This research proposes to reconcile the intermediation and production approaches within the ambit of the GMSS-DEA structure, placing them as two consecutive stages of the productive process of Chinese banks. In stage 1 (production approach), banks are considered to produce loans, deposits, and other assets while using capital and labor. On the other hand, in stage 2 (intermediation approach), banks are treated as financial intermediaries that convert deposits and purchase funds into income and revenues. Precisely, deposits are considered as a product within the production approach and as a productive resource within the intermediation one. As a distinctive feature of this paper, both approaches are viewed as complementary in the productive network structure of the banking industry in China, as is depicted in Fig. 1. Readers should note the role played by the exogenous inputs and outputs in both stages. Although they are not directly involved in the core activities of each stage, they may affect the efficiency levels of each stage as long as these exogenous variables can be considered as by-products when considering the outputs of stage 1 or auxiliary resources when considering the inputs of stage 2.

The aggregate results obtained using the GMSS-DEA structure for the Chinese banks are presented in Fig. 2. It depicts the distributions for the overall system and the two individual stages: production approach (stage 1) and intermediation approach (stage 2). Banks in China seem to be less efficient in converting physical and human resources and equity into deposits, loans, and assets than in converting deposits and loans into several types of income. This scenario may suggest that competition level of Chinese banks in generating assets based on physical and monetary resources is low and that they tend to operate in a quasi-monopolistic fashion at the province level, although it is a highly fragmented industry at the country level focused on the front-office of the banking operation (intermediation approach). These competitiveness issues are further explored in the next sections.

Fig. 2
figure 2

Boxplots of the aggregated efficiency results obtained under the GMSS-DEA model

4.3 Measures of banking competition

Since one key aim of this paper, as reflected in its title, is to investigate the endogeneity between competition and efficiency, so obviously the measurement of competition is essential. This section provides three different measurements of competition widely adopted in the banking literature. Our main focuses are: (1) estimate and present the results of bank competition accompanied by relevant discussion; (2) use the results to examine its relationship with efficiency.

4.3.1 Herfindahl–Hirschman index

The Herfindahl–Hirschman Index (HHI or sometimes HHI-score) computes the scale of the firm with respect to a given sector. In this research, two alternative measures of the HHI are computed to assess the concentration of the Chinese banks: HHI as a function of total deposits and HHI as a function of the total credit. The distributions of these indexes are depicted in Fig. 3. Results indicate that the banking industry in China is extremely fragmented over the examined period of 2007–2017 both in terms of deposits and credits. The fact that deposits are slightly less concentrated than credits may be reflected in the fact that efficiency is higher in the intermediation approach when compared to that in the production approach. This is a very interesting finding. It shows that commercial banks in China are better in keeping relationships with borrowers rather than depositors. This can be further traced back to the issue of corruption in such an industry. Different borrowers tried to bribe bank managers in order to get loans, which further promoted the relationship between the bank and borrowers. A sustainable relationship between banks and borrowers further contributes to a more concentrated credit market. In addition, this is also related to the issue that different banks normally engage in providing credits to different types of businesses. For example, state-owned banks usually provide loans to big state enterprises in comparison to city commercial banks that usually concede loans to city level enterprises and small and medium sized enterprises, which will be served by joint-stock commercial banks.

Fig. 3
figure 3

Boxplots of the HHIs computed for the deposits and credits in the Chinese banking sector

4.3.2 Panzar and Rosse (1987) H-statistic

The Panzar and Rosse H-statistic is computed via the following reduced revenue equation form for panel data of Chinese banks. The respective Ordinary Least Squares considering fixed bank-specific effects and time dummies is given in Eq. (17). Logarithmics were taken for all variables. Subscripts i and t respectively refer to bank i at time t. Hausman test was conducted to confirm the choice of the fixed effects model (Casu & Girardone, 2006; Claessens & Laeven, 2004).

$$ \ln (TR_{it} ) = \alpha + \beta_{1} \ln (PL,_{it} ) + \beta_{2} \ln (PF,_{IT} ) + \beta_{3} \ln (PC,_{it} ) + \gamma_{1} \ln (EAR_{it} ) + \gamma_{2} \ln (STA_{it} ) + \gamma_{3} \ln (LAR_{it} ) + \varepsilon_{it} $$
(17)

In this research, TRit is calculated by the following quotient: [Gross Revenue/Total Assets] while PL, PF,it, and PC,it are, respectively:

  • Cost of labor given by the quotient between personnel expenses and total assets [Personnel Expenses/Total Assets]it.

  • Cost of funds computed via the quotient between interest expenses and total deposits [Interest Expenses/Total Deposits]it.

  • Cost of fixed capital calculated using the quotient between other operating and administrative expenses to total assets [Overhead Costs/Total Assets]it.

EARit is the quotient between total equity and total assets, which indicates the capitalization level of the bank [Equity Capital/Total Assets]it.; STAit is total assets, which captures bank size [Total Assets]it; and finally LARit is the quotient between total loans and total assets proxying the portfolio mix of the bank [Total Loans/Total Assets]it. The H-statistics is computed by the sum of the coefficients for the input prices, which are β1, β2, and β3 in Eq. (17). If H is equal to 1, there is perfect competition, but if it lies between 0 and 1, then there is monopolistic competition. However, if H is less than 0, then there is monopoly.

4.3.3 Lerner (1934) index

Different from the H-statistic as explained above, the Lerner index makes it possible to measure the degree of market power for a specific bank on a yearly basis. It can be computed using the relative difference between price and marginal cost. The Lerner index usually ranges from 0 to 1 with higher figures indicating higher levels of market power and lower competition levels, while lower figures underline that there is a lower market-power level and a higher degree of competition (Fare et al., 2015; Fungacova et al., 2013; Tan & Floros, 2014; Tan, 2016; Tan et al., 2017; Liu et al., 2018; Liu et al., 2018; among others). In some scenarios, the value of the Lerner index can be negative. We use Ordinary Least Square to estimate the Lerner index while controlling for bank fixed effects and time dummies. The specification can be expressed as below:

$$ \ln Cost_{it} = \beta_{0} + \beta_{1} \ln Q_{it} + \beta_{2} /2 \, (\ln Q_{it} )^{2} + \sum \gamma_{kt} \ln P_{k,it} + \sum \varphi_{k} \ln Q_{it} \ln P_{k,it} + \sum \sum \delta_{kj} \ln P_{k,it} \ln P_{j,it} + \varepsilon_{it} $$
(18)

where ln represents natural logarithm, cost represents the total cost, and Q stands for bank output. Here the sum of interest income, non-interest income, and net income are the proxy for the bank output. P stands for the three input prices, which are the same as the ones used in the previous section. Again, subscripts t and i read similarly. Equation 18 is differentiated to derive the marginal cost with respect to the output Q as below:

$$ MC_{TA,it} = Cost_{it} /Q_{it} [\beta_{1} + \beta_{2} \ln Q_{it} + \sum \varphi_{k} \ln P_{k,it} ] $$
(19)

Therefore, the Lerner index is given by:

$$ Lerner_{it} = (P_{TA,it} - MC_{TA,it} )/P_{TA,it} $$
(20)

where P stands for price that is measured by the quotient between total revenue and total assets. MC stands for marginal cost, and TA is total assets. In some scenarios when the Lerner index is negative, the price level is lower than the marginal cost. This can be explained by the Chinese banking industry with a special characteristic of higher level of government subsidy. Negative Lerner index is good for the Chinese banking industry on the one hand due to the fact that higher marginal cost will deter entry, which is good for the improvement of profit of the existing banks. On the other hand, higher marginal cost than the price level indicates that the banks are suffering losses in the short run. Results for the OLS regression for the H-statistic and Lerner index are displayed in Table 2. In terms of H-statistic, the findings suggest, based upon the summation of coefficients β1, β2, and β3 (0.34), that the Chinese banking industry is not operating in a monopolistic competition when taken in aggregate. Results for the OLS regression for the Lerner Index also indicate that the market power of Chinese banks is low, thus confirming that these banks operate under a very fragmented fashion and are not so selective with regards to customer loans. The distribution of the weighted H-statistic per bank i and time t and Lerner index is depicted in Fig. 4. This weighted H-statistic allows an individual assessment for banks suggesting that there are a few institutions operating in virtual monopolies, possibly larger banks that operate in niche segments or present dominant positions at the province level. This is associated to the structure of the banking sector in China. As discussed before, Commercial banks that are state-owned usually provide loan services to state and large companies, while city level government normally focuses on providing services to the enterprises at the city level.

Table 2 OLS regression results for the H-Statistic and the Lerner Index
Fig. 4
figure 4

Boxplot of the Lerner Index for each bank i at time t

4.4 Multi-layer perceptron–Hidden Markov Model (MLP-HMM) approach

Artificial Neural Networks (ANNs) are computational algorithms based on the human thinking paradigm. ANNs are formed of processing units (neurons) that are weight connected. These connections motivate the estimation of non-linear models by using a training data set. Athanassopoulos and Curram (1996) is the first literature insight on combining ANNs and DEA for predicting efficiency levels. Other ANN applications in DEA can be found in Santin and Delgado (2004); Wu et al. (2006); Emrouznejad and Shale (2009); Misiunas et al. (2016); Shokrollahpour et al. (2016); Olanrewaju et al. (2012); Bashiri et al. (2013); Modhej et al. (2017).

In this research, a specific focus is placed on the MLP network that has been extensively studied in forecasting applications (Mubiru & Banda, 2008). Within the ambit of an MLP, neurons are pooled in layers and just forward connections are allowed. These features provide a robust architecture capable of learning upon any kind of continuous nonlinear mapping. A typical MLP is represented in Fig. 5.

Fig. 5
figure 5

MLP framework

MLP constituents encompass neurons, weights, and transfer functions. An input \(x_{j}\) is transmitted via connections that multiplies its respective strength by \(w_{ij}\) weights, yielding the product \(x_{j} w_{ij}\) used in the transfer function f to compute a specific output \(y_{i}\) given as \(y_{i = } f\left( {\mathop \sum \nolimits_{j = 1}^{n} x_{j} w_{ij} } \right)\). i is the neuron index in the hidden layer and j is the input index in the MPL. The modification of the weights of each connection observing some orderly fashion is known as training. During the training, an input is assigned to the network along with the desired output and the weights are adjusted so that the MLP catches up with the desired output value.

Here, the focus is on unveiling endogeneity between efficiency and competition in Chinese banks by means of MLP, taking contextual variables as the control ones. This paper departs from previous research in the banking sector by using an MLP network structure to explore endogeneity between these variables in terms of the following models:

  • Model 1: Production Efficiency = Intermediation Efficiency + HHc + HHd + H-Stat + Lemer + Contextual Variables.

  • Model 2: Intermediation Efficiency = Production Efficiency + HHc + HHd + H-Stat + Lemer + Contextual Variables.

  • Model 3: HHc = Production Efficiency + Intermediation Efficiency + HHd + H-Stat + Lemer + Contextual Variables.

  • Model 4: HHd = Production Efficiency + Intermediation Efficiency + HHc + H-Stat + Lemer + Contextual Variables.

  • Model 5: H-Stat = Production Efficiency + Intermediation Efficiency + HHc + HHd + Lemer + Contextual Variables.

  • Model 6: Lerner = Production Efficiency + Intermediation Efficiency + HHc + HHd + H-Stat + Contextual Variables.

The relative importance of each model in explaining the competition and the efficiency levels in the Chinese banking industry, besides the endogenous nature of these variables, were explored, respectively, by the variances of each model and the covariances between models. Variances and covariances of the residuals (\(R_{i}\)) of these six models are simultaneously minimized by a non-linear stochastic optimization problem, as presented in Eq. (21), where \(w_{i}\) stands for the weights, which range from 0 to 1, assigned respectively to the residual vectors of each one of the six models previously described. The values of w are optimized so that the variance (Var) and covariance (Covar) of the pooled residuals is minimal. Model (21) was solved by means of differential evolution (DE). DE is a research stream of genetic algorithms also emulating the natural selection and evolution. Readers should refer to Ardia et al. (2011) and Mullen et al. (2011) for further details. Results are discussed in the next section.

$$ \begin{aligned} \min \left[ {Var\left( {\mathop \sum \limits_{i = 1}^{6} w_{i} * R_{i} } \right) + \left( {\mathop \sum \limits_{i,j = 1}^{6} Co{\text{var}} (w_{i} *w_{j} * R_{i} *R_{j} ), i \ne j, j i} \right)} \right] \\ subject\,to\;\;\mathop \sum \limits_{i = 1}^{6} w_{i} = 1 \\ 0 \le w_{i} \le 1\, \forall i \\ \end{aligned} $$
(21)

Residuals of the MLP models were bootstrapped 200 times based on HMM. The HMM bootstrapping allowed the collection of a distributional profile of w for the most accurate prediction of the network efficiency scores and competition indexes. The stochastic HMM used in this research enables the assessment of endogeneity between efficiency and competitiveness variables by means of the respective transition probabilities for each state. Consider that there are j observations for each bank X at time t that are given as {Xtj: t = 1, …, T; j = 1,…, J}. Also assume that these random vectors are mutually independent. The HMM is structured upon choosing a proper distributional assumption of the random vectors Xtj at each one of the m states of the HMM. Therefore, transition probabilities should be determined for t = 1, 2, …, T, i = 1, 2, …, m, and for all relevant xtj, like in Wanke, dos Henrique et al. (2019), Wanke et al. (2019), Wanke et al. (2019)). Here, the stochastic HMM is modelled as a multinomial distribution for each bank j at each state i over time t. The likelihood function is given as (Zucchini et al., 2016):

$$ {\text{P}}\left( {{\text{X}}_{{{\text{tj}}}} } \right) = {\text{diag}}\left( {{\text{P}}_{1} \left( {{\text{X}}_{{{\text{tj}}}} } \right), \ldots ,{\text{ P}}_{{\text{m}}} \left( {{\text{X}}_{{{\text{tj}}}} } \right)} \right) $$
(22)

Equation (22) holds for continuous-valued series, where \({\text{P}}_{{\text{m}}} \left( {{\text{X}}_{{{\text{tj}}}} } \right)\) is the probability that observation of bank j on time t belongs to state m. The following definitions are now observed:

T = {1…OBS}, where OBS is the number of observations of the sample for each bank.

J = {production approach efficiency (pae), intermediation approach efficiency (iae), HHI for credit (hhic), HHI for deposits (hhid), weighted H-statistic (whs), Lerner Index (li), Age (age), State-Owned Bank (sob), Listed Bank (lb), Impaired Loans ratio (ilr), Total Capital Ratio (tcr), Top 10 Customer Loan Ratio (ttclr)}.

M = {m1….m4096}, each stage m is defined upon the quantile combinations (above median, below median) for each individual vector of observations, as displayed in Table 3.

Table 3 HMM states

5 Results and discussion

While alternative DEA models have made a great contribution to better apprehend the productive network of the banking industry over time, the methods proposed in the NEIO research stream achieved relevant findings for mapping its competitive behavior. Still, these different research streams have not yet been cross-checked against each other in terms of temporal dependence and mutual feedback (endogeneity). This cross-checking is deemed relevant because hidden feedbacks between the competitive structure and the productive efficiency may devise important policy and strategic implications for the Chinese banks.

Overall efficiency and sub-efficiencies of the sample are given in Appendix B. The endogenous nature of this dataset is presented in Figs. 6 and 7. It shows that banking efficiency levels are correlated to contextual variables and to competition structure metrics. Also, the time dependent nature between these relationships is noteworthy.

Fig. 6
figure 6

Endogeneity between banking efficiency, macro-economic, and competition structure variables. (scatterplots)

Fig. 7
figure 7

Time dependence between banking efficiency, contextual, and competition structure variables. (correlograms)

Figures 8, 9 and Tables 4 and 5 report the results regarding the fifteen most probable states and their respective transition matrices. It is noteworthy that, although 120 out of the 4096 individual states showed non-zero probabilities, only 178 out of the 14,400 (120^2) transition matrix possible combinations presented non-zero probabilities. These initial findings indicate that when efficiency, competition structure, and contextual variables are modelled in terms of boostrapped HMM, only a small fraction of all possible transitions actually exist. This feature may be explained by the presence of strong feedback mechanisms between efficiency and competition structure in parallel with time-series auto correlation within the observations of each bank, which also limits the probability of abrupt transitions between two disparate states.

Fig. 8
figure 8

Pareto plot for the 15 most probable states

Fig. 9
figure 9

Heat map for the 15 most probable state transitions (blank cells represent zero probability)

Table 4 Descriptive statistics for the 15 most probable states (*)
Table 5 Descriptive statistics of the most probable state transitions (*)

Figures 10 and 11 report on, respectively, the variance/covariance minimization results using differential evolution for weighted Models 1 to 6 based on 100 data series replications generated with HMM. Residuals from Models 1 to 6 were obtained solving MLP networks for each model for each HMM realization. Model 1, where productive efficiency is the dependent variable, presents isolate as of the highest importance in terms of the minimization of variance/covariance of the residuals. The weight of production efficiency is higher than the sum of the weights of all other models combined and this result suggests that converting physical and human resources into loan and deposits is the key for understanding efficiency and competitiveness in Chinese banks. Besides, productive efficiency is the key for understanding endogeneity as long as it is linked with intermediation efficiency (as expected), the level of concentration of credit and deposits (HHc and HHd), the banking industry competition type (H-Stat), and the market power of each bank (Lerner Index). All other endogenous relations are negligible, as reported in Fig. 11. As regards the competition structure variables, productive efficiency appears to be positively impacted by competition and market power in the sense that market concentration and quasi-monopolistic operation at the province level may favor the conversion of physical and human resources into deposits and loans. There is, however, a trade-off between production and intermediation efficiency levels, which may be explained by risk analysis in credit concession and other provisions for default loans. According to Fig. 12, as regards the contextual variables, the most relevant feedbacks related to productive efficiency occur in order of the impaired loans ratio, age, local bank, top ten customer loans ratio, and total capital ratio. With the exception of tcr, all remaining contextual variables presented a negative feedback, on average, with productive efficiency levels (cf. Table 6). These results suggest that credit risk analysis in older local banks with focused actuation at the province level can possibly be the factor that jeopardizes the productive efficiency levels of the entire banking industry in China. However, at this stage we cannot be assured that definitely it is the credit risk analysis in older local banks that jeopardizes the productive efficiency because the NPL ratio is primarily the metric of ex-post credit risk or the materialized credit risk and not necessarily the outcome of the bank's credit scoring, but may be the result of a deteriorating economic activity in the macro-environment. Productive efficiency of Chinese banks positively impacts, and it is impacted by, these variables, which suggests that a movement towards a perfect banking competition structure at the province level and reduced market power of local banks are drivers of a sound banking system in China. Although less important, HHc presents a positive feedback with production efficiency, thus suggesting that increased banking productivity is a direct consequence of the concentration of credit into fewer banks as a form of maintaining scale and reducing transaction costs.

Fig. 10
figure 10

Relative weight of each model in terms of variance minimization

Fig. 11
figure 11

Relative weights between models in terms of covariance minimization (endogenity)

Fig. 12
figure 12

Relative weights of each explanatory variable within each model

Table 6 Sensitivity analyses on the signs of the relationships (percent of positive signs found in bootstrap interactions)

We can see from the table that the production approach efficiency feeds back in a stronger manner on the intermediation approach efficiency rather than the other way around. This can be explained by the fact that the production approach efficiency mainly measures efficiency using relevant variables related to physical units as reflected in Table 1, i.e. the inputs used include number of employees and equity capital and the output variables are fixed assets, liquid assets, and total assets. In comparison, the intermediation approach efficiency mainly measures the level of efficiency using variables related to currency units such as using interest and non-interest expenses as inputs and interest and non-interest income as the outputs. This finding reflects the fact that in the Chinese banking industry, fixed assets, liquid assets, and total assets are strongly related to the interest and non-interest income of Chinese commercial banks, whereas larger amount of interest and non-interest income is not necessarily used to increase the volume of fixed assets and liquid assets.

We can further notice from Table 6 that the production approach efficiency for state-owned banks is 20, which is significantly different and smaller than the intermediate approach efficiency, which is 66.67. As discussed above, the intermediate approach efficiency focuses on the currency unit, whereas the production approach efficiency concentrates more on physical units. These results show that Chinese state-owned banks are more concerned about “money” related issues in the production process and do not pay enough attention to physical issues, although some of which are very important for the bank stability, such as liquid assets and equity capital. Both state-owned banks and local banks focus on producing monetary units related to outputs rather than physical units. However, local banks pay more attention on generating physical units of outputs. This is because small local banks are more concerned about their safety rather than income.

This finding is very interesting and important for the Chinese banking industry to make relevant policies. Obviously, there is still a room for the Chinese banking industry to increase the level of efficiency through allocating resources in a more optimal way in the production process, but compared to the state-owned banks, the local banks should pay more attention to using the resources to maximize the monetary unit production. It is recommended that both state-owned banks and local banks make more effort to focus on generating important physical units such as capital and liquid assets, which will further promote stability in the Chinese banking industry.

6 Robustness check

First of all, customer loans were incorporated in the NDEA model as an intermediate variable, altogether with SME loans and total loans. Results for the Kullback–Leibler (KL) divergence between previous scores (w/o customer loans) and new scores (w/ customer loans) were found to be negligible: Overall = 7.916239e−07; Stage 1 = 0.006994322; Stage 2 = 0.00944464. Therefore, it can be posited that the impact of customer loans on score differences is minimal for the Chinese banking productive process, which also can be confirmed by visual inspection on Fig. 13.

Fig. 13
figure 13

Robustness analysis scatter plot

Subsequently, the other two ratios of liquidity and year-to-year increase in provision were incorporated as additional contextual variables to be used in the neural network models. Analogously, the results for Stage 1 (the most relevant for explaining endogeneity and total variance reduction) and its interactions with other competition structure variables revealed substantially small values for their respective Mean Squared Error (MSE), thus supporting that the previous results still hold in terms of isotonicity despite the addition of these two new variables. In fact, MSE for Stage 1 weight between both analyses was 0.0861, while as for each one of the five Stage 1 interactions, the MSEs were respectively 0.00092, 0.00196, 0.0029, 0.0045, and 0.00156.

Table 7 Sensitivity analyses on the signs of the relationships (percent of positive signs found in bootstrap interactions) now including risk related variables

7 Sensitivity to risk analysis

An additional sensitivity analysis was performed repeating all steps described so far, but now including two risk-related variables: insolvency risk and capital risk. The insolvency risk is measured by the widely used Z-score (Tan & Floros, 2013), which is calculated using the steps as follows: (1) calculate the ratio of equity capital to total assets; (2) add up the return on assets with (1); (3) use (2) to divide by the standard deviation of return on assets. Higher values of Z-score indicates a lower level of insolvency risk, while lower values indicate a higher level of risk. In terms of the measurement of capital risk, we use the ratio of equity capital to total assets. The literature has used this measurement to reflect the level of capital adequacy (Altunbas et al., 2007; Fiordelisi et al., 2011). Higher values of this ratio indicate a lower level of capital risk, lower values indicate a higher level of capital risk. Table 7 reports on the changes verified in Table 6 results. One can easily verify that the most impacted relationships were those related to H-Statistics, Local Bank, and Total Capital Ratio, for which signs presented reversion with respect to efficiency scores. These results may indicate that local Chinese banks may be operating more leveraged and exposed to financial risks which may be affecting the banking efficiency levels of this particular segment. In fact, this segment is highly concentrated in the hand of local banks with low governance levels. Our results are in line with the findings of Sun et al. (2013) who report that higher levels credit risk of city commercial banks is mainly attributed to the fact that city commercial banks are established with the purpose of supporting regional economic growth and development without assessing the level of credit risk in a strict way. Lower level of governance existed in this specific bank ownership type is mainly reflected by the absolute ownership control from the local government, which impedes the effective and efficient decision making process through the involvement of different shareholders. This coincides with the finding of Sun et al. (2013) showing that strategic investors are supposed to significantly improve the efficiency of Chinese city commercial banks.

8 Conclusions and direction for future research

This paper explored efficiency in Chinese banks using a novel two-stage DEA approach to capture the impact of endogenous and exogenous variables. A specific stochastic HMM and neural network analysis was also developed in this analysis to be able to reduce the fitting bias with this analysis having the advantage of improving the accuracy of the model, in particular when the competition structure and contextual variables are included. Our model significantly contributes to the banking literature as well as to the literature on the operational research in efficiency analysis.

Our findings suggest that the local action of older banks can possibly be the factor that sets the performance threshold in the Chinese banking industry. Regarding concentration indexes, their overall effect on productive efficiency was positive, so we could argue that in the period under analysis that banking concentration was positively related with efficiency. Further research should be directed to use an alternative competition indicator (Boone indicator) to measure the level of competition in the Chinese banking industry. In addition, our method can be applied to other countries to see whether the result will hold.