1 Introduction

The nonparametric data envelopment analysis (DEA) was introduced by Charnes et al. (CCR) in 1978 as a tool for efficiency and productivity measurement of the decision-making units. The namely DEA CCR model assumes constant returns to scale, while Banker et al. (BCC) in 1984 further extended the CCR model, the so-called DEA BCC model can decompose the overall efficiency into pure technical efficiency and scale efficiency by assuming variable returns to scale. The DEA models have been widely applied across different economic sectors including the banking industry (Fukuyama and Tan, 2022b); education industry (Mozaffari et al. 2022); hospitality industry (Tan and Despotis, 2021); Transportation industry (Omrani et al. 2022a); healthcare industry (Omrani et al. 2022b); energy industry (Chu et al. 2021); among others. Both the traditional DEA and various model advancements have been used by previous studies in the estimation of efficiency in the real estate industry, including the traditional DEA CCR and BBC models (Li et al. 2019; Nicholson and Stevens, 2021); DEA Malmquist approach (Chen et al. 2020); Hybrid DEA and Malmquist total factor productivity index (Jin et al. 2015); DEA super-efficiency model (Zheng et al. 2011); Slack-based DEA model (Yang and Fang, 2020); Dynamic network DEA (Yang et al. 2019); Slack-based super-efficiency model (Wang et al. 2020); hierarchy slacks-based DEA (Mills et al. 2021), and combination of grey methodology and DEA (Wang et al. 2021).

According to Tan (2016a), market power mainly measures the extent to which a specific firm can set the price level of its products above the marginal cost. Besides the traditional concentration ratio and Herfindahl–Hirschman index, the Lerner index has been widely applied in the empirical literature to measure the level of competition/market power (Aghion et al. 2015; Ariss, 2010; Leroy and Lucotte, 2017; Tan et al. 2021). The traditional Lerner index benefits from the advantage over the Panzar-Roose H statistic (Tan, 2016b) by being able to generate the level of market power for each decision-making unit rather than the level of competition/market power on an annual basis for the whole industry. However, it still suffers from the disadvantage of its inability to derive the level of market power for different markets/products. Very limited attempts have been made to investigate market power in the real estate industry.Footnote 1 Building on the traditional Lerner index, attempts have been made to develop this index in different ways, including (1) the use of a semiparametric smooth coefficient model (Diallo, 2015); the main advantage of this method lies in the fact that it can increase the flexibility of the functional form in estimating the Lerner index, however, it suffers from the same disadvantages as the traditional Lerner index that it does not attempt to address the market power in a multi-product context; (2) Scale-elasticity based marginal cost (Tsionas et al. 2018). Instead of estimating the cost functions, the study proposed a nonlinear system of simultaneous equations, through the estimation of which, the level of cost efficiency and scale elasticity can be derived at the same time, the latter of which was then used to derive the Lerner index. In addition, the study considered a multi-product context in estimating the Lerner index. The main limitation of the study lies in the fact that its method allows for non-profit maximizing DMUs across different years of the period. It is an important piece of information that can be used by the firm’s management. (3) Fukuyama and Tan (2022a) used the nonparametric DEA for the estimation of the Lerner index. The main limitation of Fukuyama and Tan (2022a) lies in the fact that it only considered a single-output empirical scenario although this study contributes to the literature by applying the nonparametric DEA in estimating the multi-output marginal costs. Finally, Shaffer and Spierdijk (2020) proposed to use the non-homothetic generalized Leontief (NHT-GL) cost function to estimate Lerner indices and applied it to the banking sector. Different from the previous estimations of Lerner indices, this measurement discussed the estimation of Lerner indices in a multi-product context and the proposed translog cost function can reduce the number of cases for negative Lerner indices, however, it still cannot completely remove the negative values of Lerner indices, which is not in accordance with the definition and properties of the Lerner index. This indicates that the estimation still suffers from limitations. The proposal and solution are needed to provide an estimation that would be able to completely remove the negativity issue of Lerner indices, and at the same time, the method disallows non-profit maximizing.

We propose a DEA approach for estimating the Lerner index, which uses a modified nonparametric cost function. It is well-known that the DEA-based cost function not only suffers from the existing multiple optima (with zero marginal costs for some products), but also some observed product prices can be less than the estimated marginal costs in practice. In the latter case, the Lerner index becomes negative and may not be acceptable from the monopoly and/or oligopoly theoretic perspective. However, it may be difficult to obtain practically useful prices which are directly comparable to the estimated shadow (multipliers) prices, although the observed prices are supposedly unitless. We usually construct the firm-specific prices according to the same price construction procedure for all firms (for example by dividing expenditure by the relevant input quantity). Hence, we develop an optimization method and handle the issue of the possible negative Lerner index by controlling for the multiplier values, so that the estimates do not exceed the observed input prices. Moreover, our DEA framework has at least two more desirable characteristics: (a) the Lerner index estimation is carried out without assuming any specific functional structure, which is generally unknown, and (b) the estimation procedure disallows non-profit maximizing behaviour.

The real estate enterprises took a large amount of bank loans and played a key role in determining the level of financial stability in China. A lower level of market power for those small and medium-sized real estate enterprises will induce them to undertake a higher level of risk-taking behavior, which would increase the probability of bank loan default and further result in financial instability. Therefore, improving the level of market power in the Chinese real estate enterprises would be the priority for the government. Before finding ways to improve the level of market power, we need to clearly understand the level of market power in the Chinese real estate industry. This would purely rely on an accurate and comprehensive analysis of the degree of market power. The current study not only contributes to the estimation of market power in the real estate industry in general and the Chinese real estate industry specifically from the empirical perspective, but our model provides significant contributions from the operational research perspective.

2 Notation and methodology.

2.1 Lerner index in a multi-product setting: preliminaries

To start with, let \({\mathbf{x}} = \left( {x_{1} , \ldots ,x_{N} } \right)^{\text{T}} \in {\mathbb{R}}_{ + }^{N}\) and \({\mathbf{y}} = \left( {y_{1} , \ldots ,y_{M} } \right)^{\text{T}} \in {\mathbb{R}}_{ + }^{M}\) represent non-negative column vectors of N different inputs and M different products, respectively, where superscript \(\text{T}\) indicates the transposing operator. The production possibility set (PPS) is denoted as

$$\begin{array}{*{20}c} {PPS = \left\{ {\left( {\begin{array}{*{20}c} {\mathbf{x}} \\ {\mathbf{y}} \\ \end{array} } \right)\left| {\left( {\begin{array}{*{20}c} {\mathbf{x}} \\ {\mathbf{y}} \\ \end{array} } \right)} \right.{\text{ is a technologically feasible input}}/{\text{product vector}}} \right\}.} \\ \end{array}$$
(1)

With respect to (1), the standard or unrestricted cost function is denoted as

$$\begin{array}{*{20}c} {\overline{C}\left( {{\mathbf{y}},{\mathbf{w}}} \right) = \mathop {\min }\limits_{{\mathbf{x}}} \left\{ {{\mathbf{w}}^{\text{T}} {\mathbf{x}}\left| {\left( {\begin{array}{*{20}c} {\mathbf{x}} \\ {\mathbf{y}} \\ \end{array} } \right)} \right. \in PPS} \right\}} \\ \end{array}$$
(2)

where \({\mathbf{w}} = \left( {w_{1} , \ldots ,w_{N} } \right)^{\text{T}} \in {\mathbb{R}}_{ + + }^{N}\) is a positive column vector of N input prices and the inner product \(\overline{c} = { }{\mathbf{w}}^{\text{T}} {\mathbf{x}} = w_{1} x_{1} + \cdots + w_{N} x_{N}\) is total input cost. Let

$$\begin{array}{*{20}c} { \overline{MC}_{m} \left( {{\mathbf{y}},{\mathbf{w}}} \right) = \frac{{\partial \overline{C}\left( {{\mathbf{y}},{\mathbf{w}}} \right)}}{{\partial y_{m} }} } \\ \end{array}$$
(3)

be the marginal cost of product m, and let \(p_{m}\) be the price of product m. Then the Lerner sub-index for product m is written as

$$\begin{array}{*{20}c} {\overline{L}_{m} = \frac{{P_{m} - \overline{MC}_{m} \left( {{\mathbf{y}},{\mathbf{w}}} \right)}}{{P_{m} }}, m = 1, \ldots ,M} \\ \end{array}$$
(4)

If \(\overline{L}_{m}\) = 0 with \(P_{m} = \overline{MC}_{m} \left( {{\mathbf{y}},{\mathbf{w}}} \right)\), then the decision-making unit (DMU) has no market power. Under profit maximization, \(\overline{L}_{m}^{{}}\) takes the value between zero and one. To show this, consider the following profit maximization problem:

$$\begin{array}{*{20}c} {\mathop {\max }\limits_{{{\mathbf{x}},{\mathbf{y}}}} \left\{ {{\mathbf{p}}\left( {\mathbf{y}} \right)^{\text{T}} {\mathbf{y}} - {\mathbf{w}}^{\text{T}} {\mathbf{x}}{|}\left( {{\mathbf{x}},{\mathbf{y}}} \right) \in PPS} \right\} = \mathop {\max }\limits_{{\mathbf{y}}} \left\{ {{\mathbf{p}}\left( {\mathbf{y}} \right)^{\text{T}} {\mathbf{y}} - \overline{C}\left( {{\mathbf{y}},{\mathbf{w}}} \right)} \right\}} \\ \end{array}$$
(5)

where the product price vector \({\mathbf{p}}\left( {\mathbf{y}} \right)\) depends upon \({\mathbf{y}}\). The first-order conditions of the right-hand side of (5) lead to

$$\begin{array}{*{20}c} {\frac{{p_{m}^{{}} - \overline{MC}_{m}^{{}} \left( {{\mathbf{y}},{\mathbf{w}}} \right){ }}}{{p_{m}^{{}} }} = - \frac{1}{{\varepsilon_{m}^{{}} \left( {\mathbf{y}} \right)}}, \,\,\,\, m = 1, \ldots ,M.} \\ \end{array}$$
(6)

where \(\varepsilon_{m} \left( {\mathbf{y}} \right)\) is the inverse of partial elasticity of demand with respect to product m, i.e., \(\varepsilon_{m} \left( {\mathbf{y}} \right) = \left( {\frac{{\partial p_{m} \left( {\mathbf{y}} \right)}}{{\partial y_{m} }}\frac{{y_{m} { }}}{{p_{m} \left( {\mathbf{y}} \right)}}} \right)^{ - 1} \le 0\). For a single product case, the right-hand side of (5) becomes the single product inverse demand function. Equation (6) and non-positivity of \(\varepsilon_{m} \left( {\mathbf{y}} \right)\) indicates the standard Lerner sub-index \(\overline{L}_{m} \in \left[ {0,1} \right]\) and hence, the profit maximizing DMU will not operate at the region where the marginal cost is less than zero or \(\varepsilon_{m} \left( {\mathbf{y}} \right)\) is greater than zero, indicating that \(\overline{L}_{m}\) cannot be greater than one. If \(0 < \overline{L}_{m} < 1\), then the DMU has some market power (the larger \(\overline{L}_{m}\) is, the higher the market power).

Let \(R_{m} = p_{m} y_{m}\) be the revenue from product m and let \(R = \mathop \sum \limits_{{m^{^{\prime}} = 1}}^{M} p_{{m^{^{\prime}} }} y_{{m^{^{\prime}} }} = \mathop \sum \limits_{{m^{^{\prime}} = 1}}^{M} R_{{m^{^{\prime}} }}\) be the total revenue. Utilizing these notations, we define the revenue weight for product m as \(W_{m} = \frac{{R_{m} }}{{\mathop \sum \nolimits_{{m^{^{\prime}} = 1}}^{M} R_{{m^{^{\prime}} }} }}\). We aggregate the standard Lerner sub-indexes (4) by using revenue shares as the weight. We denote a standard aggregate Lerner index, \(\overline{AL}\) as:

$$\begin{array}{*{20}c} {\overline{AL} = \mathop \sum \limits_{m = 1}^{M} W_{m} \overline{L}_{m} } \\ \end{array}$$
(7)

where \(\sum\nolimits_{m = 1}^{M} {W_{m} } = {1}\). The standard aggregate Lerner index (7) is not only less than or equal to one, but also greater than or equal to zero under profit maximization, like Lerner sub-indexes (4).

Now we consider a nonparametric DEA implementation of the standard cost function. Suppose that there are J DMUs, each \(DMU_{j}^{{}}\) of which consumes a non-negative input vector \({\mathbf{x}}_{j}^{{}} = \left( {x_{1j}^{{}} , \ldots ,x_{Nj}^{{}} } \right)^{\text{T}} \in {\mathbb{R}}_{ + }^{N}\) to produce a non-negative product vector \({\mathbf{y}}_{j}^{{}} = \left( {y_{1j}^{{}} , \ldots ,y_{Mj}^{{}} } \right)^{\text{T}} \in {\mathbb{R}}_{ + }^{M}\). Let \({{\varvec{\uplambda}}} = \left( {\lambda_{1}^{{}} , \ldots ,\lambda_{J}^{{}} } \right)^{\text{T}} \in {\mathbb{R}}_{ + }^{J}\) be a non-negative vector of intensities in linear programming and let \(0\) be an appropriate dimensional column vector with all elements being zero. A DEA production possibility set is constructed as

$$\begin{array}{*{20}c} {\left\{ {\left( {\begin{array}{*{20}c} {\mathbf{x}} \\ {\mathbf{y}} \\ \end{array} } \right){|}\mathop \sum \limits_{j = 1}^{J} {\mathbf{x}}_{j}^{{}} \lambda_{j}^{{}} \le {\mathbf{x}},{ }\mathop \sum \limits_{j = 1}^{J} {\mathbf{y}}_{j}^{{}} \lambda_{j}^{{}} \ge {\mathbf{y}},{ }\mathop \sum \limits_{j = 1}^{J} \lambda_{j}^{{}} = 1,{ }{\mathbf{x}} \ge {\mathbf{0}},{ }{\mathbf{y}} \ge {\mathbf{0}},{ }{{\varvec{\uplambda}}} \ge {\mathbf{0}}} \right\},} \\ \end{array}$$
(8)

where \(\sum\nolimits_{j = 1}^{J} {\lambda_{j} } = 1\) allows for variable returns to scale. Using (8), the envelopment form of the DEA cost function (Färe et al. 1994) is constructed by

$$\begin{array}{*{20}c} {\overline{C}\left( {{\mathbf{y}},{\mathbf{w}}} \right) = \mathop {{\text{min}}}\limits_{{{\mathbf{x}},{ }{{\varvec{\uplambda}}}}} \left\{ {{\mathbf{w}}^{\text{T}} {\mathbf{x}}{|}\mathop \sum \limits_{j = 1}^{J} {\mathbf{x}}_{j} \lambda_{j} \le {\mathbf{x}},{ }\mathop \sum \limits_{j = 1}^{J} {\mathbf{y}}_{j} \lambda_{j} \ge {\mathbf{y}},{ }\mathop \sum \limits_{j = 1}^{J} \lambda_{j} = 1,\;\;\;\;{\mathbf{x}} \ge {\mathbf{0}},\;\;\;\;{{\varvec{\uplambda}}} \ge {\mathbf{0}}} \right\},} \\ \end{array}$$
(9)

The multiplier representation of (9) is written as

$$\mathop {\max }\limits_{{{\mathbf{v}},{\mathbf{u}},\omega }} \begin{array}{*{20}c} {\left\{ {{\mathbf{u}}^{{\text{T}}} {\mathbf{y}} + \omega | - {\mathbf{v}}^{{\text{T}}} {\mathbf{x}}_{j} + {\mathbf{u}}^{{\text{T}}} {\mathbf{y}}_{j} + \omega \le 0~~\left( {\forall j} \right),~~{\mathbf{v}} \le {\mathbf{w}},~~{\mathbf{v}} \ge {\mathbf{0}},~~{\mathbf{u}} \ge {\mathbf{0}},~~~\omega :\,\,{\text{free}}} \right\}} \\ \end{array}$$
(10)

Using the standard cost function, we consider the standard indirect output possibility correspondence, which is a mapping \(\overline{IP} :{ }{\mathbb{R}}_{ + }^{N} { } \to { }\overline{IP} ({\mathbf{w}}/{\overline{c}}) \subseteq {\mathbb{R}}_{ + }^{M} ,\) where

$$\begin{array}{*{20}l} {\overline{IP} ({\mathbf{w}}/\overline{c}) = \left\{ {{\mathbf{y}} \in {\mathbb{R}}_{ + }^{M} \left| {\overline{C}\left( {{\mathbf{y}},{\mathbf{w}}/\overline{c}} \right) \le 1} \right.{ }} \right\}} \\ \end{array}$$
(11)

where \(\overline{C}\left( {{\mathbf{y}},{\mathbf{w}}/\overline{c}} \right) \le 1 \Leftrightarrow \overline{C}\left( {{\mathbf{y}},{\mathbf{w}}} \right) \le \overline{c}\) from the definition of the standard cost function. Therefore, \(\overline{C}\left( {{\mathbf{y}},{\mathbf{w}}/\overline{c}} \right) = \overline{C}\left( {{\mathbf{y}},{\mathbf{w}}} \right)/\overline{c}\) is the standard cost efficiency. For a detailed account of (11), see Shephard (1970), which was further developed by Färe et al. (1994), and Färe and Primont (2006). For some applications of (11), see Grosskopf et al. (1999), Fukuyama and Weber (2005) and Fukuyama and Tan (2021).

As stated earlier, when the multiplier form of the standard cost function is used to estimate marginal costs, two issues should be considered: the unsatisfied profit maximization conditions and alternate solutions with zero product multiplier values. We develop an optimization method for dealing with these issues in the following two sub-sections.

2.2 Empirical procedure for coping with profit maximization

To avoid any possible unsatisfaction of profit maximization conditions, we implement the following multiplier restrictions:

$$\begin{array}{*{20}c} {{\mathbf{u}} \le {\mathbf{p}}.} \\ \end{array}$$
(12)

It is usually necessary to adapt the multipliers (shadow prices) because the prices of various products are preferably unitless for comparison purposes, but it is sometimes difficult to find such prices in empirical analysis. Appending (12) in the multiplier form of (10), we obtain the following adapted cost function, which implements the price data to satisfy the profit maximization conditions, as

$$C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right) = \mathop {\max }\limits_{{{\mathbf{v}},{\mathbf{u}},~\omega }} \left\{ {{\mathbf{u}}^{{\text{T}}} {\mathbf{y}} + \omega \left| {\begin{array}{*{20}c} { - {\mathbf{v}}^{{\text{T}}} {\mathbf{x}}_{j} + {\mathbf{u}}^{{\text{T}}} {\mathbf{y}}_{j} + \omega \le 0~~\left( {\forall j} \right),~~{\mathbf{v}} \le {\mathbf{w}},~~} \\ {{\mathbf{u}} \le {\mathbf{p}},~~{\mathbf{v}} \ge {\mathbf{0}},~~{\mathbf{u}} \ge {\mathbf{0}},~~~\omega :free} \\ \end{array} } \right.} \right\}.$$
(13)

The corresponding envelopment form is written as

$$C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right) = \mathop {{\text{min}}}\limits_{{{\mathbf{x}},{\mathbf{d}},{{\varvec{\uplambda}}} }} \left\{ {{\mathbf{w}}^{{\text{T}}} {\mathbf{x}} + {\mathbf{p}}^{{\text{T}}} {\mathbf{d}}\left| {\begin{array}{*{20}c} {\sum\limits_{{j = 1}}^{J} {{\mathbf{x}}_{j} } \lambda _{j} \le {\mathbf{x}},\sum\limits_{{j = 1}}^{J} {{\mathbf{y}}_{j} } \lambda _{j} \ge {\mathbf{y}} - {\mathbf{d}},} \\ {{\mathbf{d}} \ge {\mathbf{0}},~~\sum\limits_{{j = 1}}^{J} {\lambda _{j} } = 1,~~{\mathbf{x}} \ge {\mathbf{0}},{ }~~{{\varvec{\uplambda}}} \ge {\mathbf{0}}} \\ \end{array} } \right.} \right\}.$$
(14)

The difference between \({ }C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right)\) and \(\overline{C}\left( {{\mathbf{y}},{\mathbf{w}}} \right)\) is that the inner product \({\mathbf{p}}^{\text{T}} {\mathbf{d}}\) and the variable \({\mathbf{d}}\) appear in the objective function and the product constraints, respectively, for the adapted cost function case, but do not for the standard cost function. We refer to the sum, \({\mathbf{w}}^{\text{T}} {\mathbf{x}} + {\mathbf{p}}^{\text{T}} {\mathbf{d}}\), as the cost which is an “adapted” cost due to the constraint (12). When a firm is adapted cost efficient, we have \(c = {\mathbf{w}}^{\text{T}} {\mathbf{x}} + {\mathbf{p}}^{\text{T}} {\mathbf{d}}\). Otherwise, \(c > {\mathbf{w}}^{\text{T}} {\mathbf{x}} + {\mathbf{p}}^{\text{T}} {\mathbf{d}}\).

At the optimum, the optimized objection functions of (13) and (14) are the same, i.e.,

$$C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right) = {\mathbf{u}}^{{\text{*T}}} {\mathbf{y}} + \omega = {\mathbf{w}}^{\text{T}} {\mathbf{x}}^{*} + {\mathbf{p}}^{\text{T}} {\mathbf{d}}^{*}$$

where asterisk * shows the optimality of (13) and (15). Since the observed input cost c > 0 is greater than or equal to the minimum adapted cost, we have \(c \ge {\mathbf{u}}^{{\text{*T}}} {\mathbf{y}} + \omega = {\mathbf{w}}^{\text{T}} {\mathbf{x}}^{*} + {\mathbf{p}}^{\text{T}} {\mathbf{d}}^{*} \Rightarrow 0 \ge {\mathbf{w}}^{\text{T}} {\mathbf{x}}^{*} + {\mathbf{p}}^{\text{T}} {\mathbf{d}}^{*} - c.\)

Using this result, we formulate the Lagrange function for (13) as follows:

$$\begin{array}{*{20}c} \begin{aligned} {\mathcal{L}}\left( {{\mathbf{v}},{\mathbf{u}},\omega ,{ }{\mathbf{t}},{\mathbf{e}},{\mathbf{d}},a} \right) & = {\mathbf{u}}^{\text{T}} {\mathbf{y}} + \omega - \mathop \sum \limits_{j = 1}^{J} t_{j} \left( { - {\mathbf{v}}^{\text{T}} {\mathbf{x}}_{j} + {\mathbf{u}}^{\text{T}} {\mathbf{y}}_{j} + \omega } \right) \\ & \,\,\,\, - \mathop \sum \limits_{n = 1}^{N} e_{n} \left( {v_{n} { } - w_{n} } \right){ } - \mathop \sum \limits_{m = 1}^{M} d_{m} \left( {u_{m} { } - p_{m} } \right) - a\left( {c{ } - {\mathbf{w}}^{\text{T}} {\mathbf{x}} - {\mathbf{p}}^{\text{T}} {\mathbf{d}}} \right) \\ \end{aligned} \\ \end{array}$$
(15)

Hence, we can write

$$\begin{array}{*{20}c} \begin{aligned} {\mathcal{L}}\left( {{\mathbf{v}}^{*} ,{\mathbf{u}}^{*} ,\omega^{*} ,{ }{\mathbf{t}}^{*} ,{\mathbf{e}}^{*} ,{\mathbf{d}}^{*} ,a^{*}} \right) & = {\mathbf{u}}^{{\text{*T}}} {\mathbf{y}} + \omega - \mathop \sum \limits_{j = 1}^{J} t_{j}^{*} \left( { - {\mathbf{v}}^{{\text{*T}}} {\mathbf{x}}_{j} + {\mathbf{u}}^{{\text{*T}}} {\mathbf{y}}_{j} + \omega^{*} } \right){ } \\ & \,\,\,\,\, - \mathop \sum \limits_{n = 1}^{N} e_{n}^{*} \left( {v_{n}^{*} { } - w_{n} } \right){ } - \mathop \sum \limits_{m = 1}^{M} d_{m}^{*} \left( {u_{m}^{*} { } - p_{m} } \right){ } - a^{*} \left( {c{ } - {\mathbf{w}}^{\text{T}} {\mathbf{x}}^{*} - {\mathbf{p}}^{\text{T}} {\mathbf{d}}^{*} } \right) \\ \end{aligned} \\ \end{array}$$
(16)

Since the optimal objective function value of (13) is equal to \(C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right)\) in (14), we have \(C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right) = {\mathcal{L}}^{*} \left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right) = {\mathcal{L}}\left( {{\mathbf{v}}^{*} ,{\mathbf{u}}^{*} ,\omega^{*} ,{ }{\mathbf{t}}^{*} ,{\mathbf{e}}^{*} ,{\mathbf{d}}^{*} ,a^{*} } \right)\). Applying the envelop theorem, we obtain

$$\begin{array}{*{20}c} { MC_{m} \left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right) = \frac{{\partial C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right)}}{{\partial y_{m} }} = u_{m}^{*} } \\ \end{array}$$
(17)

which is the marginal cost for product m for (13) and indicates that the evaluated DMU’s marginal cost with respect to product m can be obtained by \(u_{m}^{*} \ge 0\) (part of the solution to (13)). Similar to the standard cost function case, the Lerner sub-indexes, \(L_{m} { },\) as well as the aggregate Lerner index, \(AL\), for the product-price restricted cost function case are obtained as follows:

$$\begin{array}{*{20}c} {AL = \mathop \sum \limits_{m = 1}^{M} W_{m} L_{m},\, L_{m} = \frac{{P_{m} - MC_{m} \left( {{\mathbf{y}},{\mathbf{w}};{\mathbf{p}}} \right)}}{{P_{m} }},\,\,\,\, m = 1, \ldots ,M} \\ \end{array}$$
(18)

Note that Fukuyama and Tan (2022a) reported a linkage between the standard cost function and marginal cost and provided an illustration within a single product setting. The weak complementary slackness conditions state

$$\left\{ {\begin{array}{*{20}l} {\lambda_{j}^{*} \left( {{\mathbf{v}}^{{\text{*T}}} {\mathbf{x}}_{j} - {\mathbf{u}}^{{\text{*T}}} {\mathbf{y}}_{j} - \omega^{\text{*}} } \right) = 0,} \hfill & {j = 1, \ldots ,J} \hfill \\ {x_{n}^{*} \left( {w_{n} - v_{n}^{*} { }} \right) = 0,} \hfill & {{ }n = 1, \ldots N} \hfill \\ {d_{m}^{*} \left( {p_{m} - u_{m}^{*} { }} \right) = 0,} \hfill & {m = 1, \ldots M} \hfill \\ {v_{n}^{*} \left( {x_{n}^{*} - \mathop \sum \limits_{j = 1}^{J} x_{nj} \lambda_{j}^{*} } \right) = 0,{ }} \hfill & {n = 1, \ldots N} \hfill \\ {u_{m}^{*} \left( {\mathop \sum \limits_{j = 1}^{J} y_{mj} \lambda_{j}^{*} - y_{m} } \right) = 0,{ }} \hfill & {m = 1, \ldots M} \hfill \\ {a^{\text{*}} \left( {c{ } - {\mathbf{w}}^{\text{T}} {\mathbf{z}}^{\text{*}} - {\mathbf{p}}^{\text{T}} {\mathbf{d}}^{\text{*}} } \right) = 0{ }} \hfill & {} \hfill \\ \end{array} } \right\}$$
(19)

If strong complementary slackness is assumed, then the last two sets of constraints in (19) yield the following results:

$$\begin{array}{*{20}c} {\left\{ {\begin{array}{*{20}c} {u_{m}^{*} > 0 \Leftrightarrow \mathop \sum \limits_{j = 1}^{J} y_{mj} \lambda_{j}^{*} - y_{m} = 0} \\ {u_{m}^{*} = 0 \Leftrightarrow \mathop \sum \limits_{j = 1}^{J} y_{mj} \lambda_{j}^{*} - y_{m} > 0} \\ \end{array} } \right\}} \\ \end{array}$$
(20)

which indicates that we can identify whether the DMU has market power in the m-th product market. The vector inequality \({ }{\mathbf{u}} \le {\mathbf{p}}\) in (12), derived from the assumption of profit maximization, indicates that each product multiplier does not exceed the respective observed product price. Note that this vector inequality leads to nonnegativity of the aggregate Lerner index as well as the Lerner sub-indexes in (17).

2.3 Empirical approach for implementing positive Lerner index

In order to handle the issue of multiple optima with zero values of product multipliers as well as the product price adaptations, we extend the indirect output possibility correspondence, which is a mapping \(\overline{IP} :{ }{\mathbb{R}}_{ + + }^{N} { } \to \overline{IP} \left( {{\mathbf{w}}/\bar{c}} \right) \subseteq {\mathbb{R}}_{ + }^{M}\), where

$$\overline{{IP}} \left( {{\mathbf{w}}/\bar{c}} \right) = \left\{ {{\mathbf{y}} \in \mathbb{R}_{ + }^{M} |\bar{C}\left( {{\mathbf{y}},{\mathbf{w}}/\bar{c}} \right) \le 1} \right\}{\text{ }} = \left\{ {{\mathbf{y}} \in \mathbb{R}_{ + }^{M} \left| {\begin{array}{*{20}c} {\sum\limits_{{j = 1}}^{J} {{\mathbf{x}}_{j} } \lambda _{j} \le {\mathbf{x}},\sum\limits_{{j = 1}}^{J} {{\mathbf{y}}_{j} } \lambda _{j} \ge {\mathbf{x}},} \\ {\left( {{\mathbf{w}}/\bar{c}} \right)^{{\text{T}}} {\mathbf{x}} \le 1}, \\ {\sum\limits_{{j = 1}}^{J} {\lambda _{j} } = 1, ~~{\mathbf{x}} \ge {\mathbf{0}},~~\lambda \ge {\mathbf{0}}} \\ \end{array} } \right.} \right\}$$
(21)

into the adapted indirect output possibility correspondence, which is a mapping \(IP:\mathbb{R}_{{ + + }}^{N} \times \mathbb{R}_{{ + + }}^{M} \to IP\left( {w/c,p/c} \right) \subseteq \mathbb{R}_{ + }^{M} ,\) where

$$\begin{aligned} IP\left( {{\mathbf{w}}/c,{\mathbf{p}}/c{ }} \right) & = \left\{ {{\mathbf{y}} \in {\mathbb{R}}_{ + }^{M} {| }C\left( {{\mathbf{y}},{\mathbf{w}}/c,{\mathbf{p}}/c} \right) \le 1{ }} \right\} \\ & = \left\{ {{\mathbf{y}} \in {\mathbb{R}}_{ + }^{M} \left| {\begin{array}{*{20}c} {\mathop \sum \limits_{j = 1}^{J} {\mathbf{x}}_{j} \lambda_{j} \le {\mathbf{x}}, \mathop \sum \limits_{j = 1}^{J} {\mathbf{y}}_{j} \lambda_{j} \ge {\mathbf{y}} - {\mathbf{d}}, } \\ { \left( {{\mathbf{w}}/c} \right)^{\text{T}} {\mathbf{x}} + \left( {{\mathbf{p}}/c} \right)^{\text{T}} {\mathbf{d}} \le 1, \mathop \sum \limits_{j = 1}^{J} \lambda_{j} = 1,} \\ {{\mathbf{d}} \ge {\mathbf{0}}, {\mathbf{x}} \ge {\mathbf{0}}, \lambda \ge {\mathbf{0}}} \\ \end{array} } \right.} \right\} \\ \end{aligned}$$
(22)

where \(c\left( { \ge {\mathbf{wx}}^{{}} + {\mathbf{py}}^{{}} } \right)\) is a prespecified positive value as stated above. The main difference between (21) and (22) is that the vector \({\mathbf{d}}\) appears in (22) but does not in (21). From the constraints in (21) and (22), we have \(\mathop \sum \limits_{j = 1}^{J} {\mathbf{y}}_{j} \lambda_{j} \ge {\mathbf{y}} \ge {\mathbf{y}} - {\mathbf{d}}\) and \(\left[ {{\mathbf{wx}} \le \overline{c}{ } \Rightarrow { }{\mathbf{wx}} + {\mathbf{pd}} \le \overline{c} + {\mathbf{pd}} \le c} \right]\), yielding \(\overline{IP} \left( {{\mathbf{w}}/c{ }} \right) \subseteq IP\left( {{\mathbf{w}}/c,{\mathbf{p}}/c{ }} \right)\). Hence, \(C\left( {{\mathbf{y}},{\mathbf{w}}/c,{\mathbf{p}}/c} \right) \le \overline{C}\left( {{\mathbf{y}},{\mathbf{w}}/c} \right)\). Note that the adapted unit cost function or the adapted cost efficiency is used as the indirect technology or adapted indirect output possibility set.

Now we are ready to establish the following two propositions.

Proposition 1

The adapted DEA-based indirect technology, \(IP\left( {{\mathbf{w}}/c,{\mathbf{p}}/c} \right)\) is a convex set, and \(C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right)\) is quasi-convex in \({\mathbf{y}}\).

Proposition 1 establishes the convexity of the adapted indirect output possibility set, which is equivalent to the quasi-convexity of the adapted cost function.Footnote 2 The proof of Proposition 1 is given in "Appendix A1".

Proposition 2

Let \(\partial_{o} \left( {IP\left( {{\mathbf{w}}/c,{\mathbf{p}}/c{ }} \right)} \right)\) be the set of all adapted cost-efficient outputs in \(IP\left( {{\mathbf{w}}/c,{\mathbf{p}}/c{ }} \right)\), which dominate the output vector \({\mathbf{y}}_{o}\) of \({\text{DMU}}_{o}\). Then, we have: \({\mathbf{y}} \in \partial_{o} \left( {IP\left( {{\mathbf{w}}/c,{\mathbf{p}}/c{ }} \right)} \right)\) if and only if

$$\exists \lambda_{k}^{{}} \ge 0,\,t_{k}^{{}} \ge 0,\, b_{k}^{{}} \in \left\{ {0,1} \right\}, \, \left( {k \in CE} \right), \, {\mathbf{x}} \ge {\mathbf{0}},\, a \ge 0, \, {\mathbf{v}} \ge {\mathbf{0}}, \, {\mathbf{u}} \ge {\mathbf{0}}, \,{\mathbf{s}}^{x} \ge {\mathbf{0}}, \,{\mathbf{s}}^{y} \ge {\mathbf{0}},\,\omega {\text{ free in sign}}$$

such that

$$\begin{aligned} \sum\limits_{k \in CE} {x_{nk} \lambda_{k} = x_{n} - s_{n}^{x} ,\;\;\;\; n=1,...,N}, {\mathbf{u}} \le {\mathbf{p}}, \\ y_{m} = \sum\limits_{k \in CE} {y_{mk} \lambda_{j} = y_{mo} - d_{m}^{{}} + s_{m}^{y} , m=1,...,M}, \\ \left( {{\mathbf{w}}/c} \right)^{\text{T}} {\mathbf{x}} + \left( {{\mathbf{p}}/c} \right)^{\text{T}} {\mathbf{d}} \le 1, \\ {\mathbf{w}}^{\text{T}} {\mathbf{x}} + {\mathbf{p}}^{\text{T}} {\mathbf{d}} = {\mathbf{u}}^{\text{T}} {\mathbf{y}}_{o} + \omega \\ 1 & = \sum\limits_{k \in CE} {\lambda_{k} }~, \\ & \,\,\,\, - \sum\limits_{n \in N} {\nu_{n} x_{nk} } + \sum\limits_{m \in M} {u_{m} y_{mk} + \omega + t_{k} = 0, k \in CE}, \\ & \,\,\,\, - v_{n} + \left( {\frac{{w_{n} }}{c}} \right)a \ge 0, n=1,...,N, \\ & \,\,\,\, - u_{m} + \left( {\frac{{p_{m} }}{c}} \right)a \ge 0, m=1,...,M, \\ v_{n} & \ge \frac{1}{{2N g_{n}^{x} }}, n=1,...,N, \\ u_{m} & \ge \frac{1}{{2Mg_{m}^{y} }}, m=1,...,M, \\ \begin{array}{*{20}c} {\lambda_{k} \le {\mathbb{M}}b_{k} ,~~ t_{k} \le {\mathbb{M}}\left( {1 - b_{k} } \right), k \in CE} \\ \end{array} \\ \end{aligned}$$

where \({\mathbb{M}}\) is a large positive quantity.

Proposition 2 can be established by adapting Aparicio et al. (2007) and Zhu et al. (2018). Proposition 2 does not guarantee the satisfaction of profit maximization conditions. In the empirical section, we reported the number of the DMUs which fail to satisfy these conditions.

Now we obtain product multipliers according to the following two-step procedure.

Step 1:

Identify the adapted cost-efficient \({\text{DMUs}}\) based upon (14) or (13). The cost efficiency of the assessed \(DMU_{o}\), denoted \(CEff_{o}\), is estimated as

$$\begin{array}{*{20}c} {CEff_{o} = \frac{{C\left( {{\mathbf{y}} ,{\mathbf{w}}} \right)}}{{\overline{c}_{o} }}} \\ \end{array}.$$
(23)

which represents the ratio of minimum adapted cost to the observed adapted cost. The scalar \(\bar{c}_{o} = {\mathbf{w}}^{\text{T}} {\mathbf{x}}_{o} \) is \(DMU_{o}\)’s observed total cost. The cost efficiency of \(DMU_{o}\) refers to the ability to make use of fewer resources to achieve the same product levels. If \(CEff_{o}\) = 1, i.e., \(C\left( {{\mathbf{y}}_{o} ,{\mathbf{w}}} \right) = \overline{c}_{o}\), then the \(DMU_{o}\) is cost-efficient. If \(0 < CEff_{o} < 1\), i.e.,\({ }C\left( {{\mathbf{y}}_{o} ,{\mathbf{w}}} \right) < \overline{c}_{o}\), then the \(DMU_{o}\) is cost inefficient.

Let \(C{\rm E}\) be an index set of adapted cost-efficient \({\text{DMUs}}\) (observations) with the adapted cost-efficiency score being a value of one and let \({\mathbb{M}}\) be some positive large number. Using these notations, we introduce Step 2 as follows:

Step 2:

Solve the following mixed integer linear programming problem, which minimizes the cost-based slacks (mdCBS) relative to the restricted output possibility set.

$$mdCBS\left( {{\mathbf{y}}_{o} ,{\mathbf{w}}/c_{o} ,{\mathbf{p}}/c_{o} } \right) = {\text{min}}\left\{ {\begin{array}{*{20}c} {\frac{1}{2 N}\mathop \sum \limits_{n = 1}^{N} \frac{{s_{n}^{x} }}{{g_{n}^{x} }} + } \\ {\frac{1}{2M}\mathop \sum \limits_{m = 1}^{M} \frac{{s_{m}^{y} }}{{g_{m}^{y} }}} \\ \end{array} \left| {\begin{array}{*{20}c} {\mathop \sum \limits_{k \in CE} {\mathbf{x}}_{k} \lambda_{k} = {\mathbf{x}} - {\mathbf{s}}^{x} ,{ }{\mathbf{x}} \ge {\mathbf{0}},~~{\mathbf{s}}^{x} \ge {\mathbf{0}}}, \\ {\mathop \sum \limits_{k \in CE} {\mathbf{y}}_{k} \lambda_{k} = {\mathbf{y}}_{o} - {\mathbf{d}} + {\mathbf{s}}^{y} ,~~{\mathbf{d}} \ge {\mathbf{0}},~~{\mathbf{s}}^{y} \ge {\mathbf{0}}}, \\ {\left( {{\mathbf{w}}/c_{o} } \right)^{\text{T}} {\mathbf{x}} + \left( {{\mathbf{p}}/c_{o} } \right)^{\text{T}} {\mathbf{d}} \le 1}, \\ {\mathbf{w}}^{\text{T}} {\mathbf{x}} + {\mathbf{p}}^{\text{T}} {\mathbf{d}} = {\mathbf{u}}^{\text{T}} {\mathbf{y}}_{o} + \omega\\ {\mathop \sum \limits_{k \in CE} \lambda_{k} = 1,~{{\varvec{\uplambda}}} \ge {\mathbf{0}}}, \\ { - {\mathbf{v}}^{\text{T}} {\mathbf{x}}_{k} + {\mathbf{u}}^{\text{T}} {\mathbf{y}}_{k} + \omega + t_{k} = 0,~k \in CE}, \\ {\left( {{\mathbf{w}}/c_{o} } \right)a \ge {\mathbf{v}},{ }{\mathbf{v}} \ge {\mathbf{0}},} \\ {\left( {{\mathbf{p}}/c_{o} } \right)a \ge {\mathbf{u}},{ }{\mathbf{u}} \ge {\mathbf{0}},} \\ {v_{n} \ge \frac{1}{{{ }2N g_{n}^{x} { }}}{ },~~n=1,...,N} \\ {u_{m} \ge \frac{1}{{{ }2Mg_{m}^{y} { }}}{ },~~m=1,..., M} \\ {\lambda_{k} \le {\mathbb{M}}b_{k} ,~~t_{k} \le {\mathbb{M}}(1 - b_{k} ),~~k \in CE} \\ {b_{k} :{\text{binary variable for }}k \in CE} \\{\mathbf{u}} \le {\mathbf{p}}\\ \end{array} } \right.} \right\}$$
(24)

where \(g_{n}^{x}\) and \(g_{m}^{y}\) represent, respectively, the n-th input and the m-th product directions that are at the discretion of the researcher or DMUs, and \({\mathbb{M}}\) is a large number. In Eq. (24), the input target and the product target vectors at the adapted cost frontier are obtained as \(\mathop \sum \limits_{j \in E} {\mathbf{x}}_{j} \lambda_{j}^{*}\) and \(\mathop \sum \limits_{j \in E} {\mathbf{y}}_{j} \lambda_{j}^{*}\), respectively. In their efficiency measurement, Fukuyama and Weber (2009) employed the objective function such as that of (21), that is,

$${ }\frac{1}{2N }\mathop \sum \limits_{n = 1}^{N} \frac{{s_{n}^{x} }}{{g_{n}^{x} }} + \frac{1}{2M }\mathop \sum \limits_{m = 1}^{M} \frac{{s_{m}^{y} }}{{g_{m}^{y} }}{ }$$

A pair of (20) and (24) form an adapted cost function extension of the two-step procedures of Aparicio et al. (2007) and Zhu et al. (2018). Since \(C\left( {{\mathbf{y}},{\mathbf{w}}/c,{\mathbf{p}}/c} \right) \le 1\) \(\Leftrightarrow { }C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right) \le c\) from the definition of the adapted cost function, we have

$$\begin{array}{*{20}c} {\frac{{\partial C\left( {{\mathbf{y}},{\mathbf{w}}/c_{o} ,{\mathbf{p}}/c_{o} } \right)}}{{\partial y_{m} { }}} = u_{m}^{*} \left( {{\mathbf{y}},{\mathbf{w}}/c_{o} ,{\mathbf{p}}/c_{o} } \right) ~~ and ~~ \frac{{\partial C\left( {{\mathbf{y}}_{o} ,{\mathbf{w}},{\mathbf{p}}} \right)}}{{\partial y_{m} { }}} \times \frac{1}{{c_{o} }} = u_{m}^{*} \left( {{\mathbf{y}}_{o} ,{\mathbf{w}},{\mathbf{p}}} \right) \times \frac{1}{{c_{o} }}} \\ \end{array} ,$$
(25)

hence

$$\begin{array}{*{20}c} {u_{m}^{*} = u_{m}^{*} \left( {{\mathbf{y}}_{o} ,{\mathbf{w}},{\mathbf{p}}} \right) = u_{m}^{{**}} \left( {{\mathbf{y}},{\mathbf{w}}/c_{o} ,{\mathbf{p}}/c_{o} } \right) \times c_{o} } \\ \end{array}$$
(26)

where \(u_{m}^{{**}} { }\left( {{\mathbf{y}},{\mathbf{w}}/c_{o} ,{\mathbf{p}}/c_{o} } \right)\) are the estimates from (24).

Our two-step procedure for estimating the adapted cost function constructs the adapted cost-efficient frontier implicitly (or indirectly) and then provides product multipliers \({\mathbf{u}}^{*} { }\), hence the Lerner sub-indexes and the multi-product index in (17).

Remark 1

Since (24) possesses all constraints associated to the multiplier and envelopment forms of the adjusted cost function \({ }C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right)\), the set of all constraints in (24) ensures that the projection point of the inputs and products is on the adapted cost-efficient frontier \(\partial_{o}^{{}} \left( {IP\left( {{\mathbf{w}}/c,{\mathbf{p}}/c{ }} \right)} \right)\). Consequently, the multipliers associated with products are all positive due to the optimization problem (24).

Remark 2

For the fixed levels of \({\mathbf{w}},\) as well as \({\mathbf{p}},\) the adapted cost-efficient frontier is constructed by spanning the adapted cost-efficient DMUs’ input and product vectors, while keeping \(C\left( {{\mathbf{y}},{\mathbf{w}},{\mathbf{p}}} \right) = c\).

3 Data

The real estate sector is connected to many other industries such as steel, glass, wood, and home appliances. It is a leading industry that drives economic growth. For every one-unit increase in the value-added of the real estate sector in China, the total driving impact on each industry is 1.416 units (Wang and Liu 2004). According to the data provided by the National Bureau of Statistics in China, the real estate industry’s added value continues to increase as a proportion of GDP, from 4.61 in 2008 to 6.6% in 2018. However, the real estate companies are facing increasing inventory pressure and financial pressure. The real estate industry's development model that relies on real estate development to obtain high profits is unsustainable, and it is in urgent need of transformation and upgrading to achieve sustainable development. The housing issue is not only an economic issue, but also an important livelihood issue, a common concern of the government, enterprises, and the people. As a leading industry, how the real estate industry can reasonably use natural resources and the ecological environment while achieving its own healthy and sustainable development and still provide the basis for the development of other industries is the focus of academic research.

The continuous improvement of the market system and the shift of land policy from free allocation to bidding, auction and listing have made the institutional barriers to entry continue to weaken, and more and more companies have begun to enter the real estate market. A growing number of listed companies have diversified into the real estate field. In 1998, the real estate marketization reforms and a period of rapid development had come for the real estate enterprises. In 1986, the total number of real estate companies nationwide was only 1991. In 1998, it increased to 24,378, and by 2018 it had increased to 97,938. It is worth mentioning that the real estate market was dominated by state-owned enterprises and collective enterprises during the early stage of the market development. The market share occupied by state-owned enterprises was 32.6% in 1998. Because of the increase in the participation of private enterprises derived from the deepening of the real estate reform, the market share of state-owned enterprises experienced a continuous decline. In 2018, the proportion of state-owned enterprises was only 0.8%.

We collected the data of 31 provinces/cities in China from the National Bureau of Statistics in China over the period 2002–2018. The 31 provinces/cities are further divided into three different geographical areas (Eastern area, Central area and Western area). According to the classification provided by the National Bureau of Statistics of China, the Eastern area includes Beijing, Tianjin, Hebei, Shanghai, Jiangsu, Zhejiang, Fujian, Shandong, Guangdong, Hainan, Liaoning, and Heilongjiang; the Central area includes Shanxi, Anhui, Jiangxi, Henan, Hubei, and Hunan, and the Western area includes Inner Mongolia, Guangxi, Chongqing, Sichuan, Guizhou, Yunnan, Tibet, Shan Xi, Gansu, Ningxia, and Xinjiang.

In order to estimate different types of Lerner indices, we need inputs, input prices, products, and product prices according to the models described in the methodology section. This study considers two inputs, which are land areas purchased by the real estate development enterprises, and real estate enterprises’ completed housing areas. We consider the land areas purchased by the real estate development enterprises as one input due to the fact that they can be regarded as the raw materials, based on which the real estate enterprises will build the houses, however, the completed housing area is different from the land areas purchased due to the fact that the former will be much smaller than the latter and the difference is attributed to the extent to which the enterprises can fully use the raw materials. Therefore, these two different inputs reflect different aspects of company operation and management. The two related input prices are expenses for purchasing one square meter of land area, and real estate development enterprises’ completed housing costs. There are two products included in the current study: sales area for commodity housing and residential commodity houses’ sales area; the related two product prices are per square meter revenue of commodity housing as well as per square meter sales revenue of residential commodity houses. Different types of outputs (houses) will be generated by the real estate enterprises including commodity housing, residential commodity housing, villa and high-end apartment, commodity house for offices, houses for business use, and other commodity houses. Based on the statistics provided by the National Bureau of Statistics in China, the commodity houses and residential houses are the most two important types of outputs (houses) generated by the real estate enterprises, as reflected by the volumes of sales areas and the volumes of sales. Since the adapted cost c is implied by adapting the standard cost function with (12), we calculate it as the sum of the observed cost and observed revenue, i.e., \(c_{o} = {\mathbf{w}}^{\text{T}} {\mathbf{x}}_{o} + {\mathbf{p}}^{\text{T}} {\mathbf{y}}_{o}\).

Table 1 shows the descriptive statistics of the variables used in the analysis. The table shows that there is a small difference in the level of product prices. In comparison, there are big differences in terms of inputs and products, while per square meter revenue of houses sold has quite small variations.

Table 1 Descriptive statistics of the variables used in the analysis

4 Results

Table 2 gives the cost-efficient DMUs according to the standard and adapted restricted cost functions. Columns 2 and 3 represent two types of cost-efficient cities/provinces: one is based on the adapted cost function and the other is on the standard cost function. The table indicates that all the adapted cost-efficient cities/provinces are standard cost-efficient. Columns 4 and 5 confirms that all cities are maximizing their profits as our proposed framework suggests. In terms of the commodity housing market, different cities across different years are not maximizing their profits. More specifically, in 2005, there are two cities: Shanghai and Beijing. Jiangsu and Shandong are not maximizing their profits in 2007 and 2008, respectively. In 2009, Shanghai and Sichuan are not maximizing their profits. There are two cities in 2010 that are not maximizing their profits, including Sichuan and Shan Xi. Shanxi, Sichuan, Hunan, and Henan are not maximizing their profits in 2011. Tibet is not maximizing its profits in 2012, 2013, and 2016, and, finally, Shanxi is not maximizing its profit in 2014. In terms of the residential housing market, we only find the cities that are not maximizing their profits in 2002, 2003, 2004, 2015, and 2018. In 2002, they are Qinghai, Hunan, Jiangxi, Shanxi, Tibet, and Inner Mongolia. The first four cities are not profit-maximizing in 2003 as well. Shan Xi, Tibet, and Henan are not profit-maximizing in 2003, 2015, and 2018, respectively.

Table 2 Cost efficient provinces/cities and number of non-profit-maximizing DMUs

Before presenting the overall aggregate Lerner index and the Lerner sub-indices for the sample and the ones based on geographical classifications, we provide the results of the price levels and marginal costs of three different provinces/cities to further show and confirm our contributions to the literature by providing positive marginal costs (multipliers) in a multi-product context. Table 3 provides the empirical result for three provinces/cities including Xinjiang, Jiangxi, and Beijing. Xinjiang is in the Western area, Jiangxi is in the central area, and Beijing is in the Eastern area. To illustrate, consider Xinjiang in 2009. The product multipliers satisfy the constraints (12), i.e., p1 = 0.2604 > u1 = 0.07544, and p2 = 0.24661 > u2 = 0.00019. The Lerner sub-indexes of product markets 1 and 2 are 0.7103 and 0.9992, respectively. For the same year, in terms of Jiangxi, p1 = 0.26428 > u1 = 0.10749, and p2 = 0.25168 > u2 = 0.00012. The Lerner sub-indices of product markets 1 and 2 are 0.5933 and 0.9995, respectively. Finally, for Beijing in 2009, p1 = 1.37990 > u1 = 0.29978, and p2 = 1.32243 > u2 = 0.00013. The Lerner sub-indices of product markets 1 and 2 are 0.7827 and 0.9999, respectively.

Table 3 Multiplier estimates of three DMUs in 2009

To estimate the multipliers of products in (21), we set the direction vector as the observations. We first look at the Lerner indices over the examined period of 2002–2018, the results of which are reported in Fig. 1. The figure shows that, in a general trend, the Lerner sub-index 2 is the highest, followed by the aggregate Lerner index and the Lerner sub-index 1. This result means that the Chinese real estate enterprises compete much more strongly in commercial commodity housing than that in residential housing. The higher level of competition in the commercial commodity housing can be explained by the fact that 1) there are many real estate enterprises focusing on commercial commodity housing and most of the enterprises have been listed on stock exchange. The stock market listing of these enterprises not only help them optimize the resource and further improve the corporate governance, the resulted more transparency to the public monitoring and their commitment to key stakeholders will induce more efforts to improve their operations and management. Therefore, they intend to compete more strongly to obtain higher profits not only to provide returns to the existing shareholders, but also attract more potential investors; 2) the real estate enterprises providing commercial commodity housing can be divided into two groups, one belongs to domestic enterprises and the other mainly includes the enterprises with investment outside of mainland China, such as Taiwan, Macau and Hong Kong. The enterprises do not only compete with other ones within their own category, but also compete with the ones from the other category. This further promote the level of competition in this real estate segment. Based on the statistics from Forward-The Economist, the concentration in the commercial commodity housing is quite low. It shows that the 10-enteprise concentration ratio is 10.35% in 2020, the 30-enterprises and 50-enterprise concentration ratios in the same year are 16.56% and 19.31%, respectively. These ratios are even higher than the ones in 2019.

Fig. 1
figure 1

Lerner sub-index 1, Lerner sub-index 2 and revenue-share weighted aggregate Lerner index

Fig. 2
figure 2

Lerner sub-index 1, Lerner sub-index 2 and revenue-share weighted aggregate Lerner index in the Eastern area

Fig. 3
figure 3

Lerner sub-index 1, Lerner sub-index 2 and revenue-share weighted aggregate Lerner index in the Central area

Fig. 4
figure 4

Lerner sub-index 1, Lerner sub-index 2 and revenue-share weighted aggregate Lerner index in the Western area

Fig. 5
figure 5

Lerner sub-index 1 among different areas of China

Fig. 6
figure 6

Lerner sub-index 2 among different areas of China

Fig. 7
figure 7

Revenue-share weighted aggregate Lerner index among different areas of China

Fig. 8
figure 8

Lerner indices of all the cities in China

Not only do we consider the market power of the sample, but also, we examine it for three different geographical areas (Eastern area, Central area, and Western area) in China, and Figs. 2, 3, and 4 report the results. We can see that all these three figures show a similar trend in general with the Lerner sub-index 2 being the highest, followed by the aggregate Lerner index and the Lerner sub-index 1. Starting from Fig. 2, we observe that the Lerner sub-index 2 experiences higher volatility compared to the Lerner sub-index 1. This indicates that the level of market power in residential commodity houses varies to a larger extent across the examined period, while there is relatively more stable market power in the commercial commodity housing market. This is also the case for Fig. 3 and Fig. 4. The highest volatility in the residential commodity houses across all three different geographical areas in China can be explained by the fact that as the economic development and growth took place continuously over the examined period, the peoples’ increase in disposable income increases their demand on changing the living conditions. This pushed up the house price and further induce the real estate enterprises to compete strongly in this market segment. Because of high profitability of this market segment, it attracted more enterprise to enter into this market segment or the existing real estate enterprises to increase the share of their residential commodity housing in their overall portfolio. Meanwhile, because of higher competition, some of the enterprises were unfortunately dropped out from the market. Both of these two different scenarios contribute to the volatility of market power over this period.

Further to the previous analysis we provided, we look at these three different Lerner indices across different geographical areas in a separate manner. Figures 5, 6, and 7 show the findings. Figure 5 shows that there is no clear comparison in terms of the level of market power among the Eastern, Central, and Western areas when using the sales area for commodity housing as the product; however, we notice that in general, the western area has the highest level of market power up to 2014. Not only do we observe the highest level of market power in the western area in general, but also we find that the market power in the western area experiences the highest volatility. This highest volatility can be attributed to the policy of “developing the western area” in China. As one of the key industries in the economic sector, the real estate industry actively participated and contributed to the development of this area by gradually increasing their investment in this area. This results in an increase in the level of competition. Because of the co-existence of opportunities and threats, the former is derived from preferential policies granted to the real estate enterprises, which facilitate their increase in profitability; while the latter is mainly attributed to the higher level of competition, which drives out the ones with lower level of competitiveness. Because of entry and exist of the real estate enterprise in this area, the market power experiences the highest level of volatility. In terms of the residential commodity housing, we can not see a clear comparison among the levels of market power in these three areas, while different from Fig. 5, we find that the central area experiences the highest volatility in the residential commodity housing. Considering the results from both Fig. 5 and Fig. 6, we deduce that there is a relatively quite stable market power in the commercial commodity housing in the central area (commodity housing includes both commercial commodity housing and residential commodity housing). Finally, Fig. 7 shows that the western area, in general, has the highest aggregate Lerner indices over the period and has the highest volatility.

Finally, we look at the level of market power for each of the 31 cities in the sample over the period 2002–2018, the findings of which are reported in Fig. 8. In terms of Lerner sub-index 1, we find that Tibet has the highest market power, while the market power in Shandong is the lowest. When looking at the residential housing market, the Lerner sub-index 2 shows that Tibet has the highest market power, while Sichuan has the lowest market power. Finally, regarding the aggregate Lerner index, the findings show that Tibet has the highest market power and Shandong has the lowest market power.

5 Conclusion

The current paper provides new knowledge and significantly extends the study of Fukuyama and Tan (2022a) by presenting a multi-product Lerner index, which is a revenue-share-weighted index consisting of Lerner sub-indexes within a minimum-distance DEA framework. In order to estimate the DEA-based multi-product index, two issues were considered: one is the implementation of the profit maximization conditions, in which the multi-product Lerner index, as well as the sub-indices, take a value between zero and one, and the other is the issue of having several zero multiplier values. To deal with this circumstance, exploiting Shephard’s (1970) indirect output possibility set (see also Färe et al. 1994) and introducing the adapted cost function, we have developed a minimum-distance cost function estimation procedure, in which the adapted cost function is estimated indirectly. In addition, we further build on the work of Fukuyama and Tan (2022a) by addressing the issue of negative values of the Lerner index by proposing a method to provide non-negative marginal costs for all the decision-making units. We applied our innovative approach to the Chinese real estate industry. The result shows that the Chinese real estate industry has higher market power in the residential commodity housing market than that in the commodity housing market. This is also the case for different geographical areas in China. Overall, for both of these two different markets, the level of market power experiences a level of volatility. We finally identify the market power level on a province/city level.

Our results provide important policy implications: (1) in the commercial commodity housing market, the real estate companies should find ways to increase their market power. This would improve the financial stability because higher competition results in higher risk levels. The real estate companies take out a large amount of bank loans, the higher risk increases the probability of loan default and leads to financial instability; (2) across the three geographical areas in China, efforts should be made not only to increase the level of market power in a more consistent and sustainable manner; (3) from a city-level perspective, the small-sized real estate enterprises should further increase their market power to avoid their unfortunate drop-out derived from higher levels of competition; (4) regarding increasing the level of market power, the policy aims to potentially improve the financial stability in the banking industry by reducing the risk-taking behavior of real estate firms. However, a higher level of market power does not necessarily lead to a positive outcome because it does not provide incentives to company managers for performance improvement, the government should consider both these two effects on the real estate sector, and the well-coordinated policies would balance the development of these two industries.

Future research can further develop the existing model by dividing the production process into stages and considering the undesirable product in the production process. Finally, data on different components of non-residential commodity houses can be collected to check the robustness of our results.