1 Introduction

Cost function estimation is one of the most popular approaches to analyzing a firm’s productivity and efficiency, particularly in public service provision. There are innumerable examples of cost function applications (see e.g., the number of references in Blank 2000). A cost function is a dual representation of a production technology. Under the assumption of cost-minimizing behavior at given service levels and resource prices, it provides a mathematical relationship between actual costs on the one hand, and exogenous service levels and resource prices on the other. However, it is questionable whether the cost function approach provides a sufficiently realistic representation of actual economic behavior and the economic context in which it occurs. In many public service industries, firms are constrained by a (cost) budget rather than by service levels. There is also sizeable body of literature suggesting that public service firms have motives other than minimizing costs or maximizing profits.

Cost-constrained technologies can be represented by a cost indirect output distance (CIOD) function. This cost indirect output distance function represents the largest factor by which service levels can be improved without the cost involved exceeding a given budget. It is a multiple output version of the well-known production function. The CIOD has only been applied in limited empirical research (see e.g., Grosskopf et al. 1997; Hayes et al. 1998; Blank and Merkies 2004). Since the CIOD requires the same type of data as a cost function, it is not clear why this approach has not been applied more widely.

Since the late 1980s, a number of authors have proposed non-cost minimizing behavior, input preference and bureaucratic models (Eakin and Kniesner 1988; Dor et al. 1997; Eakin 1993; Atkinson and Cornwell 1994; Kumbhakar 1997; Maietta 2000; Atkinson and Primont 2002). They all suggest using a shadow cost function, expressed in terms of exogenous service levels and shadow resource prices. Shadow resource prices (internal to the firm) may differ from actual prices. It is assumed that firms minimize (shadow) cost on the basis of these shadow prices. Shadow cost and shadow resource prices are expressed in terms of actual resource prices and actual cost shares and can be estimated. In fact, the goal of the model is the set of (shadow) resource prices that best fits the actual cost and cost shares under the minimizing-cost assumption. Based on the principle of shadow pricing, Rodrigues-Alvarez and Lovell (2004) derive a system consisting of an input distance function and corresponding resource demand equations. They estimate the structural under- or over-utilization (expense preference) by estimating an extra intercept in each of the resource demand equations.

Although the CIOD satisfies the cost constraint rather than service-level constraints, it still fails to take the input preference behavior into account. The shadow cost function (SCF), on the other hand, deals with the input preference behavior but fails when it comes to cost constraint. The approach proposed by Rodrigues-Alvarez and Lovell (2004) has another limitation. Their approach breaks the direct relationship between the input distance function and the resource demand equations. The result of this is that any relationship between the under- and over-utilization of resource quantities and firm size, relative input prices and fixed capital is ignored. Potential structural misallocations are assumed to be constant across firms and independent of the characteristics of a firm.

In this paper, I suggest combining the cost-constrained technology approach and the shadow-cost approach by deriving a Shadow Cost Indirect Output Distance Function (SCIOD). This paper will derive an exact relationship between services delivered, actual cost, actual cost shares and actual resource prices. This approach also provides separate measures for technical and allocative inefficiencies in a straightforward manner. In the limited number of studies relating to the CIOD approach, this decomposition has been neglected.

In Sect. 2 I will discuss some of the theoretical issues relating to the CIOD and the SCIOD. Mathematical expressions for the SCIOD and the corresponding resource-demand equations are derived. The expressions for the ray output elasticity and marginal cost are also presented. In Sect. 3, a complete system for the SCIOD based on a translog function is derived, consisting of the indirect output distance function and corresponding resource-demand equations. Section 4 includes some comments on the estimation procedure. The conclusions follow in Sect. 5.

2 Theoretical issues

Non-optimal behavior can be modeled by assuming that the firm is virtually observing a set of input (or output) prices that differs from actual prices. Observed input or output quantities are optimal in relation to these virtually observed prices. These virtual prices are referred to as shadow prices. The aim is to establish these shadow prices, theoretically and empirically.

Before deriving the SCIOD mathematically, the approach is illustrated by Fig. 1. Figure 1 represents two indirect output sets. An indirect output set IP(w/C) is defined as the outer envelope of the direct output sets P(x) for which the cost of the inputs does not exceed a fixed amount of cost C, i.e., wx ≤ C (Färe and Primont 1995, p. 83). As relative input prices change, the indirect output set will also change. This alternative vector of input prices is denoted by w*, the corresponding cost by C* and the resulting indirect output set by IP(w*/C*).

Fig. 1
figure 1

The cost indirect output set and the shadow cost indirect output set

Firm B is observed and produces \( (y_{1}^{\text{B}} ,y_{2}^{\text{B}} ) \). According to the actual input prices and cost, the production of both products of firm B could be expanded by the factor (OB″/OB). The boundary of the indirect production set functions as the reference set for each observation. At shadow input prices w* (and corresponding cost C*) the reference is determined by the shadow cost indirect production set, in which case the expansion factor is reduced to (OB′/OB). The latter ratio is interpreted as technical inefficiency. The ratio (OB″/OB′), which is the result of a reallocation of inputs due to a change in relative prices (i.e., the shadow input prices), can be interpreted as allocative inefficiency. Shadow input prices are established by searching for the vector of (shadow) input prices which minimizes the expansion factor (OB′/OB).

A cost-constrained output technology is defined as the set of all services that can be produced at a cost not exceeding C. The cost indirect output distance function is defined as the maximal expansion (or the minimal contraction of the reciprocal) of services delivered that can be produced by a combination of resources which satisfies the cost constraint (see Balk 1998, p. 141 and Färe and Primont 1995):

$$ ID_{\text{o}} (y,w/C) = \inf \{ \theta > 0:(y/\theta ) \in P(x),wx \le C\} $$
(1)

where ID o = cost indirect output distance; y = vector of services delivered; w = vector of resource prices; x = vector of resources; C = cost; P(x) = production possibility set; θ is a scalar.

It should be noted that inefficiency derived from the indirect output distance is also due to a non-optimal allocation of resources. The inefficiency thus includes technical and allocative inefficiency.

The optimal resource demand equations corresponding to the cost indirect output distance function are derived from Roy’s identity (see Färe and Primont 1995, p. 92):

$$ S_{j} = {{\frac{{w_{j} x_{j} }}{C} = \frac{{\partial \ln ID_{o} \left( {y,\frac{w}{C}} \right)}}{{\partial \ln \left( {\frac{{w_{j} }}{C}} \right)}}} \mathord{\left/ {\vphantom {{\frac{{w_{j} x_{j} }}{C} = \frac{{\partial \ln ID_{o} \left( {y,\frac{w}{C}} \right)}}{{\partial \ln \left( {\frac{{w_{j} }}{C}} \right)}}} {\sum\limits_{n} {\left[ {\frac{{\partial \ln ID_{o} \left( {y,\frac{w}{C}} \right)}}{{\partial \ln \left( {\frac{{w_{n} }}{C}} \right)}}} \right]} }}} \right. \kern-\nulldelimiterspace} {\sum\limits_{n} {\left[ {\frac{{\partial \ln ID_{o} \left( {y,\frac{w}{C}} \right)}}{{\partial \ln \left( {\frac{{w_{n} }}{C}} \right)}}} \right]} }} $$
(2)

where S j  = cost share resource j (j = 1,…,N); w j  = jth resource price (j = 1,…,N).

It should be noted that only under non-constant returns to scale technologies the indirect output distance approach has an added value compared to a cost function approach. As shown by Färe and Primont (1995, p. 83), the indirect output distance function equals the cost function if and only if the technology exhibits constant returns to scale.

Along the lines of the original shadow cost function approach, as discussed by Kumbhakar (1997), I assume that w* is a vector of shadow resource prices that corresponds to output maximizing resource quantities x. Note that here it is assumed that actual resource quantities correspond with the optimal values. Shadow cost C* is the corresponding cost at shadow resource prices w*. Therefore, the vector of shadow resource prices w* is the solution to the following system of equations:

$$ ID_{\text{o}} \left (y,\frac{{w}^{\ast}}{{C}^{\ast}}\right) = 1 $$
(3)
$$ S_{j}^{\ast} = s_{j} \left (y,\frac{{w}^{\ast}}{{C}^{\ast}}\right) $$
(4)

where, \( S_{j}^{\ast} \) = shadow cost share resource j (j = 1,…,N); \( w_{j}^{\ast} \) = shadow resource price j (j = 1,…,N); C* = shadow cost; and s j (.) (j = 1,…,N) are the corresponding equations derived from Roy’s identity (see Eq. 2). Shadow cost and shadow cost shares are deducted from actual resource quantities and shadow resource prices.

It is assumed that the cost indirect output distance function is a continuous twice differentiable function that satisfies all the duality requirements, such as non-decreasing and quasi concave in w*/C* and non-decreasing, convex and homogeneous of degree one in y. It is further assumed that (firm-specific) shadow resource prices are proportional to actual resource prices:

$$ w_{j}^{\ast} = k_{j} \cdot w_{j} $$
(5)

where, k j  = distortion factor of resource j.

Note that k j can vary between firms. For reasons of simplicity, we do not denote the firm by a separate index. Since we assume that the vector of actual resource quantities x are the optimal quantities at shadow resource price w*, we calculate the algebraic relationship between actual and shadow cost as follows:

$$ C^{\ast} = \sum\limits_{j} {w_{j}^{\ast} x_{j} = \sum\limits_{j} {k_{j} w_{j} x_{j} } } \left( {y,\frac{{w^{\ast} }}{{C^{\ast} }}} \right) = \sum\limits_{j} {k_{j} \frac{{w_{j} x_{j} \left( {y,\frac{{w^{\ast} }}{{C^{\ast} }}} \right)}}{C}} \cdot C = C \cdot \sum\limits_{j} {k_{j} S_{j} } $$
(6)

For computational ease we define:

$$ G_{j} = \left( {\frac{{k_{j} }}{{\sum\limits_{j} {k_{j} S_{j} } }}} \right) $$
(7)

For reasons of convenience, we also derive an expression for \( \ln \left( {\frac{{w_{j}^{\ast} }}{{C^{\ast} }}} \right) \) by substituting Eqs. 5 and 7:

$$ \ln \left( {\frac{{w_{j}^{\ast} }}{{C^{\ast} }}} \right) = \ln \left( {\frac{{w_{j} }}{C}} \right) + \ln (k_{j} ) - \ln\left (\sum\limits_{j} {k_{j} S_{j} }\right) = \ln \left( {\frac{{w_{j} }}{C}} \right) + \ln \left( {G_{j} } \right) $$
(8)

Substituting shadow resource prices and shadow cost into Eq. 2 yields:

$$ S_{j}^{\ast} {{ = \frac{{\partial \ln ID_{\text{o}} \left( {y,\frac{{w^{\ast} }}{{C^{\ast} }}} \right)}}{{\partial \ln \left( {\frac{{w_{j}^{\ast} }}{{C^{\ast} }}} \right)}}} \mathord{\left/ {\vphantom {{ = \frac{{\partial \ln ID_{\text{o}} \left( {y,\frac{{w^{\ast} }}{{C^{\ast} }}} \right)}}{{\partial \ln \left( {\frac{{w_{j}^{\ast} }}{{C^{\ast} }}} \right)}}} {\sum\limits_{n} {\frac{{\partial \ln ID_{\text{o}} \left( {y,\frac{{w^{\ast} }}{{C^{\ast} }}} \right)}}{{\partial \ln \left( {\frac{{w_{n}^{\ast} }}{{C^{\ast} }}} \right)}} \cdot } }}} \right. \kern-\nulldelimiterspace} {\sum\limits_{n} {\frac{{\partial \ln ID_{\text{o}} \left( {y,\frac{{w^{\ast} }}{{C^{\ast} }}} \right)}}{{\partial \ln \left( {\frac{{w_{n}^{\ast} }}{{C^{\ast} }}} \right)}} \cdot } }} $$
(9)

From this, we derive the actual cost shares by substituting (5) in (9) and using the expression (7):

$$ S_{j} = \frac{{w_{j} x_{j} }}{C} = \frac{{\left( {\frac{{w_{j} }}{{w_{j}^{\ast} }}} \right) \cdot w_{j}^{\ast} \cdot x_{j} }}{{C^{\ast} \cdot \left( {\frac{1}{{\sum\limits_{j} {k_{j} S_{j} } }}} \right)}} = \left( {\frac{{\sum\limits_{j} {k_{j} S_{j} } }}{{k_{j} }}} \right) \cdot S_{j}^{\ast} = \frac{{S_{j}^{\ast} }}{{G_{j} }} $$
(10)

Equations 4 and 10 form an alternative system of the cost indirect output distance function based on shadow resource prices and shadow cost. In Sect. 3, we will derive a complete system of equations from a translog specification.

3 Translog function

One of the most popular functions in analyzing cost and output distance functions is the translog function (see Christensen et al. 1973). The translog function consists of single and second order terms of all variables in logarithms. In the case of the SCIOD this yields:

$$ \begin{gathered} \ln ID_{\text{o}} (y,\frac{{w^{\ast} }}{{C^{\ast} }}) = \left\{ {a_{0} + \sum\limits_{m} {b_{m} \ln \left( {y_{m} } \right) + \frac{1}{2}\sum\limits_{m} {\sum\limits_{{m}^{\prime}} {b_{{mm}^{\prime}} } \ln \left( {y_{m} } \right)\ln \left( {y_{{m}^{\prime}} } \right) + \sum\limits_{n} {c_{n} \ln \left( {\frac{{w_{n}^{\ast} }}{{C^{\ast} }}} \right)} } } } \right. + \hfill \\ \frac{1}{2}\sum\limits_{n} {\sum\limits_{{n}^{\prime}}{c_{{n}{{n}^{\prime}}} \ln \left( {\frac{{w_{n}^{\ast} }}{{C^{\ast} }}} \right)\ln \left( {\frac{{w_{{n}^{\prime}}^{\ast} }}{{C^{\ast} }}} \right)} } + \left. {\sum\limits_{m} {\sum\limits_{n} {e_{mn} \ln \left( {y_{m} } \right)\ln \left( {\frac{{w_{n}^{\ast} }}{{C^{\ast} }}} \right)} } } \right\} \hfill \\ \end{gathered} $$
(11)

Substituting Eqs. 5, 6, and 8 into Eq. 11 yields:

$$ 0 = \ln TL \left(y,\frac{w}{C}\right) + \ln AE $$
(12)

with:

$$ \begin{gathered} \ln TL\left(y,\frac{w}{C}\right) = \left\{ {a_{0} + \sum\limits_{m} {b_{m} \ln \left( {y_{m} } \right) + \frac{1}{2}\sum\limits_{m} {\sum\limits_{{{m}^{\prime}}} {b_{{mm}^{\prime}} } \ln \left( {y_{m} } \right)\ln \left( {y_{{{m}^{\prime}}} } \right) + \sum\limits_{n} {c_{n} \ln \left( {\frac{{w_{n} }}{C}} \right)} } } } \right. + \hfill \\ \frac{1}{2}\sum\limits_{n} {\sum\limits_{{{n}^{\prime}}} {c_{{{nn}^{\prime}}} \ln \left( {\frac{{w_{n} }}{C}} \right)\ln \left( {\frac{{w_{{{n}^{\prime}}} }}{C}} \right)} } + \left. {\sum\limits_{m} {\sum\limits_{n} {e_{mm} \ln \left( {y_{m} } \right)\ln \left( {\frac{{w_{n} }}{C}} \right)} } } \right\} \hfill \\ \end{gathered} $$
(13)

and:

$$ \begin{gathered} \ln AE = \sum\limits_{n} {c_{n} \ln \left( {G_{n} } \right) + \frac{1}{2}\sum\limits_{n} {\sum\limits_{{n}^{\prime}} {c_{{n}{{n}^{\prime}}} \ln \left( {G_{n} } \right)\ln \left( {G_{{n}^{\prime}} } \right) + } } } \frac{1}{2}\sum\limits_{n} {\sum\limits_{{n}^{\prime}} {c_{{n}{{n}^{\prime}}} \ln \left( {\frac{{w_{n} }}{C}} \right)\ln \left( {G_{{n}^{\prime}} } \right)} } + \hfill \\ \frac{1}{2}\sum\limits_{n} {\sum\limits_{{n}^{\prime}} {c_{{n}{{n}^{\prime}}} \ln \left( {G_{n} } \right)\ln \left( {\frac{{w_{{n}^{\prime}} }}{C}} \right)} } + \sum\limits_{m} {\sum\limits_{n} {e_{mn} } \ln \left( {y_{m} } \right)\ln \left( {G_{n} } \right)} \hfill \\ \end{gathered} $$
(14)

where, TL(.) = translog function; lnAE = allocative efficiency component.

Note that at the frontier ID o(y, w*/C*) = 1 and that the ratios of shadow resource prices and shadow cost (w n */C*) in (11) are substituted by the ratio of actual resource prices and actual cost (w n /C) and G n . The terms which include G n are consolidated in lnAE (14). lnAE represents the percentage of potential increase in services delivered due to allocative inefficiency. Allocative efficiency thus depends on the distortion factors k j , actual cost shares and services delivered.

Equation 11 has now analytically been separated in two components. One part consists of the relationship between actual services, resource prices and cost and the second part refers to the impact of using shadow resource prices and shadow cost. The first part reflects the frontier; the second part reflects the deviations from the optimal resources at actual resource prices (allocative inefficiency). A possible third part may reflect the distance to the frontier by radial expansion (technical efficiency component) and can be added to Eq. 12 and be derived econometrically (see Sect. 4).

By applying (10) to (11), cost share equations can be derived. For resource j the cost share equals:

$$ S_{j} = \left(\frac{S_{j}^{\ast}}{G_{j}}\right)= \left({\frac{1}{{G_{j}}}}\right)\times \frac{{c_{j} + \sum\nolimits_{n} {c_{jn} }\left[{\ln \left({\frac{{w_{n} }}{C}} \right)+\ln (G_{n} )} \right]+\sum\nolimits_{m} {e_{mj}} \ln (y_{m} )}} {{\sum\nolimits_{k} {[c_{k} + \sum\nolimits_{n} {c_{kn} } \left[ {\ln \left( {\frac{{w_{n} }}{C}} \right) + \ln (G_{n} )} \right] + \sum\nolimits_{m} {e_{mk} } \ln (y_{m} )]} }} $$
(15)

where, y m  = service m (m = 1,…,M); w n  = resource price n (n = 1, … ,N); a 0 , b m , b mm′ , c n , c nn′ , e mn parameters to be estimated.

A number of additional restrictions are required before we can estimate this model. Since the translog function is continuous and twice differentiable with respect to y and w, symmetry automatically holds;

$$ b_{{mm}^{\prime}} = b_{{{m}^{\prime}}{m}} ; \quad c_{{n}{{n}^{\prime}}}= c_{{{n}^{\prime}}{n}} $$

Since the SCIOD is linear homogenous in y, the following restrictions also have to be imposed on the parameters:

$$ \sum\limits_{m} {b_{m} } = 1;\quad \quad \sum\limits_{m} {b_{{mm}^{\prime}} } = 0\;({m}^{\prime} = 1, \ldots ,M);\quad \quad \sum\limits_{m} {e_{mn} = 0\;(n = 1, \ldots ,N)} $$

Economies of scale can be derived from the output ray elasticity, defined by (see Färe and Zieschang 1991):

$$ roe(.) = \sum\limits_{n} {\frac{{\partial \ln ID_{\text{o}} (.)}}{{\partial \ln (w_{n} /C)}}} $$
(16)

where, \( roe(.) \) = function for ray output elasticity.

Marginal cost of service y m equals (see Färe and Zieschang 1991):

$$ mc(y_{m} ) = \left( {\frac{C}{roe(.)}} \right) \cdot \frac{{\partial \ln ID_{\text{o}} (.)}}{{\partial \ln (y_{m} )}} $$
(17)

where, \( mc(y_{m} ) \) = function for marginal cost service m.

4 Estimation

One of the advantages of the proposed approach is that it removes the allocative efficiency component from the error term of the cost share equations. However, estimation of the above system is far from straightforward. There is a trade-off with the many non-linearities in the model through the distortion factors k ij . Since the allocative component in the model is modeled explicitly, allocative inefficiency can be derived from the estimated parameters and the observed values for services delivered, resource prices and cost. A key element in this Eq. 14 is G j depending on the distortion factor k j . The parameter k j may reflect firm-specific input preferences, but also simply systematic preferences. For firm-specific preferences, k j (or better k ij ) varies between firms and can only be estimated by using panel data or by assuming that k j depends on other firm specific managerial characteristics (see e.g., Reinhard and Thijssen 2000). For systematic preferences, k j is constant across firms and can be estimated simply. The technical efficiency component may be regarded as a part of the error term and can be derived from the residuals. Depending on the structure of the available data, a number of methods are available (see e.g., Kumbhakar and Lovell 2000). For instance, in the case of panel data and a limited number of firms, a fixed-effect approach is one possible way to proceed. However, there are a number of alternative specifications of the error term.

Since we are only interested in the effect of relative resource prices, one of the distortion factors has to be set to one. Another possibility is to normalize the distortion factors in such way that shadow cost equals actual cost. At first sight, this seems to be an attractive way to proceed (see Balk 1997), but, as Maietta (2002) points out, outcomes become sensitive to the choice of the input price as numéraire. Nevertheless, both approaches are equivalent (see Kumbhakar and Karagiannis 2004); a trade-off needs to be made between the simplicity of the model and the sensitivity of the numéraire choice, as mentioned. The same problem occurs in applying the SCIOD.

In traditional cost equation system estimation, cost shares only appear as endogenous variables on the left-hand side of the cost-share equations and exogenous variables as service levels and resource prices on the right-hand side. Estimation is carried out in a relatively straightforward way by applying Zellner’s Seemingly Unrelated Regression (Zellner 1962). Since cost shares here are endogenous variables that appear on both sides (note S j ’s are included in G j ’s) of the Eqs. 14 and 15 in the system, it is no longer appropriate to apply SUR. The system should therefore be estimated by using minimum distance estimation techniques, such as the General Method of Moments, Full Information Maximum Likelihood and Iterative Nonlinear 3SLS. In general, these techniques are much more demanding from a computational point of view and can only be applied under the assumption of the same distortion factor for all firms. Some of these techniques may even fail to converge due to problematic starting values and so forth. Due to the singularity of the model, one of the share equations should be dropped from the model. An alternative approach is based on Bayesian inference. For a detailed discussion of a Bayesian application to a shadow cost function, I recommend reading Kumbhakar and Tsionas (2005). Another alternative is based on an iterative procedure. This procedure consecutively consists of setting the distortion factors at a fixed value, estimating the other parameters of the model and simulating the optimal values for the distortion parameters. This procedure is repeated until some convergence criterion is met. Furthermore, it is obvious that the estimation of shadow prices depends to a large extent on sufficient variation in resource prices, which may not be the case in many empirical applications.

5 Conclusions

In many public service industries, firms are constrained by a cost budget and characterized by non-maximizing output behavior. None of the popular empirical models described in literature, such as the cost function model, is suited to analyzing the production structure of these types of service industries. In this paper, an alternative model, referred to as a Shadow Cost Indirect Output Distance Function, is presented that is more suitable. The model requires the same data as a cost function model. It provides an explicit expression for allocative efficiency that also depends on other exogenous variables in the model. Estimation of the system, however, is rather difficult to conduct. Although some promising techniques are at hand, much more empirical experience is needed in this area.