Flexible mixture modelling of stochastic frontiers

Griffin, J. E.; Steel, M. F. J.

doi:10.1007/s11123-007-0064-4

Flexible mixture modelling of stochastic frontiers

Published: 13 November 2007

Volume 29, pages 33–50, (2008)
Cite this article

Journal of Productivity Analysis Aims and scope Submit manuscript

J. E. Griffin¹ &
M. F. J. Steel¹

392 Accesses
18 Citations
Explore all metrics

Abstract

This paper introduces new and flexible classes of inefficiency distributions for stochastic frontier models. We consider both generalized gamma distributions and mixtures of generalized gamma distributions. These classes cover many interesting cases and accommodate both positively and negatively skewed composed error distributions. Bayesian methods allow for useful inference with carefully chosen prior distributions. We recommend a two-component mixture model where a sensible amount of structure is imposed through the prior to distinguish the components, which are given an economic interpretation. This setting allows for efficiencies to depend on firm characteristics, through the probability of belonging to either component. Issues of label-switching and separate identification of both the measurement and inefficiency errors are also examined. Inference methods through MCMC with partial centring are outlined and used to analyse both simulated and real data. An illustration using hospital cost data is discussed in some detail.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Article Open access 05 May 2021

Violating the normality assumption may be the lesser of two evils

Article Open access 07 May 2021

General diagnostic tests for cross-sectional dependence in panels

Article 20 May 2020

Notes

Predictive results are obtained by integrating out the parameters (of the sampling model) using the posterior. In the case of Fig. 1, this leads to the distribution of efficiencies for an unobserved firm in the industry, given all the information in the data.
Actually, from the point of view of statistical identification, they could appear in both places, although this would perhaps be hard to justify from an economic point of view (with the possible exception of time variables).
Actually, the data do not contain an explicit price for materials, so we assume that material prices are constant across hospitals but not over time. The latter dynamics is then approximated by a quadratic time trend.
This is the ratio of clinical personnel to average daily census, transformed to a dummy variable.
These Bayes factors are, again, computed through the Savage-Dickey density ratio, as explained in Sect. 6.

References

Aigner D, Lovell CAK, Schmidt P (1977) Formulation and estimation of stochastic frontier production function models. J Econometrics 6:21–37
Article Google Scholar
Albert JH, Chib S (1993) Bayesian analysis of binary and polychotomous response data. J Am Stat Assoc 88:669–679
Article Google Scholar
Battese GE, Coelli TJ (1992) Frontier production functions, technical efficiency and panel data: with application to paddy farmers in India. J Prod Anal 3:153–169
Article Google Scholar
Bernardo JM, Smith AFM (1994) Bayesian theory. Wiley, Chichester
Google Scholar
Box GEP, Tiao GC (1973) Bayesian inference in statistical analysis. Addison-Wesley, Reading
Google Scholar
Carree MA (2002) Technological inefficiency and the skewness of the error component in stochastic frontier analysis. Econ Lett 77:101–107
Article Google Scholar
Celeux G, Hurn M, Robert CP (2000) Computational and inferential difficulties with mixture posterior distributions. J Am Stat Assoc 95:957–970
Article Google Scholar
Cornwell C, Schmidt P, Sickles R (1990) Production frontiers with cross-sectional and time-series variation in efficiency levels. J Econometrics 46:185–200
Article Google Scholar
Fernández C, Koop G, Steel MFJ (2002) Multiple-output production with undesirable outputs: an application to nitrogen surplus in agriculture. J Am Stat Assoc 97:432–442
Article Google Scholar
Fernández C, Osiewalski J, Steel MFJ (1997) On the use of panel data in stochastic frontier models with improper priors. J Econometrics 79:169–193
Article Google Scholar
Gelfand AE, Sahu SK, Carlin BP (1995) Efficient parameterizations for normal linear mixed models. Biometrika 82:479–488
Article Google Scholar
Gilks WR, Wild P (1992) Adaptive rejection sampling for Gibbs sampling. J R Stat Soc C 34:198–200
Google Scholar
Greene WH (1990) A gamma-distributed stochastic frontier model. J Econometrics 46:141–163
Article Google Scholar
Greene WH (2003) Simulated likelihood estimation of the normal-gamma stochastic frontier function. J Prod Anal 19:179–190
Article Google Scholar
Griffin JE, Steel MFJ (2004) Semiparametric Bayesian inference for stochastic frontier models. J Econometrics 123:121–152
Article Google Scholar
Hall P, Simar L (2002) Estimating a changepoint, boundary or frontier in the presence of observation error. J Am Stat Assoc 97:523–534
Article Google Scholar
Johnson NL, Kotz S, Balakrishnan N (1994) Continuous univariate distributions. vol 1, 2nd edn. Wiley, New York
Google Scholar
Koop G, Osiewalski J, Steel MFJ (1997) Bayesian efficiency analysis through individual effects: Hospital cost frontiers. J Econometrics 76:77–105
Article Google Scholar
Koop G, Steel MFJ, Osiewalski J (1995) Posterior analysis of stochastic frontiers models using Gibbs sampling. Computational Stat 10:353–373
Google Scholar
Kozumi H, Zhang XY (2005) Bayesian and non-Bayesian analysis of gamma stochastic frontier models by Markov chain Monte Carlo methods. Comput Stat 20:575–593
Article Google Scholar
Kumbhakar SC (1990) Production frontiers, panel data and time-varying technical inefficiency. J Econometrics 46:201–212
Article Google Scholar
Kumbhakar SC, Ghosh S, McGuckin JT (1991) A generalized production frontier approach for estimating determinants of inefficiency in US dairy farms. J Bus Econ Stat 9:279–286
Article Google Scholar
Meeusen W, van den Broeck J (1977) Efficiency estimation from Cobb-Douglas production functions with composed errors. Int Econ Rev 8:435–444
Article Google Scholar
Papaspiliopoulos O, Roberts GO, Sköld M (2003) Non-centered parameterisations for hierarchical models and data augmentation. Bernardo JM, Bayarri MJ, Berger JO, Dawid AP, Heckerman D, Smith AFM, West M (eds) Bayesian statistics, vol 7. Clarendon Press, Oxford, pp 307–326
Google Scholar
Reinhard S, Lovell CAK, Thijssen G (1999) Econometric application of technical and environmental efficiency: an application to dutch dairy farms. Am J Agr Econ 81:44–60
Article Google Scholar
Richardson S, Green PJ (1997) On Bayesian analysis of mixtures with an unknown number of components (with discussion). J Roy Stat Soc B 59:731–792
Article Google Scholar
Ritter C, Simar L (1997) Pitfalls of Normal-Gamma stochastic frontier models. J Prod Anal 8:167–182
Article Google Scholar
Stacy EW (1962) A generalization of the gamma distribution. Ann Math Stat 33:1187–1192
Google Scholar
Stephens M (2000) Dealing with label switching in mixture models. J R Stat Soc B 62:795–809
Article Google Scholar
Stevenson RE (1980) Likelihood functions for generalized stochastic frontier estimation. J Econometrics 13:57–66
Article Google Scholar
Spiegelhalter D, Grigg O, Kinsman R, Threasure T (2003) Risk-adjusted sequential probability ratio tests: applications to Bristol, Shipman and adult cardiac surgery. Int J Qual Health Care 15:7–13
Article Google Scholar
Tsionas EG (2000) Full likelihood inference in Normal-gamma stochastic frontier models. J Productivity Anal 13:183–205
Article Google Scholar
van den Broeck J, Koop G, Osiewalski J, Steel MFJ (1994) Stochastic frontier models: a Bayesian perspective. J Econometrics 61:273–303
Article Google Scholar
Verdinelli I, Wasserman L (1995) Computing Bayes factors using a generalization of the Savage-Dickey density ratio. J Am Stat Assoc 90:614–618
Article Google Scholar

Download references

Acknowledgements

Jim Griffin acknowledges research support from The Nuffield Foundation grant NUF-NAL/00728. We thank three Referees for constructive comments.

Author information

Authors and Affiliations

Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK
J. E. Griffin & M. F. J. Steel

Authors

J. E. Griffin
View author publications
You can also search for this author in PubMed Google Scholar
M. F. J. Steel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. F. J. Steel.

Appendix: Some Details on the MCMC Sampler

1.1 A.1 Drawing the inefficiencies

The full conditional for the inefficiencies is, of course, different from the one in the existing literature. The firm-specific inefficiencies u _i will be independent given all the other parameters and the observations, with density function:

$$ p(u_i\vert \hbox{rest}) \propto u_i^{c\phi-1} \exp\left\{-\left[\frac{1}{2\sigma^2}(T_i u_i^2 + 2 \mu_i u_i) + \lambda u_i^c\right]\right\}, $$

(10)

where $\mu_i=T_i\alpha+\beta^{\prime} X_i^{\prime} \iota-y_i^{\prime} \iota$ for a cost frontier and its negative for a production frontier, if we define $y_i=(y_{i1},{\ldots},y_{i{T_i}})^\prime,\;X_i=(x_{i1},{\ldots},x_{i{T_i}})^\prime$ and $\iota$ is a T _i-dimensional vector of ones. We can easily show that the conditional in (10) is log-concave if

$$ u^c > \frac{1-c\phi}{\lambda c(c-1)}, $$

which is always satisfied if cϕ > 1 and c > 1. For this parameter combination, we use adaptive rejection sampling (see Gilks and Wild 1992) to sample directly from (10). In the other cases, we use random walk Metropolis-Hastings with a lognormal candidate generator. Random walk Metropolis-Hastings is also used to generate drawings for the parameters ϕ and c while λ can simply be drawn from a Gamma conditional. We use the reparameterization from (ϕ,c) to (ψ,c), where ψ = ϕc, since it leads to better mixing properties of the sampler.

1.2 A.2 Centring

As indicated in Sect. 6, we use a sampler which randomly mixes updates from the centred and the uncentred parameterizations. This basic idea is called “partial centring” in Papaspiliopoulos et al. (2003), who show that this generally leads to more robust sampling algorithms. We choose a centred update with probability 1/4, as we only need to “recentre” α once in a while. In addition, we found that centring works best if we integrate out λ while updating α (i.e. we effectively draw α and λ jointly). Thus, in the centred parameterization we use a random walk Metropolis-Hastings step to sample from the following conditional for α < min_i{z _i}:

$$ p(\alpha\vert z, \beta, \sigma^2, c, \phi, y_1,\ldots,y_n)\propto \prod_{i=1}^n(z_i-\alpha)^{c\phi-1} \left[\sum_{i=1}^n(z_i-\alpha)^c +(-\ln r^\star)^c\right]^{-\phi(n+1)}, $$

(11)

which can be shown to be log-concave whenever cϕ > 1 and c > 1. We use adaptive rejection sampling in the latter case and a random walk Metropolis-Hastings step otherwise.

1.3 A.3 The mixture model

In the case of the mixture inefficiency distribution, as described in Sect. 3.1, we extend the MCMC sampler of the basic case by augmenting with an indicator variable s _i,i = 1,…,n which can take the values 0 or 1 and assigns firm i to one of the two efficiency groups (i.e. one of the two inefficiency components). For the mixture model the sampler will, thus, generate a chain on (α,β,σ²,w,θ,u,s), where s = (s ₁,…,s _n). Inference about the parameters of each component (c _j, ϕ_j, λ_j), j = 1,2 now only depends on those firms for which s _i = j − 1. The full conditional distribution for w will be

$$ w\vert \hbox{rest}\sim\hbox{Be}\left(w_0+n-\sum_{i=1}^n s_i,w_1+\sum_{i=1}^n s_i\right), $$

and the values of s _i are updated through

$$ \hbox{P}(s_i=0\vert \hbox{rest})=\frac{w p_{GG}(u_i\vert c_1,\phi_1,\lambda_1)}{w p_{GG}(u_i\vert c_1,\phi_1,\lambda_1) + (1-w) p_{GG}(u_i\vert c_2,\phi_2,\lambda_2)}. $$

For the decomposition case, where we have included a prior probability of w = 0, the full conditional distribution of w has the same form as above except when $\sum_{i=1}^n s_i=n$. In this case w is zero with probability $q^{\star}$ and w∼Be(w ₀, w ₁ + n) otherwise where

$$ q^{\star}=\frac{q}{q+(1-q) \frac{\Gamma(w_0+w_1)\Gamma(w_1+n)}{\Gamma(w_1)\Gamma(w_0+w_1+n)}}. $$

Here we only considered switching to the one-component model when $\sum_{i=1}^n s_i=n$, which worked well. However, in applications with very large n it might be more efficient to also consider jumps between the models when $\sum_{i=1}^n s_i$ is relatively close to n.

In case we allow for the probability of being in the efficient group to depend on firm characteristics, we can use simple Gibbs sampling after data augmentation as in Albert and Chib (1993) to update the probit regression coefficients.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Griffin, J.E., Steel, M.F.J. Flexible mixture modelling of stochastic frontiers. J Prod Anal 29, 33–50 (2008). https://doi.org/10.1007/s11123-007-0064-4

Download citation

Published: 13 November 2007
Issue Date: February 2008
DOI: https://doi.org/10.1007/s11123-007-0064-4

Keywords

JEL Classifications

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Flexible mixture modelling of stochastic frontiers

Abstract

Access this article

Similar content being viewed by others

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Violating the normality assumption may be the lesser of two evils

General diagnostic tests for cross-sectional dependence in panels

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix: Some Details on the MCMC Sampler

1.1 A.1 Drawing the inefficiencies

1.2 A.2 Centring

1.3 A.3 The mixture model

Rights and permissions

About this article

Cite this article

Keywords

JEL Classifications

Navigation

Flexible mixture modelling of stochastic frontiers

Abstract

Access this article

Similar content being viewed by others

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Violating the normality assumption may be the lesser of two evils

General diagnostic tests for cross-sectional dependence in panels

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix: Some Details on the MCMC Sampler

Appendix: Some Details on the MCMC Sampler

1.1 A.1 Drawing the inefficiencies

1.2 A.2 Centring

1.3 A.3 The mixture model

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classifications

Search

Navigation