A note on functional form specification in random coefficients stochastic frontier models

Skevas, Ioannis

doi:10.1007/s11123-023-00700-4

A note on functional form specification in random coefficients stochastic frontier models

Open access
Published: 16 September 2023

Volume 61, pages 43–46, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Productivity Analysis Aims and scope Submit manuscript

A note on functional form specification in random coefficients stochastic frontier models

Download PDF

Ioannis Skevas ORCID: orcid.org/0000-0003-4595-4396¹

1502 Accesses
3 Altmetric
Explore all metrics

Abstract

This study presents a random coefficients stochastic frontier model that can accommodate the flexible translog functional form without being computationally demanding and thus time consuming to estimate. This is achieved by restricting the second-order frontier parameters to be common to all firms. For comparison, random coefficients stochastic frontier models with Cobb–Douglas, semi-translog and translog specifications with all parameters being firm-specific are estimated. The models are applied to an unbalanced panel of German dairy farms, and Bayesian techniques are used for the estimation. The results suggest that the time needed for the sampler to complete in the proposed model reduces dramatically as opposed to a translog model where all parameters are firm-specific. The elasticities exhibit some differences, depending on the choice of functional form, whilst the efficiency scores are less affected. Bayes factors suggest that the proposed model fits the data best.

A Hierarchical Stochastic Frontier Model for Efficiency Measurement Under Technology Heterogeneity

Article 05 October 2018

Estimating efficiency effects in a panel data stochastic frontier model

Article 21 November 2019

Stochastic frontier models with correlated effects

Article 01 June 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Efficiency measurement using the classical stochastic frontier model requires, among others, the choice of a functional form. In this model, this decision does not constitute a dilemma as the translog is a standard choice thanks to its flexibility. However, choosing a functional form is a more cumbersome task when technology heterogeneity is assumed and a random coefficients stochastic frontier model (Kalirajan and Obwona 1994) is used. This is because this model allows for firm-specific parameters in order to form individual frontiers, and the standard translog specification requires the estimation of a large number of parameters, which is computationally demanding and thus very time consuming. This is due to the precision (inverse variance) matrix of the distribution of the random parameters, whose elements are a quadratic function of the number of independent variables. Hence, related studies have relied on more parsimonious functional forms such as the Cobb–Douglas or the semi-translog (Tsionas (2002); Emvalomatis (2012); Skevas (2023)).

The present study argues that it is not necessary to use parsimonious functional forms when the random coefficients stochastic frontier model is employed. The reasoning is the following: A typical practice when a translog specification is employed is to normalize the right-hand-side variables by their means so that the first-order terms are interpreted as elasticities. This diminishes the importance of the second-order terms. Furthermore, the second-order terms in a translog specification tend to be insignificant. Therefore, this study specifies a translog functional form with firm-specific first-order terms, whilst restricting the second-order terms to be the same across firms. This approach still results in firm-specific frontiers whilst using the typical translog specification and not having to estimate a large number of parameters, as the elements of the precision matrix of the distribution of the random parameters is a quadratic function of (only) the number of first-order terms. This approach serves as a remedy to the arbitrary choices of parsimonious functional forms made by related studies when estimating a random coefficients stochastic frontier model.

2 Modeling approach & estimation

Let i denote firms and t denote time. A random coefficients stochastic frontier model that restricts some parameters to be common to all firms is:

$$y_{it} = {{{\mathbf{x}}}}_{it}^\prime {\boldsymbol{\beta}} _i + {{{\mathbf{z}}}}_{it}^\prime {\boldsymbol{\gamma}} + v_{it} - u_{it}$$

(1)

where y_it is the value of the dependent variable, x_it is a K × 1 vector that stores the values of K independent variables associated with firm-specific coefficients, β_i is a K × 1 vector of parameters, z_it is an L × 1 vector that stores the values of L independent variables associated with coefficients common to all firms, γ is an L × 1 vector of parameters, v_it is a two-sided error term, and u_it is a one-sided non-negative error term that captures inefficiency.

The one-sided non-negative inefficiency component u_it is assumed to follow a Half-Normal distribution, whilst the two-sided error term v_it and the random parameters β_i are assumed to follow a Normal distribution:

$$v_{it} \sim {{{\mathcal{N}}}}\left( {0,1/\tau } \right),\,u_{it} \sim {{{\mathcal{N}}}}^ + \left( {0,1/\phi } \right),\,{\boldsymbol{\beta}} _i \sim {{{\mathcal{N}}}}\left( {\overline {\boldsymbol{\beta}} ,{\mathbf{\Omega}}^{ - 1}} \right)$$

(2)

where τ and ϕ are precision parameters to be estimated, $\overline {\boldsymbol{\beta}}$ is a K × 1 vector of parameters that represents the mean of the β_is, and Ω is a K × K precision matrix for the distribution of the β_is. Regarding the functional form, the proposal of this study is to use the typical translog, with the first-order terms along with a constant term to be included in the x_it vector, and all the remaining second-order terms to be included in the z_it vector.

The model is estimated using Bayesian techniques. The data likelihood, denoted as $p\left( {\left. {\left\{ {y_{it}} \right\},\left\{ {{\boldsymbol{\beta}} _i} \right\},\left\{ {u_{it}} \right\}} \right|\left\{ {{{{\mathbf{x}}}}_{it}} \right\},\left\{ {{{{\mathbf{z}}}}_{it}} \right\},\overline {\boldsymbol{\beta}} ,{\boldsymbol{\gamma}} ,{\mathbf{\Omega}},\phi ,\tau } \right)$, is the product of two Normal distributions for the error term v_it and for the random parameters β_i, and one Half-Normal distribution for the inefficiency component u_it. The priors, denoted as $p\left( {\overline {\boldsymbol{\beta}} } \right),\,p\left( {\boldsymbol{\gamma}} \right),\,p\left( {\mathbf{\Omega}} \right),p\left( \phi \right)$ and p(τ), consist of Multivariate Normal distributions for $\overline {\boldsymbol{\beta}}$ and γ with 0 prior means and identity prior precision matrices multiplied by 0.001, a Wishart distribution for the precision matrix Ω with the degrees of freedom parameter being equal to 45 and the prior scale matrix being an identity matrix, and Gamma distributions for τ and ϕ with the shape and rate parameters being equal to 0.001 for τ, and 7 and 0.5 for ϕ, respectively. The model’s posterior, denoted as $\pi \left( {\overline {\boldsymbol{\beta}} ,{\boldsymbol{\gamma}} ,\,{\mathbf{\Omega}},\phi ,\tau ,\left\{ {{\boldsymbol{\beta}} _i} \right\},\left\{ {u_{it}} \right\}\left| {\left\{ {y_{it}} \right\},\left\{ {{{{\mathbf{x}}}}_{it}} \right\},\left\{ {{{{\mathbf{z}}}}_{it}} \right\}} \right.} \right)$, is propotional to the product of the data likelihood and the parameters’ prior distributions.

This study also estimates the typical random coefficients stochastic frontier model that restricts ${{{\mathbf{z}}}}_{it}^\prime {\boldsymbol{\gamma}} = 0$ in Eq. (1). This model is represented as:

$$y_{it} = {{{\mathbf{x}}}}_{it}^\prime {\boldsymbol{\beta}} _i + v_{it} - u_{it}$$

(3)

In the empirical application, this model is estimated using a Cobb–Douglas, a semi-translog, and a fully translog specification. The parameterization of priors is the same in all models. Regarding Ω, the imposed Wishart distribution integrates to unity only if the degrees of freedom parameter is greater or equal to the number of independent variables associated with firm-specific coefficients. Hence, in the fully translog specification of Eq. (3) this parameter is set equal to the total number of independent variables from the application that follows (i.e., 45). However, setting the degrees of freedom parameter equal to the total number of independent variables in the remaining models would result in a lower value, as they contain fewer firm-specific coefficients, which would in turn result in less diffuse priors since the variance of the elements of Ω is multiplied by the degrees of freedom parameter. Therefore, in all models the degrees of freedom parameter is set equal to 45 so that prior diffusion is similar across all models. Bayes factors are used to compare the models, with the marginal density of the data being obtained using the Chib and Jeliazkov (2001) approximation.

3 Dataset & specification

The models are applied to an unbalanced panel of German dairy farms that specialize in milk production and are observed between 1999 and 2009. The dataset consists of 1691 farms and 13,384 observations. Two outputs are distinguished: cow’s milk and milk products (y₁), and other products (y₂). Six inputs are specified: capital (K), labor (L), land (A), intermediate inputs (I), animals (S), and purchased feed (F). Table 1 presents summary statistics of the specified variables.

Table 1 Summary statistics of specified variables

Full size table

An output distance function is used. Using the linear homogeneity property, choosing y₁ as the normalizing output, taking logs, and making rearrangements yields the following estimable form of the translog distance function:

$$\begin{array}{l} { - \log y_{1,it}}\, = \,{\beta _{0,i} + \mathop {\sum}\limits_{m = 1}^m {\beta _{m,i}\log x_{m,it} + \beta _{p,i}\log \left( {\frac{{y_2,it}}{{y_1,it}}} \right) + \beta _{t,i}t} } \\ \qquad\qquad\quad +\, {\frac{1}{2}\,\mathop {\sum}\limits_{m = 1}^M {\mathop {\sum}\limits_r^R {\gamma _{mr}\log x_{m,it}\log x_{r,it}} + \frac{1}{2}\,\gamma _p\,\log \left( {\frac{{y_{2,it}}}{{y_{1,it}}}} \right)} } \\ \qquad\qquad\quad +\, { \frac{1}{2}\,\gamma _{mp}\mathop {\sum}\limits_{m = 1}^M {\log x_{m,it}\log \left( {\frac{{y_{2,it}}}{{y_{1,it}}}} \right)} + \gamma _tt^2 + \mathop {\sum}\limits_{m = 1}^M {\gamma _{mt}logx_{m,it}t} } \\ \qquad\qquad\quad +\,{\gamma _{pt}\,\log \left( {\frac{{y_{2,it}}}{{y_{1,it}}}} \right)t + v_{it} + u_{it}} \end{array}$$

(4)

Only the first-order terms and the constant term have firm-specific coefficients, whilst the parameters of all second-order terms are constant across firms. This model is called “Translog 2”. Setting all γ parameters equal to zero yields the “Cobb–Douglas” model, whilst keeping all γ parameters and assigning them an “i” subscript gives the “Translog 1” model. Finally, setting γ_mr, γ_p, γ_mp, and γ_t equal to zero, and assigning γ_mt and γ_pt an “i” subscript yields the “Semi-Translog” model. Before estimation, the data are normalized by their geometric means, so that the first-order terms are interpreted as distance elasticities.

4 Results

The results are based on 60,000 draws from the posterior. Table 2 presents the time elapsed for the completion of the markov chain monte carlo (MCMC) sampler in each model. The MCMC sampler in the “Translog 1” model took 46 hours to complete. However, in the “Translog 2” model that this study proposes the MCMC sampler took only 3 hours to complete. This result highlights that, with the same flexible translog functional form, specifying fewer random parameters reduces the time needed for the sampler to complete dramatically, thus freeing the researcher from having to deviate from the typical translog functional form.

Table 2 Time elapsed for the completion of the MCMC sampler in each model

Full size table

Table 3 presents the parameter estimates of only the first-order terms from all models. The full set of results is presented in Table A1 in the Appendix along with monte carlo standard errors. The distance elasticity of output suggests that a 1% increase in other output leads to a 0.084, 0.090, 0.118, and 0.115% increase in the distance function in the “Cobb–Douglas”, “Semi-Translog”, “Translog 1”, and “Translog 2” models, respectively. Obviously, the choice of the functional form matters, as the output distance elasticity is deflated when parsimonious functional forms are used. Small differences in the distance elasticities of inputs are also observed.

Table 3 Parameter estimates of first-order terms from all models

Full size table

Figure 1 presents boxplots of the efficiency scores for all four estimated models. In all models average efficiency is around 91–92%. Focusing on the two translog specifications, the reported similarities in the estimated efficiency scores can be perceived as a good sign, because they highlight that the move to a more parsimonious random coefficients translog specification does not change the efficiency scores. Finally, Table 4 presents the marginal log-likelihoods, the prior and the posterior model probabilities of the four models. The model proposed in this study (e.g., “Translog 2”) is clearly favored by the data against the remaining models.

Table 4 Marginal log-likelihoods, prior and posterior model probabilities

Full size table

5 Conclusions

Although the typical practice in random coefficients stochastic frontier models is to specify parsimonious functional forms so that estimation becomes less computationally intensive and thus less time consuming, the present study argues that researchers should not abandon the use of the typical translog functional form. This can be possible by specifying a translog random coefficients stochastic frontier model with fewer firm-specific parameters. Such a model, with firm-specific first-order terms and common to all firms second-order terms is presented and estimated in this study. For comparison, Cobb–Douglas, semi-translog and translog random coefficients stochastic frontier models with all parameters being firm-specific are also estimated.

The results show that the proposed translog model is significantly less time consuming than the translog model in which all parameters are firm-specific. The reported elasticities exhibit some differences, whilst the efficiency scores are similar across the four models. Finally, formal model comparisons suggest that the proposed translog model outperforms all the remaining models. The fact that the efficiency scores do not differ between the translog models is a desirable outcome, as it shows that the choice of a more parsimonious translog model does not affect the efficiency estimates. Furthermore, the proposed method can be more useful in applications where multiple inputs are used to produce multiple outputs, as is the case in dairy farming. However, in applications where a single output is produced using few inputs, the translog model with all parameters being firm-specific can be preferable. Finally, for future productivity growth studies, specifying the proposed translog distance function implies that the geometric mean of Malmquist output indices results in a Tornqvist index (see Coelli et al. 2005). Exchanging the time dimension with the firm dimension can form the basis for comparing productivity growth between firms.

Data availability

The data used in this study are not publicly available due to a confidentiality agreement signed between the author and the data provider (EU-FADN - DG AGRI).

References

Chib S, Jeliazkov I (2001) Marginal likelihood from the Metropolis–Hastings output. J Am Stat Assoc 96(453):270–281
Article MathSciNet Google Scholar
Coelli TJ, Rao DSP, O’Donnell CJ, Battese GE (2005) An introduction to efficiency and productivity analysis, 2nd edn. Springer Science + Business Media, NY
Google Scholar
Emvalomatis G (2012) Productivity growth in German dairy farming using a flexible modelling approach. J Agric Econ 63(1):83–101
Article Google Scholar
Kalirajan KP, Obwona MB (1994) Frontier production function: the stochastic coefficients approach. Oxf Bull Econ Stat 56(1):87–96
Article Google Scholar
Skevas I (2023) A novel modeling framework for quantifying spatial spillovers on total factor productivity growth and its components. Am J Agric Econ 105(4):1221–1247
Article Google Scholar
Tsionas EG (2002) Stochastic frontier models with random coefficients. J Appl Econ 17(2):127–147
Article Google Scholar

Download references

Funding

Open access funding provided by HEAL-Link Greece.

Author information

Authors and Affiliations

Department of International & European Studies, University of Macedonia, Thessaloniki, Greece
Ioannis Skevas

Authors

Ioannis Skevas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ioannis Skevas.

Ethics declarations

Conflict of interest

The author declares no competing interests.

Supplementary information

Appendix

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Skevas, I. A note on functional form specification in random coefficients stochastic frontier models. J Prod Anal 61, 43–46 (2024). https://doi.org/10.1007/s11123-023-00700-4

Download citation

Accepted: 21 August 2023
Published: 16 September 2023
Issue Date: February 2024
DOI: https://doi.org/10.1007/s11123-023-00700-4

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A note on functional form specification in random coefficients stochastic frontier models

Abstract

Similar content being viewed by others

A Hierarchical Stochastic Frontier Model for Efficiency Measurement Under Technology Heterogeneity

Estimating efficiency effects in a panel data stochastic frontier model

Stochastic frontier models with correlated effects

1 Introduction

2 Modeling approach & estimation

3 Dataset & specification

4 Results

5 Conclusions

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Supplementary information

Appendix

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

A note on functional form specification in random coefficients stochastic frontier models

Abstract

Similar content being viewed by others

A Hierarchical Stochastic Frontier Model for Efficiency Measurement Under Technology Heterogeneity

Estimating efficiency effects in a panel data stochastic frontier model

Stochastic frontier models with correlated effects

1 Introduction

2 Modeling approach & estimation

3 Dataset & specification

4 Results

5 Conclusions

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Supplementary information

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation