Skip to main content
Log in

A stochastic frontier model with structural breaks in efficiency and technology

  • Published:
Empirical Economics Aims and scope Submit manuscript

Abstract

We have developed a stochastic frontier model with appropriate priors to estimate the locations and number of structural breaks for both production efficiency and technology, which experience different regime changes. We assume different units could have unknown common break dates. Although panel data with large cross-sectional size can help identify the break locations, it could render posterior simulation very inefficient. Hence, care must be taken to avoid such problems. We apply our method to study the world production over the period of 1960–2007 and find that the data support structural breaks in technology rather than in efficiency. For most countries under study, the most important source of growth is capital accumulation. The technology adopted by different countries shows signs of convergence. Changes of technology usually happen after economic crises to compensate for negative capital growth. Alternative modelling approach and priors are used for robustness check.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Notes

  1. We assume both the intercept and slopes could be time varying.

  2. Note that the mode of \(\exp (\gamma )\) is \(\exp (\mu -\exp (\phi ))\), which can approach 0 when \(\mu \) and \(\phi \) are adjusted appropriately. Hence, log normal density can mimic the downward-sloping shape of half normal and exponential densities. However, log normal density does not have to be downward sloping with the peak close to 0.

  3. That is, the posterior probabilities of the true number of regimes and the true break point locations will tend to one asymptotically with the increase of cross-sectional sample size. We rely on artificial data simulations to check our model. It appears that both the number of in-sample break points and their locations can be well identified with fairly large \(N\) (above 100). Such results are available upon request from the author.

  4. If the change point takes place at period \(T\), it implies there is only one regime with no breaks inside the sample.

  5. Note that \(Pr(S_T=1,S_{T-1}=1,\dots ,S_1=1)=\frac{1}{T}\ne 0\).

  6. If the parameter values could change at every period, the choice of prior implies \(Pr(S_{1}=1,\dots ,S_{t}=t)=\frac{1}{T^{t-1}}\).

  7. See Fig. 3. When \(D=16\), the number of sample periods in our application such effect is very obvious. However, when \(D=64\), the prior accumulation becomes very small.

  8. It is gamma distribution for their case.

  9. Inefficiency factor is the inverse of the relative numerical efficiency measure of Geweke (1992). It implies how many simulated draws are required to obtain one effective draw for posterior inference. The closer the IF is to 1, the more the efficient is the posterior simulation.

  10. These numbers are from prior simulation.

  11. The results are available upon request from the author.

  12. Note that in general the posterior point estimates are between the prior and the maximum likelihood estimates.

  13. See Table 4.

  14. We have collected the data of 109 countries with all the variables available over 1960–2007 from Penn World Table, based on which the per capita income ranking is calculated.

  15. Note that we average our data every 3 years. It is possible to convert the growth numbers into annual terms. For example, the overall average growth of the two groups is 13 %. The annual average growth would be 4.16 % (i.e. \(\root 3 \of {1.13}-1\)).

  16. The break point location is estimated similarly by the frequency of a particular point being a break in the simulation.

  17. Note that the efficiency estimates are relative to different production frontiers for the two groups of economies.

  18. Cyprus is the only exception in our sample with technical progress higher than capital growth.

  19. We would like to thank an anonymous referee for suggesting us estimate this model.

  20. The exact estimation results are available upon request from the author.

  21. We have also found that technology regimes in both models are not sensitive to \(D_\mathrm{max}\).

  22. For example, if both period 1 and 2 belong to regime 1, i.e.\(S^a_1=S^a_2=1\), then \(\zeta _{i1}=(x_{i1},x_{i2})'.\)

  23. Since the posterior estimate of \(\sigma ^2_\epsilon \) is consistent with the data separated into two regimes, replacing the posterior draw of \(\sigma ^2_\epsilon \) with its true value does not affect asymptotic analysis.

References

  • Aigner D, Lovell K, Schmidt P (1977) Formulation and estimation of stochastic frontier production function models. J Econom 6:21–37

    Article  Google Scholar 

  • Bai J (2010) Common breaks in means and variances for panel data. J Econom 157(1):78–92

    Article  Google Scholar 

  • Bos J, Economidou C, Koetter M, Kolari J (2010) Do all countries grow alike? J Dev Econ 91:113–127

    Article  Google Scholar 

  • Carter C, Kohn R (1994) On gibbs sampling for state space models. Biometrika 81(3):541–553

    Article  Google Scholar 

  • Chib S (1998) Estimation and comparison of multiple change-point models. J Econom 86:221–241

    Article  Google Scholar 

  • Chib S, Carlin BP (1999) On mcmc sampling in hierarchical longitudinal models. Stat Comput 9(1):17–26

    Article  Google Scholar 

  • Feng Q, Kao C, Lazarova S (2010) Estimation and identification of change points in panel models. Tech. rep., Nanyang Technological University

  • Gerlach R, Carter C, Kohn R (2000) Efficient bayesian inference for dynamic mixture models. J Am Stat Assoc 95:819–828

    Article  Google Scholar 

  • Geweke J (1992) Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments. In: Bernardo J, Berger J, Dawid A, Smith A (eds) Bayesian statistics. Oxford University Press, Oxford, pp 169–193

    Google Scholar 

  • Heston A, Summers R, Aten B (2009) Penn world table version 6.3. Center for International Comparisons of Production, Income and Prices at the University of Pennsylvania

  • Koop G, Potter S (2009) Prior elicitation in multiple change-point models. Int Econ Rev 50(3):751–771

    Article  Google Scholar 

  • Koop G, Osiewalski J, Steel MF (2000) Modeling the sources of output growth in a panel of countries. J Bus Econ Stat 18(3):284–299

    Google Scholar 

  • Koopman SJ (1997) Exact initial Kalman filtering and smoothing for nonstationary time series models. J Am Stat Assoc 92(440):1630–1638

    Article  Google Scholar 

  • Koopman SJ, Durbin J (2002) A simple and efficient simulation smoother for state space time series analysis. Biometrika 89(3):603–615

    Article  Google Scholar 

  • Kumar S, Russell R (2002) Technological change, techological catch-up, and capital deepening: relative contributions to growth and covergence. Am Econ Rev 92:527–548

    Article  Google Scholar 

  • Meeusen W, van den Broeck J (1977) Efficiency estimation from Cobb-Douglas production functions with composed error. Int Econ Rev 18:435–444

    Article  Google Scholar 

  • Ritter C, Simar L (1997) Pitfalls of normal-gamma stochastic frontier models. J Product Anal 8:167–182

    Article  Google Scholar 

  • Tsionas EG (2006) Inference in dynamic stochastic frontier models. J Appl Econom 21:669–676

    Article  Google Scholar 

  • Tsionas EG, Kumbhakar SC (2004) Markov switching stochastic frontier model. Econom J 7(2):398–425

    Article  Google Scholar 

  • Tsionas EG, Tran KC (2007) Bayesian inference in threshold stochastic frontier models. Tech. rep., Athens University of Economics and Business

  • van den Broeck J, Koop J, Steel MF (1994) Stochastic fronier models a bayesian perspective. J Econom 61:273–303

    Article  Google Scholar 

Download references

Acknowledgments

The previous versions of the paper have been presented at the 10th World Congress of the Econometric Society, the Rimini Conference in Economics and Finance and the Economics Workshop at Cardiff Business School. The author would like to thank the participants and their comments. Computational assistance by Huw Lynes and Christopher Bording in the Merlin system is gratefully acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guangjie Li.

Appendices

Appendix 1: Features of the prior in (4)

It is possible to model the breaks in terms of the locations of change points. Denote the \(m\)th break point as \(\tau _m=\lbrace t:S_{t+1}=m+1,S_t=m\rbrace \) for \(m=1,2,\dots ,M-1\). Suppose \(D\) only takes integer values. The unrestricted uniform prior suggested by Koop and Potter (2009) can have the following form:

$$\begin{aligned} \begin{aligned}&Pr(\tau _1)=\frac{1}{D}\qquad \text {for } \tau _1=1,2,\dots ,D,\\&Pr(\tau _m|\tau _{m-1})=\frac{1}{D}\qquad \text {for } \tau _m=\tau _{m-1}+1,\tau _{m-1}+2,\dots ,\tau _{m-1}+D. \end{aligned} \end{aligned}$$
(20)

The prior in (20) implies \(Pr(\tau _m|\tau _{m-1})\) is uniform over the support, which is as large as \(\tau _1\)’s.

In what follows, we use \(\widetilde{Pr}(\tau _m|\tau _{m-1})\) to denote the probability derived from our prior in (4).

$$\begin{aligned} \begin{aligned}&\widetilde{Pr}(\tau _m=t|\tau _{m-1}=t-j)= Pr(S_{t+1}=m+1,S_t=m|S_{t-j}=m-1,S_{t-j+1}=m),\\&=Pr(S_{t+1}=m+1|S_t=m,S_{t-j}=m-1,S_{t-j+1}=m)Pr(S_{t}\\&=m|S_{t-j}=m-1,S_{t-j+1}=m),\\&=Pr(S_{t+1}=m+1|S_t=m)Pr(S_{t}=m|S_{t-j+1}=m),\\&=\frac{1}{D-(t-j)+m-1}=\frac{1}{D-\tau _{m-1}+m-1}\quad \text {for }j=1,2,\dots ,t-m+1. \end{aligned} \end{aligned}$$
(21)

In the derivation above, we have used the fact that \(S_t\) is a Markov process. The last but one equality sign results since

$$\begin{aligned} \begin{aligned} Pr(S_{t}&=m|S_{t-j+1}=m)\\&=Pr(S_{t}=m|S_{t-1}=m)Pr(S_{t-1}=m|S_{t-2}=m)\\&\cdots Pr(S_{t-j+2}=m|S_{t-j+1}=m)\\&=\left( 1-\frac{1}{D-(t-1-m)}\right) \left( 1-\frac{1}{D-(t-2-m)}\right) \\&\cdots \left( 1-\frac{1}{D-(t-j+1-m)}\right) \\&=\frac{D-(t-m)}{D-(t-j)+m-1}. \end{aligned} \end{aligned}$$
(22)

Comparing \(\widetilde{Pr}(\tau _m|\tau _{m-1})\) and \(Pr(\tau _m|\tau _{m-1})\), we can see that the prior in (4) implies the conditional probability of the break location is uniform over \(\tau _m=\tau _{m-1}+1,\dots , D+m-1\), but has smaller support than \(\tau _1\) when \(\tau _{m-1}>m-1\). Therefore, \(\widetilde{Pr}(\tau _m|\tau _{m-1})\) gets bigger when \(\tau _{m-1}\) gets closer to \(D\), which also implies the points near the end of the sample are more possible break points than the points before as in Fig. 3. Hence, the prior in (4) is in fact similar to the restricted uniform prior in Koop and Potter (2009), though the uniform prior does not necessarily imply \(S_t\) follows a Markov process.

Appendix 2: Simulate draws of \(\gamma \), \(S^\gamma \) and \(D^\gamma \)

To draw \(\gamma \), we numerically invert its cumulative posterior distribution function. The posterior kernel of \(\gamma _{iS_t=j}\) (\(\gamma _{ij}\)) takes the following form:

$$\begin{aligned} \begin{aligned}&p(\gamma _{ij}|S^\gamma ,\mu ,\phi ,a,S^a,\sigma ^2_\epsilon ,Y)\propto \\&exp\Biggl \lbrace -\frac{d_j}{2\sigma ^2_\epsilon } \biggl [\exp (\gamma _{ij})-\frac{1}{d_j}\underset{t\in \lbrace t|S_t=j\rbrace }{\sum }\left( x_{it}'a_{S_t^a}-y_{it}\right) \biggl ]^2 \Biggl \rbrace \exp \left[ -\frac{(\gamma _{ij}-\mu _j)^2}{2\exp (\phi _j)}\right] . \end{aligned} \end{aligned}$$
(23)

To simulate a draw of \(S^\gamma \), we follow the algorithm by Chib (1998). Once we have a draw of \(S^\gamma \), we could simulate \(D^\gamma \), whose posterior kernel is

$$\begin{aligned} p(D^\gamma |S^\gamma ,Y)\propto I(T\le D^\gamma \le D_\mathrm{max})\prod _{t=2}^TPr(S^\gamma _{t}|S^\gamma _{t-1},D^\gamma ). \end{aligned}$$
(24)

We first find the integrating constant of the posterior kernel over \([T,D_\mathrm{max}]\) and then draw \(D^\gamma \) by numerically inverting its cumulative distribution function. The same method can be used to draw regime duration for technology.

Appendix 3: Simulate draws of \(S^a\) and \(a\)

Define

$$\begin{aligned} z^a_{it}=y_{it}+exp(\gamma _{iS_t^\gamma }). \end{aligned}$$
(25)

Stack up all the \(z^a_{it}\) in regime \(j\) of \(a\) into a vector (i.e. for \(t\) with \(S^a_t=j\)) to obtain \(z^a_{ij}\). Similarly, we concatenate all the \(x_{it}\) in regime \(j\) to form \(\zeta _{ij}\).Footnote 22 Denote \(z^a_j=(z^{a\prime }_{1j},z^{a\prime }_{1j},\dots ,z^{a\prime }_{Nj})'\) and \(\varXi _j=(\zeta _{1j}',\zeta _{1j}',\dots ,\zeta _{Nj}')'\). Then we can have the following state space model:

$$\begin{aligned} z^a_j&= \varXi _j a_j+e^a_j,\qquad e^a_j\sim N(0,\sigma ^2_\epsilon I_{d^a_j N\times d^a_j N}),\end{aligned}$$
(26)
$$\begin{aligned} a_{j+1}&= a_j+u^a_{j},\qquad u^a_{j}\sim N(0,\sigma ^2_aI_{k_a\times k_a}), \end{aligned}$$
(27)

where \(d^a_j\) is the duration of the \(j\)th regime, \(j=1,\dots ,m_a\) and \(m_a\) is the number of in-sample regimes for \(a\). We assume the cross-sectional size (\(N\)) is large and \(\varXi _j\) has full column rank. The Kalman filter takes the following form:

$$\begin{aligned} \begin{aligned}&R_j=\varXi _j(V_{j-1}+\sigma ^2_a I)\varXi _j'+\sigma ^2_\epsilon I, J_j=(V_{j-1}+\sigma ^2_aI)\varXi _j',\\&M_j=J_jR_j^{-1},C_j=J_jR_j^{-1}J^{'}_j,V_j=V_{j-1}+\sigma ^2_aI-C_j,\\&m_j=m_{j-1}+M_j(z^a_j-\varXi _j m_{j-1}),j=2,3\dots ,m_a. \end{aligned} \end{aligned}$$
(28)

Due to the diffuse prior for \(a_1\), we initialize the filter by \(V_1=\sigma ^2_\epsilon (\varXi _1'\varXi _1)^{-1}\) and \(m_1=(\varXi _1'\varXi _1)^{-1}\varXi _1'z^a_1\) (e.g. Koopman 1997). We can then use MH algorithm to draw \(\sigma ^2_a\) from the following posterior,

$$\begin{aligned} p(\sigma ^2_a|z^a,S^a,\sigma ^2_\epsilon )&\propto p_{IG}(\sigma ^2_a)\prod _{j=1}^{m_a-1}|R_{j+1}|^{-\frac{1}{2}}\exp \left[ -\frac{1}{2}\sum _{j=1}^{m_a-1}(z^{a}_{j+1}\right. \nonumber \\&\left. -\varXi _{j+1} m_{j})'R_{j+1}^{-1}(z^{a}_{j+1}-\varXi _{j+1} m_{j})\right] , \end{aligned}$$
(29)

To draw \(a|z^a\) (smoothing), we use the algorithm by Koopman and Durbin (2002), while we use the algorithm by Gerlach et al. (2000). To draw \(S^a\) and let \(S^a\) be unconditional on \(a\). Define \(z^{*a}_{it}=y_{it}+exp(\gamma _{iS_t^\gamma })\), \(z^{*a}_{t}=(z^{*a}_{1t},z^{*a}_{2t},\dots ,z^{*a}_{Nt})'\) and \(\varXi ^*_{t}=(x_{1t},x_{2t},\dots ,x_{Nt})'\). We can then write down the following state space model:

$$\begin{aligned} \begin{aligned} z^{*a}_{t}&=\varXi ^{*}_ta_t+e^{*a}_t,\qquad e^{*a}_t\sim N(0,\sigma ^2_\epsilon I_N)\\ a_{t+1}&=a_{t}+K_tu_{t},\qquad u_{t}\sim i.i.d.N(0,\sigma ^2_a I). \end{aligned} \end{aligned}$$
(30)

where \(K_t=S_{t+1}-S_{t}\) takes the value of either 1 or 0 indicating whether period \(t\) is a breakpoint or not. We draw the posterior of \(K^a=(K_1,\dots ,K_{T-1})'\) and then transform \(K^a\) into \(S^a=(S_1,\dots ,S_T)'\). We still use the prior in (4) and represent it as (15). The algorithm by Gerlach et al. (2000) is a Gibbs sampler, which draws \(K_t|K_{s\ne t},Y\), and the details are as follows.

Algorithm 6.1

  1. 1.

    Given the initial draw of \(K^a\), we run through the following backward recursion, where the notations are just confined to this section.

    $$\begin{aligned} \begin{aligned} N_{t+1}^{-1}&=\left( \sigma ^2_aK_{t}\varXi ^{*}_{t+1} \varXi ^{*\prime }_{t+1}+\sigma ^2_\epsilon I_N\right) ^{-1}\\&=\frac{1}{\sigma ^2_\epsilon }\left[ I_N- \varXi ^{*}_{t+1}\left( \frac{\sigma ^2_\epsilon }{\sigma ^2_aK_{t}}I +\varXi ^{*\prime }_{t+1}\varXi ^{*}_{t+1}\right) ^{-1} \varXi ^{*\prime }_{t+1}\right] ,\\ B_{t+1}&=\sigma ^2_aK_{t}\varXi ^{*\prime }_{t+1}N^{-1}_{t+1}, A_{t+1}=I_{k_a}-B_{t+1}\varXi ^{*}_{t+1},\\ Z_{t+1}&=\sigma ^a_a K_{t}A_{t+1}A_{t+1}'+\sigma ^2_\epsilon B_{t+1}B_{t+1}',\\ D_{t+1}&=Z_{t+1}-Z_{t+1}(\varOmega _{t+1}^{-1}+Z_{t+1})^{-1}Z_{t+1}\\&=Z_{t+1}^{\frac{1}{2}}\left( I+Z_{t+1}^{\frac{1}{2}}\varOmega _{t+1}Z_{t+1}^ {\frac{1}{2}}\right) ^{-1}Z_{t+1}^{\frac{1}{2}},\\ \varOmega _t&=A_{t+1}'(\varOmega _{t+1}-\varOmega _{t+1}D_{t+1} \varOmega _{t+1})A_{t+1} +\varXi ^{*\prime }_{t+1}N_{t+1}^{-1}\varXi ^{*}_{t+1},\\ \mu _{t}&=A_{t+1}'(I-\varOmega _{t+1}D_{t+1}) (\mu _{t+1}-\varOmega _{t+1}B_{t+1}z_{t+1}^{*a})\\&\quad \ +\varXi ^{*\prime }_{t+1}N_{t+1}^{-1}z_{t+1}^{*a},t=T-1,\dots ,1, \end{aligned} \end{aligned}$$
    (31)

    where \(\varOmega _T=0_{k_a\times k_a}\), \((\varOmega _{T}^{-1}+Z_{T})^{-1}=0\) and \(\mu _T=0_{k_a\times 1}\).

  2. 2.

    For \(K_t=0,1\),

    1. (a)

      run the Kalman filter in (28) by replacing the subscript \(j\) by \(t\) (\(t=2,\dots ,T\)), \(\sigma ^2_a\) by \(K_{t-1}\sigma ^2_{a}\), \(\varXi _j\) by \(\varXi ^*_t\) and \(z^a_j\) by \(z^{*a}_t\).

    2. (b)

      Calculate the posterior conditionals:

      $$\begin{aligned}&\!\!Pr(K_t|z^{*a},K_{s\ne t})\propto p(z_{t+1}^{*a}|z_{1:t}^{*a},K_{1:t})p(z_{(t+2):T}^ {*a}|z_{1:(t+1)}^{*a},K^a)Pr(K_t|K_{s\ne t})\nonumber \\&\!\!p(z_{t+1}^{*a}|z_{1:t}^{*a},K_{1:t})\propto |R_{t+1}|^{-\frac{1}{2}}\exp \left[ -\frac{1}{2}(z^{*a}_{t+1}-\varXi ^{*}_{t+1} m_{t}) 'R_{t+1}^{-1}(z^{*a}_{t+1}\right. \nonumber \\&\!\!\left. -\varXi ^{*}_{t+1} m_{t})\right] \nonumber \\&\!\!p(z_{(t+2):T}^{*a}|z_{1:(t+1)}^{*a},K^a)\propto |\varOmega _{t+1}V_{t+1}+I|^{-\frac{1}{2}} \exp \Biggl \lbrace -\frac{1}{2}\biggl [m_{t+1}'\varOmega _{t+1} m_{t+1}\nonumber \\&\!\!-2\mu _{t+1}' m_{t+1}-(\mu _{t+1}-\varOmega _{t+1} m_{t+1})'\nonumber \\&\!\!\quad \left( V_{t+1}-V_{t+1}(\varOmega _{t+1}^{-1} +V_{t+1})^{-1}V_{t+1}\right) \nonumber \\&\!\!\quad (\mu _{t+1} -\varOmega _{t+1}m_{t+1})\biggr ]\Biggr \rbrace \end{aligned}$$
      (32)

      where \(Pr(K_t|K_{s\ne t})\) is obtained from the prior and \((\varOmega _{T}^{-1}+V_{T})^{-1}=0\).

    3. (c)

      Draw \(K_t|z^{*a},K_{s\ne t}\) based on the results from (32) and use the values of \(m_{t+1}\) and \(V_{t+1}\) accordingly for the next iteration.

It is similar to draw \(\phi \) and \(\mu \), though we need to use the algorithm by Carter and Kohn (1994) to simulate draws from partially Gaussian state space model.

Appendix 4: A case when Chib’s algorithm is inconsistent

Consider the following model, which is a simplified version of (1) without the exogenous regressors and the inefficiency terms:

$$\begin{aligned} y_{it}=a_{S_t}+\epsilon _{it},\qquad \epsilon _{it}\sim i.i.d.N(0,\sigma ^2_\epsilon ). \end{aligned}$$
(33)

Suppose \(T=2\) and set the initial draw of \((S_1,S_2)\) to be \((1,2)\). We can then draw \(a_1\) and \(a_2\) from the Gibbs sampler below,

$$\begin{aligned} \begin{aligned}&a_1|y_1,a_2,\sim N\left( \frac{\sigma ^2_a\frac{\sum _{i=1}^Ny_{i1}}{N} +a_2\frac{\sigma ^2_\epsilon }{N}}{\sigma ^2_a +\frac{\sigma ^2_\epsilon }{N}},\frac{\sigma ^2_\epsilon \sigma ^2_a}{N\sigma ^2_a +\sigma ^2_\epsilon }\right) ,\\&a_2|y_2,a_1,\sim N\left( \frac{\sigma ^2_a\frac{\sum _{i=1}^Ny_{i2}}{N} +a_1\frac{\sigma ^2_\epsilon }{N}}{\sigma ^2_a +\frac{\sigma ^2_\epsilon }{N}},\frac{\sigma ^2_\epsilon \sigma ^2_a}{N\sigma ^2_a+\sigma ^2_\epsilon }\right) . \end{aligned} \end{aligned}$$
(34)

Now if we draw \(S_1\) and \(S_2\) conditional on the draws of \(a_1\) and \(a_2\) using Chib’s algorithm, we can obtain

$$\begin{aligned} \begin{aligned} \frac{Pr(S_2=2|a_2,y_2)}{Pr(S_2=1|a_1,y_2)}&=\frac{p(y_2|a_2) \left[ Pr(S_{2}=2|S_{1}=1)Pr(S_{1}=1)\right] }{p(y_2|a_1)\left[ Pr(S_{2}=1|S_{1}=1)Pr(S_{1}=1)\right] },\\&=\exp \left[ \frac{\sum _{i=1}^N(y_{i2}-a_1)^2-\sum _{i=1}^N (y_{i2}-a_2)^2}{2\sigma ^2_\epsilon }\right] , \end{aligned} \end{aligned}$$
(35)

where \(Pr(S_{1}=1)=1\) and \(Pr(S_{2}=2|S_{1}=1)=Pr(S_{2}=1|S_{1}=1)=\frac{1}{2}\) if we set \(D=M=T=2\) in (4). If \(N\) goes to infinity, the posterior draws of \(a_1\) and \(a_2\) in (34) will have the properties of \(a_1-\bar{y}_1\overset{p}{\rightarrow }0\) and \(a_2-\bar{y}_2\overset{p}{\rightarrow }0\) where \(\bar{y}_1=\frac{\sum _{i=1}^Ny_{i1}}{N}\) and \(\bar{y}_2=\frac{\sum _{i=1}^Ny_{i2}}{N}\). We can therefore replace \(a_1\) and \(a_2\) by \(\bar{y}_1\) and \(\bar{y}_2\), respectively, for asymptotic analysis. Denote \(a_1^*\) and \(a_2^*\) as the true values of \(a_1\) and \(a_2\), respectively. We can have \(\bar{y}_1=a_1^*+\frac{\sum _{i=1}^N\epsilon _{i1}}{N}=a_1^*+\bar{\epsilon }_1\) and \(\bar{y}_2=a_2^*+\bar{\epsilon }_2\). Substituting these terms into the right-hand side of (35), we can obtain

$$\begin{aligned} \begin{aligned}&\frac{Pr(S_2=2|a_2,y_2)}{Pr(S_2=1|a_1,y_2)} -\exp \left[ \frac{N(\bar{y}_{1}-\bar{y}_2)^2}{2\sigma ^2_\epsilon }\right] \overset{p}{\rightarrow }0,\\ \text {or }&\frac{Pr(S_2=2|a_2,y_2)}{Pr(S_2=1|a_1,y_2)}-\exp \left\{ \frac{N\left( a_1^*-a_2^*+\bar{\epsilon }_{2}-\bar{\epsilon }_1\right) ^2}{2\sigma ^2_\epsilon }\right\} \overset{p}{\rightarrow }0. \end{aligned} \end{aligned}$$
(36)

Note that if \(a_1^*-a_2^*\ne 0\), i.e. there are indeed two regimes, \(\frac{Pr(S_2=2|a_2,y_2)}{Pr(S_2=1|a_1,y_{2_{}})}\) will go to infinity with \(N\) and the Chib’s algorithm will give consistent inference. But if there is only one regime, \(\frac{Pr(S_2=2|a_2,y_2)}{Pr(S_2=1|a_1,y_2)}\) will be asymptotically equivalent to \(\exp \left[ \frac{N (\bar{\epsilon }_{2}-\bar{\epsilon }_1)^2}{2\sigma ^2_\epsilon }\right] =O_p(1)\). Note that due to (33) \(\frac{N(\bar{\epsilon }_{2}-\bar{\epsilon }_1)^2}{2\sigma ^2_\epsilon }\) follows a chi-squared distribution with 1 degree of freedom.Footnote 23 Hence, \(\frac{Pr(S_2=2|a_2,y_2)}{Pr(S_2=1|a_1,y_2)}\) is greater than 1 and \(S_2=2\) will be preferred. The inference result is hence inconsistent.

Using the algorithm by Gerlach et al. (2000) described in the previous section based on (28) and (32) to analyse the same example, we can obtain

$$\begin{aligned} \frac{Pr(K_1=1|y,\sigma ^2_\epsilon ,\sigma ^2_a)}{Pr(K_1=0|y,\sigma ^2_\epsilon ,\sigma ^2_a)}&= \sqrt{\frac{\sigma ^2_\epsilon }{\sigma ^2_\epsilon +N\sigma ^2_a} \frac{2N}{\frac{N\sigma ^2_\epsilon }{\sigma ^2_\epsilon +N\sigma ^2_a}+N}}\nonumber \\&\exp \left[ \frac{1}{2\sigma ^2_\epsilon }\left( \sum _{i=1}^Ny_{i2} -N\bar{y}_1\right) ^2\left( \frac{1}{\frac{N\sigma ^2_\epsilon }{\sigma ^2_\epsilon +N\sigma ^2_a}+N} -\frac{1}{2N}\right) \right] ,\nonumber \\&= \sqrt{\frac{2\sigma ^2_\epsilon }{2\sigma ^2_\epsilon +N\sigma ^2_a}} \exp \left[ \frac{N(a_2^*-a_1^*+\bar{\epsilon }_2-\bar{\epsilon }_1)^2 \sigma ^2_a}{4\sigma ^2_\epsilon (\frac{2\sigma ^2_\epsilon }{N}+\sigma ^2_a)} \right] . \end{aligned}$$
(37)

It is not hard to observe that if there exists a change point, i.e. \(a_2^*-a_1^*\ne 0\), \(\frac{Pr(K_1=1|y,\sigma ^2_\epsilon ,\sigma ^2_a)}{Pr(K_1=0|y,\sigma ^2_\epsilon ,\sigma ^2_a)}\) will tend to infinity; if there is no change point, \(\frac{Pr(K_1=1|y,\sigma ^2_\epsilon ,\sigma ^2_a)}{Pr(K_1=0|y,\sigma ^2_\epsilon ,\sigma ^2_a)}\) will tend to 0. Therefore, the inference of the break point is consistent.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, G. A stochastic frontier model with structural breaks in efficiency and technology. Empir Econ 49, 131–159 (2015). https://doi.org/10.1007/s00181-014-0852-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00181-014-0852-4

Keywords

JEL Classification

Navigation