Skip to main content

Advertisement

Log in

Dynamic panel data estimation of an integrated Grossman and Becker–Murphy model of health and addiction

  • Published:
Empirical Economics Aims and scope Submit manuscript

Abstract

We propose a dynamic panel data approach to estimate a model that integrates the Becker–Murphy theory of rational addiction with the Grossman model of health investment. We define an individual’s lifetime smoking consumption and investments in health capital as simultaneous choices within a single optimisation problem. We show that this can be estimated using GMM system estimation of two stand-alone single fourth-order difference equations of health capital and smoking. These preserve roots and fundamental dynamics of the original system of four interrelated first-order equations. Monte Carlo simulations confirm that this reduced-form dynamic estimation also produces very similar estimates to the ones of the initial system of equations. We argue that, in the presence of long panel data, this approach may provide a feasible alternative for the estimation of a complex life-cycle model of human capital.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. Note, however, that most studies are not conclusive in this respect and often produce implausible estimates of discount rates. However, see Gruber and Koszegi (2001) for a discussion of potential dynamic inconsistencies in preferences with respect to smoking. Note that our paper is concerned with embedding the Grossman model within the B–M framework and is not explicitly concerned with estimating discount rates.

  2. Baltagi and other authors have also extensively explored the application of panel data methods on both tobacco and alcohol consumption, e.g. Baltagi (2007), Baltagi and Geishecker (2006) and Baltagi and Griffin (2001).

  3. B–M models often include a wealth equation (e.g. Becker and Murphy 1988). This is omitted here as it is not essential to our narrative.

  4. In this version of the model, where borrowing is not permitted, \(C_t\) and \(S_t\) are tied together by the budget constraint (3), which allows us to substitute \(C_t\) out of the problem.

  5. This assumption of linearity does not impact on the qualitative solution to the problem.

  6. Some studies have used biological markers which are used to reflect the addictive stock (Adda and Cornaglia 2006, 2010).

  7. More generally M can represent any good which is beneficial for health but yields no direct utility. We follow the standard approach in the literature and denote this as medical care.

  8. The arguments of utility, for example, \(U_C \left( t\right) \), denote the time period to which utility refers.

  9. This is because the B–M problem is, as noted above, an inter-temporal optimisation problem, which is typically set up as an optimal control problem. The solution equations to an optimal control problem are necessary conditions for optimising the present value of the stream of future utilities, which will arise from future consumption decisions, taking account of how current consumption affects future addiction.

  10. See “Appendix” for an illustration of which exogenous variables from the original system end up in each equation.

  11. It might be argued that this assumption is unrealistic. However, relaxing the assumption requires either the identification of external instruments for \(W_i\), or relying on instruments internal to the model. The latter might consist of transformations of \(X_{it}, \ldots , X_{it-3}\) where a subset of these are assumed to be uncorrelated with the unobserved individual-specific effect in the spirit of Hausman- and Taylor-type estimators (Hausman and Taylor 1981). Alternatively, differences in the lags of the dependent variable can be used as instruments. It is not surprising, however, that such instruments are weak when used in this context. Estimates of the lagged dependent variables, \(S_{it-1},\ldots ,S_{it-4}\) and \(H_{it-1}, \ldots , H_{it-4}\), do not change dramatically for models estimated without the vector of time-invariant regressors.

  12. In addition to the MA(3) structure, the error terms will also include cross-error elements. Note also that the equation-specific disturbance terms in (8) and (9) may contain the disturbance terms from the original system of four equations. See “Appendix” for an illustration of these issues.

  13. The corresponding matrix of coefficients defining the data-generating process (DGP) for the MC exercise is:

    $$\begin{aligned} \begin{bmatrix} Y_{1t} \\ Y_{2t} \end{bmatrix}= & {} \begin{bmatrix} 100\\ 300 \end{bmatrix} +\begin{bmatrix} 0.9&-0.4\\ .10&.95 \end{bmatrix} \begin{bmatrix} Y_{1t-1} \\ Y_{2t-1} \end{bmatrix} +\begin{bmatrix} 0.4&0\\ 0&0.6 \end{bmatrix} \begin{bmatrix} X_{1t} \\ X_{2t} \end{bmatrix} +\begin{bmatrix} \varepsilon _{1t} \\ \varepsilon _{2t} \end{bmatrix}\\ \begin{bmatrix} X_{1t} \\ X_{2t} \end{bmatrix}= & {} \begin{bmatrix} 0.6&0\\ 0&0.4 \end{bmatrix} \begin{bmatrix} X_{1t-1} \\ X_{2t-1} \end{bmatrix} +\begin{bmatrix} \epsilon _{1t} \\ \epsilon _{2t} \end{bmatrix} \end{aligned}$$

    The matrix multiplying the vector of lagged Y’s contains the \(\phi \)’s, while the matrix multiplying the vector of X’s includes the \(\theta \)’s. For simplicity, we made this matrix of Y’s diagonal such that \(\theta _{12}=\theta _{21}\)= 0.

  14. To run the experiment, we use the MC routine within the PcNaive module of the PcGive econometrics package. For the first equation, the initial values for the lagged Y’s were 10,000 for \(Y_{t-1}\) and 9000 for \(Y_{t-2}\). For the second equation, the initial values for the lagged Y’s were also 10,000 for \(Y_{t-1}\) and 9000 for \(Y_{t-2}\). The X’s were drawn from a normal distribution with mean 0 and variance 100 for both equations. Since we have a dynamically stable and unique equilibrium, regardless of what initial values we chose, the system would converge on that equilibrium. We simply chose these values to be sure we were starting the process a long way away from the equilibrium.

  15. We have also explored extending the simulation exercise to a system of four first-order difference equations, results are available upon request.

  16. Note that further lagged differences of the dependent variable are redundant when combined with instruments for the first-differenced equation (see Blundell and Bond 1998).

  17. We have explored specifications where lags of smoking and health enter the same equation as well as the joint estimation of a system of two second-order difference equations, each including lags of health and smoking. The former were estimated by employing a similar GMM approach and overall did not appear to produce reliable estimates if compared to their OLS within fixed effects counterparts. (Estimates are available upon request.) The latter was estimated via GMM three-stage least squares (3SLS) and failed to achieve convergence.

  18. Before 1999 (wave 9), Scottish individuals were only sampled if they resided South of the Caledonian Canal.

  19. For further details on the BHPS sample structure, see Lynn (2006).

  20. The lag structure imposed by serial correlation in the error term determines the exact number of lags required to construct valid instruments.

  21. Additional and updated information on the SF-36 and its related literature are available on the SF-36 community web page (http://www.sf-36.org).

  22. As specified in Brazier et al. (2002), to build the SF-6D (where 6D stands for six dimensions) they have excluded general health items and collapsed the two dimensions of role limitations due to physical and emotional problems into a single role limitations dimension.

  23. These cover problems related to arms, legs or hands, sight, hearing, skin conditions, chest/breathing, heart/blood pressure, stomach or digestion, diabetes, anxiety/depression, alcohol or drug use, epilepsy, migraine.

  24. Due to a change in wording and response categories in the SAH question at wave 9, we collapse the original five-category self-assessed variable (SAH) to a four-category measure. In waves 1–8 and 10–18, respondents are asked: Compared to people of your own age, would you say your health over the last 12 months on the whole has been: excellent, good, fair, poor or very poor?, whereas in wave 9, the question and possible answers are: In general, would you say your health is: excellent, very good, good, fair, poor?. Creating a SAH variable with four health categories (excellent, good or very good, fair, poor or very poor) allows common support over the two versions of the question.

  25. We have also employed alternative specifications to compute predicted SF-6D values for all individuals in the sample such as linear fixed effects models. We have also estimated versions of these models (pooled OLS and linear fixed effects models) with lagged values of all regressors (health variables) to ease potential problems related to endogeneity. As predicted SF-6D scores and results from the main dynamic empirical models of health capital and smoking do not appear to differ across these specifications, we use results from the simple pooled OLS models to maximise the number of observations.

  26. Other employment consists of looking after the family, maternity leave, government training, student or other jobs.

  27. The set of regional dummy variables contains little variation across the waves, and accordingly these are categorised as the region in which a respondent was observed to reside the longest. The regions cover England (South East, South West, London, Midlands, Yorkshire, North West, North East), Scotland, Wales and Northern Ireland. The South East is taken as the baseline.

  28. It is plausible that the inclusion of higher- order dynamics via four lags in our models may absorb any remaining serial correlation in the disturbances.

  29. \(S_{it-2}, \ldots , S_{it1}\) are potential instruments for \(\Delta S_{it-1},\ldots , \Delta S_{it-4}\). Similarly, for \(\Delta H_{it-1}, \ldots , \Delta H_{it-4}\).

  30. These results are available on request.

  31. The year dummies, which are not reported, indicate a decreasing trend in smoking across the waves for men but not for women and no discernible trends for health.

  32. For example, it has been reported elsewhere that Black-Caribbean and Other South Asian women have a far greater prevalence of regular smoking (at levels slightly lower than white ethnic groups) compared to Bangladeshi, Indian, Pakistani , Chinese and other Black minority ethnic groups (Mellward and Karlson 2011). There is evidence of geographical variation, particularly for women where areas associated with decreased health (compared to the baseline of South East of England) are also associated with increased smoking.

  33. Of the remaining variables, the lack of significance may well be a consequence of a lack of within-individual variation. Alternatively, it may indicate that these variables do not have a role in shifting the optimal trajectory for an individual. If these are variables which do not change often and which have relatively small impacts on the position of the individual’s optimal trajectory, it is not surprising that, conditional on the intrinsic dynamics, they do not appear important.

References

  • Adams P, Hurd MD, McFadden D, Merrill A, Ribeiro T (2003) Healthy, wealthy, and wise? Tests for direct causal paths between health and socioeconomic status. J Econom 112:3–56

    Article  Google Scholar 

  • Adda J, Cornaglia F (2006) Taxes, cigarette consumption, and smoking intensity. Am Econ Rev 96(4):1013–1028

    Article  Google Scholar 

  • Adda J, Cornaglia F (2010) The effect of bans and taxes on passive smoking. Am Econ J Appl Econ 2(1):1–32

    Article  Google Scholar 

  • Adda J, Lechene V (2013) Health selection and the effect of smoking on mortality. Scand J Econ 115(3):902–931

    Article  Google Scholar 

  • Arcidiacono P (2007) Living rationally under the volcano? An empirical analysis of heavy drinking and smoking. Int Econ Rev 48(1):37–65

    Article  Google Scholar 

  • Arellano M, Bond S (1991) Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations. Rev Econ Stud 58(2):277–297

    Article  Google Scholar 

  • Balia S, Jones AM (2008) Mortality, lifestyle and socio-economic status. J Health Econ 27(1):1–26

    Article  Google Scholar 

  • Baltagi BH (2007) On the use of panel data methods to estimate rational addiction models. Scot J Polit Econ 54(1):1–18

    Article  Google Scholar 

  • Baltagi BH, Geishecker I (2006) Rational alcohol addiction: evidence from the Russian Longitudinal Monitoring Survey. Health Econ 15(9):893–914

    Article  Google Scholar 

  • Baltagi BH, Griffin JM (2001) The econometrics of rational addiction: the case of cigarettes. J Bus Econ Stat 19(4):449–454

    Article  Google Scholar 

  • Baltagi BH, Griffin JM (2002) Rational addiction to alcohol: panel data analysis of liquor consumption. Health Econ 11(6):485–491

    Article  Google Scholar 

  • Baltagi BH, Levin D (1986) Estimating dynamic demand for cigarettes using panel data: the effects of bootlegging, taxation and advertising reconsidered. Rev Econ Stat 68(1):148–155

    Article  Google Scholar 

  • Becker GS (1965) A theory of the allocation of time. Econ J 75(299):493–517

    Article  Google Scholar 

  • Becker GS, Murphy KM (1988) A theory of rational addiction. J Polit Econ 96(4):675–700

    Article  Google Scholar 

  • Becker GS, Woytinsky WS (1993) Human capital and the personal distribution of income: an analytical approach. In: Becker GS (ed) 1967. Institute of Public Administration, University of Michigan, Ann Arbor. Human Capital (3rd edn). University of Chicago Press, pp 102–158

  • Becker GS, Grossman M, Murphy KM (1994) An empirical analysis of cigarette addiction. Am Econ Rev 84(3):396–418

    Google Scholar 

  • Blundell R, Bond S (1998) Initial conditions and moment restrictions in dynamic panel-data models. J Econom 87(1):115–143

    Article  Google Scholar 

  • Blundell R, Bond S, Windmeijer F (2001) Estimation in dynamic panel-data models: improving the performance of the standard GMM estimator. In: Baltagi B, Fomby TB, Carter Hill R (eds) Nonstationary panels, cointegrating panels and dynamic panels, advances in econometrics, volumn 15. Elsevier, New York, pp 53–92

    Google Scholar 

  • Brazier J, Roberts J, Deverill M (2002) The estimation of a preference-based measure of health from the SF-36. J Health Econ 21(2):271–292

    Article  Google Scholar 

  • Chaloupka F (1991) Rational addictive behavior and cigarette smoking. J Polit Econ 99(4):722–742

    Article  Google Scholar 

  • Christelis D, Sanz-de-Galdeano A (2009) Smoking persistence across countries: an analysis using semi-parametric dynamic panel data models with sensitivity. Discussion Paper No. 4336, Institute for the Study of Labour (IZA)

  • Contoyannis P, Jones AM, Rice N (2004) The dynamics of health in the British Household Panel Survey. J Appl Econom 19:473–503

    Article  Google Scholar 

  • Darden M (2017) Smoking, expectations, and health: a dynamic stochastic model of lifetime smoking behaviour. J Polit Econ 125:1465–1522

    Article  Google Scholar 

  • Doll R, Peto R, Boreham J, Sutherland I (2004) Mortality in relation to smoking: 50 years’ observations on male British doctors. Br Med J. https://doi.org/10.1136/bmj.38142.554479.AE

    Article  Google Scholar 

  • Ehrlich I, Chuma H (1990) A model of the demand for longevity and the value of life extension. J Polit Econ 98(4):761–782

    Article  Google Scholar 

  • Ferguson B, Lim GC (2003) Dynamic economic models in discrete time: theory and empirical applications. Routledge, New York

    Book  Google Scholar 

  • Forster M (2001) The meaning of death: some simulations of a model of healthy and unhealthy consumption. J Health Econ 20(4):613–638

    Article  Google Scholar 

  • Frijters P, Haisken-DeNew JP, Shields MA (2005) The causal effect of incoem on health: evidence from the German reunification. J Health Econ 24(5):997–1017

    Article  Google Scholar 

  • Galama TS, van Kippersluis H (2015) A theory of education and health. CESR-Schaeffer working Paper No 2015-001

  • Gilleskie D (1998) A dynamic stochastic model of medical care use and work absence. Econometrica 66(1):1–45

    Article  Google Scholar 

  • Goldberg D, Williams P (1988) A users guide to the general health questionnaire. NFER-Nelson, Windsor

    Google Scholar 

  • Grossman M (1972) On the concept of health capital and the demand for health. J Polit Econ 80(2):223–255

    Article  Google Scholar 

  • Grossman M (1993) The economic analysis of addictive behavior. In: Hilton ME, Bloss G (eds) Economics and the prevention of alcohol-related problems. NIAAA Research Monograph No. 25, NIH Pub. No. 93–3513. National Institute on Alcohol Abuse and Alcoholism, Bethesda, MD, pp 91–123

  • Grossman M (2000) Chapter 7: the human capital model. In: Culyer AJ, Newhouse JP (eds) Handbook of health economics, vol 1. Elsevier, North Holland

    Google Scholar 

  • Grossman M, Chaloupka FJ (1998) The demand for cocaine by young adults: a rational addiction approach. J Health Econ 17(4):427–474

    Article  Google Scholar 

  • Grossman M, Chaloupka FJ, Sirtalan I (1998) An empirical analysis of alcohol addiction: results from the monitoring the future panels. Econ Inq 36(1):39–48

    Article  Google Scholar 

  • Gruber J, Koszegi B (2001) Is addiction rational? Theory and evidence. Q J Econ 116(4):1261–1303

    Article  Google Scholar 

  • Hai R, Heckman J (2014) A dynamic model of health, education and wealth with credit constraints and rational addiction. http://www.bus.miami.edu/_assets/files/faculty-and-research/conferences-and-seminars/economics-seminars/falls-2014/papers_Hai.pdf

  • Hausman J, Taylor W (1981) Panel data and unobserved individual effects. Econometrica 49(6):1377–1398

    Article  Google Scholar 

  • Jones AM, Labeaga JM (2003) Individual heterogeneity and censoring in panel data estimates of tobacco expenditure. J Appl Econom 18(2):157–177

    Article  Google Scholar 

  • Keeler TE, Hu T-W, Barnett PG, Manning WG (1993) Taxation, regulation, and addiction: a demand function for cigarettes based on time-series evidence. J Health Econ 12(1):1–18

    Article  Google Scholar 

  • Labeaga JM (1999) A double-hurdle rational addiction model with heterogeneity: estimating the demand for tobacco. J Econom 93(1):49–72

    Article  Google Scholar 

  • Leonard D, Long NV (1992) Optimal control theory and static optimization in economics. Cambridge University Press, Cambridge, pp 307–310

  • Lindström M (2010) Social capital, economics conditions, marital status and daily smoking: a population-based study. Public Health 124(2):71–77

    Article  Google Scholar 

  • Lynn P (ed) (2006) Quality profile: British Household Panel Survey Version 2.0: Waves 1 to 13: 1991–2003. Institute for Social and Economic Research, University of Essex, Colchester

  • Meer J, Miller D, Rosen H (2003) Exploring the health-wealth nexus. J Health Econ 22:713–730

    Article  Google Scholar 

  • Mellward D, Karlson S (2011) Tobacco use among minority ethnic populations and cessation interventions. A race equality foundation briefing paper, London

  • Mincer J (1974) Schooling, experience and earnings. Columbia University Press for the National Bureau of Economic Research, New York

    Google Scholar 

  • Nickell S (1981) Biases in dynamic models with fixed effects. Econometrica 49(6):1417–1426

    Article  Google Scholar 

  • Pitchford JD (1977) Two state variable problems. In: Pictchford JD, Turnovsky SJ (eds) Applications of control theory to economic analysis. North-Holland, Amsterdam, pp 127–154

    Google Scholar 

  • Saffer H, Chaloupka F (1999) The demand for illicit drugs. Econ Inq 37(3):401–411

    Article  Google Scholar 

  • Wagstaff A (1986) The demands for health: some new empirical evidence. J Health Econ 5:195–233

    Article  Google Scholar 

  • Ware JE, Snow KK, Kolinski M, Gandeck B (1993) SF-36 Health survey manual and interpretation guide. The Health Institute, New England Medical Centre, Boston, MA

    Google Scholar 

  • Waters TM, Sloan FA (1995) Why do people drink? Tests of the rational addiction model. Appl Econ 27(8):727–736

    Article  Google Scholar 

  • Windmeijer F (2005) A finite sample correction for the variance of linear efficient two-step GMM estimators. J Econom 126:25–52

    Article  Google Scholar 

  • Zweifel P (2012) The Grossman model after 40 years. Eur J Health Econ 13(6):677–682

    Article  Google Scholar 

Download references

Acknowledgements

The authors gratefully acknowledge funding from the Economic and Social Research Council (ESRC) under Grant Reference RES-060-25-0045. We are grateful to Victor Aguirregabiria, Jonneke Bohlaar, Brenda Gannon, Eleonora Fichera, Aloysius Siow, Matt Sutton, and participants of the 8th International Health Economics Association (iHEA) Congress in Toronto, the 9th European Conference on Health Economics (ECHE) in Zurich, the 2012 Applied Health Econometrics Symposium at the University of Leeds, the 2012 Netspar Conference on Health and Inequality in the Life-cycle in Rotterdam as well as participants of seminars at Universities of Barcelona, Leeds, Newcastle, Toronto and York for discussions and helpful comments. Data from the British Household Panel Survey were supplied by the Economic and Social Research Council Data Archive. Neither the original collectors of the data nor the Archive bear any responsibility for the analysis or interpretations presented in this manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Eugenio Zucchelli.

Appendices

Appendix A

Consider the system of two second-order difference equations (10) for \(Y_1\) and \(Y_2\) as follows:

$$\begin{aligned} Y_{1t}&= \phi _{10} + \phi _{11} Y_{1t-1} + \phi _{12} Y_{2t-1} + \theta _{11} X_{1t} + \theta _{12} X_{2t} + \varepsilon _{1t} \\ Y_{2t}&= \phi _{20} + \phi _{21} Y_{1t-1} + \phi _{22} Y_{2t-1} + \theta _{21} X_{1t} + \theta _{22} X_{2t} + \varepsilon _{2t} \end{aligned}$$

Abstracting from the constant terms without loss of generality, the above can be written in matrix form as,

$$\begin{aligned} \begin{bmatrix} Y_{1t} \\ Y_{2t} \end{bmatrix} =\begin{bmatrix} \phi _{11}&\phi _{12} \\ \phi _{21}&\phi _{22} \end{bmatrix} \begin{bmatrix} LY_{1t} \\ LY_{2t} \end{bmatrix} +\begin{bmatrix} \theta _{11}&\theta _{12} \\ \theta _{21}&\theta _{22} \end{bmatrix} \begin{bmatrix} X_{1t} \\ X_{2t} \end{bmatrix} +\begin{bmatrix} \epsilon _{1t} \\ \epsilon _{2t} \end{bmatrix} \end{aligned}$$
(18)

where L is the lag operator. Since L is multiplicative, Eq. (18) can be expressed as,

$$\begin{aligned} \begin{bmatrix} Y_{1t} \\ Y_{2t} \end{bmatrix} =\begin{bmatrix} L\phi _{11}&L\phi _{12} \\ L\phi _{21}&L\phi _{22} \end{bmatrix} \begin{bmatrix} Y_{1t} \\ Y_{2t} \end{bmatrix} +\begin{bmatrix} \theta _{11}&\theta _{12} \\ \theta _{21}&\theta _{22} \end{bmatrix} \begin{bmatrix} X_{1t} \\ X_{2t} \end{bmatrix} +\begin{bmatrix} \epsilon _{1t} \\ \epsilon _{2t} \end{bmatrix} \end{aligned}$$

Combining the vectors for \(Y_t\) gives,

$$\begin{aligned} \begin{bmatrix} 1-L\phi _{11}&-L\phi _{12} \\ -L\phi _{21}&1-L\phi _{22} \end{bmatrix} \begin{bmatrix} Y_{1t} \\ Y_{2t} \end{bmatrix} =\begin{bmatrix} \theta _{11}&\theta _{12} \\ \theta _{21}&\theta _{22} \end{bmatrix} \begin{bmatrix} X_{1t} \\ X_{2t} \end{bmatrix} +\begin{bmatrix} \epsilon _{1t} \\ \epsilon _{2t} \end{bmatrix} \end{aligned}$$

This can be rearranged such that,

$$\begin{aligned} \begin{bmatrix} Y_{1t} \\ Y_{2t} \end{bmatrix} =\begin{bmatrix} 1-L\phi _{11}&-L\phi _{12} \\ -L\phi _{21}&1-L\phi _{22} \end{bmatrix}^{-1} \begin{bmatrix} \theta _{11}&\theta _{12} \\ \theta _{21}&\theta _{22} \end{bmatrix} \begin{bmatrix} X_{1t} \\ X_{2t} \end{bmatrix} +\begin{bmatrix} 1-L\phi _{11}&-L\phi _{12} \\ -L\phi _{21}&1-L\phi _{22} \end{bmatrix}^{-1} \begin{bmatrix} \epsilon _{1t} \\ \epsilon _{2t} \end{bmatrix} \end{aligned}$$
(19)

The inverse matrix which appears twice on the RHS of Eq. (19) can be written:

$$\begin{aligned} \begin{bmatrix} 1-L\phi _{11}&-L\phi _{12} \\ -L\phi _{21}&1-L\phi _{22} \end{bmatrix}^{-1} =\frac{1}{DetA} \begin{bmatrix} 1-L\phi _{22}&L\phi _{12} \\ L\phi _{21}&1-L\phi _{11} \end{bmatrix} \end{aligned}$$
(20)

where DetA is the determinant of the matrix being inverted:

$$\begin{aligned} DetA = [1-L\phi _{11}] [1-L\phi _{22}] - L^2 \phi _{12} \phi _{21} = 1 - [\phi _{11} + \phi _{22}]L + [\phi _{11}\phi _{22} - \phi _{12}\phi _{21}]L^2 \end{aligned}$$
(21)

Using Eq. (20) allows us to write,

$$\begin{aligned} DetA\begin{bmatrix} Y_{1t} \\ Y_{2t} \end{bmatrix} =\begin{bmatrix} 1-L\phi _{22}&L\phi _{12} \\ L\phi _{21}&1-L\phi _{11} \end{bmatrix} \begin{bmatrix} \theta _{11}&\theta _{12} \\ \theta _{21}&\theta _{22} \end{bmatrix} \begin{bmatrix} X_{1t} \\ X_{2t} \end{bmatrix} +\begin{bmatrix} 1-L\phi _{22}&L\phi _{12} \\ L\phi _{21}&1-L\phi _{11} \end{bmatrix} \begin{bmatrix} \epsilon _{1t} \\ \epsilon _{2t} \end{bmatrix} \end{aligned}$$
(22)

recalling that DetA is a scalar—the determinant of the matrix to be inverted.

Multiplying out the first two matrices on the RHS, such that

$$\begin{aligned} \begin{bmatrix} 1-L\phi _{22}&\quad L\phi _{12} \\ L\phi _{21}&\quad 1-L\phi _{11} \end{bmatrix} \begin{bmatrix} \theta _{11}&\quad \theta _{12} \\ \theta _{21}&\quad \theta _{22} \end{bmatrix} =\begin{bmatrix} [1-L\phi _{22}] \theta _{11} + L\phi _{12}\theta _{21}&\quad [1-L\phi _{22}] \theta _{12} + L\phi _{12}\theta _{22} \\ \theta _{11} L\phi _{21} + [1-L\phi _{11}] \theta _{21}&\quad L\phi _{21} \theta _{12} + [1-L\phi _{11} ]\theta _{22} \end{bmatrix} \end{aligned}$$
(23)

Substituting Eq. (23) into Eq. (22) gives,

$$\begin{aligned} DetA\begin{bmatrix} Y_{1t} \\ Y_{2t} \end{bmatrix}= & {} \begin{bmatrix} [1-L\phi _{22}] \theta _{11} + L\phi _{12}\theta _{21}&\quad [1-L\phi _{22}] \theta _{12} + L\phi _{12}\theta _{22} \\ \theta _{11} L\phi _{21} + [1-L\phi _{11}] \theta _{21}&\quad L\phi _{21} \theta _{12} + [1-L\phi _{11} ]\theta _{22} \end{bmatrix} \begin{bmatrix} X_{1t} \\ X_{2t} \end{bmatrix}\nonumber \\&+\begin{bmatrix} 1-L\phi _{22}&L\phi _{12} \\ L\phi _{21}&1-L\phi _{11} \end{bmatrix} \begin{bmatrix} \epsilon _{1t} \\ \epsilon _{2t} \end{bmatrix} \end{aligned}$$
(24)

Now take the top row of Eq. (24), which is the expression for \(Y_{1t}\) where we are assuming that \(Y_{1}\) is the observable variable. That will be

$$\begin{aligned} DetA Y_{1t}= & {} \theta _{11} X_{1t} + [\phi _{12} \theta _{21} - \phi _{22}\theta _{11}] LX_{1t} + \theta _{12}X_{2t} + [\phi _{12}\theta _{22} - \phi _{22} \theta _{12}] LX_{2t} \\&+\,[\epsilon _{1t} - \phi _{22} L \epsilon _{1t} + \phi _{12} L \epsilon _{2t}] \end{aligned}$$

Substituting for DetA from Eq. (21),

$$\begin{aligned}&\left[ 1 - \left[ \phi _{11} + \phi _{22} \right] L + \left[ \phi _{11}\phi _{22} - \phi _{12} \phi _{21} \right] L^2 \right] Y_{1t} = \phi _{11} X_{1t} + \left[ \phi _{12}\theta _{21} - \phi _{22} \theta _{11} \right] LX_{1t} \nonumber \\&\quad +\, \theta _{12} X_{2t} + \left[ \phi _{12} \theta _{22} - \phi _{22} \theta _{12} \right] LX_{2t} + \left[ \epsilon _{1t} - \phi _{22} L \epsilon _{1t} + \phi _{12} L \epsilon _{2t} \right] \end{aligned}$$
(25)

Applying the lag operator gives,

$$\begin{aligned}&Y_{1t} - \left[ \phi _{11} + \phi _{22} \right] Y_{1t-1} + \left[ \phi _{11}\phi _{22} - \phi _{12} \phi _{21} \right] Y_{1t-2} = \phi _{11} X_{1t}\\&\quad +\, \left[ \phi _{12}\theta _{21} - \phi _{22} \theta _{11} \right] X_{1t-1} \\&\quad +\, \theta _{12} X_{2t} + \left[ \phi _{12} \theta _{22} - \phi _{22} \theta _{12} \right] X_{2t-1} + \left[ \epsilon _{1t} - \phi _{22} \epsilon _{1t-1} + \phi _{12} \epsilon _{2t-1} \right] \end{aligned}$$

Isolating \(Y_{1t}\) on the left-hand side gives,

$$\begin{aligned} Y_{1t}= & {} \left[ \phi _{11} + \phi _{22} \right] Y_{1t-1} - \left[ \phi _{11}\phi _{22} - \phi _{12} \phi _{21} \right] Y_{1t-2}\nonumber \\&+\, \phi _{11} X_{1t} + \left[ \phi _{12}\theta _{21} - \phi _{22} \theta _{11} \right] X_{1t-1} \nonumber \\&+\, \theta _{12} X_{2t} + \left[ \phi _{12} \theta _{22} - \phi _{22} \theta _{12} \right] X_{2t-1} + \left[ \epsilon _{1t} - \phi _{22} \epsilon _{1t-1} + \phi _{12} \epsilon _{2t-1} \right] \nonumber \\ \end{aligned}$$
(26)

The purpose of the transformation using the lag operators is to leave us with, on the RHS, lagged values of the observable Y, values of the exogenous variables, and a disturbance term which involves (in the 2 equation case) both \(\epsilon _{1}\) and \(\epsilon _{2}\) values and, by its involvement of both current and lagged \(\epsilon \)’s, requiring us to use an estimating method which allows for a serially correlated residual term.

The structure in Eq. (26) also has the appeal that we can see how the roots of the system are derived from the coefficients on the two lags of \(Y_1\) and, by comparing it with the original matrix form, we can see that the roots of Eq. (26) will be the same as the roots of the original system of two FODEs, justifying our argument that the dynamics of the system are retained by the transformation.

Appendix B

Instrument set for single fourth-order equation in first-differenced form

$$\begin{aligned} \left( \begin{array}{ccccccccccccc} S_{i1} &{} 0 &{} 0 &{} \ldots &{} \ldots &{} 0 &{} \ldots &{} \ldots &{} 0 &{} \Delta X_{i6} &{} \Delta X_{i5} &{} \Delta X_{i4} &{} \Delta X_{i3}\\ 0 &{} S_{i1} &{} S_{i2} &{} \ldots &{} \ldots &{} 0 &{} \ldots &{} \ldots &{} 0 &{} \Delta X_{i7} &{} \Delta X_{i6} &{} \Delta X_{i5} &{} \Delta X_{i4} \\ \vdots &{} \vdots &{} \vdots &{} \ddots &{} &{} \vdots &{} \ddots &{} &{} \vdots &{} \vdots &{} \vdots &{} \vdots &{} \vdots \\ \vdots &{} \vdots &{} \vdots &{} &{} \ddots &{} \vdots &{} &{} \ddots &{} \vdots &{} \vdots &{} \vdots &{} \vdots &{} \vdots \\ 0 &{} 0 &{} 0 &{} \ldots &{} \ldots &{} S_{i1} &{} \ldots &{} \dots &{} S_{iT-5} &{} \Delta X_{iT} &{} \Delta X_{iT-1} &{} \Delta X_{iT-2} &{} \Delta X_{iT-3} \\ \end{array} \right) \end{aligned}$$

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jones, A.M., Laporte, A., Rice, N. et al. Dynamic panel data estimation of an integrated Grossman and Becker–Murphy model of health and addiction. Empir Econ 56, 703–733 (2019). https://doi.org/10.1007/s00181-017-1367-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00181-017-1367-6

Keywords

JEL classification

Navigation