Skip to main content
Log in

Consumer cohorts and purchases of nonalcoholic beverages

  • Published:
Empirical Economics Aims and scope Submit manuscript

Abstract

Age, period and cohort (APC) variables are included in a demand system that is used to estimate Norwegian purchases of nonalcoholic beverages. To take account of censoring, a two-step method is used. In the first step, the probabilities of purchasing milk, carbonated soft drinks and other soft drinks are estimated by probit models. The APC variables are highly significant. Older cohorts have higher probabilities of purchasing milk and lower probabilities of purchasing carbonated soft drinks than younger cohorts. In the second step, the probability density functions and the cumulative density function are used to correct for censoring. In the corrected demand system, there are positive cohort and negative age effects for milk. These effects suggest that the replacement of older by younger cohorts, in an increasingly older population, will result in reduced per capita purchases of milk. For carbonated soft drinks, there are no cohort or negative age effects, while there are positive age but no cohort effects for other soft drinks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. A quadratic AIDS model that allows for nonlinear Engel curves was also considered. However, the focus of this article is on the inclusion of APC variables in a demand system. A linear version of the AIDS model was used to facilitate the estimation of an already complex demand system.

  2. Attanasio (1998), Chen et al. (2001), Propper et al. (2001) and Mori and Clason (2004) followed Deaton (1985, 1997) and averaged the variable of interest over all individuals in a certain year within the cohort.

  3. Demographic variables could also have been included in the system. However, the inclusion of demographic variables resulted in serious multicolinearity problems.

  4. In this method, one equation models the decision to purchase or not, and a second equation models the expected purchase conditional on a positive purchase. Let \(y_{it}\) be the purchased quantity of good \(i, \mathbf{x}_{\mathbf{it}}\) the vector of covariates in the second-step equation and \(\mathbf{z}_{\mathbf{it}}\) the vector of covariates in the first-step equation. There is a positive purchase if \(\mathbf{{z}^{\prime }}_{\mathbf{it}} {\varvec{\uppsi }}_\mathbf{i} +v_{it} >0,\) where \(v_{it}\) is an error term. The expected mean purchase conditional on a positive purchase is \(E(y_{it} |\mathbf{x}_{\mathbf{it}} ,\mathbf{z}_{\mathbf{it}} ;v_{it} >-\mathbf{{z}^{\prime }}_{\mathbf{it}} {\varvec{\uppsi }}_\mathbf{i} )=f(\mathbf{x}_{\mathbf{it}} ,{\varvec{\beta }}_\mathbf{i} )+\delta _i {\phi (\mathbf{{z}^{\prime }}_{\mathbf{it}} {\varvec{\uppsi }}_\mathbf{i} )}/{\Phi (\mathbf{{z}^{\prime }}_{\mathbf{it}} {\varvec{\uppsi } }_\mathbf{i} )}\), where \(\phi (\cdot )\) is the probability density function and \(\Phi (\cdot )\) is the cumulative distribution function of the standard normal distribution. The expected purchase conditional on nonpurchase is zero, or \(E(y_{it} |\mathbf{x}_{\mathbf{it}} ,\mathbf{z}_{\mathbf{it}} ;v_{it} \le -\mathbf{{z}^{\prime }}_{\mathbf{it}} {\varvec{\uppsi }}_\mathbf{i} )=0,\) and we have the unconditional (on zero/nonzero purchase) mean purchase \(E(y_{it} |\mathbf{x}_{\mathbf{it}} ,\mathbf{z}_{\mathbf{it}} )=\Phi (\mathbf{{z}^{\prime }}_{\mathbf{it}} {\varvec{\uppsi }}_\mathbf{i} )f(\mathbf{x}_{\mathbf{it}} ,{\varvec{\beta }}_\mathbf{i} )+\delta _i \phi (\mathbf{{z}^{\prime }}_{\mathbf{it}} {\varvec{\uppsi }}_\mathbf{i} ).\)

  5. The first step is modeled by univariate probit models, which imply that the decision to purchase one beverage is assumed to be independent of the decisions to purchase other goods. To take account of correlations between the purchase decisions, a potentially better way would be to estimate a multivariate probit model. However, this model is computationally difficult because of the numerical integration of the multivariate normal. The computational difficulty increases because of the bootstrap used in the estimation. Puhani (2000) recommended estimating the model in two steps rather than using maximum likelihood procedures when there are many variables, because of potentially strong colinearity. When using the two-step procedure, the uncertainty in the first step has to be provided for using the bootstrap.

  6. We follow the cohorts for 16 years. As pointed out by one referee, it may be of interest to follow the cohorts over a longer period. However, 16 years is a longer period than used in some other studies. For example, Aristei et al. (2008) used data over the 1997–2002 period.

  7. An alternative specification is a conditional demand system that only includes nonalcoholic beverages. Given available data a two-stage model, with a second-stage demand system for nonalcoholic beverages, could have been estimated. Using the formulas provided in Carpentier and Guyomard (2001), the demand elasticities of the two stages could have been used to calculate elasticities that could have been compared with the demand elasticities in our model. However, this comparison is not pursued in this article. For a related discussion, see Alston et al. (2000), who discussed asymmetric versus conditional demand systems without being able to conclude which specification is the preferred one.

  8. As pointed out by one referee, an alternative price measure could be price indices. We tried to estimate the model using monthly consumer price indices for the different beverage groups. The estimation broke down, probably because of a high degree of multicolinearity among the prices. One possible explanation for this multicolinearity is that the consumer price indices do not take account of regional variability.

  9. Unit prices are constructed by dividing expenditure by quantity for positive purchases. As pointed out by one referee, these unit prices will be downward biased if nonpurchase decisions are caused by too high prices. In our case, the effect of including (quality-corrected) unit prices were small in the probit equations used to model the purchase versus nonpurchase decisions, so any potential bias in the estimated demand functions is likely to be small. Furthermore, Cox and Wohlgenant (1986) used a cross section for 1 year. To avoid outliers, they deleted observations with prices more than five standard deviations from the average (about 2 % of the observations). We have a pooled dataset for 16 years. To avoid outliers, we replace the values below the 0.01 quantile, in each year, with the 0.01 quantile and values above the 0.99 quantile, in each year, with the 0.99 quantile.

  10. These variables are similar to the variables used in Cox and Wohlgenant (1986), Park and Capps (1997) and Kuchler et al. (2005) to filter out the effects of quality differences. Our hypothesized effects include: larger families purchase less convenience products than smaller families, high-income households purchase more expensive brands than low-income households and older households choose different brands than younger households. In addition, there may be regional or seasonal differences: for example, related to transportation costs.

  11. We also tried to include prices in the probit equations. Because of identification problems, the estimation of the second step did not converge and prices were excluded from the first step. However, the choice of first-step variables had only minor effects on the second-step estimates. When we excluded the cohort variables and instead included the price variables, the percentage of correct predictions did not change for any beverage. Furthermore, the second-step elasticities changed only marginally; the own-price elasticity changed from \(-\)0.59 to \(-\)0.56 for milk, from \(-\)1.28 to \(-\)1.21 for carbonated soft drinks, from \(-\)0.91 to \(-\)0.94 for other soft drinks and it did not change for other goods.

  12. The percentage of correct predictions is, however, a questionable measure of the goodness of fit. For example, the naive predictor that every household purchases milk is very good according to this criterion.

  13. For example, when we dropped the equation for other soft drinks instead of the equation for nondurables and services in the estimation, the own-price elasticity of milk changed from \(-\)0.59 to \(-\)0.61, the own-price elasticity of carbonated soft drinks changed from \(-\)1.28 to \(-\)1.31, the own-price elasticity for other soft drinks changed from \(-\)0.91 to \(-\)0.95 and the own-price elasticity for other nondurables and services changed from \(-\)1.02 to \(-\)1.01.

  14. Figure 1 shows the mean annual purchase in the different age/cohort groups, while the data used in the estimation are for household purchases in a 2-week period. The insignificant cohort effects may be caused by household heterogeneity in the data or, as pointed out by a referee, older cohorts may purchase soft drinks for other people like their grandchildren.

  15. As pointed out by one referee, reduced milk purchases is also seen among younger cohorts suggesting that there also may be some changes in taste among consumers.

  16. The differences in numerical values between the two studies are not surprising given the differences between them. Gustavsen and Rickertsen (2003) did not include APC variables, used aggregate time-series data, had a different commodity specification and used a different sample period.

  17. A simultaneous CI for the system can be constructed by a Bonferroni correction. Let \(C_{i}\) denote a confidence statement for coefficient \(i\). Then, the probability that all the CIs simultaneously are true is \(P(\text{ all} C_i \text{ true})=1-P(\text{ at} \text{ least} \text{ one} C_i \text{ false})\ge 1-\sum \limits _{i=1}^m {P(C_i \text{ false})} =1-\sum \limits _{i=1}^m {\left( {1-P(C_i \text{ true})} \right) } =1-(\alpha _1 +\alpha _2 +\cdot \cdot \cdot +\alpha _m ),\) where \(m\) is the number of estimated coefficients and \(\alpha \) is the confidence level. In our case, the simultaneous CI is a 70 % CI.

References

  • Alston J, Chalfant J, Piggott N (2000) The incidence of the costs and benefits of generic advertising. Am J Agric Econ 82:665–671

    Article  Google Scholar 

  • Aristei D, Perali F, Pieroni L (2008) Cohort, age and time effects in alcohol consumption by Italian households: a double-hurdle approach. Empir Econ 35(1):29–61

    Article  Google Scholar 

  • Asche F, Wessells C (1997) On price indices in the almost ideal demand system. Am J Agric Econ 79:1182–1185

    Article  Google Scholar 

  • Attanasio O (1998) Cohort analysis of saving behavior by U.S. households. J Hum Resour 33:575–609

    Article  Google Scholar 

  • Bawa S (2005) The role of the consumption of beverages in the obesity epidemic. J R Soc Promot Health 125:124–128

    Article  Google Scholar 

  • Brownstone D, Valletta R (2001) The bootstrap and multiple imputations: harness increased computing power for improved statistical tests. J Econ Perspectives 15(4):129–141

    Article  Google Scholar 

  • Carpentier A, Guyomard H (2001) Unconditional elasticities in two-stage demand systems: an approximate solution. Am J Agric Econ 83:222–229

    Article  Google Scholar 

  • Chen R, Wong K, Lee H (2001) Age, period and cohort effects on life insurance purchases in the US. J Risk Insur 68(2):303–328

    Article  Google Scholar 

  • Cox T, Wohlgenant M (1986) Prices and quality effects in cross-sectional demand analysis. Am J Agric Econ 68:909–919

    Article  Google Scholar 

  • Deaton A (1985) Panel data from time series of cross-sections. J Econom 30:109–126

    Article  Google Scholar 

  • Deaton A (1997) The analysis of household surveys: a microeconometric approach to development policy. The Johns Hopkins University Press, Baltimore

    Book  Google Scholar 

  • Deaton A, Muellbauer J (1980) An almost ideal demand system. Am Econ Rev 70:312–326

    Google Scholar 

  • Deaton A, Paxson C (1994) Intertemporal choice and inequality. J Political Econ 102:437–467

    Article  Google Scholar 

  • Deaton A, Paxson C (2000) Growth and saving among individuals and households. Rev Econ Stat 82:212–225

    Article  Google Scholar 

  • Departementene (2007) Oppskrift for et sunnere kosthold (A recipe for a healthier diet). http://www.regjeringen.no/upload/kilde/hod/prm/2007/0006/ddd/pdfv/304657-kosthold.pdf. Accessed 5 July 2007

  • Glenn N (1977) Cohort analysis. Sage Publications, London

    Google Scholar 

  • Gustavsen G, Rickertsen K (2003) Forecasting ability of theory-constrained two stage demand systems. Eur Rev Agric Econ 30:539–558

    Article  Google Scholar 

  • Jonas A, Roosen J (2008) Demand for milk labels in Germany: organic milk, conventional brands, and retail labels. Agribus 24:192–206

    Article  Google Scholar 

  • Kjærnes U (1993) A sacred cow: the case of milk in Norwegian nutrition policy. In: Kjærnes U, Holm L, Ekström M, Fürst E, Prättälä R (eds) Regulating markets, regulating people: on food and nutrition policy. Novus Publishers, Oslo, pp 91–106

    Google Scholar 

  • Kuchler F, Tegene A, Harris J (2005) Taxing snack foods: manipulating diet quality or financing information programs? Rev Agric Econ 27:4–20

    Article  Google Scholar 

  • Mori H, Clason D (2004) A cohort approach for predicting future eating habits: the case of at-home consumption of fresh fish and meat in an aging Japanese society. Int Food Agribus Manag Rev 7(1):22–41

    Google Scholar 

  • Mori H, Clason D, Lillywhite J (2006) Estimating price and income elasticities in the presence of age-cohort effects. Agribus 22:201–217

    Article  Google Scholar 

  • Norwegian Agricultural Economics Research Institute (2011) Utsyn over norsk landbruk (Overview of Norwegian agriculture). Oslo

  • Park J, Capps O (1997) Demand for prepared meals by U.S. households. Am J Agric Econ 79:814–824

    Article  Google Scholar 

  • Propper C, Rees H, Green K (2001) The demand for private medical insurance in the UK: a cohort analysis. Econ J 111:180–200

    Article  Google Scholar 

  • Puhani P (2000) The Heckman correction for sample selection and its critique. J. Econ Surv 14(1):53–68

    Article  Google Scholar 

  • Shonkwiler J, Yen S (1999) Two-step estimation of a censored system of equations. Am J Agric Econ 81:972–982

    Article  Google Scholar 

  • StataCorp (2007) Stata statistical software: release 10. Stata Press, College Station

  • Statistics Norway (1996) Survey of consumer expenditure 1992–1994. Official Statistics of Norway, Oslo

  • Statistics Norway (2011) Statbank Norway. http://statbank.ssb.no/statistikkbanken/Default_FR.asp?PXSid=0&nvl=true&PLanguage=0&tilside=selectvarval/define.asp&Tabellid=04886. Accessed 10 January 2011

  • Stewart H, Blisard N (2008) Are younger cohorts demanding less fresh vegetables? Rev Agric Econ 30(1):43–59

    Article  Google Scholar 

  • Tauchmann H (2005) Efficiency of two-step estimators for censored systems of equations: Shonkwiler and Yen reconsidered. Appl Econ 37:367–374

    Article  Google Scholar 

  • Yang Y (2009) Age, period, cohort effects. In: Carr D (ed) Encyclopedia of the life course of human development. Gale Publishing, New York, pp 6–10

    Google Scholar 

Download references

Acknowledgments

The Research Council of Norway, grants 173388/I10 and 190306/I10, provided financial support for this research. The authors thank Rudy Nayga, two anonymous referees, and seminar participants at the 2009 IAAE conference and at the ERS, USDA, Washington DC for useful comments to earlier versions of this article.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kyrre Rickertsen.

Appendices

Appendix 1

See Table 5.

Table 5 Parameter estimates, step 1

Appendix 2: The Monte Carlo experiment

The exclusion of cohort variables may lead to biased parameter estimates when the data-generating process includes cohort effects. To investigate the bias of omitting cohort variables, we performed a Monte Carlo analysis.

First, system (5) was assumed to be the data generating process. The residuals were assumed to follow a multivariate normal distribution \(MN(0, \Sigma )\), where \({\Sigma }\) is the estimated covariance matrix, and 1,000 residuals were drawn for each equation from this distribution. Second, we used the estimated parameters, as reported in Table 2, the observations of the independent variables and the drawn residuals to construct 1,000 observations of the expenditure shares in system (5). Some of the constructed shares were negative and they were set to zero. Third, we estimated system (5) for each of our 1,000 drawings without the APC variables but including a (log of) trend and a (log of) age variable.

The median values of the estimated coefficients in the Monte Carlo analysis and the 99 % confidence intervals (CIs) for the corresponding coefficients in Eq. (5) are shown in Table 6.Footnote 17 The coefficients associated with the intercepts, expenditure terms and the correction parameter \(\phi \) are always outside the CIs. In the carbonated soft drink equation, several of the price parameters are also outside the CIs. Estimated coefficients outside the CIs may be explained in several ways. First, excluding the cohort variables may result in biased effects of other variables suggesting the importance of cohort variables. Second, the log-linear specification of the age and period variables may be wrong.

The median values of the estimated coefficients of the year and age variables and their associated \(t \)values are reported at the bottom of Table 6. These variables are significant with the exception of the trend variable in the carbonated soft drinks equation and the age variable in the other soft drinks equation. However, compared with the model including APC variables, the interpretation of the effects changes in the milk equation. The negative effect of age changes to a positive effect, suggesting that older people purchase more milk than younger people. Furthermore, the positive cohort effect, which suggests that older cohorts consume more milk than younger cohorts, is replaced by a strong negative trend, which suggests that there is a negative trend affecting all age groups simultaneously.

Table 6 Monte Carlo results

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gustavsen, G.W., Rickertsen, K. Consumer cohorts and purchases of nonalcoholic beverages. Empir Econ 46, 427–449 (2014). https://doi.org/10.1007/s00181-013-0688-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00181-013-0688-3

Keywords

JEL Classification

Navigation