Skip to main content
Log in

Multi-category purchase incidences with marketing cross effects

  • Original Paper
  • Published:
Review of Managerial Science Aims and scope Submit manuscript

Abstract

We focus on cross effects of marketing variables and cross category dependences for multi-category decisions which households take during a shopping trip to a retail store. A cross effect is defined as the effect which a marketing variable used for a certain product category exerts on purchases of another category. Using Dirichlet process mixture models with multivariate probit components we analyze purchase incidences of 24,047 shopping visits of a random sample of 1500 households. Independent variables of these models encompass marketing variables for 25 product categories and household attributes. We discuss differences between the two best performing models, a full model which includes both cross effects and cross category dependences, and a related restricted model which ignores cross effects. We obtain several high and significant differences with respect to category constants and cross category dependences between these two models. We also present explanations for the larger (in absolute terms) cross effects of features or displays. We demonstrate that by ignoring cross effects management runs the risk to obtain in many product categories too optimistic forecasts of sales revenue changes due to promotions. In contrast to previous related work suggesting not to use promotions which are not tailored to individual households in any of the investigated categories, we obtain support for such promotions in at least 48 % of the 25 product categories. In addition, based on the full model we demonstrate that often different categories are appropriate for promotions which are targeted at household clusters.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Albert J, Chib S (1993) Bayesian analysis of binary and polychotomous response data. J Am Stat Assoc 88:669–679

    Article  Google Scholar 

  • Allenby G, Lenk PJ (1995) Reassessing brand loyalty, price sensitivity, and merchandising effects on consumer brand choice. J Bus Econ Stat 13:281–290

    Google Scholar 

  • Ansari A, Mela CF (2003) E-customization. J Market Res 40:131–145

    Article  Google Scholar 

  • Banerjee S, Yandell BS, Yi NJ (2008) Bayesian quantitative trait loci mapping for multiple traits. Genetics 4:2275–2289

    Article  Google Scholar 

  • Barnard J, McCulloch R, Meng XL (2000) Modeling covariance matrices in terms of standard deviations and correlations with application to shrinkage. Stat Sin 10:1281–1311

    Google Scholar 

  • Berry MJA, Linoff GS (2004) Data mining techniques for marketing, sales, and customer relationship management, 2nd edn. Wiley, Indianapolis, IN

    Google Scholar 

  • Betancourt R, Gautschi D (1990) Demand complementarities, household production, and retail assortments. Market Sci 9:146–161

    Article  Google Scholar 

  • Bodapati AV, Srinivasan V (2006) The impact of feature advertising on customer store choice. Research Paper 1935, Graduate School of Business, Stanford

  • Boztuğ Y, Hildebrandt L (2008) Modeling joint purchases with a multivariate MNL approach. Schmalenbach Bus Rev 60:400–422

    Google Scholar 

  • Boztuğ Y, Reutterer T (2008) A combined approach for segment-specific market basket analysis. Eur J Oper Res 187:294–312

    Article  Google Scholar 

  • Boztuğ Y, Silberhorn N (2006) Modellierungsansätze in der Warenkorbanalyse im Überblick. J Betr Wirtsch 56:105–128

    Google Scholar 

  • Bronnenberg BJ, Kruger MW, Mela CF (2008) The IRI marketing data set. Market Sci 27:745–748

    Article  Google Scholar 

  • Chib S, Greenberg E (1998) Bayesian analysis of multivariate probit models. Biom 85:347–361

    Google Scholar 

  • Chib S, Seetharaman PB, Strijnev A (2002) Analysis of multi-category purchase incidence decisions using IRI market basket data. In: Franses PH, Montgomery AL (eds) Econometric models in marketing. JAI, Amsterdam, pp 57–92

    Google Scholar 

  • Corstjens M, Doyle P (1981) A model for optimizing retail space allocations. Manag Sci 27:822–833

    Article  Google Scholar 

  • Dippold K, Hruschka H (2013) A model of heterogeneous multicategory choice for market basket analysis. Rev Market Sci 11:1–31

    Article  Google Scholar 

  • Duvvuri SD, Ansari V, Gupta S (2007) Consumers’ price sensitivities across complementary categories. Manag Sci 53:1933–1945

    Article  Google Scholar 

  • Fox EJ, Hoch SJ (2005) Cherry-picking. J Market 69:46–62

    Article  Google Scholar 

  • Gelfand AE (1996) Model determination using sampling-based methods. In: Gilks WR, Richardson S, Spiegelhalter DJ (eds) Markov chain Monte Carlo in practice. Chapman & Hall, Boca Raton, pp 145–161

    Google Scholar 

  • Greene WH (2003) Econometric analysis, 6th edn. Prentice Hall, Upper Saddle River, NJ

    Google Scholar 

  • Ibrahim JG, Kleinman KP (1998) Semiparametric Bayesian methods for random effects models. In: Dey D, Müller P, Sinha D (eds) Practical nonparametric and semiparametric Bayesian statistics. Springer, New York, pp 89–114

    Chapter  Google Scholar 

  • Kruger MW, Pagni D (2008) IRI academic data set description. Information Resources Incorporated, Chicago

    Google Scholar 

  • Liu X, Daniels MJ (2006) A new algorithm for simulating a correlation matrix based on parameter expansion and reparameterization. J Comp Graph Stat 15:897–914

    Article  Google Scholar 

  • Manchanda P, Ansari A, Gupta S (1999) The “Shopping Basket”: a model for multi-category purchase incidence decisions. Market Sci 18:95–114

    Article  Google Scholar 

  • Neal RM (2000) Markov chain sampling methods for Dirichlet process mixture models. J Comp Graph Stat 9:249–65

    Google Scholar 

  • Neslin S (2002) Sales promotion. In: Weitz BA, Wensley R (eds) Handbook of marketing. Sage, Thousand Oaks, CA, pp 310–338

    Chapter  Google Scholar 

  • Russell GJ, Petersen A (2000) Analysis of cross category dependence in market basket selection. J Retail 76:369–392

    Article  Google Scholar 

  • Seetharaman PB, Siddhartha C, Ainslie A, Boatwright P, Chan T, Gupta S, Mehta N, Rao V, Strijnev A (2005) Models of multi-category choice behavior. Market Lett 16:239–254

    Article  Google Scholar 

  • Shocker A, Bayus B, Kim N (2004) Product complements and substitutes in the real world: the relevance of other products. J Market 68:28–40

    Article  Google Scholar 

  • Song I, Chintagunta PK (2007) A discrete-continuous model for multi-category purchase behavior of households. J Market Res 44:595–612

    Article  Google Scholar 

  • Tellis G (1988) Advertising exposure, loyalty, and brand purchase: a two-stage model of choice. J Market Res 25:134–144

    Article  Google Scholar 

  • Zellner A (1971) An introduction to Bayesian inference in econometrics. Wiley, New York

    Google Scholar 

Download references

Acknowledgments

I thank two anonymous reviewers for their helpful suggestions to improve this paper, especially with respect to interpretation of estimated parameters and elaboration of managerial implications. I am indebted to the German Research Foundation DFG for financial support under grant HR 11/4-1.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Harald Hruschka.

Appendices

Appendix 1: Overview of MCMC Simulation

Clusters are indexed as \(c = 1, \ldots , C\) with C as the number of clusters currently formed. I and \(I_{-i,c}\) denote the number of households and the number of households of cluster c without household i, respectively. Following existing practice we set \(\alpha =1\) (Neal 2000). One MCMC iteration consists of the following steps which correspond to algorithm 7 of Neal (2000):

  1. 1.

    For each cluster:

    1. (a)

      Draw new samples of coefficients from their respective prior distributions (see “Appendix 2”) and calculate the corresponding likelihood.

    2. (b)

      Decide whether to form a new segment \(c_i^*\) with probability

      $$\begin{aligned} min \left[ 1,\frac{\alpha L_i^*}{(I-1) L_i}\right] \end{aligned}$$
  2. 2.

    For each singleton:

    1. (a)

      Decide on allocating the singleton to one of the clusters with probability proportional to

      $$\begin{aligned} \frac{I_{-i,c}}{I-1} \end{aligned}$$
    2. (b)

      Calculate the corresponding likelihood and place the singleton into the new cluster \(c_i^*\) with probability proportional to

      $$\begin{aligned} min \left[ 1,\frac{(I-1) L_i^*}{\alpha L_i} \right] \end{aligned}$$
  3. 3.

    Decide on allocating each customer belonging to a cluster to one of the other clusters with probabilities proportional to

    $$\begin{aligned} g \frac{I_{-i,c}}{I-1} L_i \end{aligned}$$

    where g serves as normalizing constant.

  4. 4.

    Estimate parameters for each cluster as described in “Appendix 2”.

On the level of stochastic utilities the multivariate probit model corresponds to a seemingly unrelated regression model (Chib and Greenberg 1998). That is why the likelihoods of steps 1–3 are based on latent residuals of the multivariate probit model which are defined as \(e_{jit} = U_{jit} - V_{jit}\) and depend on parameter values assumed. The residual vector of household i for basket t is \(e_{it} = (e_{1it}, e_{2i1t},\ldots , e_{Jit})^{\prime}\). (JJ) matrix \(A_{i}\) of residual cross products across all baskets of household i is given by:

$$\begin{aligned} A_{i} = \sum _{t=1}^{T_i} e_{it} e_{it}^{\prime} \end{aligned}$$
(3)

Finally we obtain as likelihood \(L_i\) for household i (see, e.g., the analysis of the seemingly unrelated regression model in Zellner 1971 or Greene 2003):

$$\begin{aligned} L_i = |\varSigma |^{-T_i/2} \exp {\left( -\frac{1}{2} tr (A_{i} \varSigma ^{-1}) \right) } \end{aligned}$$
(4)

Appendix 2: Clusterwise parameter sampling

Estimation steps described here concern all households which in the current iteration are assigned to segment c, i.e., all i \(\in\) c. Note that we suppress the suffix c in this appendix to simplify notation.

Estimation of the multivariate probit model is based on the Bayesian analysis of Zellner (1971) for the seemingly unrelated regression model and the sampling of latent variables due to Albert and Chib (1993) as well as Chib and Greenberg (1998). Sampling of the correlation matrix of latent residuals in steps 3–5 draws upon Liu and Daniels (2006). Sampling of individual coefficients works similar to the approach developed by Banerjee et al. (2008) for the seemingly unrelated regression model.

Stochastic utilities are multivariate normally distributed with mean vector \(V_{it}\) containing deterministic utilities \(V_{1it}, \ldots V_{Jit},\) and covariance matrix \(\varSigma\). Therefore the conditional distribution of the stochastic utility of category j given the remaining \(J-1\) categories is univariate normal with conditional mean \(V_{jit|-j}\) and conditional variance \(\sigma _{-j}\) as parameters:

$$\begin{aligned} V_{jit|-j} = V_{jit} + \sigma _{s -k} \varSigma _{s -k}^{-1} (V_{-kit} - \mu _{-jit}), \quad \sigma _{-j} = \sigma _{j,j} - \sigma _{-j} \varSigma _{-j}^{-1} \sigma _{-j}^{\prime} \end{aligned}$$
(5)

\(V_{-kit}\) corresponds to vector \(V_{it}\) without element k. \(\sigma _{-j}\) denotes the vector containing the residual covariances of category j with the other \(J-1\) categories. \(\varSigma _{-j}\) corresponds to \(\varSigma\) without row and column j.

\({\mathcal {N}}(V, \sigma )\) denotes the univariate normal distribution with expectation \(\mu\) and variance \(\sigma\). \({\mathcal {TN}}^{+}(V, \sigma )\) and \({\mathcal {TN}}^{-}(V, \sigma )\) denote univariate normal distributions truncated to positive and negative values, respectively. \({\mathcal {IW}} (\nu ,S)\) denotes the inverse Wishart distribution with degrees of freedom \(\nu\) and symmetric, nonsingular matrix S.

Coefficients are collected in a (P, J) matrix \(\beta\) with \(P=1+S+2J\) denoting the number of coefficients, i.e., a constant, S coefficients for sociodemographic variables, and 2J coefficients if own effects and cross effects of marketing variables as given in Eq. (1). T symbolizes the total number of baskets of all the households currently assigned to the cluster. (TP) matrix X holds the P predictors including the constant, (TJ) matrix V the stochastic utilities of all baskets and categories.

Initially all coefficients are set to zero and \(\varSigma\) to a diagonal correlation matrix. Each coefficient has the prior \(\mathcal {N}(0,\underline{v})\) with \(\underline{v}= 4\). Estimation steps are as follows:

  1. 1.

    For all baskets \(t=1,\ldots ,T\) and categories \(j=1,\ldots ,J\)

    $$\begin{aligned} U_{ji} \sim \left\{ \begin{array}{ll} \mathcal {TN}^{+}(V_{jit|-j}, \sigma _{-k}) &\quad \text{ if } \quad y_{jit}=1 \\ {\mathcal {TN}}^{-}(V_{jit|-j}, \sigma _{-k}) &\quad \text{ else } \end{array} \right. \end{aligned}$$
  2. 2.

    Compute matrix of cross products of latent residuals \(A=\sum _i A_{i}\)

  3. 3.

    Sample covariance matrix \(\varSigma\) from \(\mathcal {IW} (T-J-1,A)\)

  4. 4.

    Compute proposal correlation matrix \(\tilde{R} = D \varSigma D\) where D is a (JJ) diagonal matrix with elements \(d_{j,j} = \sigma _{k,k}^{-1/2}\) (\(\sigma _{k,k}\) denotes the element of \(\varSigma\) in row and column k)

  5. 5.

    Accept \(\tilde{R}\) with probability \(min(1, \exp {(0.5 (J+1) (\log {|\tilde{R}|}-\log {|\varSigma _{0}|})}))\) with \(\varSigma _{0}\) as correlation matrix of the previous iteration

  6. 6.

    Set \(\varSigma = R\) and compute its inverse \(H=\varSigma ^{-1}\)

  7. 7.

    Sample coefficients \(\beta _{pj}\) for predictors \(p = 1,\ldots ,P\) and categories \(j =1, \ldots ,J\):

    $$\begin{aligned} \beta _{pj} \sim \mathcal {N} (m,v) \quad \text{ with } \quad v = 1/(1/\underline{v} + h_{jj} x_p' x_p) \quad \text{ and } \quad m = h_j' (V - X \beta _0)' \, x_p \, v \\ \end{aligned}$$

    \(h_{jj}\) is the element in row j and column j of H. \(h_j\) and \(x_p\) denote columns j and p of H and X, respectively. \(\beta _0\) equals \(\beta\) except for a zero value in row p and column j.

Appendix: 3 Significant differences of category constants and coefficients

The following table contains all the differences of category constants and coefficients between the full and the restricted model whose respective 95 % probability intervals only include either positive or negative values (Table 11).

Table 11 Significant differences of category constants and coefficients

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hruschka, H. Multi-category purchase incidences with marketing cross effects. Rev Manag Sci 11, 443–469 (2017). https://doi.org/10.1007/s11846-016-0193-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11846-016-0193-0

Keywords

JEL Classification

Navigation