1 Introduction

A regressor is endogenous if it is correlated with the unobserved residual of a model. Ignoring endogeneity may lead to a bias, according to which the estimated coefficient of the regressor differs from its true value. Wooldridge (2010) distinguishes four sources of endogeneity, namely variable omission, measurement error, selection, and simultaneity.

These sources generate bias under the following conditions:

  • the omitted variable is related to both the regressor and the dependent variable.

  • the dependent variable also affects the regressor (simultaneity.

  • the error in measuring the regressor is correlated with the dependent variable;

  • the selection into a sample is not random, or the assignment of a treatment (e.g., the assignment of an ad to households) is not random.

Here we deal with the omitted variable bias that arises if firms set marketing variables considering factors (demand shocks) that researchers do not observe. Such unobserved factors are not included in the model, they are incorporated into the structural error of the model. Biased estimates generated this way prevent the inference of valid causal effects, e.g., changes of sales or choice probabilities produced by changes of marketing variables (Papies et al. 2017). The magnitude and direction of the endogeneity bias depend on the way managers react to unobserved demand shocks. The sign of a bias corresponds to the sign of the correlation between the respective marketing variable and the excluded factor (Papies et al. 2017). Therefore, a positive (negative) bias of a marketing variable’s coefficient implies that the marketing variable and the unobserved demand shock are positively (negatively) correlated, i.e., that managers raise (decrease) the marketing variable in the case of a positive shock in demand.

The most widespread approach correcting for endogeneity uses observed instrumental variables. Ideally, an instrumental variable should be correlated with the marketing variable and should be uncorrelated with the error term of the respective model. Another approach adds a supply-side model explaining observed realizations of the endogenous variables. Typically, the supply-side model is based on either prior knowledge or a theory of the decision-making process of a firm. Obviously, this approach is prone to misspecification bias due to erroneous assumptions on decision-making. Instrument-free alternatives such as the higher moments approach (Lewbel 1997), the heteroskedastic errors approach (Lewbel 2012), the latent instrument variable approach (Ebbes et al. 2005), and the Gaussian copula approach (Park and Gupta 2012) do not require observed instrumental variables or knowledge of the decision-making process.

For sales response and brand choice models, we can find several papers that deal with endogeneity [examples are Andrews and Ebbes 2014; Villas-Boas and Winer 1999; Petrin and Train 1999] by using instrumental variables or adding a supply-side model (Besanko et al. 1998). Publications using instrument-free methods focus on sales response data (Ebbes et al. 2009; Hruschka and Gerhardt 2012; Park and Gupta 2012; Elshiewy and Boztuğ 2018; Atefi et al. 2018; Keller et al. 2019; Yang et al. 2022). Datta et al. (2015) consider endogeneity of marketing variables in a binomial choice model by an instrument-free approach. Whereas the publications mentioned so far are restricted to one brand or one product category, Datta et al. (2017) include an instrument-free approach to deal with endogeneity in a market share model for 25 categories.

Multicategory choice models allow pick-any choices, i.e., households may purchase multiple product categories at the same occasion. Multinomial choice models, on the other hand, are appropriate only if households choose exactly one category at each purchase occasion. The two dominant functional forms of multicategory choice models are the multivariate logit (Russell and Petersen 2000; Boztuğ and Hildebrandt 2008; Boztuğ and Reutterer 2008; Dippold and Hruschka 2013; Aurier and Mejia 2014; Hruschka 2021; Richards et al. 2018; Solnet et al. 2016) and the multivariate probit (Chib et al. 2002; Duvvuri et al. 2007; Manchanda et al. 1999; Hruschka 2017; Aurier and Mejia 2014; Xia et al. 2019).

In contrast to single category models, which may lead to biased conclusions on the effects on consumer behavior (Seetharaman et al. 2005), multicategory models like the multivariate logit and the multivariate probit model allow for interactions between purchases of different categories. A positive interaction exists if the purchase of category A increases the purchase probability of another category B. For example, the purchase of snacks could increase the purchase probability of beverages. In a negative interaction, on the other hand, the purchase of category A decreases the purchase probability of another category B (e.g., the purchase of cold cereal could decrease the purchase probability of beer).

Such interactions turn out to be especially important for promotion, assortment, and store layout decisions. Considering interactions is a prerequisite evaluating the effects of promotions across a retailer’s assortment (Russell and Petersen 2000). By using information on interactions, management avoids the error of eliminating low-profit categories though the latter have strong positive interactions with other, high-profit categories (Boztuğ and Silberhorn 2006). Placing categories with positive interactions close to each other in store layouts increases total sales (Boztuğ and Silberhorn 2006).

Research on multicategory choice models, as a rule, leaves out endogeneity concerns. We are aware of only one exception, Richards et al. (2018) who tackle endogeneity by adding a supply-side model to a multivariate logit model for choices in four categories. Let us put forward two possible reasons for this research gap with respect to multicategory choice models.

Maybe some researchers think that the endogeneity of marketing variables is a lesser problem at the category level. Such a consideration can be justified if the following two conditions are both valid. One condition requires that a firm’s managers, who respond to a shock at the category level, do not set marketing variables of most brands that belong to the category in the same way. According to the other condition, endogeneity only occurs for low share brands or becomes low because random shocks affect marketing variables of brands in different directions with appropriate positive and negative correlations. We doubt that these two conditions apply in general.

Empirical evidence for the endogeneity of marketing variables at the category level is rare, with the exception of Park and Gupta (2012) and Richards et al. (2018), who analyze category sales and purchase incidence data, respectively. The demand surge of toilet paper and pasta during the first weeks of the COVID-19 crisis in Germany offers an example, as retail managers reduced sales promotion activities in these categories (Consumer Index 2020). Such circumstances lead to negative omitted variable biases, because positive demand shocks are negatively correlated with sales promotion activities.

Another reason for this research gap may be the fact that the number of endogenous marketing variables is much higher compared to sales response or brand choice models. As a rule, about seven endogenous marketing variables are investigated in brand choice models. The usual number of endogenous marketing variables in sales response models is even lower. Quite contrary, we analyze 29 endogenous marketing variables in our empirical study. A high number of marketing variables makes it difficult to either find appropriate instrumental variables or develop a supply-side model. That is why we turn to the instrument-free Gaussian copula-correction method of Park and Gupta (2012) that under certain assumptions reproduces the correlation between marketing variables and the error term.

For models without intercept Park and Gupta (2012) demonstrate their method’s robustness if the distribution of structural errors is mispecified. On the other hand, it is well known that the majority of models in marketing encompass intercepts. For such models the recent simulations of Becker et al. (2022) show that the Gaussian copula-correction method is sensitive to deviations of the structural error terms from normality and the Gaussian correlation structure. According to the results of Becker et al. (2022), normality of the error terms can be reliably assessed based on regression residuals. Nonetheless, we have to emphasize that the Gaussian correlation structure of the error term with any endogeneous variable remains as untestable assumption.

In the next section, we present the homogeneous multivariate logit model. We also show how the endogeneity of marketing variables can be taken into account, based on a two-step Gaussian copula approach developed for the individual-level mixed logit model. Subsequently, we explain our estimation approach. We also describe how we investigate the difference of the effects of marketing variables between a model with coefficients corrected by the copula approach and another model with uncorrected coefficients. The empirical part of our paper starts by characterizing the data set. The following section compares to the performance of a related model that excludes interactions between purchases of product categories. We then discuss coefficients of interactions, of category-specific marketing variables and of their copula correction terms. Assuming a Gaussian correlation structure the significance of the latter group of coefficients indicates that category-specific marketing variables are endogenous. In this case, the sign of a copula correction coefficient shows whether marketing variables are subject to positive or negative omitted variable biases. We also examine to what extent the testable requirements of the Gaussian copula correction method are fulfilled. Afterwards, we discuss managerial implications. In the concluding section, we summarize the main results, mention further application areas, and discuss the limitations of our approach together with corresponding model extensions.

2 Models

In this section, we present the homogeneous multivariate (MVL) model. Then we show how endogeneity of marketing variables can be considered by the Gaussian copula approach and explain our estimation approach.

J column vector \(y_{mt}\) denotes market basket t of household m and consists of binary purchase indicators (J symbolizes the number of product categories). If household m purchases category j at purchase occasion t, the respective element \(y_{jmt}\) equals one. Vector \(x_{mt}\) consists of regressors relevant for the market basket t of household m. In our study, these regressors consist of category loyalties and one category-specific marketing variable.

We compute the loyalty of household m for category j in market basket t in analogy to exponentially smoothed brand loyalties (Guadagni and Little 1983):

$$\begin{aligned} loy_{jmt} = \alpha \, y_{jmt-1} + (1-\alpha ) \, loy_{jmt-1} \end{aligned}$$
(1)

\(0\le \alpha \le 1\) denotes the smoothing constant. The binary purchase incidence \(y_{jmt-1}\) equals one, if household m purchases category j at the previous purchase occasion \(t-1\). The current category loyalty depends on the previous purchase incidence \(y_{jmt-1}\) and the previous loyalty \(loy_{jmt-1}\). In a manner similar to the brand loyalty of Guadagni and Little (1983) we set initial values \(loy_{jm0}\) equal to the relative purchase frequency of the respective category j across all households and shopping visits (\(t=1\) denotes the first shopping visit). The lower the smoothing constant \(\alpha\) is, the less the loyalty variable reflects fluctuating purchases.

2.1 Homogeneous multivariate logit model

In the homogeneous MVL model, each coefficient is constant across households. Extending the expression for the homogeneous MVL model without regressors (also known as auto-logistic model) given in Besag (1972) we define the probability of market basket \(y_{mt}\) conditional on regressors \(x_{mt}\) as follows:

$$\begin{aligned}{} & {} \exp (y_{mt}' a + x_{mt}' B \, y_{mt} + 1/2 \, y_{tm}' V y_{mt}) /C \nonumber \\{} & {} \quad \text{ with } \quad C = \sum _{\upsilon \in \{0,1\}^J} \exp (\upsilon ' a + x_{mt}' B\, \upsilon + 1/2 \, \upsilon ' V \upsilon ) \end{aligned}$$
(2)

Expression (2) shows that computation of this probability requires division by the so-called normalization constant C that is obtained by summing over all possible market baskets defined by different binary vectors \(\upsilon\). Coefficients contained in the (JJ) matrix V measure pairwise interactions between categories. As a pairwise interaction of a category with itself does not make sense, all diagonal elements of V are zero. Off-diagonal elements are symmetric, i.e., \(V_{j1,j2} = V_{j2,j1}\). Column vector a consists of J constants. The (KJ) matrix B holds the effect of K regressors on purchase probabilities. The homogeneous MVL model has been applied to market basket data by Russell and Petersen (2000) building upon earlier publications in statistics (Cox 1972; Besag 1974).

For the homogeneous MVL model we can write the purchase probability of category j in market basket t of household m conditional on purchases of the other categories collected in vector \(y_{-jtm}\), the category-specific marketing variable \(mvar_{jt}\), and the category-specific loyalty \(loy_{jmt}\) as:

$$\begin{aligned}{} & {} P(y_{jmt}=1 \vert y_{-jmt}, x_{mt}) = \varphi (Z_{jmt}) \nonumber \\{} & {} \quad \text{ with }\quad Z_{jmt} = a_{j} + b_{j} mvar_{jt} + c_j loy_{jmt} + \sum _{l \ne j } V_ {j,l} \, y_{lmt} \end{aligned}$$
(3)

\(\varphi (Z)\) denotes the binomial logistic function \(1/(1+exp(-Z))\). \(Z_{jmt}\) can be interpreted as latent variable referring to category j and market basket t of household m.

Maximum likelihood estimation of the MVL model requires computation of the so-called normalization constant obtained by summing over all possible market baskets (see expression (2)) in each iteration. For the 29 categories in our study, we would have to deal with more than \(5.36 \times 10^{8}\) possible market baskets. Maximum pseudo-likelihood (MPL) estimation (Bel et al. 2018) offers a viable alternative maximizing the log pseudo-likelihood LPL across households, market baskets and categories:

$$\begin{aligned} LPL = \sum _{m=1}^M \sum _{t=1}^{T_m} \sum _{j=1}^J \log (\tilde{P}_{jmt}) \end{aligned}$$
(4)

\(T_m\) symbolizes the number of market baskets of household m, \(\tilde{P}_{jmt}\) the pseudo-probability of a (non) purchase of category j in market basket t of household m. Summing logarithmic pseudo-probabilities across product categories makes MPL estimation feasible as it replaces summing across all possible baskets, which would be necessary in maximum likelihood estimation. The pseudo-probability \(\tilde{P}_{jmt}\) can be written as:

$$\begin{aligned} \tilde{P}_{jmt} = P(y_{jmt}=1 \vert y_{-jmt}, x_{mt})^{y_{jmt}} \, (1- P(y_{jmt}=1 \vert y_{-jmt}, x_{mt})) ^{1-y_{jmt}} \end{aligned}$$
(5)

Expression (3) shows how to compute the conditional probability \(P(y_{jmt}=1 \vert y_{-jmt}, x_{mt})\) for the homogeneous MVL model. \(y_{jmt}\) denotes the binary purchase indicator, which is set to one if basket t of household m contains category j. One can see from Eq. (5) that its first part is relevant if category j is purchased and its second part if category j is not purchased. In a nutshell, LPL estimation looks at J different binomial logit models representing conditional probabilities.

2.2 Taking endogeneity into account

We treat the category-specific marketing variables as potentially endogenous regressors and consider the category loyalties to be exogenous. Our approach is based on the two-step estimation method that (Park and Gupta 2012) develop for the random coefficient multinomial logit model for individual level data. In a simulation generating data for the random coefficient multinomial logit model, Park and Gupta (2012) demonstrate that under certain assumptions this two-step estimation method provides unbiased coefficients. In contrast to other models dealt with in their simulations, Park and Gupta (2012) include brand constants (which are analogous to intercepts) in the random coefficient multinomial logit model. On the other hand, Park and Gupta (2012) do not investigate whether their method is robust with respect to violations of the assumed Gaussian correlation structure.

In accordance with Sect. 2.1 we replace the random coefficient multinomial logit model by J different random coefficient binomial logit models, one for each product category. To simplify matters, we speak of the random coefficient multivariate logit model (RMVL) in the following to denote this set of J random coefficient binomial logit models.

We specify the latent variable for category j as follows:

$$\begin{aligned} Z_{jmt} &= (\bar{a}_{j} + \bar{b}_{j} mvar_{jt} + \xi _{jt}) + a_{jm} + b_{jm} mvar_{jt} + c_{j} loy_{jt} \nonumber \\ & \quad +\sum _{l \ne j } V_ {j,l} \, y_{lmt} + \epsilon _{mjt} \nonumber \\= & {} \delta _{jt} + a_{jm} + b_{jm} mvar_{jt} + c_{j} loy_{jt} + \sum _{l \ne j } V_ {j,l} \, y_{lmt} + \epsilon _{mjt} \end{aligned}$$
(6)

\(\bar{a}_{j}\) and \(\bar{b}_{j}\) denote average coefficients, \(a_{jm}\) and \(b_{jm}\) their random deviations. Correlation of the structural error \(\xi _{jt}\) with the endogenous marketing variable leads to the endogeneity problem. \(\epsilon _{mjt}\) are Gumbel distributed errors.

In the first step we estimate random deviations \(a_{jm}\) and \(b_{jm}\) together with fixed loyalty coefficients \(c_j\), fixed interaction coefficients \(V_ {j,l}\), and fixed coefficients \(\delta _{jt}\). The latter are specific to the category j and week t. First step estimates of \(\delta _{jt}\) can be written as \(\hat{\delta }_{jt} = \delta _{jt} + \varsigma _{jt}\) with \(\varsigma _{jt}\) being asymptotically normal.

In the second step we regress these estimates on the marketing variable \(mvar_{jt}\) of category j and its Gaussian copula correction term \(mvar^{*}_{jt}\):

$$\begin{aligned} \hat{\delta }_{jt} = \bar{a}_{j} + \bar{b}_{j} mvar_{jt} + d_j mvar^{*}_{jt} \end{aligned}$$
(7)

The Gaussian copula directly correlates the error term of a structural equation with a non-normally distributed endogenous regressor. This way, the endogenous regressor is treated as a random variable from any (non-normal) marginal population distribution.

In the Gaussian copula approach computation of the correction term of an endogenous regressor starts with its empirical cumulative distribution function. The value of the empirical cumulative distribution function \(F(mvar_{j't'})\) of marketing variable for category j and basket \(t'\) is (Papadopoulos 2022):

$$\begin{aligned} F(mvar_{j't'}) = 1/\left( 1+ \sum _{m=1}^M T_m\right) \sum _{m=1}^M \sum _{t=1}^{T_m} I\{mvar_{jt} \le mvar_{j't'}\} \end{aligned}$$
(8)

The indicator function I returns one if the condition inside the parentheses is fulfilled, zero otherwise. Therefore \(F(mvar_{jm't'})\) equals the number of baskets in which the marketing variable for category j is less equal to \(mvar_{jm't'}\) divided by the total number of baskets \(\sum _{m=1}^M T_m\) plus one. Finally, the correction term \(mvar^{*}_{jm't'}\) results from inserting the value of the empirical cumulative distribution function into the inverse normal cumulative distribution function \(\Phi ^{-1}\):

$$\begin{aligned} mvar^{*}_{j't'} = \Phi ^{-1}(F(mvar_{j't'})) \end{aligned}$$
(9)

Park and Gupta (2012) demonstrate that multiple correction terms have to be included in the case of multiple endogenous regressors, i.e., one term for each regressor. Each of these terms is computed separately based on its empirical cumulative distribution function and the inverse normal cumulative distribution function according to Eqs. (8) and (9).

The copula approach requires that endogenous regressors are not (too) normal to make sure that variation due to the endogenous regressor and variation due to the structural error can be separated. Consistent estimates of average coefficients \(\bar{a}_{j}\) and \(\bar{b}_{j}\) by the second step regression (7) require non-normality of the endogenous variable as well as normality of error terms (Becker et al. 2022; Eckert and Hohberger 2022).

Based on extensive simulations, Becker et al. (2022) propose to assume nonnormality of a variable if its absolute skewness is greater than 0.8 and a nonnormality test provides a conservative (i.e., very low) p-value. Becker et al. (2022) also state that the error term’s normality can be checked using the residuals of the second stage regression without the copula correction term.

According to the results of Eckert and Hohberger (2022) significance of the copula correction term in the second stage regression is indicative of endogeneity only when the assumption a Gaussian correlation structure is met. In this case, a positive (negative) significant coefficient shows that the correlation of the respective marketing variable and the excluded variable is positive (negative). Consequently, a positive (negative) omitted variable bias results, as explained in Sect. 1.

The copula approach also implicitly assumes that all exogenous regressors are uncorrelated with the linear combination of copula transformations of endogenous regressors (Yang et al. 2022). In our case, these linear combinations correspond to \(d_j mvar^{*}_{jt}\). To tackle such correlations Yang et al. (2022) develop a two-stage copula endogeneity correction method that consists of adding residuals from regressing copula data for each endogenous regressor on copula data for the exogenous regressors. In a situation with noticeable correlations, the two-stage copula correction method should be used.

2.3 Estimation

We exclude the null basket for which all purchase indicators \(y_j\) equal zero in accordance with previous related publications (Russell and Petersen 2000; Boztuğ and Reutterer 2008; Kwak et al. 2015). This way, we model purchases conditional on the purchase of at least one category. Therefore, the number of possible market baskets is \(2^J-1\).

We estimate the RMVL model by maximum simulated pseudo-likelihood using Halton draws and normal mixing distributions for the random coefficients (Train 2003). The coefficients of the second state regression models are estimated by least squares. We do not base coefficients’ tests at this stage on conventional standard errors, which are incorrect for models with copula correction terms because the latter are estimated quantities. Instead, we compute the standard error of any coefficient as standard deviation of its estimates across 500 bootstrap samples (Papies et al. 2017). Each bootstrap sample has \(\sum _{m=1}^M T_m\) observations, which are drawn with replacement from the original market baskets.

3 Derivation of managerial implications

We investigate whether taking the endogeneity of marketing variables into account leads to different managerial implications. We consider the decision problem of setting a marketing variable. This decision depends on the effect a marketing variable has on purchases of the corresponding category itself, the so-called own effect, as well as on cross effects, i.e., the effects on purchases of other categories.

As our estimation approach provides pseudo-probabilities, we cannot directly determine purchase probabilities and have to resort to simulation. We generate simulated purchases by iterated Gibbs-sampling from the conditional distributions (Besag 2004). The computation times of Gibbs sampling for each observed market basket would be prohibitively high. Therefore, we cluster market baskets by K-means with the category loyalties of each basket \(loy_{jmt}, j=1,\ldots ,J\) as clustering variables.

K-means assigns each market basket to exactly one of C clusters. These assignments can be indicated by binary variables \(u_{cmt}\). \(u_{cmt}\) is one, if basket t of household m belongs to cluster c. The size \(s_c\) of each cluster c is the sum of its assignments, \(s_c = \sum _{m}^M \sum _{t}^{T_m} u_{cmt}\). The average loyalty in each cluster equals \(1/s_c \, \sum _{m}^M \sum _{t}^{T_m} u_{cmt} \, loy_{jmt}\). We obtain cluster-specific purchase probabilities as averages of simulated purchases, which we generate by iterated Gibbs-sampling from the following cluster-specific conditional distribution:

$$\begin{aligned}{} & {} P(y_{jc}=1 \vert y_{-jc}, x_{c}) = \varphi (Z_{jc}) \end{aligned}$$

Average cluster-specific loyalties are part of the regressors \(x_{c}\). \(Z_{jc}\) denotes the latent variable that is specified in analogy to expression (6). Finally, we compute the total purchase probability \(P(y_{j}=1)\) of each category j as weighted average of cluster-specific purchase probabilities \(1/(\sum _m^M T_m) \sum _c^C s_c P(y_{jc}=1 \vert y_{-jc}, x_{c})\).

In the first run of our simulation approach, we set the marketing variable of a category k to a low value of and estimate the total purchase probability of category j. Then we estimate the total purchase probability of category j by setting the marketing variable of category k to a high value. In both cases, we fix the marketing variables for categories other than category k at their observed average values. The difference of probability of the category for the marketing variable at the high value and the probability for the marketing variable at the low value gives the change of probability of category j that occurs with the given change of the marketing variable of category k:

$$\begin{aligned} \Delta P(y_{j}=1) = P(y_{j}=1 \vert mvar_{k}=high) - P(y_{j}=1 \vert mvar_{k}=low) \end{aligned}$$
(10)

Note that the purchase probabilities in Eq. (10) are only conditional on the marketing variable of category k, but not conditional on purchases of categories other than j.

In addition, we compute arc elasticities by dividing the relative change of purchase probability by the relative change of the marketing variable (Allen 1934):

$$\begin{aligned} \frac{\Delta P(y_{j}=1)}{(P(y_{j}=1 \vert \vert mvar_{k}=high) + P(y_{j}=1 \vert mvar_{k}=low))/2} \, \frac{(high + low)/2}{high - low} \end{aligned}$$
(11)

Probability changes and elasticities measure own effects if \(j=k\), they measure cross effects if \(j \ne k\).

4 Empirical study

4.1 Data

Our data refer to 24,047 shopping visits to one specific grocery store over a one-year period made by a random sample of 1500 households. For each shopping visit, we compose a market basket from the IRI data set (Bronnenberg et al. 2008). We represent a market basket by a binary vector whose elements indicate whether a household purchases each of 31 product categories (see Table 1). The average number of shopping visits per household amounts to 16.031, its standard deviation is 13.464. The average basket size (i.e., the number of purchased categories) is 3.852, and its standard deviation 2.654.

Table 1 Product categories and abbreviations

Table 2 shows relative marginal purchase frequencies for the 31 categories, and Table 3 the highest 20 pairwise relative frequencies. Milk is the category most frequently purchased. Carbonated beverages and milk are the two categories most frequently purchased together.

Table 2 Relative marginal frequencies
Table 3 Relative pairwise frequencies

The data include the category-specific marketing variables displays, features, and price reductions. These marketing variables Displays are set up in stores. Frequently found variants of displays are front-walls, end-caps, wings, or in-aisle gondolas (Neslin 2002). Products are typically featured by newspaper ads or inserts. These marketing variables are defined as weekly market share-weighted averages of UPC level variables in the respective category and are shares taking values between zero and one.

Displays, features, and price reductions are promotional activities intended to increase sales of a store (Little 1998). Therefore, we expect that average coefficients for these marketing variables are positive. However, estimation of the RMVL model with these marketing variables produces several negative average coefficients. Multicollinearity, reflected by high variance inflation factors for two marketing variables, is responsible for these negative coefficients. Specifically, we obtain variance inflation factors greater than 5.0 in 23 and 22 categories for features and price reductions, respectively [a threshold of 5.0 corresponding to a R-squared value of 0.80 is recommended by Hocking and Pendelton (1983)]. On the other hand, for displays such a high variance inflation factor occurs in one category only. These results are congruent with the statement of Blattberg and Neslin (1990) that “multicollinearity is endemic in promotion regression models”.

Negative average coefficients of features and price reductions run counter to theoretical expectations. They prevent both the comparison of uncorrected to copula-corrected coefficients and the derivation of plausible managerial implications. Therefore, we exclude features and price reductions from the model and only consider the marketing variable display as is least affected by multicollinearity. Table 4 shows the share of displays averaged across all market baskets. We see that carbonated beverages have the highest number of displays, whereas no displays occur for the two categories cigarettes and sugar substitutes. For this reason, we do not consider these two categories in the following analyses.

Table 4 also contains the average loyalty across all market baskets for each category for a smoothing constant \(\alpha =0.2\), which puts more weight on the loyalty of the previous shopping visit. This value of the smoothing constant leads to the best performing RMVL model with category loyalty according to a grid search over \([0.1, 0.2, 0.3, \ldots , 0.9]\). Given such a value, previous purchases are strongly smoothed. Milk attains the highest category loyalty.

Table 4 Average values of displays and category Loyalties

4.2 Estimation results

We compare the RMVL model defined in Sect. 2.2 to the related, less complex random coefficient independent logit model that excludes category interactions. We compute the log pseudo-likelihoods of these two models and determine AIC and BIC values to take their different number of parameters into account. Interactions turn out to be important because the RMVL is clearly superior with respect to both AIC and BIC values (see Table 5). That is why we only consider the RMVL in subsequent analyses.

Table 5 Evaluation of random coefficient logit models

Out of a total of 406 interaction terms, 190, i.e., about 47%, are significant. Of these 190 interaction terms, 188 are positive, and only two are negative (photo and yogurt, beer and cold cereal). Table 6 shows the 20 interaction coefficients with the highest t-values. In these interactions, both categories belong either to the non-food assortment (e.g., paper towels and toilet tissue, tooth brush and toothpaste, facial tissue and toilet tissue) or to the food assortment (e.g., carbonated beverages and salted snacks, mayonnaise and mustard/ketchup, cold cereal and yogurt).

Table 6 Pairwise interaction coefficients

In the following we present the significant average display coefficients of the RMVL, which arise in all but the two categories cigarettes and sugar substitutes. These significant coefficients are all positive, i.e., more displays increase the pseudo-probability of a purchase in the respective category (see Table 7).

Table 7 Significant average coefficients for displays

The second stage regressions provide nine significant coefficients of the copula correction term for displays (see Table 8). We also assess to what extent the testable requirements of the Gaussian copula correction mentioned in Sect. 2.2 are fulfilled. We obtain evidence for the non-normality of the display variables. In each of the 29 categories, displays attain absolute skewnesses greater than 0.8. For 27 categories, the Shapiro–Wilks tests of normality provide a p-value less than 0.10. We especially see very low p-values not greater than 0.0078, for the nine categories with significant copula correction terms.

We also investigate the residuals of the second stage regression without copula correction terms for these nine categories to assess the normality of error terms. Across these nine categories, the highest absolute skewness amounts to 0.373, and the lowest p-value of the Shapiro-Wilks test is 0.141. Therefore, we conclude that their error terms are normal. In one accepts the assumption of a Gaussian correlation structure, these results for the non-normality of displays and for the normality of error terms allow to infer endogeneity from significant coefficients of the correction terms.

Summing up we obtain evidence for the endogeneity of displays in about 31% of the 29 categories investigated. The majority of the nine significant coefficients of copula correction terms are positive. In other words, most of the nine display variables are positively correlated with the omitted variable. These correlations imply positive omitted variable biases, i.e., overestimation of display coefficients, if they are not corrected. Corrected display coefficients in Table 8) that are lower than their uncorrected counterparts reflect this fact.

One anonymous reviewer asked whether correlations of the endogenous variable display with one of the omitted marketing variables (feature, price reduction) might explain the sign of the omitted variable bias. As a rule, these correlations are positive in the investigated categories. For only two categories we obtain negative correlations with features, which are very low in absolute size. Therefore correlations with the two omitted marketing variables do not explain the signs of biases. Of course, the biases may be due to unknown factors that differ from the two omitted marketing variables.

Across all categories, the maximum correlation in absolute size of the linear combinations of the copula correction term with the exogenous regressor loyalty amounts to 0.061. In view of these low correlations, we do not apply the two-stage copula endogeneity correction method of Yang et al. (2022).

Table 8 Significant copula correction coefficients and display coefficients

4.3 Managerial implications

We now deal with the question of whether models M0 with uncorrected display coefficients and M1 with corrected display coefficients entail different managerial implications. We consider the number of purchases as managerial objective for displays in any category. As the number of purchases equals the sum of purchase probabilities across market baskets, we investigate how much probability changes and elasticities due to higher display activities differ between these models. Section 3 explains the computation of purchase probabilities, probability changes, and elasticities. Based on K-means for category loyalties, we choose six clusters. We alternatively set each category display variable to a high value of 0.25 and a low value of 0.05.

Table 9 shows the differences of probability changes and elasticities for purchases of the respective category of at least 0.01 in absolute size. We do not show differences for cross effects, i.e., effects on other categories, as these are all lower than 0.005 in absolute size. The signs of six significant copula correction term coefficients in Table 8 agree with the sign of the differences with respect to probability changes and elasticities in Table 9. Signs of the remaining three significant copula correction term coefficients agree with one of the two differences. To sum up, the differences between the two models are linked to the respective positive or negative omitted variable biases.

If the difference between models M0 and M1 is positive (negative) and managers rely on uncorrected display coefficients, they would choose a level of display activities in the respective category, which is too high (low). The majority of the differences in Table 9 are positive, i.e., display effects are often overestimated if uncorrected coefficients are used.

To assess the economic relevance of these differences we assume that a supermarket and a smaller convenience store attain 7000 and 13,000 weekly shopping visits, respectively. These numbers of weekly shopping visits are in accordance with published values (EHI 2017; Spar International 2020). In the case of a positive difference of the probability changes of 0.02, management would overestimate the number of weekly purchases in the displayed category by 140 and 260, respectively. Consequently, management would set display activities at a level that is too high. For a negative difference of − 0.02, management would underestimate the number of weekly purchases by the same amounts (140 and 260) and opt for a level of display activities that is too low.

Table 9 Probability changes and elasticities of displays

5 Conclusion

Under the assumption of a Gaussian correlation structure explained in Sect. 1, the two-step Gaussian copula approach indicates endogeneity of the marketing variable display in about 31% of the 29 categories investigated. The majority of display variables rated as endogenous are positively correlated with the omitted variable. Such positive correlations imply that ignoring endogeneity leads to overestimation of the coefficients of the respective display variables.

The own effects of increasing a promotion activity on purchases of the same category frequently differ between a model with display coefficients corrected by the Gaussian copula approach and the related model with uncorrected coefficients. On the other hand, both models agree on the size of the cross effects that display activities exert on purchases of other categories. In a similar manner to the results for model coefficients, the own effects are often overestimated if they rely on uncorrected display coefficients. Such overestimation tempts managers to set a level of display activities that is too high.

Future research efforts may investigate the endogeneity of marketing variables in multicategory choice models in other contexts than food retailing (e.g., consumer electronics, apparel). The fact that we only look at purchases constitutes a limitation of our study. One could also consider response variables such as store choice, purchase quantity, and brand choice. Of course, adding response variables would lead to more encompassing multicategory models, but also to higher model complexity.