1 Introduction

Recent technological advances and fierce competition have led many companies to expand the channel through which customers can purchase their products. In many product categories, customers have a broad range of channels to choose from, such as catalogs, call centers, Internet stores, apps for smartphones, and bricks-and-mortar stores.

Previous research has investigated what happens when customers start using more than a single channel (i.e., become multichannel). This research stream has identified two key consequences of multichannel usage. First, Shankar et al. [1] and Hitt and Frei [2] find that customers using Internet channel in addition to the traditional bricks-and-mortar channel are more loyal than customers who use a single channel. Sousa and Voss [3] explain the higher customer retention rates by increased coordination between channels: the coordination among channels increases customer satisfaction, which improves retention rates. Second, Thomas and Sullivan [4], Kumar and Rajkumar [5], Venkatesan et al. [6], Ansari et al. [7], and Kushwaha et al. [8] find that, on average, multichannel customers spend more than single-channel customers. This research stream, however, has measured customers’ spending using cross-section data and has not attempted to quantify the long-term effects of multichannel usage on household spending.

Blattberg et al. [9] point out that multichannel customers could buy more due to self-selection: a heavy user is more likely to be able to take advantage of the availability of several channels. In this argument, multichannel usage does not cause (or lead to) higher revenues; the heavy users self-select into using multiple channels. Thus, the direction of causality between increased overall spending and utilizing more than one channel is not clear.

This study empirically examines the long-term consequences of multichannel usage on consumers’ spending, while controlling for potential self-selection through a variety of statistical methods. To address the self-selection problem, we use five different panel data econometrics techniques (i.e., pooled OLS, fixed effects, first difference, lagged dependent variable, and Arellano-Bond generalized method of moments GMM, estimation). We also implement propensity score matching to create datasets consisting of matched pairs. Matching is a commonly used technique to cope with self-selection problems in social sciences [1013]. By artificially creating observationally equivalent control and treatment groups, propensity score matching enables us to compare single-channel and multichannel customers with similar observed characteristics.

In line with prior research, we find that multichannel customers spend more on average even when we control for the self-selection bias. However, we contribute to the literature by showing that this increased spending decays over time. Specifically, the difference between multi- and mono-channel customers’ spending disappears 3 years after mono-channel customers become multichannel customers. This result can be explained by novelty theory. According to this theory, customers derive value from learning new ways of doing things [14], but they return to their regular consumption patterns in the long run [1518]. Alternatively, this decay could be related to the type of the new sales channel that consumers begin to use. Blattberg et al. [9] posit that online channel usage in the long run leads customers to compare competitors’ products and prices and become more price-sensitive over time. Ansari et al. [7] provide empirical support for this argument.

The rest of the paper is organized as follows. We first describe relevant research and highlight our contributions. Next, we discuss the data and present the methodologies we use to analyze the data. We then present empirical results. We conclude with a discussion on managerial implications and directions for future research.

2 Related Literature and Contributions of the Paper

The main objective of our research is to examine how the spending patterns of multichannel customers evolve over time as compared to mono-channel customers. In this section, we review the pertinent literature and highlight our research contributions.

2.1 Impact of Multichannel Usage on Revenues

Most findings in the marketing literature suggest that multichannel customers purchase more than mono-channel customers (e.g., [4, 5, 7, 8, 19]). However, Thomas and Sullivan [4] point out that using any combination of two channels does not necessarily result in higher purchase volume than a single channel. For instance, consumers using catalog and online channels could buy (on average) less than customers using only a bricks-and-mortar store. However, these customers typically purchase more than only online or catalog channel customers.

Few scholars have examined the long-term consequences of multichannel usage on customers’ spending habits. Avery et al. [20] investigate the cannibalization and complementary effects of adding bricks-and-mortar stores to existing Internet and catalog channels. The authors find that opening retail stores reduces sales in the catalog channel in both short and long run. However, while new bricks-and-mortar stores cannibalize the sales in the online channel in the short term, they produce a complementary effect (i.e., increasing sales) in the long run. Pauwels and Neslin [21] find that catalog mailings enhance sales not only of the catalog channel but also of the online and retail store channels in both short and long run. All these researchers, however, report aggregate channel sales effects and do not distinguish between increases in sales stemming from newly acquired customers and those stemming from existing customers. In this paper, we use household-panel data to investigate the sales effect of multichannel behavior on existing customers over time.

2.2 Self-Selection

A heavy user is more likely to select a multichannel firm over a single-channel company. It is highly possible that customers using more than one sales channel are inherently different than customers who use a single channel. For example, Hitt and Frei [2] find that consumers who adopt online banking channel have always been more profitable. In contrast, Ansari et al. [7] find no differences in the spending of multichannel and mono-channel customers. This implies that self-selection problem may not be present in their case. As these results are contradictory, the literature can benefit from further work quantifying the relationship between customer spending and multichannel usage, while addressing the self-selection problem. In addition, while these papers have examined only the short-term effects of multichannel usage, we examine both short- and long-run impact of multichannel usage in this study.

2.3 Methodology

To address the self-selection problem, researchers typically use different panel data econometric techniques, instrumental variable estimation, or matching methods [22]. In this study, we estimate five different panel data econometrics models on “matched” data based on observed customer characteristics. To the best of our knowledge, this study is the first to combine a dynamic panel data model with the propensity score matching method.

Several papers in the social sciences rely on aggregate-level data (such as US Census data or demographics at zip code) to create matched pairs. These papers are criticized as some scholars (e.g., [23]) argue that zip code-level data do not provide sufficient information to construct functional matched pairs. Our household-level data enable us to create matched pairs based on both US Census and household-level data. We compare the estimation results using data generated by both matching approaches.

To summarize, we contribute to the marketing literature by examining both short- and long-term effects of multichannel usage on consumers’ spending. We minimize self-selection by using panel data econometric techniques and the propensity score matching method. That is, we run different panel data econometrics models on data consisting of matched pairs. Methodologically, our paper is the first study to use Arellano-Bond estimation in conjunction with propensity score matching. Further, we compare and contrast the estimation results obtained using matched pairs based on aggregate- and household level data.

3 Data

The data provider is a major retailer in the USA that sells durables and apparel in mature categories predominantly through catalog channel. The panel data are reported yearly at the household level, starting on January 1, 1997, and ending on December 31, 2001. The retailer introduced an online channel in 1996. However, the vast majority of customers (i.e., more than 99.9 %) did not try the Internet channel before 1997, when we begin to observe them.Footnote 1

The data include channel-specific sales amounts, marketing activities (catalogs and e-mails), and household-specific demographics. The catalog company also provides us with recency, frequency, and monetary value (RFM) measures for each household prior to the beginning of the data. We also create a set of dummy variables that capture each household’s unique multichannel shopping behavior. Specifically, year-of-multichannel ijt  = 1 if as of year t, customer i became multichannel j (where j = 1, …, 5) years ago; and year-of-multichannel ijt  = 0 otherwise. For instance, in the first year that household i starts using the online channel, first-year-of-multichannel i1t is 1, while other dummy variables, such as second-year-of-multichannel i2t and third-year-of-multichannel i3t are 0. In the next year, first-year-of-multichannel i1t is 0 and second-year-of-multichannel i2t is 1, indicating that the household is a second year multichannel user, and so forth.Footnote 2 Table 1 lists the variables we use and their operationalization.

Table 1 Operationalization of variables

The company has survey-based demographics data about their customers (Table 2 depicts households’ descriptive statistics across all the years we observe them). To add information regarding customers’ formal education level, we integrate data from the National Center for Environmental Health (NCEH) website into the primary dataset.Footnote 3 NCEH’s data provide the distribution of formal education levels for adults older than 25 years at zip code level. By calculating the percent of college-educated people within each zip code, we create a proxy for the probability that the household head has a college degree.

Table 2 Descriptive statistics

The main analysis focuses on customers who add the online channel to the incumbent catalog channel that they already use. Table 2 presents the sample means and standard deviations for the overall sample and multichannel and single-channel households. According to this table, multichannel users spend significantly more than mono-channel users. Moreover, multichannel customers are younger, are better educated, and have higher incomes and more children compared to single-channel users. In addition, multichannel households are more frequent buyers and are exposed to more marketing activities. This latter result suggests a potential endogeneity issue, which we address in our empirical analysis.

4 Empirical Analysis

In an ideal scenario, we would conduct an experiment to measure the long-term effects of multichannel usage on consumers’ total spending. This way, we would randomly assign a set of households to the treatment group (multichannel) and another set to the control group (mono-channel) and track their purchase behavior over time. Analyzing such experimental data would provide clean results that are free of bias stemming from the self-selection problem. Unfortunately, it is difficult to run such an experiment since it is impossible to force customers to use multiple channels. This explains why most empirical researchers rely on observational data to study multichannel issues. However, using observational data suffers from self-selection bias: the households who use multiple channels may be different from those who use a single channel from the beginning. For example, the multichannel households could differ in their usage level before becoming multichannel so that difference in purchase behavior cannot solely be attributed to being multichannel. This is quite possible in our data since customers with higher predispositions to become multichannel are more likely to be heavy users [9]. Econometrically, the self-selection bias induces a correlation between the customer decision to become multichannel and the model error term. Failure to control for this endogeneity problem will lead to biased empirical results.

There are several econometric approaches for addressing the self-selection problem, such as panel data econometrics, instrumental variable estimation, and matching methods [22]. In this section, we briefly review the panel data econometric models that we use to address the self-selection problem. We then discuss the propensity score matching method and explain how we implemented it in our study. As we do not have good instruments, we do not use instrumental variable estimation in our analysis.

4.1 Panel Data Econometrics Models

4.1.1 Pooled Ordinary Least Squares

We use pooled ordinary least squares (POLS) regression as a benchmark model. The strategy here is to include the variables that correlate with sales and the decision to become multichannel directly in the sales equation. As a result, these variables are not part of the error term. To reduce the self-selection bias in POLS, [24], chapter 3) suggest including customer demographics as independent variables. To control for period effects, we include year-specific dummy variables in the model, as we are not interested in the nonparametric relationship between multichannel usage and household spending. Let y it be household i’s total spending in year t in US dollars. Let year-of-multichannel ijt (j = 1,…, 5) be a dummy variable that indicates whether household i is multichannel for j years in year t. Specifically, these dummy variables indicate whether the household is a first-, second-, third-, fourth-, or fifth-year multichannel user at time t. For instance, “first-year-of-multichannel i1t ” is 1 when household i became multichannel user 1 year ago as of year t. In the following year, this variable becomes 0 and “second-year-of-multichannel i2t ” variable becomes 1.

Previous research has not studied the sales impact of multichannel usage over time (i.e., it has typically examined the effect of first-year-of-multichannel on sales). All the households in our sample use the catalog channel for purchasing. Thus, we consider the adoption of the new online channel as equivalent to becoming a multichannel household.

The POLS model is specified as follows:

$$ {y}_{it}=\gamma +{\displaystyle \sum_{j=1}^5{\rho}_j\mathrm{year}\hbox{-} \mathrm{of}\hbox{-} {\mathrm{multichannel}}_{ijt}} + {\displaystyle \sum_{m=1}^M{\beta}_m{x}_{mit}+{\lambda}_t+{\varepsilon}_{it}} $$
(1)

where γ is an intercept term and ρ j (j = 1, …, 5) are the parameters of interest that measure the long-term effects of multichannel usage on consumers’ total spending. The vector x it  = (x 1it , … x Mit )′ includes the M independent control variables that are listed in Table 1 and discussed in detail below. β = (β 1, …, β M )′ is a vector of parameters, and λ t denotes a set of period effects that capture common trends in all consumers’ total spending and ε it is an error term for capturing all other omitted factors. We assume that ε it is normally distributed with mean zero and variance σ 2 for all i and t.

We include the following control variables in our analysis: the firm’s marketing efforts, consumer demographics, and socioeconomic factors. Marketing activities have considerable impact on consumers’ spending [6, 7] and channel choice [6, 7]. Demographic and socioeconomic factors impact consumers’ spending as well propensity to become multichannel. Venkatesan et al. [6] and Ansari et al. [7] find that younger customers with high income tend to spend more. Such customers are also more likely to try new channels (37). As formal education is highly correlated with income, and influences consumers’ channel choice Mattilla et al. [25], we include our proxy for education level in our analysis. The number of children within a household directly influences the household’s disposable income, consumers’ spending [7], and a household’s inclination to become multichannel. Customer tenure has been shown to be associated with multichannel shopping Kumar and Rajkumar [5]; [4]), and therefore is included in our model.

Marketing variables are likely to be endogenous since companies target their marketing efforts to their best customers [9]. Conventionally, companies typically use basic statistical models, such as RFM models, to identify the best customers [9]. Following Ansari et al. [7] and Blattberg et al. ([9], p. 652), we control for such a potential endogeneity problem by including a RFM variable (i.e., cumulative purchase incidences) in our analysis. However, a single RFM variable may not control for additional factors influencing firm’s managers use to allocate marketing effort for a given customer. To control for such additional factors, Risselada et al. [26] recommend including average marketing variable per period per each household. This is because the mean marketing effort across all periods serves as a proxy for additional factors besides RFM-type variables affecting marketing efforts.

Even with the rich set of control variables we use, the POLS model does not fully control for the self-selection problem. Particularly, inclusion of demographic variables as covariates controls only for observed heterogeneity. Nonetheless, Angrist et al. [24] point out that the POLS model impressively minimizes self-selection and causality biases when customer demographics are included as explanatory variables. To control for unobserved household heterogeneity, we utilize a fixed-effects (FE) specification as our next model.

4.1.2 Fixed Effects

The FE model extends POLS by specifying a customer-specific effect to account for unobserved, time-invariant customer heterogeneity. This model can effectively address the self-selection problem if unobserved heterogeneity or omitted time-invariant household characteristics underlie the self-selection process. For instance, the households may have different levels of “ability,” which influences their spending levels and their propensity to become multichannel users. Obviously, if customer ability evolves over time, then the FE model will still suffer from endogeneity (i.e., correlation between the error component and the model covariates) due to omitted time-varying components.

The FE model is specified as follows:

$$ {y}_{it}=\gamma +{\displaystyle \sum_{j=1}^5{\rho}_j\mathrm{year}\hbox{-} \mathrm{of}\hbox{-} {\mathrm{multichannel}}_{ijt}}+{\displaystyle \sum_{m=1}^M{\beta}_m{x}_{mit}+{\lambda}_t+{\mu}_i+{\varepsilon}_{it}} $$
(2)

where μ i denotes the unobservable, time-invariant, household-specific fixed effect. The FE model has one main disadvantage: its specification cancels out all of the time-invariant household-specific characteristics, such as demographics and socioeconomic factors, from the analysis.

Another way of coping with unobserved, time-invariant customer heterogeneity is using first-difference specification, which we discuss in the next section.

4.1.3 First Difference

The first-difference (FD) model “differences out” the customer-specific effect by subtracting the (t − 1)th equation from the tth equation. Similar to the FE model, the FD model only controls for time-invariant, unobserved heterogeneity ([27, 28]). The FD model is specified as follows:

$$ {y}_{it}-{y}_{it-1}={\displaystyle \sum_{j=1}^5{\rho}_j\left(\mathrm{year}\hbox{-} \mathrm{of}\hbox{-} {\mathrm{multichannel}}_{ijt}-\mathrm{year}\hbox{-} \mathrm{of}\hbox{-} {\mathrm{multichannel}}_{ijt-1}\right)}+{\displaystyle \sum_{m=1}^M{\beta}_m\left({x}_{mit}-{x}_{mit-1}\right)+{\lambda}_t+\left({\varepsilon}_{it}-{\varepsilon}_{it-1}\right)} $$
(3)

The FD model has two disadvantages. First, similar to the FE model, it wipes out all of time-invariant household-specific characteristics. Second, the standard errors estimated by ordinary least squares method will be biased, as the error terms in the FD model are correlated across observations. Therefore, we use Huber-White sandwich estimators to remedy this problem.

4.1.4 Lagged Dependent Variable

Neither the FE nor the FD models account for time-varying, unobserved heterogeneity. To control for this potential source of bias, we use a lagged dependent variable (LDV) model as our fourth model. Unobserved heterogeneity (or ability) may change over time as a result of learning or maturation. The LDV model captures this changing unobserved heterogeneity by using the lagged dependent variable as a proxy. The LDV model specification is:

$$ {y}_{it}=\gamma +\upalpha {y}_{it-1}+{\displaystyle \sum_{j=1}^5{\rho}_j\mathrm{year}\hbox{-} \mathrm{of}\hbox{-} {\mathrm{multichannel}}_{ijt}}+{\displaystyle \sum_{m=1}^M{\beta}_m{x}_{mit}+{\uplambda}_t+{\upvarepsilon}_{it}} $$
(4)

where the lagged value y it − 1 is included in the model to capture persistence in a household’s spending and also to control for the self-selection in multichannel usage since multichannel users tend to be heavy users [2, 9]. The advantage of a LDV model is that it controls for both time-invariant and time-variant unobserved heterogeneity. However, such a control comes with a heavy price: the lagged dependent variable and the model error term are correlated, and consequently, the results will be biased. To address this, we use the Arellano-Bond generalized method of moments (A-B GMM) estimation as our final model.

4.1.5 Arellano-Bond GMM

This method (also referred as dynamic panel data estimation) combines the essence of the LDV and the FE models. This model accounts for the unobserved time-invariant household characteristics, does not require these latent characteristics to be uncorrelated with other covariates, and allows for dynamic structure (i.e., including the lagged dependent variable in the regression). This model is specified as:

$$ {y}_{it}=\gamma +\alpha {y}_{it-1}+{\displaystyle \sum_{j=1}^5{\rho}_j\mathrm{year}\hbox{-} \mathrm{of}\hbox{-} {\mathrm{multichannel}}_{ijt}}+{\displaystyle \sum_{m=1}^M{\beta}_m{x}_{mit}+{\lambda}_t+{\mu}_i+{\varepsilon}_{it}} $$
(5)

Note that y it − 1 is persistently correlated with the error structure in Eq. (5). In addition, because μ i appears in each time period, the model cannot be estimated through simple least square estimation procedures. Anderson and Hsiao [29] suggest first differencing the equation to remove fixed household effects and using lagged explanatory variables as instruments to create moments for estimation. Arellano and Bond [30] suggest using deeper lags (i.e., two or more periods) as instruments for GMM estimation to achieve more efficiency.

4.2 Matching Methods

The goal of matching methods is to mimic experimental designs by pairing treated and untreated customers who have comparable characteristics but not treatments [22]. That is, the objective of the matching methodology is to artificially create treatment and control groups. These methods rely on the assumption that the observed characteristics are informative enough that controlling for them is sufficient to remove any self-selection effect, referred to as the “conditional independence assumption.” A rich dataset on observed heterogeneity is required to meet the conditional independence assumption [24].

The most commonly used matching methods are (i) covariate matching (Avery et al. [46]; [2, 31]) and (ii) propensity score matching ([32]; Mithas et al. [33]; [13, 23, 28, 34]). The covariate matching pairs multichannel and mono-channel households based on observed household demographics and socioeconomic factors. This method, however, comes with inherent problems: utilizing many customer characteristics to find similar treated and untreated customers is tremendously challenging [1, 2, 31]. One way to cope with this problem is to reduce the dimensions of data by using the propensity score matching (PSM).

To create matched pairs, PSM uses the conditional probability that a customer with particular observed characteristics becomes a multichannel user. A logistic model estimates the propensity score for each household to become multichannel shopper. Because PSM reduces each consumer’s propensity to a single score, a matched pair with highly similar propensity scores may, in fact, have different household characteristics [22, 24]. However, Smith et al. [34] find that PSM performs vastly better than the covariate matching method. Therefore, we use PSM to pair multichannel and single-channel households.

Let MC i indicate if household i is a multichannel user by the last period of the data (MC i  = 1), or not (MC i  = 0). Following previous research, we use the following logistic regression model to estimate the propensity scores:

$$ P\left({\mathrm{MC}}_i=1\right)=1/\left(1+ \exp \left(\eta +{\displaystyle \sum_{j=1}^J{\delta}_j{z}_{ij}}\right)\right) $$
(6)

where z i  = (z i1, … z iJ )′ are J observed household characteristics (discussed below), η is an intercept, and δ = (δ 1, …, δ J )′ is a vector of regression parameters.

We perform two kinds of propensity score estimations: the first is based on household characteristics measured at the zip code level (i.e., US Census data on education, median age, median income, and average family size). The second uses household-level characteristics (i.e., head of household’s age, household income, and number of children in the household). As Heckman et al. [10] point out, only variables that are unaffected by the treatment should be included in the matching logistic model. Therefore, we do not include dynamic variables, such as spending, incidence decisions, or updated RFM-type variables, in this regression. In our data, the household-level characteristics were measured before the household became multichannel. Similarly, census-level characteristics are not likely to be affected by the observed households’ decision to become multichannel, as we observe only a small subset of households living in each zip code.

We use the logistic parameter estimates to predict the propensity score for each household and implement the nearest neighbor algorithm to create matched pairs.Footnote 4 There are several variants of the nearest neighbor algorithm, such as with or without replacement. We select the algorithm without replacement, where a household can be matched only once, and therefore, eliminate the risk of artificially giving more weight to some households in our analysis. Any unmatched household is dropped from the dataset and not included in the regressions. To ensure there are no ordering effects during the matching process, we randomize the order of data before matching.

4.3 Combining Panel Data Econometrics with Propensity Score Matching

Our rich data enable us to compare the results of matching based on aggregate- and household-level variables and address an ongoing debate in literature: several papers in the social sciences rely on aggregate-level data to create matched pairs. For example, Avery et al. [20] and Degeratu et al. [31] use US census data at zip code level to generate matched pairs. Gensler et al. [13, 23] and Heckman and Navarro-Lozano [22] criticize the use of aggregate-level data for matching purposes. They argue that aggregate-level data do not provide enough information to construct functional matched pairs. Fortunately, our data contain household-level demographics. By adding US Census variables to the main data, we create two scenarios of matching: matched pairs based on (i) US Census data (measured at the zip code level) and matched pairs based on (ii) household-level data. We compare the results obtained by analyzing the datasets created under these two scenarios.

In sum, we estimate the five different econometric specifications (POLS, FE, FD, LDV, and A-B GMM) on three different datasets: (i) no matching, (ii) matched pairs based on zip code-level demographics, and (iii) matched pairs based on household-level characteristics. Using panel data econometric techniques in conjunction with matching methods is relatively new, and to the best of our knowledge, our study is the first one combining a dynamic panel data econometrics model with the propensity score matching.

5 Empirical Results

Most of the econometrics specifications fit the data well. Adjusted R 2 statistics of POLS, LDV, and FE models are between 37 and 57 % across the three datasets.Footnote 5 Tables 3, 4, and 5 report the estimation results on the full data, matched pairs based on US Census variables, and matched pairs relying on household factors, respectively.

Table 3 Parameter estimates on full data (i.e., no matching)
Table 4 Parameter Estimates on Matched Pairs Based on U.S. Census Data
Table 5 Parameter estimates on matched pairs based on household level data

5.1 Results Based on Full Data

Table 3 depicts the results when the econometrics models are estimated on the full data. The results are reasonably consistent.

5.1.1 The Effect of Multichannel Usage

Across all models, multichannel customers spend much more than mono-channel customers in the first year they become multichannel. Across all models, the coefficients pertaining to the first year of multichannel usage are significant (p < 0.001) and vary from ρ 1 = 89.14 (POLS) to ρ 1 = 101.69 (FD). In their second year of being multichannel, customers still spend significantly more than mono-channel customers albeit with a lower magnitude. Across all models, the coefficients pertaining to the second year of multichannel usage are significant (p < 0.001) and vary from ρ 2 = 25.36 (FD) to ρ 2 = 68.15 (FE). On average, multichannel customers spend $94.97 ($43.45) more than mono-channel customers in their first (second) year of becoming multichannel. These results are consistent with results in the marketing literature that multichannel customers spend more on average than mono-channel customers ([4, 5, 7, 19]; Kushwaha and Shankar [36]).

In the long run (after year two), however, the difference between multichannel and mono-channel customers’ spending disappears. Starting from the third year of being multichannel, the spending levels of multichannel and mono-channel customers are not significantly different (p > 0.05).Footnote 6 This result provides evidence that multichannel customers revert to their regular spending patterns within a few years of becoming multichannel. In Section 6, we provide potential explanations for why multichannel customers regress to their previous spending pattern over time.

5.1.2 The Effect of Control Variables

The estimation results of the control variables are consistent with previous findings. Past purchase incidence (RFM), which we included to control for endogeneity of marketing efforts, is positively associated with spending, suggesting that more frequent buyers spend more. Its effect is significant in the POLS (p < 0.001) and A-B GMM (p < 0.01) models and ranges from β = 19.63 (POLS) to β = 37.94 (A-B GMM). However, its effect is insignificant (p > 0.05) for the FE, FD, and LDV models. This result is not surprising since y it − 1 can be a proxy for past purchase incidence. Across the different models, we find that younger customers spend more than older customers. This effect is significant (p < 0.01) across models and ranges from β = −0.66 (A-B GMM) to β = −0.34 (LDV). Similarly, the number of children in the household is negatively associated with customers’ spending. Except for the LDV model, this effect is significant (p < 0.05) and ranges from β = −4.96 (A-B GMM) to β = −2.52 (LDV). Education is positively and significantly (p < 0.05) correlated with spending, with effects ranging from β = 22.20 (LDV) to β = 24.08 (POLS). All these results support the previous findings in the literature [6, 7, 37]. Customer tenure and income are not significantly related (p > 0.05) to customer spending.

The number of catalogs received has a positive and significant (p < 0.001) impact on customers’ spending, with effects ranging from β = 2.82 (A-B GMM) to β = 13.60 (FD). Whether a customer subscribes to receive e-mails is also positively and significantly (p < 0.001) associated with spending. The effects range from β = 38.44 (FD) to β = 74.95 (A-B GMM). Interestingly, the number of e-mails received is negatively and significantly (p < 0.001) correlated with consumers’ spending. The effects range from β = −3.53 (LDV) to β = −1.54 (FD). This result suggests that the company may be over-e-mailing its customers. Pew Research Center [38], Morimoto and Chang [39], and Ellis-Chadwick and Doherty [40] conclude that consumers perceive a high number of commercial e-mails as exceedingly intrusive and irritating, as they are annoyed by the volume and repetitive nature of such e-mails. Finally, average marketing efforts, a variable that we included to control for endogeneity, is significant for the POLS, FD and LDV models (p < 0.05), ranging from β = 5.70 (LDV) to β = 6.98 (FD).

The state dependence variable shows a significant positive effect (β = 0.40, p < 0.001) for the LDV model, but its effect is negative (β = −0.24, p < 0.001) for the A-B GMM model. This suggests that after correcting for the auto-correlated errors and customer observed heterogeneity, a household’s current spending is negatively correlated with their previous spending; households that made large purchases last year are more likely to make small purchases in the current year. Finally, the dummy variables capturing the time trend indicate that the households in our dataset tend to decrease their spending over time.

5.2 Results Based on Propensity Score Matching Using US Census Data

This PSM method uses US Census data to create matched pairs, and the resulting data are approximately 44 % of the original, full data. Table 4 depicts the estimates of the same five models using these matched pairs. To the best of our knowledge, this study is the first to estimate a dynamic panel data model on matched data based on propensity scores.

5.2.1 The Effect of Multichannel Usage

The results in Table 4 are similar to those in Table 3. Except for the FE model, multichannel usage is significantly associated (p < 0.001) with increased overall spending in the first 2 years. Thereafter, the difference between multichannel and mono-channel customers is insignificant (p > 0.05). These results are consistent with our previous findings based on the full data. The FE model shows that multichannel customers spend significantly more (p < 0.001) than mono-channel customers for the first 3 years. Comparing the PSM results in Table 4 with the full data results in Table 3, we can see that the magnitude of the spending difference between multichannel and mono-channel customers is slightly lower under the PSM models. On average, under PSM (full data), the first year difference is $90.60 ($94.97) and the second year difference is $36.67 ($43.45).

5.2.2 The Effect of Control Variables

The only customer characteristics variable that is significant (p < 0.01) across all five models is the age of the household head, suggesting that younger customers spend more than older one. The effects are similar in magnitude to those obtained using the full data. They range from β = −0.90 (A-B GMM) to β = −0.54 (LDV). Unlike the full data results, all the other customer characteristics are insignificant (p < 0.05). (In the POLS models, past purchase incidence has a significant impact (p < 0.01) on spending and customer with longer tenure spend significantly (p < 0.05) less.)

For the marketing effort variables, the results are also quite similar to those using the full data. Catalogs have a significant positive impact (p < 0.001) on consumers’ spending with effects ranging from β = 6.50 (A-B GMM) to β = 7.34 (FE). E-mail subscription also has a significant positive impact (p < 0.001) with estimates ranging from β = 59.21 (FE) to β = 78.28 (A-B GMM). The effect of number of e-mails sent is still significantly negative (p < 0.001) with a magnitude of about −2 across all the five models.

As in the full data analysis, the state dependence variable shows a significant positive effect (β = 0.38, p < 0.001) for the LDV model, but its effect is negative (β = −0.20, p < 0.001) for the A-B GMM model. Except for the FD model, the time trend effects have similar decay in customers’ spending over time. The FD model, however, shows increased trend over time.

5.3 Results Based on Propensity Score Matching Using Household Characteristics

This PSM method matches households based on their stated demographic and socioeconomic factors rather than relying on zip code-level data. As such matching puts the highest restrictions to create matched pairs, the size of the resulting dataset is approximately 31 % the size of the original data. Table 5 presents the results obtained by the five econometric models applied to these data. Overall, the estimation results in Table 5 are quite consistent with those in Tables 4 and 3. This suggests that matching techniques using aggregate level data (such as US Census demographics) produce similar results to those of matching methods based on individual-level data.

5.3.1 The Effect of Multichannel Usage

The effect of multichannel usage is quite consistent and close in magnitude across the two matching methods. On average, across all the models, the boost in revenues in the first year of being multichannel is $91.27 and in the second year is $36.89 (all effects are significant, p < 0.001). By the third year, the difference between the multichannel and mono-channel customers’ spending is no longer significant (p > 0.05), with the exception of the FE model (ρ 3 = 39.33; p < 0.01).

5.3.2 The Effect of Control Variables

Except for few differences, the effects of the control variables are also similar across the two matching methods. Younger customers spend significantly more than older ones (p < 0.05). Number of catalogs received and e-mail subscription have a significant positive impact on spending (p < 0.00), whereas number of e-mails sent has a significant negative impact (p < 0.001). We also obtain similar persistence and time trend effects. A difference in the results stems from the POLS model, where the effects of number of children become significant (p < 0.05).

6 Discussion

Except for the FD model, the effects are similar in magnitude across all regressions. This is not surprising as the FD model measures first differences, whereas the other regressions examine the relation between the dependent variable and regressors.

Our key finding is that multichannel customers spend significantly more than mono-channel customers in the short run but revert to their regular spending pattern over time. One potential explanation for this result is the novelty effect. According to the novelty effect, when customers start using a new channel, they derive epistemic (novelty) value from trying and learning new things [14, 41]. This epistemic value results in excitement [42], positive attitudes towards the purchase [18], higher customer satisfaction [42], and amplified consumption patterns [15, 43]. Cantor [44] and Sheth et al. [16] find that customers who are motivated by novelty value often return to their regular consumption patterns after satisfying their need for change. For some products and services, reverting to regular consumption patterns may take a few years. For example, Howard and Crompton [17] find that the boost in attendance and revenues associated with a new stadium may last several years.

Another potential explanation for the similarity of multichannel and mono-channel customers in the long run lies in the consequences of using the online channel. Ansari et al. [7] find that when customers use the Internet channel over time, they tend to buy less frequently from the firm. This suggests that customers may start to compare competitors’ products and prices and become price-sensitive over time [9]. That is, as multichannel customers (who use an online channel) become more comfortable using the Internet, they start to buy from competitors.

7 Conclusion

This paper investigates the consequences of multichannel shopping on consumers’ spending. It validates previously established theories suggesting that multichannel customers on average spend more. However, we find that multichannel customers increase their overall spending initially and return to their regular spending pattern over time. This may be due to increased customer switching behavior stemming from the online channel use. It can also be explained by the novelty effect, whereby adopters of new technologies and products increase their consumption for a limited time and regress to their regular spending pattern in due course.

Our results are based on observational data which are affected by self-selection: heavy spenders and sophisticated customers are more likely to become multichannel. That is, the direction of causality between multichannel shopping and customers’ spending is not clear. To address this issue, we use different panel data econometric models and combine them with two propensity score matching methods. Our results are very consistent across all analyses. Further, we empirically demonstrate that using matched pairs based on aggregate- and household-level characteristics produces quite similar results. This finding suggests that matching techniques using aggregate level data (such as US Census demographics at zip code) produce reliable results and can control for self-selection bias. The findings also suggest that POLS estimates do a reasonable job of uncovering the key relations.

We validate previous findings that marketing activities, customer demographics, and socioeconomic factors influence customers’ overall spending. Catalogs increase households’ spending. E-mails, on the other hand, tell a different story: While e-mail subscriptions are positively correlated with spending, the number of e-mails received has a negative influence. The latter result implies that the overuse of e-mails creates irritation among retailer’s customers. We find that younger customers with college education tend to spend more. In addition, the number of children in a household is negatively associated with spending, suggesting that having more children decreases a household’s disposable income. Alternatively, these households are likely to be more price-sensitive and less loyal to the sponsoring firm.

Managerially, these findings indicate potential drawbacks of expanding channel structures for firms. Although it has been typically accepted that opening and maintaining new channels increases customers’ lifetime value by creating multichannel customers [13], our results suggest that managers should be aware that the increased revenues associated with multichannel usage may not be sustained in the long run.

This paper is not free of limitations: first, our paper analyzed the long-term effects of multichannel shopping on customer’s spending when a firm adds an online channel. Thus, it is possible that our results may not generalize when the firm adds a different type of channel. As preliminary evidence, we analyzed a similar longitudinal data (albeit much shorter) to examine whether our findings hold when the firm adds a bricks-and-mortar store. The results summarized in the Appendix indicate that our results are robust. Second, the data provider is a single firm that operates in mature markets. The trend showing a decline in spending might not arise to companies operating in new and growing industries. Third, as the firm sells durables and apparels, these results may not be generalizable to consumer goods that are frequently purchased. Finally, we do not separate consumers’ incidence and purchase amount decisions. Separating these decisions (see for example, [7]) can provide further insights. Nonetheless, our results suggest that marketers can benefit from focusing on multichannel shoppers’ behavior in the long run.