1 Introduction

Online retailing has experienced a huge boost in the context of the COVID-19 pandemic. Of course, the lockdowns enforced in many European countries immediately impacted shopping behavior due to the closure of “non-essential” retail stores. However, these lockdowns were temporary and cannot be considered solely responsible for the enormous increase in the relevance of online retailing. Several studies suggest that during the pandemic—independent of any lockdowns—voluntary behavioral changes such as reductions in shopping mobility and switching to e-shopping due to fear of coronavirus infection occurred (e.g., Chenarides et al. 2021; Goolsbee and Syverson 2021; Jacobsen and Jacobsen 2020; Jiao and Azimian 2021; Shamshiripour et al. 2021). In addition, in many European countries, the face mask mandate in retail stores, which many consumers found uncomfortable or categorically rejected, may have impacted shopping behavior (Bonial 2020; Knotek et al. 2020; Taylor and Asmundson 2021). In view of the above findings, one can assume that the pandemic situation has also influenced spatial shopping behavior, independently of forced business closures, particularly in favor of e-commerce. Research on shopping behavior has so far focused on either store choice in physical retailing or channel choice in multi-channel retailing, using either store choice or channel choice models (see Sect. 2). The studies mentioned above, on the other hand, refer to changes in (shopping) behavior in the COVID-19 pandemic, but without modeling spatial shopping behavior. Unfortunately, there have not yet been any model-based studies on spatial shopping behavior in the context of the COVID-19 pandemic. This study aims to fill this gap.

Based on an empirical, model-based store choice approach, the current study attempts to answer the following research questions: (1) What are the main drivers of spatial shopping behavior given the availability of both physical and online stores? (2) Have the determinants of spatial shopping behavior changed during the pandemic? (3) Have fear of infection and the obligation to wear a face mask in retail stores influenced channel choice in favor of online retailing? In order to answer these questions, Wieland’s (2021a, b, 2023) study design on micro-econometric modeling of spatial shopping behavior in a multi-channel context is here repeated for the purposes of comparison. The empirical analysis focuses on consumer electronics retailing and furniture retailing. Empirical data on shopping behavior was collected via a representative consumer survey conducted in three German regions. The econometric modeling approach follows on from Wieland’s (2021a, 2023) previous studies on consumer electronics and furniture shopping; it is based on the hurdle model, which is used frequently to answer questions about individual consumption decisions and with which it is possible to split human behavior into an “if” and a “how” decision. Since the model is designed for micro data, it is possible to include individual consumer characteristics of both an objective and subjective nature. The current model considers shopping transaction costs, store characteristics, and psychographic consumer characteristics (shopping attitudes). In order to investigate the influence of the pandemic on shopping decisions, attitudes including fear of infection and the perceived burden of masks were also surveyed and integrated into the model as explanatory variables.

The study approach demonstrates that online and physical retailers can be incorporated into a store choice model for spatial shopping behavior, which is originally derived from retail location theory and developed for physical retail locations alone. The fact that such a model can also be used for the multi-channel context represents a theoretical and methodological extension of the analysis concepts in retail geography. With respect to the comparison of shopping behavior before and during the pandemic, it can be shown that the determinants of shopping behavior have largely not changed. While channel choice is mainly explained by the age of the consumers, their attitudes and, in some cases, their place of residence, transaction costs and store features have a major influence on store choice. The pandemic situation—in the sense of fear of infection and mask aversion—has only a minimal impact on shopping behavior, if at all.

The paper is structured as follows. Section 2 describes the literature on spatial shopping behavior in the context of retail location theory, multi-channel shopping behavior, and voluntary behavior changes during the pandemic. Section 3 outlines the modeling approach, the explanatory variables, and their expected impact, as well as the data collection procedure. In Sect. 4, the empirical findings are presented in terms of descriptive results, intermediate results toward the psychographic consumer attributes, and model results for the three survey areas. Section 5 summarizes the main conclusions of this study and discusses some limitations of the study.

2 Literature review

2.1 Spatial shopping behavior in retail location theory

According to Brown (1993), there are four domains of retail location theory: (1) central place theory (and its extensions), (2) spatial interaction models, (3) theories and models of retail agglomeration, and (4) bid rent theory—whereby the first three focus on spatial shopping behavior. Central place theory (CPT) (Christaller 1933) and its successors focus spatial shopping behavior by emphasizing the respective roles of accessibility and (consumer) transport costs on the choice of a shopping location. According to CPT, consumer demand for a given good decreases with increase in transport costs (distance-dependent demand), and consumer sensitivity toward transport costs reduces with decrease in purchasing frequency of the desired good. CPT was first adopted worldwide in the 1960s and has been empirically tested and theoretically expanded over a period of several decades. Extensions of this theory aim to make it more dynamic (e.g., Eaton and Lipsey 1982; Ghosh 1986; Lange 1973), with some emphasizing the importance of multipurpose shopping in particular: An important argument emerging from these extensions is that retail locations with greater potential for multipurpose shopping are preferred by consumers, even if visiting them incurs higher transport costs, a factor which may be regarded as the explicit implementation of agglomeration effects (here: urbanization economies) within CPT (Eaton and Lipsey 1982; Ghosh 1986; Lange 1973). The CPT received new attention and further expansions of its theoretical ideas through the emergence of the so-called “New Economic Geography” (NEG). The basic assumptions regarding spatial consumer behavior were either retained or expanded by aspects such as multipurpose and/or comparison shopping (e.g., Fujita and Thisse 2002).

Independent from CPT, spatial interaction models (SIM) for retailing incorporating similar theoretical assumptions toward spatial consumer behavior (e.g., with respect to accessibility) have been developed (e.g., Reilly 1931; Huff 1962). Originating from deterministic models for two supply locations (e.g., Reilly 1931), Huff (1962) created a probabilistic store choice model based on microeconomic assumptions. This model is formalized mathematically and estimates the probability of a (shopping) decision based on the assumptions of utility maximization and imperfect information of consumers. Consumer utility of a shopping location is assumed to be impacted by (1) travel time and (2) the location’s respective assortment and store size. Travel time has an overproportionate negative influence on store choice due to the opportunity costs involved in traveling to shopping locations. Conversely, store size increases consumer utility of visiting a store because consumers decide for a shopping location based on imperfect information, whereby, the larger the store’s assortment, the more likely it is that a consumer will obtain the desired goods. However, as consumer search and decision costs increase with increase in assortment, a larger assortment is assumed to be affected by diminishing marginal utility. In the Huff model, the probability that a consumer chooses a shopping location is equal to the store’s utility relative to the sum of the utilities of all shopping locations.

The Huff model has been further developed in many ways, both theoretically and in terms of application (methodology). These include, among other things, the transformation into an econometric model with which empirical shopping behavior can be analyzed, namely the Multiplicative Competitive Interaction (MCI) model (Nakanishi and Cooper 1974). An important extension, for example, is the Competing Destinations Model by Fotheringham (1985), in which an additional variable that depicts the clustering of stores operationalizes the influence of (positive) agglomeration effects or competitive effects. Further developments concern, for example, the integration of (subjective) variables regarding the shops/chains concerned (e.g., Stanley and Sewall 1976) or model alternatives for depicting multipurpose trips (e.g., O'Kelly 1981). The Huff model also plays an important role in the applied context, especially in location planning of retail companies (Berman and Evans 2013) and in spatial planning regarding retail impact assessments (Khawaldah et al. 2012; Müller-Hagedorn 2020). Note that the Huff model is just one of many spatial interaction models, many of which are also used to model spatial shopping behavior (e.g., Nakaya et al. 2007).

The third theoretical strain mentioned above emphasizes the role of positive agglomeration effects in retailing due to the opportunity for multipurpose and comparison shopping. Hotelling’s microeconomic model (“principle of minimum differentiation”) is usually considered to be the first theory of this kind (although it fails to consider multipurpose or comparison shopping). Hotelling’s model describes a duopoly in a linear market, whereby suppliers relocate to maximize their profits. The best location for both suppliers is a cluster in the middle of the market, where each supplier serves one half of the market (Hotelling 1929). While all the aforementioned theories and models represent a deductive approach, Nelson (1958) has worked solely from an inductive perspective. Based on empirical observations on shopping behavior, Nelson formulated two theories on agglomeration effects in retailing. The “theory of cumulative attraction” relates to competing retailers selling different product variants (e.g., shoe stores). If such stores cluster together, this enables comparison shopping, and thus, generates more customer traffic when compared to the sum of all these retailers when located in solitary locations. However, stores from different sectors may increase their joint demand if they build a cluster provided that such stores are compatible with respect to multipurpose shopping (“rule of retail compatibility”). Clustering of (complementary or competitive) retail establishments has been incorporated into later monopolistic competition models and subsequent NEG models (e.g., Fujita and Thisse 2002).

The SIM and its successors can also be used directly empirically, provided real data on spatial shopping behavior is available. The assumptions of CPT and SIM with respect to assortment and accessibility were often examined empirically, using either econometric market area models such as the MCI model or discrete choice models for individual decisions. Here, the assumed impacts of, inter alia, assortment and travel time have been frequently confirmed (e.g., Baviera-Puig et al. 2016; Briesch et al. 2009; Orpana and Lampinen 2003; Popkowski Leszczyc et al. 2004; Suárez-Vega et al. 2015; Tihi and Oruc 2012; Wieland 2015, 2018). Positive agglomeration effects have only been examined in a few store choice studies with the related findings failing to demonstrate congruency and rather depending on the examined product types (Orpana and Lampinen 2003; Tihi and Oruc 2012; Wieland 2015). Importantly, this family of retail location theory and the associated empirical work is geared solely toward physical retailing.

2.2 Multi-channel shopping behavior

Research into multi-channel shopping behavior typically deals with consumer shopping channel choice (online vs. in-store) or online shopping frequency. A large branch of the literature dealing with multi-channel shopping behavior focuses on channel-specific shopping transaction costs. These costs include aspects such as travel time to physical stores, delivery charges and delivery time (waiting time) in online retailing, or search and information costs in both channels (Chintagunta et al. 2012). Several studies have shown empirically that decreasing travel time to physical shopping locations and increasing delivery charges and delivery time decrease the likelihood of online shopping (Chintagunta et al. 2012; Hsiao 2009; Marcucci et al. 2021; Marino et al. 2018; Schmid and Axhausen 2019). However, the informative value of such studies is limited since they are based either on survey experiments (Hsiao 2009; Marcucci et al. 2021; Schmid and Axhausen 2019) or on empirical data from individual companies (Chintagunta et al. 2012; Marino et al. 2018).

One explicit geographic perspective on online shopping deals with spatial differences in shopping channel choice based on two competing hypotheses. The innovation-diffusion hypothesis assumes that urban residents are more likely to buy online due to a greater openness to new technologies. In contrast, the efficiency hypothesis states that consumers in rural areas tend to buy more online because of the (assumed) lower accessibility to physical shopping locations (Cao et al. 2013). The first hypothesis has typically been confirmed in previous studies (Cao et al. 2013; Farag et al. 2006; Zhen et al. 2018); however, there are several more recent studies which no longer find a higher online affinity in (large) cities (Clarke et al. 2015; Beckers et al. 2018). In contrast, the efficiency hypothesis was frequently confirmed. Several studies have found that spatial proximity to competing stores decreases the likelihood of online shopping (Cao et al. 2013; Clarke et al. 2015; Dijst et al. 2008; Farag et al. 2006; Zhai et al. 2017; Zhen et al. 2018). However, these effects have been found to differ between region types (Cao et al. 2013) and product types (Farag et al. 2006; Zhen et al. 2018). Both the transaction cost perspective and the efficiency hypothesis relate to accessibility of physical stores. As accessibility and opportunity costs of traveling are important aspects in retail location theory, there is an obvious connecting point between store choice and channel choice studies. In principle, delivery fees and waiting time which occur in e-shopping might be regarded as the equivalent of the opportunity costs of in-store shopping trips.

Several studies investigating channel choice have also included psychographic consumer attributes, in particular attitudes, which are subjective perceptions of reality (here: shopping channels), regardless of whether these perceptions are correct or not (Dijst et al. 2008). Typically, latent variables are inferred from individual statement items which relate, for example, to the perceived risks of online purchases (e.g., with respect to credit card fraud or privacy) or to the pleasure of shopping in physical stores. The inferred variables then represent higher-level individual attitudes with respect to the shopping channels. Several studies have shown that a positive attitude toward e-commerce (e.g., low risk aversion) increases the likelihood of shopping online, whereas enjoyment of in-store shopping reduces online affinity (Dijst et al. 2008; Schmid and Axhausen 2019; Zhai et al. 2017).

Additionally, most studies include socio-demographic consumer attributes. One obvious tendency is that consumers of a younger age tend to buy more online than older consumers, which may be attributed to a younger consumer’s experience with information and communication technology (ICT). Furthermore, the likelihood of buying online was found to be higher for consumers with higher education and/or income, male consumers, and consumers in employment (Beckers et al. 2018; Clarke et al. 2015; Farag et al. 2006).

Recently, a few studies that integrate online retailing into a store choice framework have emerged. The model by Beckers et al. (2021) approaches behavior on an aggregated level (similar to the Huff model), and thus, fails to allow assessment of the impact of individual consumer characteristics. Suel and Polak (2017) investigated channel, store, and travel mode choice with respect to grocery retailing simultaneously. They found few significant socio-demographic effects on channel choice but did demonstrate a higher likelihood for higher social classes to visit expensive stores and shop online. With respect to store choice, store size increases and travel time (driving, public transport, or walking) reduces the likelihood of a store being chosen, which is in line with previous store choice studies. Using a store choice and expenditure model, Wieland (2021a, b, 2023) has combined the aforementioned approaches of channel and store choice. Results have shown that channel choice is mainly influenced by shopping attitudes, place of residence, and age of the consumers. Consumers with a “pro online” attitude, urban residents, and younger consumers were found to exhibit a higher likelihood for e-shopping. Store choice and expenditures were primarily explained by store features such as assortment size, cross-channel integration, and transaction costs (travel time, delivery charges, and delivery time).

2.3 Shopping behavior changes in the COVID-19 pandemic

The dramatic increase in market share for online retailers during the COVID-19 pandemic has occurred due to changes in individual shopping behavior. Of course, these changes may be a direct result of government-imposed restrictions, in particular, lockdowns including stay-at-home orders and closing of “non-essential” businesses. When physical retail stores are closed during a lockdown (as imposed in most Western countries), there is no opportunity for shopping there at all. However, changes in shopping behavior may also occur on a voluntary basis in the absence of lockdowns and may result from individual fear of infection (Goolsbee and Syverson 2021). From a psychological perspective, fear of infection may motivate people to change their behavior in terms of avoiding situations with risk of infection and in establishing preventive actions (Stangier et al. 2022).

For example, with respect to Germany, Stangier et al. (2022) investigated the impact of fear of infection and knowledge about the virus on (reported) behavioral changes. For the measurement of fear of infection, they utilized the Perceived Vulnerability to Disease (PVD) scale consisting of two latent variables, “perceived infectability” and “germ aversion,” both of which were inferred from a survey (Duncan et al. 2009). The results demonstrated that both perceived infectability and germ aversion significantly increased preventive behavior. In addition, germ aversion significantly reduced risk behavior, which corresponded to reducing physical contacts. However, the Stangier et al. study did not explicitly investigate shopping trips and therefore could not assess whether behavioral changes were actually implemented.

There are various studies on spatial mobility that show a significant decline in physical shopping trips in the context of the pandemic. With respect to the first wave of infections in spring 2020, Jacobsen and Jacobsen (2020) analyzed Google mobility data from the USA and found a substantial reduction in the second half of March 2020 with respect to visits to transit stations, retail and recreation facilities, grocery stores, and pharmacies. These changes were found to be of a similar magnitude in states with and without a formal lockdown. This reduction of visits to grocery stores and pharmacies—both of which were never closed—was approximately 16% and 15%, respectively. Goolsbee and Syverson (2021) investigated spatial consumer behavior in the USA based on mobile phone data from March to May 2020, including US counties with and without formal lockdowns. Their results showed a large decline in over-all mobility, of about 60%, with the estimated impact of formal lockdowns being found to be approximately 7% only. This decline impacted, inter alia, clothing, and do-it-yourself stores as well as grocery stores and pharmacies. The Jiao and Azimian (2021) study covered the second phase of the pandemic and employed a survey on mobility behavior, which was conducted in the absence of formal lockdowns in the USA (October 2020). This study found a significant reduction of trips to retail stores and for trips by public transport, with stronger effects in the group of people aged 35 or older.

In their survey from April to June 2020 in Chicago, Shamshiripour et al. (2021) showed that people’s tendency to buy groceries online grew substantially during the first wave of the pandemic, regardless of the formal lockdown. For several other major US cities, Chenarides et al. (2021) found the same trend to be occurring, with respondents’ main reasons for not shopping in-store being “scared of COVID-19” and “feeling unsafe.” During this same time frame, Rossetti et al. (2022) also conducted a stated choice experiment investigating grocery store choice under pandemic conditions. They examined the effects of infection control measures while shopping, and found, among other things, that consumers accept certain inconveniences (e.g., longer waiting time, queues in front of the store) if, in return, infection control measures (e.g., face masks, limitation of the number of customers) are in place. All in all, these studies suggest that, at least in the early phase of the pandemic, there were voluntary changes in behavior to reduce risk and that consumers’ online affinity subsequently increased as a result of fear of infection.

In addition, the face mask mandate in retail shops, which applied in many European countries from April 2020 to (mostly) April 2022, may have had an impact on shopping behavior. As studies have shown, a certain part of the population rejects face masks for ideological reasons or at least finds them uncomfortable (Knotek et al. 2020; Matusiak et al. 2020; Taylor and Asmundson 2021). It is therefore plausible to assume that people who are negatively affected by wearing a mask avoid situations in which a mask is mandatory. However, it has not yet been empirically investigated whether consumers are switching or have switched to e-shopping because of this. There are only indications that consumers are avoiding retail locations where masks are mandatory. In a German survey from October 2020 conducted by Bonial (2020), 40% of the respondents stated that they shop less frequently in city centers because of the mask mandate in retail stores. In any case, the mandate might have encouraged, at least some, consumers to buy online rather than in-store.

3 Modeling approach and data collection

3.1 Modeling approach

The econometric strategy in this study is based on the modeling approach outlined in Wieland (2021a, b, 2023), which incorporates an adaptation of the hurdle model (Mullahy 1986) applied to (spatial) shopping behavior. The hurdle model is a special kind of count data model which was designed for excess of zeros in the dependent variable. This is a two-part model. In the first part, a binary probability model determines whether there is a null or a nonzero outcome; the positive outcomes are treated in the second part of the model using a (truncated) count data model. Thus, although the model is designed on the basis of mathematical principles (heavily skewed dependent variables), it nevertheless implicitly represents a two-step decision process (“if”- and “how”-decision), which is why the model is often used to analyze individual demand in micro-econometric studies (Greene 2012; Cameron and Triverdi 2005). Similar to previous store choice and channel choice models, the underlying rationale for the current model is probabilistic choice behavior which assumes consumer utility maximization. The first model part (usually referred to as “participation equation”) deals with whether consumer choice for something (here: shopping at a given store) is performed and measures the probability of that event occurring. Mathematically, this is the probability that the dependent variable (here: a consumer’s sum of expenditure at a given store) is greater than zero. The second part of the model (usually referred to as “intensity equation” or sometimes “expenditure equation”) is employed only for observations greater than zero and deals with the issue of how something is performed (here: a consumer’s sum of expenditure at a given store which was chosen before) (Cameron and Triverdi 2005; Wieland 2021a, b, 2023).

The representation provided here follows that of Wieland (2021a, b, 2023). The dependent variable in the model equals the expenditures of consumer I at (physical or online) store j, denoted Sij hereafter. This variable represents both store and consumer attributes. The utility of store j (j = 1,…,J) for consumer i (i = 1,…,I) consists of an explained part (representative utility), Vij, and an unobserved part, the error term, εij:

$${U}_{ij}={V}_{ij}+{\varepsilon }_{ij}$$
(1)

The representative utility of store j for consumer i, Vij, is a combination of the utility of the channel and the utility of the individual store, with the first part reflecting the consumer's affinity for e-shopping and the second part comprising the most important attributes of the store. To emphasize this distinction, we will henceforth denote the total utility \({V}_{ij}^{P}\) , the store utility \({V}_{ij}^{S}\), and the channel utility \({V}_{ij}^{C}\):

$${V}_{ij}^{P}={V}_{ij}^{S}+{V}_{ij}^{C}$$
(2)

The part \({V}_{ij}^{C}\)  contains variables which are assumed to have a significant impact on channel choice, which includes socio-demographic, spatial, and attitudinal characteristics of consumers (see Sect. 2.2). The impact of a variable on channel choice is assessed using interaction terms incorporating the variable describing a consumer characteristic and a dummy variable (DOj) indicating whether store j is an online store (DOj = 1) or not (DOj = 0). Table 1, section “Independent variables A,” shows the independent variables considered relevant to channel choice. These are the variables used in the previous studies (Wieland 2021a, b, 2023) but supplemented by psychographic consumer characteristics related to the SARS-CoV-2 pandemic. The utility of the online channel for the individual consumer i, \({V}_{ij}^{C}\), is thus defined as:

$${V}_{ij}^{C}={\beta }_{1}D{O}_{j}+{\beta }_{2}D{25}_{i}+{\beta }_{3}D{65}_{i}+{\beta }_{4}D{m}_{i}+{\beta }_{5}D{E}_{i}+{\beta }_{6}D{L}_{i}+{\beta }_{7}P{O}_{i}+{\beta }_{8}F{I}_{i}+{\beta }_{9}G{A}_{i}+{\beta }_{10}M{A}_{i}+{\beta }_{11}\left(D{O}_{j}*D{25}_{i}\right)+{\beta }_{12}\left(D{O}_{j}*D{65}_{i}\right)+{\beta }_{13}\left(D{O}_{j}*D{m}_{i}\right)+{\beta }_{14}\left(D{O}_{j}*D{E}_{i}\right)+{\beta }_{15}\left(D{O}_{j}*D{L}_{i}\right)+{\beta }_{16}\left(D{O}_{j}*P{O}_{i}\right)+{\beta }_{17}\left(D{O}_{j}*F{I}_{i}\right)+{\beta }_{18}\left(D{O}_{j}*G{A}_{i}\right)+{\beta }_{19}\left(D{O}_{j}*M{A}_{i}\right)$$
(3)

where DOj denotes whether store j is an online store (dummy variable), D25i and D65i indicate whether consumer i is under 25 years old or at least 65 years old (dummy variables), Dmi indicates whether consumer i is male (dummy variable), DEi indicates whether consumer i is employed (dummy variable), DLi indicates whether consumer i lives in a large city (dummy variable), POi is the value of the latent attitude variable (“pro online” attitude) of consumer i, FIi and GAi are the values of consumer i's latent variables related to fear of infection (“perceived infectability” and “germ aversion”), MAi is the score for consumer i's rejection of face masks when shopping, β1, …, β19 are the regression coefficients to be estimated.

Table 1 Variables in the models

The coefficients of interest are those related to the interaction terms which include place of residence, the “pro online” attitude, and the pandemic-related variables. According to the innovation-diffusion hypothesis, urban residents exhibit a higher likelihood of buying online. The dummy variable DLi indicates whether consumer i lives in a large city, whereby “large city” is understood to mean a city with at least 100,000 inhabitants, which was taken from the official spatial classification in Germany (BBSR 2022). Thus, the coefficient β15 for the interaction term DOj*DLi is expected to be significant and positive. The variable POi represents the “pro online” attitude of consumer i, as used in Schmid and Axhausen (2019) and Wieland (2021a, b, 2023) and is inferred from 15 statements provided by surveyed consumers (see Sect. 3.2). A “pro online” attitude is expected to increase the likelihood of buying online, and thus, coefficient β16 of the interaction term DOj*POi is expected to be significant and positive.

To account for possible impacts of the pandemic on channel choice, three variables have been introduced into the channel choice part of the utility equation, the first two of which reflect individual fear of infection of consumer i (FIi: “perceived infectability,” GAi: “germ aversion”) and the third the rejection of face masks when shopping (MAi). The first two are inferred from the Perceived vulnerability to disease scale, as used by Stangier et al. (2022), and the last is simply a statement reflecting the perceived inconvenience of face masks in retail shops (see Sect. 3.2). If fear of infection, as defined here, impacts shopping behavior such that physical stores are avoided and online buying is preferred with the aim of reducing infection risk, the two coefficients β17 and β18, which relate to the interaction terms DOj*FIi and DOj*GAi, should be significant and positive (i.e., the higher the fear of infection, the more likely an online purchase). Similarly, if the face mask mandate in retail shops encouraged consumers to buy online instead of in-store, coefficient β19 of the interaction term DOj*MAi should be significant and positive.

In order to ensure that socio-demographic characteristics of consumers are not decisive, the recorded characteristics (age, gender, employment status) were included in the equation as control variables: The dummy variables D25i and D65i show whether the interviewed consumer i is under 25 years old or at least 65 years old. The variable Dmi indicates the gender of the consumer (1 = male, 0 = female), and DEi indicates whether consumer i is employed. For example, if β12 of the interaction term DOj*D65i is significant and negative, it means that people aged 65 and over buy significantly less online.

Store utility (\({V}_{ij}^{S}\)) includes a set of explanatory variables stemming from both store choice and multi-channel shopping behavior studies and describes the shopping alternatives and the corresponding shopping transaction costs (see Sections 2.1, 2.2). Table 1, section “Independent variables B,” lists the independent variables considered relevant to store choice. These are the variables used in previous studies (Wieland 2021a, b, 2023). Please note that the store utility equations for consumer electronics and furniture are not completely identical because of some industry-specific circumstances that need to be considered. For CE stores, the utility of store j for the individual consumer i, \({V}_{ij}^{S}\), is defined as (Wieland 2021a):

$${V}_{ij}^{S}={\gamma }_{0}+{\gamma }_{1}\mathrm{ln}{A}_{j}+{\gamma }_{2}{t}_{ij}+{\gamma }_{3}s{t}_{j}+{\gamma }_{4}s{c}_{j}+{\gamma }_{5}D{I}_{j}+{\gamma }_{6}D{C}_{j}+{\gamma }_{7}DS{M}_{j}+{\gamma }_{8} ln {C}_{j}+{\gamma }_{9}Dnf{r}_{j}+{\gamma }_{10} \left(\mathrm{ln}{A}_{j}*D{O}_{j}\right)+\sum_{g}^{G}{\delta }_{g}{D}_{{g}_{j}}\left[+\zeta \mathrm{ln}{S}_{i}\right]$$
(4)

where Aj denotes the number of articles of store j, tij is the travel time between consumer i and store j, stj and scj denote the delivery time and the delivery costs of store j, DIj and DCj indicate whether store j offers an integrated online shop or the BIPOS option (dummy variables), DSMj indicates whether store j is located in a shopping mall (dummy variable), Cj reflects store j's spatial proximity to its competitors, Dnfrj indicates whether store j is a non-full range provider (dummy variable), DOj denotes whether store j is an online store (dummy variable), Dgj indicates whether store j belongs to chain g, Si equals the all-over expenditures of consumer i, γ0, …, γ10, δg, and ζ are regression coefficients to be estimated.

For furniture stores, the utility of store j for the individual consumer i, \({V}_{ij}^{S}\), equals (Wieland 2023):

$${V}_{ij}^{S}={\gamma }_{0}+{\gamma }_{1}\mathrm{ln}{A}_{j}+{\gamma }_{2}{t}_{ij}+{\gamma }_{3}s{c}_{j}+{\gamma }_{4}Dsc{o}_{j}+{\gamma }_{5}Dsc{f}_{j}+ {\gamma }_{6}DC{C}_{j}+{\gamma }_{7} ln {C}_{j}+{\gamma }_{8} \left(\mathrm{ln}{A}_{j}*D{O}_{j}\right)+\sum_{g}^{G}{\delta }_{g}{D}_{{g}_{j}}\left[+\zeta \mathrm{ln}{S}_{i}\right]$$
(5)

where Aj denotes the number of articles of store j, tij is the travel time between consumer i and store j, scj denotes the delivery costs of store j, Dscoj and Dscfj indicate whether the delivery costs of store j depend on the order value or whether there is free delivery above a certain order value (dummy variables), DCCj indicates whether store j has an integrated online shop and offers BIPOS (dummy variable), Cj reflects store j's spatial proximity to its competitors, DOj denotes whether store j is an online store (dummy variable), Dgj indicates whether store j belongs to chain g, Si equals the all-over expenditures of consumer i, γ0, …, γ8, δg, and ζ are regression coefficients to be estimated.

According to the Huff model and several empirical store choice studies, assortment size of store j (Aj) is expected to have a positive but sublinear effect due to diminishing marginal utility of assortment. Thus, a significant coefficient γ1 between zero and one is expected. Because online stores regularly offer a considerably larger assortment than physical stores, it is tested whether there is a difference of the assortment impact between physical and online stores, assuming that this impact is lower for online stores, which means that coefficient of the interaction term ln Aj*DOj (γ10 in the consumer electronics model and γ8 in the furniture model) should be negative.

Three types of shopping transaction costs are included in the store utility function, namely travel time (tij; for physical stores), delivery time (stj), and delivery costs (scj; for online stores). According to the Huff model and other work from retail location theory, distance-dependent demand is assumed, and thus, the impact of travel time is expected to be significant and negative (coefficient of tij: γ2 < 0). This assumption is equivalent to the findings in channel choice studies, where a better accessibility of competing physical stores decreases the likelihood of shopping online. Similarly, it is expected that delivery costs and delivery time will reduce the choice probability of a store, which would mean that coefficients γ3 and γ4 of variables stj and scj in the consumer electronics model will be negative. However, there is a difference between consumer electronics and furniture stores: In the case of furniture online stores, the delivery time cannot be determined through research (because the delivery times between the individual products sometimes vary by several weeks or even months), which is why this variable (stj) cannot be included in the store utility equation for furniture stores. In return, two variables are included that refer to the different delivery policies of furniture online stores: the dummy variables Dscoj and Dscfj indicate whether delivery charges of online store j depend on the order value or whether they are free from a certain order value (which is offered by some furniture online retailers).

In addition, previous studies have, for the first time, examined whether omni-channel integration of retailers increases customer inflow and it was determined that with regard to consumer electronics and furniture retailers that this tends to be the case (Wieland 2021a, 2023). Thus, it is expected that retailers with an integrated online shop (see Sect. 3.3 for the definition) and which provide the “buy online, pick-up in store” (BIPOS) option will have a higher likelihood of consumer patronage. According to the previous studies, these two attributes of cross-channel integration are considered separately in the store utility function of the CE stores, using the dummy variables DIj and DCj. The corresponding coefficients γ5 and γ6 in Eq. (4) are expected to be significant and positive. With furniture multi-channel retailers, it is the case that almost every store that has an integrated online shop also offers the BIPOS option. The two dummy variables would show almost perfect collinearity, so the dummies for the furniture stores are combined, with the variable DCCj indicating whether store j has an integrated online shop and offers BIPOS (Wieland 2023). The corresponding coefficient γ6 in Eq. (5) is expected to be significant and positive.

To account for possible positive agglomeration effects, for consumer electronics stores, two variables reflect the potential for multipurpose shopping (dummy DSMj for locations in a shopping mall) and comparison shopping (Cj for spatial proximity to competitors), respectively. Note that the cluster variable Cj is adapted from Fotheringham's (1985) Competing Destinations Model. If both types of agglomeration economies hold, then coefficients γ7 and γ8 in Eq. (4) would be significantly positive. German furniture stores are very rarely located in shopping malls, and in this case not a single furniture store is located within a mall. It is therefore pointless to include this independent variable in the store utility equation for furniture stores (see Eq. (5)).

Possible influences concerning membership of a specific store chain (e.g., IKEA, Media Markt) are not part of the research interest of this study, but it makes sense to take them into account in the model. This is made possible through dummy variables for the relevant chains: Dummy variable Dgj indicates whether store j belongs to chain g, and the corresponding coefficient δg displays the (positive or negative) chain effect. Additionally, to control for differences in the individual expenditures, the expenditure equation of the hurdle model contains the all-over expenditures of consumer i, Si.

The first part of the hurdle model explains the choice of (online or physical) shopping alternative j, in particular, the probability that the expenditures of consumer i at store j is greater than zero (Sij > 0). This probability depends on the utility of shopping alternative j, \({V}_{ij}^{P}\) . The participation equation is formalized via a binary logit model:

$$Pr\left[{S}_{ij}>0|{V}_{ij}^{P}\right]=\mathrm{exp}\left({V}_{ij}^{P}\right)/\left(1+\mathrm{exp}\left({V}_{ij}^{P}\right)\right)$$
(6)

The second part of the hurdle model (intensity equation) deals with the amount of expenditure at the chosen stores (Sij for all Sij > 0). This part of the model is operationalized as a truncated Poisson distribution with a Poisson parameter of λij. The expected value depends on the store utility, \({V}_{ij}^{S}\):

$$E\left({S}_{ij},{S}_{ij}>0|{V}_{ij}^{S}\right)={\lambda }_{ij}/\left(1-{\mathrm{exp}\left(-{\lambda }_{ij}\right)}\right)$$
(7)

where

$$\mathrm{ln}{\lambda }_{ij}={V}_{ij}^{S}$$
(8)

The expected value of the store choice hurdle model (including both parts), E(Sij|Vij), is:

$$E\left({S}_{ij}|{V}_{ij}\right)=\left(Pr\left[{S}_{ij}>0|{V}_{ij}^{P}\right]\right)\left(E\left[{S}_{ij},{S}_{ij}>0|{V}_{ij}^{S}\right]\right)$$
(9)

The estimation of the hurdle model employs the maximum likelihood technique (Greene 2012). The significance level is set to 90% (p < 0.1). Model estimation was conducted in R (R Core Team 2021) using the package pscl (Zeileis et al. 2008).

3.2 Consumer survey

The empirical approach used in this study is analogous to previous research (Wieland 2021a, b, 2023). In order to answer the research questions, data on real-world shopping decisions as well as individual attitudes and socio-demographic characteristics was required. This data was collected in a self-administered postal consumer survey from September to December 2021 in three German planning regions (Regional Association of South Lower Saxony with 531,814 inhabitants, Middle Upper Rhine Region with 1,043,465 inhabitants, and Regional Planning Association Halle with 740,278 inhabitants). These study areas were chosen in order to incorporate both urban and rural regions (BBSR 2022) as well as both Western and Eastern Germany. In principle, the questionnaire was intended to be filled out using paper-and-pencil; however, the respondents were also given the option to fill it out in web form. The addresses of contacted individuals came from a random sample of the municipal registration offices, whereby the minimum age for making contact was set at 15 years. A total of 129 municipalities were asked in all three survey areas, of which 118 municipalities (91.5%) participated and provided address data.

In the questionnaire, shopping behavior was obtained by asking about purchases made in the recent past. The individuals were asked about their three last purchases of different goods (including consumer electronics and furniture) and the expenditures related to each purchase/shopping trip. For any purchase, the specific shopping destination (or online shop) was noted (e.g., “Media Markt in street X of municipality Y,” “Amazon online,” “IKEA online”). The expenditures of individual consumer i at (physical or online) store j is the dependent variable in the hurdle model (Sij). As an initial question, consumers were asked about their shopping frequency for specific goods.

The construction of the “pro online” attitude of consumer i (POi) originally stems from the stated choice experiment by Schmid and Axhausen (2019). Their items related to risk perception for online shopping and attitudes toward in-store shopping were extended to include several statements connected with environmental and work-related effects of online shopping, as well as privacy issues (see Wieland 2021a, 2023 for a broader discussion). In the second part of the questionnaire, 15 attitude items on a 4-point Likert scale (1 = agree, …, 4 = disagree) were included. Two factors were extracted using an exploratory factor analysis (principal component extraction, Varimax rotation), one of which was expected to cover a “pro online” attitude.

The third part of the survey dealt with the pandemic situation from the respondent’s perspective. Fear of infection was targeted in order to examine any possible impact on shopping decisions. This section of the questionnaire included items from the Perceived Vulnerability to Disease (PVD) Scale (Duncan et al. 2009) in the same version (13 items) as used by Stangier et al. (2022). However, unlike this study, the PVD items were scaled on a 4-point Likert scale (1 = agree, …, 4 = disagree) to be congruent with the shopping attitude items. The two latent variables representing fear of infection (“perceived infectability,” “germ aversion”) were inferred using factor analysis in the same way as for shopping attitudes. One additional item on the same scale dealt with the perceived inconvenience of face masks in retail stores (“It bothers me to wear a mask while shopping”). The last part of the questionnaire asked for socio-demographic characteristics (age group, gender, employment status, and number of household members).

In the survey, 2,526 questionnaires were completed (see Table S1 in the online supplementary appendix). Considering neutral losses (e.g., invalid address, deceased), the response rate over the three survey regions was equal to 17.7%. There is a slight over-representation of women, whereas people from the lowest and the highest age group tended to be somewhat underrepresented. Nevertheless, this does not represent a problem as the model includes socio-demographic variables to control for such under- and overrepresentations. One-third of the respondents live in a large city (Göttingen, Karlsruhe, or Halle), as defined in the national classification system (BBSR 2022).

3.3 Collection of store data

With respect to CE retailing, the inclusion criterion for consideration in the model analysis was that the respective (physical or online) store offered at least the following product range groups: “Electrical Household Appliances, Lighting (comprehensive)” and “Consumer Electronics, Electronic Media (comprehensive),” as defined by GfK (2020). Big-box stores (e.g., Media Markt, Saturn, Euronics XXL) typically also provide further GfK product range groups (“Information Technology,” “Telecommunications,” and “Photography, Optics [comprehensive]”). In order to distinguish between these and more specialized stores, the latter stores were marked with the dummy variable Dnfrj (“not full range provider”) in the store choice model (see Sect. 3.1). Consequently, the collection of physical CE stores includes all big-box stores (e.g., Media Markt, Saturn, Expert) and specialty stores (e.g., EP), as well as some departments of department stores and hypermarkets (e.g., Galeria, Real). The same inclusion criterion was applied analogously to the online shops. The relevant online providers include, for example, the online shops of the omni-channel retail companies Media Markt, Saturn, and Expert, as well as pure online retailers with the corresponding range of products such as Amazon or OTTO. With respect to furniture shopping, all physical and online stores were included that offer the GfK ranges “Furnishings (comprehensive)” and “Household Products, Glass, Porcelain,” whereby a part of the range “Electrical Household Appliances, Lighting (comprehensive)” (especially lamps) is always covered (GfK 2020). The physical furniture stores considered therefore include the outlets of the chains of large stores such as IKEA, XXXLutz, or Poco. The online stores considered include the online shops of these chains as well as other purely online retailers such as Amazon.

For both physical and online stores, the number of articles offered (variable Aj in the store choice model) and information about their cross-channel integration were collected. The cross-channel integration of the chain or the specific store was ascertained via internet research, in particular, the specific online shop of the store/chain. “Cross-channel integration” is operationalized by two variables here, with the first indicating whether the specific store/chain has an “integrated online shop” (variable DIj) and the second indicating whether the “buy online, pick-up in store” (BOPIS) service is available (variable DCj). In the case of CE stores, both variables are included in the model. For the furniture stores, the variable DCCj is included in the utility function that includes both attributes: DCCj = DIj*DCj (see Sect. 3.1). An “integrated online shop” was defined as the web platform of a multi-/cross-channel retailing company which provides information about (1) the assortment of both the online shop and the associated physical stores, and (2) the availability of each product in a given physical store, and (3) in-store price as well as some product details. The opportunity of BOPIS was verified simply by reviewing the company’s terms and conditions and the product pages. All online providers were researched for their delivery costs and their specified average delivery time (variables scj and stj). For all relevant physical stores, their location (street address) and store size (selling space in sqm) was recorded, with the latter being necessary for the interpolation of missing values on the assortment size (see below). Data on store size was requested from the corresponding companies and/or was obtained from public authorities, e.g., in land use plans.

The number of articles was quantified via desktop research using the websites and online shops of the corresponding retail companies (e.g., Expert, IKEA, JYSK, Media Markt). Most of the online shops of cross-channel retailers also provide an availability check for articles of each store, which allows the number of articles of specific stores of a chain to be counted. This procedure was automated using self-created functions and scripts for web scraping based on the package httr (Wickham 2019) in R (R Core Team 2021). While the number of items was completely available for all relevant CE online stores (27 in total), there was no data on assortment availability for 43 physical stores (108 CE stores in total). Therefore, the data of the stores for which both the selling space (in sqm) and the number of articles were available (49 stores) was used to estimate a regression model (see Table S2 in the online supplementary appendix) with which the number of articles was interpolated for the other stores. From a total of 123 physical furniture stores in all three study areas, the number of articles could be determined for 74 of them, and the sales area was also established for 50 stores. As with the CE stores, the missing values for the remaining 49 furniture stores were interpolated using an auxiliary regression (see Table S3 in the online supplementary appendix).

3.4 Further data processing

The street addresses of the survey respondents (residential address) and physical stores were geocoded automatically by accessing the OpenStreetMap address database (OSM Nominatim). Based on these coordinates, travel times between consumer and store locations (variable tij) were calculated using OSRM (OpenStreetMap Routing Machine), with travel time being defined as the fastest route between origins and destinations in terms of car driving time in minutes. It was assumed that car travel time is the best proxy for the opportunity costs of the shopping trip, since (1) it can be assumed that the majority of purchases at mostly decentralized CE and furniture stores are made by car, and (2) car travel times are estimated much more precisely than, e.g., public transport travel times. An interaction matrix for all I consumers (i = 1, …, I) and J stores (j = 1, …, J) with I*J rows was constructed and merged with the travel time matrix. The travel time variable tij was set to zero for online stores. The dependent variable Sij (expenditures of consumer i at store j) was calculated from the survey data, whereby observed purchases without a record of the corresponding expenditures were excluded from the analysis. These steps were performed in R (R Core Team 2021) using the package MCI2 (Wieland 2021c), which provides access to the OSM services by utilizing functions from the OSRM package for R (Giraud 2022).

The cluster variable Cj for the spatial proximity to competitors was calculated based on the street addresses of the stores. This was based on the airline distance between stores j and k, djk, with the weighting exponent set to two. Dummy variables were also calculated for the individual chains. Both calculations mentioned were carried out in R using the REAT package (Wieland 2019).

Since the place of residence of the respondents was known, they could be assigned to the respective BBSR municipality type (BBSR 2022). This was necessary for the dummy variable DLi, which indicates whether a consumer lives in a large city (minimum of 100,000 inhabitants, as defined by the BBSR). It must be noted here that the division between “urban” and “rural” is of course relatively rough, especially since there are intermediate types in the spatial classification mentioned. However, the present analysis is not about differences in shopping behavior between region types in general, but very explicitly about the difference between (large) cities and (more or less) “rural” areas, which is discussed in the “innovation-diffusion hypothesis” (see Sect. 3.1). In the three survey areas, one can make a very clear distinction between the three “large cities” (Göttingen, Karlsruhe, Halle/Saale) and their (predominantly rural) surroundings. The next largest towns are many times smaller and the vast majority of surrounding communities in the three regions are “small towns” and “rural communities” (according to the aforementioned spatial type classification). In addition, the study aims (also) at a comparison with the time before the COVID-19 pandemic, whereby the results of previous studies (Wieland 2021a, 2023) are used for comparison. In these studies, a dummy variable was used to denote “large cities.” If in the present study the shopping behavior was broken down by other region types (which would in principle be more accurate), a comparison with the results from the pre-pandemic period would no longer be possible.

4 Results and discussion

4.1 Descriptive results toward shopping behavior

Tables 2 and 3 show descriptive results from the 2021 survey as compared to those of 2019 for each specific study area. Shopping frequencies stated by the respondents (median of CE or furniture purchases in the last 12 months), the shares of store formats and channels in the purchases and expenditures (in %), as well as the average travel times of the in-store purchases (car driving time in minutes, median) are displayed. Shares of CE and furniture purchases and expenditures separated by study area at the level of municipalities are shown in the online supplementary information (Figs. S3 and S4).

Table 2 Channel-specific purchases and expenditures by survey areas—Consumer electronics stores
Table 3 Channel-specific purchases and expenditures by survey areas—Furniture stores

If we look at the first two study areas, we can see that the frequency of consumer electronics and furniture purchases (measured in the form of the median) has remained the same. In this regard, there are no differences between the two or three study areas. It is noticeable that the average travel times for in-store consumer electronics purchases fell from 2019 to 2021 and that this applies to all three store formats considered. With respect to all physical stores of this industry, the decline is 21% (South Lower Saxony) and 31% (Middle Upper Rhine Region), respectively. A similar change can be seen in furniture purchases: the median travel times for purchases in physical stores fell by 9.3% in South Lower Saxony and by 33.2% in the Middle Upper Rhine Region. Even if this descriptive analysis precludes any causal conclusions, it is nevertheless an indication that there was less willingness to invest time in in-store shopping trips. A possible explanation for this could be that due to increased work-at-home during the pandemic, purchases on the way to work or at the place of work were reduced and instead purchases were made more frequently near the place of residence. In any case, it has now been proven that retail sales have shifted to the city suburbs during the pandemic, which is attributed to increased work-at-home frequency (Alipour et al. 2022).

In the first two study areas, the share of e-shopping in consumer electronics purchases is higher than in spending, which was already evident for 2019. This is also reflected in the average expenditures, which are higher in physical retailing than in online retailing. However, this does not apply to the third study area, where the share of expenditure is higher than the share of purchases and the median expenditure is nearly the same. The fact that expenditure in physical stores is higher on average than for online purchases can be seen with regard to furniture purchases in all three study areas. In line with expectations, the market share of e-commerce in consumer electronics and furniture retailing increased from 2019 to 2021, both in terms of purchases and expenditures. But the opposite picture emerges for consumer electronics purchases in the first study area. With regard to South Lower Saxony, it cannot be ruled out that the high share of expenditures for in-store consumer electronics shopping is due to “outliers” (some individual cases with extremely high purchase amounts) because the median expenditure in online stores is lower than in the Middle Upper Rhine Region. The online share of consumer electronics purchases is highest in the Middle Upper Rhine Region, the share of expenditures is highest in the Halle Region. Summarized across all three study areas, online retailing accounted for 46.4% of expenditure. For furniture purchases, online market share is highest for both purchases and expenditures in the third study area. In all cases, the share of purchases significantly exceeds the share of expenses since the average purchase amount is much lower for online purchases. Across all three study areas, the share of online expenditures equals 15.8%. Accordingly, online retailers play a noticeable role in shopping for consumer electronics and furniture products and the market share of online retailing has increased significantly.

4.2 Psychographic consumer characteristics: shopping attitudes and fear of infection

Since shopping attitudes and fear of infection play an important role as (possible) explanatory variables in this study, it makes sense to take a closer look at these interim results. Table S4 in the online appendix shows the relative frequencies of the shopping attitude items, the associated latent variables, factor loadings, and additional information such as Cronbach’s α and cumulative explained variance. Table S5 in the online appendix shows the same for the PVD items.

Out of the 15 shopping attitude items, ten can be attributed to the first latent variable, the “pro online” attitude (variable POi in the model), whereas five items (1, 6, 8, 11–12) can be associated with the second latent variable, “physical shopping pleasure.” When interpreting the factor analysis, the scaling of the variables should be kept in mind (1 = “agree,” …, 4 = “disagree”), e.g., the lower the agreement with statement 2 (“Online shopping is associated with risks”), the higher the value of latent variable 1 (“pro online” attitude). Based on this relationship, the assignments to the latent variables are plausible in terms of content. The association of the variables with the factors, the cumulative explained variance and the internal consistency (Cronbach’s α), essentially correspond to the result of the first survey in 2019 (Wieland 2021a, 2023), and thus, also largely to that of Schmid and Axhausen (2019), from whose questionnaire the majority of the items was acquired. The value for Cronbach’s α for the “pro online” attitude equals 0.79, which can, at the very least, be classified as acceptable, but may also be rated as good. For these reasons, it can be assumed that the items used here sufficiently represent a “pro online” attitude.

The thirteen items on fear of infection are associated to the two latent variables (“perceived infectability” and “germ aversion”) as follows: Eight items are attributed to the first latent variable (2, 4–5, 7, 9, 11–13) and five items (1, 3, 6, 8, 10) to the second. The scaling of the items must be considered in the interpretation, e.g., the higher the agreement with statement 7 (“In general, I am very susceptible to colds, flu and other infectious diseases”), the higher the value of the “perceived infectability” factor (latent variable 1). It must be noted here that, analogous to the original paper by Duncan et al. (2009), some items were recoded (see Table S5 in the online appendix). This also applies to the item just mentioned, so that the highest numerical value corresponds to the highest level of agreement. Therefore, the factor loading of this item is positive (0.834). Against this background, the factor loadings can be regarded as understandable and plausible. The factor structure identified here is very similar to that found in the psychological studies from which the items are derived; the same applies to internal consistency and the explanation of variance (Duncan et al. 2009; Stangier et al. 2022). The first latent variable (“perceived infectability”) has good internal consistency and the second (“germ aversion”), at least, sufficient internal consistency. Since the results are very similar to those of psychological research, it is therefore assumed that the construct, “fear of infection,” is sufficiently represented to be operationalized for checking whether this characteristic influenced shopping behavior during the pandemic.

4.3 Determinants of channel and store choice

For each of the three survey regions, one hurdle model consisting of two model parts was estimated. Tables 4 and 5 show the results with respect to CE and furniture shopping for the Middle Upper Rhine Region. Results for the other two survey areas can be found in the online appendix (Tables S6 to S9). The first column lists the explanatory variables, while the second column contains the coefficients of the Participation Equation (given in Eq. (6)), and the third column contains those of the Expenditure Equation (for all Sij > 0, given in Eqs. (6)-(7)).Footnote 1 In order to ensure the robustness of the models (e.g., with regard to possible collinearity effects), the estimation was carried out successively. Additional variables were gradually included in the models. This showed that the log-likelihood values could be increased and the results with regard to the significance of individual variables did not change (usually, changes in the coefficients only became apparent from the second decimal place). Correlations between individual independent variables where an association seemed possible were also checked; only weak correlations, if any, were found.Footnote 2

Table 4 Estimation results for consumer electronics in the Middle Upper Rhine Region
Table 5 Estimation results for furniture in the Middle Upper Rhine Region

First, the results related to the choice of shopping channel (channel utility, \({V}_{ij}^{C}\)) are considered, which represent one part of the participation equation. The innovation-diffusion hypothesis states that urban residents are more likely to buy online, which has been confirmed for South Lower Saxony and the Middle Upper Rhine Region with respect to consumer electronics and furniture purchases in 2019 (see the participation equations for 2019). However, with respect to consumer electronics purchases, the coefficient for the interaction term, DOj*DLi, β15, for the 2021 data is only significantly positive in the second study area, but not (anymore) in South Lower Saxony and Halle. Regarding the models for furniture shopping, the associated coefficient is also only significantly positive in the second study area. Therefore, the assumption that people in large cities tend to shop more online can only be confirmed for one of the three study areas. It is in any case questionable whether such a result can confirm the innovation-diffusion hypothesis at all. On the one hand, it is questionable whether e-shopping can still be considered an “innovation,” given that online shopping has been available for over 20 years. On the other hand, a tendency toward more online purchases in (large) cities, especially for durable goods, can also be explained differently. Big-box stores for durable goods such as consumer electronics and furniture are often located in peripheral commercial areas and the car density is significantly lower in large cities. Therefore, the accessibility of such stores in large cities is sometimes poor, which would rather speak for a confirmation of the efficiency hypothesis (Wieland 2021a, 2023).

There is a clear result regarding shopping attitudes. The coefficient of the interaction term, DOj*POi, β16, is significant and positive in all three regions for consumer electronics purchases and in two of three study areas for furniture purchases (the coefficient is positive but not significant in South Lower Saxony). This result was already found in 2019 in the first two study areas and is now confirmed for the Middle Upper Rhine Region and the Halle Region; it is also congruent with the experimental choice study by Schmid and Axhausen (2019) and real-world shopping behavior with respect to groceries (Wieland 2021b). Channel choice may, therefore, be regarded as highly impacted by attitudes toward the online channel, which only seems obvious at first glance. It is by no means to be assumed that stated attitudes (e.g., regarding the impact of online shopping on the environment or on working conditions) are congruent with actual behavior; this may be due, among other things, to the social desirability bias in surveys. The result is also important because the innovation-diffusion hypothesis relates to psychographic consumer characteristics, implying that online affinity is greater in urban areas. On the contrary, we find, in the current study, predominantly no influence of the place of residence, but of the attitude itself. Within the scope of this study, it cannot be clarified why the “pro online” attitude in the first study area (South Lower Saxony) no longer plays a statistically demonstrable role in channel choice for furniture purchases.

It was assumed that shopping online rather than in-store could be a strategy to reduce one's risk of infection. The coefficients for the two interaction terms of the online dummy with the latent variables “perceived infectability” and “germ aversion,” β17 and β18, are typically found to be not significantly positive. There are only two exceptions. First, in the Middle Upper Rhine Region is an increasing degree of perceived infectability associated with a higher probability of buying consumer electronics online. Fear of infection (as operationalized here using the psychological PVD scale) therefore plays little or no role in channel choice for consumer electronics purchases, at least in this survey from 2021. Second, a significant effect of germ aversion in channel choice for furniture purchases was found in the first study region (South Lower Saxony). There was no significant effect of perceived infectability in any of the three areas. The fact that neither of the two latent variables that represent fear of infection significantly influences channel choice is the norm with regard to the two product types considered and the three regions examined. To some extent, these results contradict those of Stangier et al. (2022), who found fear of infection as a predictor for (declared) changes in behavior (in terms of risk avoidance, etc.). It must also be taken into account that such attitudes may have changed; the work of Stangier et al. (2022) refers to the first year of the pandemic, whereas the results shown here concern shopping behavior in the second year. The same applies to studies that dealt with channel choice in the early days of the pandemic and found influences of fear of infection and perceived insecurity (e.g., Chenarides et al. 2021).

There is no indication that the rejection of masks in shops influences the choice of shopping channel in favor of online retailing, as the related coefficient, β19, is not significant and positive for either product type considered in any of the three study areas. With respect to CE purchases, in two out of three study areas there is instead a significantly negative coefficient, which contradicts the expected effect. The same applies to furniture purchases in the second survey area. The reason for this may lie in unobserved consumer characteristics (e.g., with regard to lifestyle).

As expected, a pronounced age effect can be determined for the socio-demographic control variables. Older consumers (65 years and older) shop significantly less online, which is reflected in most models. At the same time, it has also been partially shown that young consumers (under 25 years) shop online significantly more often. The findings of the most models also demonstrate that male consumers shop online significantly less often, which is not consistent with the results of previous studies. With regard to the employment status, no clear statements can be made as the associated interaction variable in the CE models is only significant in one study area and in the case of furniture purchases the results contradict each other (positive in the first region, not significant in the second, negative in the third).

With respect to store utility, the impact of assortment is found to have a significant impact, as the coefficient γ1 of ln Aj is significant in both model parts in all three survey areas for both consumer electronics and furniture shopping. This influence, however, is not uniform. In the Participation Equations for consumer electronics, which reflect store choice probability, the coefficient is between zero and one in the first two survey areas but negative in the third region. In the case of the Participation Equation for furniture purchases, the associated coefficient is within this range in all three study regions. A coefficient between zero and one implies a sublinear positive impact of assortment on consumer utility, as stated, e.g., in the Huff model (Huff 1962). The positive effect is smaller for online stores as the coefficient γ10 of the interaction term, ln Aj*DOj is significant and negative in the first two study regions. This may be explained by the fact that online shops have a much larger assortment (in terms of breadth and depth of product range), implying that expanding this range leads to a much smaller increase in consumer utility. With respect to South Lower Saxony and the Middle Upper Rhine Region, the results from the 2019 survey were confirmed. Comparable results can also be seen with respect to furniture shopping (Wieland 2023). However, the results with respect to CE shopping for the Regional Planning Association Halle are the opposite. This may be due to a fundamentally different retail structure and, related to this, different consumer needs in the Halle Region (representing East Germany). The CE market there is dominated by peripherally located big-box stores with a large assortment, and the share of expenditure in e-shopping is, for this area, by far the highest. In this competitive environment, product range extensions may no longer play a role in shopping decisions.

The results concerning the shopping transaction costs are very clear, at least assuming consumers strive to reduce these costs as much as possible (Chintagunta et al. 2012). For both consumer electronics and furniture purchases and in all three survey regions, the coefficient γ2 of travel time, tij is significant and negative in the Participation Equation, which means that the greater the travel time to the (physical) store, the lower its choice probability. Distance-dependent demand or reduction in consumer utility due to opportunity costs of travel is one of the fundamental statements of retail location theory (e.g., Christaller 1933; Huff 1962) and has been empirically confirmed countless times (see Sect. 2.1). The same applies analogously to the shopping transaction costs for online retailers, at least regarding the models for CE shopping. The coefficients of delivery costs, γ3, and delivery time, γ4, are significant and negative in the Participation Equations in all three study regions. While delivery costs represent real expenses, delivery time can also be interpreted as the opportunity cost of waiting. In the case of furniture shopping, however, this expectation can only be partially confirmed, as the delivery charges only show the expected significant negative effect in one of the three study areas (Middle Upper Rhine Region). In the 2019 survey, delivery charges in the furniture retail sector were of no significant importance. This might be explained by the fact that the high expenses incurred when ordering furniture means that shipping costs are no longer a decisive factor for consumers. Other aspects of the delivery policy of furniture online stores influence store choice, which is shown by the fact that the dummy variables Dscoj and Dscfj sometimes have significant influences. These impacts are not uniform across the study regions and are not the focus here.

In summary, it can be said that in the case of store choice, the transaction costs of the channel or the individual store regularly reduce consumer utility and choice probability, respectively. However, in the intensity equation, these costs do not always show a negative, but sometimes a positive effect. This seems contradictory at a first glance, but it is possible that consumers compensate for the increased effort (e.g., longer travel time, higher delivery costs) by making a larger purchase. The results regarding the shopping transaction costs largely confirm the findings from the 2019 survey.

With regard to CE purchases, a clear difference compared to the results on shopping behavior before the pandemic can be seen in the influence of the cross-channel integration of the stores. In contrast to the previous model-based analysis, there is no longer a positive effect of an integrated online shop or the “buy online, pick-up in store” (BOPIS) option, since the two relevant coefficients, γ5 and γ6, for the dummy variables DIj and DCj, are usually either insignificant or significantly negative. This applies without exception to the participation equations, which means that the cross-channel integration of CE retailers does not increase consumer utility and choice probability, respectively.

In the furniture retail sector, the result is similar, with one exception. In the first study area (South Lower Saxony), a significantly positive effect was found for the cross-channel integration of a store/chain in the 2019 survey, which can now no longer be confirmed because the associated coefficient γ6 of the variable dummy variable DCCj is no longer significant. Likewise, no such influence on store choice can be found in the Halle region, while the utility-enhancing effect of cross-channel integration continues to be confirmed in the Middle Upper Rhine Region. As a result, no increase in the consumer utility of a store through its cross-channel integration can regularly be determined. One possible reason for this are changes occurring in the retail landscape, whereby some physical stores have, on the one hand, set up additional online shops and, on the other, some companies without any cross-channel integration have left the market.

In addition, the enormously increased relevance of online shopping is easier for large chains to cope with when compared to non-chain stores. It is therefore possible that integrated online shops and BIPOS, at least in consumer electronics and furniture retailing, have become so common that they no longer represent a competitive advantage. This especially appears plausible in the context of the COVID-19 pandemic, where many physical retailers started to engage in multi-channel retailing. Thus, the declining importance of cross-channel integration as a competitive advantage would also be an indirect consequence of the pandemic situation and the governmental interventions associated with it.

Positive agglomeration effects due to the potential of multipurpose and comparison shopping cannot be detected in all but one case, as the corresponding coefficients of the variables Cj and DSMj (the latter being tested only in the case of CE purchases) are mostly insignificant, and in some cases, negative. However, the clustering variable has a significantly positive influence on store choice for furniture purchases in the second study area (Middle Upper Rhine Region). This result is unique and appears to be contradictory at first glance. It is most likely due to differences in the location structure of the furniture stores between the study areas. A negative effect indicates the clustering of competitors whose assortments tend to be substitutable. On the other hand, a positive effect, as in the Middle Upper Rhine, suggests a cumulative attraction of stores (Nelson 1958). For example, there is a furniture cluster in the city of Karlsruhe in survey area 2, which includes an IKEA store that opened in 2020 and stores of two other chains. Since this effect was not yet found in the 2019 survey in the same area and the IKEA opening represents the only change in the sector under consideration, it strongly suggests that the IKEA settlement made this cluster effect possible. There is empirical evidence for positive agglomeration effects arising from IKEA openings (Daunfeldt et al. 2019). It is also important to note that clustering increases the choice probability in the mentioned case but not the expenditure, since γ7 in the intensity equation is negative. This indicates that while there is a cumulative attraction, the actual expenditure is split between the competitors.

There are also several significant chain effects with respect to both product types considered (e.g., regarding IKEA, Media Markt and Saturn) which are most likely largely determined by the competitive structure of the respective study region, and which are outside the scope of this study. The control variable for total expenditures by consumer correlates, as expected, positively with the expenditures in the stores.

5 Conclusions and limitations

First, it is important to conclude that the competitive relationships between online and physical retailers can be incorporated into a store choice model for spatial shopping behavior. With regard to the assumptions concerning consumer behavior (e.g., reduction of transaction costs, accessibility), there are large overlaps between traditional retail location theory (e.g., Huff model, CPT) and newer concepts around multi-channel shopping behavior (e.g., transaction costs perspective, efficiency hypothesis). As the models presented here show, these concepts can be combined, which represents an advance of the theoretical and methodological concepts of retail geography. Another advantage of the model is that it works at the individual level meaning that individual characteristics of both consumers and stores can be considered. From a practical perspective, for the purpose of business expansion or estimating the impact of new retail projects in urban and regional planning, it makes sense to take online retailing into account. The store choice model developed here provides a possible approach to this.

Second, the determinants of shopping behavior have essentially not changed compared to the period before the pandemic. The preference for online shopping can be explained by psychographic consumer attributes, more precisely, by shopping attitudes toward the online channel, age of the consumers, and, partially, place of residence. The choice of the specific (physical or online) store is determined primarily by shopping transaction costs occurring during the purchase (travel time, delivery time, delivery costs), and store features such as assortment. Several substantial differences in spatial shopping behavior before the pandemic (2019) and during the pandemic (2021) have been identified, such as the decline of the average travel time for in-store purchases or the lack of cross-channel integration as a competitive advantage. These changes in shopping behavior mentioned can be regarded as indirect consequences of the COVID-19 pandemic.

Third, with respect to direct consequences of the pandemic situation, the results indicate that fear of infection and the obligation to wear a mask have no or only a very minor influence on channel choice. At least it cannot be generally stated that consumers would have avoided in-store shopping and shopped online instead because of fear of infection (in retail stores), or because of a rejection of face masks. The pandemic situation was therefore—at least during the period under review—not a key driver of competition between the shopping channels. This does not mean that this influence did not exist in the first year of the pandemic; it must be emphasized that the purchases examined took place in 2021, at a time when the COVID-19 vaccination was already available and a large part of the population was vaccinated.

Despite these important findings, the current study also has limitations. First, there is a concern regarding the econometric strategy. Although the model approach includes and distinguishes between channel choice and store choice, it does not reveal the order in which these two consumer decisions are made. For example, it is possible for the channel to be selected first and then the store to be selected. Another decision-making level may also be interposed with respect to the store format (big-box store, specialty store etc.). In order to find out whether the decision is made sequentially or simultaneously, more differentiated surveys and models are necessary. One way to depict a hierarchical decision would be a nested logit model (e.g., Suel and Polak 2017), but this type of model is also based on mathematical considerations regarding the substitution of alternatives and cannot directly show the order in which the decisions are actually made. In addition, in such a model only the decision is modeled and not, as in the hurdle model, also the expenditure; therefore this would not be a satisfactory alternative.

Second, with regard to the informative value of the model results, it should also be considered that part of the empirically recorded purchases had to be excluded (e.g., purchase directly from the manufacturer) in order to ensure comparability of the providers included within the model. Of course, this means that information is lost.

Third, the underlying consumer survey was conducted from September to December 2021, which means that the recorded purchases were made in the previous months. Thus, the empirical data used here relate to shopping decisions outside the peak times of SARS-CoV-2 infections (which was also deliberately integrated into the study design as there were no or few other Corona-related restrictions on retailing during this time). However, it is quite possible that during the peak of the infection waves—for example, in December or January—the effect of fear of infection on shopping behavior was significantly greater. In addition, it can also be assumed that fear of infection played a far greater role in the first few months of the pandemic (especially spring 2020) than in the second year of the pandemic. The results shown here do not allow this conclusion to be ruled out. Furthermore, in winter 2021/2022, i.e., following the purchases recorded here, there were new restrictions for retail stores (e.g., access restrictions for the unvaccinated), which were not able to be considered here.

Fourth, it must be remembered that the modeling approach used here operates entirely at the individual level, i.e., the individual consumer (and where they live) and the individual store. On the one hand, this is an advantage because there are no biases due to spatial aggregations. On the other hand, the modeling approach does not allow any statements about differences in shopping behavior between region types or intra-regional differences (i.e., within the three survey regions). The only difference considered in this study is that between (large) cities and rural areas, whereby this distinction is not made at a spatial level, but at the consumer level. In order to take the spatial scale more into account, either separate models for each region type or a large number of other interaction variables would be necessary, which is not done at this point.