Introduction

Clustering—geographic concentrations of interconnected entities in a particular field (Porter 1998)—has recently gained more traction in the marketing and management literature. Some published papers show the effect of clustering or proximity on outlet sales revenue (Butt et al. 2018; Kalnins 2004; Pancras et al. 2012), nursing home quality (Lu and Wedig 2013), communication (Ganesan et al. 2005), and franchisor performance (Zheng et al. 2020). A close examination of that research stream shows that most of this published research focused on the clustering of same-brand outlets, i.e., intrabrand clustering (e.g., Butt et al. 2018; Kalnins 2004). Further, it provides mixed evidence with both positive and negative performance outcomes of clustering (e.g., Butt et al. 2018; Lu and Wedig 2013; Pancras et al. 2012).

The overwhelming focus on intrabrand clustering ignores the potential impact of other-brand outlets located nearby, i.e., interbrand clustering. It is much more common to find same-brand outlets near other-brand outlets with similar offerings in our daily lives. For example, McDonald’s restaurants will most likely be located next to Burger King, Wendy’s, and KFC restaurants. The fact that outlets belonging to the same brand situate near other brands may significantly affect the outlets’ performance.

Further, firm size—the number of outlets of a brand—may significantly affect outlet performance when clustered. A larger firm size generally means more resources, more significant experience advantage, and higher efficiency that may lead to a better quality of processes, products, and services at lower costs (Bachmann et al. 2021; Kalyanaram et al. 1995; Scherer and Ross 1990; Liu 1995; Wang 2010). An outlet associated with a larger-sized firm may thus perform differently when clustered than the one related to a smaller-sized firm.

Relying on clustering literature in marketing, management, and economics (Butt et al. 2018; Lu and Wedig 2013; Porter 1998) and brand competition literature in branding and brand management (Carpenter and Lehmann 1985; Depeyre et al. 2018; Faulkner et al. 2014; González-Benito et al. 2010), this study aims to assess the effect of intrabrand clustering on outlet quality performance measured in terms of quality violations. Further, it investigates the potential role of interbrand clustering and firm size as moderators that may impact the relationship between intrabrand clustering and outlet quality performance.

This study uses publicly available New York State Department of Health (NYSDOH) data. The NYSDOH records the inspection of every food service establishment located within the New York State boundary. The dataset comprises approximately 21,000 food service establishments operating across New York State in 2019. I supplement these data with population, income, and location information at the zip code-level collected from the US Census Bureau and USDA. Additionally, I use ArcGIS software to compute intrabrand and interbrand clustering variables within a specified radius.

In what follows, I present the conceptual framework and hypotheses sections. This follows by describing my research context, model specification, and results. I conclude with a discussion of the implications of my study for theory and practice, limitations, and some future research avenues.

Literature review

Figure 1 represents my conceptual diagram. My interest focuses on the potential impact of intrabrand clustering on outlet quality performance and how interbrand clustering and firm size might temper the relationship between intrabrand clustering and outlet quality performance.

Fig. 1
figure 1

Conceptual diagram

According to Porter (1998), “clusters promote both competition and cooperation”.

Accordingly, prior work on clustering has emphasized its positive and negative outcomes. It has adopted several theoretical mechanisms to explain its impact, including knowledge sharing, monitoring efficiency, agglomeration effects, brand awareness, and competition. Table 1 provides selected empirical papers related to clustering/proximity, and Fig. 2 shows the intrabrand and interbrand clustering instances individually in Panels 1 and 2 and together in Panel 3.

Table 1 Selected prior empirical studies on clustering/proximity
Fig. 2
figure 2

Intrabrand and interbrand clustering

Prior research shows the advantage of clustering in disseminating knowledge among outlets sharing the same-brand name (Argote and Darr 2000; Bradach 1997; Butt 2017; Butt et al. 2018). Clustering allows the outlets’ operators to observe and meet each other more frequently and cost-effectively and, therefore, share knowledge without spending more time and effort (Argote and Miron-Spektor 2011; Ganesan et al. 2005). This knowledge transfer facilitation due to clustering likely improves outlets’ performance (Butt 2017; Butt et al. 2018; Kalnins and Mayer 2004).

Another theoretical mechanism links clustering of outlets with greater monitoring efficiency. As the proximity of outlets increases, it gets less time-consuming to monitor outlets owing to reduced travel time from one outlet to another. Therefore, the cost of monitoring per outlet within an area goes down significantly (Holmstrom and Milgrom 1991; Lafontaine and Slade 2007; Lu and Wedig 2013). This advanced efficiency to monitor better enables firms to expose violations and noncompliant behavior exhibited by the outlets (Brickley and Dark 1987; Lu and Wedig 2013), thereby increasing the performance.

As well, clustering is shown to create agglomeration effects in terms of heightened demand (Marshall 1920). The closely located outlets of different brands reduce customer search costs and give customers more variety and options, increasing the likelihood of their visits and purchases (Baum and Mezias 1992; Chung and Kalnins 2001). The heightened demand increases the overall market size for clustered outlets, positively affecting their performance.

Further, clustering of same-brand outlets can increase brand awareness and help forge positive brand associations with target customers (Avery et al. 2012). The clustered same-brand outlets are highly noticeable and provide more brand exposure to customers. Therefore, managers increasingly view stores as a form of promotion that has the potential to reach more customers with greater frequency and performs the same function as the brand’s billboard (Avery et al. 2012; Chang 2009).

Lastly, clustering yields some adverse outcomes too. It can harm outlet performance due to the intense competition (Porter 1998). Previous work supports the existence of more significant competition among clustered same-brand outlets (e.g., Butt et al. 2018; Kalnins 2004). The competition among closely located same-brand outlets likely results from the almost identical product and service offerings with little room for differentiation (Pancras et al. 2012), which results in customers viewing them as close substitutes leading to sales cannibalization (Davis 2006; Kalnins 2004).

Competition is also an established topic in branding and brand management literature. Prior research has studied various aspects of brand competition, including the impact of brand name, multiple products, and advertising on intrabrand and interbrand competition (Carpenter and Lehmann 1985), the strength of cannibalization effects versus competitive effects on other brands (González-Benito et al. 2010; Lomax et al. 1997), patterns of brand competition in emerging markets such as China (Faulkner et al. 2014), the effects of brand competition and coopetition strategies on business performance (Depeyre et al. 2018), and brand competition leading to undifferentiation and commoditization (Nandan 2005).

My research contributes to the brand management literature by considering brand competition arising due to geographical proximity or clustering of same- and other-brand outlets. This study acknowledges the possibility of both positive and negative effects of clustering and relies on the competition literature (Chung and Kalnins 2001; González-Benito et al. 2010; Pancras et al. 2012), monitoring efficiency (Brickley and Dark 1987; Lu and Wedig 2013), and agglomeration effects (Chung and Kalnins 2001; Marshall 1920) to describe relationships and state hypotheses.

Hypotheses

The main effect of intrabrand clustering

The proximity of same-brand outlets likely triggers intrabrand competition due to shared markets and customers. The same-brand outlets share similar traits; therefore, they have the potential to compete more intensely (Baum and Mezias 1992; Porter 1998). As closely located outlets of the same brand sell identical products to the same customer segments with little differentiation (Baum and Mezias 1992; González-Benito et al. 2010), customers are shared, and market size per outlet is reduced (Butt 2017; Lomax et al. 1997). Dealing with the increasing prospect of intrabrand competition, same-brand outlets are more likely to free-ride to take advantage of the common brand name (Rubin 1990) and shirk their quality obligations (Bergen et al. 1992; Lu and Wedig 2013). Therefore, I expect a negative association between intrabrand clustering and outlet quality performance. My baseline expectation is as follows:

H 1

The higher the intrabrand clustering, the lower the outlet quality performance.

As well, clustering of same-brand outlets increases monitoring efficiency due to reduced travel time of inspectors (Lu and Wedig 2013). The reduced travel time decreases the cost of monitoring per outlet (Lafontaine and Slade 2007), enabling firms to detect outlet quality violations (Brickley and Dark 1987; Lu and Wedig 2013). This credible threat of monitoring brought about by clustering likely deters outlets from shirking on quality aspects, increasing the quality performance of outlets. Therefore, my alternate expectation is as follows:

H 1Alt

The higher the intrabrand clustering, the greater the outlet quality performance.

The moderating effect of interbrand clustering

A higher intensity of interbrand clustering may significantly affect the relationship between intrabrand clustering and outlet quality performance. Specifically, I argue that interbrand clustering mitigates the adverse effects of intrabrand clustering on outlet quality performance, hence helping the outlets perform better.

A higher intensity of other-brand outlets likely increases the market size owing to agglomeration effectsFootnote 1—the external economies accruing from the collocation of different firms (Chung and Kalnins 2001; Liu et al. 2022; Marshall 1920). The presence of closely located outlets of different brands creates a destination marketplace for customers who want more variety in one place. It helps customers to evaluate the options presented by various brands situated close to each other, increasing the likelihood of their visits and purchases versus if the outlets had been located dispersedly (Chung and Kalnins 2001). This increase in market size and customer patronage may decrease the propensity of the outlet to free-ride and shirk on quality aspects, hence helping the outlet perform better.

In sum, in the presence of both intrabrand and interbrand clustering, there is an inflection point where the outlet decides whether it is better off free-riding to take advantage of a common brand or cleaning up its act. I argue that a higher intensity of interbrand clustering likely mitigates the adverse effects of intrabrand clustering and helps the outlet perform better thanks to the agglomeration effects.

About my alternate baseline hypothesis (H1Alt), I expect that the agglomeration effect generated by interbrand clustering strengthens the positive association between intrabrand clustering and outlet quality performance. The increase in market size and customer patronage discourages outlets from shirking on quality obligations. This effect, combined with monitoring efficiency generated by intrabrand clustering, further bolsters the positive impact of interbrand clustering on outlet quality performance.

H 2

A higher interbrand clustering weakens (strengthens) the negative (positive) association between intrabrand clustering and outlet quality performance.

The moderating effect of firm size

Firm size of clustered outlets may significantly affect the relationship between intrabrand clustering and outlet quality performance. Specifically, a larger firm size could be instrumental in mitigating the harmful effects of intrabrand clustering in at least two ways. First, a larger firm size generally means more resources and a more significant experience advantage that may lead to a better quality of processes, products, and services (Bachmann et al. 2021; Kalyanaram et al. 1995; Scherer and Ross 1990). Second, the larger firm generates greater economies of scale, reducing the cost of doing business and increasing efficiency relative to competition (Kalyanaram et al. 1995; Liu 1995; Wang 2010).

When clustered with same-brand outlets, there is an inflection point where the outlet decides whether it is better off free-ride or perform well. I argue that a larger firm size deters the outlet from indulging in free-riding and shirking on quality aspects due to more resources, experience, and efficiency effects, even in the presence of high intensity of intrabrand clustering. Therefore, I expect that the larger firm size mitigates the adverse effects of intrabrand clustering and helps the clustered outlets perform better.

Alternatively (H1Alt), I expect that the larger firm size of clustered outlets strengthens the positive association between intrabrand clustering and outlet quality performance due to greater resources, experience, and efficiency effects. A high level of intrabrand clustering serves as a credible threat of monitoring and discourages outlets from shirking on quality aspects. A larger firm size strengthens this effect and further prevents outlets from shirking.

H 3

A larger firm size weakens (strengthens) the negative (positive) association between intrabrand clustering and outlet quality performance.

Method

Empirical context and data collection

I investigate the performance consequences of intrabrand clustering in the context of food service establishments located in New York State. Food service establishments are mainly situated close to each other. Various brands compete directly with each other. Information on their precise locations and quality performance are publicly available, making this context suitable for studying the effects of intrabrand and interbrand clustering and firm size.

New York State is the 27th largest state by area in the USA, with a total area of 54,556 square miles. Further, it is ranked fourth in the US concerning population, with more than 20 million people (US Census Bureau). The GDP of New York State is the third-largest in the USA, behind California and Texas (Bureau of Economic Analysis).

New York State Department of Health (NYSDOH) inspects food service establishments to maintain food quality standards and reduce the incidence of foodborne illness. The food service establishments include full-service restaurants, quick-service restaurants, cafes, and snack bars. The mission statement of NYSDOH is “To protect public health by assuring that food service establishments are operated in a manner that eliminates hazards through design and management, resulting in a decreased incidence of foodborne illness in our communities”.Footnote 2

The data provided by NYSDOH are publicly available (https://health.data.ny.gov/). It includes approximately 21,000 food service establishments in New York State in 2019. Temporary food service establishments, such as mobile and vending machines, are excluded from this dataset.

This dataset comprises food service establishments' names and street addresses, inspection dates, inspection type (such as inspection or reinspection), and quality violations found during their inspections. The types of quality violations include food conditions (e.g., food found in rusty cans), food temperature (e.g., food not kept at recommended temperatures), food from unapproved sources, food contamination, plumbing and drainage issues, improper installation and maintenance of equipment, inaccessible handwashing facility, and inadequate lighting and ventilation. The quality violation details and descriptions are collected on inspection reports, including corrective actions for violations. Food service establishments are graded based on the observed number and types of violations.

I use the food service establishments’ street-level addresses to compute intrabrand and interbrand clustering variables using ArcGIS 10.7.1 software. Further, to control market-specific effects, I supplement the NYSDOH data with the population, income, and location type data collected at the most fine-grained level—the zip code-level from US Census Bureau and USDA. Table 2 displays the list of all variables used in this study and their data sources.

Table 2 variables and data sources

Operationalizations

The unit of analysis in this study is the individual food service establishment (outlet) inspected in New York State in 2019. My objective is to relate intrabrand clustering with outlet quality performance and assess the moderating effects of interbrand clustering and firm size.

I measure the dependent variable, outlet quality performance (QPERi), as the total count of quality violations committed by outlet i. Therefore, the lesser the number of violations, the greater the outlet quality performance. The number of quality violations determines inspection scores and restaurant letter grades that are displayed to consumers. Therefore, it is a crucial indicator of an outlet’s quality performance (Ibanez and Toffel 2020).

I measure the intrabrand clustering (INTRACLi) as the ratio of the number of same-brand outlets to the total number of outlets within a 5-mile radius of the outlet i. Similarly, I measure the interbrand clustering (INTERCLi) as the ratio of the number of other-brand outlets to the total number of outlets within a 5-mile radius of outlet i. I compute intrabrand and interbrand clustering measures using the Point Distance module of ArcGIS 10.7.1 software. The Point Distance module determines the distance of a point (location) to all nearby points (locations) within a specified search radius. Here, it calculates the distance of an outlet from proximally located same-brand and other-brand outlets within a 5-mile radius. The Point Distance module provided a table with a considerable amount of data that included the distance in miles of each of 21,000 outlets from all other outlets in my dataset. I counted the number of same-brand, other-brand, and total outlets within a 5-mile radius of each outlet from this table to compute my clustering variables. Considering prior research, I use a 5-mile radius as a suitable distance (e.g., Guler 2018).

I measure firm size (FSij) as the total number of outlets of brand j, the outlet i is affiliated with. Firm size reflects market dominance that measures the strength of a brand or firm in a geographic area relative to competitors (Athey and Schmutzler 2001).

I include several control variables in my model that may affect the outlet quality performance. To control for ownership effects, I include multiunit affiliation (MUAij) and measure it as a dichotomous variable that equals 1 if outlet i is affiliated with a multiunit brand j, otherwise 0 (Jin and Leslie 2009). To control the impact of multiple inspections, I include reinspection (REINSPi) that equals 1 if outlet i is reinspected within the same year, otherwise 0. NYSDOH reinspects a food service establishment considering its previous compliance history and risk category. Food service establishments with complex cooking processes, such as full-service restaurants, are considered high-risk, fast-food restaurants are considered medium-risk, and coffee shops fall under the low-risk category. Therefore, REINSPi variable controls an outlet's risk category and compliance. In my dataset, 1730 outlets were inspected more than once within the same year. I also control for cluster size around the outlet i. I measure cluster size (CLSi) as the total number of outlets (both same-brand and other-brand) within a 5-mile radius of outlet i.

Finally, I control market-specific effects by including the population (POPk) and median family income (INCk) at the zip code-level, the most granular level of available data. To control for the potential effect of outlets located in urban versus rural areas, I also include urban (URBk) variable that equals 1 when a zip code k falls in an urban area, and 0 otherwise. Table 3 displays the sample’s descriptive statistics and correlation matrix.

Table 3 Descriptive statistics and correlation matrix (raw values)

I undertook a multicollinearity diagnostic test to investigate a possible case of linear dependencies in my independent variables. The mean VIF (variance inflation factor) without interaction was 1.25, and with interactions was 2.00, well below the heuristic of 10 (Hair et al. 1995). Therefore, multicollinearity is likely not a concern in my model.

Model specification

I specify my model as follows:

$$ \begin{aligned} {\text{QPER}}_{i} & = \alpha_{0} + \alpha_{1} {\text{INTRACL}}_{i} + \alpha_{2} {\text{INTERCL}}_{i} + \alpha_{3} {\text{FS}}_{ij} \\ & \quad + \alpha_{4} {\text{INTRACL}}_{i} *{\text{INTERCL}}_{i} + \alpha_{5} {\text{INTRACL}}_{i} *{\text{FS}}_{ij} \\ & \quad + \alpha_{6} {\text{MUA}}_{ij} + \alpha_{7} {\text{REINSP}}_{i} + \alpha_{8} {\text{CLS}}_{i} \\ & \quad + \alpha_{9} {\text{POP}}_{k} + \alpha_{10} {\text{INC}}_{k} + \alpha_{11} {\text{URB}}_{k} + u_{i} \\ \end{aligned} $$
(1)

where QPER = Outlet quality performance (quality violations), INTRACL = Intrabrand clustering, INTERCL = Interbrand clustering, FS = Firm size (natural log-transformed), MUA = Multiunit affiliation, REINSP = Reinspection, CLS = Cluster size (natural log-transformed), POP = Population at the zip code-level (natural log-transformed), INC = Median family income at the zip code-level (natural log-transformed), URB = Zip code belonging to an urban area, and u ~ N (μ, σ2).

The hypothesized predictors—intrabrand clustering (INTRACLi), interbrand clustering (INTERCLi), and firm size (FSij)—are potentially endogenous. For example, a brand may strategically locate its outlet near the same-brand and other-brand outlets that creates a self-selection problem. Further, my specification could suffer from potential endogeneity caused by omitted variables that drive the regressors (intrabrand and interbrand clustering and firm size) and directly affect the outcome (outlet quality performance). To test the potential endogeneity of my regressors, I conducted the Durbin–Wu–Hausman test. The results confirm the endogeneity problem.

I account for the endogeneity of intrabrand and interbrand clustering and firm size by relying on the Gaussian Copula method (Park and Gupta 2012). The Gaussian Copula is an instrument-free method for endogeneity correction. It involves constructing a multivariate distribution to capture the correlation between endogenous regressor and structural error. The resulting model likely does not suffer from the endogeneity problem and generates consistent estimates for model parameters (Park and Gupta 2012). Several marketing studies have used the Gaussian Copula approach (e.g., Butt et al. 2021; Datta et al. 2015; Papies et al. 2017; Vomberg et al. 2020).

The Gaussian Copula method's identification assumption is the non-normality of the endogenous regressor (Park and Gupta 2012; Vomberg et al. 2020). I thus use the Shapiro–Wilk test of normality for each of my three endogenous variables. The results reject the null hypothesis that the endogenous regressors are normal (p < 0.01), suggesting that the Gaussian Copula approach is suitable for the endogeneity correction. I specify the final model as:

$$ \begin{aligned} {\text{QPER}}_{i} & = \beta_{0} + \beta_{1} {\text{INTRACL}}_{i} + \beta_{2} {\text{INTERCL}}_{i} + \beta_{3} {\text{FS}}_{ij} \\ & \quad + \beta_{4} {\text{INTRACL}}_{i} *{\text{INTERCL}}_{i} + \beta_{5} {\text{INTRACL}}_{i} *{\text{FS}}_{ij} \\ & \quad + \beta_{6} {\text{MUA}}_{ij} + \beta_{7} {\text{REINSP}}_{i} + \beta_{8} {\text{CLS}}_{i} \\ & \quad + \beta_{9} {\text{POP}}_{k} + \beta_{10} {\text{INC}}_{k} + \beta_{11} {\text{URB}}_{k} \\ & \quad + \delta_{1} \left( {{\text{INTRACL}}_{i} } \right)* + \delta_{2} \left( {{\text{INTERCL}}_{i} } \right)* + \delta_{3} \left( {{\text{FS}}_{ij} } \right)* + e_{i} \\ \end{aligned} $$
(2)

All terms are described previously, and e ~ N (μ, σ2). The parameters δ1–δ3 represent the Copula terms (estimated from the Gaussian Copula method) for the endogenous variables.

The dependent variable in this study, the count of quality violations, has excess zeros when no violations were observed during inspections (35% of the sample). It also has a greater level of variance (10.79) than the mean (2.38). I account for these excess zeros and overdispersion issues using a zero-inflated negative binomial (zinb) regression in Stata. Zero-inflated negative binomial regression models count variables with excess zeros, usually for overdispersed count outcome variables (Long and Freese 2003). Several marketing studies relied on the zero-inflated negative binomial regression (e.gDhaoui and Webster 2021; Dotzel et al. 2013; Li and Xie 2020; Rubera and Kirca 2017; Sorescu and Spanjol 2008).

Results

Descriptive statistics

Table 3 provides summary statistics for my sample. The average number of quality violations committed by an outlet is 2.38 (SD 3.28). The average count of same-brand outlets within a 5-mile radius of an outlet is 1.48 (intrabrand clustering 0.02, SD 0.08), which is considerably less than the average count of other-brand outlets, 68.64 (interbrand clustering 0.46, SD 0.49). These numbers show that interbrand clustering is more prevalent than intrabrand clustering in my data. The average firm size is 20.45 outlets (SD 69.70) (raw value).

On average, the probability of an outlet being associated with a multiunit brand is 0.38 (SD 0.49). The average possibility for an outlet to be reinspected is 0.09 (SD 0.28). On average, the cluster size within a 5-mile radius is approximately 69 outlets (raw value), the population per zip code is 19,200 (raw value), the median family income per zip code is USD 85,307, and the probability for a zip code to fall in an urban area is 0.86.

Regression estimates

Table 4 displays the results of the zero-inflated negative binomial regression. The full results are provided in Model 3 with the interaction terms and the endogeneity correction. The results support my baseline expectation for intrabrand clustering’s negative impact on outlet quality performance. I find that a greater level of intrabrand clustering increases an outlet’s quality violations (Coeff. = 5.91, p < 0.01), hence decreasing its quality performance. Therefore, I find strong support for hypothesis H1. The alternate hypothesis H1Alt is not supported.

Table 4 Zero-inflated negative binomial regression estimates

Hypothesis H2 predicted a weakening effect of interbrand clustering on the association between intrabrand clustering and outlet quality performance. I find support for H2; a higher intensity of interbrand clustering weakens the positive association between intrabrand clustering and quality violations (Coeff. = − 2.62, p < 0.05). It shows that the presence of other-brand outlets mitigates the negative impact of intrabrand clustering on outlet quality performance.

Hypothesis H3 predicted a weakening effect of firm size on the association between intrabrand clustering and outlet quality performance. Similar to H2, my results provide robust support for H3. A larger firm size weakens the positive association between intrabrand clustering and quality violations (Coeff. = − 0.83, p < 0.01). It shows that an outlet’s association with a larger-sized firm mitigates the negative consequences of intrabrand clustering. I will discuss these results in detail in the simple slopes analysis and discussion sections.

For control variables, I find that reinspection of an outlet (Coeff. = − 0.70, p < 0.01), cluster size (Coeff. = − 0.09, p < 0.01), and urban location (Coeff. = − 0.34, p < 0.01) decrease quality violations, therefore increasing outlet quality performance. Population (Coeff. = 0.03, p < 0.01) and income (Coeff. = 0.36, p < 0.01) increase quality violations, hence decreasing outlet quality performance. A mere affiliation with the multiunit brand does not significantly affect outlet quality performance (Coeff. = 0.07, ns).

Post hoc analysis of significant interactions (simple slopes)

I analyzed simple slopes for significant interactions to better understand the moderating effects of interbrand clustering and firm size (Aiken and West 1991). Table 5 and Fig. 3 display the simple slope analysis of two-way interactions.

Table 5 Simple slopes analysis
Fig. 3
figure 3

Simple slopes analysis

I find that a lower intensity of interbrand clustering strengthens the positive association between intrabrand clustering and quality violations (the simple slope of intrabrand clustering at low levels of interbrand clustering = 5.91, p < 0.01). However, the impact of intrabrand clustering on outlet quality performance weakens but stays positive and significant at higher levels of interbrand clustering (the simple slope of intrabrand clustering at high levels of interbrand clustering = 3.29, p < 0.05). This shows that interbrand clustering mitigates the negative outcome of intrabrand clustering and helps outlets perform better.

For firm size, I find that a higher intensity of intrabrand clustering increases quality violations at lower levels of firm size (the simple slope of intrabrand clustering at low levels of firm size = 5.91, p < 0.01). However, a higher level of firm size makes the relationship between intrabrand clustering and quality violations non-significant (the simple slope of intrabrand clustering at high levels of firm size = 0.91, ns). This result shows that the firm size has a more significant mitigating effect than interbrand clustering.

Supplementary analysis

I further test the impact of my main predictors on the alternate dependent variable—customer satisfaction (CSi) measured as the customer ratings of food service establishments listed on the online review platform, Yelp. On Yelp, customers can demonstrate their satisfaction by rating food service establishments on a 1–5-star scale (5 refers to highly satisfied). Over time, Yelp has become a handy and efficient tool for customers to find good food service establishments. When a food service establishment secures a higher rating on Yelp, it will likely receive more customer traffic (Dai et al. 2018).

I used web scraping in Python to extract the rating of every food service establishment located in New York State and listed on the Yelp website in 2019. The web scraping of the Yelp website provided ratings of 12,132 food service establishments (out of 20,226 food establishments in my data set). Other restaurants either had no rating or were not listed on Yelp. To account for potential bias in regression estimates due to rating data not missing at random, I used Heckman’s (1976) selection model in the first stage of the analysis that computed the selection bias parameter—Inverse Mills Ratio, which I included in the second stage of regression to control the selection bias. Similar to my main study, I corrected for the endogeneity of regressors using the Gaussian Copula approach (Park and Gupta 2012).

Due to the ordinal nature of the customer satisfaction measure, I relied on the ordered probit regression model for estimation. Table 6 displays the regression estimates. Despite a reduced sample size, my hypothesized results remain pretty robust. The results show that intrabrand clustering reduces customer satisfaction (Coeff. = − 9.49, p < 0.01). Firm size mitigates the negative impact of intrabrand clustering on customer satisfaction (Coeff. = 1.07, p < 0.01). These results are in line with my empirical findings in the main study. Although I find the impact of interbrand clustering on the association between intrabrand clustering and customer satisfaction as non-significant (Coeff. = 0.34, ns), it still supports my assertion that firm size plays a more substantial role than interbrand clustering in mitigating the negative effect of intrabrand clustering.

Table 6 Supplementary analysis: customer satisfaction as an alternate DV ordered probit regression estimates

Alternate specifications

I further assess the results' stability to alternate operationalization of clustering, the extent of clustering, alternate estimator, and alternate sample approaches.

Alternate operationalization of clustering variables

In this study, I use a 5-mile radius to measure intrabrand and interbrand clustering variables. To check the robustness of my findings, I recompute clustering variables using 3-mile and 7-mile radii instead. All my substantive results remain robust to this alternate operationalization of clustering variables. Table 7 displays the regression estimates with endogeneity correction.

Table 7 Robustness check with 3-mile and 7-mile radii of clustering zero-inflated negative binomial regression estimates

Alternate model to exclude nonlinearity effects of clustering

To rule out a possible effect of the extent of clustering and to exclude the nonlinearity, I run an alternate model by including a quadratic term of intrabrand clustering. All my results stay robust to this alternate specification.

Alternate estimator

I use the standard Poisson regression as an alternate estimator to test my results' robustness. Poisson regression is used to model count dependent variables, such as the number of quality violations in this study. All my results stay robust to this alternate estimator.

Alternate sample

I check the robustness of my results to the presence of outliers. To reduce potential outliers ' impact, I winsorize all my continuous predictor variables at 1% and 99% levels. All my results remain robust to this sampling variation.

Discussion

This research contributes to marketing theory and practice by extending the clustering and brand competition literature. Prior studies provided mixed results for the effect of clustering. While some studies demonstrated evidence of positive performance outcomes due to knowledge sharing (e.g., Butt et al. 2018) and monitoring efficiency (e.g., Lu and Wedig 2013), others showed its negative consequences owing to intense competition (e.g., Pancras et al. 2012). In brand management literature, research on brand competition provided empirical evidence mostly favoring its negative consequences (e.g., González-Benito et al. 2010).

This study shows that the relationship between clustering of same-brand outlets (intrabrand clustering) and outlet performance is not straightforward. It is contingent on what other-brand outlets are located nearby (interbrand clustering) and the firm size. My first hypothesis predicted both negative and positive effects of intrabrand clustering on the outlet’s quality performance. I find support for the adverse impact owing to the intense brand competition brought about by clustering. My second hypothesis predicted the positive effect of a critical moderator—interbrand clustering. It is common to find outlets of competing brands located next to each other. The existence of competing brands close to same-brand outlets might have significant performance consequences for the same-brand outlets. My results show that the proximity of other-brand outlets is beneficial for same-brand outlets as it increases market size due to the agglomeration effect. Finally, my third hypothesis predicted the positive effect of firm size as a moderator. I find strong support for my third hypothesis. Firm size of clustered outlets plays a crucial role in mitigating the negative effect of intrabrand clustering due to experience and economies of scale advantages associated with larger firm size. My results show a more robust mitigating impact of firm size than interbrand clustering.

Overall, my work contributes to the growing work on clustering and its consequences. My research contributes to the brand management literature by considering brand competition in clustering situations. Extant research on brand competition mainly focused on grocery brands placed next to each other in grocery and convenient stores (Faulkner et al. 2014; González-Benito et al. 2010) and competition and cooperation in luxury markets (Depeyre et al. 2018). That body of research used store-level scanner data and consumer surveys to provide critical insights into brand competition. However, the competition triggered by distance or geography is an under-researched topic in brand management literature. My research advances brand competition literature by focusing on geographical proximity or clustering of same- and other-brand outlets. Relying on the context of food service establishments located in New York State, this study provides novel insights to marketing and branding scholars and practitioners alike.

Managerial implications

The results of this study have some critical implications for marketing and brand managers who face such questions: Is it recommended to cluster same-brand outlets? Is it recommended to locate close to other-brand outlets? My study answers these crucial questions in the context of food service establishments located in New York State.

A crucial managerial implication of the results is that opening a food service establishment closer to the existing same-brand ones may have adverse performance outcomes. However, these negative consequences of intrabrand clustering can be mitigated by situating food service establishments near competing brands. Therefore, interbrand clustering helps same-brand food service establishments improve their quality performance.

The second important managerial insight relates to the firm size, which plays a more significant role in mitigating the negative effect of intrabrand clustering. The same-brand food service establishments associated with a larger-sized firm are less likely to be affected by intrabrand clustering. In fact, I find that a larger firm size is more instrumental in mitigating the adverse effects of intrabrand clustering. Therefore, marketing and brand managers of larger-sized firms may cluster their food service establishments without worrying about brand competition.

The results of some control variables have important managerial implications. I find that reinspecting a food service establishment decreases quality violations, which shows that more frequent inspections are better. However, there is a tradeoff. The regulators like NYSDOH are responsible for maintaining food and service quality and reducing the incidence of foodborne illness by inspecting food service establishments. They must perform their duties within budgetary limitations. As more frequent inspections mean more costs, NYSDOH may need to identify the food service establishments requiring more frequent inspections. For example, high-risk food service establishments are inspected more frequently; food service establishments with poor compliance history are expected to get more frequent inspections. Although costly, my results demonstrate the positive outcome of reinspections. Similarly, I find a positive effect of locating food service establishments in urban areas than rural areas. Food service establishments in urban areas commit fewer quality violations than rural ones.

I have also conducted a supplementary analysis using an alternate dependent variable—customer satisfaction—operationalized as the customer ratings of food service establishments listed on Yelp. In line with my primary study, I find that intrabrand clustering has adverse effects—it decreases customer satisfaction, and firm size plays a substantial role in mitigating the negative impact of intrabrand clustering on customer satisfaction.

In sum, the results of this study provide vital insights to marketing and brand managers who want to locate and manage their outlets closer to their same-brand and other-brand outlets.

Conclusion, limitations, and future research directions

This study assesses the performance consequences of intrabrand clustering and its context-dependency on two critical factors—interbrand clustering and firm size. My examination of approximately 21,000 food service establishments in New York State in 2019 finds that intrabrand clustering decreases outlet quality performance. Interbrand clustering and firm size moderate this relationship and help to mitigate the negative effect of intrabrand clustering on outlet quality performance. Further, I find that firm size has a more substantial mitigating effect than interbrand clustering.

Like any research, this study is subject to some limitations. First, this research uses the New York State data available for 2019. I believe that my research findings would be generalizable. However, future research that uses data across various markets observed over multiple years would confirm my expectations. The second limitation pertains to the lack of data regarding the specific identity of the outlet concerning ownership, i.e., company-owned or franchised. The NYSDOH database does not provide this information. Future work that combines the outlet ownership data with the NYSDOH dataset and investigates the consequences of clustering considering some critical context dependencies would be welcome.

Moreover, this study provides rich insights into the actual behavior due to an archival dataset. However, the reliance on archival data cannot measure the underlying mechanisms of free-riding or shirking. Future work integrating archival and experimental data, although challenging to undertake, would significantly contribute to this topic.

A promising avenue to expand this research is to study the impact of digitalization in clustering and branding contexts. The trend of online shopping has grown immensely, along with the delivery and drive-throughs during the COVID-19 pandemic. Many retailers have reduced their physical presence through closures. Many now use their physical outlets more as fulfillment centers than retail shops, also called ‘dark stores’. How this digital, delivery, and drive-through trend will change clustering, brand competition, and their impact on outlet performance represents a potentially fruitful avenue for future research.

Another important area for future research is to investigate the financial impact of clustering. Specifically, a future study that assesses the effect of clustering on the financial value of brands—for example, on sales, profit, and stock price—and how some key factors moderate this relationship would generate valuable insights for brand managers and other stakeholders.