Introduction

Over the last few years, the consumption of organic food has experienced a rapid and steady growth. In France, its value rose from € 5.9 billion in 2015 to € 9.7 billion—i.e., 5% of food expenditures—in 2018 (Agence Bio 2019b). In a recent French survey, about one in eight respondents declares eating at least one organic product a day (Agence Bio 2019a). Other quality labels—such as Protected Designation of Origin (PDO), Protected Geographical Indication (PGI), or label rouge—also represent a substantial and increasing share in food consumption. For some products, these labels dominate organic ones as a quality sign. This is notably the case for cheese, for which geographical indications represent 11% of the French market, a much higher share than that of organic cheese (2%).

The rising demand for quality-food products—defined here as products certified as organic, PDO, PGI, or label rouge—impacts farming practices, land use, and value chains. As an illustration, the area devoted to organic agriculture in France has increased by 48.3% from 2015 to 2018, reaching 7.5% of the total agricultural area. As for other quality signs, between 2015 and 2019, the volume of meat and dairy products, fish, and eggs produced under these labels has increased from 602 to 665 thousand tons, reaching 1.5% to 1.8% of market share in these categories (INAO 2019).

Providing consumers with healthier and higher-quality food while leveraging the potential health and environmental benefits from less intensive agricultural practices without affecting too negatively farmers’ income has attracted renewed interest from policy makers. The development of the production and consumption of quality-food products (organic food, but also other quality signs) has become a central objective in agricultural and food policies. EGalim, a French law on food and agriculture passed in 2018, provides a good illustration of this trend in public policies.

An adequate understanding of the demand-side determinants of quality-food consumption is thus critical to assess the potential impacts of such policies. In particular, it is important to determine whether the demand for these products comes primarily from a large base of occasional consumers or a small base of “regular” ones. The recent increase in quality-food consumption is consistent with both a larger consumer base and an increasing share of quality-food in individual purchases. Which explanation dominates remains however an open question. This raises further questions regarding consumers’ behavior with respect to quality-food products. Do consumers tend to routinely and systematically purchase a product under the same label? If so, is this behavior restricted to some products or consistent across the food basket? The answers to these questions may have strong implications for the design and targeting of the policies aiming to develop quality-food consumption.

In this article, we define a “regular” consumer—as opposed to an “occasional” one—as a consumer who predominantly buys a given product with a consistent quality. Regularity in quality-food consumption is thus related to how consumers perceive and value quality labels. In this regard, previous research has shown that credence attributes—i.e., attributes that the consumer cannot identify through search nor experience—play a major role in the decision to buy quality-food products (Massey et al. 2018; Padilla Bravo et al. 2013; Rana and Paul 2017). If the demand for quality-food products is primarily driven by credence attributes, one may expect that regularity should be the rule rather than the exception.

Yet, this intuition is at odds with recent evidence. In a study of organic food purchasing behaviors in France, Boizot-Szantai et al. (2017) find that organic food represents only 8.1% of total food expenditures of the consumers whose organic budget share is above the top quintile. This suggests that, even for the consumers who value organic food the most, a large share of their purchases goes to non-organic products. That is not to say, however, that regularity is necessarily absent from consumers’ behavior. A consumer may well always choose to buy the organic version of some products and systematically prefer the conventional version (or another quality label) of other products. The aggregation of all food expenditures at the food-basket level therefore may thus mask regularity patterns that occur at the product level.

Our main objective is to identify and document regularity patterns in food purchasing behaviors at various levels of aggregation and for various quality labels, and to study the interplay with other drivers of quality-food consumption such as price, availability, and socio-demographic characteristics of quality-food consumers.

Our study thus contributes to the large body of literature—in economics, but also in sociology and marketing science—that has examined consumers’ behavior and attitude towards quality-food products. The vast majority of this literature is focused on organic consumption, which covers a wider range of food products than other quality signs. This literature has investigated the effects of price, income, and socio-demographic variables on organic food consumption, as well as the role of the relative preferences for product objective characteristics (taste, color, etc.) and considerations regarding health, the environment, ethics, or animal welfare.

Quantitative studies in this literature are based on two broad types of approaches. First, a number of studies use surveys or controlled experiments. These studies rely on stated preferences to elicit consumers’ preferences or willingness to pay for various goods with different characteristics or attributes. The experimental design makes it possible to disentangle potential confounding factors. However, they often rely on small sample sizes and/or may be subject to sampling biases. Furthermore, stated intentions to buy quality food may overestimate actual purchases (Sun and Morwitz 2010). Studies in the second category are based on observational data. Those relying on scanner data are particularly relevant for the present study. By using detailed records of actual and repeated purchases by a large and representative sample of consumers, such studies allow to overcome some of the limitations faced by survey-based and experimental approaches. Observed purchases do not, however, directly inform on consumers’ preferences. In addition, such data mostly pertain to purchases for food-at-home consumption at the household level, which may differ from actual consumption at the individual level.

This literature has shed interesting light on the socio-demographic profile of quality-food consumers, as well as on the influence of product attributes on consumption behavior. For example, findings suggest that organic food consumers tend to be more educated and have higher income than conventional food consumers. They are more likely to be female and to have children, especially newborns (Gracia and de Magistris 2008; Hughner et al. 2007). They also spend more on food and have a healthier diet (Boizot-Szantai et al. 2017). Several studies underline the key importance of credence attributes in attitudes towards organic food. Consumers tend to place more value on credence attributes of organic products than on their search and experience attributes (Massey et al. 2018; Rana and Paul 2017). The influence of credence attributes on the probability of purchasing organic products has been found to be larger than that of education, marital status, and income (Padilla Bravo et al. 2013). The review by Rödiger and Hamm (2015) shows inconclusive evidence with respect to the effect of price on organic consumption. Results from observational studies indicate that the effect of economic determinants of organic consumption, most notably price and availability, varies across products and types of retail stores (Buder et al. 2014; Dimitri and Dettmann 2012; Padel and Foster 2005).

Regularity in organic food consumption has been much less studied than the influence of the socio-demographics variables, price, income, or product attributes. Some survey-based studies have examined the factors influencing the probability that a consumer defines himself or herself as a regular organic consumer (Barrena and Sánchez 2010; Baudry et al. 2019; Kesse-Guyot et al. 2013; Oates et al. 2012; Onyango et al. 2007; Pearson et al. 2013; Treu et al. 2017). However, as these studies are based on stated consumption averaged at the whole food basket level, it is difficult to draw robust conclusions on actual purchases at finer aggregation levels. Observational studies based on scanner data (Boizot-Szantai et al. 2017; Lacour et al. 2018) provide some interesting findings about purchasing patterns, but, the focus in these studies is on the share of organic food in total food expenditures. Considering all food products together may mask regularity patterns at a more disaggregated level.

Although expectedly important, the availability of quality-food products has also received less attention than other potential determinants (Dimitri and Dettmann 2012). One reason lies in the difficulty to accurately quantify availability. As a consequence, availability has been overlooked in most survey-based and observational studies of organic consumption.

In this paper, we use a large scanner dataset of food purchases in France, provided by Kantar®, which provides detailed records (quantities, price, type of retail store, etc.) of purchases of 237 food products by an unbalanced panel of 12,453 French households between 2011 and 2016, along with socio-demographic information about households (age, income, département of residence, household size, etc.). This dataset enables to decompose household food purchases at a fine level of disaggregation in terms of products. Importantly, it informs about the labels (organic, PDO, PGI, label rouge), if any, attached to each purchased product.

In order to identify regularity, we use two complementary approaches. For a given product or category of products, we first examine whether the distribution of purchases under a given label is compatible with a large base of occasional consumers or a small base of quality regulars. In a second step, we then investigate the influence of various determinants of the probability that a consumer is regular for a given product and label. The variables accounted for in this analysis include various characteristics of products (price, retailer’s brand, availability), retail stores, and households (socio-demographic variables, income, city size, etc.).

We take advantage of the richness of this information to address four main questions. First, can the consumption of a given (set of) product(s) be characterized as “regular”? In other words, do consumers purchase mostly one version of the product (conventional, organic, PDO, PGI, or label rouge) or do they buy a mix of various versions? Second, is regularity consistent across the food basket? Put differently, are quality regulars for a given product also regular for most of all other products they purchase? Third, how much of the total demand for quality-food products comes from regular consumers? Fourth, how do product attributes and household’s characteristics interplay in the probability that a given household is a quality regular for a given food product?

Our contribution is threefold. First, we address these questions for a wide range of products sold under various labels. Contrary to most of the literature, which is predominantly focused on organic products, we extend the scope of the analysis to other quality labels such as PDO, PGI, or label rouge. Products under these labels have several important features in common with organic products. In particular, their production is often based on extensive farming practices with a potential to improve the sustainability of food systems (Arfini and Bellassen 2019) and credence attributes play a major role in consumers’ decision to buy these products. They do however differ from organic products in a number of dimensions, such as the range of products covered by the label, availability, price, perception of environmental and health benefits, organoleptic characteristics, and aspect. We investigate whether different labels can be associated with different purchasing behaviors in terms of regularity. Besides, previous studies were often focused on only one or a few products. The large number of products (237) examined in this article allows us to further study how consumers’ behaviors vary with the type of products (Buder et al. 2014; Chekima et al. 2017; Padel and Foster 2005).

Second, we identify regularity patterns in quality-food consumption, but not for all products nor all labels. Such findings would not have been possible with the level of aggregation and coverage used in previous studies (Boizot-Szantai et al. 2017; Lacour et al. 2018).

Third, we propose a new indicator of availability of quality-food products. The main difficulty is that, availability of products that are not bought by consumers in the sample cannot be directly observed. We circumvent this difficulty by constructing a novel indicator based on web-scrapping techniques applied to online catalogues of retail stores. This indicator allows us to determine the share of retail stores where any given product can be found within a given region in the total number of stores visited by a given consumer. This indicator is found to be strongly and positively correlated with the probability that a consumer is an organic regular.

In sum, after detailing our methodology, we present the results organized in two axes, the first one on the identification of quality-food regular consumption and the second on its determinants, then discuss them and conclude.

Methodology

Theoretical framework

Consumers often have the choice between several versions of the same food product, e.g., between conventional, organic, PDO, PGI, or label rouge. Although different versions of the same product share some common characteristics, they may also differ along various attributes. Some of these attributes are specific to the product itself, whereas others are attached to perception of the label by the consumer. Some can be observable through search and experience. Others remain unobservable and are therefore akin to credence attributes.

Previous research has shown that credence attributes are major drivers of organic and other quality-food purchases (Gracia and de Magistris 2008; Barrena and Sánchez 2010; Massey et al. 2018; Rana and Paul 2017). Most of the attributes that define a product—regardless of the presence of a label—can be identified by search and/or experience. In contrast, most of the distinctive attributes of quality-food items are credence attributes. Note that some of the credence attributes associated with a given label may be shared by all products under the same label. It is thus important to disentangle consumers’ preferences towards (i) food products regardless of their quality label and (ii) credence attributes associated with quality labels (or absence thereof) across all products. Note also that preferences for a quality label may vary from one product to the other.

It is useful to illustrate the consumer’s problem by a simple theoretical framework inspired by Lancaster’s approach to consumer demand (Lancaster 1966) and akin to the individual utility model of the BLP model (Berry et al. 1995). This simple model accounts for search and experience as well as credence attributes. Consider the choice faced by consumer i over a set J of food products (indexed by j). Food products are available in various qualities indexed by l in L, including a conventional version (indexed by l = c) and different quality labels. Assume that the consumer’s preferences can be represented by a utility function Ui(qi0; {uijl(qijl)}j ∈ J, l ∈ L), where qi0 is the quantity of a composite non-food item, qijl is the quantity of jth product of quality l, and uijl(.) is the respective subutility function. For simplicity, assume that uijl(qijl) takes the following simple Cobb-Douglas form:

$$ {u}_{ij l}\left({q}_{ij l}\right)={a}_{ij l}\ln \left({\mathrm{q}}_{ij l}\right)={a}_{ij}^{se}\ln \left({\mathrm{q}}_{ij l}\right)+{a}_{il}^{cr}\ln \left({\mathrm{q}}_{ij l}\right)+{a}_{ij l}^{se\times cr}\ln \left({\mathrm{q}}_{i\mathrm{j}l}\right) $$
(1)

The formulation in Eq. (1) characterizes the consumer’s utility with respect to product j of quality l. It is parameterized by aijl > 0, which can be decomposed into three terms capturing the utility derived from search and experience attributes (se), credence attributes (cr), and an interaction term (se × cr). The parameter \( {a}_{ij}^{se} \) parametrizes the utility for the jth product regardless of its quality. It captures how much the consumer values product-specific attributes that can be apprehended by search and experience (e.g., in the case of an apple, characteristics such as the fact that it is a rather small, round, acid, and sweet fruit). \( {a}_{ik}^{cr} \) parameterizes the utility for quality l across all products. This parameter is thus label-specific and captures how consumer i values the credence attributes attached to label l (e.g., in the case of organic products, the implications that farming practices do not rely on pesticides nor synthetic fertilizers). The third term \( {a}_{ijl}^{se\times cr} \) captures the interaction between the search and experience attributes of product j with the credence attributes associated with quality l. This reflects that the credence attributes attached to a specific label may vary from one product to the other (e.g., the consumer may value more organically produced apples than organically produced eggs). Further assume that the price of the conventional version of the product (l = c) is lower than the price of all other quality versions of the product. For simplicity, we normalize the parameters in Eq. (1) so that \( {a}_{ic}^{cr}={a}_{ijc}^{se\times cr}=0 \) for all j.

The quantities qijl that maximize the consumer’s utility subject to the budget constraint depend on prices and income, as well as on the elasticities of substitution between products and qualities implied by the form of Ui(.). One expects that the demand in product j of quality l ≠ c to be close to 0 if \( {a}_{il}^{cr} \) and \( {a}_{ijl}^{se\times cr} \) are both close to 0—i.e., if the credence attributes attached to the label l are perceived as unimportant by the consumer relative to the search and experience attributes of the product. If this condition holds for all l ≠ c, the product is predominantly purchased in its conventional version and the consumer is said to be a conventional regular. Conversely, if \( {a}_{il}^{cr}+{a}_{ijl}^{se\times cr} \) is sufficiently large relative to \( {a}_{ij}^{se} \), the consumer is a quality l regular for product j.

Moreover, if \( {a}_{il}^{cr} \)is sufficiently large relatively to \( {a}_{ijl}^{se\times cr} \) for all j—that is, if the consumer mostly values the credence attributes associated with the quality label l—and Ui(.) is such that the consumer’s preferences exhibit some taste for product variety, one should expect the quantities of purchased food items of quality l to be positively correlated across products. If, to the contrary, the consumer mostly values the credence attributes when they are associated with a specific subset of search and experience attributes—corresponding to \( {a}_{ijl}^{se\times cr} \) being much larger than \( {a}_{il}^{cr} \) for a few values of j, one should expect little correlation between the purchased quantities of products of quality l.

Distribution of quality-food consumption

We first investigate regularity by analyzing the distribution of the quality consumption rate of a given product, i.e., the ratio of purchases of the quality version of the product over the total number of purchases during the time span the household remains observed in our dataset. Individuals who almost never buy the quality version of the product are characterized by a quality consumption rate close to 0. They are said to be “conventional regulars”; those who mostly purchase the quality version (i.e., quality consumption rate close to 1) are said to be “quality regulars.”

The first part of our investigation is organized around the three following hypotheses:

  • For a given product, consumers are regulars: they either purchase mostly its conventional version or its quality version, but not a mix of both (H1);

  • Quality regular consumption is consistent across the food basket: a consumer who is a quality regular for a product is a quality regular for an important part of his food basket (H2);

  • The total turnover of quality products purchased by regulars is substantial (H3).

If H1 holds true, the distribution of the quality consumption rate among the sample should be bimodal, with a mode around 0 (conventional regulars) and another around 1 (quality regulars). Figure 1 illustrates such a case. The dashed lines represent the two thresholds under and above which a consumer is considered as a conventional or organic regular, respectively.

Fig. 1
figure 1

Bimodal distribution and classification of quality-food consumers

Following Silverman (1981), it is possible to test if the density of a distribution has more than k modes using kernel density estimation. Indeed, the kernel density estimate for window width h based on the observations X1,...,Xn is defined by

$$ f\left(t,h\right)={n}^{-1}{h}^{-1}\sum \limits_{i=1}^nK\left\{{h}^{-1}\left(t-{X}_i\right)\right\} $$
(2)

where K is the normal density function in our case. The window width h is a parameter controlling to which extent the observations are smoothed in order to obtain the kernel estimate and t is the endogenously estimated mean of the kernel. If the data is multimodal, a large value of h will be needed to obtain a unimodal kernel estimate.

The test relies on the comparison between the estimated h for a unimodal distribution and hcrit, the k-critical window width, defined as:

$$ {h}_{\mathrm{crit}}=\min \left\{h,f\left(.,h\right)\ \mathrm{has}\ \mathrm{at}\ \mathrm{most}\ k\ \mathrm{modes}\right\}. $$

The lowest hcrit gives the most likely number of modes for the distribution. Using the Silverman test, we can thus test whether the distribution of quality-food consumption is bimodal for each product and product family.

To test H2, we define three types of behavior towards quality and organic food, illustrated in Fig. 1:

  • conventional regulars, who are almost never buying the quality version of a product (0–20% of quality purchases for a given product);

  • quality-food regulars, who are almost always buying the quality version (more than 80% of quality purchases for a given product);

  • and occasional quality-food consumers (20–80% of quality purchases for a given product).

The 20% and 80% thresholds are partly arbitrary. However, when a beta distribution is fitted to the organic consumption rate of products with a bimodal distribution, around 90% of the values are either above or below these thresholds (see supplementary materials (SM) 1 for the detailed procedure). Through this typology, we also assess the duplication of quality consumption behavior across different products (H2): if I am an organic regular for eggs, am I likely to be an organic regular for milk? (see Monier et al. (2009) for an example of this question in a two-products case).

Determinants of the probability to be a regular organic consumer

In a second step, we assess which product attributes or household characteristics influence whether a quality-food consumption (ith consumer × jth product) is regular. Because no regular behavior is identified for geographical indications and label rouge products (see “Consumption behavior of geographical indications and label rouge products is occasional″ section), this second step is restricted to organic products. We use the random utility discrete choice framework (McFadden 2001) as an application of the theoretical model. In this model, the utility function is assumed to be well-behaved (preferences are complete, reflexive, and transitive) and known by the consumer. Thus, the consumer can compare the organic and conventional alternatives of a product and rank them in order to purchase the product that maximizes his utility. However, some parts of the utility function, essentially the preference for the credence attributes per se, cannot be separated from other elements in scanner data. This is why the random utility model is used here to identify which product attributes or household characteristics influence quality-food regularity rather than a structural estimate of the coefficients in Eq. (1) of our theoretical framework. The random utility model describes the utility (Uijo) of the ith consumer purchasing the jth product in its organic version as the sum of the observed attributes (Vij) and a random component (εio):

$$ {U}_{ijo}={V}_{ijo}+{\varepsilon}_{ijo} $$
(3)

Similarly, the utility (Uijc) of the conventional product j’s choice is described as:

$$ {U}_{ijc}={V}_{ijc}+{\varepsilon}_{ijc} $$
(4)

The ith consumer will be an organic regular for the jth product at time t if Uijo > Uijc and the probability that this consumer will be an organic regular can be written as:

$$ P\left({Y}_{ijt}=1\right)=P\left({U}_{ijt o}>{U}_{ijt c}\right)=P\left({\varepsilon}_{ijt c}-{\varepsilon}_{ijt o}<{V}_{ijt o}-{V}_{ijt c}\right)=P\left(\varepsilon <{V}_{ijt o}-{V}_{ijt c}\right) $$
(5)

where Yijt is a binary choice variable:

$$ {Y}_{ijt}=\left\{\begin{array}{c}1\ \mathrm{if}\ \mathrm{the}\ i\mathrm{th}\ \mathrm{consumer}\ \mathrm{is}\ \mathrm{an}\ \mathrm{organic}\ \mathrm{regular}\ \mathrm{for}\ \mathrm{the}\ j\mathrm{th}\ \mathrm{product}\ \mathrm{at}\ \mathrm{time}\ t\\ {}0\ \mathrm{otherwise}\end{array}\right. $$

Defining f(εi) as the density function of εi, (3) becomes:

$$ P\left({Y}_{ijt}=1\right)={\int}_{\varepsilon }{Z}_{ijt}\left({\varepsilon}_{ijt}<{V}_{ijt o}-{V}_{ijt c}\right)f\left({\varepsilon}_{ijt}\right)d{\varepsilon}_{ijt} $$
(6)

where Zijt is a binary variable indicating if the term inside the parenthesis is true (Zijt = 1) or false (Zijt = 0), i.e., if the utility derived from the regular organic choice exceeds the one from the conventional choice. Furthermore, Zijt can be empirically described as depending on the ith consumer’s characteristics and the jth product attributes at time t. Because most households remain in the panel for several years—on average 5 years—a random household effect is introduced to correct for the within-household correlation in the error terms:

$$ {Z}_{ijt}=\beta {X}_{ijt}+{u}_i+{\varepsilon}_{ijt} $$
(7)

where Xijt = (xijt1, …, xijtk) is a matrix of variables explaining the choice of being a regular organic consumer, i.e., they represent the jth product attributes (price ratio between the organic and the conventional version, a binomial variable indicating if the product is processed or not, the type of shop where the product is bought…) or the ith consumer’s characteristics (age of children, income, size of the city in which they live, their socio-professional category…), β = (β0, β1, …, βk) is a vector of parameters to be estimated, εijt captures the idiosyncratic residuals, and ui is a random effect related to the ith household, accounting for the unobserved heterogeneity of the households and the correlation among the εijt.

To estimate Eq. (7) with scanner data, we assume that εiit follow a logistic distribution and thus that εijtc and εijtoare identically and independently distributed as type I extreme value (Onyango et al. 2007). This hypothesis on the distribution of the errors terms may sometimes be violated in empirical analysis, as the unobserved portion of utility, captured by the errors terms, can be correlated among the different consumer × product couples, but the potential induced bias is limited (Train 2003). Under this assumption, P(Yij = 1), the probability that the consumer i is an organic regular for the product j is given by the following logit model:

$$ P\left({Y}_{ij}=1\right)=F\left({Z}_{ij}\right)=F\left(\beta {X}_{ij t}+{\varepsilon}_i\right)=\frac{1}{1+\exp \left(-\beta {X}_{ij t}+{\varepsilon}_i\right)} $$
(8)

Furthermore, we construct two other logit models, one—logit product—using only the information on the product attributes and the other—logit household—using only the variables describing the consumers’ characteristics. Using a likelihood ratio test, we then compare the goodness of fit of these two models, in order to assess which set of characteristics—products’ vs households’—is the most important driver of regular organic behavior.

We repeat the analysis on a restricted sample containing only households who purchased at least 20% the product of interest as organic. This allows to examine whether the same variables are correlated with the probability of being a regular organic consumer for a product, compared with being either a conventional or occasional consumer (called thereafter “full sample logits”) or to the probability of being a regular organic consumer compared with being an occasional consumer only (“restricted sample logits”).

Data

This paper uses French data from Kantar WorldPanel which contains food-at-home purchases of French households. For the analysis, we use the number of organic, quality, and conventional products purchased, filtering out households who bought less than three times the considered products. To define the product families (level 1, 23 families), the classification from (Boizot-Szantai et al. 2017) is reused, with more detailed groups for fruits and vegetables. We also created a more in-depth classification (level 2, with 237 categories/products), which differentiate each fruit and vegetable (carrots, potatoes…) and types of meat (pork, beef…). In this panel data, we also have socio-economic information on 12,453 yearly active households, surveyed one or several years from 2011 to 2016 (5 years on average) whose descriptive statistics are presented below (Table 1). In addition, a map of the spatial distribution of our sample in France is available in SM 3.

Table 1 Descriptive statistics of the households in the sample (relative frequencies)

As shown in the “Introduction” section, the organic products’ availability and their price differences compared with conventional versions are presented as key drivers of organic consumption.

However, in the recent literature on organic consumption, availability is most often neglected. In order to assess the role of product availability on consumption behavior, an indicator of the availability of quality-food—organic, red label, or geographical indications—is developed for each consumer. This indicator is the share of shops which offer the quality food of interest out of all shops where the household is going and the exhaustive results are available in the supplementary data (SD) 1. It is computed after the following steps:

  • For each household, we define the set of shops where it is going as the shops where the household went at least three times.

  • For each shop, we estimate that a given quality product is available if it has been purchased by at least one household in a shop of the same retail chain, same size (hypermarket/supermarket), and in the same region. These three variables are indeed the most important in predicting the availability of organic products: together, they explain 68% of the variance in the number of organic items available for online purchase in the Burgundy region (see SM 2 for details on data collection and regression).

  • At shops where there are few purchases for a given product, the absence of organic purchases may be an artifact caused by data scarcity rather than actual unavailability. These shop × product combinations are filtered out as “no data” to avoid false negatives. For this purpose, the act of buying an organic product in a given shop is assumed to follow a binomial law using the nationwide average share of organic purchases for this product as its probability of success. We determine the critical (minimal) sample size, for a given significance level of 0.05 and a power level of 0.8, for which the hypothesis that the product is available but has not been purchased can be rejected.

  • Then, if the observed number of purchases of a product is above the critical sample size and if the organic version has never been purchased, we can assert that the organic version of the product is not available. However, if the organic version has been purchased at least one time, we directly assess that it is available in the shop considered.

  • The indicator of availability for each household is then defined as the proportion of shops offering the organic product among all shops in which the household purchases food. The indicator ranges between 0—the organic version of the product is never available to the household—and 1—the organic version of the product is available in all the shops the household attends to.

Similarly, we develop a price ratio indicator (organic price per kilogram divided by conventional price per kilogram) between organic and conventional versions of a same product, which varies depending on the region, the shop, and the product considered. In addition, we create an absolute price difference indicator (organic price per calorie – conventional price per calorie, the harmonization per calorie making it comparable across product families).

Results

Bimodality of quality-food consumption at different aggregation levels

Consumption behavior of geographical indications and label rouge products is occasional

For the few products for which information on quality purchases other than organic is available (labelled meat, fish, eggs, processed meat, and cheese), these quality purchases are not bimodally distributed, with the exception of processed meat (label rouge and PDO matured ham) which has a mode above 80% of quality purchases (Table 2). It must be noted that label rouge and PDO/PGI information are lacking for some products (i.e., meat, where certification information is available for chicken only). Consumers thus buy these products as exceptional purchases, possibly for special occasions. H1 is therefore invalidated for geographical indications and label rouge, and so is H2. Moreover, only 3% of the market value of these certified products is purchased by regular consumers, which invalidate H3 (for an example, see Table 4). As geographical indication and label rouge products are not subject to regular consumption, we do not perform the second part of the analysis (estimation of the determinants of regular consumption) for them.

Table 2 Distribution of the quality labels (PDO, PGI, label rouge) consumption per product family

Organic regulars are consistent for a given product, but not among product families

Fifty-six percent of the 23 families of products are subject to regular behavior from a significant share of consumers (Table 3): 35% of distributions are bimodal and 21% are multimodal with one mode in the 80–100% range). However, the rationale for agglomerating products into families is not necessarily suited to assess the consistency of consumer behavior. Indeed, consumers are more consistent at the product level (for exhaustive results see the SD 2 and 3): 80% of products are subject to regular behavior from a significant share of consumers (bimodal (44%) or multimodal with a mode in the 80–100% range (36%)). For example, the distribution of the organic fat product family is not bimodal while a deeper look at the distribution of the 5 products’ categories composing this family shows that 3 fat product categories have a bimodal organic consumption distribution (Fig. 2). H1 is therefore validated for organic products.

Table 3 Distribution of the organic consumption per product family
Fig. 2
figure 2

Distribution of the organic consumption of fat products

The basket of organic regulars remains dominated by conventional products

Even if the existence of organic regulars can be observed for 80% of food products, the consumers that purchase several products as organic regulars are scarce. More than 71% of our sample does not buy any product as organic regulars and only 6% purchase more than 5 products as organic regulars (see SD 4 for exhaustive results). The consumers in our sample purchase on average 80 different products. This shows that most consumers have a regular organic behavior for only a few products (Fig. 3). Thus, regular organic behavior is not very consistent across products and H2 is invalidated (this is consistent with the findings of Marian et al. (2014) in the case of organic red meat, chicken, milk, and hard cheese).

Fig. 3
figure 3

Distribution of the number of categories for which consumers are organic regulars

In our theoretical framework, this indicates strong interaction effects in the consumer utility function (Eq. (1)) between the organic (credence) attribute of a product and the search and experience attributes (roots vegetables, fruits growing in a tree …). In product families compounded of many categories such as fresh fruits and vegetables (F&V), the regular organic behavior seems coherent among very similar products (quasi-substitutes, e.g., lemons and oranges), but not among modestly close products (e.g., carrots and onions). The interaction effect could therefore reveal itself for search and experience attributes slightly more generic than the product level, but certainly more specific than the product family level.

There is however a non-negligible category of consistent regulars: 0.8% of consumers are organic regulars for more than 40 products, that is half of the average basket, noting that many minor products are often not available at all under the organic label. For this consumer category, the appetence for the credence attribute per se is strong (\( {a}_{il}^{cr} \) >> \( {a}_{ijl}^{se\times cr} \)) in our theoretical framework). This consumer category represents 3% of all regulars for at least one product.

Twenty-eight percent of the organic market is purchased by regular consumers

The share of the organic market in France purchased by organic regulars averages at 28%, which tends to validate H3. However, this average is heavily influenced by the market values of the products, which strongly differs between product families. For example, the market for organic wine is five times larger than the one of organic broccolis, but the share of the organic market purchased by regulars is only 8% for wine compared with 75% for broccolis. Thus, the size of the market and the unit values of products strongly drive down the average share of the organic market purchased by regulars. Among the detailed results, 40%, 33%, and 43% of the organic eggs, organic F&V, and the organic milk markets are purchased by regulars, respectively (Table 4).

Table 4 Examples of the organic market share represented by regular consumers (exhaustive results are available in SD 2)

Determinants of regular organic consumer behavior

Product availability and product family are key determinants

The logit model on the whole sample (Table 5, model 1) shows that regular organic behavior is more likely to occur when a consumer makes most of its purchases for a given product at a specialized organic store and if he purchases the organic product from the retailer’s brand. The price ratio between the organic and conventional versions of product is not significantly correlated with the probability of being a regular organic consumer while the absolute price difference between the two versions is positively correlated with it: a higher the price difference is associated with a higher share of organic regulars.

Table 5 Selected results of the logit models

The typical organic regulars are single, without children and have a lower body mass index (BMI), indicating that they have a healthier diet or practice more physical activities. They usually also have a high income, a managerial or superior professions or are own-account workers, and live in large cities. However, consumers living in rural areas (reference level) are more likely to be organic regulars than the ones livings in small cities.

The logit regressions reveal that some product families are more likely to be purchased by organic regulars than others: baby foods, biscuits, cakes and pastry, bread and flour, confectionary products, desserts, eggs, fat, beverages, meal substitutes, milk, sweeteners, processed F&V, and starchy food. The availability of the organic version of a product in the shops the consumers usually visit appears as a significant driver of regular behavior.

When using a smaller sample, compound of occasional and regular organic consumers only, the results are similar (Table 5, model 4). Nevertheless, in these restricted logit models, the role of prices differs: when comparing occasional and regular organic consumers, a higher price ratio between organic and conventional products is associated with a higher probability of being a regular consumer whereas the correlation with the absolute price difference becomes non-significant. In addition, in all logit models, the standard deviation of the households’ random effect is high, of comparable size with the largest treatment effect (eggs family), which show an important household-related variability of the probability of being an organic regular.

Products’ attributes matter more than households’ characteristics

Two other logit models are also computed, one using as predictor variables the information related to the products only (price ratio, frequency, processed or not…) and the other using the information on the household only (number of children, wealth, professional status…). Performing a likelihood ratio test, we conclude that the model using product information predicts better the regular consumer behavior (Table 6). These results still stand when using the restricted sample of occasional and regular organic consumers.

Table 6 Likelihood ratio tests between the models of product attributes and household characteristics, whole sample

Discussion

Strong interrelation between credence and search and experience attributes in determining regular organic consumption

The first part of this paper, focusing on the bimodality of organic consumption, outlines the product families for which regular consumer behaviors can be found: hot drinks, milk, eggs, baby foods, meal substitutes, desserts (which include dairy products), and bread and flour (Table 3). The results are confirmed by the logits on regular organic behavior, in which these product families have positive regression coefficients (Table 5). Except F&V and starchy food, these product families are the most purchased ones in the organic market (Agence Bio 2019a; Hill and Lynchehaun 2002). Clearly, product availability plays a role: product families that are difficult to find organically in conventional supermarkets (meat or seafood for example) are not dominantly subject to regular consumption behavior: their share of organic purchases is unimodal. Also, the fact that some product families are compound of more products than others (for example, fresh F&V family has 71 products while milk has only two) influences the distribution of organic consumption, as shown in Fig. 2. Moreover, the characteristics of organic products (price, quality, availability) likely vary between outlets. Consumers are likely to be influenced by the choice offered at the outlet where they shop most often, which may explain why consumers who mostly shop at specialized organic shops are more likely to be regulars than those shopping mostly in hard discounts. Moreover, all consumers do not have access to all outlet brands, depending on where they live. The availability indicator that we develop captures this disparity and appears as a key variable in the logit regressions on regular organic behavior (Table 5). Indeed, a lower availability of organic products compared with conventional ones is correlated with a lower organic consumption (Buder et al. 2014; Dimitri and Dettmann 2012; Massey et al. 2018).

In the same sense, very few consumers are organic regulars for a large number of product categories, as only 6% of them purchase as regular more than 5 product categories and only 0.7% of them have a regular behavior on more than 50% of the products they purchase (Fig. 3).

Following our theoretical framework, the most likely interpretation of this result is a strong interaction effect between the credence attribute (the product is organic) and the search and experience attributes (the product is an apple) in the consumer’s utility. Most consumers do not value much the organic sign per se, but they value it strongly for a few specific products. Relating that to the dominant health driver of organic consumption (Buder et al. 2014; Padel and Foster 2005), a possible explanation is that consumers are mostly concerned about the healthiness of a few specific products, e.g., because they have seen a documentary on the amount of pesticides in lemon or read a newspaper article on the amount of antibiotics in milk. Accordingly, they may become organic regulars for these specific products, for which they received health information, but this change may not spillover on their broader feeding routines.

However, at least two other reasons can explain this weak consistency of regular behavior across products. The first is again availability, as some products can only be found in specialized shops, but most of the consumers purchase food in several types of shops (conventional supermarket, open air market, specialized shops…). As the logit results show, if a product is mainly purchased at a specialized shop, the probability that the consumers will follow a regular behavior increases drastically. So, as consumers shop in different outlets, even if they choose to be regulars on some products that they usually purchase at their main shopping source, they may have to buy these products in others shops (supplemental purchases, oblivion, unexpected meal to prepare…) but the organic versions of these products may be unavailable, more expensive, or of a lesser quality and they will not purchase it. This interpretation is supported by the lower numbers of regulars found when the analysis is based on frequency (number of purchases) rather than volumes (liters or kilograms). Indeed, supplemental purchases weigh more on the shares of organic purchases when computed with frequencies. For the F&V family for example, we find that more households following a regular organic behavior for at least one product when the shares are computed with quantities (1548 households) than in frequencies (1303 households). Similarly, the number of F&V categories purchased by regular organic households is larger when they are computed in quantities (3317) than frequencies (2614).

The second reason is that the data we use is collected at the household level, and even if the main shopper is identified, there can be another shopper (husband/wife, teenagers…) that also sometimes purchases food. This occasional shopper may not have the same purchasing behavior or may not go to the same shopping source, and thus the household may not be regular even though each of its members are regulars of different styles. This effect seems weak however: couples are more likely to be organic regulars than singles (Table 5).

Finally, consumers are more likely to be organic regulars if they purchase products from retailer brands, which are generally cheaper than other organic brands (Ngobo 2011). However, the price ratio between the organic and conventional version of a product is not significantly correlated with regular behavior, and only the absolute difference in prices is significantly, and positively, associated with regular organic consumption. This surprising result—i.e., the more expensive the organic version of a product is, compared with the conventional version, the more likely the consumer will be an organic regular—can be understood as a weak price-elasticity of regular consumers to organic products prices. Indeed, if a consumer is an organic regular, most likely for health consideration, he will not be as reactive to the relative price of organic products than occasional or conventional consumers. This result is strengthened by the restricted logit models, in which the regression coefficient of the relative price of organic is positive and significant.

The characteristics of actual organic regulars are similar to those of “declared” pro-organic consumers

The description of regular organic consumers which can be drawn from the logit models is comparable with the main findings of the literature on the socio-economic characteristics of organic consumers (Kesse-Guyot et al. 2013). Indeed, we corroborate that regular organic consumers have a higher income, are more urban, and have a higher profession position than non-regular organic consumers (Table 5). Comparing regular consumers with occasional consumers only, we uncover that regular consumers are more represented in the upper-middle class and high professional status (Table 5).

Nevertheless, the comparison of the logit models with product attributes to the logit models with household characteristics only demonstrates that product attributes explain better regular organic behavior than the household characteristics (Table 6).

Public policy implications: product-specific targets

Of course, what matters most to policy makers is the total amount of organic production produced and sold, relatively to conventional production and the negative health and environmental externalities associated with it. However, an increase of organic production can be reached by sustaining organic consumption, which is currently done by French policies (“Ambition Bio 2022”, “Egalim” law in 2018). Our results from the analysis of organic consumption and their regular behavior advocate for more product-specific policies. Indeed, as much as 28% of organic value is purchased by households with a regular consumption pattern for a few specific products. Accordingly, public initiatives aiming at increasing organic consumption should focus both on promoting the organic label per se and on promoting specific organic products. They may also promote a “regular” attitude towards some organic products—e.g., “remove pesticides from your morning orange juice: buy organic oranges.” Targeting product families for which a regular consumption behavior is already frequent may be particularly promising, as regular organic consumption exists (H1 is validated) but is not yet very consistent across the food basket (H2 is invalidated). This is supported by our finding that product characteristics explain better the regular organic behavior than the household characteristics (Table 6). If there is a choice to be made between targeting specific products or specific consumers, one should go for specific products although some consumer segments—higher income, higher professional status, and more urban—are likely to be more receptive (Apostolidis and McLeay 2016).

Geographical indications and label rouge products on the other hand are purchased occasionally, making them somewhat comparable with luxury products. Marketing and promotion actions for these products may therefore want to get inspiration from non-food luxury products.

Conclusion

This paper demonstrates that quality-food with similar market shares can be subject to widely different consumption behaviors. Consumption of geographical indications and label rouge is always occasional while consumer attitude towards organic food is often regular: for a given product, some consumers tend to either purchase it always organic or always conventional. Indeed, conversely to previous studies using scanner data or surveys, we use observable information, the distribution of quality purchases, to categorize regular and occasional quality consumers. Doing so leads to a better comprehension of quality consumers’ behavior, i.e., that an important part (29% of our sample) is regular for at least one organic product, but that regular organic consumers for their whole food basket are scarce. Moreover, regular organic consumers purchase 28% of the total value of the organic market and up to 50% for some F&V, eggs, or milk. In this sense, public policies should target product categories instead of the whole food basket and develop organic regular behavior, using as levers the product attributes and household characteristics our analysis revealed as strongly related with regular organic consumption. However, other quality products, such as Geographical Indications, are purchased as extraordinary goods and so policies aiming at increasing their consumption should relate to those on luxury goods.

More precisely, we illustrate that the main product families for which organic consumption distributions are bimodal, i.e., for which regular consumer behaviors can be found, are eggs, milk, baby foods, meal substitutes, desserts, bread, and flour. When organic consumption is analyzed at product level, one sees that its distribution is bimodal also for most fruits and vegetables (processed or raw).

Besides, households which exhibit regular organic consuming behavior are richer, more urban, and have a higher profession statute than the others. These organic regulars also have a higher propensity to be in couples and to have fewer children.

The products categories that are consumed by organic regulars are more available and organic regulars do not seem to be influenced by the price of organic products. To the contrary, a higher price difference between an organic product and its conventional counterpart is associated with a higher share of organic regulars, possibly because organic regulars are willing to buy the organic alternative at all costs. Moreover, product attributes explain better organic regular behavior than household characteristics. Further research may explore the interaction effects in the consumer utility function between the organic attribute and other product attributes (thickness of skin for fruits, roots vegetables…). Computing price and income elasticities of quality-food products would also shed light on consumer behavior, especially in defining which products are luxury goods. Lastly, bridging the gap between organic production and consumption may help designing efficient policies: in light of our work, policies could focus on securing the regular consumption of products whose production in organic systems has high environmental or health value-added (production nearby water catchments for example).