1 Introduction

Global environmental agendas are putting increasing policy efforts to keep global temperatures below the target of 1.5 °C above pre-industrial levels advocated by the IPCC.Footnote 1 Human consumption plays an important role in these agendas: the production, delivery, and storage of all products available in the marketplace require the emission of greenhouse gases (GHG) (Camilleri et al. 2019; Manderson and Considine 2018; Panzone et al. 2020; Vermeulen et al. 2012). At household level, estimates indicate that food alone accounts for up to 37% of GHG emissions (Poore and Nemecek 2018; Springmann et al. 2018), that is their total carbon footprint measured in grams of carbon dioxide (CO2) equivalentFootnote 2 (gCO2e). The challenges of changing households’ choices and behaviour have led to calls for interventions targeting consumer behaviour specifically (Dietz et al. 2009; Vandenbergh and Steinemann 2007).

However, consumers have limited incentives to reduce the carbon emissions from their consumption: in the case of a global public good, the environmental impact of present consumption will be felt by society sometime in the future and often far away in space (Gifford 2011; Steg 2016; Weber 2006, 2018). At the same time, occasional changes in behaviour are unlikely to keep the increase in global temperatures below the 1.5 °C target: the achievement of this ambitious goal requires consumers to consistently privilege behaviours that have the least environmental impact (Galizzi and Whitmarsh 2019; Truelove et al. 2014; Ulph et al. 2023). This is particularly important for the case of grocery shopping, where shopping trips consist of a sequence of choices targeting a range of consumption goals (Panzone et al. 2021a; Sheehan and Van Ittersum 2018).

A key research question in this area is how to motivate consistent sustainable behaviour. Literature studying consumer decision-making retail environments presents three main tools to drive more sustainable consumer choices: carbon labelling (Muller et al. 2019; Potter et al. 2022; Suchier et al. 2023), carbon (or meat) taxation (Panzone et al. 2021b, c), and environmental nudges (Demarque et al. 2015; Kanay et al. 2021). Nudges could be an effective tool to motivate consistent sustainable consumption by strengthening self-control and focusing consumer attention to the environmental impact of their choices (Carlsson et al. 2021; Sunstein and Reisch 2014). Appropriate nudges could also facilitate consistent behaviour by activating relevant constructs that drive choices during the consumption process.

This study uses an incentive-compatible framed field experiment (Harrison and List 2004), to explore the effectiveness of commitments nudges and badges in increasing the environmental sustainability of consumer behaviour in the food domain. In this experiment, consumers shop in an online store that provides real-time basket-level carbon footprint and nutrition data. The store, containing over 900 products commonly consumed by the study population, allowed consumers to shop at any time (within a week) and from any location, with no interactions with the experimenter. Understanding the environmental impact of consumer choices in retail environment is increasingly considered key to the design of interventions that can increase the sustainability of consumption by driving large-scale changes in consumer behaviour (Macfadyen et al. 2015; Vadakkepatt et al. 2021). We focus on online shopping as this is a fast-growing retail segment in the UK (Panzone, Larcom, and She 2021a, b, c, d); moreover, online retail lends itself to significantly more engagement between retailers and consumers during the shopping trip, through stimuli that can adapt to consumer choices (Todd et al. 2013), leading to interventions that can be implemented at low-cost.

Results indicate that presenting consumers with a voluntary or forced commitments leads to a reduction in the carbon footprint of shopping baskets, also recording a drop in the amount of fats in the food baskets; the reduction in carbon footprint is of similar magnitude for both commitments. We find additional benefits in terms of consumption of fats and, for forced commitments, that of salt by 18%. However, a badge does not have a significant effect on behaviour. In the group where commitment is voluntary, the reduction is driven by those consumers who accept the commitment, who move towards the sustainability goal early in their shopping trip. The reductions observed in the commitment groups occur primarily through an increase in the budget allocated to the purchase of fruit and vegetables, and a reduction in the budget for dairy and eggs. Finally, Participants in the commitment groups were recorded as spending more time reading environmental labels, and were more likely to report an environmental goal.

This article contributes to the existing literature on sustainable food consumption by exploring ways to nudge consumers directly when making choices. Compared to previous research, this study focuses on the design of a motivational nudge, which is designed to increase consumer motivation to reduce the carbon emissions of their food baskets. In particular, we link the literature on goal pursuit (Fishbach and Dhar 2007) and bounded willpower (Baumeister 2002; Jolls et al. 1998) to the existing literature on sustainable consumer behaviour to design a behavioural intervention that aims at increasing the sustainability of food shopping in retail environments. Using an experimental approach, we are able to determine the causal relationship between a commitment or a badge, and the resulting change in the carbon footprint of the food basket in an online supermarket.

The next section presents some relevant theoretical background and experimental hypotheses, and a stylised model of behaviour. Section 3 outlines the data collection process, which used an experimental online supermarket to measure the carbon footprint of consumer choices. Section 4 explains the econometric model used in the analysis, with results presented in Sect. 5. Section 6 discusses the results, and Sect. 7 concludes.

2 Theoretical Background

2.1 Consumer Choices and Bounded Willpower

The pursuit of a consumption goal requires an individual to ensure all decisions are conducive to the achievement of this goal, exerting willpower when facing a conflict between short-term interests and the long-term objective (Baumeister 2002; Fishbach and Dhar 2007; Zhong et al. 2009). For instance, consumers aiming at minimising the carbon footprint of their food basket need willpower to prefer low-carbon options (e.g., vegetarian burger) when other high-carbon alternatives (e.g., beef burger) are more appealing. However, willpower is costly to exert, because it consumes physical and cognitive resources, which are finite. As a result, consumers often display bounded willpower, which leads individuals to prioritise short-term interest over long-term goals in some of their decisions (Gino et al. 2011; Jolls et al. 1998). Bounded willpower can be problematic from a policy perspective because it shows dynamic inconsistency (Baca-Motes et al. 2013; Gneezy et al. 2012; Sadoff et al. 2020; Ulph et al. 2023): consumers will mix high-carbon and low-carbon options in their shopping basket, particularly as willpower depletes, leading to baskets that have higher carbon footprint compared to a scenario with unlimited willpower. This problem may be solved by intervening directly when consumers make choices.

2.2 Commitment as a Tool to Motivate Pro-environmental Behaviour

A commitment to a clear, actionable goal can be a suitable strategy to motivate pro-social behaviour, and counter the depletion of willpower (Brocas et al. 2004; Bryan et al. 2010; Burke et al. 2018; Himmler et al. 2019). The literature presents several theoretical explanations of why commitments can reduce the carbon footprint of grocery shopping. First, research indicates consumers start shopping with fuzzy goals (Lee and Ariely 2006); in the presence of conflicting goals, a commitment increases the motivation to pursue a focal goal over others (Fishbach and Dhar 2005; Rogers et al. 2014). Second, a commitment may activate self-image concerns, which motivate consumers to respond by behaving in line with the core values of their self-image (Ariely et al. 2009; Baca-Motes et al. 2013; Falk 2021; Mazar et al. 2008). Third, commitments may activate the need to comply with social norms, leading to feelings of guilt if such social norms are broken (Charness and Dufwenberg 2006; Matthies et al. 2006; Theotokis and Manganari 2015). Fourth, individuals may have innate preferences for promise-keeping (Ellingsen and Johannesson 2004; van der Werff et al. 2019; Vanberg 2008), experiencing—or expecting—guilt when breaking promises (Charness and Dufwenberg 2006; Ellingsen et al. 2010). Finally, when the opportunity or requirement of a commitment is seen as coming from an authority, there may be a desire to comply (Karakostas and Zizzo 2016).

In our experiment, consumers were asked to commit to keeping their overall carbon footprint below an ambitious threshold (the bottom 20% of the pre-intervention distribution). We hypothesize that a commitment to a clear, actionable goal, whether voluntary or forced, can be a suitable strategy to nudge more consistent sustainable (low-carbon) consumer behaviour. Our first hypotheses are:

H1: A voluntary commitment leads to a lower carbon footprint than a control group with no commitment.

H2: A forced commitment leads to a lower carbon footprint than a control group with no commitment.

In support of these hypotheses, previous research in the environmental domain has shown that consumers who committed to an environmental goal were more likely to use public transport (Matthies et al. 2006); conserve water by taking shorter showers (Dickerson et al. 1992); reduce their energy consumption (van der Werff et al. 2019); and re-use their hotel bath towels (Baca-Motes et al. 2013). In a meta-analysis, Lokhorst et al. (2013) also shows that commitment motivates a range of environmental behaviours, with an effect that can last over time, although their study does not include examples in the food domain.

In our study, we focus on two types of commitment, which differ on whether the origin of this decision is internal or external to the decision-maker (in the same spirit as Gino et al. 2013). In a voluntary commitment, the participant chooses to commit or not. By gathering the interest on environmental preservation in motivated consumers, the commitment is expected to lead to a reduction in the carbon footprint of the food baskets, compared to a control with no commitment.Footnote 3 In a forced commitment, the participant is forced to commit to the goal in order to continue shopping online, and the decision is imposed externally, for instance by a paternalistic policymaker (or the research team in our online supermarket). Note that, while we use the term ‘forced’ to describe such a commitment, this exercise uses a soft commitment, which imposed only psychological (i.e., non-monetary) rewards (costs) for keeping (violating) a commitment (Brocas et al. 2004; Bryan et al. 2010; Burke et al. 2018; Himmler et al. 2019) and was not actually enforced; participants unwilling to commit could opt out from the experiment.

It is not clear ex ante what prediction can be made in terms of the relative performance of the two forms of commitment. The rejection of hypotheses H1 or H2 can include the lack of an effect (no change in carbon footprint), as well as an increase in carbon footprint if consumers experience reactance (Espinosa and Treich 2021; Sunstein 2017). Reactance refers to a psychological state experienced by individuals when they perceive their freedom of choice or decisional autonomy to be threatened or restricted by external influences. Consumers may feel this restriction when the commitment is forced; but also in the voluntary commitment, if they feel that the request violates their individual freedom to determine what is good for them. If they experienced reactance, consumers would respond by doing the opposite of what is requested to them as a way to assert their autonomy (Botti et al. 2008; Espinosa and Treich 2021; Sunstein 2017): in the case of this article, by increasing, rather than decreasing, the carbon footprint of their basket.

Putting the possibility of reactance aside, voluntary commitment may be less effective than forced commitment if those individuals who commit are already more sustainable than the average consumer: in this case, the marginal costs of reducing the carbon footprint of those who commit are high, leading to smaller changes compared to a commitment requested to those consumers with lower carbon abatement costs. The same effect may be observed in consumers where the commitment causes significant consumption losses (e.g., in consumers attaching a high value to the consumption of high-carbon goods, for instance meat), who will have the ability to refuse the commitment if voluntary (in line with Gino et al. 2013). At the same time, forced commitment may encourage motivated participants to explore alternatives that they do not usually consider (Larcom et al. 2017), as well as make a stronger case for the existence of a social norm to comply to. This leads to hypothesis H3a:

H3a: A voluntary commitment is less effective in reducing carbon footprint than a forced commitment.

However, if the failure to be sustainable is due to personal limitations, e.g., lack of knowledge, a voluntary commitment may be more effective than a forced commitment, because those who commit voluntarily are more motivated to search harder for low-carbon options. This point is particularly relevant for the case of carbon footprint, which consumers do not know well (Camilleri et al. 2019; Panzone et al. 2016, 2020). Moreover, as noted above, a forced commitment might cause reactance (Botti et al. 2008; Sunstein 2017), therefore reducing the effectiveness relative to a voluntary decision. Then, hypothesis H3a becomes:

H3b: A voluntary commitment is more effective in reducing carbon footprint than a forced commitment.

Gino et al. (2013) provide evidence to support H3a: allowing individuals to accept monitoring of their behaviour on a voluntary basis increases the likelihood of cheating relative to imposing monitoring or not monitoring at all. However, Gino et al. (2013) requested participants to accept third-party monitoring of their compliance to a goal, while our study asks consumers to accept a goal and self-monitor, a less intrusive request.

2.3 Rewarding Environmentally Friendly Behaviour Through Dynamic Badges

Along with commitments, in this study we also explore badges as a tool to reduce the carbon footprint of food shopping. Willpower can be motivated by providing (monetary or non-monetary) incentives for self-control (Dhar and Wertenbroch 2012; Fishbach and Hofmann 2015; Schmeichel and Vohs 2009; Walsh 2014); among these, of relevance to this study is the use of signalling pins and bracelets (Baca-Motes et al. 2013; van der Weele and von Siemens 2020). Badges are a gamification element that provides a soft (that is, psychological) reward when a specific goal is achieved (Edwards et al. 2016; Hamari 2017; Sailer et al. 2017). Gamification, which includes the use of badges in online settings, is becoming an important element of human–computer interactions (Hamari 2017; Hock et al. 2019; Sailer et al. 2017; Whittaker et al. 2021).

In the context of this article, badges may operate through four main pathways. First, a badge may increase the awareness of an environmental goal, effectively priming goal pursuit (Tate et al. 2014; Walsh 2014). Second, badges self-signal pure achievement of a goal (Sailer et al. 2017; Whittaker et al. 2021). Third, a badge may allow consumers to self-monitor behaviour (Edwards et al. 2016): the presence of a dynamic badge that appears when a basket is low in GHG signals to the individual that behaviour aligns with their own values of environmental preservation, and willpower to refrain from purchasing high-carbon items is not needed. Finally, a badge provides feedback on the behaviour being monitored (Sailer et al. 2017): the consumer learns of having done something “desirable” when the badge is present (a low-carbon basket), or “undesirable” when the badge is absent (a high-carbon basket).

Based on the literature presented above, our fourth hypothesis is:

H4: A visible badge leads to a lower carbon footprint than the control group.

Prior research presents mixed findings on the effectiveness of badges in other settings. Baca-Motes et al. (2013) show that signalling, in the form of a pin that has a specific meaning to the consumer only, but which is visible to others, had a significant, if relatively small, effect on the reuse of hotel towels. Conversely, van der Weele and von Siemens (2020) found that bracelets reminding of pro-social behaviour (e.g., donations to the Red Cross), visible to the decision maker only, failed to motivate further pro-social behaviour. In our setting, badges are only visible to the decision maker, a point we will return to in discussing the results.

3 Methodological Approach

3.1 NU-Food Portal

Sales data for all participants were collected using Newcastle University’s NU-food online supermarket. The store contains 908 different stock-keeping units (SKUs). For each SKU, the store contains information including: the name and image of the product; its price; its macro-nutritional information; and the carbon footprint from published sources, as in (Panzone et al. 2021c), Clune et al. (2017), Drewnowski et al. (2015), Scarborough et al. (2014). Carbon footprint is presented at product level, with varying granularity across food categories: for some categories, for instance eggs, carbon emissions vary by product, and substitution requires trading off GHG with other characteristics; for other categories, such as fresh beef, the carbon footprint per gram is the same for all products, although products will have different carbon footprint depending on their weight; finally, other categories, such as jam or ice cream, have no intra-category variability in carbon footprint, and products differ only in other attributes (e.g., flavour, price), making substitution random with respect to GHG.

Consumers could access the carbon footprint and nutritional composition of each product by moving a cursor onto a specific icon (Fig. 1); the system recorded how long the window with the information remained open. Consumers could search for products using a search box. The carbon and nutritional content of the basket was always accessible, and updated in real-time as consumers added or removed products. The NU-food portal could be accessed from anywhere and on any type of electronic device with Internet (computer, tablet, or phone), and consumers had no interaction with the research team during their experiment (other than via email for technical queries). These features were available to all participants in all weeks.

Fig. 1
figure 1

Graphical representation of the NU-food supermarket. Note The information on CO2e and macronutrients data was only visible to participants when they hovered over the respective icon with the mouse

3.2 Experimental Procedure

In a framed field experiment (Harrison and List 2004), for two consecutive weeks participants were given a virtual weekly budget of £25.00 to shop on NU-food.Footnote 4 Participants could enter the store as many times as they wanted during each experimental week (from 9 am on Monday to midnight on Sunday), but they could only complete each week’s overall transaction once. To ensure this was the case, the check-out functionality was automatically de-activated once the transaction was completed, and re-activated at 9 am on the following Monday. Participants could spend as much as they wanted from the £25 budget, knowing that any unspent budget would be given to them. After the second shopping session (including the final questionnaire) was submitted, one of the two weeks of shopping was randomly chosen by the computer for the participant to collect at Newcastle University; any unspent budget for the randomly selected week only was also returned to participants upon collection. Participants were told that their shopping collection would be at least one week after their second week of shopping.

3.3 Participant Recruitment

The experiment was advertised by posters in offices and leisure facilities, such as cafés and community centres, around the city of Newcastle upon Tyne (UK). 1355 people from the general public registered interest to participate in an online shopping study. Of these, 1206 were randomly chosen to participate, and randomly assigned to an experimental group (participants were unaware of group allocations until week 2 started). 780 consumers finished shopping in week 1, while 677 participants completed the two weeks of shopping in the main experiment. An additional 48 participants completed the experiment in a separate group, used to test whether information on carbon footprint in the store primed choices; results in Appendix 1 fails to reject the null hypothesis of no priming effects.

Each participant was paid a £5 show up fee. Before starting the experiment, participants had to register; at this stage, they were given an anonymous login, and could choose a personal password. Once registered, participants had to give explicit consent by filling an online form outlining the terms and conditions of the experiment.

3.4 The Sustainability Threshold and the Shopping Goal

The manipulations proposed in this research require the definition of a threshold that defines sustainability in terms of carbon footprint, measured in gCO2e, at a basket level, for both the badge and the commitment. The threshold was defined as 180gCO2e/100 g: food baskets were classified as low-carbon when below this threshold; and high-carbon if above the threshold. The value of 180gCO2e/100 g was identified as it refers to around the bottom 30% of the carbon footprint distribution in the baseline week of Panzone et al. (2021c); in this study, it corresponds to the bottom 20% of the baseline week.

The definition of a threshold in gCO2e/100 g of food gives consumers a goal in terms of carbon footprint per weight. Conversely, the policy aim of the interventions is to reduce the total carbon footprint of the food baskets consumers purchase in the experimental store. The normalisation of the goal by weight ensures the incentive provided with the consumer aligns with the goals of the retailer as well as the policymaker. In fact, a goal defined in terms of absolute carbon reduction (e.g., buy less than 20,000 gCO2e) in the experimental supermarket can be achieved by spending less during the experiment, cashing in more of the budget and use it to buy high-carbon options (e.g., meat) outside the experimental store. The same would apply in real life if a single retailer in a competitive marketplace was encouraging consumers to reduce their carbon emissions in their stores. On the other hand, the optimal strategy for consumers to reduce their carbon footprint per weight is buying low-carbon goods: buying less can decrease as well as increase the carbon footprint of the basket, depending on the composition of the basket, and the normalised goal can be only met by buying low-carbon items. This adjustment also has real-life appeal for retailers, because the normalised goal ensures that consumers reduce their carbon footprint whilst buying the same amount of goods.

3.5 Experimental Manipulations

The experimental design consists of a mixed design, using 2 (virtual badge vs. no virtual badge) × 3 (no commitment vs. voluntary commitment vs. forced commitment) orthogonal between-participants design (Fig. 2), over two experimental weeks (the within component). In week 1 of the experiment, participants shopped without any intervention in place; consumers were then randomly allocated to an experimental group in week 2. The three experimental stimuli are as follows. (Fig. 2).

Fig. 2
figure 2

Experimental design of the main treatments

3.5.1 Voluntary Commitment

A voluntary commitment is the voluntary decision to accept the promise to keep the carbon footprint of the food basket below the sustainable threshold. In this group, participants were asked “Will you commit to check out with a low carbon footprint basket?”, whilst being told that “Based on previous studies, a low carbon shopping basket is one which is lower than 180gCO2/100 g”. Participants could only proceed to the store after ticking either the acceptance of the commitment (“I am interested in protecting the environment; therefore, I commit to keeping the carbon footprint of my basket below 180gCO2/100 g”) or the rejection of the commitment (“I do not want to commit myself to keeping the carbon footprint below 180gCO2/100 g”) box, respectively (Fig. 3). The threshold was not enforced, and participants could check out independently of whether they exceeded the threshold or not.

Fig. 3
figure 3

Graphical representation of the Voluntary Commitment with Badge

3.5.2 Forced Commitment

A forced commitment required the consumer to accept the promise to keep the carbon footprint of the food basket below the sustainable threshold. The manipulation was identical to the voluntary commitment indicated above; however, participants could only tick the commitment acceptance box (“I am interested in protecting the environment; therefore, I commit to keeping the carbon footprint of my basket below 180gCO2/100 g”) to proceed into the store. Participants could not proceed without ticking the commitment box, and could not reject the commitment (Fig. 4), although they could simply leave the online supermarket if they did not want to commit. Participants who would try clicking on “Start shopping” to continue without accepting the commitment would receive an error window asking them to tick the box before proceeding (the software recorded this information). The threshold was not enforced, and participants could check out independently of whether they exceeded the threshold or not.

Fig. 4
figure 4

Graphical representation of the Forced Commitment with Badge

3.5.3 Badge

In week 2, participants in the Badge treatment were told that a badge would appear on the screen whenever they had a low-carbon basket. They were explained that “Based on previous studies, a low carbon shopping basket is one which is lower than 180gCO2/100 g”. Whilst shopping, the participant would see the badgeFootnote 5 (Fig. 5) whenever their carbon footprint was equal to or below 180 g CO2e/100 g, and would disappear if basket was above the threshold and only to return if the shopping basket was below (or equal to) the threshold.

Fig. 5
figure 5

The virtual badge

3.6 Final Questionnaires

After submitting their basket, each week participants completed a questionnaire. In both weeks, they were asked about their shopping trip (e.g., shopping goals, type of shopping trip), inventory, and recorded their self-control scale (Tangney et al. 2004) and moral self-image (Jordan et al. 2015). In addition, the questionnaire in week 1 collected demographic information (gender, age, postcode, education, income, household size); in week 2, it also collected attitudes and self-perception towards health and the environment (Cornelissen et al. 2008), health and environmental social and self-signalling (own elaborations, based on Bem 1967; Dubé et al. 2017), health and environmental identity (Aquino and Reed 2002), and their ethical mindset (Cornelissen et al. 2013).Footnote 6 Finally, a follow-up questionnaire was handed to participants when they came to collect their food, to explore whether consumers felt the commitment was binding, as explained in Appendix 2.

4 Econometric Model

The econometric estimation of the average treatment effect of a manipulation follows Panzone et al. (2021a), who use a Difference-in-difference (DID) estimator (Bertrand et al. 2004; Imbens and Wooldridge 2009; Wing et al. 2018). In the experiment, participants shopped over 2 successive weeks. Week 1 is a baseline week, where no intervention is in place. In week 2 participants are allocated to a treatment k = 0, 1, …, 5, where k = 0 is the control group, and k = 1, …, 5 are the experimental groups. Within each week t = 1, 2, participants i purchase a basket with total normalised carbon footprint Cit (in gCO2e): we label this as the consumer’s carbon footprint for short. The average treatment effect is then estimated as difference between the average change observed in the treated individuals and the change in individuals in the control group over the same time (Imbens and Wooldridge 2009), or

$$\phi_{k2} \equiv \left[ {\overline{C}_{k2} - \overline{C}_{k1} \left] - \right[\overline{C}_{02} - \overline{C}_{01} } \right]\quad k = 1,...,5$$
(1)

where \(\overline{C}_{kt}\) is average carbon footprint of individuals in experimental group k in week t.

This approach acknowledges that changes in behaviour between groups from the baseline week to the experimental week could be driven by factors other than the experimental stimuli (e.g., social media, interaction between participants), which the experimenter cannot see. Equation (1) removes all unobservable effects by removing the change that would have occurred in the absence of stimuli: this item corresponds to the change in the control group, captured by the term \(\left( {\overline{C}_{02} - \overline{C}_{01} } \right)\). Crucially, because the treatment is randomly allocated, there is no self-selection into a treatment, ensuring the absence of endogeneity in the DID estimator. In our analyses, we estimated the log-linear panel regression:

$$\mathrm{ln}({C}_{it})={\alpha }_{0i}+{\sum }_{k=1}^{k=5}{\alpha }_{1k}{G}_{ik}+{\alpha }_{2}{W}_{t}+{\sum }_{k=1}^{k=5}{\pi }_{k}{W}_{t}{G}_{ik}+{\nu }_{it}$$
(2)

where Wt refers to a dummy equal 1 if t = 2 (zero otherwise); and Gik refers to a set of dummies capturing the experimental stimuli (zero for the control group). Finally, α0i refers to individual-specific fixed effects, which capture any time-invariant personal attitudes and characteristics. The average treatment effect for the treated (ATT), \({\phi }_{k}\), is estimated as in Puhani (2012).

5 Results

5.1 Demographic and Shopping Characteristics of the Sample

Table 1 presents the summary characteristics of the 677 participants who shopped and completed the final questionnaire. The sample is characterised by a majority of professional (non-student) workers (84%), females (69%), primarily in the 25–44 year-old range (60%), most commonly in possession of an undergraduate degree or above (70%) and with a relatively high income. The sample is broadly comparable to that of the local population from the North-East of England in terms of total family size and income, but included younger individuals, with slightly less children, and more likely to have a graduate education. The sample also has more women, a feature typical of store-level data, where women are more likely to have responsibility for the family shopping within a household.

Table 1 Average demographics of the sample, by group

A series of χ2 tests shows that participants across treatments did not differ significantly in their demographic composition (Table 1), and their week-1 personal attitudes and beliefs (Table 2). All treatments register comparable in-store expenditures in week 1, with the only exception of the forced commitment treatment, whose expenditures are slightly larger than the control group only in week 1. All other summary basket characteristics, including carbon footprint, do not differ significantly across groups in week 1 (Table 3). Overall, these summary statistics indicate that the random assignment of subjects to treatments was effective.Footnote 7

Table 2 Average attitudes of the sample, by group
Table 3 Summary basket characteristics, averages by week and treatment group

At the same time, we find no evidence that attrition (that is, dropout rates) differed across treatment groups: a Pearson χ2 test performed on the 780 consumers who completed the shopping trip in week 1 indicated that the probability of dropping out in week 2 is unrelated to treatment group membership (χ2 (5) = 5.01, p = 0.415).

Table 3 indicates that the mean average spend in the overall sample was £23.47 in week 1 (range: £1.15–£25, N = 677), with 90% of participants spending £20.18 or more; and £23.00 in week 2 (range: £0.59–£25, N = 677), with 90% of participants spending £19.09 or more. Most shoppers saw this exercise as a top-up shopping trip (26% in week 1, 28% in week 2) or part of a full weekly shopping trip (58% in week 1, 57% in week 2); while for a minority of consumers this occasion was a full-size weekly shopping trip (12% in week 1, 9% in in week 2), or as an “unusual” weekly shopping trip (4% in week 1 and 6% in week 2).

Table 3 and Fig. 6 show that, apart from the control and the badge treatments, there was a reduction in the average total carbon footprint of shopping baskets in week 2, compared to week 1, with a leftward shift of the distribution for the whole sample across week (Kolmogorov–Smirnov test: D = 0.10, p < 0.001). An analysis of the carbon footprint in week 1 indicates that the low-carbon footprint threshold (180 gCO2e/100 g of basket) represents the bottom quintile of the distribution, representing an ambitious goal. The drop in carbon footprint came with no significant change in basket weight; while the voluntary commitment group, and the forced commitment with badge group recorded a drop in kilocalories (Table 3). Finally, while the forced commitment group spent slightly more than other groups in week 1, expenditures were in line with the rest of the sample in week 2 (Table 3).

Fig. 6
figure 6

Median carbon footprint, by week and group

The commitment manipulation also had an impact on goal pursuit and search. ANOVA-style tests (Table 4) reveal that participants in both commitment groups were more likely to indicate that they had an environmental goal in their shopping trip; Probit regression (Table 15) estimate the probability of reporting an environmental goal increased by 16–20%. Table 4 also indicates that participants in the commitment groups spent more time looking at the carbon footprint of products, searching for this information on more products. Finally, participants facing a forced commitment scored higher in self-signalling and, marginally, on self-control (Table 15).

Table 4 Repeated measure ANOVA, chi2 values

Finally, Fig. 7 shows how the consumer goal (in gCO2e/100 g) changed as participants added items to their baskets: in all groups, the median cumulative carbon footprint per 100g tended to start relatively low and grow, then declining towards the end of the shopping trip; in week 2, the commitment groups show an earlier decline in the carbon footprint by weight. Figure 8 shows that this decline happened particularly early for those who committed to the goal, for whom the median food basket remained below 200 gCO2e/100 g most of the time, and started moving towards the threshold from the fourth choice.

Fig. 7
figure 7

Median cumulative gCO2e/100 g of basket, by group and week. Note the horizontal black line refers to the low carbon threshold of 180 gCO2e/100g basket weight. Note that from choice 20 onwards, the number of consumers is always below 20 in each treatment, and less than 2 people per group make more than 25 decisions

Fig. 8
figure 8

Median cumulative gCO2e/100 g of basket, by group, week and commitment. Note the horizontal black line refers to the low carbon threshold of 180 gCO2e/100g basket weight

5.2 The impact of Commitment and Badges on the Carbon Footprint of Food Baskets

We now use the econometric model presented in Sect. 4 to test our hypotheses. The dependent variable is the natural logarithm of the total CO2e in the basket of the consumer in the experimental week, in gCO2e. Results refer to a fixed-effects panel Difference-in-difference (DID) estimator, with bootstrapped standard errors clustered at individual consumer level, and stratified by treatment group.Footnote 8 This approach estimates the change in carbon footprint in the presence of the experimental stimuli over time, removing the change over the same period observed in the control group. Coefficients refer to half-elasticities, which measures the % change in carbon footprint when the treatment dummy is 1.

Table 5 presents the key results. A regression with simple treatment effects is reported in Table 17 in Appendix 3 for reference; while results with all the interactions are available in Table 18 in the Appendix. Finally, Table 19 in Appendix 3 presents the same set of regressions using ln(CO2e/100 g) as the dependent variable.Footnote 9

Table 5 DID estimates

In Table 5 (and Table 18 in Appendix 3), model A regresses the carbon footprint of the food basket over a single joint commitment variable, equal to one for participants in any commitment group. Model B measures the average treatment effects of both voluntary and forced commitment separately, as well as that of the badge. It is the key model for testing all of our hypotheses. Model C separates those who voluntarily accepted and those who voluntarily rejected the voluntary commitment, to observe whether the two groups behaved differently. Importantly, in model C individuals self-selected in or out of the commitment group driven by personal preferences (as shown in the next section), and the results should not be interpreted as the causal impact of the commitment, but rather as the change in carbon footprint in the segment of consumers with strong preferences for an environmental commitment. All the three regressions are estimated with and without time-varying personal characteristics to remove unobserved heterogeneity: the weight of the basket (in kilograms); self-control; environmental self-image, self-signalling, and social signalling. Across all models, a 1% increase in basket weight is associated to an increase in carbon emissions by 0.55%; while the psychological scales do not explain differences in carbon footprint in any regression.

Model A indicates that the presence of a commitment (either forced or voluntary) reduces the carbon footprint of the food baskets by 10–11% (a net reduction of ~ 2 kgCO2e). Model B indicates that both commitments contribute to comparable reductions in carbon footprint: the voluntary commitment leads to a 9% reduction in carbon footprint (~ 1.8 kgCO2e); while the forced commitment causes a 10–12% reduction (~ 2.1 kgCO2e). The difference between each type of commitment is not statistically significant.

Result 1: Both a voluntary and a forced commitment lead to a lower carbon footprint than a control group with no commitment, in support with H1 and H2.

Result 2: There is no statistically significant difference in effectiveness between voluntary and forced commitments, not supporting H3a and H3b.

Model C indicates that the reduction caused by a voluntary commitment is driven by those consumers who accepted the commitment: they reduced their carbon footprint by over 15% (~ 2.7–2.9 kgCO2e), while those who refused recorded a small reduction, around 0.8–1.5% (− 0.15–0.27 kgCO2e), not significantly different from zero. In model C, a forced commitment caused a 11–12% reduction in carbon footprint (~ 2.1–2.2 kgCO2e). Effect sizes change slightly when interaction terms are included, with a particularly large increase in the size of the standard errors (Table 18, Appendix 3). To put the results into perspective,Footnote 10 driving 1 mile with an average passenger vehicle emits 398 gCO2e; while charging one smartphone emits 8.22 gCO2e. Overall, these results provide support to hypotheses H1 and H2. Wald tests found no statistical difference between the two types of commitments, providing no support for H3a and H3b.

We do not detect any statistically significant effect of introducing a badge on the carbon footprint: the presence of the badge led to a non-significant reduction of around 3.2–3.7% (0.67–0.72 kgCO2e) across all models. This result does not support hypothesis H4.

Result 3: A visible badge does not lead to a lower carbon footprint than the control group.

Table 6 summarises how consumers allocated their £25 across food categories and savings. This table indicates that in week 2 consumers in the voluntary commitment groups consumed more fruit and vegetables, and reduced their consumption of dairy and eggs, other products of vegetarian origin, and drinks. Those in the forced commitment treatment increased savings, and reduced consumption of dairy and eggs, and other products. Interestingly, consumers did not reduce their consumption of meat, despite the large potential carbon savings in this category (Poore and Nemecek 2018). Finally, Table 7 indicates that a forced commitment had an impact on the total amount of fats and salt in the food basket, which dropped by 26% and 18%, respectively, with no impact on the amount of sugar, proteins, and kilocalories in the basket; a voluntary commitment reduced total fats in the basket by 27%.

Table 6 Share of the £25 budget allocated for each food category and savings
Table 7 DID for total kilocalories and nutrients in the final basket

5.3 Who Commits?

Table 8 shows the commitment rates by treatment group. When consumers were asked to voluntarily commit, 54–56% accepted. When commitment was forced, 79–82% of participants ticked the box straight away; while 18–21% of consumers tried to proceed without ticking the commitment box, and ticked it once they were reminded. As a result, the forced commitment group presented no sample selection; whereas in the voluntary commitment group, interested participants self-selected into a commitment.Footnote 11

Table 8 Commitment rates by group

To understand self-selection in the decision to commit to a lower carbon footprint, we explore what drives individual commitment. A Probit regression (Table 9) indicates that the decision to voluntarily commit correlates with the symbolisation component of the environmental identity, and is higher in households with teenagers; the decision is unrelated to self-control, self-image, self-signalling or social signalling. Conversely, in the forced commitment treatment, those who try to proceed without ticking are older consumers, who may have low computer literacy, and consumers scoring low in the symbolised component of environmental identity. It is important to note that these results are correlational, not causal, due to endogeneity of these variables; they simply indicate that environmental preferences were related to the decision to commit. Finally, knowing that a badge will be present does not affect the likelihood of making a commitment.

Table 9 Probit regressions on the likelihood of commitment, by commitment type

A χ2 test shows that consumers who committed voluntarily were more likely to meet the low-carbon footprint threshold in week 2 than those who voluntarily did not commit (χ2 = 10.71, p = 0.001). A Probit regression (Table 10) shows that in week 2 the voluntary commitment increases the likelihood of meeting the low-carbon footprint threshold by around + 17%, relative to the control group, while a forced commitment increases this probability by + 14–15%. The presence of a badge has no impact on the likelihood of meeting the threshold. Finally, while males, older consumers and households with more adults are less likely to meet the threshold, consumers with high internalised environmental identity are more likely to meet it.

Table 10 Determinants of the achievement of the sustainability threshold, probit regression

6 Discussion

This research studied the role of commitment and badges in driving lower carbon footprint food choices. Being an exploratory study, we did not pre-register our hypotheses.Footnote 12 Results indicate that making a commitment prior to entering the store leads to a reduction in carbon footprint of around 9% when the commitment is voluntary, and 10–11% when this is forced. In the group where commitment is voluntary, the reduction is driven by those consumers who accept the commitment, which record a reduction of around 15% in carbon footprint. Compared to previous research, the effect of these intervention sits somewhere in the middle: manipulations that altered the architecture of a choice or prices (Panzone et al. 2021b, 2021c) show larger effects (reductions greater than 15%), while information (Kanay et al. 2021; Muller et al. 2019; Potter et al. 2022; Suchier et al. 2023) show the smallest reductions (even less than 5%), and significant variability depending on the information provided (Muller et al. 2019 report reductions of 8–12%). On the other hand, the goal-setting tasks in Kanay et al. (2021) indicate reductions (relative to the control group) of ~ 10–15%, close to the values observed in this study. Overall, this comparison is in line with Cadario and Chandon (2019), who find that affectively-oriented nudges are more effective than information provision, but less effective than structural changes in store (e.g., smaller portion sizes). This section summarises these results in more detail.

6.1 How Goal Commitment Influences Consumer Decisions

This study shows that asking consumers to commit prior to entering the online retailer increased the sustainability of their decisions. Compliance to the commitment is high even in the absence of any enforcement. Everyone committed when forced to do so, with no self-selection into the commitment. In the voluntary commitment, participants could self-select into the commitment, and unsurprisingly the largest correlation with a reduction in carbon footprint is with the subjects with environmental preferences who accept the commitment. The average treatment effect of the voluntary commitment, which includes both those who accepted and rejected the commitment (therefore unaffected by self-selection), is comparable to that in the forced treatment.

The mere presence of the commitment coming from a favourable authority (in this case, a trusted academic institution) might have been enough to motivate compliance (see also Karakostas and Zizzo 2016). This effect may have been especially evident in Kanay et al. (2021), because the way the goal is set is a representation of how an experimenter may demand a result from their subjects (de Quidt et al. 2018; Zizzo 2010). While insightful, this manipulation may be harder to replicate in natural world grocery shopping than our commitment.Footnote 13 The manipulation of Kanay et al. (2021), as well as our commitment manipulation, may operate also through a social norm compliance channel (as reviewed in Sect. 2). Thirdly, our commitment manipulation may specifically elicit a preference for promise-keeping (Vanberg 2008) or compliance to a social norm of promise keeping (Ellingsen and Johannesson 2004; van der Werff et al. 2019). On the other hand, we find no evidence that commitments operated by altering the self-image of the consumer.

Conversely, badges do not cause a significant reduction in carbon footprint, or in the likelihood of achieving the threshold for having a badge. While badges can increase self-efficacy (Sailer et al. 2017) and engagement (Hamari 2017), the limited impact indicates that consumers may not need a soft reward to motivate their compliance to a commitment. We conjecture that this result is likely linked to the private nature of the badge: in Baca-Motes et al. (2013), consumers received a signalling pin visible to third parties, therefore allowing for social signalling, which had a significantly positive impact on the reuse of hotel bath towels; on the other hand, van der Weele and von Siemens (2020) show that a bracelet—which cannot be seen by third parties—does not motivate pro-social behaviour. Our results support the latter research, suggesting that consumers use signalling badges to communicate their pro-social preferences to others, rather than to themselves. An alternative explanation is that the badge had no actual meaning to consumers: while respondents to a pilot survey viewed this badge as motivating, a more recognisable badge (e.g., a WWF badge) might have been more effective. Further research is needed to better understand the behavioural implications of soft rewards like badges.

6.2 Limitations and Future Research

A limitation of this research, typical of experimental work, is the short time window (two weeks), which limits the ability to observe the impact of a nudge over time. Having this article established the effectiveness of commitment in the short term, future research should look at having a longer time horizon. Another potential limitation refers to the ecological validity of the experiment. While the experimental procedure implemented an ecological context by delivering one food basket and removing interactions with the experimenter, it remains difficult to fully determine the external validity of this study, on two grounds. Firstly, many participants used the £25 budget; this decision may be a reflection of the preferences of participants, but may be also due to a desirability bias, whereby consumers expected their task was to reduce carbon emissions. Secondly, consumers may have reduced their carbon emissions in the experimental shop, using their own money to buy high-carbon food elsewhere, particularly as many participants considered this exercise part of a weekly shop. As commonly in experimental supermarkets, we are unable to control for such potential substitution between grocery shopping in the experiment and grocery shopping outside the experiment. As a result, our results may overestimate somewhat the absolute impact of the interventions we test. We note however that (Zizzo et al. 2021) provide evidence that the results of their supermarket interventions were unaffected by such substitution effects. Questions also remain over the scalability of the results in this article (Al-Ubaydli et al. 2019): moving from students (Panzone et al. 2021a, c) to the general population (in this study) gives slightly smaller effects, and research is needed to further explore the effect size in the real world.

Linked to incentive-compatibility, the ‘free’ £25 budget may have caused a house money effect (Thaler and Johnson 1990): participants may have felt the budget was a gift, which they spent on goods they would not normally buy (e.g., relative luxury goods). Experimental research shows this effect may not affect public goods (Clark 2002), and it did not occur in the pilot of another study using an experimental online supermarket (Zizzo et al. 2021).

6.3 Policy Implications: Retail Design and the Protection of the Public Good

The key policy implication of this research is that carbon footprint from grocery shopping can be reduced effectively through store design, designing motivating nudges that can be implemented quickly in store. Online environments are particularly suitable to the design of nudges and interventions targeting large-scale changes in behaviour (Lorenz-Spreen et al. 2020; Rogers et al. 2014; Todd et al. 2013). The commitments presented in this study can be easily implemented in online shops, in conjunction with rewards (a discount following a successful commitment in Schwartz et al. 2014), penalties (blocked withdrawals in Burke et al. 2018), or within a more complex gamified environment (Hamari et al. 2014; Hock et al. 2019; Whittaker et al. 2021). This is a promising area for further research. For instance, Schwartz et al. (2014) show that conditioning a promotion on the achievement of a health goal that consumers voluntarily commit to can increase fruit and vegetables sales. While Schwartz et al. (2014) enforced the commitment by linking it to the reward, our article shows that a soft commitment has the potential to be effective at scale (as in Burke et al. 2018; Himmler et al. 2019). Our commitment manipulations also have health benefits, particularly in terms of reduced purchased of fats but also, for forced commitments, that of salt.

Finally, the voluntary commitment nudge presented in this work can be seen as a hyper-nudge. Hyper-nudging refers to the use of nudges that are personalised around the user in either the choice task, or the method of nudging (Mills 2020; Yeung 2017). As an example, online retailers often target consumers with incentives (e.g., a discount on diapers for babies) based on past, related behaviour observed in the store (e.g., the purchase of baby food), using past purchases to reveal information (the household has a baby) otherwise unobservable to the marketeer. Hyper-nudges allow for more precise targeting in settings with large preference heterogeneity, optimising the use of resources. Compared with the forced commitment, the voluntary commitment contains information on the environmental preferences of the user, which the marketer can use to identify environmentally motivated individuals within the market. This information could be used to design different promotions that further reduce carbon footprint in the same domain (food) or in other environmental domains (e.g., energy).

7 Conclusion

This article reports the results from an experiment testing the role of commitments and badges promoting the reduction in carbon footprint from online food shopping. Using an experimental online supermarket and general public sample from the Northeast of England, we show that non-binding commitments can be significant in reducing the carbon footprint from food consumption. The findings presented in this article show that the design of the retail environment has an important impact on what consumers choose, and on the carbon footprint of their basket.