1 Introduction

Sensitivity to scope in nonmarket valuation refers to the property that people are willing to pay more for a higher quality or quantity of a nonmarket public good (Carson et al. 2001; Freeman et al. 2014; Mariel et al. 2021). Establishing significant scope sensitivity has been an important check of validity and a point of contention for decades in stated preferences (SP) research, primarily in contingent valuation (CV) surveys (Kahneman 1986; Mitchell and Carson 1989; Kahneman and Knetsch 1992; Desvousges et al. 1992; Diamond and Hausman 1994; Whitehead et al. 1998; Berrens et al. 2000; Heberlein et al. 2005; Lew and Wallmo 2011; Hausman 2012; Kling et al. 2012; Haab et al. 2013; Whitehead 2016).Footnote 1

At the one extreme, some researchers have claimed general methodological invalidity in light of the failure of some studies to establish statistically significant scope effects (Hausman 2012). Recently, however, several authors have made compelling arguments to the effect that the scope sensitivity and validity of a study cannot be assessed purely on the basis of tests of statistical significance (e.g., Amiran and Hagen 2010; Whitehead 2016; Lopes and Kipperberg 2020). The extent to which estimated scope effects are economically significant (McCloskey and Ziliak 1996; Thorbecke 2004) may be equally important. Related to economic significance are the concepts of adequacy, i.e., whether the estimated scope effects exceed a minimum threshold, and plausibility, i.e., whether the estimates are believable for the particular empirical context (Arrow et al. 1994; Whitehead 2016).Footnote 2

A specific measure proposed for assessing the economic significance of sensitivity to scope in CV studies is scope elasticity of willingness to pay (WTP) (Amiran and Hagen 2010). This elasticity measures the percentage change in WTP for a nonmarket good relative to the percentage change in its quantity or quality. Amiran and Hagen (2010) demonstrate that scope elasticities need only be greater than zero and less than one in the case of strictly convex neoclassical preferences. A resulting challenge is that elasticities close to zero may be difficult to detect statistically. Whitehead (2016) elaborates on the economic intuition underlying the concept of scope elasticity and applies it in a re-assessment of several CV studies that initially had their scope sensitivity questioned. He argues that the implied scope elasticities of WTP in these studies are within a plausible range and satisfy economic significance.Footnote 3

The issue of sensitivity to scope has also been explored in the discrete choice experiment (DCE) literature, but to a much lesser extent than for CV (Layton and Brown 2000; Lew and Wallmo 2011; Rakotonarivo et al. 2016; Johnston et al. 2017). As in CV studies, sensitivity to scope in DCEs implies that people are willing to pay more for a larger quantity or better quality of a good. For a good, as opposed to a bad, this is usually indicated as higher attribute levels, all else held equal. Depending on the experimental design, variation in the levels of quantitative attributes facilitates scope sensitivity examination through the estimation of indirect utility functions with linear and non-linear functional forms. For example, Layton and Brown (2000), estimate a piecewise linear indirect utility function to test whether WTP to avoid larger forest losses due to climate change is higher than WTP to avoid smaller losses. Lew and Wallmo (2011) perform scope tests across a number of protected endangered species as well as their protection levels. Both studies establish statistically significant scope effects. Neither study discusses adequacy, plausibility, or economic significance, though Layton and Brown (2000) refer to their results as “economically sensible” and “economically reasonable”.Footnote 4

It is important from both a methodological point of view and a policy perspective to further develop and include scope tests in DCE studies as well as in CV research. Methodologically, scope sensitivity continues to be discussed in relation to SP validity. As pointed out in the SP guidance by Johnston et al. (2017, p. 374): “Underlying the challenge for SP validity testing is the lack of general agreement on whether results from individual studies (or sets of studies) should be interpreted as evidence for or against the validity of the method in general. Recognizing this lack of agreement over what constitutes an acceptable validity test for SP studies, we recommend continued investigation of both current and new tests as an important area for future research.” From a practical resource management perspective, policymakers are typically interested in assessing different policy alternatives and associated attributes varying in magnitude (e.g., degree of environmental protection), with increasing opportunity costs. If the social benefits of the policy should turn out to be invariant to the public good provision levels, the optimal decision would be simple. The policymakers should choose the lowest cost alternative. In many circumstances, such a finding would seem implausible and not be useful for decision-making.

In this paper, we investigate scope effects through the lens of the scope elasticity of WTP concept. To our knowledge, no other DCE study has used this analytical framework. We provide a theoretical discussion, methodological perspectives, and a unique empirical application. We begin by formalizing scope elasticity of WTP both generally and specifically in the DCE context (Sect. 2). Then we review a selection of DCE studies from the environmental valuation literature and derive their implicit elasticity estimates (Sect. 3). The literature analysis leads to the following three observations: (i) explicit investigations of scope sensitivity in DCE studies seem uncommon; (ii) many studies assume unitary elastic scope sensitivities through their choice of restrictive functional form; and (iii) studies that utilize flexible functional forms tend to find inelastic effects, consistent with diminishing marginal utility from attribute improvements.

Following the literature discussion, we apply the scope elasticity of WTP concept to study preferences for expanding renewable energy in Norway (Sects. 4 and 5). We provide baseline results for two quantitative attributes (renewable energy production and wind power expansions) and investigate whether elasticity estimates vary across model specifications, geographic subsamples with different levels of familiarity and exposure, and experimental variation in the unit of measurement of the quantitative wind power attribute. This analysis is generally motivated by the lack of attention to DCE scope effects revealed by the literature review. More specifically, the exploration of familiarity and exposure is motivated by the existing literature on habituation to environmental change (e.g., Wilson and Dyke 2016; Zerrahn 2017) while the exploration of unit of measurement is motivated by emerging research on choice architecture and attribute representation (e.g., Hertwig and Grüne-Yanoff 2017; Ungemach et al. 2018). Both of these strands of research are relevant to the scope effects observed.

Overall, the analyses in this paper show that scope sensitivity can vary between attributes and across conceptual, methodological, and empirical dimensions of studies. Section 6 provides a summary of findings and concluding reflections with specific recommendations for future research.

2 Conceptual Framework

The concept of scope elasticity of WTP was first proposed by Amiran and Hagen (2010) to address the economic significance of scope sensitivity in CV research. Whitehead (2016) then applied the concept in simulation analysis and empirical illustrations. Existing CV studies that have subsequently reported scope elasticity estimates include Burrows et al. (2017), Borzykowski et al. (2018), and Lopes and Kipperberg (2020).

A major appeal of the scope elasticity of WTP framework is that it provides a unit-free measure of the ceteris paribus responsiveness of an endogenous variable of interest (in this case, WTP) to a change in an exogenous variable (in this case, environmental quality). As such, it is similar to other important elasticity measures in economics (e.g., own-price elasticisty of demand; input-price elasticity of supply; income elasticity of WTP). Specifically, the scope elasticity of WTP is defined as the ratio of percentage change in WTP to the percentage change in environmental quality. A scope elasticity of zero signals absence of impact, or no scope effect, whereas a scope elasticity of one means proportional responsiveness. Elasticity estimates within the 0 to 1 interval imply less than proportional, i.e., inelastic, impact. Such an elasticity would be expected under neoclassical microeconomic convexity priors regarding the trade-off between market and nonmarket goods (Amiran and Hagen 2010; Whitehead 2016). For example, a scope elasticity of 0.4 suggests that a 10% increase in environmental quality is associated with a 4% increase in WTP. However, the scope elasticity could also be greater than one, suggesting disproportionally large, i.e., elastic, responsiveness. Elastic WTP responsiveness to change in scope is consistent with increasing marginal utility of an economic good or increasing disutility from an economic bad (e.g., Layton and Brown 2000).

2.1 Defining the Scope Elasticity of WTP in General

Let \(WTP = WTP\left( {q,{\varvec{z}}} \right)\) represent a general value function for a representative consumer, where q is a scalar measure of the level of environmental quality and \({\varvec{z}}\) is a vector of other factors influencing the consumer’s valuation (including income). The scope elasticity of WTP (\(E_{WTP}\)) is then given by:

$$ E_{WTP} \equiv \frac{{\% \Delta WTP\left( {q,{\varvec{z}}} \right)}}{\% \Delta q} = \left( {\frac{{\partial WTP\left( {q,{\varvec{z}}} \right)}}{\partial q}} \right) \cdot \left( {\frac{q}{{WTP\left( {q, {\varvec{z}}} \right)}}} \right) $$
(1)

For a non-marginal improvement in environmental quality, say from q0 to q1, where q1 > q0, with associated change in WTP from WTP0 to WTP1 (WTP1 \(\ge\) WTP0), the midpoint formula can be utilized to define a scope arc-elasticity (\(\overline{E}_{WTP}\)) as follows:

$$ \overline{E}_{WTP} \equiv \frac{{\% \Delta WTP\left( {q,{\varvec{z}}} \right)}}{\% \Delta q} = \left( {\frac{{\Delta WTP\left( {q,{\varvec{z}}} \right)}}{\Delta q}} \right) \cdot \left( {\frac{{\overline{q}}}{{\overline{WTP} }}} \right) $$
(2)

where \(\Delta q = q^{1} - q^{0} > 0\), \(\Delta WTP = WTP^{1} - WTP^{0} \ge 0\), and \(\overline{q}\) and \(\overline{WTP}\) are, respectively, average environmental quality (\(\frac{{q^{0} + q^{1} }}{2}\)) and average WTP (\(\frac{{WTP^{0} + WTP^{1} }}{2}\)).

2.2 Defining Scope Elasticities in DCE

Scope sensitivity in DCEs means that people’s WTP for a specific attribute (good/bad) is (increasing/decreasing) in its level, all else held equal (Mariel, et al. 2021). However, multi-attribute discrete choice situations are typically motivated from a random utility model (RUM) framework, not via a direct valuation function, as above. Therefore, let indirect utility (U) be represented by \(U = V + \varepsilon\), where V is the deterministic component and \(\varepsilon\) is the random component (see e.g., Hensher et al. 2005). For the sake of simplicity, we ignore the latter term and focus on deterministic indirect utility. Let \(V = V\left( {{\varvec{p}},{\varvec{q}},M} \right)\) be a generalized deterministic indirect utility component, where \({\varvec{p}}\) is an exogenous price vector, \({\varvec{q}}\) represents nonmarket goods and amenities exogenously provided (including various environmental quantity and quality attributes), and \(M\) is exogenous consumer income. The utility an individual derives from any given policy or resource management scenario, say alternative j, is given by \(V_{j} \left( {{\varvec{p}},{\varvec{q}}_{j} ,M - F_{j} } \right)\), where Fj is the fee or tax payment for that scenario. Faced with J mutually exclusive alternatives, the consumer is assumed to prefer the alternative that yields maximum indirect utility, meaning that alternative i is chosen provided \(V_{i} \left( {{\varvec{p}},{\varvec{q}}_{i} ,M - F_{k} } \right) > V_{j} \left( {{\varvec{p}},{\varvec{q}}_{j} ,M - F_{j} } \right), \forall i \ne j\).

The ceteris paribus marginal willingness to pay (MWTP) for a change in the level of a specific attribute, say attribute s (\(q_{s} \in {\varvec{q}}\)), is given by the marginal rate of substitution (MRS) between that attribute and the consumer’s money income:

$$ MWTP\left( {q_{s} } \right) = MRS_{{q_{s} ,M}} = \frac{{\partial V\left( \cdot \right)/\partial q_{s} }}{\partial V\left( \cdot \right)/\partial M} $$
(3)

However, DCE researchers are often interested in non-marginal changes in amenity or attribute levels due to changes in public policy and management regimes. We therefore consider discrete changes in q and associated changes in WTP implied by the indirect utility given above. Let \(\Delta_{s}^{A} = q_{s}^{A} - q_{s}^{0}\) and \(\Delta_{s}^{B} = q_{s}^{B} - q_{s}^{0}\), \(\Delta_{s}^{B} > \Delta_{s}^{A}\) represent two different discrete increases in the level of attribute s, where both these increases are considered improvements. The two associated WTP measures (WTPA and WTPB) are defined implicitly from the indirect utility function in the following manner:

$$ V\left( {{\varvec{p}}^{0} ,{\varvec{q}}^{0} ,M} \right) = V\left( {{\varvec{p}}^{0} ,{\varvec{q}}^{j} ,M - WTP^{j} } \right),\;j = A\;or\;B$$
(4)

Subsequently, a scope arc-elasticity of WTP can be defined analogously to Eq. (2) as:

$$ \overline{E}_{WTP} \equiv \frac{\% \Delta WTP}{{\% \Delta q_{s} }} = \left( {\frac{{WTP^{B} - WTP^{A} }}{{(WTP^{B} + WTP^{A} )/2}}} \right)/\left( {\frac{{\Delta_{s}^{B} - \Delta_{s}^{A} }}{{(\Delta_{s}^{B} + \Delta_{s}^{A} )/2}}} \right) $$
(5)

For the linear specification of the deterministic indirect utility often employed in DCE research, that is, \( V_{j} = \alpha_{j} + {\varvec{\beta}}_{{\varvec{q}}} {\varvec{q}}_{j} + \beta_{M} \left( {M - F_{j} } \right)\), \(MWTP\left( {q_{s} } \right) = \beta_{{q_{S} }} /\beta_{M}\) and \(\overline{E}_{WTP} = 1\).Footnote 5 This means that the estimated scope elasticity is equal to one provided that the estimated MWTP is statistically significant and greater than zero. Since most researchers would agree that proportional responsiveness in a welfare estimate with respect to scope, i.e., \(\overline{E}_{WTP} = 1\), is economically significant, the restrictive linear functional form is meaningless for the purpose of attempting to distinguish between statistical and economic significance of scope effects. In order to explore such distinction, it is therefore necessary to employ more flexible functional forms.

2.3 Adequate, Plausible, and Economically Significant Scope Sensitivity

Amiran and Hagen (2010) show that neoclassical utility functions with strictly convex preferences have scope elasticity bounded by zero and one (Proposition 1, p. 59). Furthermore, \(\overline{E}_{WTP} = 1\) implies perfect substitution between environmental quality and market goods, whereas \(\overline{E}_{WTP} = 0\) suggests a perfectly complementary relationship. Importantly, many well-behaved preference representations can imply “arbitrarily small” scope elasticities. These results have important implications for empirical research. First, any given application may reveal relatively moderate, but legitimate, scope effects. Second, when the underlying scope sensitivity is low in the true data-generating process, it is more challenging to statistically distinguish scope elasticity estimates from zero.

Whitehead (2016) points out that the panel of experts formed by the National Oceanic and Atmospheric Administration (NOAA) to assess the CV method (Arrow et al. 1993) was as much concerned with economic significance as with statistical significance. Specifically, the NOAA panel was concerned with the adequacy or plausibility of estimated scope effects in CV studies (Arrow et al. 1993, 1994). Whitehead (2016) interprets adequacy as a sufficiency condition (i.e., a minimum threshold criterion). While the literature has yet to establish such a condition, it is evident from the conceptual analysis in Amiran and Hagen (2010) that it could be arbitrarily close to zero. In a follow-up to Arrow et al. (1993), Arrow et al. (1994) provide the following clarification: “Had the panel thought that something as straightforward as statistical measurability were the proper way to define sensitivity, then we would (or should) have opted for language to that effect. A better word than ‘adequate’ would have been ‘plausible’: A survey instrument is judged unreliable if it yields estimates which are implausibly unresponsive to the scope of the insult. This, of course, is a judgment call, and cannot be tested in a context-free manner”. In line with this sentiment, Whitehead (2016) favors using a case-by-base examination of whether scope effects are “plausible”, “believable” or “within the realm of possibility”. This recommendation is supported by his Monte Carlo scope elasticity simulations, which indicate that 95% of the draws lie between 0.630 and 0.998 in the case of a simple linear WTP function and between 0.177 and 0.971 in the case of a quadratic WTP function. A re-assessment of several previously contested CV studies reveals plausible scope elasticities between 0 and 1 (Whitehead, 2016).Footnote 6

3 Scope Elasticities in Previous DCE Studies

To our knowledge, no previous DCE study has explicitly analyzed scope elasticity of WTP for attribute improvements. Nonetheless, many studies report estimation results from which it is possible to infer or extract such scope sensitivity measure. Here, we first examine a purposive sample of studies from the broader environmental economics literature (Table 1).Footnote 7, Footnote 8 Since our own DCE application presented in Sect. 4 and 5 are set in the intersection between environmental and energy economics, we also examine prior DCE studies specifically related to wind power preferences (Table 2).Footnote 9 For each article, we identify the study context, which scope-relevant attributes were included in the DCE design, the functional form utilized in estimation, whether the article includes a discussion of the scope sensitivity issue, and the implied scope elasticities from its estimation results.Footnote 10

Table 1 Inferred scope elasticities of WTP in selected DCEs from environmental economics
Table 2 Inferred scope elasticities of WTP in previous wind power DCE studies

3.1 Examples From Environmental Economics

Table 1 summarizes the review of ten articles from environmental economics. Only three of these include a scope discussion of the reported results (Layton and Brown 2000; Zhao et al. 2013; Badura et al. 2020). Four studies employ a restrictive functional form that imposes unitary scope elasticity (Adamowicz et al. 1994; Zhao et al. 2013; Meyerhoff et al. 2016; and Ando et al. 2020). One study (Adamowicz et al. 1998) estimates both linear and quadratic functional forms, whereas one study (Badura et al. 2020) utilizes linear specification for one quantitative attribute (size of recreation area) and logarithmic for another quantitative attribute (distance to recreation site). The remaining four articles employ piecewise linear functional form, i.e., dummy coding for different attribute levels. The estimation results reported in Boxall et al. (1996), Sandorf (2019), and Hynes et al. (2020) imply scope elasticities less than one, while the results in Layton and Brown (2000) suggest elastic WTP responsiveness.

Adamowicz et al. (1994), Boxall et al. (1996), and Adamowicz et al. (1998) comprise three highly cited early DCE applications in environmental valuation. Adamowicz et al. (1994) use DCE as supplement to the travel cost method to analyze choice of recreational fishing site, with expected fish catch being a key attribute of the study. The fish catch attribute is highly significant in estimation with \(\overline{E}_{WTP} = 1\) by imposed linear functional form.

Boxall et al. (1996) compare DCE with CV methodology to study preferences for hunting sites. A main quantitative attribute is expected moose encounters. The implied scope elasticity of WTP for this attribute is 0.51 based on our extraction. Adamowicz et al. (1998) combine DCE and CV methodology to investigate non-use values associated with habitat conservation. The study has three quantitative attributes that lend themselves to scope analysis, namely, mountain caribou population, size of wilderness area, and number of forest industry jobs.Footnote 11 In their quadratic model the implied scope elasticity of WTP for the caribou population is 0.68. For the wilderness area attribute, the linear term is significant whereas the quadratic is not. This implies unitary scope elasticity for this attribute. The estimated linear and quadratic coefficients for the job attribute are insignificant, suggesting zero WTP and \(\overline{E}_{WTP} = 0\).

In two more recent studies on ecosystem service valuation, Sandorf (2019) reports welfare estimates for one attribute that lends itself to scope analysis (size of protected area) while Hynes et al. (2020) include three quantitative attributes (number of species, juvenile fish abundance, size of restored area). The inferred scope of elasticity of WTP is 0.8 in Sandorf (2019) and 0.32, 0.23, and 0.94, respectively, in Hynes et al. (2020).

Layton and Brown (2000) are the only authors to report welfare estimates that imply \(\overline{E}_{WTP} > 1\). The context of this study is preferences for avoiding adverse ecosystem impacts from climate change with a key attribute of interest being forest loss. The implied scope elasticity of WTP is 1.15 and 1.29 for a 60-year and 150-year time horizon, respectively. These estimates are indicative of increasing marginal disutility from forest loss and increasing marginal WTP to avoid this climate change impact.

3.2 Scope Elasticities in Wind Power DCE Studies

Table 2 summarizes our review of 22 wind power related DCE studies. As can be seen from the third column, this literature has explored a wide range of non-monetary attributes related to the renewable energy mix, characteristics of wind power expansions, landscape, ecosystem, and air pollution effects, and economic impacts. Noteworthy, none of the studies explicitly discusses the scope sensitivity issue or carries out any internal validity diagnostics related to scope.Footnote 12 Many of the attributes included in these studies preclude scrutiny of scope elasticity because they are described with categorical or qualitative representations (e.g., protection of cliffs, fauna, flora, and landscapes in Alvarez-Farizo and Hanley 2002). Furthermore, many of the quantitative attributes are entered linearly into estimation, which impose constant marginal utility and unitary scope elasticity of WTP (e.g., greenhouse gas emissions, electricity shortages, and jobs in Longo et al. 2008).

Piecewise linear specifications that permit testing for \(\overline{E}_{WTP} \ne 1\) are utilized in ten of the 22 studies: Ladenburg and Dubgaard (2009), Drechsler et al. (2011), Landry et al. (2012), Westerberg et al. (2013), Vecchiato (2014), Börger et al. (2015), Brennan and Van Rensburg (2016, 2020), Dalton et al. (2020), and Peri et al. (2020). For example, Drechsler et al. (2011) explore four quantitative attributes (size of wind farm, maximum turbine height, red kite population, minimum distance to residential areas).Footnote 13 The first two attributes do not exhibit statistically significant scope effects, implying zero scope elasticity, while the inferred scope elasticity of WTP is 0.76 for red kites and 0.29 for minimum distance.

Several other studies also include an attribute related to spatial proximity. The inelastic scope sensitivity with respect to minimum distance in Drechsler et al. (2011) is consistent with the inferred scope elasticities of 0.57 in Ladenburg and Dubgaard (2009), 0.88 in Westerberg et al. (2013), 0.35 in Vecchiato (2014), 0.39 in Brennan and Van Rensburg (2020), 0.88 in Dalton et al. (2020), and 0.36 in Peri et al. (2020). In contrast, the distance attributes in Landry et al. (2012) and Brennan and Van Rensburg (2016) do not exhibit significant scope sensitivity.

The key quantitative wind power attribute in our application below is the number of new wind turbines to be built in Norway. Six out of the 22 studies reviewed in Table 2 include such turbine attribute. This attribute is significant, with restricted unitary scope elasticity in Brennan and Van Rensburg (2016), Garcia et al. (2016), and Brennan and Van Rensburg (2020). In contrast, the turbine attribute is insignificant in Ladenburg and Dubgaard (2009) and Vecchiato (2014), and almost perfectly correlated with the statistically insignificant wind farm size attribute in Drechsler et al. (2011). Overall, this brief review reveals a general lack of attention to scope effects in the DCE literature to date.

4 Empirical Application

We analyze data from a recent DCE study of preferences relating to expansion of renewable energy production in Norway which had a specific focus on wind power externalities. The study was motivated by the Norwegian Government’s 2018 call for a long-term national plan for the expansion of wind power production on land. The Ministry of Petroleum and Energy assigned the Norwegian Water Resources and Energy Directorate (NVE) the tasks of providing an update of the scientific knowledge base and identifying the geographical areas of Norway that would be the most suitable for new wind farms.

The interest in expanding wind power production has two primary policy motivations. First, even though Norway is self-sufficient regarding renewable electricity, less than 2/3 of domestic energy consumption is met from renewable sources.Footnote 14 Second, the Norwegian Government wants to expand renewable production to meet international commitments towards transforming the global energy system and reducing carbon emissions. In 2018, the wind power industry generated 3–4 TWh per year on 30 sites with 610 wind turbines. An additional 30 projects with 600–700 new turbines had also been approved and were under planning or construction. With some of Europe’s best wind resources, the Government envisages that wind power production could reach 25 TWh per year by 2030, depending on production costs and prospective electricity prices (NVE 2019).

NVE’s work on the national plan started with the mapping of 43 areas distributed across different regions of Norway that were deemed to have high potential and meet basic eligibility criteria for new wind power deployment. NVE then examined each of these areas with respect to production and transmission capacity, stakeholder interests, and environmental impact. During this work, NVE commissioned multiple technical/scientific reports from external consultants, collaborated with the Norwegian Environment Agency, and solicited input from local and regional stakeholders in both the private and the public sector. This process led to the identification of a sub-set of 13 geographical areas proposed for future prioritization. The priority areas are located throughout Norway, with concentrations in Central and Western Norway, and comprise mostly coastal and mountain landscapes.

Despite the deliberate planning process, the final report (NVE 2019) met widespread criticism leading to intense debate in social and public media. Citizens expressed concern about the impact of wind power installations on Norway’s increasingly reduced pristine nature. Various environmental groups and outdoor recreation and tourism organizations protested. Local politicians objected on the basis that the plan would limit their local autonomy. Finally, the wind power industry itself opposed the plan because of the spatial constraints it placed on future expansions of production. Our study was conducted concurrently with NVEs planning process. Hence, we argue that our DCE study exhibits an unusually high degree of policy relevance and consequentiality.

Broadly speaking, consequentiality refers to ensuring that respondents believe that their responses to and overall results of the DCE could influence policy decisions with real monetary and public good-provision implications for them (Johnston et al. 2017). In addition to its timing, our DCE survey had many design elements to enhance consequentiality, including carefully explained attributes, realistic policy scenarios, and a credible payment vehicle. Furthermore, the information treatment leading up to the DCE choice tasks was carefully framed around the Government’s plan, the coordinating planning agency (NVE), and facts and figures from several scientific reports published prior to the plan’s release.Footnote 15 Next, we describe the features of our DCE design most central to the subsequent scope elasticity analysis in Sect. 5.Footnote 16

4.1 The DCE Design

The DCE survey was designed over a 15-month period starting in January 2018, with implementation in April 2019. An overarching design consideration was the objective of making the study relevant for national policy decisions. The selection and configuration of attributes and other elements of the choice architecture was the combined outcome of a careful review of the existing literature, input from a workshop with experts on valuation of wind power externalities, and feedback from two focus groups and several pilot tests, and following general SP guidance (e.g., Hoyos 2010; Johnston et al. 2017).

The final survey started with questions that elicited general opinions, awareness, and knowledge before guiding the respondents through information about Norway’s renewable energy production and potential plans for future expansions. Next, the respondents were provided details on the structure of the DCE, including careful descriptions of alternatives and attributes. At the core of the DCE, the respondents were asked to express their preferences on a series of choice cards. Standard debriefing, attitudinal, and socio-economic questions followed at the end of the survey.

Figure 1 provides an illustrative choice card. Each choice card contained three alternatives, status quo and two scenarios with expansion of energy production, varying in five attributes. The first attribute, new renewable energy production from all sources, had experimental levels of zero (no change), 10, 20, and 30 TWh per year. The second attribute, new wind turbines, had experimental levels of zero (no change), 600, 1200, and 3000 turbines. The third attribute designated prioritized region for new wind power production (no prioritization, Northern Norway and Central Norway, Western Norway, or Eastern Norway and Southern Norway). The fourth attribute was prioritized landscape type for new wind power production (no prioritization, coastal land, lowland and forest land, or mountain land). Finally, the fifth attribute, change in household’s monthly electricity bill (NOK), had experimental levels of − 450, − 150, zero (no change) + 150, and + 450.Footnote 17 We used the %ChoicEf and %MktBlock macros in the software SAS® along with the procedures described in Kuhfeld (2010) to generate the DCE design, with constraints imposed to avoid meaningless or unrealistic combinations of attribute levels.Footnote 18,Footnote 19 Specifically, a total of 24 choice sets were generated and divided into three blocks. Each survey participant was randomized into one block, thereby receiving eight choice tasks. The D-efficiency for the DCE design was 0.89.

Fig. 1
figure 1

Sample choice card (wind turbine version, translated from Norwegian)

The two quantitative non-cost attributes are of particular interest for the scope elasticity analysis in this paper. The first attribute is intended to broadly capture the nonmarket benefits of expanding Norway’s production of renewable energy. Both prior research and our focus group results indicate that people are positive to such expansion for reasons related to concern over energy security, support of greenhouse gas emission reduction, and a desire to stimulate economic activity. The second attribute is intended to capture specific preferences for wind power, holding constant the level of renewable energy production. As documented by prior research summarized in Mattman et al. (2016) and Zerrahn (2017), wind turbines and accompanying infrastructure (e.g., roads and power lines) have multiple adverse impacts. These impacts include habitat displacement, ecosystem fragmentation, negative effects on recreational experiences and visual landscape amenities, and issues related to noise and light-, shadow- and ice-casting. In total, these externalities can reduce the well-being of local residents (e.g., Gibbons 2015; Krekel and Zerrahn 2017), lower the growth potential of other regional industries such as tourism and recreation (e.g., Brökel and Alfken 2015), and generally threaten non-use values associated with the protection of pristine nature (Krutilla 1967).

4.2 Sampling Scheme, Experimental Design Variation and Implementation

During the survey development stages, previous experience and the likelihood of future exposure were identified as potentially important determinants of preferences. For this reason, it was decided to conduct the survey in two geographic regions with differential experiences and exposure. Specifically, we sampled Rogaland County in Western Norway and Oslo County in Eastern Norway with population sizes (shares) of approximately 476 000 (9%) and 681 000 (13%), respectively. Rogaland is the county that currently has most wind power production and could have substantially more in the future. In contrast, Oslo does not have wind power production and is also unlikely to have any in the future.

In our analysis, we investigate potential differences in scope elasticities across the two subsamples. The tentative a priori expectation is that wind power experience/exposure could affect both WTP and scope elasticity estimates. Previous research indicates that WTP to avoid adverse impacts from industrial development may be higher or lower as result of experience/exposure, depending on the mechanisms at play (Zerrahn 2017; Dugstad et al. 2020). However, this research is silent with respect to how experience/exposure might affect scope sensitivity. Consequently, we do not hypothesize a specific sign on expected difference in scope elasticities between the two counties.

In addition to the dual-region sampling scheme, we also implement experimental variation in the unit of measurement of the wind power attribute. Half the respondents were given choice cards with new wind turbines (as in Fig. 1), while the other half received cards with new production sites. The two survey versions were otherwise identical. These two measurement units were perfectly correlated (1 production site = 30 wind turbines; 30 wind turbines = 1 production site). This implies identical wind power production, land capture, and environmental impacts across the experimental versions. The motivation for this experimental treatment is an emerging literature on attribute translations, choice architecture, and signposting/nudging (e.g., Hertwig and Grüne-Yanoff 2017; Ungemach et al. 2018), which suggests that how an attribute is presented in a choice context, including its unit of measurement, is not arbitrary. Specifically, different measurement units can invoke different motivational associations or activate different objectives/goals (e.g., Dellaert et al. 2018; Schlüter et al. 2017). Consequently, the representation of an attribute may cause people to weight the attribute differently in the decision-making process. A change in unit of measurement could also potentially shift the weight of the attribute in question relative to other choice dimensions. Here, we investigate whether a seemingly innocuous change in unit of measurement, from number of wind turbines to number of production sites, alters scope elasticity estimates. This is particularly interesting since elasticities are unit free. In our split-sample DCE, a one percent increase in number of wind turbines is the same as a one percent increase in number of production sites. Hence, our tentative a priori expectation is that unit of measurement will not have an impact on scope elasticity estimates.

The data collection was implemented as an online survey using the pre-recruited internet panel of NORSTAT,Footnote 20 one of the leading survey companies in Norway. In total, 4404 individuals were invited to participate in the survey. The topic of the survey was not revealed in the survey invitation. The response rate was 24% and the dropout rate was 12%. Table 8 in the appendix provides basic descriptive statistics for the full dataset, the geographic subsamples, and the unit of measurement subsamples.

5 Empirical Analysis

The stated preferences in our DCE are motivated from a standard RUM framework with \(U_{njt} = V_{njt} + \varepsilon_{njt}\), where \(U_{njt}\) represents total indirect utility, \(V_{njt}\) is deterministic utility, and \(\varepsilon_{njt}\) is stochastic utility for respondent n associated with alternative j in choice occasion t. The underlying assumption of the RUM framework is that preference expressions are consistent with utility maximization such that alternative i is chosen by individual n in choice occasion t if and only if \(U_{nit} > U_{njt}\). The goal of the data analysis is to estimate deterministic utility as a function of observable attributes (\({\varvec{x}}_{njt}\)) and unknown preference weights or parameters (\({\varvec{\beta}}_{n}\)), in general, \(V_{njt} = V_{njt} \left( {{\varvec{x}}_{njt} , {\varvec{\beta}}_{n} } \right)\). The most common econometric approach is to use panel mixed logit techniques (Hensher et al. 2005; Train 2009). If we let \({\varvec{i}}_{n}\) represent the sequence of preference expressions made by respondent n over \(J\) alternatives in \(T\) choice occasions, then the joint panel mixed logit probability is given by:

$$ {\text{Prob}}\left( {{\varvec{i}}_{n} {|}{\varvec{\theta}}} \right) = \smallint \mathop \prod \limits_{t = 1}^{T} \frac{{\exp \left( {V_{int} } \right)}}{{\mathop \sum \nolimits_{j}^{J} \exp \left( {V_{jnt} } \right)}} f\left( {{\varvec{\beta}}_{n} {|}{\varvec{\theta}}} \right)d{\varvec{\beta}}_{n} $$
(6)

where \( f\left( {{\varvec{\beta}}_{n} {|}{\varvec{\theta}}} \right)\) represents a distribution function for the unknown preference parameters (\({\varvec{\beta}}_{n}\)) described by a set of coefficients (\({\varvec{\theta}}\)), typically central tendencies, variances, and covariances, to be estimated. This probability expression lacks a closed-form solution and must instead be simulated.Footnote 21

In our application, we specify a multivariate normal distribution for the preference parameters on non-cost attributes in order to account for multiple observations per respondent and preference heterogeneity. The normality assumption is flexible in the sense that it supports both positive and negative preferences, which is appropriate in our case. For example, many respondents will associate new wind turbines with disutility due to negative environmental impacts, while others may perceive them as generating a net good. Furthermore, the joint nature of the distribution permits preference weights for specific attributes or attribute levels to be correlated. For example, it seems reasonable to believe that high disutility (or utility) from one level of wind power expansion (e.g., 600 new turbines) is positively correlated with preferences for another level of expansion (e.g., 3000 new turbines).

Below we report results from three different specifications of deterministic indirect utility (\(V_{njt}\)). For convenience, the variable names and descriptions are provided in Table 3. The first specification (LINEAR) is restrictive and only included for comparison purpose. It imposes constant marginal utilities and scope elasticity of one for new renewable energy production and wind turbines. The second specification (QUADRATIC) brings more flexibility by adding squared terms for these two attributes. The third specification (PIECEWISE) enters the levels of these two attributes with separate indicators (i.e., dummy variables), which arguably provides the highest degree of functional form flexibility (Layton and Brown 2000). Deterministic utility for this specification can be written out as follows:

$$ \begin{aligned} V_{njt} = & \alpha_{SQ} + \beta_{1} COST_{it} + \beta_{2,n} TWH10_{njt} + \beta_{3,n} TWH20_{njt} + \beta_{4,n} TWH30_{njt} + \\ & \beta_{5,n} TURB600_{njt} + \beta_{6,n} TURB1200_{njt} + \beta_{7,n} TURB3000_{njt} + \beta_{8,n} MOUNT_{njt} + \\ & \beta_{9,n} LOW_{njt} + \beta_{10,n} COAST_{njt} + \beta_{11,n} NORTHMID_{njt} + \beta_{12,n} WEST_{njt} + \beta_{13,n} EASTSOUTH_{njt} \\ \end{aligned} $$
(7)
Table 3 Variables used in the estimation of deterministic indirect utility

The variable COST represents change in the household’s monthly electricity bill, while the variables TWH10, TWH20, and TWH30 are indicators for the levels of new renewable energy production and TURB600, TURB1200, and TURB3000 are indicators for numbers of new wind turbines. The remaining variables (NORTHMID, WEST, EASTSOUTH, MOUNT, LOW and COAST) are indicators for regional and landscape prioritizations. The parameter on the COST attribute (\(\beta_{1}\)) identifies marginal utility of money and is essential for deriving monetary values associated with changes in the level of non-cost attributes. For example, respondent n’s implied WTP for 10 TWh new renewable energy production is given by the expression \(- \beta_{2,n} /\beta_{1}\) Similarly, individual n’s incremental WTP for avoiding 3000 instead of 600 new wind turbines is given by the expression \((\beta_{7,n} - \beta_{5,n} )/\beta_{1}\).Footnote 22, Footnote 23

5.1 Baseline Results and Comparison Across Functional Forms

Estimation results for the full dataset are reported in Table 4. Overall, the three model specifications yield consistent patterns for key utility parameters. The estimated COST parameter is negative and highly significant, as expected. The average respondent obtains positive utility from expansion of renewable energy production and disutility from increasing the number of turbines, as indicated by the signs of the mean coefficients of the linear terms (TWH and TURB). The signs of the coefficients of the quadratic terms (THW2 and TURB2) in the QUADRATIC model indicate diminishing marginal utility from new renewable energy production and diminishing marginal disutility from new wind turbines. These preference patterns are also reflected in the PIECEWISE estimation. For example, the difference between the mean coefficients of TURB600 and TURB1200 is larger than the difference between the mean coefficients of TURB1200 and TURB3000. The results for the indicators of prioritized regions and landscapes, which are of second-order interest for the research focus of this article, are mixed.Footnote 24 The estimated standard deviation coefficients are generally large and significant, suggesting substantial preference heterogeneity. Lastly, the overall goodness-of-fit statistics (pseudo-R2, AIK, and BIC) indicate that the PIECEWISE model is statistically superior.Footnote 25

Table 4 Full sample panel mixed logit parameter estimates for different functional forms of deterministic indirect utility (linear, quadratic, and piecewise linear)s

Figure 2 illustrates the estimated scope effects by providing empirical scope lines, which are linear interpolations of welfare estimates across the experimental attribute levels (Lopes and Kipperberg 2020). Specifically, the figure provides scope lines for the PIECEWISE and QUADRATIC models based on the estimated mean WTP for 10, 20, and 30 TWh of new renewable energy production and the estimated mean WTP to avoid 600, 1200, and 3000 new wind turbines, respectively. The numbers are in Norwegian kroner (NOK) on a per household per month basis.Footnote 26

Fig. 2
figure 2

Empirical Scope lines—Full sample WTP per household per month (mean and 95% CI) for PIECEWISE and QUADRATIC for attributes TURB (number of turbines) and TWH (renewable electricity production in TWh). Note: Confidence intervals are computed by the Krinsky & Robb method (1986)

Table 5 offers a full statistical analysis of the welfare estimates (WTPs) from all three models and estimated scope elasticities (\(\hat{E}_{WTP}\)) between the lowest and highest attribute levels. As can be seen, the LINEAR model has the lowest welfare estimates, which increase monotonically due to the constant marginal utility restriction. The QUADRATIC and PIECEWISE specifications generate somewhat higher WTP estimates. For example, the estimated mean WTP to avoid 600, 1200 and 3000 turbines are NOK 41, 82, and 205 in the LINEAR model versus NOK 256, NOK 362, and NOK 373 in the PIECEWISE model.

Table 5 Welfare estimates and scope elasticities (mean and 95% CI) from full sample panel mixed logit estimations for different functional forms of deterministic indirect utility (linear, quadratic, and piecewise linear)

The estimated utility coefficients in Table 4 together with the corresponding welfare measures in Fig. 2 and Table 5 establish the presence of scope sensitivity. In general, the welfare measures for higher attribute levels are greater than those for lower levels. Furthermore, the estimated scope effects are statistically significant. For the LINEAR model, statistical significance follows directly from the significance of the estimated mean coefficients of THW and TURB. In the QUADRATIC and PIECEWISE cases, statistical scope significance is indicated by fact that the differences in WTP estimates for the highest versus the lowest attribute levels are statistically greater than zero. For example, the simulated difference in mean WTP to avoid 3000 vs. 600 new wind turbines is NOK 117 in the PIECEWISE model. The 95% confidence interval for this difference has a lower bound of NOK 51 and an upper bound of NOK 179.

With regard to the estimated scope elasticities, the LINEAR model imposes unitary elastic scope sensitivities. Hence, \(\hat{E}_{WTP} = 1\) for both attributes. In the QUADRATIC and PIECEWISE models, \(\hat{E}_{WTP}\) for avoiding new wind turbines evaluated from 600 to 3000 wind turbines is 0.416 and 0.288, respectively. Similarly, \(\hat{E}_{WTP}\) for new renewable energy production evaluated between 10 and 30 TWh is 0.264 in the QUADRATIC model and 0.269 in the PIECEWISE model. The confidence intervals around these estimates indicate that both elasticities are statistically greater than zero and less than one, i.e., suggesting inelastic responsiveness in WTP. Combined, these results suggest that the functional form of deterministic indirect utility may influence scope inference in DCE studies. Below, we investigate the data further at the subsample-level using the PIECEWISE specification. Results are reported in Figs. 3 and 4 and Tables 6 and 7.

Fig. 3
figure 3

Empirical Scope lines—WTP per household per month (mean and 95% CI) by geographic subsample (Oslo and Rogaland counties) for the attributes TURB (number of turbines) and TWH (renewable electricity production in TWh) based on piecewise linear specification. Note: Confidence intervals are computed by the Krinsky & Robb method (1986)

Fig. 4
figure 4

Empirical Scope lines—WTP per household per month (mean and 95% CI) by unit of measurement subsample (1 site = 30 turbines) for the attributes TURB (no. of turbines) and TWH (renewable electricity production in TWh) based on piecewise linear specification. Note: Confidence intervals are computed by the Krinsky & Robb method (1986)

Table 6 Sub-sample panel mixed logit parameter estimates for piecewise linear functional form (PIECEWISE) of deterministic indirect utility (Oslo vs. Rogaland counties; sites vs. turbines unit of measurement)
Table 7 Sub-sample welfare estimates and scope elasticities (mean and 95% CI) from mixed logit estimations for piecewise linear functional form of deterministic indirect utility (Oslo vs. Rogaland counties; sites vs. turbines unit of measurement)

5.2 Comparison Across Geographic Subsamples

The patterns of estimated utility parameters in terms of signs, significances, and relative magnitudes from the geographic subsample estimations (Table 6) are similar to those for the full sample estimation (Table 4). Furthermore, all empirical scope lines in Fig. 3 are upward sloping, suggesting presence of scope effects in both subsamples. However, the scope lines for ROGALAND appear steeper than those for OSLO by visual inspection, especially when it comes to the wind turbine attribute. This observation is supported by the estimated welfare effects and scope elasticities summarized in Table 7. The estimated difference in WTP to avoid 3000 vs. 600 wind turbines in the ROGALAND model (436 − 201 = NOK 235) is higher than in the OSLO model (307 − 237 = NOK 70). Correspondingly, \(\hat{E}_{WTP}\) is 0.576 for the ROGALAND subsample versus 0.216 for the OSLO subsample. This subsample difference in scope elasticity is statistically significant at a 10% level (P-value = 0.077) by the full combinatorial convolution method suggested by Poe et al. (2005), see the third numeric column of Table 7. In combination, the higher WTPs and scope sensitivity associated with the turbine attribute in the Rogaland subsample suggest that experience and exposure may adversely affect wind power acceptance in Norway.Footnote 27 When it comes to scope elasticity of WTP for new renewable energy production, \(\hat{E}_{WTP}\) between 10 and 30 TWh is slightly higher in the ROGALAND model (0.346) than in the OSLO model (0.268). However, the difference is not statistically significant (P-value = 0.347).

5.3 Comparison Across Unit of Measurement Subsamples

Figure 4 and Table 7 summarize WTP and scope elasticity estimates for the two unit of measurement subsamples, while the underlying panel mixed logit results can be found in Table 6. The estimated models are referred to as TURBINES and SITES, respectively. Bear in mind that the only difference between the two DCE versions was the unit of measurement for the wind power attribute. Specifically, the number of wind turbines in one version versus the number of production sites in the other, where one production site was described as comprising thirty wind turbines. The WTP estimates for the wind power attribute are reported on a per turbine basis for comparison.

Figure 4 shows the scope lines overall to be upward sloping, suggesting presence of scope sensitivity. However, the scope line associated with WTP to avoid wind turbines is flatter and non-monotonic in the SITES model, while the scope line associated with WTP for new renewable energy production is flatter in the TURBINES model. These visual patterns are reflected in the estimated scope elasticities reported in Table 7. \(\hat{E}_{WTP}\) for avoiding new wind turbines is 0.578 in TURBINES versus 0.325 in SITES, while \(\hat{E}_{WTP}\) for new renewable energy production is 0.551 in SITES versus 0.130 in TURBINES. While the former difference is not statistically significant (P-value = 0.167), the latter is significant at a 10% level (P-value = 0.061). Interestingly, the TURBINES model in general yields higher WTP estimates than the SITES model. For example, the difference in WTP to avoid 3000 vs. 600 wind turbines is NOK 226 in the TURBINES model and NOK 95 in the SITES model. The difference across models is statistically significant with P-value = 0.041.

Contrary to our tentative a priori expectation, it appears that the choice of unit of measurement for a key attribute, which in our case is the one that reflects deployment of wind power in Norway, influences welfare estimates and scope elasticities. One potential explanation for the observed difference could be that wind power turbines may invoke stronger associations to negative environmental impacts than the term wind power production sites. Interestingly, the unit of measurement for the wind power attribute also seems to have an impact on the analysis of other attributes of interest, in our case, new renewable energy production. In combination, these findings suggest that choice of attribute representation may influence scope inferences in DCE studies, even when the difference in the available metrics may seem innocuous from a design perspective. While the influence of attribute translations (Hertwig and Grüne-Yanoff 2017; Ungemach et al. 2018) has not been a primary focus of this paper, we are not aware of prior explorations of this topic in the environmental valuation literature, at least not in the context of scope sensitivity analysis.

5.4 Subsample Results in WTP Space

We conduct robustness checks of the findings in 5.2 and 5.3 by re-estimating the subsample models in WTP space with lognormal cost parameter (Train and Weeks 2005). Estimated WTPs and scope elasticities results are summarized in appendix Tables 10 and 11. The results are generally similar to those reported above with respect to WTP and elasticity magnitudes and comparisons across geographic subsamples (ROGALAND vs. OSLO) and unit of measurement subsamples (TURBINES vs. SITES). One difference is that estimation in WTP space appears to yield tighter confidence intervals around the estimates, which amplifies the differences across the subsamples. There is also a tendency towards slightly higher, albeit still inelastic, scope elasticity estimates in WTP space. For example, \(\hat{E}_{WTP} \) for wind turbines in the OSLO model is 0.444 in WTP space versus 0.216 in preference space. However, due to the imprecision of the latter, this difference in not statistically significant (P-value = 0.110). A notable exception is that \(\hat{E}_{WTP}\) for wind turbines is insignificant in the SITES model estimated in WTP space. The estimate of 0.021 is statistically different from the estimate of 0.325 from the corresponding model estimated in preference space with fixed cost parameter (P-value = 0.037). This finding suggests that econometric model assumptions and specifications may also affect estimated scope effects.

6 Concluding Remarks

Investigating the significance of scope sensitivity remains an important validity check in SP research, but it is important to make a distinction between statistical and economic significance (Amiran and Hagen 2010; Whitehead 2016; Lopes and Kipperberg 2020). This paper is the first to carry out a systematic investigation of the significance of scope effects in DCE studies using the scope elasticity of WTP concept. Specifically, it provides a conceptual exposition and a twofold empirical analysis. First, we analyze a selection of DCE studies from the environmental valuation literature and derive their implicit scope elasticities. Second, we apply the scope elasticity concept in an analysis of primary data from a DCE on preferences for renewable energy expansions in Norway.

From the literature analysis we observe that explorations of scope sensitivity as a validity check, or for any other reason, are scarce in DCE studies. Furthermore, many studies employ a restrictive linear functional form for deterministic indirect utility, which implies a scope elasticity of one. When more flexible specifications are employed, such as quadratic or piecewise linear, there is a tendency toward inelastic scope sensitivity, i.e., a scope elasticity greater than zero but less than one.

There are several possible explanations for the lack of scope sensitivity focus in DCE research. One obvious explanation is that the DCE design process of identifying salient choice features typically ensures that key attributes are statistically significant in estimation. Related to this is the added statistical precision that comes from the practice of utilizing multiple choice tasks per respondent. However, as pointed out in this paper, statistical significance is not the same as economic significance. It is worth reflecting on the fact that the dearth of scope investigations in the environmental DCE literature coincides with a general lack of attention to functional form and utility-theoretic properties in RUM applications (Hoyos 2010; Mariel et al. 2021). In turn, this deficiency has implications for the ability to differentiate between statistical and economic significance in estimated effects. As observed by Johnston et al. (2017: p. 361): “Many published SP studies facilitate estimation by assuming a utility function that is linear and additively separable (with constant marginal utilities). Although such functions may serve as a useful local first approximation, these implicit assumptions will not always hold. Among the concerns in this area is the likelihood that preferences will exhibit nonlinearity (e.g., diminishing marginal utility or nonconstant marginal rates of substitution between attributes). Such possibilities can be accommodated using richer specifications for preference or welfare functions.

In our own empirical application, we find positive mean preferences for new renewable energy production and negative mean preferences for new wind turbines, ceteris paribus. For example, estimated mean WTP for avoiding 600 and 3000 wind turbines are NOK 256 and NOK 373 per household per month, respectively, in the flexible piecewise linear specification for the full sample. The simulated scope arc-elasticity of WTP between these two attributes levels is 0.288. In general, our estimation results suggest sizeable differences in WTP across attribute levels, with estimated scope elasticities varying between 0.13 and 0.58, depending on the attribute analyzed (new renewable energy or new wind turbines), model specification (quadratic or piecewise linear), geographic subsample (Rogaland County versus Oslo County), and the unit of measurement (wind turbines versus wind power production sites).

While there is no strict, universally applicable benchmark for assessing the economic significance of scope effects, we deem these scope elasticities to be of an adequate and plausible order of magnitude. Thus, the results from our study can be used to inform policy decisions related to renewable energy investments. In particular, the welfare estimates for different levels of wind power expansion can be utilized as valid inputs in benefit–cost analyses and optimization models for the sizing and siting of future wind power in Norway. It may also be possible to employ scope elasticity estimates from our study, or scope elasticities derived from other existing studies, in benefit transfer exercises, provided context-specific baseline welfare estimates are available.

We end this paper by providing some take-home messages of best practices and directions for future research. First, we think it should be standard practice for DCE practitioners to include explicit statements regarding internal validity in general and scope sensitivity in particular in reporting from their studies. Second, the selection of functional form of the underlying value function (e.g., the deterministic component of indirect utility in the standard RUM framework) should be carefully explained and justified. The piecewise linear specification recommended by Layton and Brown (2000) and employed in our own analysis provides a high degree of flexibility, while the simple linear functional form should be avoided. Third, formal scope sensitivity diagnostics should be carried out for all quantitative attributes. Specifically, we recommend that scope elasticity estimates are reported as part of the standard output, i.e., alongside the usual econometric estimation results and monetary value estimates. Fourth, ex ante DCE design adaptations should be made to facilitate ex post scope sensitivity analysis. In particular, the selection of units of measurement, attribute levels, and experimental design must enable estimation of models that relax the assumption of constant marginal utility and facilitate flexible identification of scope elasticities.

A fruitful general direction for future research on scope sensitivity in DCE is to systematically examine how scope elasticities are influenced by various conceptual, methodological, and empirical dimensions. In this paper, we limited our data analysis to the mean scope elasticities and examined how these varied across attributes, geographic sub-samples, and experimental variation in the wind power attribute. However, scope elasticities are likely to vary across individuals, valuation contexts, and a multitude of choice architecture dimensions. For this reason, one specific direction for future research would be to expand the literature analysis in this paper with the aim of compiling a large dataset of extracted scope elasticities, which could then lend itself to meta-regression analysis. Such analysis could be limited to DCEs or include CV studies. A second specific direction for future research would be to analyze determinants of individual-specific scope elasticities. For example, it is possible to extract conditional, individual-specific WTP estimates from panel mixed logit estimations, which, in turn, can be utilized to predict individual-specific scope elasticities. By linking these elasticities to respondent-level information, one would be able to explore a wide range of conceptual hypotheses on factors that may influence scope sensitivity (e.g., income, age, education, knowledge, perceptions, familiarity, etc.) through auxiliary regressions. Such analysis would be particularly attractive if it involved several datasets spanning multiple attributes from various valuation contexts. A third specific direction for future research would be to explore potential differences in sensitivity to scope across experimental variations in the experimental design. For example, a so-called design-of-design approach (e.g., Hensher 2006; Weller et al. 2014) could be taken to analyze how choice architecture and researcher’s design decisions influence welfare estimates per se and the estimated scope elasticities. Ultimately, an important aim of all this research would be to better understand the underlying reasons for scope (in)sensitivity and improve designs to better capture true preferences for environmental change in future SP research. This is also crucial for making resulting welfare estimates (more) useful for benefit-cost analysis, both in applications of the original estimates and in benefit transfer exercises.

Finally, we emphasize that we believe that true scope sensitivity varies naturally across people and valuation objects in the real world. Consequently, it is not possible to define a single threshold value from which internal scope validity of a study can be inferred. Therefore, we believe that the adequacy, plausibility, and economic significance of estimated scope effects must be assessed on a case-by-case basis.