1 Introduction

Humanity faces the crucial challenges: increasing global food production and transitioning quickly to sustainable and climate-smart agricultural production systems (Sarkar et al. 2020) at limited and contested natural resources (Li et al. 2019; Popp et al. 2014). Intense traditional farming practices contribute to loss of biodiversity, greenhouse gas emissions and groundwater contamination (Conijn et al. 2018; Foley et al. 2011). Many of the ecological improvements in farming associated with low-input or organic production, cannot maintain productivity levels and may even cause land conversions (Smith et al. 2019).

Given that farms operate in the complex social-ecological system, i.e. farming extracts and markets ecosystem services and provides maintenance by interacting with the natural system (cf. McGinnis and Ostrom 2014), a number of crucial farm management and production decisions may have adverse effects on ecosystem functioning. Hence, the concept of Sustainable Intensification (hereafter SI) has been proposed to offset the adverse effects of agricultural production (Balaine et al. 2020; Baulcombe et al. 2009), and to sustain rural economies (Godfray and Garnett 2014). SI-based production systems have been developed to reduce environmental harm while maintaining yield levels (Pretty 2018), or foster yield growth in developing countries while preserving ecosystems (Pretty 1997; Schut and Giller 2020) and closing yield gaps (Ray et al. 2012). SI thus governs the socio-economic dimension of sustainability, while enhancing functionality of the natural ecosystem, i.e. the ecological dimension of sustainability (e.g., Gunton et al. 2016). Other conceptual definitions include aspects of the social dimension of sustainability explicitly, for instance cultural or ethical aspects (Garnett and Godfray 2012).

Practical evaluations of the outcomes of adopting SI measures on farm level are mainly based on field trial data (e.g., Paul et al. 2015; Townsend et al. 2016) or simulation-based approaches (e.g., Devkota et al. 2016; Mao et al. 2015; Scherer et al. 2018) and strongly focus on yield effects. Yet holistic approaches seem underrepresented. Few studies exist demonstrating that farms in developing and developed countries can improve their ecological and economic performance by adopting SI measures (e.g., Kassie et al. 2015). For European cases, Areal et al. (2018) or Gadanakis et al. (2015) assess SI with indicators to track economic and ecological farm outcomes but do not causally link the outcome indicators to the adoption of specific SI measures. Petersen and Snapp (2015) indicate that assessing SI is challenging as adopting and combining different agronomic measures may impact the production process in various ways. This may explain the continuing debate about which measure could achieve an improved balance between the economic and ecological sustainability of agricultural production. Mahon et al. (2018) even emphasise the need for concrete outcome specifications of SI, while acknowledging the processes of ecosystem maintenance and service extraction under the respective local natural system and governance settings.

In this paper we analyse agronomic examples of SI measures (e.g., wider crop rotations, reduced tillage, integrated pest management, and plant and/or site-specific technologies) and their potential offset environmental harm without sacrificing productivity (cf. Weltin et al. 2018 for an overview of SI measures). In order to evaluate the benefits of these different measures, we propose a meta frontier approach for measuring farms’ eco-efficiency and respective improvement potentials. We use rich survey data for a sample of SI adopters and non-adopters throughout the northern German Plain, which is characterised by peatlands. We follow the fundamental SI definition which states that SI practices can improve the balance between the economic and ecological sustainability of agricultural production without reducing either (Dicks et al. 2019).

We base our evaluation on a utility maximization model, where farmers opt consciously for a SI technology, and on a two-output (economic and ecological) production model and a directional meta-frontier approach to measure eco-efficiency (cf. Beltrán-Esteve and Reig-Martínez 2014; O’Donnell et al. 2008). The production frontiers of SI technology and traditional farming technology are enveloped by an eco-efficient system frontier. We assume that the system frontier offers the highest possible outputs in either direction. A farm’s eco-inefficiency in direction of the ecological output reflects the improvement potential in environmental performance, i.e. maintaining or enhancing the eco-system’s functionality, at a given level of economic outcome (cf. Kuosmanen and Kortelainen 2005).

We hypothesize that adopting SI offers an eco-efficient reduction of farms’ improvement potentials and farms provide more maintenance in the form of ecological output at no economic cost. Our analysis involves three aspects: we examine whether SI adopters determine the system frontier in the ecological output direction and refer to this as the technology effect. As a second aspect, we explore whether adopting the SI technology reduces farms’ improvement potentials, i.e., the distances to the system frontier in the direction of the ecological output, and as a third aspect, we acknowledge that farms’ performance in the chosen technology can impact the improvement potential and refer to this as the performance effect.

Given that farmers make production decisions based on utility maximization, observed differences in the environmental performance improvement potentials between the SI adopters and non-adopters can also relate to the structural differences of the two groups (e.g., natural and socio-economic conditions or environmental preferences). In line with Mayen et al. (2010), linking technology adoption decisions and eco-efficiency analysis is a pre-condition for identifying causal relationships. Therefore, following the theoretical behavioural framework of Chabé-Ferret and Subervie (2013), we assume a representative farmer will choose the SI technology, and the inputs and outputs that maximise utility and improve ecological and economic potentials. Figure 1 illustrates our research plan and the effects we investigate.

Fig. 1
figure 1

Research plan

We believe this is the first paper to provide a meta-frontier approach for evaluating SI by comparing improvement potentials in the direction of ecological outputs. We contribute to the environmental economics literature by using a matching algorithm to generate a control sample that reduces potential bias and causally interprets the differences in eco-efficiency through SI measures. The proposed meta-frontier approach separates the differences in improvement potentials between the SI adopters and non-adopters into a technology effect and a performance effect.

The remainder of this paper is organized as follows. Section 2 introduces the analytical framework, theoretical background and hypotheses. Section 3 describes the empirical model specification, data, and study region. Section 4 presents the results. Section 5 discusses the results. Section 6 concludes and offers suggestions for future research.

2 Conceptual Approach

2.1 Eco-Efficiency Analysis for SI Evaluation

The eco-efficiency analysis used in this paper offers an instrument for measuring the capability of farms to produce goods at the least environmental harm (e.g., Chen and Delmas 2012; Halkos and Petrou 2019; Huppes and Ishikawa 2005). Typically, the production possibility frontier in a two-output model consists of an economic and environmental output dimension (see Sect. 2.2). Adopting and implementing SI measures successfully governs the two dimensions by mitigating or eliminating the undesirable environmental output and maintaining the economic production output, or vice versa. The eco-efficiency approach is thus preferable to using a single evaluation of indicators, such as a cost–benefit analysis of agricultural income before and after SI adoption (see Smith et al. 2017 for an overview).

In this paper farm-specific deviations from the production possibility frontier denote eco-inefficiencies. Farm-specific but directional eco-efficiency measures indicate respective inefficiencies in the ecological and economic output direction separately (Callens and Tyteca 1999; Picazo-Tadeo et al. 2012; Tyteca 1999), and the measured eco-inefficiency scores reflect farms’ improvement potentials in the economic and ecological direction (Asmild et al. 2016).

Key for evaluating SI is the investigation to which extent farms exploit the respective improvement potentials. We follow the idea of sequential preferences by Asmild and Hougaard (2006), and presume that farm managers first aim at optimizing technical efficiency, that is, focus on optimizing in the economic output direction. Second, after meeting a certain economic threshold, and depending on their environmental preferences, they may improve the environmental output, along with SI adoption (see Sect. 2.3).

Our non-parametric directional approach based on Data Envelopment Analysis (DEA) relies on linear programming approaches to retrieve the production possibility frontier as the best practise frontier (Bogetoft and Otto 2011). Free of any distributional assumptions, the approach does not suffer from potential bias due to the functional form misspecification of the frontier. Farm-specific deviations from the best practise frontier in the ecological direction capture the ecological outcome improvement potentials without sacrificing economic performance (Zhou et al. 2018).

DEA approaches have been applied to eco-efficiency analyses of agricultural production, particularly for evaluating production systems that govern economic value creation at lower externalities, such as dairy farms’ greenhouse gas emissions (e.g., Pérez Urdiales et al. 2016), environmental nutrient impacts (Iribarren et al. 2011), soil loss due to erosion (Eder et al. 2021), pesticide uses for olive production (Gómez-Limón et al. 2012; Picazo-Tadeo et al. 2012) and crop production (Bonfiglio et al. 2017). DEA approaches have been applied to production types related to policy support, such as participation in agri-environmental schemes (Picazo-Tadeo et al. 2011), sustainable coffee production certification (Ho et al. 2018), labour- versus capital-intensive farming systems (Grzelak et al. 2019), regional differences (Coluccia et al. 2020) and farm sizes (Stępień et al. 2021).

We base our analysis on the behavioural model of Chabé-Ferret and Subervie (2013) to frame the decision to adopt SI measures for causal identification of farms’ respective improvement potentials. The model of was originally developed to identify (windfall) effects causally linked to participation in agri-environmental schemes in France; an issue that has been discussed for other regions (Arata and Sckokai 2016; Calvet et al. 2019), other policy measures, for instance decoupled payments (Esposti 2017a, b), and for other voluntary production choices such as organic farming (e.g., Cisilino et al. 2019). The behavioural model frames the role of environmental preferences while investigating farm level decisions to enter such policy schemes and has been mirrored in follow-up applications (Kuhfuss et al. 2016; Laukkanen and Nauges 2014; Mennig and Sauer 2020; Udagawa et al. 2014).

2.2 The Production Economic Model

We assume that a representative farm \(i\) produces output vector \({\varvec{Y}}\), with an economic \(Y_{econ}\) and ecological dimension \(Y_{ecol}\). We use agronomic SI measures to categorize the SI technology and the traditional farming technology with respective production technology sets \(\Psi_{j}\) with \(j = \left\{ {0;1} \right\}\), where \(j = 0\) indicates production without SI measures and \(j = 1\) indicates the SI-adjusted production system. In both technology sets, farm \(i\) chooses a variable input \(X\) and on-farm labour \(H\) to produce output \({\varvec{Y}}\). Fixed inputs \({\varvec{I}}\), such as human and physical capital, and unobserved factors \({\varvec{\varepsilon}}\), such as land quality, weather conditions or managerial ability, enter the production possibility sets:

$$\Psi_{j} = [(X,H,{\varvec{I}},{\varvec{\varepsilon}},\varvec{Y })|X,H,{\varvec{I}},\varvec{\varepsilon }\;can\; produce \;{\varvec{Y}}].$$
(1)

Following O’Donnell et al. (2008), the respective technologies, \(\Psi_{j} ,\) determine a common production system frontier, \(\Psi_{m}\), enveloping the SI and non-SI production possibility frontiers. The system production possibility frontier denotes the achievable production output for farms in a region, given the natural ecosystem and the governance settings. Figure 2 illustrates our production economic model.

Fig. 2
figure 2

System frontier \((\Psi_{m} )\) and the two group-specific frontiers, non-SI \((\Psi_{0} )\) and SI \(\left( {\Psi_{1} } \right)\)

The solid black line represents the system frontier \(\Psi_{m}\) and the dashed lines represent the respective production possibility frontiers of the two technologies (SI, \(\Psi_{1}\), versus traditional farming, \(\Psi_{0}\)). Farms producing on the system frontier \(\Psi_{m}\) such as A in Fig. 2 fully exploit their improvement potentials in both directions (eco-efficient), whereas farms A’ and A’’ deviate and can improve in either direction against the overall system frontier (economic or ecological). Directional distances from these points measured against A give the degree of eco-inefficiency in the farm’s ecological output direction for given economic outcome (e.g., Picazo-Tadeo et al. 2012). As we hypothesize that the SI technology determines the system frontier in ecological direction, A’’, offers improvements in ecological direction to A at the maximum under SI. If farms do not adopt SI and operate efficiently (A’ in Fig. 2 on \(\Psi_{0}\)), they still have unexploited improvement potentials when compared to the system frontier (technology effect). In Fig. 2, such improvement potential in ecological direction is denoted by the directional distance between A’ and A. While the distance A’’ to A’ suggests efficiency gains and efficient production when measured against the non-SI frontier, evaluated against the SI and the system frontier, inefficient.

Assuming that a farm producing efficiently under SI (\(\Psi_{1}\)) produces closer to the system frontier in an ecological direction than farms using traditional farming technology, leads to the following hypothesis:

Hypothesis 1

The SI frontier locates in the direction of the ecological output closer to the system frontier. Hence, SI adopters in this direction determine the system frontier.

2.3 The Behavioural Model

We assume that the observed and measurable respective improvement potential of a farm, denoted as \(\tilde{Y}_{j}\), results from two sequential decisions. First, the farm household’s decision to adopt SI determines the possible improvement in the ecological direction by the respective group-specific frontier. Second, the farm household’s decision regarding input allocation and intensity determines how eco-efficiently to operate with the chosen technology. Following Chabé-Ferret and Subervie (2013), we assume farms maximize utility as represented by utility function \(U\), and evaluate optimized production input levels \(X_{j}^{*}\) and on-farm labour time allocation \(H_{j}^{*}\) for both technologies. Both \(X_{j}^{*}\) and \(H_{j}^{*}\) are functions of the exogenous variables, such as prices, consumption shifters, preferences and fixed inputs, as denoted by \(g_{j}\) and \(h_{j}\), respectively.

Presuming that sequential preferences guide the farm household’s decision to adopt SI measures (cf. Asmild and Hougaard 2006), the SI farm aims to reduce the improvement potential in the ecological direction compared to their non-SI reference situation. We thus model their first stage decision-making such that the results from previous years and their experience enter the estimation of the reference improvement potential: \(\tilde{Y}_{0}^{*} = \tilde{Y}_{0} \left( {X_{0}^{*} ,H_{0}^{*} } \right)\).

The farm household’s utility maximisation problem is given by:

$$\mathop {\max }\limits_{{C, L, H, H_{off} , X}} U\left( {C, L, H,X, {\varvec{S}}, {\varvec{\eta}}} \right)$$
(2)

subject to:

$$Y_{econ} = f_{j} \left( {X,H,{\varvec{I}},{\varvec{\varepsilon}},Y_{ecol} } \right)$$
(3)
$$C = pY_{econ} - p_{x} X + wH_{off}$$
(4)
$$T = L + H + H_{off} ,$$
(5)

where utility \(U\) depends on levels of consumption \(C\), leisure \(L\), variable input \(X,\) and on-farm labour hours \(H.\) Variable input \(X\) and on-farm labour hours \(H\) reflect the dependence of utility on the farm’s preference for or rejection of certain input compositions. Consumption shifters \({\varvec{S}}\), such as age or education, and unobservable taste shifters \({\varvec{\eta}}\), such as ecological preferences or idiosyncratic non-farm profit opportunities also enter \(U\). Equation (3) above gives the transformation function in an explicit form regarding \(Y_{econ}\) according to the implicit function theorem (e.g., Sauer and Wossink 2013), and Eq. (4) states that the farm household sells \(Y_{econ}\) for price \(p\) with input costs at price \(p_{x}\) and quantities \(X\). The farm generates additional income from \(H_{off}\) hours of off-farm work remunerated by wage rate \(w\). Equation (5) constrains the total available time \(T\) of hours for on- and off-farm labour and leisure time.

Optimal input levels under non-SI are given by:

$$X_{0}^{*} = { }g_{0} \left( {p, p_{x} , w, T, {\varvec{I}}, {\varvec{S}},{{\varvec{\upeta}}}, {\varvec{\varepsilon}}} \right)$$
(6)

and

$$H_{0}^{*} = { }h_{0} \left( {p, p_{x} , w, T, {\varvec{I}}, {\varvec{S}},{{\varvec{\upeta}}}, {\varvec{\varepsilon}}} \right).$$
(7)

When applying SI, the farm’s input allocation is guided such that the improvement potential in the ecological direction, \(\tilde{Y}_{1}\), does not exceed the optimized improvement potential of the reference situation, \(\tilde{Y}_{0}^{*}\). This provides an additional voluntary constraint to the utility maximization problem. The constraint becomes applicable when the farm adopts SI (\(D = 1\)), where the reference improvement potential enters as a constant:

$$D\left( {\tilde{Y}_{1} \left( {X,H,{\varvec{I}},{\varvec{\varepsilon}},\varvec{ Y}} \right) - \tilde{Y}_{0}^{*} } \right) \le 0.$$
(8)

The voluntary constraint of Eq. (8) enters the first-order conditions:

$$\frac{\partial U}{{\partial C}}\left( {p\frac{{\partial f_{j} }}{\partial X} - p_{x} } \right) + \frac{\partial U}{{\partial X}} - \lambda \left( {\frac{{\partial \tilde{Y}_{1} \left( {X,H,{\varvec{I}},{\varvec{\varepsilon}},\varvec{ Y}} \right)}}{\partial X}} \right)D = 0$$
(9)
$$\frac{\partial U}{{\partial C}}\left( {p\frac{{\partial f_{j} }}{\partial H} - w} \right) + \frac{\partial U}{{\partial H}} - \lambda \left( {\frac{{\partial \tilde{Y}_{1} \left( {X,H,{\varvec{I}},{\varvec{\varepsilon}},\varvec{ Y}} \right)}}{\partial H}} \right)D = 0,$$
(10)

where \(\lambda\) denotes the respective Lagrangian multiplier.

Therefore, the optimized input choices under SI depend on the reference situation’s improvement potential, \(\tilde{Y}_{0}^{*}\). This counterfactual improvement potential works as a lower bound against which farms compare the respective improvement potential under SI. If the constraint is binding (\(\lambda \ne 0\)), the farm will adjust \(X\) and \(H\) but may be compensated by increases in utility, whereas if the constraint is not binding (\(\lambda = 0\)), the farm has no costs in terms of the constrained use of \(X\) and \(H\) when applying SI. Optimized input and labour allocation under SI are given by:

$$X_{1}^{*} = { }g_{1} \left( {p, p_{x} , w, T, {\varvec{I}}, {\varvec{S}},{{\varvec{\upeta}}}, {\varvec{\varepsilon}},\tilde{Y}_{0}^{*} } \right)$$
(11)

and

$$H_{1}^{*} = { }h_{1} \left( {p, p_{x} , w, T, {\varvec{I}}, {\varvec{S}},{{\varvec{\upeta}}}, {\varvec{\varepsilon}},\tilde{Y}_{0}^{*} } \right).$$
(12)

We note that if the farm’s expected improvement potential under SI, \(\tilde{Y}_{1}^{*}\), remains insufficiently large enough to increase utility compared to \(\tilde{Y}_{0}^{*}\) according to the environmental preferences, the farm will not adopt (\(D = 0\)) and Eq. (8) becomes irrelevant.

We use the farm’s decision outcome to measure the respective improvement potential \(\tilde{Y}_{j}\) as shown in Fig. 2. The farm decides on SI based on the indirect utilities, \(V_{1}\) and \(V_{0} .\) Indirect utilities depend on the same variables as \(\tilde{Y}_{1}^{*}\) and \(\tilde{Y}_{0}^{*}\). The implementation of SI may, however, induce search, implementation or information cost denoted by \(V\). Cost \(V\) potentially varies with education and experience and reduces indirect utility when choosing SI. The farm will adopt SI (\(D = 1\)) when the expected increase in indirect utility outweighs the cost of adoption:

$$D = {\varvec{1}}\left[ {E\left[ {V_{1} - V_{0} {|}{\varvec{Z}}} \right] - V \ge 0} \right]],$$
(13)

where \({\varvec{Z}}\) denotes the determinants of the adoption decision. The determinants may coincide with the determinants of input choices, such as environmental preferences, consumption shifters or fixed inputs. Since the adopters and non-adopters may systematically differ regarding their environmental preferences, we need to ensure comparability between the two groups before we assess the observed improvement potentials.

Thus far, we have assumed eco-efficient production under the respective technology. In the short-run, however, inefficiencies within the chosen technology may occur and farms may tolerate the adjustment costs of technology adoption in terms of output reductions (Ang and Oude Lansink 2017), although ignoring the possible inefficiencies of SI adopters would bias the retrieved improvement potentials (performance effect). For instance, a fully efficient non-SI farm could have a lower improvement potential compared to a weakly efficient SI-farm. Figure 3 illustrates two examples of non-SI reference situations.

Fig. 3
figure 3

Improvement potentials for different reference situations of farms A and B. SI adoption may shift farm production to A′ and B′ with reductions in improvement potential. B′′′ represents a situation when eco-inefficiencies in the SI technology increase the improvement potential compared to B

If farm A could achieve A’’, the improvement potential turns to zero, otherwise farm A may only be able to reduce the improvement potential up to a point A’, which is also achievable in the non-SI technology, due to eco-inefficiencies within the SI technology. Farm B is eco-efficient within the non-SI technology but could exhibit eco-inefficiencies in the SI technology that prevent moving to a point B’ that reduces the improvement potential. Assuming that performance B’’’ under SI corresponding to the ecological output level of A’ could even increase farm B’s improvement potential leads to the following hypotheses.

Hypothesis 2a

At the mean, for the same economic outcome, SI adopters produce at a lower ecological improvement potential than comparable non-adopters.

Hypothesis 2b

If SI adopters have a low within-technology performance in the chosen technology and comparable non-adopters have a high within-technology performance, the non-adopters have lower improvement potential than the adopters in some cases.

3 Data and Empirical Approach

3.1 Study Area, SI Measures, SI-Related Outcomes and Summary Statistics

Our study area is the northern German Plain characterised by abundant peatlands and lowland farming systems. These areas are classified as high-priority intervention areas in the European Common Agriculture Policy framework to protect carbon-rich soils and require adapting farming practices to meet Germany’s climate protection and biodiversity goals (TEEB 2015) and EU’s climate protection goals (European Commission 2020).

We use data from a survey of farming practices conducted between February and June 2017 for farms in the federal states of Brandenburg, Mecklenburg Western Pomerania, Saxony-Anhalt, Lower Saxony and Schleswig Holstein (for brevity, the full questionnaire is provided with the supplementary material). For the survey, farms in Brandenburg, Mecklenburg, and Western Pomerania were recruited in areas with at least 20% peatland area and 1000 ha of peatlands in total, and with more than 5000 ha of peatlands in total, based on postal code. Additional respondents were recruited via farmers’ associations in all five federal states. From the 464 observations in the spatial expansion, we used the 410 farms for which we observed adoption decisions for SI measures, and excluded 26 farms below 5 ha, for a total of 384 farms. Figure 4 illustrates the study area.

Fig. 4
figure 4

Map of the spatial expansion of the sample and response rates in North German Plain (based on Weltin and Zasada (2018). Note: 22 farms that did not give postal codes are excluded

The set-up and data, including sampling choices, selected SI measures and outcome definitions may affect the location of the eco-efficient frontier, and thus sensitivity respective improvement potentials (cf. Areal et al. 2018; Gadanakis and Areal 2018). Therefore, our approach contains a robustness analysis. We exclude the additional recruitments in in the first robustness scenario.

This evaluation study targets at investigating the contribution of agronomic SI measures in improving farms’ eco-efficiency, and therefore, regionally relevant agronomic SI measures have to be identified. Following Dicks et al. (2019) and Firbank et al. (2013), we support our choice by stakeholder opinions. Thus, prior to conducting our survey, we invited farmers, representatives of environmental protection agencies, and local administrators, to a workshop in the Rhinluch in Brandenburg, explaining the research and to select our SI measures. Given the study region’s urgencies to meet climate-protection and biodiversity goals by SI, our ecological outcome measures focus on on-farm and on-land diversity, and we selected six agronomic SI measures with potential contributions to biodiversity and thus ecosystem functionality of grassland and arable farming: (1) reduced tillage, (2) intercropping, (3) growing legumes, (4) integrated pest management, (5) grazing, and (6) extensive use of grassland. While in this study we focus on the economic and ecological sustainability dimensions, SI measures discussed in the workshop also governed social aspects (see Weltin et al. 2021).

To specify the ecological output dimension in the eco-efficiency model, \(Y_{ecol}\), we primarily focus on farm, farmland and crop diversity. We rely on several indicators classified as indirect or related to farmland use and management that correlate with biodiversity (cf. Bockstaller et al. 2011). Since landscape simplification is a strong predictor for losses in species richness and thus ecosystem functionality (Dainese et al. 2019), we follow a more holistic approach by Gibson et al. (2007) and consider farm-level heterogeneity between different landscape elements (on-farm diversity) and the diversity within each land use type (on-land diversity). We assign equal weight to both aspects of diversity in the final ecological output indicator (cf. Gan et al. 2017). For robustness scenarios 2 and 3, we vary the weighting scheme by assigning a higher weight of 80 percent to one diversity aspect. Table 1 reports the calculations of the indicators.

Table 1 Environmental output indicators and calculations

We use the Simpson diversity index to indicate on-farm diversity on cropped and non-cropped area, i.e. share of arable land, extensive grassland, and other grassland (cf. Van Eck and Koomen 2008). For non-cropped land, we observe the presence but not the amount of fallow land and flower or buffer strips. Acknowledging their high value for biodiversity (Herbst et al. 2017; Weibull et al. 2003), we assign them 50% of the weight in the overall indicator for on-farm diversity.

For on-land diversity, we measure the biodiversity in arable land by the number of different crops grown within a year (Matson et al. 1997). For grassland we use three indicators: we include the share of permanent grassland to total grassland. The indicator captures the carbon sink function of grassland (Barnes and Poole 2012). We use farms’ shares of permanent pasture that exceed regional averages to capture a biodiversity surplus extending the indicator of Areal et al. (2012). We use the third grassland indicator for the abundance of peatlands extensively managed or in conditions close to nature, with a high impact on carbon capture and biodiversity (TEEB 2015). We weight the three grassland indicators equally. We weight the indicators for arable and grassland by the respective share of each land-use type on the farm in the composite indicator for on-land diversity to reflect the heterogeneity of farm types in the sample. To account for potential farm-type effects, in the fourth robustness scenario, we exclude fully specialised farms that operate either only on arable land or only on pastures. Figure 5 maps the distribution of peat- and wetland areas in the studied region and the main land use types.

Fig. 5
figure 5

Map of main land use types and peatland areas in study region. Note: Land use visualistions based on CORINE Land Cover—10 ha (see http://sg.geodatenzentrum.de/wms_clc10_2012); red denotes artificial surfaces, yellow denotes agricultural areas, and green denotes forest and seminatural areas (for details see https://land.copernicus.eu/pan-european/corine-land-cover/clc-2012?tab=mapview)

We consider economic outputs and ecological outputs without modelling all inputs in the production process. We use the agricultural area as input for deriving the improvement potentials for farms of comparable size. To specify the economic output dimension in the eco-efficiency model, \(Y_{econ}\), we rely on a farm profit indicator provided on an ordinal scale with twelve categories and acknowledge its limitations compared to direct measures, such as agricultural income and associated risk (Smith et al. 2017).Footnote 1

Our modelling relies on a binary distinction between SI and non-SI technology so we need to define a cut-off value that distinguishes SI adopters from non-adopters. If we assume that SI measures best exploit their benefits via combinations (Benton et al. 2003; Kassie et al. 2015), we need an SI adopter to apply at least two SI measures. We use the median number of adopted measures in our sample, that is three measures, to avoid potentially overestimating the performance of SI adopters. In our robustness analysis, we define further scenarios with cut-offs at two and four measures, respectively.

We characterise farmers by their responses to the survey questions and exclude 43 observations with missing values for a total of 265 observations.Footnote 2 Table 2 lists the characteristics of the SI and non-SI farms and the summary statistics.

Table 2 Summary statistics for SI and non-SI farms before matching

The table indicates that SI farms are more likely to be full-time operations, operate at larger scale, their farmers are more highly educated (e.g., agricultural degree), and use agricultural extension services more often compared to non-SI farmers. SI farmers also have a stronger affinity for regional entrepreneurship. Differences in the unconditional means of variables self-indicated environmental awareness and regional attachment, however, are small.

3.2 Efficiency Model Specification and Matching Approach

We use a meta-frontier approach to measure the eco-efficiency scores to the system frontier and within-technology performance to the group-specific frontiers. Therefore, we rely on directional distance function (DDF) for measuring eco-inefficiency in either direction. We specify the DDF with outputs \({\varvec{Y}} = \left( {Y_{ecol} , Y_{econ} } \right)\), agricultural area input \(Q\) and define the direction vector \(g_{y} \left( {Y_{ecol} ,0} \right)\):

$$\vec{D}_{ecol, j} \left( {Q, {\varvec{Y}}; g_{y} } \right) = Sup\left[ {\beta_{ecol, j} :\left( {{\varvec{Y}} + \beta_{ecol, j} g_{y} } \right) \in \Psi_{m} } \right].$$
(14)

For within-technology performance, \(\Psi_{j}\) in Eq. (14) replaces \(\Psi_{m}\). Symbol \(\beta_{ecol, j}\) represents the proportion by which \(Y_{ecol}\) could be increased to reach the respective frontier. The ratio \(1/(1 + \beta_{ecol,j} )\) determines the fraction of the feasible output, which is the farm’s eco-efficiency in the interval \(\left[ {0;1} \right]\). The following relationship holds: eco-efficiency to the system frontier equals the meta-technology ratio (MTR) multiplied by group-specific eco-efficiency. The MTR is the farm’s distance to the system frontier if the farm produces on its group-specific frontier (Gómez-Limón et al. 2012). An MTR of 1 implies that the group-specific frontier coincides with the meta-frontier and offers to assess the technology effect of SI. Similarly, we calculate the farm’s eco-efficiency in the economic direction by setting the direction vector to \(g_{y} \left( {0,Y_{econ} } \right)\).

We use directional non-parametric DEA and opt for a full disposable hull technology to obtain the most cautious estimates of the respective improvement potentials. Since DEA results are sensitive to outliers with regard to inputs and outputs (Bogetoft and Kromann 2018), we use the minimum covariance determinant estimator for outlier control (Rousseeuw and Driessen (1999). We estimate robust Mahalanobis distances to assess the distance between an observation and the centre of the data (cf. supplementary material Sect. 2 for details). We observe land, profit and biodiversity indicators for 325 observations and eliminate 17 outliers. We use the R packages Benchmarking and Robustbase.

The SI adopters may differ in their farm(er) characteristics from the non-adopters as well, so that a comparison of the observed improvement potentials will not suffice to identify the causal differences. As sub-sample homogeneity is a precondition for the causal interpretation of outcomes when selectivity issues prevail (Bogetoft and Kromann 2018), we combine the eco-efficiency analysis with a matching approach. Using observed farm(er) characteristics \({\varvec{Z}}\) as covariates, the matching approach offers generating a counterfactual control group with the characteristics that resembles the group of SI adopters. Comparing the eco-efficiency of SI adopters and these estimated counterfactual group of non-adopters offers a causal link of eco-efficiency and SI. We use kernel density matching based on Mahalanobis distances, offering more robust results in smaller samples (Zhao 2004), and the Epanachnikov kernel function. Kernel matching allows us to assign several control observations to each SI adopter, thus reducing the variance of the estimation. We determine the bandwidth of the estimator by cross-validation to minimize the mean squared error regarding the averages of the covariates. We use the command kmatch in Stata14 and generate a sample of control observations from the weighted averages of the matched controls.

For the matching covariates \({\varvec{Z}}\), we select farmers’ education and experience, farm characteristics (full-time operation, specialisation in arable and organic farming), labour intensity to reflect the intensity of the farming operation., and use of advisory services offering knowledge input for the farm business. As farmers’ environmental preferences and sustainability attitudes have proven relevant (e.g., Hansson et al. 2018; Jongeneel et al. 2008) and our survey remains limited, we additionally consider farmers’ self-assessments for regional attachment, environmental awareness, and entrepreneurial attitude. Due to missing values, we exclude 43 observations (cf. Table A1 in Appendix A for descriptive statistics before matching).

4 Results

To increase the precision of the estimates, we exclude the eight SI farms and 16 non-SI farms out of common support (cf. Lechner and Strittmatter 2017). The final sample with achieved covariate balance (all standardized differences are below 0.25 (cf. Stuart 2010) for DEA consists of 149 SI farms and their matched control non-SI farms. On average, an SI adopter has 11.49 control observations as matches (see Appendix Table B1 for the standardized differences and group means before and after matching).

The results in Table 3 lend support to our three hypotheses (also see in the supplementary material Table 3.1 for the respective results of the robustness scenarios).

Table 3 Eco-efficiency scores in the direction of the ecological output for SI adopters and their matched controls

4.1 System Frontier and Technology Effect

The meta-technology ratio that captures the eco-efficiency to the system frontier is 0.99 on average and is equal to 1 for 78 SI farms. In other words, if all SI farms were eco-efficient to their group-specific frontier, they would also be or nearly be eco-efficient to the system frontier. An average MTR of nearly 1 with a small variance (max. 0.05) for adopters is a consistent pattern across the robustness scenarios. The average MTR is 0.86 for non-SI farms, where only 26 have an MTR of 1. The MTR range for non-adopters in the robustness check is between 0.77 and 0.93. We use the distributions of the eco-efficiency scores to test the differences in the location of the frontiers. Based on a Kolmogoroff–Smirnov test, we reject the null hypotheses that the distributions of eco-efficiency scores to the group-specific frontier and system frontier are identical for non-SI farms (D = 0.42; p = 0.00), but cannot reject them for SI farms (D = 0.07; p = 0.89). The results lend support to hypothesis 1 (The SI frontier locates in the direction of the ecological output closer to the system frontier. Hence, SI adopters in this direction determine the system frontier.).

4.2 Performance Effect

Table 3 above also shows that SI farms are on average more eco-efficient to the system frontier (0.75) than matched control non-SI farms (0.63), or a difference of 0.12 score points. The result is consistent with similar differences for all scenarios of the robustness analysis except for raising the weight of on-land diversity in the ecological outcome indicator (i.e., adopters are still more eco-efficient but the gap narrows to 0.04 score points; lowering the SI cut-off to the adoption of at least two SI measures yields the maximal difference of 0.14 score points). The results lend support to hypothesis 2a (At the mean, for the same economic outcome, SI adopters produce at a lower ecological improvement potential than comparable non-adopters.). SI farms also have a higher mean eco-efficiency score to the system frontier in the direction of the economic output (0.57) than without SI (0.37) (see Appendix Table B2).

Despite the on average higher eco-efficiency scores of SI farms, the empirical cumulative distribution of eco-efficiency scores in Fig. 6 shows a heterogeneous distribution of the respective eco-efficiency scores, although SI measures offer a higher potential to produce on the system frontier. Only 25 SI adopters and three non-adopters produce on the system frontier. Comparing distributions shows similar patterns to those in Fig. 6 (see supplementary material Supp. Figures 3.1a–g).

Fig. 6
figure 6

Empirical cumulative distribution of eco-efficiency scores to the system frontier for SI and non-SI farms

For SI farms, Fig. 7a shows that the empirical cumulative distribution of eco-efficiency scores in the ecological direction of SI farms to their group-specific frontier is very close to the distribution of scores to the system frontier with an average distance of 0.76. 66% of SI farms could improve their within-technology performance. For non-SI farms, Fig. 7b shows a divergence of empirical cumulative distributions of distances to group-specific frontier and system frontier. Non-SI farms’ average eco-efficiency score regarding the group-specific frontier is 0.74. The robustness analysis yields within-technology performance scores between 0.63 and 0.84 for adopters and 0.48 and 0.81 for non-adopters, respectively.

Fig. 7
figure 7

a Empirical cumulative distribution of eco-efficiency scores to the system and group-specific frontier for SI farms. b Empirical cumulative distribution of eco-efficiency scores to the system and group-specific frontier for non-SI farms

Figure 8 shows the effect of heterogeneity in within-technology performance on improvement potentials. For SI farms, 70% have a higher eco-efficiency score to the system frontier than their respective matched control non-SI farms. The increase in eco-efficiency is on average 0.22. Figure 8 also shows that 29% of SI farms have a lower eco-efficiency score than their matched control farms. The decrease in eco-efficiency is on average 0.12.

Fig. 8
figure 8

Difference in eco-efficiency to the system frontier for SI farms compared to their matched control non-SI farms

The results lend support to hypothesis 2b (If SI adopters have a low within-technology performance in the chosen technology and comparable non-adopters have a high within-technology performance, the non-adopters have lower improvement potential than the adopters in some cases.).

5 Discussion

Our aim was to evaluate agronomic SI in regional conditions characterised by peatlands at the farm level by using an eco-efficiency analysis. We relied on core SI definitions stating that the adoption of SI technology improved the balance between the economic and ecological sustainability of agricultural production without reducing either, and assumed that farms in our case study opt consciously for the SI technology. To account for farms’ selection process into SI, we combined a behavioural model with a matching approach.

Based on the Mahalanobis distance matching, the sampled SI and non-SI farms are comparable with regard to socio-economic and farm-specific characteristics and the measures that captured behavioural factors (see Appendix Table B1). These measures, however, rest on self-assessments and thus remain limited in interpretation. While differences in mean regional attachment were minor before matching, SI farms indicated a slightly higher entrepreneurial attitude (6.62 versus 5.25, see Table 1), whereas environmental awareness indicated small differences in mean (7.10 versus 6.97). These numbers imply unimportant differences in farmers’ self-perception between the unmatched groups, but could also be skewed by farmers’ responses to the questionnaire (i.e., only environmentally aware farmers responded or they responded by intention socially desirable), or by farmers’ homogenous awareness of the natural ecosystem in the study region. We note that the potential for bias could further limit investigating the behavioural factors for SI adoption (Weltin et al. 2021).

Results of the directional DEA meta-frontier showed that few farms could be eco-efficient to the system frontier without adopting SI. Hence, SI adopters mainly determined the system frontier and a technology effect of SI was clearly evident. This is in line with our argumentation of hypothesis 1 that SI farms could produce higher environmental output without sacrificing economic output when they took full advantage of the SI technology.

This suggests the following: in the short run, possible reductions in improvement potentials by SI farms were offered by the outward shift of the SI frontier (closer to the system frontier) compared to the non-SI frontier (see Fig. 2). For the majority of the sampled SI farms, adopting SI could in fact reduce the environmental improvement potential compared to their matched control references (i.e., eco-inefficiency in the ecological direction measured against the system frontier). This finding was in line with other studies on the effects of agronomic SI adoption on ecological output, such as diversity gains (Redlich et al. 2020).

SI farms produced 75% at the mean of the potentially possible ecological output, keeping land and economic output constant whereas non-SI farms produced 63% at the mean. This lends support to hypothesis 2a. Also, a performance effect of SI was evident, i.e., eco-inefficiencies within the SI technology could impede the SI technology’s positive effects, lending support to hypothesis 2b. For non-SI farms, improvement potentials resulted from a mixture of inefficiencies to the group-specific frontier and the fact that non-SI technology did not allow reaching the system frontier in most cases.

The almost perfect overlap of the system frontier and group-specific frontier of SI adopters in ecological direction was consistent across all scenarios of the robustness analysis. To some extent, the cut-off value of adopted SI measures, sampling or farm-type heterogeneity influenced eco-efficiency scores but our general observations on technology and performance effects did not.

Results also lend support to our assumption of sequential preferences (i.e., farm managers could prioritize to become more efficient in an economic direction prior to adopting SI measures and improving in the ecological direction). It is possible that our assumption explained why non-SI farms with traditional farming technology and rather high eco-efficiency scores in the environmental direction did not adopt SI, or that intentional inefficiencies (Hansson et al. 2018) explained SI farms’ larger distance to the frontier in an economic direction than in an ecological direction. After meeting a certain threshold in economic direction, SI farmers could have prioritised ecological outcome above economic efficiency gains when including non-financial values according to their environmental preferences in their utility function. Perhaps eco-inefficiency in the economic direction represented an adjustment or learning cost as farmers reduced ecological improvement potential with new technologies (Ang and Oude Lansink 2017).

In summary, our results empirically supported the core concept of SI that ecological improvement is possible without compromising economic outcomes. The heterogeneous distribution of the eco-efficiency scores is consistent with previous research on multidimensional performance assessments (e.g., Sidhoum et al. 2019). We suggest that the heterogeneity and increasing improvement potential for some SI adopters may be attributed to insufficient understanding of complex SI production systems (Kassam et al. 2011), and that biodiversity effects may only be obtained over time (Gabriel et al. 2013). High complexity and delayed visibility of ecological and/or economic benefits may also prohibit adopting SI technologies that improve ecological and economic outcomes in the short run (Dessart et al. 2019; Yeboah et al. 2015).

We note the following limitations of our research. The use of matching acknowledges the SI selection processes and ensures comparability between adopters and non-adopters, but the observational data and availability of measures for the behavioural factors were limited. We hope to address these shortcomings in future research with a more comprehensive investigation of the dispositional, cognitive, and social factors (Dessart et al. 2019), farmers’ opinions of SI and traditional technology, their understanding of the potential advantages and risks of adoption, and their knowledge of the region’s natural ecosystem, and governance system (e.g., Stupak et al. 2019). Encouraging large-scale adoption of SI and sustainable farming requires understanding of origins of eco-inefficiencies and overall performance effects in economic and ecological direction. Addressing climate protection goals further requires monitoring and coordinating activities at landscape scale given the correlation of eco-functionality with diversity in land use and management (Prestele and Verburg 2020). Larger field size and large-scale management will necessarily require more research staff and analysts.

Selecting the sustainability dimensions and SI measures could have introduced bias. While some studies take a system-perspective when analysing SI (cf. Mahon et al. 2018), we rely on agronomic SI measures governing the economic and ecological sustainability dimension only and ignored social, cultural or ethical aspects of SI as proposed by the broad SI concept (cf. Garnett and Godfray 2012). It is also possible that these aspects were not raised in our workshop to design the survey questionnaire. Our perspective allows us to identify the channels contributing to the heterogeneity in improvement potentials but may underestimate cross-effects from SI measures originally proposed to improve the social dimension of sustainability, for instance. Despite the robustness of our results to different specifications of SI adoption and outcome indicators, another limitation is the biodiversity measure by proxy indicators based on survey data at the farm level. We can only measure exact outputs on the farm, and comprehensive ecological indicator sets and appropriate weights are difficult to develop. Reliable region and country scale data and consistent definitions and applications of sustainability indicators could facilitate analysis at EU scale, where enhancing the Farm Accountancy Data Network could offer a starting point (Kelly et al. 2018). Too fine a regional perspective, particularly for heterogeneous regions, however, may limit generalization and transferability of the results to different agroecological, cultural and policy systems.

Our study’s narrow regional focus on EU’s high-priority intervention areas to protect carbon-rich soils however bears the advantage to analyse effects of contextual SI adoption from a microstructural perspective for a region largely shaped by peatland areas as potential carbon sinks (Busse et al. 2019; Häfner and Piorr 2020). Our approach and the results are thus argued to be transferable to comparable peatland regions under the same policy framework (Buschmann et al. 2020). We further suggest that our research will benefit investigations of regionally contextual farming measures that aim to overcome trade-offs in the objective function (e.g., building on the work of Kassie et al. (2015)).

Some authors also criticize the SI concept by not precluding specific measures to qualify for the concept “[…] to improve efficiency within existing market, production and land use constraints.” (Franks 2014, p. 73). Even though SI can act as an enabler for reaching eco-efficiency it may also contribute to a status-quo bias in not seeking a system change (Petersen and Snapp 2015). Thus “smart” combinations various productions systems, including organic production, eco-intensified systems such as agroforestry (Mosquera-Losada et al. 2018) or ecological modernized systems relying on circular systems (Rocchi et al. 2020), have been proposed (Meemken and Qaim 2018). SI production systems could be a baseline to build such combinations on. This is left for future research.

Our results have policy implications. Policies to support farmers in increasing their production efficiency should accompany support of technology adoption, for instance by trainings and organized field days to offer exchange platforms (Hüttel et al. 2020), e.g. these might reduce operational complexity and thus support learning about efficient use of SI practises. Besides fostering process improvements, we suggest a policy mix targeting the ecological outcome dimension directly. Remuneration of potentially foregone returns from altering the production process, e.g. by input reduction as in agri-environmental programs of the EU, has been shown to be less effective (e.g., Brown et al. 2021; Uehleke et al. 2022). Also, in our study participation in such programs seems not generally associated with SI adoption: 35% of sampled SI adopters do not participate in such programs, where only 15% receive remuneration from such programs for adoption of at least three SI measures, i.e. the cut-off used in this study. Thus, result-based agri-environmental support that rewards farmers for achieving ecological improvements could complement the policy mix (e.g., Burton and Schwarz 2013; Russi et al. 2016). Large-scale adoption of these schemes could further foster pro-environmental behaviour, reinforce SI and scheme adoption decisions, and increase overall environmental performance of farming (Pe'er et al. 2020).

6 Conclusion

In this paper, we proposed directional meta-frontier approach combined with matching to assess selected agronomic SI measures for improving ecological outcomes in agricultural land use. Connecting the theoretical behavioural framework of Chabé-Ferret and Subervie (2013) with a production economic framework, we accounted for farmers’ selection processes to either rely on traditional farming practises or adopt SI technology. Eco-efficiency scores to the system frontier estimated by directional DEA, captured farms’ ecological improvement potential. Matching ensured comparability of SI adopters and non-adopters, and to link the differences in eco-efficiencies to SI adoption. Survey data on farms in the northern German Plain was combined with information from stakeholder workshops used to determine context-specific SI measures and a composite indicator for biodiversity as an ecological outcome. SI farms had higher eco-efficiency scores to the system frontier compared to farms staying with traditional farming at mean. The relatively low within-technology performance of many however hampered full environmental improvement. In fact, there was only a small probability of reaching the system frontier without adopting SI. Future research should examine farmers’ characteristics and preferences in order to better compare the differences in eco-efficiency scores. We believe that our research, although confined to a small area, is also applicable to climate and sustainable policy-making at region, country, and EU levels. To facilitate exploiting full ecological improvement potentials, we suggest smart mixes of policy measures including result-based support to offer incentives for large scale SI adoption and for improving within-technology efficiency.