Background

Assessments of bioenergy systems

The last decade has seen major development and adoption of bioenergy in Europe, particularly in Germany [13]. This has resulted in a scattering of decentralised bioenergy plants across the landscape, mainly due to their dependency on spatially diffuse biomass resources. Such configurations of biomass and bioenergy technologies in the landscape make the influence of bioenergy production obvious to the eye (e.g., maize and biogas plant dominated landscapes [4]) and thus open to scrutiny for a broad list of potential environmental burdensFootnote 1, to soil, to water, and land use [58]. In general, life cycle assessment (LCA) is the most popular assessment approach used for investigating the environmental burdens associated with bioenergy production [911]. However, more regionalised approaches are promoted to assess the potential environmental implications of such bioenergy configurations (i.e., bioenergy plants and their biomass catchments). Mainly, as at this scale, assessments can help to identify the distribution and range of geographical variances (e.g., climate, soil) and management practices (e.g., fertilising, tilling), which can influence the potential environmental burdens (e.g., emissions from soils) associated with biomass for bioenergy production and thus the environmental sustainability of a bioenergy product [1214]. Such geographical and regional variances are difficult to capture using average or national values. Therefore, with regional and spatially representative life cycle assessments of bioenergy systems still evolving [10, 1518], the use of three regional contexts were proposed in order to support more regional and spatial life cycle thinkingFootnote 2 for assessing regional bioenergy production systems [19].

Regionally contextualised life cycle thinking

LCA is essentially not designed to be a regional or spatially disaggregated approach and is considered more a global assessment tool [20]. The main reason being, that most production systems are linked to the diversity of the world’s economy, making it impossible to provide a spatial orientation for the thousands of associated production activities [21]. To simplify such complexity, for life cycle assessment, all environmental burdens are aggregated across the full life cycle of a product (i.e., from the thousands of global sites with associated production activities), removing the regional or spatial patterns relating to the production and distribution of such environmental burdens [22]. However, with many environmental burdens associated with bioenergy production occurring at the regional level [13, 23, 24], there is a need to produce more regional and spatially representative life cycle assessments of bioenergy systems [10, 15, 16, 18, 25]. Therefore, the challenge is to find a balance between the increased focus on regional activities and direct environmental burdens associated with bioenergy production (e.g., emissions produced within the regional foreground) and the requirement to account for the non-regional environmental burdens also associated with such regional bioenergy production (i.e., emissions produced upstream, not within the regional foreground, but elsewhere outside the region). In this way, the burdens related to production activities for the full life cycle of a bioenergy product can be taken into account.

O’Keeffe et al. [19] in their review of the literature identified regional and non-regional environmental burdens associated with bioenergy production. Considering these burdens, they aimed to determine how much of a regional focus could make sense for assessing the environmental performance of a particular bioenergy system, while maintaining the major concepts of life cycle thinking. Defining the regional scope as one scale lower than a country [12]Footnote 3 and a bioenergy production system as the combination of different feedstocks and technologies used to produce a particular bioenergy product (Fig. 1), they identified three regional contexts which have been used to frame regionally focused life cycle thinking. With many burdens of bioenergy production strongly influenced by the regional variability (e.g., management, climate, soil) of biomass production [5, 6, 23, 24, 2628], O’Keeffe et al. [19] identified the need to begin determining what is happening “within” a regional context for a bioenergy producing region. They also identified that life cycle thinking framed in a regionally contextualised manner is at a nascent stage, particularly with regard to implementing a suitable or appropriate life cycle impact assessment phaseFootnote 4. For this reason it was suggested, as a first step, to begin exploring regionally contextualised life cycle thinking with the development of life cycle inventory assessments.

Fig. 1
figure 1

A graphical representation of a transesterification bioenergy system, showing two biodiesel plant concepts in a regional setting. The regional foreground activities are the focus of the zoomed in graphical section (adapted from [19])

Aim of paper

Therefore, the aim of this paper is to outline the steps, along with their challenges and options, for developing one of the regional contexts outlined by O’Keeffe et al. [19] into a practical life cycle approach. “RELCA” is a regional life cycle inventory for assessing the environmental burdens of producing a bioenergy product “within” a regional context. The objective of RELCA is to establish a regionally distributed life cycle inventory, which combines both regional bioenergy technologies with their associated biomass feedstocks for an entire region. This enables the use of bioenergy configurations (e.g., bioenergy technologies and biomass) to explore the regional variability of direct regional environmental burdens (i.e., those occurring within the regional foreground), as well as the indirect non-regional burdens (i.e., those occurring outside the region of focus) associated with the production of a regional bioenergy product (e.g., biodiesel, bioethanol, biogas), thus helping to determine the environmental performance of a regional bioenergy product.

As a first step, a brief outline of the RELCA approach is provided. Key issues and aspects relating to the five major steps involved in developing RELCA (Fig. 2) are then outlined. Additionally, a simple illustrative example, using greenhouse gas emissions (GHG) is provided, to show how RELCA can be used to identify the potential distribution of environmental emissions associated with the regional production of a bioenergy product, such as biodiesel. To note, if the concepts of regionally contextualised life cycle approaches for assessing bioenergy systems are not familiar to the reader, it is recommended to read the review of O’ Keeffe et al. [19] for a more in-depth understanding of the underlying theoretical concepts for this approach.

Fig. 2
figure 2

Outlines the RELCA approach and the various integrated modelling steps. (1) CRAMod-crop allocation modelling. (2) BioMod-biomass inventory modelling. (3) CPMod-conversion plant modelling. (4) CAMod-catchment allocation modelling. (5) NoRiMod-non regional inventory modelling

Methods

What is RELCA?

RELCA is a life cycle inventory (LCI) assessment approach which can be used to assess the environmental performance of bioenergy produced “within a regional context” [19] (Figs. 1 and 2). It is a retrospective scoping approach, applicable to projects which aim to either “monitor” the status quo of a region’s existing bioenergy production (e.g., retrospective, attributional LCI [29]), or which aim to explore different options for improving the burdens relating to the existing regional bioenergy production “micro change” (e.g., system expansion through substitution [29]). Goal depending, two major functional units can be used in this approach, either emissions per function of energy produced (MJ), or emissions per function of land used (ha) [30]. It can be used to answer more spatially orientated questions such as: how much of an environmental burden is coming from within my region of activity (e.g., due to cultivating biomass), how much is non-regional? Where are the burdens of my regional activities being produced? Are there regional hotspots (i.e., areas within my region which indicate a higher potential for emissions)?

RELCA consists of three major steps to produce two regionally distributed inventories (Fig. 2). Two steps are required for producing the regionally distributed biomass inventory, with one step required for establishing the regionally distributed bioenergy technologies inventory. In a succeeding step, catchment delineation is used to integrate both regionally distributed inventories to form inventories for each individual bioenergy configuration (e.g., bioenergy plant and supplying biomass). In a final step, these catchment delineated inventories are linked to the associated non-regional inventory (Fig. 2). The main modelling steps involved are as follows:

  1. 1.

    Crop Allocation Modelling (CRAMod) to generate the potential biomass availability or regional distribution of energy crops which can theoretically be used by regional bioenergy systems.

  2. 2.

    Biomass inventory Modelling (BioMod) to develop the mass and energy balances (inputs and emissions) relating to the cultivation of the associated energy crops used in the bioenergy system of interest.

  3. 3.

    Conversion Plant Modelling (CPMod) to develop representative bioenergy plant models (process based) for the bioenergy conversion systems in the region, along with determining their spatial distributions.

  4. 4.

    Catchment Modelling (CAMod) to assign the regionally distributed bioenergy crops to the various bioenergy conversion plants across the region.

  5. 5.

    Non-Regional Inventory Modelling (NoRIMod) to identify and link the indirect upstream emissions, to the direct emissions produced within the region. Upstream emissions are assumed to be produced externally to the region.

There are no hard or fast rules for which modelling tools should be implemented for each of the associated steps, as this depends on the goal of a particular study or the skill sets of a particular research group. However, to be considered a RELCA approach, all five steps need to be carried out in the manner outlined in Fig. 2. As RELCA consists of a regionally distributed foreground inventory, geographic information system (GIS) software is a key support tool for modelling and spatialisation of the inventory data. This can then be used in combinationFootnote 5 with python or other programming and scripting tools (e.g., C++, MATLAB, or R)Footnote 6, as well as Microsoft Excel to model and compute the necessary flows (mass, energy, emissions) outlined in the different steps of RELCA. The use of life cycle software is also a key tool for supporting the life cycle inventory structure and for connecting the non-regional flows (e.g., upstream unit processes from Ecoinvent) with the regional foreground flows (derived for the particular study). In the following sections we discuss the challenges and options for the different steps involved in the RELCA approach (Fig. 2).

Step 1. CRAMod step—crop allocation modelling

The lack of more exact spatially differentiated data (e.g., what is grown where) for energy crop production, (e.g., due to confidentiality issues ), is a known bottleneck in making more spatially detailed assessments of biomass-to-bioenergy pathways [31, 32]. Therefore, alternative approaches have been developed to deal with the issue of crop allocation [33]; here, we refer to them as CRAMod approaches. Many approaches used for spatially allocating crops are based on principles of economic optimization (e.g., farmers or plant owner's profitability) such as; agent based modelling [34], linear optimization modelling [35, 36], or Marchov chain modelling [37]. However, other non-economic approaches for spatially allocating crops also exist. These use a combination of geographical constraints such as crop growing preferences (e.g., soil type, climate), detailed regional land use, and cropping statistics [31]. The latter might be more preferable where economic outcomes are not part of the study’s focus.

For many European based studies investigating the spatial aspects of biomass production, the Corine land cover dataset (CLC) [38] is used as a base layer to identify the distribution of arable land used within a particular region [39]. This is then overlaid with other geo datasets containing information on various environmental properties (e.g., soil type, annual rainfall, temperature) [31, 39]. In many cases for such spatial modelling, geographical data is divided into “land parcels” or grids cells, as this enables a particular process to be modelled for each of the constituent grid cells [4042]. These grid cells, attributed with all the relevant information (e.g., crop, yields, soil type and climate) are then used as the input for the CRAMod approaches outlined above.

The structure and aggregation of geo data into these “grid cells” is, however, a key consideration for any regionally distributed biomass inventory. Therefore, at the beginning of the study deciding on the most appropriate unit(s) of aggregation ensures that the regional inventory flows include the relevant geographical characteristics to (1) enable the estimation of potential regional crop distributions [31], (2) account for the desired regional and spatial variances, which influence the potential emissions of interest in relation to biomass production [43, 44], and (3) helps reduce aspects of MAUP (modifiable areal unit problem) an unavoidable issue in geographical studiesFootnote 7.

The output from the CRAMod step is the cornerstone of the RELCA approach. It is a geodataset comprising of constituent regional grid cells, with each grid cell containing important regional geographical variables (e.g., climate, soil types, agricultural suitability), as well as the energy crop allocated to a particular grid cell by the crop allocation model implemented. Thus, this provides the regional biomass availability required for the CAMod step. It also contains the geographical variables to model management and emissions, for the associated energy crop cultivation which should be used in the next step, BioMod.

Step 2. Biomod—biomass inventory modelling

The magnitude of emissions from energy crop production depends on the production locations within a region and therefore, the associated regional variations in turn can have a significant influence on the environmental performance of the associated bioenergy systems [28, 4547]. For this reason the purpose of the BioMod step is to determine the regional management practices, as well as the direct emissions associated with producing energy crops within a region. These management flows and emissions are calculated for each of the regional grid cells assigned to the particular energy crop, determined in the previous step. The calculations for the BioMod step can be performed using either a python based scripting tool in Esri ArcGIS® or through other programming languages such as C++ or MATLAB6 (see Additional file 1, S1 for an example).

For ease of discussion and because of its relevance for bioenergy production, we focus the discussion here on one category of emissions—greenhouse gases (GHG). In order to determine GHGs associated with biomass production, we need to first determine the potential biomass management practices implemented within the region. Modelling exact management practices of different farmers across a region is near impossible, unless such empirical data is available (e.g., farm survey data) [48]. However, an alternative approach to bridge this data gap is to use recommended best energy crop management practices and available information provided by regional agricultural institutes or advisories as proxies [39, 49]. The application of nitrogen fertiliser is an important example of such management practices, as this has been identified as contributing significantly to the GHG balance of energy crop production [45, 50, 51]. We therefore, use nitrogen management as an example to show how the BioMod step is carried out (Fig. 3, see also Additional file 1, S2 for a more detailed explanation of the calculations).

The management flows are split in two (Fig. 3): the foreground input flows, which are the main nutrient flows (AppFm) in the regional foreground (e.g., nitrogen) and the background non-regional input product flows (FPi), which are supplying the nutrients (e.g., the fertiliser product calcium ammonium nitrogen (CAN)). In the end, as shown in Fig. 3, each grid cell will be attributed with the direct regional foreground flows (e.g., 50 kg of nitrogen), as well as the total induced demand for non-regional products supplying the foreground flows (e.g., 185 kg of CAN). Similarly the steps employed for estimating the fertiliser management practices can also be applied to estimate other cultivation practices such as, crop protection (e.g., regional flows = active ingredientsFootnote 8/non-regional flows = pesticide products). Additionally, the diesel consumption and emissions associated with mechanical field operations (e.g., ploughing, harvesting) can be modelled for each grid cell. This is done using the potential tractability of the attributed soil properties, e.g., heavy or light soil (usually based on % clay) [48, 52].

After the regional management input flows have been determined, the next step is to estimate the associated direct regional GHG emissions induced as a result of such management practices, e.g., kg N2O released per kg nitrogen fertiliser applied. The IPCC provides three Tiers of accounting [53]. For the regionally distributed approach, Tier 2 is the basic calculation which should be used to estimate the potential greenhouse gas balances, as this includes basic geographical considerations (e.g., yields, climate). However, the more idealised means of life cycle accounting is the inclusion of results (Tier 3) from more complex process based models (e.g., DNDC [5456], DAYCENT [50]), which can be used to estimate GHG emissions. Indeed the integrated use of GIS and biogeochemical models is increasingly being employed to estimate regional distributions of greenhouse gas (GHG) emissions [5760].

Fig. 3
figure 3

Example of management input flows per grid cell, all inventory flows for the regional biomass production are estimated in a similar manner. CAN = Calcium Ammonium Nitrate fertiliser and the N2O emissions are assumed to be at 1 % of the applied nitrogen [53] . (See Additional file 1 for more detailed calculation descriptions)

The combined output of the CRAMod Step and the BioMod step produces the regionally distributed biomass inventory (Fig. 2). This is a geodataset which contains, for each grid cell, the direct foreground flows (e.g., nutrients, active ingredients, emissions from soils, exhaust emissions from farm equipment) relating to activities of biomass production within the region. It also contains, for each grid cell, the demands for the non-regional product flows, as shown in Fig. 3 (e.g., fertiliser products, crop protection product, diesel demand).

Step 3. CPMods—conversion plant models

The purpose of the CPMod step is to determine representative bioenergy plants or model plants for the bioenergy system or systems of interest. We refer to regional bioenergy systems (e.g., transesterification), as the total combination of different regional bioenergy chains producing the same energy product (e.g., biodiesel), but which may comprise of different feedstock/technology combinations and hence different regional configurations [19]. A bottom up process-based approach is taken to develop the mass and energy balances for each representative model plant. Additionally, the CPMod step determines the spatial locations and distribution of the different bioenergy plants associated with a bioenergy system, within a particular region (i.e., a point shapefile).

To date (at time of writing), there is little or no available literature on how to develop inventories for existing bioenergy technologies within a region (i.e., process based mass and energy balances), as it is not something commonly done. This is mainly due to lack of data, it is too time intensive, or the development of bioenergy has not been so extensive. However, one approach has been outlined and we use this as a guideline for the discussion here [15]. The purpose of the bioenergy plant inventory is to develop conversion plant models (CPMods) for a conversion system (e.g., anaerobic digestion—model biogas plants, transesterification—model biodiesel plants), which are representative of the regional situation (Fig. 1). The development of the representative conversion plant models can be carried out through a combination of Microsoft Excel®, Aspen Plus® and life cycle software (e.g., GaBi, SimaPro, Open LCA) 6. However, the challenges for developing representative regional model concepts are different for each conversion system and therefore, need to be approached and handled differently.

For large “non-diffuse” bioenergy conversion systems (e.g., fermentation—consisting of a few large-scale bioethanol plants), direct partnership with the plant operators is the ideal means for developing these specific conversion plant inventories (process-based mass and energy balances) [61]. Unfortunately, this is not always possible (e.g., confidentially issues, time constraints), therefore, an alternative option is to try and obtain information from annual reports and regulatory environmental documents, which they are legally obliged to publish each year. A combination of literature values and expert opinion, as is what is traditionally done to complete the life cycle inventory, could then be used. However, as the conversion plant models are required to be representative of a region, the limitation of its regional representativeness should also be acknowledged in the interpretation of the results [62, 63].

For spatially diffuse bioenergy systems (e.g., anaerobic digestion - consisting of many biogas plants scattered across the region), it is not always easy to obtain detailed information for each individual plant, or process steps, due to the sheer number of bioenergy plants and the potentially vast number of feedstock and technology combinations (e.g., as observed in anaerobic digestion and combustion systems). Therefore, bioenergy plant clusters can be developed [64]. There are a number of options for inventory collection, such as: surveys, on-line databases [65, 66], or data provided by grid operators (e.g., 50 Hertz in Germany) [2, 3]. In the ideal case where sufficient data has been collected, various different statistical techniques can be employed for data mining e.g. summary statistics, correlations, distribution analysis (both spatial and non-spatial) [15, 67]. This helps to identify key parameters for clustering the conversion technologies (e.g., feedstocks, installed capacities etc.). These clusters can then be used to develop the process based conversion plant models which are representative of regional conditions [15].

The spatialisation of bioenergy plant data is an additional challenge for the bioenergy technology inventory. For non-diffuse bioenergy plants, determining spatial locations should be relatively straightforward processes (e.g., low numbers present, large and easily seen in Google maps), whereas the spatial distributions (point locations) of spatially diffused conversion systems are a little more challenging. In many cases, addresses for the individual bioenergy plants are provided in tabular format [65, 66], which require further processing for use in spatial analysis software (e.g., GIS). Options can range (depending on number of bioenergy plants) from manual (x, y) coordinate conversion in Google maps, to using a combination of Google Fusion and Google Earth to geocode and convert to a GIS compatible file format, such as keyhole mark-up language (KLM) [68]. For larger data sets using more complex spatialisation approaches, it is also suggested to conduct a visual assessment with Google maps using a representative sample number of bioenergy plants in order to determine the potential level of uncertainty associated with such point data. Additionally, the final locations of these bioenergy plants should also be crossed reference with another data set if available [69].

The output from the CPMod step is the regionally distributed bioenergy technology inventory. In other words this is a geodataset (i.e., point shape file) containing information on the locations, installed capacities, production capacities, as well as the categories of conversion plant models assigned to each bioenergy plant. Additionally, it should also contain the distributed demand functions for each bioenergy plant, which is an important parameter to be used in the CAMod step. Furthermore, for each of these conversion plant models, process models should be developed concurrently in the life cycle software of choice (e.g., GaBi, SimaPro, Open LCA)7.

Step 4. CAMod—catchment allocation modelling

The purpose of the CAMod step is to combine the regionally distributed bioenergy technology inventory with the regionally distributed biomass inventory as in Fig. 2. This is done by assigning the biomass to the associated conversion plants using the demand function determined in the CPMod step (i.e., tonnes of bioenergy crop required for their annual production capacities). In this way the spatial configuration of the bioenergy plants’ catchment areas, or area required to supply biomass to the conversion plants are estimated. Many factors influence the size of the catchment area such as yields, biomass availability, economics, operational capacity of the bioenergy plants, and its efficiency.

With the application of GIS software (e.g., Esri ArcGIS® buffer tool), many assessments use a circular based configuration for the catchment area of a bioenergy plant, with the radius of the circle used to estimate the transport distance to the plant [7072]. The calculation of this radial transport distance can be based on simple demand–supply properties (i.e., x tonnes of biomass required and it takes y number of hectares with a yield z), or more advanced calculations using the approach of [73]. The latter includes aspects of biomass availability and tortuosity (e.g., windiness factor of the landscape/roadways). Additionally, more complex cost supply curves can also be implemented, which use a combination of geographical and economic variables to determine the potential distances travelled for harvesting the biomass (e.g., using various cost accounting methods) [7476]. Which approach is implemented depends on the goal of the particular study. However, most approaches can only process the calculation of one catchment at a time, or several catchments if they are assumed to have the same capacity. Therefore, with bioenergy plants of various different capacities installed in a region, a more flexible approach is required to estimate all the potential catchments simultaneously, such as [77], who used a programming approach (FORTRAN) to model the heterogeneous bioenergy plant catchments.

The output of the CAMod step is catchment delineated inventories, for each bioenergy configuration (e.g., bioenergy plant and associated biomass), associated with the bioenergy system(s) of interest. In other words, both biomass and bioenergy plant inventories are now combined for each bioenergy configuration as a result of the CAMod step (Fig. 2). Delineating the inventory for each bioenergy configuration is one of the most important calculations in the RELCA framework, as the catchment area sets the boundary for aggregating all the relevant foreground flows (e.g., mass, energy and emissions) [75, 78, 79]. It is also the most basic indicator of land use (land occupation) within the region [25].

Step 5. NoRIMod—non-regional inventory modelling

The purpose of the NoRiMod step in the inventory accounting is to link the indirect upstream emissions generated outside the region, to the direct emissions produced within the regional boundary. The regional boundary denotes the foreground activities relating to the bioenergy systems being assessed [12, 29, 80, 81]. However, such foreground activities also require inputs from outside the region (e.g., fertiliser products, fossil fuels, grid energy), these can be considered as flows from the “non-regional background” (Figs. 1 and 3). This means, that the activities involved in producing these products (or flows), as well as their associated burdens, are released everywhere else but in the region of focus, i.e., they are non-regional burdens.

As the focus of RELCA is “within the regional” boundary, the spatial locations of where potential burdens occur within the region are of interest. Therefore, the direct regional burdens are estimated for each constituent regional grid cell, thereby producing a regional distribution of direct burdens (see Table 1) associated with the regional foreground activities of bioenergy production. For the RELCA approach the spatial locations for the indirect non-regional burdens (i.e., where in the world they are produced) are not of interest and they can be summated to a single aggregated value (see Additional file 1, S.3).

For biomass production the bulk of the non-regional burdens come from the production of fertiliser products, crop protection products (e.g., pesticides, insecticides) and fossil diesel (Table 1). For many bioenergy plants the majority of non-regional burdens come from production of auxiliaries and upstream energy supply. Indeed for the bioenergy conversion step the majority of associated emissions are more than likely to occur outside of the region [8284], of course this depends on regional plant designs. However, for some bioenergy conversion technologies, such as combustion, it may be necessary to have foreground modelling of the conversion step, as direct emissions from chimney stacks may be of key regional concern [8587]. In this case, such considerations should be made during the CPMod step.

For the simplest NoRIModFootnote 9 accounting, the mean mass and energy flows for the total biomass supplied to a bioenergy plant (i.e., the bioenergy plant catchment), can be used as input for the designated life cycle software (e.g., GaBi, SimaPro, Open LCA)6, similar to conventional life cycle approaches (i.e., taking the mean). Once all the mean foreground flows (e.g., kg of nutrients) have been estimated and the associated upstream products (e.g., kg of fertiliser products) have been determined, as shown with the nitrogen fertiliser example in Fig. 3, the connections between the regional foreground and background flows can be made with the help of the life cycle software (i.e., using process units for each of the different biomass producing activities). Again similar to how conventional life cycle is conducted, the mass, energy and emission flows associated with the biomass production of a particular bioenergy plant catchment are then linked, within the life cycle software, to the bioenergy conversion plant models, which have already been established as part of the CPMod step.

The output of this step is a connected life cycle inventory which differentiates between within regional environmental burdens and non-regional environmental burdens (see Additional file 1).

Regional life cycle inventory assessment (RELCA)

Continuing with the example of GHG emissions, once the NoRIMod step is completed, a life cycle inventory assessment for the related GHG burdensFootnote 10 can be carried out for each bioenergy configuration within the region (see Additional file 1, S.3: for a fuller description). Carrying out a life cycle inventory assessment for each bioenergy configuration can be a viable option for non-diffuse bioenergy plants (i.e., low number of bioenergy plants within the region). However, for conversion systems which are more spatially diffuse (e.g., biogas) and have a greater number of bioenergy plants scattered across the region, it would be time intensive to calculate an inventory assessment for each of these bioenergy configuration. For this reason, the aggregation of individual bioenergy configurations to their assigned bioenergy clusters, determined during the CPMod step, may be a preferable means to handle such information. In this way the regional ranges for the cluster, i.e., the min, max and mean mass and energy flows are determined. These values can then be inputted into the life cycle software to provide an LCI for the range of potential emissions (e.g., min, mean, max) associated with a bioenergy cluster, which could consist of hundreds of bioenergy plants [14]. Additionally, RELCA can also be used to estimate both the direct and induced indirect input flows and emissions estimated per functional unit of energy or per hectare supplied for the constituent grid cells of a region (see Additional file 1, S.3 for a more detailed explanation).

Outputs of RELCA-inventory results

There are two output inventory results from the RELCA approach, the regionally distributed and the non-regionally aggregated (Fig. 2). Both outputs should be used together to identify the potential regional environmental burdens of producing a bioenergy product. The regionally distributed inventory results in the GIS software can help to identify where potential burdens are occurring at higher or lower intensities, i.e. potential geographical “hotspots” for bioenergy production. This also enables the exploration of potential underlying geographic influences on the associated life cycle balances. The aggregated inventory result from the life cycle software provides an indication of the overall performance of the regional bioenergy production, identifying the activities within and outside the region that induced the overall greatest environmental burdens.

Application to a simple illustrative example

To show how RELCA can be implemented, we present a simple illustrative example, based on the regional data of two East German federal states (Saxony and Thüringen). While aspects of the example pertain to the geographical characteristics of the actual region, we have modified certain details (e.g., bioenergy plant numbers) in order to provide a simple illustrative example of how the RELCA approach can be employed. The goal of the simple example is to use RELCA to determine for one operational base year: (1) the potential regional distribution of direct GHG burdens (emissions) associated with the production of 1 MJ of biodiesel, which is also the functional unit and (2) to determine the percentage direct emissions and indirect emissions for the overall regional biodiesel production. The RELCA approach applied was retrospective and complied with the ISO LCA standards [88]. An attributional life cycle accounting approach was implemented [19, 29] and for simplification, all GHGs emissions were allocated based on energetic content.

Step 1. CRAMod for rapeseed availability inventory

In order to determine the distribution of arable land for the study region the Corine land cover (CLC) [38] was used and organised into land use parcels or grid cells of 25 hectares (500 × 500 m2) using the Fishnet function of ESRI ArcGIS 10.1. This gridded land use layer was then overlaid with different geographical data for the region (e.g., soil types and climate). The crop allocation model of Wochele et al. [29]was implemented, which used the geographical attributes of the gridded land use layer (soil type [89], Ackerzahl valueFootnote 11 [90]) in combination with regional cropping statistics for the year 2010 [91], in order to produce a regionally distributed energy crop layer. From this the regional biomass availability of rapeseed with yields ranging from 2.31 to 4.41 tha−1 (fresh matter) was determined for the example study region.

Step 2. BioMod—biomass inventory modelling

The key direct emissions associated with biomass production are outlined in Table 1, with the relevant inventory assumptions also provided (see also Additional file 1, S.4). It was assumed that farmers in the region used “best farming practices” for rapeseed production [92]. All direct regional management practices and associated emissions were allocated and estimated for each rapeseed grid cell using MATLAB 2012b6 based scripts (see Additional file 1, S.1 for BioMod step). The resulting output is the GHG emissions associated with producing one hectare of rapeseed.

Table 1 Examples of environmental regional and non-regional GHG emissions for the example case study’s transesterification bioenergy system (excluding aspects relating to infrastructure). All regional burdens (GHG emissions) were calculated for each constituent regional grid cell. Non-regional burdens were calculated in an aggregated manner, i.e., no spatial resolution taken into account

Step 3. CPMod inventory

The operational base year selected was estimated from the point of harvest in autumn, through to the autumn of the following year. For this simple example five biodiesel plants were assumed to be operating in the region, with rapeseed as the main oilseed feedstock. Installed capacities ranged from 2000–55,000 tonnes of biodiesel per year. Those plants with installed capacities of less than 10,000 tonnes of biodiesel per year were classified as small scale, with plants above this assumed to be medium scaled (Table 2). For the operating base year, the annual production capacities were also assumed to be 52 % of installed capacity [3]. The mass and energy balances, for the model plant concepts are outlined in Table 2, as well as important modelling considerations for the modelling step. Additionally, the associated GHG emissions are also outlined in Table 1. The rapeseed demand vector for each model biodiesel plant was determined by (1) their installed capacities, (2) the assumed operational capacities (i.e., 52 % [3]), and (3) the estimated conversion efficiency from rapeseed to biodiesel. Additionally, the locations for each biodiesel plant within the region were determined and the coordinates generated using Google maps.

Table 2 Activity parameters for conversion plants in the regional foreground. Model plant concepts representative of the region were developed in collaboration with the German Biomass Research Centre (DBFZ) [64]. All flows unless otherwise stated are kg/MJ biodiesel produced, values have been rounded up to the nearest decimal place

Step 4. CAMod

Using MATLAB2012b6 generated scripts, the Euclidian distance rj was estimated for all biodiesel plants (xn, yn,) to every rapeseed pixel (x i, y i) (Eq. 1). Distances for each rapeseed grid cell to a biodiesel plant (r j) were then sorted for each plant according to increasing distance from the particular biodiesel plant. Each grid cell had also an associated yield

$$ {r}_j=\sqrt{{\left({X}_n-{X}_i\right)}^2+{\left({Y}_n-{Y}_i\right)}^2}. $$
(1)

It was assumed that the smaller biodiesel plants are more likely is to use rapeseed sourced from the region, therefore the model runs start with the smaller biodiesel plants first. The catchments grew in size, (i.e., the consecutive yields were added) until the demand for rapeseed (i.e., tonnes required per annum) of all the biodiesel plants were satisfied in one simulation run. Additionally, if a rapeseed grid cell was closer to one biodiesel plant than another, the rapeseed grid cell was allocated to the closest biodiesel plant to avoid catchment area overlap (i.e., things closer are more likely to be related [103]). The particular catchment for each biodiesel plant was then used to estimate the associated life cycle inventory (field-to-gate). In order to keep the simulations simple, it was also assumed that there was no other demand for the rapeseed, other than biodiesel production.

Step 5. NoRIMod—estimating emissions

The key indirect emissions associated with biomass production and bioenergy conversion are outlined in Table 1. The relevant inventory assumptions are also provided in Tables 1 and 2 (see also Additional file 1, S.3). Flows relating to biomass production (inputs and emissions) were aggregated per catchment area and then divided by the amount of biomass used by the bioenergy plant (e.g., total kg rapeseed used per operational year). These catchment averages were then used as input flows for a rapeseed model developed in the life cycle software GaBi 6.06, enabling the connection to be made with the upstream non-regional flows (e.g., fertiliser products). This was then linked to the relevant regional biodiesel models (Table 2) modelled in GaBi 6.06 and coupled with Ecoinvent 2.2 inventory [104]. The resulting output of the modelling step was the aggregated emissions per functional unit of mega joule biodiesel produced. For the example presented here, GHG inventory results could be calculated for each configuration10.

Output results

The results of the RELCA simulations for the simple illustrative example indicate the overall regional average for producing 1 MJ of biodiesel to be 35.4 g CO2eq. MJ−1. The regional emissions for the different biodiesel configurations ranged from 32.4–40.0 g CO2eq. MJ−1 (Fig. 4a). The cultivation step was found to contribute the majority of GHG emissions, ranging from between 65–80 % of the total GHG emissions. The largest direct regional emissions were identified as coming from soil (e.g., N2O) and the largest indirect GHG emissions associated with the production of nitrogen fertiliser (Fig. 4b).

Fig. 4
figure 4

a The GHG emission (g CO2eqMJ-1) for each process step associated with the regional bioenergy system, cultivation, conversion, and transport (to biodiesel plant) are shown for each of the five catchments included in the example case study. b The contribution of the different parameters to the total GHG emissions per mega joule of biodiesel. For meaning of abbreviations, refer to the list of abbreviations

Across the various catchments the share of direct emissions (i.e., those released directly in the region) were found to be between 37–42 % of the total emissions, whereas, the indirect emissions (i.e., those released outside of the regional foreground) were found to be between 58– 63 % of the total emissions. Nitrogen fertiliser production contributed between 49–59 % of the total indirect emissions. Energy production required to meet the thermal and electrical demands of the conversion plants contributing between 17–35 % of the total indirect emissions (non-regional).

Direct emissions from soil (e.g., N2O) contributed between 84–87 % of the total direct emissions, whereas, the other direct emissions in the simple example (e.g., exhaust emissions from field operations) contributed between 13–16 %. The distribution of the total direct GHG emissions per mega joule output for each biodiesel configurations can be seen in Fig. 5.

Fig. 5
figure 5

Exemplary map to visualise the catchment delineated distributions of total direct GHG emissions (mg CO2eq. MJ−1) for cultivation of rapeseed associated with the different biodiesel configurations within the example region. The variation for the different biodiesel configurations is dominated for this functional unit by the different installed capacities, with larger biodiesel configurations showing lower emissions per mega joule. However, greater spatial variation can be observed when looking at a map of emission intensity per hectare of supplied biomass (see Additional file 1, S.5). Map produced using Arc GIS® software by Esri

Interpretation and discussion of example outputs

Due to RELCA taking into account the regional and spatial variability of regional bioenergy production, it is not so easy to find comparable approaches in the literature, indeed as pointed out by Müller-Langer et al. [61], such direct comparisons between life cycle results should be done with care. However, one comparable study is that of Bernesson et al. [105]. They took regional averages to determine the potential GHG emissions associated with biodiesel production for a region in Sweden (cradle-to-gate). They found GHG emissions to be between 39.5–40.3 gCO2eqMJ−1 for medium and small scale operations, respectively (using energetic allocation). Another bench mark comparator is the EU RED [106], which requires biodiesel to be produced with a GHG emission of less than 42 g CO2eq MJ−1. Therefore, when comparing against these literature benchmarks, the RELCA results, for this example appear to be realistic and indicate that most biodiesel configurations presented in this simple example, have relatively better GHG performances.

The results of RELCA also indicate, as with other life cycle studies of biodiesel, that the cultivation of rapeseed contributes substantially to the overall GHG balance of biodiesel [105, 107, 108]. Therefore, the ability to determine the GHG performance of the different biodiesel configurations and the regional variance of this performance is a critical component of the RELCA approach. The ability to do this is due to the more spatially detailed inventory for the production of the rapeseed feedstock. Indeed Malça et al. [108] in their review of GHG balances for European biodiesel chains identified more detailed modelling is required for soil emissions, such as N2O, which are sensitive to the regional variability (e.g., fertiliser application, rotation) [5, 6, 23, 24, 2628]. Such modelling advancements are crucial for improving the results of life cycle studies, and this is one of the objectives of the RELCA approach. For the simple example presented here, field emissions for rapeseed production were estimated by RELCA (v1.0), to range on average for the catchments from 3.02–4.66 kg N2O ha−1 a−1. These values are within the ranges estimated in the literature for Eastern Germany of 1.9-14.8 kg N2O ha−1 a−1 [39, 109]. However, the ability to validate such emissions is limited, as there is in general a lack of spatially distributed empirical studies which could enable an effective validation process [97], and this is a limitation of the RELCA approach. Therefore, it is important that the interpretation of RELCA results does not focus on the individual values themselves (e.g., for one particular catchment), but rather the ranges found throughout the entire region and across all bioenergy configurations. This discussion is continued further in the next section.

Discussion

RELCA (v1.0) the first step

With many burdens of bioenergy production strongly influenced by the regional and spatial variability (e.g., management, climate, soil) of biomass production [5, 6, 23, 24, 2628] more regional and spatially representative life cycle approaches for assessing bioenergy systems [10, 16, 18, 25] are required, to support the sustainable use of natural resources [11], such as biomass. The aim of this paper was to outline the challenges and options for developing a life cycle approach for assessing not only the regional environmental performance of bioenergy production, but also the spatial variability of that performance “within” a regional context [19].

As a first step, the ability to estimate GHG emissions for regional bioenergy systems has been used as a proof of concept, in order to show that adapting life cycle thinking to the “within regional context”, as outlined by O’Keeffe et al. [19], can provide valuable insight into the potential regional and spatial variability (e.g., yields, nitrogen fertiliser, soils, climate) which may influence the environmental performance of regional bioenergy production. The RELCA approach is relatively straight forward with regard to integrating spatial data into a life cycle inventory accounting approach. It combines conventional geographical modelling with conventional life cycle software through the use of catchment delineation. RELCA is therefore, the first approach to explore catchment delineation as an option for including greater spatial details within a life cycle approach for bioenergy production. It has been shown here to work for the estimation of potential GHG balances (Additional file 1, S.3).

The result of the simple illustrative example helps to show how RELCA can work. It also indicates how the outputs can be used to explore the regional and spatial variation and how the emissions can change across and between the different catchments. This is mainly due to the different characteristics of the regional locations where biomass is produced (e.g., soils, climate, management) and due to the different characteristics of the different bioenergy plants (e.g., capacity and efficiency). This interaction in turn influences the final result of a bioenergy configuration (i.e., bioenergy plant and catchment) and thus, the overall potential GHG emissions associated with a regional bioenergy product. Therefore, as a retrospective scoping approach, RELCA can be used by policy makers to help first identify potential issues relating to regional bioenergy production e.g. production capacities too concentrated in one area. As well as helping to identify opportunities for improving regional bioenergy production, e.g., the need to adapt biomass management strategies for certain locations within the region due to relatively higher emissions. Furthermore, it can also help to identify areas or locations within the region which could benefit from more fundamental research, e.g., establishment of field trials for areas or soils which may be sensitive to greater emissions. The extent, to which this support can be given of course, depends on the quality of data acquired and robustness of modelling implemented for each of the modelling steps.

While RELCA (v1.0) proposed here is the first approach for assessing the environmental implications of regional bioenergy production and bioenergy configurations within a focus region, it is not without some practical drawbacks. These include time constraints, data availability at the required spatial resolutions, as well as the need to integrate different and complex datasets. Additionally, it is also important to consider the uncertainties in the interpretation and reporting of the RELCA results [110114]. Both the biomass and bioenergy technology inventories will have their own unique sets of uncertainties, due to the broad spectrum of data sources and different disciplines used to derive such data. As mentioned previously for the biomass inventory, it will be difficult to validate the assumptions made for management practices and emissions, as empirical production and environmental data (i.e., farm surveys, field measurements) relating to biomass production are simply not monitored or measured at such spatial resolutions [48]. For bioenergy technology inventories, there are also no extensive or complete datasets available for cross checking, or comparison [69], making the validation of such results also challenging. Additional uncertainties and limitations, relate to the estimation of the NoRIMod indirect emissions, as many of these emission estimates are sourced from databases which may only provide generic or already pre-allocated values (see [84]). These uncertainties are not only relevant for bioenergy plant models, as many of the environmental burdens are indirect (i.e., upstream, non-regional), but also for biomass modelling, especially for the estimation of fertiliser production [45, 84]. Furthermore, sensitivities to the underlying assumptions and modelling approaches implemented for estimating the potential environmental burdens or emissions should also be taken into account when interpreting the results of the approach. Therefore, the focus of the analysis should be on the potential ranges observed for the region and not on the individual results themselves, or for any one particular location. Contribution analysis and sensitivity analysis for parameters with a known higher level of uncertainty could also be used to help in the interpretation of the results [110, 112, 114].

Concept and design of RELCA—potential future developments

The overall concept and design of RELCA is meant to be adaptable and flexible, thus enabling further complexity to be introduced in successive steps, with regards to the number of environmental burdens considered and spatio-temporal considerations (e.g., adapting crop rotations and land use). The current version of RELCA (v1.0) is unable, as of yet, to model additional environmental burdens, which lead to more regional and spatially dependent impacts, such as impacts to soil or water. There are two major challenges for this; the first relates to the need to integrate different spatial models in the regional foreground to assess for a greater number of burdens. The second relates to the development of NoRIMod approaches required to deal with the associated upstream non regional environmental burdens and linking them to the regional foreground environmental burdens.

Modelling the regional foreground is relatively straight forward when focusing on one set of emissions (e.g., GHGs); however, trying to combine two or more classification of emissions introduces greater complexity. This is particularly relevant when choosing the most appropriate spatial units and identifying which types of modelling approaches are compatible (e.g., which biophysical models can be combined) [19]. To date, there has been relatively little or no literature on the integration of different spatial models for determining different environmental burdens within life cycle approaches. There is also a lack of literature on how to handle the different spatial units required for the different modelling approaches (e.g., hydrological units for eutrophication emissions vs. simple square grids for GHG emissions). In general, singular approaches for estimating individual emission categories have been proposed [24, 115] and only one approach has been outlined for dealing with the uncertainties of using different spatial units for the full life cycle assessment [69]. Therefore, further research is needed to support the enhancement of more regionally distributed life cycle approaches, such as RELCA, which would enable them to include more than one potential environmental burden within a regional context.

Additionally, while RELCA can provide a static monitoring-based result for potential land occupation of certain biomass-to-bioenergy systems within the regional foreground, it cannot estimate the potential dLUC (direct land use change within the region) and iLUC (indirect land use change) occurring within and outside of the focus region. The use of more dynamic economic modelling approaches have been identified as the best means of estimating changes in land use, for example, equilibrium and agent-based modelling [116118]. Such modelling can provide a better understanding of market interactions, which might lead to dLUC within a focus region or indeed lead to induced iLUC impacts outside the focus region. However, this is still an area which needs further consideration and indeed to model such related burdens and impacts, may require a different regional context, a more regionalized context (i.e., capturing interactions between regions) than that provided by RELCA (see [19]).

The current version of RELCA (v1.0) is at its simplest form for estimating GHG emissions, mainly because GHG10 burdens, due to their global impact potential, can be normalised to CO2 equivalents. This makes it easier to create links with life cycle software in the NoRIMod step. When it comes to estimating further environmental burdens (i.e., those relating to soil and water), this will require much greater effort to determine the associated non-regional upstream burdens. This includes, identifying all relevant non-regional upstream burdens from various different data sources and inventories, e.g., Ecoinvent, GEMIS, CML6 (i.e., which emissions need to be included and to which tier), as well as organising and structuring the data flows, in order to integrate them into the rest of the RELCA modelling approach. This in the end could be very time consuming, it may also require the modification of the life cycle matrices.

Additionally, while RELCA in its current iteration is designed to conduct a life cycle inventory assessment, efforts should still be made during the development of the inventory, to enable the future possibility of linking to plausible and robust impact assessment methods (i.e., adding an LCIA phase to account for impacts to water, soil, or biodiversity). However, many challenges have been identified with regards to turning the potential environmental burdens (LCI) derived in a spatially distributed manner, into potential impacts (LCIA) for a regionally contextualised approach. The ability to do this will depend on the availability of characterisation factors at the appropriate scale, or indeed the ability to calculate a chosen impact or impacts through direct modelling, e.g., fate and transport models or land-use models [19]. Therefore, further research into this possible development of RELCA is also required.

Despite the limitations of the current version, RELCA (v.1.0) is the first step towards materialising a regional and spatial life cycle approach for assessing the environmental implications of bioenergy production and bioenergy configurations (e.g., bioenergy plants and catchments) within a focus region. It also provides a good starting point for further discussion, improvements and modelling developments for assessing the regional and spatial environmental implications of bioenergy production (e.g., such as impacts to soil, water, and biodiversity) for a within regional context.

Conclusions

Life cycle thinking is ever increasingly being used in various sustainable development policies and to support how environmental issues and decisions are deliberated and handled [119, 120]. This is exceedingly the case for bioenergy and as decision makers work in real time and cannot wait for precise results, there is a demand on life cycle approaches to simplify complex systems. [10, 119] This is what the RELCA approach sets out to do. The regional life cycle inventory assessment (RELCA) outlined in this paper is the first approach to assess not only the regional environmental performance of bioenergy production, but also the spatial variability of that performance “within” a regional context. It does this through combining conventional geographical modelling with conventional life cycle software through the use of catchment delineation. RELCA is therefore, the first approach to explore catchment delineation as an option for including greater spatial details within a life cycle approach for bioenergy production. It has been shown here to work for the estimation of the potential GHG performance of a regional bioenergy product. However, RELCA (v1.0) is not without its limitations. Despite these, it still provides a good starting point for further discussion, improvements and modelling developments for assessing the more regional and spatial environmental implications of bioenergy production (e.g., such as impacts to soil, water and biodiversity) for a within regional context. Indeed, with a greater emphasis being placed on the role of biomass in supporting a future bioeconomy, (i.e., the use of biomass resources for the production of food, materials, chemicals, fuels and energy) [121, 122], the development of approaches, such as RELCA, will be necessary for the sustainable management of natural resources, such as biomass.