1 Introduction

There is a worldwide interest today in designing a greener economy (D’Amato et al. 2017), in which the potential production of bio-molecules for a diversity of species could be harnessed. By doing so there is some hope that society can substitute petrochemical-based products with renewable ones (Mukhopadhyay 2006; Philp 2018), or design new ones. For instance, bio-molecules extracted from trees (also known as wood extractives) can be used to produce novel and high-value products, e.g. pesticides and fungicides (Anttila et al. 2013; Valette et al. 2017), products for the construction materials industry (Lacoste et al. 2015; Pizzi 2019), food colours and supplements, cosmetics and drugs (Stevanovic et al. 2009; Jablonsky et al. 2017; Bhardwaj et al. 2021).

In practice, supply chain industries from the forestry sector (e.g. sawmills, pulp and paper mills, panel board mills, and lumber brokers) are responsible for finding and extracting bio-molecules from tree species and locations. When doing so, they have to avoid conflicts with the production of other wood products (e.g. wood used for construction, food, industry, energy) and to comply with environmental regulations (e.g. GHG emissions, conservation of forest biodiversity).

Given the above-mentioned constraints, supply chain actors require prior knowledge about many biological, physical and socio-economic variables at different scales of observation (Fig. 1):

Fig. 1
figure 1

Generic knowledge and decision pathway that supply chain actors can entirely or partially follow to decide whether to invest in the extraction of bio-molecules from wood resources

First, supply chain actors require a detailed breakdown of the composition and concentration of bio-molecules (Royer et al. 2013) in order to decide which species to select, trees to fell, wood logs to cut, and biomass components to split, transform and use for industrial extraction (Step 1 in Fig. 1). All trees produce bio-molecules that belong to different groups of polyphenols, including tannins, flavonoids, and lignans. For these bio-molecules, actors need to know how their concentration vary among species (from less than 0.3 to 35% (Willför et al. 2004b; Benković et al. 2014; Kebbi-Benkeder et al. 2015; Bianchi et al. 2016; Kebbi-Benkeder et al. 2017; Nisula 2018)). They also need to know how they vary within a species between the bottom and the top of the trees (Pietarinen et al. 2005, 2008; Kebbi-Benkeder et al. 2017; Nisula 2018; Brennan et al. 2020a, 2020b, 2020c), between young and old trees (Dedrie et al. 2015; Jyske et al. 2014), and between the biomass components of trees (heartwood, sapwood, bark, knots and branches, and branch insertions) (Sinclair and Dymond 1973; Kebbi-Benkeder et al. 2015). For example, knots contain 5–20% polyphenols, like lignans in conifers and flavonoids in deciduous species (Holmbom et al. 2003; Willför et al. 2003, 2004a, 2004b, 2004c, 2005; Pietarinen et al. 2005; Sekine et al. 2013; Kebbi-Benkeder et al. 2015; Nisula 2018). Likewise, they need to know whether the diversity and concentration of bio-molecules differ from the normal level. This may occur in response to some environmental factors at the growth site (Kilulya et al. 2014; Piispanen et al. 2008), or around scars and specific biomass components after trees have been pruned, de-barked or wounded due to wind breakage of branches, silviculture or herbivory (Lewinsohn et al. 1991; Kopaczyk et al. 2020).

Then, supply chain actors need to locate forest catchments that contain the requested quantities of bio-molecules (Step 2 in Fig. 1). They need to do that to identify the land owners they would contract with for using already felled trees or harvesting new ones (IGN 2018). But getting this information requires an extended knowledge of how the composition and dynamics of the bio-molecules at the scale of a population of standing and felled trees is impacted by various biological and environmental disturbances, as well as by silvicultural practices. Eventually they also need to know whether there is a suitable road infrastructure, and whether there are nearby industries that procure wood from the forest landowner. Geographic information systems (GIS) could be used to synthesise and project across space and time the composition and quantities of bio-molecules for any given species. But to the best of our knowledge, there is no published work or map of this kind, not even for the most advanced studies in the fields, such as in Canada (see Royer et al. 2010, 2012, 2013) or in Scandinavia (Holmbom 2011; Näyhä et al. 2011; Bergström and Matisons 2014; Routa et al. 2017; Nisula 2018; Verkasalo et al. 2021).

Next, supply chain actors need to select suppliers that can transform wood biomass into extractable wood fractions (Step 3 in Fig. 1). The problem is that detailed knowledge of supply chains is rarely shared. They represent complex and strategic networks of forest industries, involving multiple pathways and loops of purchasing, selling, and the transformation of unprocessed and processed wood biomass (Velema 2004; Roitto et al. 2016; Routa et al. 2017; Pichancourt et al. 2018; Hassegawa et al. 2018). The lack of such public data makes it difficult to predict where and how much of the richest fraction of bio-molecules of the biomass components are separated during biomass processing (e.g. bark, sawdust, chips and knotwood), where they are aggregated into composite by-products that could be more cost-effectively bought and collected for the industrial extraction of bio-molecules (Holmbom et al. 2003; Nabarlatz et al. 2007; Rustad et al. 2011) and, finally, where they could be collected without disturbing traditional uses of forest biomass (Velema 2004; Roitto et al. 2016; Pichancourt et al. 2018; Hassegawa et al. 2018; Verkasalo et al. 2021).

Finally, supply chain actors need to select an industrial extractor specialised in biorefining (Step 4 in Fig. 1). However, beyond the cost of industrial extraction itself, they need to know whether the process of extraction actually improves the carbon balance of their activity, as opposed to if they were selling their wood biomass for another use (e.g. for bio-energy). By estimating that, they may be able to claim (additional) carbon credits (Brunet-Navarro et al. 2016; Pichancourt et al. 2018).

Altogether, the strategic pieces of information that we present at the different scales of observation form what is known as a “knowledge infrastructure”. Without it, supply chain actors may not have the necessary “visibility horizon” (as per Carter et al. 2015) to adapt their activities to this new bio-economy. This knowledge infrastructure is not easy to produce and mobilise. In this study, we show how it can be achieved by developing a generic methodological framework that can be used to select the right data, models and analyses to produce information at different scales (Fig. 2). We then apply it to a case study where we worked with supply chain actors from the Grand Est region of France who are interested in the industrial extraction of bio-molecules contained in silver fir (Abies alba).

Fig. 2
figure 2

Summary of the generic methodological framework used to produce information for Steps 1, 2, 3 and 4. For the biomass component of a tree, BWD represents basic wood density, RV: relative volume, and EC: extractive concentration

2 Methodological framework

2.1 Selection of the best tree species and biomass components to be used

For this first step (Step 1 in Fig. 1), the objective is to estimate the quantity of bio-molecule i that can be extracted from biomass components j (e.g. stem, bark, knots, branch insertions, branches), at tree height k for a particular tree species. To achieve this, we need to independently estimate the volume (Vjk [m3]) and the basic wood density (WDjk [kg.m− 3]) for each biomass component, as well as the dry mass proportion of the extractives (ECijk [kg.kg− 1] or [%]). Then, using simple arithmetic, we can deduce the total quantity of the extractable bio-molecule (Bijk [kg]), such that:

$$ B_{ijk} = V_{jk} \times WD_{jk} \times EC_{ijk} $$
(1)

To carry out this simple calculation, four prior scientific tasks need to be performed:

2.1.1 Selection, measurement and harvesting of trees

Studying the variability of bio-molecules at both a tree level and a species level is expensive and time-consuming. Therefore, the sampling effort at both levels should be adapted to the objective, time and money (see, e.g., Royer et al. 2010, 2012, 2013, Kebbi-Benkeder et al. 2017; Billard et al. 2020, Brennan et al. 2020a, 2020b or Brennan et al. 2020c). Regardless of the sampling effort, the spatial location and biological characteristics of standing trees must be measured for further allometric and chemistry analyses (e.g. breast height diameter (dbh), tree height, crown height, height of the first living branch, stem and branch diameters).

2.1.2 Preparation for allometric and chemical analyses

This task depends on the tree phenotype and sampling effort. When many species need to be extensively compared, then 1–2 transverse discs of several centimetres are sufficient, sawn at the stump and at the height of 1.3 m from the base (Kebbi-Benkeder et al. 2015) or, if possible, at 5 m if the species has butt swelling (like for some tropical trees) (Santana and Okino 2007). If more intensive analyses are needed to study the differences in bio-molecules between the biomass components, several evenly spaced discs are cut (e.g. 2–15), and their position along the trunk is recorded (Kebbi-Benkeder et al. 2017). If data must be further segregated based on how biomass components are used (e.g. for industrial uses, construction or wood for energy), we suggest sawing the discs along the trunk, from the base to the upper limit of wood suitable for industrial uses (Billard et al. 2020; Brennan et al. 2020a, 2020b, 2020c).

2.1.3 Estimation of the basic density and relative volume of biomass components

This task can be skipped if prior measurements of biomass have been performed directly on standing trees, or indirectly using allometric equations, i.e. predicted from stem or branch diameters (as described in detail by Gómez-García et al. (2015)). However, recent studies suggest that current manual and allometric knowledge may be misleading and should be improved (Bauer et al. 2019; Billard et al. 2020). In this case, some of the discs (after being oven-dried) should be analysed with an X-ray computed tomography scanner (CT-scanner) (Longuetaud et al. 2016; Billard et al. 2020)) to estimate the precise volume and basic density [kg.m− 3] of all of the biomass components. The statistical method, assumptions and source code to achieve this are described in details in Longuetaud et al. (2016).

2.1.4 Estimation of the composition and concentrations of bio-molecules

Discs and knots are then independently crushed to extract the bio-molecules of interest, as described, e.g., in Brennan et al. (2020a, 2020b, 2020c). The lab process should match the actual industrial process of extraction used by biorefineries and legal environmental constraints. Briefly, discs are milled, dried, and processed with a solvent in an accelerator for extraction, before estimation of the dry yield. Various solvents, such as water, ethanol, a mixture of the two, or even more efficient ones such as the toluene/ethanol mixture (if authorised by the legislation). Liquid chromatography is then followed by mass spectroscopy (LC-MS) and nuclear magnetic resonance spectroscopy (13C NMR) to quantify and identify polyphenolics and others.

Once the distribution model of extractives has been inferred for the desired species, it may be used together with other methods to support decision-making. However, the first step can still be repeated again for two independent reasons: either when it is a question of geographic region where the species ecology is known to be different, or when better pre-screening of some of the tree biomass components is necessary. This can be the case when the harvesting of wood biomass is legally constrained by prioritisation rules in some countries. For this purpose, we will show how it is possible to restructure the dataset in the application example section.

2.2 Selection of the best geographical location for harvesting

The methodological objective of Step 2 (Fig. 1) can be broken down into two tasks:

2.2.1 Scaling up the information about bio-molecules in the standing forest resource

The common method here is to develop a GIS map containing information about the composition and quantity of bio-molecules in the standing forest resource and felled logs. These layers of information can be crossed with other information layers, e.g. with the road network, the land register, or the distance from the nearest forest industries. This task obviously depends on the availability of spatial data related to the forest resource. The latter can be obtained from any National Forest Inventory (NFI) programme. NFIs provide statistically representative information (e.g. forest composition and structure, dendrometric data) on a large spatial scale based on sampling methods, geo-statistical interpolation and allometric models (Vidal et al. 2016). Various statistical methods can be used to represent NFI data on a GIS map, depending on the quality of the dataset (Vidal et al. 2016; Köhl et al. 2006). Equation 1 can then be combined with NFI data in a GIS to estimate the quantity of extractives.

2.2.2 Projecting standing resource information over time for different socio-ecological scenarios

The main method consists in designing and using a forest growth model to project the biomass and the composition in bio-molecules for different tree size classes and harvesting scenarios (Wernsdörfer et al. 2012). For this task, several types of forest growth models can been used Weiskittel et al. (2011). Mäkelä and Valentine (2020) provide guidelines to select the right model, depending on the objective, scale of projection, level of forest complexity, data available, and computational limitations. Once the right model is selected, we can produce GIS layers to project the spatial forest distribution and biomass density of bio-molecules in both standing trees and felled logs for different time horizons (Colin et al. 2017).

2.3 Selection of the best suppliers to buy by-products for extraction

The methodological objective of Step 3 (Fig. 1) can be broken down into two tasks:

2.3.1 Reconciling supply chain data

Since private supply chain data are strategic and frequently not shared, public data can be used as a surrogate. They usually present aggregated quantities of raw material that are harvested, and the annual proportions that are processed at a regional or a national scale by different groups of industries, e.g. all sawmills (Martin 2009; Lenglet et al. 2017; Pichancourt et al. 2018). However, this type of supply chain data is frequently heterogeneous, since it comes from different sources and scales of observation, and may thus benefit from being reconciled using optimal material flow analyses (Cencic 2016; Lenglet et al. 2017).

2.3.2 Connecting resource growth dynamics and supply chain data

The supply chain model is then used as a data integration tool to represent the supply chain structure and the associated flow of wood material and bio-molecules (Holmbom 2011; Bergström and Matisons 2014; Routa et al. 2017; Pichancourt et al. 2018). It is then used to test alternative scenarios of re-orienting wood biomass for extraction in order to find the right supplier and to quantify the sensitivity of the flow of bio-molecules in response to various levels of social and environmental risks and uncertainties (Vieira et al. 2020). There are different types of supply chain models and not all can achieve what has been mentioned so far. The reader may refer to Min and Zhou (2002), Vahid et al. (2016), and Pichancourt et al. (2018) or Vieira et al. (2020) among others, to select the most appropriate model.

2.4 The regional impact of redirecting wood biomass for industrial extraction

Finally, the methodological objective of the last (Step 4 in Fig. 1) is to assess the regional impact of constraints of re-routing wood biomass for other end-uses, such as industrial extraction. For instance, analyses of GHG emissions are considered a common business practice that requires standard calculations defined by the IPCC (Lindner et al. 2012; Pichancourt et al. 2018). A list of the supply chain models that include these calculations, with their pros and cons, can be found in Brunet-Navarro et al. (2016) and Pichancourt et al. (2018).

3 Application example: silver fir in the Grand Est region (North-Eastern France)

Our aim here was to show how this methodological framework could be applied, to help supply chain actors from the Grand Est region to extract high-value bio-molecules contained in the common tree species silver fir (Abies alba Mill.), using an industrial scale biorefinery. The application example combined many of the classic supply chain management problems predicted by the related theory (Carter et al. 2015), i.e. poor awareness and poor data sharing and coordination practices regarding the structure and dynamics of the entire supply of forest products (Ficko et al. 2011; Pichancourt et al. 2018; ADEME 2019). The tailoring of methods that we produced represents three years of multidisciplinary scientific work, plus feedback from various forest supply chain actors that we met in the region, during multiple thematic meetings, workshops and formal interviews.

3.1 Biomass components useful for extracting bio-molecules

We first estimated the total quantity of bio-molecules contained in most biomass components of eight silver fir trees harvested in 2018 at 49.97N, 4.06 E. The harvest took place in a 40-year-old naturally regenerated mixed-forest stand with 90% silver fir, 10% Norway spruce, and a small number of miscellaneous broad-leaved species. The wood chemistry analysis showed that the composition of bio-molecules per unit of wood biomass [kg/kg] varied between the biomass components. Stemwood represented the largest fraction of bio-molecules, followed by bark and knotwood with much smaller fractions of the total tree biomass.

CT-scan analyses showed that the variation in the total biomass of bio-molecules was a consequence of differences in the relative volume, wood basic density, and extractive concentration between the biomass components (Fig. 3). Dry weight of bio-molecules per cubic metre of stem volume was estimated to be \(\sim 15~kg.m^{-3}\), independent of tree size, and \(\sim 12~kg.m^{-3}\) and \(\sim 1~kg.m^{-3}\) in bark and knotwood, respectively. The highest concentrations of bio-molecules on a dry weight basis were observed in bark (0.2–0.25 kg.kg− 1) and knotwood (0.1–0.2 kg.kg− 1), and the lowest concentrations in stemwood (< 0.05 kg.kg− 1). We could not produce any information about bio-molecules for branches. Given that knots represent only \(\sim ~\nicefrac {1}{100}^{th}\) of the total above-ground volume of silver fir, their bio-molecules was estimated to be no more than \(\sim ~\nicefrac {1}{500}^{th}\) of the total biomass of a tree, even if knots were the densest wood component (700 ± 41 kg.m− 3), and almost twice as dense as stemwood (374 ± 18 kg.m− 3). Similarly, bio-molecules in bark represented no more than \(\sim ~\nicefrac {1}{50}^{th}\) of the total biomass of silver fir, even if barkwood density (514 ± 19 kg.m− 3) represented \(\sim ~\nicefrac {4}{3}^{rd}\) of stemwood density.

Fig. 3
figure 3

Bio-molecules that can be extracted from five biomass components of silver fir: (1) bark, (2) stem, (3) knot, (4) insertion of the branch, (5) branch (\(\varnothing \) = [0–4 cm, 4–7 cm, > 7 cm]). We measured the basic wood density (WD: [kg.m− 3]), the relative volume (RV: [%] of the above-ground volume of the tree with the exclusion of leaves), and the graphs represent the concentration in extractives, EC [kg.kg− 1]. For every biomass component, potential conflicts (construction and industrial use in this case) are presented if the wood materials are used for molecular extraction on the same harvesting sites

The concentration of bio-molecules varied along the tree. However, given the national priority rules for using wood biomass in France (ADEME (2019, p. 8): construction wood & industrial wood products > wood for chemical industry > energy wood), we categorised the results into industrial wood and energy wood (dbh ≤ 20 cm) and into sawlogs (dbh > 20 cm). Particularly higher grade bottom logs of silver fir are usually prioritised by sawmills for the supply of construction wood or packaging. Upper logs are prioritised for industrial wood (e.g. for panel or pulp&paper making), either coming from the stem part above the lowest living branches, or from young trees removed from thinning. Sawdust, knots and bark by-produced by sawmills, pulp and paper mills are usually sold there for making either panels, but mostly for energy wood in the form of pellets or cogeneration. Given those constraints, we show that knotwood, which are easier to obtain, contains more extractives than tree segments that are used for sawn timber (\(\sim \) 18%) or tree parts that are used for industry or energy wood (\(\sim \) 10%).

Finally more detailed wood chemistry analyses revealed that bark and knotwood samples both contained a large diversity of families of extractable bio-molecules (Table 1). Bark contained 13 bio-molecules (nine identifiable ones) from different families, mostly flavan-3-ol, phenolic compounds and tannins. Knots contained seven bio-molecules at 10 to 100 times lower concentrations, from the following families of molecules: lignans, phenolics, terpene and from unknown origins. The largest diversity of bio-molecules was found in knotwood. Many of these bio-molecules are biologically active and have nutraceutical properties that can be used in food supplements (Bhardwaj et al. 2021). Moreover, isorhamnetin flavonoids found in barkwood are known for their anti-inflammatory and anti-tumoral effects, and lignans from knotwood for its anti-tumoural properties (Gong et al. 2020; Bhardwaj et al. 2021).

Table 1 Knot and bark extractive yield from silver fir trees in laboratory conditions and using a mixture of Water/EtOH (1:1)

3.2 Locations where trees contain the desired bio-molecules

Combining GIS projections and French NFI data, we projected the regional distribution and variability of the biomass of a diversity of bio-molecules contained in one hectare of forest with silver fir trees (Fig. 4). As an example knotwood of standing trees in one hectare of forest from the Vosges mountains should contain \(\sim \) 1 to 40 kg of different types of lignans with potentially useful nutraceutical properties; whereas two to six hectares would be necessary to obtain the same amount of isorhamnetin from barkwood (known for its anti-tumoural properties).

Fig. 4
figure 4

Locations and amount of bio-molecules per hectare of standing forest containing silver fir in the Grand Est region. The results were generated using the GIS platform that can be found at http://extraforest.ign.fr/

We then combined the GIS information with a size structured forest growth model, calibrated on the basis of regional inventory data (Wernsdörfer et al. 2012; Colin et al. 2017) (Table 2). We used it to project the expected space, time and tree size structure of the bio-molecules for the next 20 years, in both standing trees and trees that were expected to be logged for other uses (i.e. for construction, industrial and energy wood). In our case, the future scenarios of potential harvesting were constructed in collaboration with relevant forest agencies. We estimated the future biomass of bio-molecules in standing and felled logs, and the uncertainty was estimated using a bootstrapped sensitivity analysis (1000×). These results are not presented in this section, but were used to produce molecular biomass information along the supply chain in the following section.

Table 2 Summary of the National Forest Inventory (NFI) plots used to generate the growth forecasts with the matrix population model MARGOT (Wernsdörfer et al. 2012; Colin et al. 2017) applied on the silver fir population of the Grand Est region for the 5-year period 2017–2037

3.3 Bio-molecules contained in by-products coming from suppliers

We used the 2017–2050 projections for the biomass of bio- molecules contained in felled logs as input for a supply chain model (Pichancourt et al. 2018, 2020). We projected them along a scenario of a regional supply chain reconstructed using several sources of data (Fig. 5): (i) the latest 2017 data for the Grand Est region from the French national forest and timber supply chain survey (Normandin 1990; Paquet and Deroubaix 2003; AGRESTE 2017); (ii) results from a previous material flow analysis for the region (Lenglet et al. 2017; Pichancourt et al. 2018); and (iii) data that we collected during semi-structured interviews with specific industrial actors, under confidential data agreements.

Fig. 5
figure 5

The silver fir supply chain operating in the French Grand Est region was modelled using the flux manager module of a wood supply chain analysis software (Pichancourt et al. 2018). In this scenario, 20% of the by-products (bark, knots) of the manufacturing of sawn timber, pulp and paper (initially sold for energy wood) were reallocated to extract bio-molecules.

For the scenario, 1000 runs of pair-wise Monte Carlo uncertainty and sensitivity analyses applied on various parameters of the supply chain model showed that significant annual quantities of the previously mentioned bio-molecules could be extracted during the next 20 years from the by-products made of knotwood (Fig. 6a) or barkwood (Fig. 6b) from sawmills and paper mills.

Fig. 6
figure 6

Bio-molecules contained in wood by-products from sawmills and paper mills. Predicted cumulated quantities (2017–2037) of families of bio-molecules that can be extracted from (a) knotwood by-products and (b) barkwood by-products, produced by forestry industries using silver fir harvested across the 1,465,034.8 ha of forests present in the Grand Est region of France. Values were estimated using the wood supply chain software presented in Pichancourt et al. (2018) and connected with a bio-chemistry module (Pichancourt 2020). Bars represent the 95% confidence intervals estimated using 1000x bootstrapping

3.4 Regional impact assessment of the extraction of bio-molecules

Reorienting part of the by-products for industrial extraction of bio-molecules is expected to represent a very small fraction of the wood biomass (measured in product biomass [kg]) that should have been used for energy for the next 20 years (Fig. 7a). This amount is also insignificant when compared to the total wood biomass fraction used to build all other classes of end-products (buildings, furniture, paper, wrapping). For carbon balance accountability, the production and use of these bio-molecules is predicted to reduce the carbon footprint of the regional supply chain (re-scaled as CO2 content [kg CO2eq]: Fig. 7b) by avoiding the burning and release into the atmosphere of several thousand tonnes of CO2 from wood residuals over the next 20 years.

Fig. 7
figure 7

(a) Expected cumulated biomass of the various types of wood products, including the bio-molecules (extractives), that can be produced by 2037 in the Grand Est region of France. (b) Cumulated GHG emissions saved by re-routing by-products produced by the forestry industries, for the industrial extraction of bio-molecules. Values were first estimated per hectare using the carbon accounting module of a wood supply chain software presented in Pichancourt et al. (2018). They were then re-scaled for the 1,465,034.8 ha of forests containing silver fir present in the Grand Est region. The error bars in (a) and grey ribbon in (b) represent 95% confidence intervals using 1000x bootstrapping

4 Discussion

In our study, we demonstrated the possibility to produce and articulate diverse information about bio-molecules obtained at different scales. Based on this unique experience, we now discuss some problems and methodological challenges that can emerge for this procedure.

4.1 Challenges for the selection of the best species and biomass components

The supply chain actors that we met had quantitative knowledge of the extractable fraction of lignocellulosic macro-molecules, because of their historical economic interest in producing wood products and biofuel, their low requirements for homogeneity, and low variability among species (Naik et al. 2010; Jayasinghe and Hawboldt 2012). However, their knowledge of other bio-molecules was not as good for the reasons we mentioned above. In this first exploratory study, we show that intensive data collection and analysis at a tree level were required to provide a complete picture of the bio-molecules for silver fir, justifying the fact that we only processed eight trees. The same problem has been encountered for similar initiatives in Canada (Royer et al. 2010; Royer et al. 2012; Royer et al. 2013). Before extending our work to other species, our results would have to be validated on more trees and compared to direct allometric model approaches that predict the biomass of some tree components (Gómez-García et al. 2015). Our allometric models could further be extended to predict how the biological and chemical characteristics of the biomass components vary with local ecological factors (Piispanen et al. 2008; Rivoire et al. 2010; Genet et al. 2011; Hounzandji et al. 2014; Kilulya et al. 2014), or with other environmental stressors on standing trees (Lewinsohn et al. 1991; Kopaczyk et al. 2020), or even on felled trees at logging or stocking sites (Anttila et al. 2013). However, regardless of how well we can simplify the sampling protocol, the choice of one species or biomass component is not only about wood chemistry. Supply chain actors that we met mentioned the need to consider the monetary value of bio-molecules and market demand (Fig. 3). They want to run cost-benefit analyses, rank the species and biomass components, and decide whether it is worth moving away from other timber uses. In this case, multi-criteria optimisation methods need to be developed.

4.2 Challenges for the selection of the best harvesting locations

To the best of our knowledge our work represents the first attempt to publish a distribution map or temporal projections of the composition and quantity of bio-molecules per hectare for a species (see Fig. 4). Nevertheless, several methodological challenges still remain to be solved.

The first challenge is related to the resolution of the NFI sampling scheme. The number of trees currently sampled statistically limits the spatial scale at which we can produce results for bio-molecules across space and time without producing interpolation errors (IGN 2018). Because the number of silver fir trees varied from one place to another, it was difficult to predict the minimal scale at which we can make interpolation beforehand. The GIS platform that we developed automatically integrates these constraint, prevents further analyses, and suggests that the user should select a larger area. However, our interactions with forest actors during thematic meetings and workshops suggested that the scale at which they could make predictions for silver fir was not always adequate. For instance, small sawmills preferred to obtain information about bio-molecules at the scale of a small harvesting catchment or even of a single private forest farm, which we could not provide. For them, we would need to develop better methods to combine data from current NFI sampling schemes with higher resolution of remote or local inventory data (see, e.g., Kangas et al. 2019).

The second challenge is related to determining the best forest resource catchments to invest in for future industrial extraction. In our case, criteria other than biological ones would need to be considered, such as the cost of transportation, the presence/absence of a road network, of forestry infrastructures, of land conflicts between timber objectives and some ecosystem services (e.g. carbon sequestration, biodiversity conservation, cultural services). Our GIS platform already had these layers of information, but the supply chain actors that we met were eager to select the best location by considering this diversity of social-ecological criteria. We think that the spatial optimisation algorithms presented by Ananda and Herath (2009) could be of use to solve these multi-dimensional problems.

The last challenge is related to the use of GIS platforms for the prospection of bio-molecules at a large scale. Our platform was tested on the Grand Est region for precise molecular estimates, but could be used at a national scale for less precise metrics. Moreover, some scientists and supply chain actors from neighbouring countries asked us to extend this study to better match the spatial structure of forest resource catchments and some supply chains (Mukhopadhyay 2006). However, the problem is that NFIs are frequently not harmonised between neighbouring countries (Vidal et al. 2016), and even if data harmonisation and infrastructure are operational (as planned between the EU member states), this may not be sufficient to implement a cross-country forest information system for strategic reasons (Baycheva-Merger and Sotirov 2020). Targeted R&D initiatives like ours may help solve this problem.

4.3 Challenges for the selection of the best suppliers

For the supply chain step, we demonstrated that it was possible to use models to predict the best supplier and the biomass and diversity of bio-molecules that could be extracted. As of that point, we scan anticipate three remaining challenges.

First in our supply chain projections, we included various sources of uncertainties (see Figs. 3 and 4). However, others have not yet been factored in, including those found in the literature. A first source of uncertainty is related to knot removal by transformation industries. Interviews that we conducted with industrial actors showed that sawmills and paper mills use machines that cut a bit more wood around the knots (1–3%) than what was measured by CT-scan (\(\thicksim ~1\%\)). The exact quantity can change from one mill to another, depending on the exact parameterisation of the machine used. However, we do not know its variability at a regional scale. A second source of uncertainty is related to carbon balance calculations when modelling the industrial process of extraction. We do not have any information about the carbon that would be emitted during the process. However, various industrial extraction pathways can be used, each with its own GHG emission efficiency (Liao et al. 2020). Including these two sources of uncertainties into supply chain projections may change the final results we presented in Figs. 6 and 7.

Next, using our simulation-based supply chain model, we analysed whether re-routing the biomass from two supply chain actors could significantly increase the supply of bio-molecules. The non-analytical nature of our model required us to make targeted sensitivity analyses by presenting a specific scenario of supply chain adaptation. However, extending this logic to analyse the sensitivity of the entire supply chain structure in relation to each and every industry would have been preferable. Global sensitivity analyses of this kind are complex analytical questions, and the literature on the topic is sparse (Rabelo et al. 2011). An interesting avenue would be to instead use mathematical models of supply chain dynamics. They are better designed for implementing global analytical sensitivity analyses, like those that have been already applied, for example, in population dynamics (Caswell 2001). The other interest is that the forest growth models could be incorporated into the same system of equations, given that these models are frequently based on the same mathematical formalism (like the one we used) (Wernsdörfer et al. 2012; Colin et al. 2017). By doing this, it would be possible to further perform novel global sensitivity analyses at the intersection between the ecological dimension and the economic dimension. For instance it would be possible to analyse the sensitivity of specific resource and supplies of bio-molecules in response to the perturbation of the demand for other bio-molecules or wood products.

Finally, regardless of how well we perform such sensitivity analyses, actors still need to evaluate whether some solutions would be more optimal or viable than others when considering other criteria. In this regard, several questions arise that would benefit from applying better optimisation algorithms. For instance, is it preferable that each industry strategically invest in its own specialised biorefinery? Or should they collectively invest in a big generic one that is strategically placed for everyone to contract with? Similar questions also arise for bio-energy conversion facilities (Elia and Floudas 2014; Thiriet et al. 2020). Then, once the biorefinery(ies) is(are) constructed, where should actors tactically establish the stocking sites to sustain the level of extractable bio-molecules inside the logs (Melo et al. 2009)? For such mid-term problems, viability theory algorithms are well suited to help minimise such risks under well defined constraints (Aubin 2009; Chavas 2015). Third, once the stocking sites are established, industrial actors need to sign contracts to supply by-products to one of the potential biorefineries. The nature of the contracts depends on many supply chain and ecological constraints. We think that Elia and Floudas (2014) and Rahmani et al. (2016) and Cambero and Sowlati (2016) present interesting optimisation tools for such complex multi-dimensional problems.

5 Conclusion

Before making decisions, supply chain actors with an interest in bio-molecules require specific background knowledge. It is not easy to obtain since it requires an extensive effort of scientific coordination to produce knowledge that spans different scales of observation, i.e. at the scales of bio-molecules, biomass components of trees, regional resources and, finally, the entire supply chain covering different transformation industries with their primary products and by-products, and other stakeholders. A major issue still is that adding new scales added more dimensions to the problem, making it harder to connect data, models and software and, ultimately, make sound decisions. Nevertheless, we demonstrated that once this connection was achieved, it was possible to draw a simple knowledge and decision pathway, containing a reduced set of steps. We demonstrated that the information produced for one step (scale) could be reasonably used to produce new information and solve a decision problem at another scale. We suggested efficient analytical, modelling and decision theory methods for those who want to optimise the decision process according to a diversity of socio-economic criteria. We hope that this study will help to design a more generic knowledge infrastructure and methodological framework, and that they could be used to solve a wide diversity of decision problems within the context of various species, resources and supply chains.