FormalPara Contents

1 Introduction

Throughout history, controversies have played a critical role in advancing scientific progress. They are essential to the emergence, development, and critical evaluation of concepts and methods. Disagreement can also stimulate new ways of interpreting data and help integrate different viewpoints on a subject (scientific, economic, ethical, political) (Dunlop and Veneu 2019). Although this process is essential to the advancement of research, scientific controversies are sometimes misinterpreted by the agricultural community and the general public and may undermine support for the scientific community. Scientific controversies also hamper the provision of univocal scientific knowledge to inform decision-making.

The recent example of the COVID-19 crisis has illustrated the difficulty of taking rapid action in the context of scientific controversy. In a similar way, ongoing debates about the practices and mechanisms controlling the storage of carbon (C) in soil run the risk of impeding climate change mitigation strategies. In particular, the “4 per 1000” initiative, which aims to promote actions and practices that can store C in soil—with associated benefits for food security and climate (Soussana et al. 2019)—has generated intense debates (Minasny et al. 2017 and the subsequent related commentary papers). The controversies associated with 4 per 1000 are both technical and political. Technical considerations relate to the methods used to calculate soil C stocks, possible overestimations, quantification of the effects of biomass uses, issues related to nutrient cycling or greenhouse gas emissions, and to the definition of an initial baseline (Larrère 2018). On the political side, the initiative could be used as a pretext for not reducing anthropogenic C emissions.

Debating some of the current technical controversies related to the “4 per 1000” initiative was the objective of the second seminar of the scientific network CarboSMS (Carbon Stabilization Mechanisms in Soil), which serves as the inspiration for this review. The CarboSMS collective brings together both researchers and stakeholders from the French-speaking community who work with scientific and operational issues related to the mechanisms that affect C storage in soil (Dignac et al. 2017) (Fig. 1 and 2).

Fig. 1
figure 1

The future of soil C sequestration is in our hands (credit: D. Derrien).

Fig. 2
figure 2

Everything we do on the soil has an impact on the soil (credit: J. Balesdent).

This review addresses the subject of C storage in agricultural and forest soils by adopting an original approach: it highlights various differences of opinions and proposes some opportunities for reconciliation as well as orientations to support decision-making despite the controversy. Our aim is not to be exhaustive, but to stimulate debate and research in critical areas. In the first part, we describe some current disagreements regarding the mechanisms of C accumulation and loss in soil (Fig. 3). The second part examines the advantages and disadvantages of processing massive datasets and using mechanistic models, with the goal of synthesizing knowledge on C storage mechanisms and generating predictions of C stocks for discussion with stakeholders. The third section examines the main controversies regarding biomass harvest, as well as agricultural and forest management practices that are recommended for increasing soil C stock. It provides advice and orientation for scientific action to better support practitioner and policy-maker decisions (Fig. 4).

Fig. 3
figure 3

Controversial topics related to the mechanisms that control C storage in soils. (SOM: soil organic matter, OM: organic matter). Recalcitrance has long been considered the major process driving OM persistence in soil, but this has been questioned by a large number of publications in the last decade (section 2.1). A debate has emerged on the nature of persistent C. Often considered as microbe-derived, can it be also plant-derived (Section 2.2). The relevance of particulate organic matter (POM), which contains relatively young C, for accumulating additional C in soil has been recently debated (Section 2.3). Two main theories are discussed in the literature on the factors controlling C dynamics in soil: spatial inaccessibility and a theory based on the bioenergetic constraints of SOM degradation, which we call “energetic inaccessibility” (Section 2.4).

Fig. 4
figure 4

Linking research questions on C storage mechanisms to operational questions.

2 Controversial issues and new challenges in research on mechanisms controlling C storage in soil

Additional C storage in agricultural and forest soil results either from additional C inputs or from increased preservation of soil organic carbon (SOC), which reduces C loss from soil. Both types of processes have a positive effect on climate change by alleviating the amount of C transferred from the soil to the atmosphere as CO2 or CH4 and should therefore be favored by soil management practices. However, there is a lack of consensus in the scientific community on the mechanisms leading to C storage. In this section, we discuss the currently conflicting views on (1) the notion of chemical recalcitrance, (2) the importance of microbial transformations for C persistence in soil, (3) the contribution of particulate organic matter (POM) to additional C storage, and (4) the preservation of organic matter (OM) in soil due to its spatial or energetic inaccessibility (Fig. 3). We finally conclude with some opportunities for reconciliation among the controversial theories.

2.1 Chemical recalcitrance: should it be rehabilitated?

Different definitions of “recalcitrance” can be found in the literature. The most commonly accepted definition in soil science, which we adopt here, is the intrinsic chemical property of a molecule that makes it resistant to decomposition (Kleber et al. 2011; Sollins et al. 1996). Chemical recalcitrance is usually associated with complex and/or polymerized structures (Bertrand et al. 2006; Marschner et al. 2008). These recalcitrant structures may be (i) initially present in plant litter, such as lignin aromatic structures and some aliphatic structures (e.g., Berg 2014), (ii) formed during decomposition (Stevenson 1994) or (iii) formed during thermal degradation (e.g., charcoal formed by wildfires). Recalcitrance is one of the older theories about C persistence in soil (Fig. 5) and has long been considered the main process driving the persistence of OM in soil. However, in recent decades, this concept has been questioned by a large number of publications (e.g., Dungait et al. 2012; Kleber 2010; Kleber and Johnson 2010). We discuss here the approaches that can be used to identify recalcitrant OM, the issues that have made this concept questionable, and whether it can be rehabilitated.

Fig. 5
figure 5

Timeframe of the emergence of the different theories about OM persistence. These theories are not mutually exclusive and can, in most cases, be reconciled. In most ecosystems, several processes act at the same time. In addition, priming appears as a cross-cutting theory, which can explain the shift from persistent to non-persistent SOM. Some theories appear contradictory, such as the persistence of microbial products suggested in the 2000s and the recalcitrance theory suggesting that simple, small molecules are not persistent (see Section 2.2). This does not necessarily invalidate one or both of the theories but highlights the importance of environmental conditions in shaping these processes.

2.1.1 A concept partly based on questionable methods

Two methods have historically been used to characterize chemical recalcitrance: litter incubations and measurements of C age in soil organic matter (SOM) fractions (Marschner et al. 2008). Both methodologies have demonstrated a link between chemical structures and C dynamics. However, the C age method estimates recalcitrance on the basis of field experiments examining a timescale of several decades or even centuries, whereas the litter incubation method assesses recalcitrance on the scale of months to years, often in laboratory conditions. The first limitation of the concept of recalcitrance is thus that it is based on methodologies that refer to different time scales, which prevents the development of a single coherent narrative.

A major debate has arisen around the recalcitrance concept in the 2010s with the criticism of the humification theory. According to this theory, products of the degradation of organic substrates are recombined into complex polymers enriched with aromatic functional groups, called humic substances (Schulten and Leinweber 1996). Initially, humic substances were isolated using a soil chemical extraction technique designed to separate organic from mineral compounds (Stevenson 1994). The isolated fraction was found to be enriched in aromatic compounds containing 14C that was old in age and was interpreted as recalcitrant. Such results are now widely questioned because, with this extraction protocol, it is possible to generate artifacts in the form of supramolecular assemblies that form under these very specific chemical conditions and are therefore not representative of soil compounds (Myneni 2019; de Nobili et al. 2020; Olk et al. 2019; Sutton and Sposito 2005). Furthermore, despite the use of spectroscopic techniques with very fine-scale resolution, the existence of complex polymeric humic substances in soil has not been proven and some authors have suggested that the thermodynamic investment for decomposers to condense hypothetical “humic substances” would be too high (Kleber and Lehmann 2019; Lehmann and Kleber 2015).

Nevertheless, the theory of SOM condensation still has its defenders who argue that complex structures may naturally form through the condensation of certain organic compounds via the influence of reactive oxygen species generated by brown rot saprotrophic fungi (Goodell et al. 2017; Yu and Kuzyakov 2021) or by mineral phases under specific conditions (Kleber et al. 2021). However, the contribution of these observed condensation processes to the genesis of persistent OM in soil remains to be elucidated.

2.1.2 A concept partly based on misunderstood results

At the same time as the concept of humification was called into question, a growing number of studies were quantifying the mean residence time of C in specific molecular structures. In most molecules, the mean residence time of C was found to be similar to that detected in bulk soil (Amelung et al. 2008; Schmidt et al. 2011), contradicting the idea of a specific recalcitrance of some compounds. Even lignin C, which was suspected to persist in soil, was shown to turn over more rapidly than bulk SOC and to exhibit shorter mean residence times (Amelung et al. 2008; Dignac et al. 2005; Gleixner et al. 2002). The only reported exception was pyrogenic C (Schmidt et al. 2011).

This paradox can be explained by several biases encountered in studies of C turnover time in soil molecules. First, an individual biochemical family comprises a very large number of molecules with diverse properties. For example, in the lignin family, the chemical structures of wood lignins are very different from those of the stems of annual plants. Thus, the residence times obtained for C in lignin extracted from maize (rapid turnover, Dignac et al. 2005) cannot be generalized to lignin molecules from other plant species. Then, only some of the members of a given molecular family can be analyzed, i.e., those that can be isolated by wet-chemical methods and that are analytically detectable. It is generally assumed that their C residence time is representative of that of the whole family, which is highly debatable. Another bias is related to the misuse of certain models to assess the mean residence time of C in specific compounds (Derrien and Amelung 2011). Typically, models assessing C mean residence time in microbial compounds must also take into account the initial transit of C in a plant residue, what is not always done. Similarly, there is often a confusion between the age of a molecule and the age of C in that molecule. As an example, let us consider a tree that is several hundred years old. When it dies, the fungi that degrade its wood use the “old” C to create new molecules. The C in these molecules is much older than the fungal molecules themselves and its age is completely unrelated to their recalcitrance. For this reason, one should not claim that a molecule with old C atoms is a recalcitrant molecule. It is necessary to separate the concept of recalcitrance from C age (Kleber et al. 2011): the old C in a molecule may have been recycled from another molecule, and this process may have occurred many times (Derrien et al. 2006).

2.1.3 Revisiting recalcitrance in light of other persistence processes

The concept of chemical recalcitrance has also been challenged in the last couple of decades by a considerable number of studies investigating other processes of SOM dynamics (Fig. 5) (Ekschmitt et al. 2005; Schimel and Schaeffer 2012). In particular, a significant research effort has been made since the late 1990s to understand how OM can be preserved through association with mineral phases, which reduces accessibility to decomposers (Kravchenko and Guber 2017; Panettieri et al. 2017; Virto et al. 2010). At the same time, microbial ecology has also provided new insights by showing that OM breakdown depends on the interplay of different microbial communities characterized by their functional diversity (Fanin et al. 2016; Schneider et al. 2012; Bardgett and Van Der Putten 2014). OM is degraded by a succession of microorganisms producing extracellular enzymes that progressively depolymerize and oxidize molecules, releasing small organic compounds and nutrients available for microbial uptake (Amin et al. 2014; Sainte-Marie et al. 2021; Schneider et al. 2012). Although most of the soil enzymes are redundant in terms of the attacked biochemical classes, enzymes greatly differ in terms of structure. As a result, each enzyme breaks some specific bonds with a particular efficiency. The nature of persisting SOM in turn depends on microbial diversity, which controls enzyme functional diversity. Additionally, the activation energy necessary to access and decompose a molecule can be impossible to provide for some microorganisms but provided by others (see Section 2.4). For example, white and brown rot fungi produce enzymes capable of attacking lignins in litter, which most soil microorganisms are unable to do (Floudas et al. 2012; Janusz et al. 2017). Nevertheless, all of the known processes that favor the persistence of SOM (Fig. 5) are related to OM chemistry. For example, the nature of the functional groups in OM governs its interactions with the protective mineral phases, as well as with other organic compounds in the soil. To summarize, the intrinsic properties of organic molecules undoubtedly play a role in C dynamics in soil, but they need to be considered together with the biotic and abiotic parameters of the environment. For this reason, the term recalcitrance should no longer be applied to qualify the global dynamics of a molecule in soil; instead, the term persistence should be used.

2.1.4 How can recalcitrance be rehabilitated, and how can it be used in a practical way?

As discussed above, the chemical properties of a molecule are not the predominant driver to explain its dynamics, especially in consideration of different sites with contrasting biotic and abiotic properties. However, for a given set of pedoclimatic conditions, molecule properties do have an influence (e.g., in Versailles, France, different turnover for C in lignin and plant polysaccharides, Derrien et al. 2006; Dignac et al. 2005). The long residence time of the condensed structures of pyrogenic OM in soil (Lehmann et al. 2015) is another good reason to avoid sweeping away recalcitrance.

To rehabilitate the controversial concept of chemical recalcitrance, we propose to define distinct scales of recalcitrance that are applicable in the major pedoclimatic contexts. Practically, a set of targeted pedoclimatic conditions should be identified, encompassing various ranges of nutrient availability, mineral phase properties, decomposer needs and functionalities, and so on. The operational calibration of these recalcitrance scales should be based on standardized degradation tests carried out in the selected pedoclimatic conditions for various molecules/substrates at monthly to pluriannual timescales. Such recalcitrance scales could be disseminated to stakeholders in order to support the selection of additional organic inputs to the soil that favors SOC accumulation under local pedoclimatic conditions (see Section 4.2). They could also contribute to the improvement of Earth System Models by prompting updated versions that go beyond the old concept of recalcitrance.

2.2 Microbial transformation of plant OM: is it a prerequisite for its persistence in soil?

The plant or microbial origin of OM persisting in soil on a multidecadal scale is still a topic of much debate. It was long assumed that persistent SOM mainly originates from plant inputs and forms according to mechanisms that depend on the biochemical quality of these inputs. Some plant biopolymers, such as lignins and long-chain lipids, were thus considered slowly degradable or recalcitrant (see Section 2.1). However, studies conducted over the last decade have suggested that microbial compounds are a major contributor to persistent organic C, and currently, the prevailing opinion is that microbial transformation is a prerequisite for SOM stabilization (e.g., Dynarski et al. 2020; Kallenbach et al. 2015; Liang et al. 2019). We review here how microbial compounds persist in soil despite being theoretically easily assimilable by decomposer. We also present evidence of contexts that lead to the persistence of plant compounds and discuss the preferential use of preserved plant or microbial compounds by decomposers.

2.2.1 How microbial compounds are preserved in soil

Since the early 2000s, technological advances in molecular analysis have improved our knowledge on the chemistry of persistent SOM (Kögel-Knabner 2002; Amelung et al. 2008; Gleixner 2013). Using these new techniques, some studies have argued that the molecular composition of persisting OM is closer to that of microbial constituents than to that of plant tissues (Clemmensen et al. 2013; Kallenbach et al. 2016; Liang et al. 2017; Liang et al. 2019; Miltner et al. 2012; Wang et al. 2021). Furthermore, it has been shown that persisting OM is preferentially found in fine soil fractions (von Lützow et al. 2007), which are known to be enriched in compounds derived from microbial OM (e.g., Kopittke et al. 2018).

To understand the mechanisms that could explain the persistence of microbial compounds, we first need to examine their chemical nature. Products of microbial metabolism are exuded and excreted by active microbes or released after cell death (Hofman and Dušek 2003), and range from low-molecular weight and soluble compounds (such as simple sugars, organic acids, and amino acids), to proteins and storage and structural polysaccharides (such as starch and chitin). In theory, these structures are not particularly “recalcitrant,” that is, not complex in their chemical structure (see Section 2.1) (Hopkins and Dungait 2010; Lorenz et al. 2021; Malik et al. 2016). In vitro decomposition experiments under controlled conditions have shown that they may decompose very rapidly (sometimes in less than an hour), probably because they are rich in energy or nutrients, easily accessible to organisms, and rapidly assimilated (Boddy et al. 2007; Hill et al. 2008).

There are several explanations for the apparent paradox of non-complex molecules remaining in soils for a very long time. First, we must remember (Section 2.1) that the age of the C in a microbial molecule does not reflect the age of the molecule, which may be constituted of C that was recycled from other molecules. Nevertheless, it has been argued that the large contribution of microbial compounds to SOM only makes sense if they are somehow protected from continuous microbial recycling (e.g., Derrien et al. 2006). This protection could be due either to chemical or physico-chemical bonding to minerals or to physical occlusion in small aggregates, metal oxides, or short-range-ordered minerals (Kögel-Knabner et al. 2008; Berhe et al. 2012). OM that becomes physically or chemically inaccessible as a result of association with minerals is referred to as mineral-associated OM (MAOM). This is a finer, heavier fraction of OM, which, in addition to the larger and/or lighter particulate organic matter (POM), constitutes the bulk of SOM. We adopt this definition of MAOM in this paper, but must point out that other authors use a more restrictive definition of MAOM that only includes OM chemically bound to mineral surfaces and excludes OM physically protected in aggregate (see Section 2.3). Molecules biosynthesized by microbes have a stronger affinity for protective mineral phases than plant compounds (principally because of their polarity, Kleber et al. 2021), which could be a reason for their large contribution to persisting OM (Cotrufo et al. 2015; Hatton et al. 2012; Kallenbach et al. 2015; Kallenbach et al. 2016; Liang et al. 2017; Miltner et al. 2012; Rillig 2004; Traore et al. 2000). When associated with mineral phases, microbial compounds become less accessible to decomposer enzymes and uptake by decomposer (see Section 2.4). Moreover, accessible mineral surfaces are hot spots of microbial growth and, thus, hot spots of preserved microbial necromass in the form of MAOM (Uroz et al. 2015; Witzgall et al. 2021). Finally, the persistence of microbial compounds could be related to their lower energy content compared to plant compounds, as measured using thermal analysis (see Section 2.4). Indeed, under anaerobic conditions, certain microbial lipids with a low oxidation state might not be used by decomposers due to energetic restriction (Keiluweit et al. 2016).

2.2.2 How plant compounds are preserved in soil

The theory that OM persisting in soil is mainly composed of microbial metabolites is contradicted by numerous observations at long-term experimental sites of partially decomposed plant organic residues that persist several decades after their input into the soil (e.g., Amelung et al. 2008; Barré et al. 2018). These studies demonstrate that both microbial and plant-derived compounds contribute to persistent SOM.

In some ecosystems, the persistence of OM of plant origin in soil may be due to environmental conditions that limit microbial activity, such as anoxia, lack of water, low temperature, or acidic pH (e.g., Keiluweit et al. 2016; Trumbore 2009) (Fig. 6). This is the case in alpine soils (Budge et al. 2011), high-latitudes soils (Kohl et al. 2018), peatlands (Leifeld and Menichetti 2018), permafrosts (Pengerud et al. 2017; Schuur et al. 2008), and built-up or urban soils (Allory et al. 2022; Cambou et al. 2018; Rees et al. 2019). As an example, in soil submitted to anoxia, the decomposition of certain plant lipids and aromatics could be thermodynamically hampered—as mentioned above for microbial lipids (Keiluweit et al. 2016) (see Fig. 6 and Section 2.4).

Fig. 6
figure 6

Examples of conditions and processes leading to plant residues persistence.

The persistence of plant residues may also be due to their association with protective mineral phases. A meta-analysis by Angst et al. (2021) indicated that 50% or more of MAOM (in the broad sense defined above) would be of plant origin, based on the quantification of amino sugars as biomarkers of the microbial-derived OM. The formation of MAOM of plant origin derives from the direct occlusion of particulate plant OM within microaggregates (Lehmann et al. 2008; Virto et al. 2010), which is promoted by detritivore activity (Lavelle and Spain 2001) (Fig. 6). Plant-derived MAOM may also form via the interaction of reactive mineral surfaces with soluble plant-derived compounds (e.g., rhizodeposits, forest floor leachates) (Hagedorn et al. 2015) or with plant polymers transformed by enzyme oxidation or depolymerization that increase their reactivity toward mineral phases (Kleber et al. 2015; Liang et al. 2017; Liang et al. 2019; Wang et al. 2017). This association with protective minerals can occur as soon as plant OM enters the soil. In particular, plant belowground inputs in the form of root tissues or exudates may penetrate aggregates (Freschet et al. 2018; Rasse et al. 2005; Poirier et al. 2018).

The relative contributions of microbial and plant products to SOM are influenced by both land use and soil type (see Section 4.2). The contribution of microbial compounds is reported to be greater in grassland soils as well as in fertile Chernozems or Luvisols (Angst et al. 2021). Instead, plant compounds predominate when soil conditions are less favorable to microbial growth or in coarse-textured soils characterized by organo-metal complexes or short-range-ordered minerals, such as forest soils, Podzols, Ferralsols, Gleysol, or Alisols (Angst et al. 2021; Hall et al. 2020; Kögel-Knabner and Amelung 2021). Furthermore, the overall contribution of MAOM to C storage also varies among soil types (see Section 2.3).

Thus, evidence from the literature of the preservation of both microbial- and plant-derived compounds contradicts the view that microbial transformation is a prerequisite for OM preservation in soil. To improve our understanding of the preferential persistence of microbial or plant-derived OM, we should further investigate patterns of their preferential utilization by decomposers to meet energy or nutrient needs (Bernard et al. 2022) and the likelihood of random encounters between soluble substrates and decomposers (Don et al. 2013; Dungait et al. 2012). There is notably a significant knowledge gap on how variations in micro-environmental conditions affect the accessibility of different OM sources (Bailey et al. 2020) and the energy gained by microbes through their consumption (see Section 2.4).

2.3 Is particulate organic matter an effective lever for climate change mitigation?

The POM fraction refers to organic soil particles that are not associated with minerals. POM consists of coarse plant debris, characterized by a turnover that is generally faster than that of bulk SOM and rather young C (generally a few years old) (Antón et al. 2022). Therefore, POM may not seem to be the most relevant fraction for accumulating additional C in soil (Dynarski et al. 2020). However, one may still question whether POM accumulation could make a meaningful contribution to climate change mitigation. Here, we review the POM properties, its contribution to SOC stock, and its mean residence time, before examining the detailed arguments supporting the use of POM as a means of climate mitigation.

2.3.1 Properties of POM

Before discussing properties of POM, an important distinction should be noted. POM has been defined either as a conceptual pool that complements MAOM (see Section 2.2) (Lavallée et al. 2020) or as an operational pool (Poeplau et al. 2018), which is isolated by density or size combined with density, using physical fractionation techniques (typically, density < 1.4–1.8 g cm-3 and size > 50 μm) (Hénin and Turc 1949; Leifeld and Kögel-Knabner 2005; Poeplau and Don 2013; Poeplau et al. 2018). The difficulty of measuring conceptual pools and the diversity of fractionation protocols (Lavallée et al. 2020) have led to variation in the properties of POM and MAOM among studies and have impeded generic characterization. Consequently, scientists must ensure in their studies that they clearly explain how they define POM and MAOM (see also Section 2.2.1). In the following, POM is considered to be mineral-free and to mainly consist of decomposing plant fragments—with this definition, “occluded POM” is considered as part of MAOM. POM originates from roots and aerial parts of on-site vegetation or from inputs of organic amendments. It is progressively fragmented and incorporated into the soil through the action of living organisms. Pyrogenic C (e.g., charcoals from forest fires) is also found in this fraction (Paetsch et al. 2017). The size of the POM reservoir and its biochemical composition depend on vegetation type, soil biodiversity, and management practices (Section 4.2). Indeed, the magnitude of biomass harvest has a direct impact on plant inputs to soil and on the size of the POM pool (Section 4.1).

2.3.2 Contribution of POM to soil C stock

The contribution of POM to SOC is highly variable (Georgiou et al. 2022) and is strongly affected by soil type and depth (Kögel-Knabner and Amelung 2021; Soucémarianadin et al. 2019). It is higher in alpine soils, cryosols, and soils with few reactive minerals (Hagedorn et al. 2019; Kögel-Knabner and Amelung 2021; Leifeld and Fuhrer 2009) than in clay soil from temperate low-elevation regions. POM also depends on soil use and vegetation cover. At the scale of the European LUCAS Soil Network, POM can account for more than a quarter of SOC in the 0–20-cm layer of forests and grasslands, while this fraction is generally smaller in well-drained arable mineral soils (Cotrufo et al. 2019; Lugato et al. 2021; Wander 2004). In forest soils, the relative abundance of POM depends on tree species and is typically lower under broadleaves than under conifers (Cotrufo et al. 2019). In soils that are relatively poor in OM, C tends to be stored more as MAOM than as POM (Cotrufo et al. 2019). C storage in MAOM would be limited to a maximum saturation value (Cotrufo et al. 2019; Georgiou et al. 2022), while POM, instead, does not appear to be limited. The POM fraction, because of its unlimited size, would therefore be an interesting lever for additional C storage and may represent the main opportunity to increase C storage in soil with few reactive mineral phases. However, this additional C storage capacity of POM is still controversial since the duration of this storage, before POM degradation by decomposers, may represent a limit for its contribution to long-term SOC stock build-up.

2.3.3 Residence time of C in the POM fraction

The quantities and biochemical composition of POM are dynamic and transient in nature due to the seasonality of plant litter inputs and fluctuation in microbial activity (e.g., Puissant et al. 2017). Because POM is not physically protected from decomposers, it is often considered readily biodegradable (Baldock and Skjemstad 2000; Cotrufo et al. 2019; Paul 2016). However, the biochemical constituents of POM (e.g., cellulose, lignins, pyrogenic C) may exhibit a range of intrinsic biodegradation properties (see Section 2.1 about recalcitrance). In addition, POM use by decomposers depends on its N and P contents (which are often low), on the energy resources it provides to microorganisms, and on the energy required to break bonds within POM biopolymers (Berg 2014; Schmidt et al. 2011; Soucémarianadin et al. 2019) (see section 2.4). As a result, C residence times in POM can range from months to a few decades (Antón et al. 2022; Balesdent 1996; Derrien and Amelung 2011; Paul 2016). The notable exception to this is POM that contains substantial amounts of pyrogenic C, which often has average residence times greater than 100 years (Bird et al. 2015; Chassé et al. 2021; Lehmann et al. 2015).

The residence time of POM is further modulated by soil and climate conditions: it is increased by conditions unfavorable to microbial decomposition (waterlogged soils, nutrient limitations, anoxia, high latitudes and altitudes, arid zones, deep horizons) (Budge et al. 2011; Leifeld and Fuhrer 2009) and under plant species with inhibitory actions on microbial activity (e.g., soil acidification). Peatlands represent an extreme case where the residence time of C in POM can reach thousands of years (Leifeld and Menichetti 2018).

2.3.4 Limits and benefits of POM for C storage in soil

The generally short residence time of C in POM is a strong limitation to its utility for additional C storage over the long term. The sensitivity of POM to environmental changes that affect plant inputs or decomposer activity also carries a risk of C storage reversibility by the effects of climate change (Hagedorn et al. 2019; Rocci et al. 2021). This is the reason why some calculation methods of C sequestration potential at regional and national scales (e.g., Alvarez and Berhongaray 2021; Chen et al. 2018) do not consider the accumulation of C in POM but rely only on the finite capacity of the fine soil fraction to sequester SOC in the long term, as first conceptualized by Hassink (1997). However, estimating SOC storage potential only from the storage capacity of the fine fraction, in which C is assumed to persist but to reach saturation, is still a matter of debate. Some research has described MAOM as a composite fraction with some components turning over rapidly on a decadal scale (Schrumpf et al. 2021; Virto et al. 2010), while other authors have argued that soil sequestration potential is primarily determined by inputs rather than by the finite capacity of minerals to accumulate C (Barré et al. 2017).

Nevertheless, POM possesses several characteristics that are advantageous for its accumulation in soil. First, its rapid response to changes in land use and management (Lugato et al. 2021) and its potential pool size—which, unlike MAOM, is not limited—make it a relevant reservoir that can be mobilized in the short term for additional C storage via the implementation (and maintenance over time) of adapted practices (see Section 4.2). Second, increasing C storage in soil in the form of POM would not require additional immobilization of nutrients, in contrast with MAOM, for which stoichiometric requirements can act as limiting factor (see Section 4.2). Finally, the loss of free POM does not mean that its C has been lost from the soil: it may simply move to other compartments of SOM, such as MAOM. For this reason, in all conceptual models of SOM formation—such as humification, selective preservation, microbial decomposition, or the soil continuum model (Basile-Doelsch et al. 2020; Lehmann and Kleber 2015)—POM is involved at the very beginning of persistent OM formation, which it feeds through different mechanisms. Soil faunal activity can lead to the inclusion of POM in soil aggregates (Angst et al. 2019; Le Mer et al. 2020; Six et al. 1999; Vidal et al. 2019). In such circumstances, the age of C in trapped POM (“occluded POM” or “intra-aggregate POM”) can be older than the average age of the SOC (Virto et al. 2010). Some of the C in free POM is also converted to microbial metabolites that have a strong affinity for mineral surfaces and can be preserved over the long term in organomineral assemblages (e.g., Hatton et al. 2012; Six et al. 2006) (see section 2.2). Management practices must promote, as much as possible, this flow of C from unprotected POM to protected MAOM (Kallenbach et al. 2019). More generally, the fact that POM serves as hotspots of microbial activity can drive the formation of organomineral associations. The exopolysaccharides and hyphal network produced by POM-consuming microbes glue together finely sized minerals and constitute a nucleus for aggregate formation and soil C persistence (Witzgall et al. 2021).

2.4 Is soil organic matter persistence driven by spatial or energetic inaccessibility?

Regarding the factors that control C dynamics in soil, two main theories have been recently presented in the literature (Fig. 5): one based on spatial inaccessibility (Chenu and Stotsky 2002; Pinheiro et al. 2015) and the other based on the bioenergetic constraints of SOM degradation (Fontaine et al. 2007; Barré et al. 2016), which we will refer to here as energetic inaccessibility. Whether spatial or energetic, limitations on SOM degradation correspond to the difficulty of substrate use by decomposers. Is it possible to reconcile these two theories? In this section, we define the spatial and energetic inaccessibility of soil C and examine their drivers. We discuss whether spatial inaccessibility may be examined from an energetic perspective and also consider nutrient inaccessibility as a third possible factor controlling soil C dynamics.

2.4.1 Spatial inaccessibility

A substrate might be spatially inaccessible as a result of two distinct processes: (i) its inaccessibility to decomposer enzymes and (ii) its impossible uptake by decomposers while it occurs as oligomer or monomer. Spatial accessibility is an issue at both the nano- and microscales, e.g., through the adsorption of OM onto mineral phases or its entrapment in environments with reduced pores, such as soil microaggregates or mineral-organic coprecipitates (Erktan et al. 2020; Hemingway et al. 2019; Kleber et al. 2015; Kravchenko and Guber 2017; Rowley et al. 2018). It can also be an issue at the macroscopic scale, for example, in the case of a mulch applied to the soil surface that is not in contact with soil organisms (Bleuze et al. 2020). Spatial inaccessibility depends on soil physico-chemical properties that determine the physical protection of OM such as the nature of the mineral phases, pH, the composition, and ionic strength of the soil solution (Newcomb et al. 2017; Sposito 2008), and the soil water content, which controls the diffusion of OM/nutrients and the mobility/motility of organisms in the soil porosity (Kleber et al. 2021). It also depends on the actions of mesofauna and earthworms, which alter pore geometry and connectivity (Erktan et al. 2020).

2.4.2 Energetic inaccessibility

Energetic inaccessibility corresponds to an unfavorable balance between the energy invested by decomposers to obtain access to a substrate and the energy that is gained from its mineralization (Fontaine et al. 2007; Kleerebezem and Van Loosdrecht 2010; Rovira et al. 2008; Williams and Plante 2018). This concept is therefore dependent on the properties of a substrate (diversity and number of monomers and strength of bonds between them) that makes it either thermodynamically expensive or appealing for decomposers (Lashermes et al. 2014; Lashermes et al. 2016; Moorhead et al. 2013).

This cost/benefit ratio can be estimated by thermal analysis (Dufour et al. 2021; Williams and Plante 2018). The cost corresponds to the activation energy required to overcome the energy barrier for mineralization; this is evaluated by the stability of a compound subjected to a temperature increase. It can also be assessed using incubations that aim to quantify the substrate decay rate at different temperatures, in order to calculate the activation energy as a proxy of the energy cost (e.g., Leifeld and von Lützow 2014; Hemingway et al. 2019). The benefit corresponds to the energy released during mineralization of the compound (e.g., by respiration) and is measured by calorimetry. Several studies have reported that the energetic characteristics of persistent SOC are different from those of fresh SOC. Persistent SOC tends to be more thermally stable and its combustion generates less energy (e.g., Barré et al. 2016; Hemingway et al. 2019; Henneron et al. 2022; Plante et al. 2011; Plante et al. 2013). These observations suggest that SOC persists when the cost of its degradation is too high relative to the benefit to microorganisms. Some experimental (Barré et al. 2016; Cécillon et al. 2021) and theoretical (LaRowe and Van Cappellen 2011; Manzoni et al. 2012) studies have indicated that, under oxic conditions, reduced compounds provide more energy to microorganisms than oxidized compounds, and better promote microbial growth. These studies have also suggested that plant-derived compounds provide more energy than microbial compounds in well-aerated soils. In contrast, in anoxic environments, reduced compounds persist because their decomposition would require too much activation energy and their fermentation would yield too little energetic benefit (Boye et al. 2017; Keiluweit et al. 2016).

With respect to the interpretation of thermal data, though, it should be noted that thermal methods are integrative and take into account the total OM of the analyzed sample, whereas under real conditions, decomposers have access to only a small fraction of this OM (Leifeld and von Lützow 2014). Alternatively, energy investment can also be considered as the cost of exoenzyme production (Bosatta and Ågren 1999). In this context, analyses must consider the temporal dimension since microorganisms produce enzymes using previously acquired resources (Amin et al. 2014; Fontaine et al. 2007; Klotzbücher et al. 2011).

2.4.3 The energetic dimension of spatial inaccessibility

Spatial inaccessibility may be altered by changes in local environmental conditions. Such modifications can be the result of energetic investment by decomposers in the production of weathering agents (e.g., protons, siderophores), that desorb substrates associated with minerals and increase their accessibility (Henneron et al. 2022). The extent of this investment varies with the type of bond to be broken: more investment is required to break covalent bonds between OM and mineral surfaces than to break electrostatic bonds between oppositely charged organic and mineral ions (Kleber et al. 2015). With an increased understanding of microbe-driven desorption efforts, it may be possible to better integrate the concepts of spatial and energetic accessibility.

2.4.4 Nutrient issues, a third mechanism of inaccessibility limiting soil OM decomposition

Beyond the acquisition of C, the benefit of an organic substrate for microorganisms must also be evaluated in terms of acquisition of nutritive elements (e.g., N, P, Ca, Mn). Indeed, nutrients are essential for metabolite production and decomposer growth (Hemkemeyer et al. 2021; Monod 1949; Saadat et al. 2020). OM utilization is thus not just a matter of whether an energy balance is favorable or not for microorganisms, but also, whether the stoichiometric needs of microorganisms are satisfied (Kleerebezem and Van Loosdrecht 2010; Margida et al. 2020; Moorhead et al. 2012; Torn et al. 2005; Zechmeister-Boltenstern et al. 2015). Stabilized SOM has near-constant C:N:P ratios across ecosystems (Bertrand et al. 2019; Kirkby et al. 2013), which could imply that additional C storage would be necessarily associated with nutrient immobilization—a hidden cost of C persistence in SOM (Richardson et al. 2014) (see section 4.2).

Microbes are not completely powerless in the face of soil nutrient depletion; many are able to modulate their metabolism to better forage for nutrients (Bertrand et al. 2019; Recous et al. 2019). For example, microorganisms may produce enzymes to mineralize nutrient-containing SOM, such as nitrogenous OM. This selective mining leads to an acceleration (or priming) of SOM mineralization kinetics (Craine et al. 2007; Fontaine et al. 2011; Shahzad et al. 2015; Hicks et al. 2021). Microorganisms can also retrieve nutrients such as phosphorus, magnesium, calcium, or iron from mineral phases by producing weathering agents such as siderophores or organic acids (Bailey et al. 2020; Keiluweit et al. 2015; Uroz et al. 2009; Uroz et al. 2020). If these mechanisms are not efficient enough to meet the stoichiometric requirements of microbes, though, microbial activity decreases, limiting the decomposition of OM.

In conclusion, OM persistence in soil is never driven by purely spatial or purely energetic constrains. A holistic view of these processes must be adopted, with particular attention to the impacts of environmental parameters and practices on inaccessibility processes (Fig. 7).

Fig. 7
figure 7

The persistence of OM (whatever its nature: plant-derived, decomposer-derived, pyrogenic or dissolved) results from the conjunction of different mechanisms controlling OM utilization by decomposer. These mechanisms are linked to stoichiometric, energetic, or spatial constraints. The chemical properties of OM drive stoichiometric and energetic limitations. Enzyme production and mineral dissolution drive energetic limitation. Organo-mineral associations, soil properties, and bioturbation by soil fauna drive spatial limitation. Dotted arrows indicate links between constraints.

2.5 Reconciling competing theories of C persistence

In this first part, we have described how the apparent conflict between theories of soil C storage mechanisms is not actually rooted in any definitive antagonism (Figs. 5 and 7). We have shown that recalcitrance is not an all-or-nothing concept, but must be seen as a regulator of OM persistence. Recalcitrance could be defined for each major pedoclimatic context, using a quantitative index in standardized conditions. The debate on the relative importance of microbial and plant origins of persistent OM can also be resolved by considering pedoclimate properties, which control (i) the intensity of microbial activity and transformation of plant OM into microbial OM, and (ii) the presence of reactive mineral phases crucial for the preservation of thermodynamically labile microbial compounds. We have explained how POM could be an asset for C accumulation. Although its rapid turnover and smaller size (relative to MAOM) are detrimental for additional C storage, this effect may be offset by its rapid response to management practices, its potential non-limited size, its stoichiometry, and its ability to feed and promote C preservation in MAOM. Finally, we have clarified that, when environmental properties are considered, theories of SOM inaccessibility—spatial, energetic, or related to nutrient availability—are complementary rather than conflicting, and need to be considered together to explain C persistence in soil.

3 Integrating mechanisms into databases and models

Databases and models have important roles to play in the efficient synthesis and dissemination of knowledge on mechanisms involved in SOM storage as well as in the translation of scientific knowledge into recommendations for management practices beneficial to C storage. Nevertheless, caution must be exercised in the use of databases, and the scientific community is still divided on the relevance of increasing the number of processes described in models. We review here the benefits and pitfalls of (1) large databases and (2) process-based models, and conclude with a discussion of the good practices for their use in soil science.

3.1 Big data: opportunity or danger for research in soil science?

Data are produced or collected on a massive scale by researchers, practitioners (Billings et al. 2021), and citizens involved in participatory research. Big data, i.e., the analysis of massively acquired data, offers a complementary alternative to experimental research on mechanisms, which are necessarily limited by cost and/or complexity of advanced analytical techniques. We discuss here the opportunities opened by the increasing availability of soil-related data and underline critical points of caution before proposing a roadmap for big data deployment in soil science.

3.1.1 Opportunities of big data

Massive datasets represent a real opportunity to synthesize and make available to the community a wide range of knowledge obtained on large spatial scales (e.g., research observation networks, national or continental scale), as proposed, for example, by international databases on soil respiration or radiocarbon (Jian et al. 2021; Lawrence et al. 2020). The exploration of such massive datasets then serves as a means of generating new insights from multiple data aggregation. For instance, through archiving recently acquired data and digitizing old data, it may be possible to monitor the evolution of soil C stocks and then gain insight into the mechanisms involved.

Another strength of big data is that it can massively combine data on the mechanisms controlling C storage in soil with georeferenced environmental and even socioeconomic data (e.g., pedological, biological, geological, geomorphological, ecological, hydrological, climatic, agronomic, forestry, economic and historical data), thus promoting interdisciplinarity. It can increase our knowledge of mechanisms of soil C storage (Cécillon et al. 2015; Vestergaard et al. 2017) by (1) providing evidence for the genericity of certain processes, or, alternatively identifying pedoclimatic contexts that are characterized by distinct processes, and, by (2) ranking the importance or relevance of soil C storage mechanisms among different types of pedoclimate or plant cover. Statistical methods such as path analyses can be used to infer dependence and causality between environmental variables and soil storage mechanisms (Cotrufo et al. 2019; Lange et al. 2015). In addition, machine learning models can be constructed to directly predict the intensity of soil C storage mechanisms from environmental metadata readily available from online databases (Cotrufo et al. 2019; Sanderman et al. 2017). Machine learning models can also be developed using massive data obtained by applying simple and fast analytical techniques, such as near infra-red spectroscopy, to a large number of samples to predict OM characteristics (e.g., Dangal et al. 2019; Viscarra Rossel et al. 2019). Such characteristics are classically measured by more precise but complex or expensive analytical techniques, which are not accessible to all scientists. The extent of the data collected compensates for the lack of sensitivity of these massively deployed simple methods and enables detection of significant trends.

3.1.2 Points of caution associated with big data

With the emergence of big data, soil science and biogeochemistry are facing a profusion of data, which are sometimes inconsistent and may have few quality controls. Moreover, the skills to properly process and interpret such data are scarce. The danger is that the noise associated with inconsistent or non-validated data may prevent the detection of real trends. The deployment and use of big data must therefore be guided by a number of warnings from the scientific community. The pitfalls to be avoided when exploiting massive databases are (i) producing only obvious trends that have been understood for a long time (e.g., soil OM stock is related to clay content); (ii) seeking to detect only global trends, whereas situations outside the global trend may be of particular interest (e.g., risk of overlooking outliers such as a sandy soil with a lot of C, which could provide new information); (iii) aggregating and comparing data that are not comparable because they were obtained using different methods (e.g., soil organic carbon stocks obtained by wet chemistry, dry combustion, or loss on ignition are not equivalent—Tivet et al. 2012); and (iv) inferring irrelevant mechanisms or causality from large datasets (e.g., through misused machine learning without critical evaluation—Wadoux et al. 2020).

With the use of big data, there is also an increase in the distance between the people who use the data and those who carry out the soil sampling, data acquisition, and archiving in databases. Within the field of soil biogeochemistry, we must be careful not to deploy big data at the expense of soil science and metrology, which are and should remain essential pillars of the discipline. Big data also represents a risk for data ownership and access. To avoid uncontrolled exploitation of data, partnerships may be necessary, in particular with private companies.

3.1.3 Roadmap for the deployment of big data in soil science

Our community can contribute to several important projects on the implementation and rigorous use of big data. The first relates to the harmonization of data on C storage mechanisms (nature, unit, etc.). For this purpose, the use of international ISO metrological standards is essential when collecting data. Our community can also contribute to the development of new standards (Bispo et al. 2017) (e.g., European INSPIRE Directive; constitution of metadata catalogs; application of interoperability rules; FAIR data principles (Findable, Accessible, Interoperable, Reusable)). Finally, it is necessary to make the database frameworks compatible with future scientific advances to make sure that it will be possible to integrate novel data generated by innovative methodologies (e.g., "-omics") and associate them with new standards.

3.2 Is it necessary to integrate novel mechanistic knowledge in models of SOM dynamics?

A model is a simplified representation of reality. It is, however, often very useful for three main purposes in the field of SOC storage: (i) improving scientific knowledge on soil functioning, (ii) providing predictions of C storage in soil, and (iii) supporting policy-makers and practitioners for practices and management choices. A model can be built either on the basis of observations that are reproduced using a reduced number of equations and parameters (data-driven models, also called empirical, statistical or phenomenological models) or by gathering knowledge or hypotheses on soil functioning and translating each process into an equation, which leads to a much larger number of equations and parameters in the model (process-based models, sometimes called mechanistic models). There is also a vast range of “pseudo-mechanistic” models that are intermediate between data-driven and process-based models (see examples below) (Fig. 8). Technical advances in recent years have led to a considerable body of new knowledge on the factors controlling SOM dynamics, particularly regarding the functional diversity of decomposers and the importance of the spatial organization of the soil matrix (Blankinship et al. 2018; Kleber et al. 2021; Lehmann et al. 2020; Miyauchi et al. 2020; Zhang et al. 2016). Will the integration of this new mechanistic knowledge into models improve the accuracy of their projections? In an attempt to answer this question, we first survey the benefits and limitations of process-based models and then discuss how the expectations of the model’s end user should be considered in determining how precisely processes should be described in models.

Fig. 8
figure 8

A model’s purpose must be the foremost consideration in its design and the number of equations it includes. There is a continuum of models from data-driven models which include few equations on the one hand to process-based models, which rely on numerous mechanistic equations on the other, and pseudo-mechanistic models that are intermediate between the two end-members.

3.2.1 Benefits and limits of process-based models

A detailed description of processes in models can be very useful when the interactions among several processes and/or actors lead to complex dynamics that cannot be analyzed using the usual human deductive reasoning (Waring et al. 2020). Process-based models allow knowledge to be aggregated and simulations to be generated for scenarios, including changing environmental conditions. However, increasing the number of processes described in a model also increases the number of parameters as well as the uncertainty associated with predictions (Shi et al. 2018). Moreover, the acquisition of some of the data needed for parameter calibration and model validation might be experimentally difficult (Allison 2012; Blankinship et al. 2018; Lehmann et al. 2020). For all these reasons, it is very important to respect the principle of parsimony when choosing the mechanisms to be included in a model (Shi et al. 2018).

For models implemented at regional, national, or global spatial scales, the description of numerous mechanisms may require computational resources that are expensive or simply unavailable. In such situations, one solution may be to apply mechanistic models that analyze a limited number of carefully selected soils representative of typical pedoclimatic conditions in order to identify the emerging factors controlling OM dynamics. These determining factors are then introduced into simpler models deployed on a large scale for the same type of sites (Lehmann et al. 2020). Another way to circumvent the problems of process-based models is to create a “model of the model,” a so-called metamodel (Garcet et al. 2006). The metamodel is built by running the initial model in a wide range of conditions and by applying statistical approaches to the simulated outputs to determine new equations that require fewer parameters and a shorter computation time. Such metamodels are, however, more rigid than the original process-based model and may not give relevant predictions in complex, fast-changing environments.

It must also be kept in mind that models reflect the knowledge available at a given time. Models do not report an absolute truth and are likely to evolve with the progress of knowledge but also with the changing expectations of societies. Furthermore, the notion of a “mechanism” is itself complex and dependent on the scale at which the researcher is investigating. For example, a soil scientist identifies mechanisms of ecosystem functioning through observations that are in a large part only the macroscopic manifestation of processes taking place at much finer scales (e.g., molecular interactions, quantum physics) (Allison 2012). These limitations suggest that, despite the considerable amount of progress that has been made, current knowledge of the mechanisms contributing to ecosystem functioning remains incomplete.

3.2.2 Toward models adapted to users' expectations

The degree of mechanism that should be incorporated into a model depends on the modeler’s priorities among the three objectives previously defined (i.e., better understanding, scientific predictions, transfer to stakeholders) (Fig. 8):

  1. 1)

    To understand the functioning of a system, process-based (mechanistic) models may represent the best option. Playing with such models (e.g., “toy models” that have been designed to represent a theory or a mechanism in a simplified way) facilitates the identification of the roles of drivers and their interactions. Process-based models have led to significant advances in the understanding of biogeochemical cycles and supported the design of a new generation of experimental work (Barot et al. 2014; Daufresne and Loreau 2001; Moorhead et al. 2012; Perveen et al. 2014; Sainte-Marie et al. 2021; Sulman et al. 2017).

  2. 2)

    To predict alteration in ecosystem processes such as primary production and C storage and to share the results with policy-makers, the use of complex process-based models must be performed carefully. For short timescales (a few years to decades), data-driven models or pseudo-mechanistic models—such as RothC (Coleman and Jenkinson 1996), or AMG (e.g., Clivot et al. 2019; Levavasseur et al. 2020)—are probably the most effective tools. By incorporating only a few parameters and relying on a very large number of past observations, such models guarantee solid predictions in the short term in an environment that does not change much. In unstable conditions, however, the predictions of data-driven models could become questionable (e.g., Waring et al. 2020). When the environment evolves outside the framework for which the parameters were determined, these simple models may no longer be applicable (e.g., Georgiou et al. 2017). By explicitly representing the important processes in a system, mechanistic and pseudo-mechanistic models are more suitable for describing transient phenomena and the transition towards new equilibria (Finke et al. 2019; Keyvanshokouhi et al. 2019), and increase confidence in model predictions (Bradford et al. 2016). This is the strategy implemented in Earth System Models utilized for projections by the Intergovernmental Panel on Climate Change (IPCC) (e.g., Guenet et al. 2018). Moreover, to provide predictions to public policy decision-makers, it may be relevant to use several models based on different assumptions (Wieder et al. 2018). The range of simulations produced by the so-called ensemble modeling method provides uncertainty associated with the estimates (Farina et al. 2021; Shi et al. 2018; Sulman et al. 2018).

  3. 3)

    To integrate and transfer knowledge to stakeholders (students, citizens, politicians, practitioners and policymakers), models describing ecosystem functioning must also include economic and sociological components (e.g., multi-agent models—Bousquet and Le Page 2004). In this way, they can highlight the advantages, disadvantages, and trade-offs of various practices and management scenarios for human society (e.g., Pellerin et al. 2019). These decision-support tools are necessarily very integrative, but this integration comes at the cost of a simplification of the ecosystem mechanisms description. These models must be co-constructed with stakeholders to meet the needs of decision-makers, and with an interdisciplinary scientific community to better objectify the choices to be made for a simplified representation of ecosystems. The scientific community must also estimate the uncertainties associated with the predictions when possible. Finally, particular attention must be paid to the ergonomics of the modeling tool, to ensure that it has an intuitive interface that is easy to use by non-specialists (e.g., serious game, Jouan et al. 2020).

3.3 Databases and models: summary of benefits and pitfalls

We have shown here that big data and models offer a huge potential to improve our understanding of soil functioning, predict C dynamics, communicate and transfer scientific knowledge to stakeholders. Both tools must be used with caution, bearing in mind the benefits and limitations. Scientists must stay alert when developing and using these resources to avoid the pitfalls of database exploration and to optimize the robustness of model prediction. Because big data in soil science is still in its infancy, proper practices can contribute to its successful expansion and implementation. An increase in the number of processes described in a model framework must be justified by the model purpose, as a large number of parameters may be beneficial to knowledge production but detrimental for model use by non-experts. Our community should also favor collaborative exchanges between modelers and researchers who contribute to the production of experimental knowledge on processes in order to support model progress in a direction that is useful to stakeholders and to all scientists, whether experimentalists or modelers.

4 Debates on the effects of harvest and management practices: how can research support practitioner and policy-maker decisions?

While research on C storage in soil focuses on how biomass-derived SOM can contribute to climate change mitigation, there is at the same time an increasing demand for biomass harvest, which decreases C input to soil and modifies C stock. Indeed, plant biomass is also central to the global challenges of food security and fossil fuel dependency that need to be addressed simultaneously. In addition, although current knowledge on beneficial management practices has been summarized in several recent reviews (e.g., Amelung et al. 2020; Chenu et al. 2019), the rational mechanisms behind the observed benefits are still poorly understood or intensively debated, and the possible unsuspected interplay between processes may impair the expected effects.

In this section, we provide recommendations for research actions to better support stakeholders’ decisions regarding harvest and management practices, which are still highly debated for their actual contribution to climate change mitigation. First, we address the issue of biomass harvest by analyzing the trade-offs between directing plant residues to the soil and using them for food, energy, or materials. Then, we analyze the areas of contention related to agricultural and forestry management practices recommended for soil C accumulation.

4.1 How can acceptable trade-offs be designed regarding the use of plant biomass?

In managed agrosystems and forests, most of the aboveground plant biomass is harvested by humans to produce (i) food, (ii) biosourced energy or (iii) biomaterials and biomolecules (Fig. 9). The above-ground biomass that is not exported contributes to the transient storage of C in living standing biomass and feeds SOM (Fig. 9). Throughout history, biomass harvest has progressively spread over larger surfaces and its intensity has increased. This continuous and increasing harvest of biomass has led to a decrease in soil C stocks compared to a hypothetical world without agriculture (Sanderman et al. 2017).

Fig. 9
figure 9

Biomass fluxes among ecosystems and the bioeconomy: exports to food, bioenergy, biomaterial, and biomolecule sectors, return to the soil or standing.

The current demand for biomass is reaching a critical point. Food demand is projected to increase by 50% between 2012 and 2050 (The Future of Food and Agriculture—Trends and Challenges 2017), as a consequence of population growth. The biomass use for the production of bio-based energy, biomaterials or biomolecules is also increasing with the aim of substituting fossil C (Favero et al. 2020). To meet the needs of the bioeconomy, either the level of biomass harvesting must be increased or the amount of land devoted to agriculture or forestry must be expanded.

When designing trade-offs between exporting biomass or keeping it in the ecosystem, it is important to first take into account the expected duration of C storage. The storage duration in biomaterials is expected to be less than or equal to the one in soil. Indeed, the mean age of soil C is 100 years at 20 cm depth in the tropics (Balesdent et al. 2018), compared to a lifespan of 50 years for wood timber and 4 years for paper (Valade et al. 2018). In the soil C storage, duration can be increased by transforming biomass (e.g., into compost, biochar, methanization residues) before applying it to soil. A second important consideration is that the harvested biomass must have indeed substituted fossil C (Sathre and O’Connor 2010; Amelung et al. 2020), which could be confirmed by life cycle analysis that integrates the C emissions of intensified management, biomass conversion yields into energy or finished products, and off-farm transformation, among other factors (e.g., Valade et al. 2018). From this perspective, the C balance would obviously be more oriented towards C-saving for a territory capable of transforming locally produced biomass into locally consumed energy than for an area importing biomass and exporting the energy produced. Finally, soil fertility and production capacity should be carefully addressed, since biomass export affects not only C stock but also nutrient stocks (Achat et al. 2015; Durante et al. 2019; Legout et al. 2020). Under low fertility conditions, practices that favor the return of harvest residues to the soil should be encouraged. These practices will also participate to the preservation of other ecosystem functions (e.g., water quality and quantity, biodiversity) (Baveye et al. 2020; Hoffland et al. 2020).

4.2 Priorities for research on SOM mechanisms to support the implementation of practices beneficial to C accumulation in soil

Currently certain agricultural and forestry practices are recommended for promoting OM accumulation in soil (Chenu et al. 2019; Dynarski et al. 2020; Lugato et al. 2021; Schlesinger 2022). However, their benefits depend on soil characteristics (Amelung et al. 2020), and their net effect on C stock in the long-time has not yet been analyzed. It is possible that these practices may ultimately have some negative repercussions on climate mitigation or soil quality. Unfortunately, we cannot afford to wait until long-term studies have been performed: immediate action is needed to maximize the potential benefits of soils for climate change as soon as possible (Chabbi et al. 2017).

From a mechanistic knowledge basis, Table 1 summarizes the potential positive and negative effects induced by recommended management practices, categorized according to their targeted action: (1) increase in C input; (2) increase in N input; (3) enhancement of soil life, and (4) increase in spatial inaccessibility.

Table 1 Four categories of practices (see also the reviews by Dignac et al. 2017; Chenu et al. 2019; Amelung et al. 2020) with expected positive effects on C storage in soil are distinguished according to the mechanisms targeted: C input (practices that increase the input of C to soil); N input (practices that increase the input of N to soil); biology (practices that aim to maintain microbiological and biological life in soil); inaccessibility (practices that impede decomposers access to organic matter). Expected positive effects and potential negative effects of these practices.

The scientific community should make a particular effort to help quantify, with a site-specific approach, the possible adverse effects of the practices listed in Table 1, making use of long-term ecosystem research trials, database exploration, and modeling tools (see Section 3). Given the current challenges of climate change mitigation, food security, and bioeconomy growth, we consider that the practices favoring (i) soil life and the efficient nutrient cycling in the ecosystem and (ii) C input to the soil may be recommended more confidently than others (Fig. 10).

Fig. 10
figure 10

Synthesis of all the topics related to the increase of soil organic C storage that are addressed in this article. Management practices intended to promote C storage in a context of global constraints can be guided by results, theories, models, and debates arising from research, thanks to better communication between researchers and stakeholders. These practices aim to either increase C inputs or decrease C losses (see also Table 1).

With respect to practices that favor soil life, the enhanced formation of biogenic structures protecting OM and the enhanced production of microbial compounds with a strong affinity for protecting mineral surfaces in soil (see Section 2.2) are expected to counterbalance the higher respiration also induced by an enhanced soil life (Liang et al. 2017, 2019) (Table 1). With respect to practices that enhance C input to soil, a recent point of contention is that soil C accumulation would require an additional immobilization of nutrients to meet the stoichiometric constraints of SOM. Given a C/N ratio of 12:15 in SOM, 100 Tg of additional N would be needed per year to reach the 4 per mil program objective (Schlesinger 2022; Soussana et al. 2019; van Groenigen et al. 2017). They could be provided by the application of fertilizers (about 210 tons of atmospheric N are currently fixed per year by the fertilizer industry—Fowler et al. 2013). But this measure would hardly be compatible with the agenda for C neutrality. Additional N needs could also be met through the adoption of agro-environmental practices limiting nutrient leaching in soil, preventing soil erosion, and/or enhancing N biological fixation (current estimates of annual biological fixation in terrestrial ecosystems range from 52 to 120 tons of N—Davies-Barnard and Friedlingstein 2020).

In any case, the implementation of practices favoring soil life and C input to the soil should be considered “win-win” strategies that promote the storage of C, nitrogen, and other nutrients in the form of OM and a better coupling of biogeochemical cycles (Drinkwater and Snapp 2007; Janzen 2005, 2006; Hufnagl-Eichiner et al. 2011). It is of utmost importance, though, that C storage in soil does not occur at the expense of nutrients provision to plants, fauna, and microorganisms.

5 Conclusions and perspectives

By adopting an original approach based on the analysis of controversies on mechanisms of soil C storage, we demonstrate here that theories and concepts that appear at first glance to be antagonistic may in reality be quite complementary. Different theories can often be reconciled by considering variability in pedoclimatic properties, different decomposer functionalities, and changing environmental conditions. Future technical innovations will undoubtedly foster further debate and promote the development of novel emerging theories (Fig. 5).

Databases and models have an important role to play in the synthesis and dissemination of knowledge on mechanisms of SOM storage as well as in the translation of this knowledge into recommendations for practices beneficial to C storage. Nevertheless, the use of databases by the scientific community must be carried out cautiously, and the level of mechanistic description in models must be appropriate for the end user's goals and expectations.

While it is certainly true that much remains to be learned about mechanisms of C storage, their interactions, their relative importance in different environments, and how they can be integrated into databases and models, it is already possible to recommend and provide guidance on a certain number of management practices that have positive effects on soil C storage and for many ecosystem services (Fig. 10). When advising on beneficial practices, though, the scientific community must make it clear that there is no such thing as the absolute, definitive storage of C in soil (Dynarski et al. 2020). A given practice may lead to C storage for a certain period of time only. Moreover, this depends on the ecosystem’s characteristics and whether the practice can be maintained in the context of environmental, social or economic constraints. When choosing a practice, practitioners must be aware of the need to pursue it over the long term. The minimum storage timeframe considered in public policies, such as the Green Deal and C neutrality, is on the order of 20–30 years. The lack of quantitative knowledge on the impacts of recommended practices over time calls for long-term experiments over several decades in a wide variety of pedoclimatic situations, to simulate both the range of current conditions and those that we can expect with global warming, especially extreme climate events.

To conclude, it is critical that scientists are aware of existing debates in order to provide stakeholders with nuanced insights. They must ensure that a clear and consistent vocabulary is used and that the definitions and concepts that are debated within our community are explained and clarified for a wider audience, as done in this paper. Concurrently, stakeholders need to be aware of the need to interact with researchers as science evolves. To improve the translation of novel research findings into advice to policy-makers and practitioners, an improved dialogue between researchers and stakeholders (Fig. 10) is critical. This dialogue could be strengthened at all stages of research projects, from their construction—which could better involve stakeholders and more directly address questions from the field (Raous et al. 2020)—to the communication of research results in accessible media. For both communities, scientific networks that bring together researchers and stakeholders, such as CarboSMS in France, have an important role to play.