1 Introduction

Pollution, climate change, waste accumulation, and depletion of natural resources have increased at an alarming rate as a result of human activity [1]. Biorefineries, which produce biofuels and chemicals from lignocellulosic biomass, may address some of these challenges. Biorefinery processes might consist of five main unit operations: pretreatment, enzymatic hydrolysis, fermentation — combined or not with hydrolysis — product recovery, and wastewater purification [2]. All unit operations have to be adapted to the desired product and the available feedstock. In order to adapt the process to the feedstock and design the process steps, it is important to add structural knowledge on the different lignocellulosic biomasses. In Sweden, almost 70% of the land is forest, and the 83% of it is coniferous composed by 40% spruce, making spruce a viable feedstock for biorefineries. Spruce is composed of cellulose (45–47% w/w DM [3]), hemicelluloses (up to 15% w/w dry mass (DM) arabinoglucoronoxylan and up to 30% w/w DM galactoglucomannan [4]), and lignin (27–28% w/w DM [3]).

Low enzymatic hydrolysis efficiencies of softwood represent one of the main hurdles to industrial-scale biorefinery applications. Enzymatic hydrolysis of lignocelluloses is affected by substrate- and enzyme-related parameters. Enzyme-related parameters have been studied extensively, resulting in a relatively good understanding [5]. Apart from direct substrate features, such as the S/G ratio of lignin [6], substrate-related parameters are mainly associated with the accessibility of cellulose to cellulolytic enzymes [5]. Accessibility can be restricted by the hemicellulose and lignin matrix, in which cellulose fibers are embedded [7]. Additionally, hemicellulose and lignin can be covalently linked to form lignin-carbohydrate complexes (LCCs), further increasing substrate recalcitrance [8,9,10]. Another key role is played by cellulose ultrastructure (i.e., degree of organization on micro-, macro-, and fiber level) [5, 11,12,13]. A direct and quantitative link between a lower degree of organization (i.e., higher accessibility) and increased hydrolyzability has been established recently [14]. However, it is necessary to obtain a detailed understanding of substrate ultrastructure and how it affects the enzymatic hydrolyzability in non-model substrates that contain not only cellulose, but also hemicellulose and/or lignin [14].

Collectively, these substrate-related limitations require a pretreatment step to obtain sugar release by enzymatic hydrolysis. Steam pretreatment (STEX) offers a mature and flexible technology in this respect [15, 16], but necessitates the addition of acids to compensate for the low amount of organic acids in softwood [16,17,18]. Because spruce has acetyl groups only on the galactoglucomannan [4] — 1 acetyl group only every 3/4 hexose units [19] — up to 4% w/w of SO2 or H2SO4 have to be added to obtain high saccharification yields [17, 20].

The main disadvantage of harsh STEX conditions is the degradation of hemicellulosic carbohydrates and the formation of secondary degradation compounds, which may inhibit microbial fermentation [17, 20, 21] and diminish overall yields of the process. To develop a pretreatment process with high carbohydrate retention and good saccharification efficiency during enzymatic hydrolysis, better understanding of the impact of pretreatment on enzyme accessibility and biomass structure is of outmost importance [15, 16].

Therefore, the aim of the present study was to analyze the impact of steam pretreatment severity on composition, ultrastructure, and enzymatic hydrolyzability of Norway spruce. To address this objective, the chemical and supramolecular properties of spruce following steam pretreatment at six different severities were determined and correlated with the enzymatic hydrolysis efficiency. A combined severity factor (CSF) summarizing the different operating parameters (temperature, residence time, and pH) into a single value [22], allows for a comparison of different pretreatment conditions. Steam pretreatment conditions were defined by two different temperatures (180 °C and 210 °C), with/without the addition of various acids (CH3COOH, H3PO4, H2SO4, SO2). Overall structural properties were assessed by scanning electron microscopy (SEM). The effect of STEX severity on lignin and lignin-carbohydrate bonds was investigated by two-dimensional heteronuclear single quantum correlation nuclear magnetic resonance (2D HSQC-NMR). Finally, cellulose ultrastructure was studied by applying small/wide-angle X-ray scattering (SAXS/WAXS). The novelty of this work is the connection of different advanced structural characterization with the enzymatic hydrolyzability using six different steam pretreated spruce. The results contribute to add structural knowledge of mild steam pretreated spruce that is important to compromise between enzymes accessibility and hemicellulose retention in the solid fraction.

2 Materials and methods

2.1 Raw material and steam pretreatment

Milled Norway spruce (Picea abies) chips were kindly provided by the Widtskövle saw mill. Prior to pretreatment, the wood chips were reduced in size using a laboratory knife-mill (SK1; Retsch GmbH, Haan, Germany) and a 20-mm screen.

The milled spruce was pretreated by STEX then washed, pressed, and used for the hydrolysis experiment. The pretreatment was performed as previously described [18] using six different conditions as detailed in Table 1. One batch of raw material was used for pretreatment by autocatalysis and SO2; while a second batch was used for the remaining conditions. The raw material had a DM content of 55% w/w and 2 × 750 g of material were loaded in a preheated 10-L reactor for each condition. Pretreatment with a catalyst involved an overnight pre-incubation with acid. In the case of sulfur dioxide, the raw material was incubated for 20 min with 3% (w/w) SO2 inside a plastic bag and then exposed to air for 30 min before loading in the reactor.

Table 1 Summary of the conditions applied for pretreatment of Norway spruce biomass and combined severity factor

After STEX, the liquid and solid fractions were separated using a hydraulic filter press at 100 bar (HP5M; Fischer Maschinenfabrik, Neuss, Germany). The solid fraction was washed using 10-L warm deionized water per kg of material, stirred for 1 h, and filter-pressed. The pretreated materials were not further processed to perform the enzymatic hydrolysis. The pH of the liquid fraction was measured to calculate the CSF for each pretreatment condition (Table 1) as detailed in Sect. 2.7.

2.2 Compositional analysis

The carbohydrate composition of raw and pretreated materials was analyzed following the National Renewable Energy Laboratory/TP-510–42,618 protocol. In the case of the raw material used for pretreatment by autocatalysis and SO2, the analysis was performed on non-extracted material; for the raw material used for the other conditions, extracted material was used for the analysis. The difference in sample preparation of the raw material led to an overestimation of the acid insoluble lignin in the raw material not extracted (Table 2A) [49]. Monomeric sugar composition was analyzed in triplicate for each material as described in Sect. 2.7.

Table 2 Compositional analysis of lignocellulosic compounds in spruce before and after pretreatment. Data represent mean values ± standard deviation. Analyses were performed in triplicate. The glucan amount corresponds to the amount of glucose from cellulose and from galactoglucomannan

B

Raw material

(%w/w DM)

STEX210°C/HAc (%w/w DM)

STEX210°C/H3PO4

(%w/w DM)

STEX210°C/H2SO4

(%w/w DM)

Glucan

45.5 ± 0.9

60.5 ± 0.3

61.3 ± 0.4

61.0 ± 0.1

Xylan

5.7 ± 0.1

1.7 ± 0.0

1.3 ± 0.0

0.7 ± 0.0

Arabinan

0.7 ± 0.3

0.0 ± 0.0

0.0 ± 0.0

0.0 ± 0.0

Galactan

1.7 ± 0.1

0.1 ± 0.0

0.1 ± 0.0

0.1 ± 0.0

Mannan

13.2 ± 0.1

1.2 ± 0.0

0.9 ± 0.0

0.6 ± 0.0

ASL

3.7 ± 0.1

2.9 ± 0.1

2.8 ± 0.1

2.7 ± 0.1

AIL

26.4 ± 0.2

36.0 ± 0.1

36.9 ± 0.1

38.2 ± 0.4

Ash

BDL

BDL

BDL

BDL

Water extractives

3.8 ± 1.5

-

-

-

Ethanol extractives

0.8 ± 0.1

-

-

-

Recovery

102 ± 3

103 ± 1

103 ± 1

103 ± 1

  1. AIL, acid-insoluble lignin; ASL, acid-soluble lignin; BDL, below detection limit

2.3 SEM analysis

Biomass samples were studied by SEM using a FEI Quanta 200 instrument (FEI Company, Eindhoven, The Netherlands). A small amount of material was fixed on conductive polycarbonate stickers with admixed graphite (G3347; Plano GmbH, Wetzlar, Germany) without any coating or drying of the material. The electron microscope was operated at an accelerating voltage of 10 kV in a low-vacuum mode.

2.4 Milled wood lignin extraction and semi-quantification of linkages

First, the pretreated material was dried at 45 °C for 48 h, milled in a laboratory SK1 knife-mill using a 0.5-mm screen, and dried for another 24 h at 45 °C. The dry coarse powder was then ball-milled for 5 days in air-tight 1-L steel containers, filled with 20 g of material and 32 stainless steel balls (10 mm in diameter), at a speed of 35 rpm. Next, 3 g of ball-milled material was weighed in extraction thimbles (26 × 60 mm; Whatman, Maidstone, England), these were transferred to 100-mL blue-cap bottles, and 50 mL of 96 wt% 1,4-dioxane (diluted from 99.8% stock; Sigma-Aldrich Co., St. Louis, MO, USA) was added. The bottle was closed and placed in an 80 °C water bath for 16 h, after which the liquid fraction was removed, another 50 mL of solvent was added, and the procedure was repeated for 16 h. The solvent from both extraction steps was combined and evaporated (Multivapor P-6; BÜCHI Labortechnik AG, Flawil, Switzerland) at 55 °C and 60 mbar overnight. The remaining MWL solid was placed in an oven at 45 °C for 48 h to fully dry.

For 2D HSQC-NMR analysis, 100 to 150 mg of dry MWL was dissolved in 600 μL deuterated DMSO-d6. NMR spectra were recorded at room temperature using a Bruker Avance III HD 500 MHz spectrometer (Bruker BioSpin GmbH, Karlsruhe, Germany) equipped with a 5-mm broadband probe and Z-gradient coil. The “hsqcedetgpsisp2.3” pulse program was used for HSQC at the following acquisition settings: 80 scans, 0.12 s acquisition time, 2.0 s relaxation delay, 9.5 μs pulse width, 1199 to 390 size of free induction decay, and spectral width of 19.99 and 190.20 ppm. HMBC experiments were performed with a long-range coupling constant of 6 Hz [8]. NMR spectra were processed with MestReNova 12 (Mestrelab Research S.L., Santiago de Compostela, Spain) using a 90° shifted square sine-bell apodization window, and baseline and phase correction in both dimensions. The peaks were referenced to the central DMSO peak (δCH = 39.5/2.5 ppm), identities were assigned according to literature [24, 27], and the unsubstituted carbon 2 of the aromatic group on lignin was used as an internal reference for semi-quantification [25].

2.5 SAXS/WAXS

SAXS and WAXS measurements were performed using Mat:Nordic from SAXSLAB (Copenhagen, Denmark). The instrument was equipped with a Rigaku 003 + high brilliance microfocus Cu radiation source (Rigaku, Tokyo, Japan) and a Pilatus 300 K detector (Dectris, Baden, Switzerland). The X-ray wavelength was 1.54 Å−1, and the measured q-range was 0.007–0.25 Å−1 for SAXS, 0.07–2.7 Å−1 for WAXS, and 2.5–3.5 Å−1 for the secondary Pilatus 100 K detector. The q-axis was calibrated using a silver behenate sample before every new set of measurements. Wet samples were sealed in sandwich cells covered with mica windows and measured at room temperature. The measurements were performed on three different spots of a sample piece. The 2D scattering patterns were radially averaged using SAXSGui software. Background subtraction and peak deconvolution for WAXS analysis was performed using Fityk software. The diffractograms were deconvoluted using Gaussian functions. In addition, cells containing Milli-Q water and soluble spruce hemicellulose (galactoglucomannan) samples extracted from spent-sulfite liquor were measured to model the amorphous peaks more accurately using two or three Gaussian functions. The diffractograms of galactoglucomannan and water are reported in the Fig. S3. The detailed chemical composition of hemicellulose samples is presented elsewhere [39]. The crystallinity index was calculated according to Eq. (11):

$$\mathrm{Crystallinity\;index}=1-\frac{{\mathrm{I}}_{\mathrm{amorphous}}}{{\mathrm{I}}_{200}}$$
(1)

where \({I}_{\mathrm{amorphous}}\) and \({I}_{002}\) are the intensity of the amorphous peak (~ 19°) and the 200 reflection (~ 22.4°) [40], respectively. The Scherrer Eq. (2) was used to estimate the apparent crystallite size (L):

$$L=\frac{K\bullet \lambda }{\beta \bullet cos\Theta }$$
(2)

where K, \(\lambda , \beta\), and \(\Theta\) represent a constant of 0.94, the X-ray wavelength (1.542 Å), the half-height width of a diffraction peak, and the Bragg angle corresponding to the 200 crystalline plane, respectively. The positions of the peaks were first fixed to published values for 101 (15°), 10î (16.5°), 102 (20.4°), and 200 (22.4°) reflections and were fitted once the appropriate match was obtained. The 004 peak was not used due to poor data quality. The obtained model parameters are reported in the Table S1.

One-dimensional SAXS curves were plotted as Kratky plots (q vs. Iq2) to enhance certain features. The maximum of the peak (qmax) observed in Kratky plots at around 0.11 Å−1 is often used to estimate microfibril-to-microfibril distances [34, 35]. The peak maxima were determined by fitting with Gaussian functions. The dimensions (d) were then calculated using Eq. (3):

$$d=\frac{2\pi }{{q}_{\mathrm{max}}}$$
(3)

2.6 Enzymatic hydrolysis

Two enzyme preparations, Celluclast® 1.5 L and Novozym 188 (Novozymes, Lyngby, Denmark), were used for enzymatic hydrolysis. The enzymes were loaded based on their activity. Using the filter paper unit (FPU) assay [41] with some adjustments [42], the cellulolytic activity in Celluclast® was determined to be 50 FPU mL−1. The β-glucosidase activity of Novozym 188 was determined as described previously [43]. Briefly, 50 μL of diluted enzyme solution was mixed with 50 μL of 2.0 mM p-nitrophenyl-β-d-glucopyranoside (in 50 mM sodium acetate buffer, pH 5.0) and incubated at 50 °C for 10 min. The reaction was stopped with 100 μL of 1 M Na2CO3 and the activity was determined to be 20 U mL−1.

The enzymatic hydrolysis experiments were performed in triplicates, in 2-mL screw-cap tubes, and for a total reaction weight of 1.8 g. The loaded substrate (2% w/w of DM) was suspended in 0.15 M acetate buffer pH 5. The reaction mixture was autoclaved at 121 °C for 20 min, after which the sterile filtrated enzyme mixtures were added aseptically at 10 FPU g−1 DM for Celluclast® 1.5 L and 10 U g−1 DM for Novozym 188. The reactions were carried out at 40 °C for 48 h in an oven with a constant end-over-end rotation (Big S.H.O.T III™; Boekel Scientific, Feasterville, PA, USA) at 25 rpm. To terminate the reaction, the mixture was boiled for 10 min. The samples were then either stored at − 20 °C or processed directly as detailed in Sect. 2.7.

2.7 Analytical methods and data processing

DM content was measured in triplicate as dry weight at 105 °C [44].

The release of glucose, mannose, and xylose was measured by isocratic high-performance anion-exchange chromatography with pulsed amperometric detection (ICS-5000; Dionex, Sunnyvale, CA, USA) as described elsewhere [45]. The samples were centrifuged for 5 min at 13,000 rpm, and the supernatants were filtered through 0.22 μm syringe-driven filters before storage at 4 °C until analysis.

The CSF, correlating the effects of temperature, residence time, and medium acidity with the effectiveness of biomass pretreatment [46], was calculated as explained elsewhere [47]. The conversion of polymers to soluble monomeric sugars after enzymatic hydrolysis at low solids loading was calculated by Eq. (4) [48]:

$${Y}_{g }= \frac{{C}_{g}-{C}_{g0}}{{\mathrm{\varphi }}_{G}{C}_{is0 }{X}_{G0}}$$
(4)

where Yg is the conversion (expressed as %) of cellulose to glucose (taken as example), Cg (g L−1) is the concentration of solubilized glucose in the sample supernatant, Cg0 (g L−1) is the initial glucose concentration, φG is the molecular weight ratio of glucose-to-glucan monomer (φG = 180/162 = 1.11), Cis0 (g L−1) is the initial concentration of insoluble solids, and xG0 is the initial mass fraction of glucan in insoluble solids.

3 Results

3.1 Compositional characterization of mildly pretreated spruce biomass

The chemical composition of differently pretreated spruce biomass was determined. The effect of temperature and addition of acid catalysts, including acetic acid (HAc), phosphoric acid, sulfuric acid, and sulfur dioxide, was investigated using two different batches of raw material (Table 2). The CSF increased from 0.66 for STEX180°C/auto to 3.1 for STEX210°C/SO2, concomitantly with a decrease in pH from 3.9 to 1.5 (Table 1).

The main difference between the pretreated materials was the extent and composition of the remaining hemicellulose (Table 2). The hemicellulose content decreased from 20% w/w DM in the raw material to 0% w/w DM in STEX210°C/SO2. Loss of arabinose on arabinoglucuronoxylan was observed in all pretreatments. Almost complete loss of galactose on galactoglucomannan occurred already with STEX210°C/auto. During STEX210°C/HAc, STEX210°C/H3PO4, and STEX210°C/H2SO4, the content of xylan and mannan decreased from 5.7 to 0.7% w/w DM and from 13.2 to 0.6% w/w DM respectively, with increasing severity. It is important to underline that the glucan amount, reported in Table 2, comprehends the amount of both glucose from cellulose and from galactoglucomannan.

4 Impact of acid-catalyzed steam pretreatment on MWL

The effect of acid-catalyzed steam pretreatment on milled wood lignin (MWL) was evaluated by 2D HSQC-NMR. MWL extraction by a modified version of the Björkman method [23] was applied to the STEX210°C/auto, STEX210°C/HAc, STEX210°C/H3PO4, and STEX210°C/H2SO4. On average, the extracted MWL amounted to 25% (w/w) of total acid-insoluble lignin, with the autocatalyzed material having the lowest yield (21% w/w) (Table 3). The reported values are within the values presented in literature which are between 10 and 25% (w/w); where the lower percentage was found for ball milled spruce and the higher for hydrothermally treated spruce [24, 25]. Generally, increasing the severity of pretreatment results in higher MWL yields [26], owing to higher mechanical or chemical interbond cleavage, which in turn increases the amount of dioxane-soluble lignin and LCCs [24]. This trend is not immediately evident from the extraction yields obtained in this study (Table 3). The apparent discrepancy could be explained by an insufficient difference in severity, leading to only minimal structural changes between fractions, or by the extraction method reaching in a plateau.

Table 3 Results from the extraction of MWL at varying severities. The extraction yields and the content of the inter-monolignol linkages are highlighted in bold

MWL was dissolved in DMSO-d6 and analyzed using 2D HSQC-NMR. As shown in Fig. 1 for the autocatalyzed fraction (STEX210°C/auto), NMR signals in the aromatic region (6.1/102.0 ppm < δHC < 7.6/122.0 ppm) indicate that the lignin was of the guaiacyl type, with no signals for syringyl (δC < 102.0 ppm) and p-hydroxyphenyl (δC > 122.0 ppm), as expected for spruce lignin [27]. The major intermonolignol bonds identified were β-O-4, β-5, and β-β. Methoxyl and oxidized structures (α-carbonyl and coniferyl aldehyde) were also detected (Fig. 1). While the same peaks were detected in all fractions (Fig. S2), semi-quantitative data showed differences in bond occurrence between them (Table 3). The content of β-O-4 linkages was lower than the 36–45 units/100 aromatic rings reported for ball-milled spruce lignin [25]. This discrepancy can be attributed to the harsh conditions of STEX and catalyst addition, which are known to cleave β-O-4 linkages much more effectively than ball milling [25, 28, 29]. In our study, an increase in STEX severity led to a decrement of β-O-4 linkages. A similar trend was observed for β-5 linkages, although to a lesser degree, demonstrating the stable nature of lignin carbon–carbon bonds, including β-β linkages, in acidic conditions (Table 3) [29].

Fig. 1
figure 1

HSQC and HMBC spectra of MWL derived from STEX210°C/auto-pretreated material and dissolved in DMSO-d6. The numbers in the labels indicate the carbon number in the different substructures. The identified substructures in lignin, sugars, and LCCs are shown with structure colors and labels matching the peaks in the HSQC spectra. α-Carb. = α-carbonyl; BE = benzyl ether; C. Ald = coniferyl aldehyde; C-Ar = carbon in the aromatic ring; GE = gamma ester; M = mannose; X = xylose

NMR spectra showed signals for two types of LCCs, gamma esters and benzyl ether (Fig. 1), implying that the sugars found in the fractions (Table 3) could be coupled to lignin through benzyl ether and gamma ester linkages. The region for gamma esters (δCH: 62–65/3.8–4.5 ppm) contained several overlapping peaks due to the occurrence of two different classes of gamma ester bonds: acetyl esters (δCH: 62.5/4.24;3.84 ppm) and uronic acid esterified lignins [24].

One prerequisite for the identification of acetyl esters is the presence of a methyl signal in the region for acetyl groups (δCH ~ 20.0/2.0 ppm) and the absence of acetylated polysaccharide signals. The latter ensures that the methyl signals originate from acetylated lignin and not from polysaccharides. These conditions were observed in Fig. 1. Giummarella and Lawoko [24] reported that these acetyl esters were part of β-O-4 subunits, which gave a signal at a chemical shift of δCH ~ 81/4.62 ppm, which was also observed in Fig. 1 (βO4γester). This finding strengthened the results and assignment of this gamma ester bond.

Uronic acid esterified lignins were visible in Fig. 1CH: 65.3/4.33;4.11 ppm). These bonds are the result of the formation of gamma-ester linkages between lignin and uronic acids [24]. The C-6 of non-esterified uronic acid produces a signal at δC = 171.4 ppm in a carbon NMR spectrum [8]. However, in LCC gamma ester bonds, the same signal is shifted upfield to δC = 166.5–169.4 ppm. The C-6 of the uronic acid group is a carbonyl carbon and not visible by HSQC. However, correlations between the C-6 of uronic acid and H-γ (3 J long-range coupling) of lignin can be seen using heteronuclear multiple bond correlation (HMBC). Based on HMBC results (Fig. 1), a correlation was found at δC = 167 ppm for the signal at δCH: 65.3/4.33 ppm, thereby confirming that the signal was probably originating from the LCC gamma ester bond.

The HMBC results revealed an interesting correlation between the gamma ester signals at δH = 4.24; 4.11 ppm and a signal at δC = 162 ppm. A signal found by HSQC corresponded to the formyl ester at 162/8.22 ppm (Fig. 1). Based on the chemical shift region and assignment in literature, the signal was probably originating from a formyl group [30]. The formation of acyl groups (acetyl and formyl) is driven by Fischer’s acidic esterification of carboxyl groups on acetic and formic acid with the hydroxyl groups on lignin. The concomitant presence of formyl and acetyl esters likely explains the overlapping region of gamma esters. Semi-quantitative results (Table 3) revealed different trends for the two acyl esters; the amount of acetyl esters remained largely constant at varying pretreatment severity, while the amount of formyl ester increased with increasing severity. This trend could be related to the formation, strength, and structure of acetic and formic acids. Acetic acid is produced mainly by the hydrolysis of acetyl groups on hemicellulose, whereas formic acid is a thermochemical degradation product of polysaccharides; both of them increased with the severity of pretreatment [31]. Formic acid is stronger than acetic acid (pKa of 3.8 vs. 4.8) and is structurally more accessible due to a hydrogen on the carboxyl group versus methyl group on acetic acid. As a result, formic acid is more reactive than acetic acid [32]. As shown in Table 3, the detected gamma ester and benzyl ether bonds were stable under acidic STEX conditions, which could explain the recalcitrance of hemicellulose during steam pretreatment (Table 2) [24].

4.1 Impact of steam pretreatment severity on cellulose ultrastructure analyzed by SAXS/WAXS

The effect of lowering steam pretreatment severity on the crystallinity of cellulose microfibrils was studied by WAXS. Figure 2 showcases a typical diffractogram of a spruce sample subjected to H2SO4-catalyzed STEX. The WAXS profile consisted of five characteristic crystalline peaks (101, 10î, 102, 200, and 004) and a broad amorphous region at 19°. The curve above 26° was dominated by the scattering from water, due to the elevated moisture content in the samples. The remaining scattering curves are presented in the Fig. S2.

Fig. 2
figure 2

WAXS diffractogram (above) of a spruce sample treated with H2SO4. Dashed lines represent theoretical fits to the experimental data. A sharp peak at around 21° corresponds to the scattering from mica windows. SAXS curves (below) of untreated spruce (dark blue) and samples pretreated by STEX180°C/auto (purple), STEX210°C/auto (orange), STEX210°C/HAc (grey), STEX210°C/H3PO4 (green), STEX210°C/H2SO4 (light blue), and STEX210/SO2 (pink). Scattering curves were shifted for visual clarity

Once all the peaks were fitted, the crystallinity index and the size of the ordered domains were determined. Because the crystallinity index is a relative value affected by changes in both crystalline and para-crystalline parts of the cellulose structure, it is difficult to draw conclusions on the absolute content of crystalline domains. The crystallinity index of the raw material was estimated to be 63% ± 4% (Fig. 3A) and was similar to that of autocatalyzed materials. The size of the cellulose crystallite in the untreated spruce sample was 2.38 ± 0.04 nm, which is comparable to 2.2 nm reported in the literature [33]. A comparison of the crystallinity index between STEX210°C/HAc and STEX210/SO2 revealed a slight increase in crystallite dimension with increasing severity of pretreatment, suggesting the aggregation of cellulose microfibrils.

Fig. 3
figure 3

Crystallinity index (A) and crystallite size (B) of raw and steam pretreated spruce samples, as determined by WAXS analysis. Distance between fibril center (C) of steam pretreated spruce samples, as determined by SAXS analysis. Error bars correspond to a standard deviation of three measurements

SAXS provides information on the microfibril arrangement inside the plant cell wall. A typical scattering curve of wood samples often contains a characteristic peak at around 0.11 Å−1 (Fig. 2). The peak was more clearly visible in a Kratky plot (Fig. S4), which included the scattering curves of spruce samples before and after pretreatment. The dimensions obtained from the center of the peak can be correlated with the microfibril-to-microfibril distance [34, 35]. The peak in the SAXS curve of the untreated spruce sample was not sufficiently pronounced, preventing any interpretation. The distance between the microfibrils’ center is increasing as the pretreatments become harsher (Fig. 3C). Considering that WAXS indicated microfibrillar aggregation, we believe that the distance observed by SAXS is between cellulose microfibril aggregates instead, as the microfibrils are closely packed inside the aggregates and give rise to similar scattering.

4.2 Overall structural evaluation of mildly pretreated spruce

A macroscopic observation of the pretreated materials revealed some clear differences (Fig. S1). Compared to the raw material, the substrate presented increasingly smaller particle size and homogeneity with increasing severity of pretreatment.

SEM was used to study the effect of steam pretreatment on spruce overall structure (Fig. 4). The tracheids in softwood (Fig. 4, red arrows) are between 1 and 10 mm in length and function chiefly as a source of support (late wood) or for the conduction of fluids (early wood) [36]. The main effect of steam pretreatment was defibrillation of the wood chips’ surface, which was unnoticeable in the raw material (Fig. 4A) but became increasingly evident with rising severity of pretreatment. In STEX180/auto, the overall structure looked slightly corrugated (Fig. 4B), although still similar to the raw material. STEX210/auto showed an increased effect on the surface (Fig. 4C), but the overall structure was akin to the material pretreated at 180 °C. The addition of acids triggered further defibrillation, and the overall structure of the material started to show disruption (Fig. 4DF). In STEX210/SO2, almost complete disruption of the structure was reached because of the high severity of the pretreatment (Fig. 4G).

Fig. 4
figure 4

SEM images showing the ultrastructure of Norway spruce before and after pretreatment. A Untreated sample. BG Samples pretreated by B STEX180°C/auto, C STEX210°C/auto, D STEX210°C/HAc, E STEX210°C/H3PO4, F STEX210°C/H2SO4, and G STEX210°C/SO2. The structures marked with red arrows represent tracheids. The scale bar in the lower right hand corner in all figures represent 1 mm

4.3 Evaluation of enzymatic hydrolyzability yields

To investigate the enzymatic hydrolyzability of various steam pretreated spruce samples, enzyme-catalyzed sugar release was assessed. Glucose was released from both cellulose and galactoglucomannan (Fig. 5). The release of glucose increased from 0% w/w DM in the raw material to 19% w/w DM in STEX210°C/H2SO4 and 66% w/w DM in STEX210°C/SO2 (Fig. 5A). Xylose, which was released from the arabinoglucuronoxylan backbone, reached its highest value of 11% w/w DM in STEX210°C/auto (Fig. 5B). Similarly, the largest release of mannose from the galactoglucomannan backbone (17% w/w DM) was detected in STEX210°C/auto (Fig. 5C).

Fig. 5
figure 5

Enzymatic hydrolysis of raw and pretreated spruce substrates. Release of A glucose, B xylose, and C mannose after 48 h of hydrolysis with an enzyme loading 10 FPUg−1 DM of Celluclast and 10 Ug−1 DM of Novozyme 188. Yields were less accurate for materials pretreated at higher severities due to the small amount of residual hemicellulose in the solid fraction. Data represent mean values ± standard deviation of triplicate measurements

5 Discussion

In the current work, spruce biomass was pretreated by STEX at different severities, and its ultrastructure was analyzed. WAXS analysis of the untreated spruce, with its even surface (Fig. 4A), was used as a starting point to understand how the cellulose crystallite size was affected by increasing pretreatment severity. The peak in the SAXS curve of the untreated spruce sample was not sufficiently pronounced, preventing any interpretation. Furthermore, no enzymatic conversion was detected. Pretreatment was essential to increase enzyme accessibility as reported frequently in the literature [5, 15, 16]. The most applied catalysts to steam pretreat softwood getting high sugars release after enzymatic hydrolysis are either sulfur dioxide or sulfuric acid [16, 17].

In the present study, sulfur dioxide was chosen as the reference catalyst because its sulfonation of lignin favors saccharification efficiencies since lignin becomes more soluble [5]. In STEX210°C/SO2, the hemicellulose was completely solubilized (Table 2A) and the overall structure was disrupted as judged by visual inspection (Fig. S1-G) and SEM (Fig. 4G). The results are in line with the expected effect of the steam pretreatment at high severities [15]. Based on the attained structural evidence, STEX210°C/SO2 led to increased cellulose crystallite dimension (Fig. 3B) and the distance between the microfibrils’ center (Fig. 3C), thus providing the best accessibility to enzymes, as demonstrated by the highest glucose release after enzymatic hydrolysis (Fig. 5A).

The downside of steam pretreatment with sulfur dioxide is the loss of sugars from the solid fraction and the formation of inhibitors of microbial fermentation. Both factors contribute to a decrease in the overall process yields [37]. To lower the impact on process yields, we applied milder pretreatment conditions including (from low to high severity): STEX180°C/auto, STEX210°C/auto, STEX210°C/HAc, STEX210°C/H3PO4, and STEX210°C/H2SO4. The materials recovered from milder pretreatment were structurally different (Fig. 4) from STEX210°C/SO2 but showed also differences between each other. Specifically, the compositional analysis revealed the loss of arabinose in all the steam pretreated materials. Almost complete loss of galactose occurred already with STEX210°C/auto. During STEX210°C/HAc, STEX210°C/H3PO4, and STEX210°C/H2SO4, the content of xylan and mannan decreased increasing the severity. Surface defibrillation and disruption of wood chips increased with increasing pretreatment severity. This effect is important for the enzymes since the presence of the fibers on the surface facilitate enzyme access. The structural observations by SEM were in line with the SAXS/WAXS and NMR results, showing rearrangement of the cellulose fibers and depolymerization of lignin.

Collectively, the results point to greater enzyme accessibility, as shown by the increase in crystallite size and the distance between microfibrils, with rising pretreatment severity. The increase in the size of cellulose crystallites at elevated temperatures has been noted previously with different types of biomass [35, 38]. Hemicellulose acts as a spacer between microfibrils [35]; therefore, when it is removed during pretreatment, it might no longer prevent cellulose aggregation. The obtained structural information suggests that pretreatment leads to a rearrangement of cellulose microfibrils, which may cause the formation of larger pores and increase the distance between microfibril aggregates, ultimately providing easier access to enzymes [6, 12, 13]. These differences in the cellulose structure were supported by hydrolyzability results (Fig. 5A) since the glucose release increased increasing the severity of the pretreatment.

A comparison of the lignin recovered from STEX210°C/auto, STEX210°C/HAc, STEX210°C/H3PO4, and STEX210°C/H2SO4 pretreatments revealed a progressive loss of β-O-4 linkages. A similar trend but less pronounced was observed in β-5 linkages that confirmed the stable nature of lignin carbon–carbon bonds in acidic conditions (Table 3) [29]. NMR spectra revealed the presence of two different signals for LCCs: gamma esters and benzyl ether. We were not able to distinguish the single gamma ester signals because of partial peak overlap between acetyl esters and uronic acid esterified lignins.

Results from MWL extraction (Table 3) showed the presence of monomeric sugars that were not solubilized during samples preparation. This is probably indicative of recalcitrance of some polymers and is in line with the evaluation of hemicellulose hydrolyzability. The release of xylose during enzymatic hydrolysis (Fig. 5B) did not increase markedly with the severity of pretreatment; whereas that of mannose decreased (Fig. 5C) increasing the severity in the materials pretreated with acids. This finding may be explained by the low amount of hemicellulose, together with that the hemicellulose may be hard to access for the enzymes because of the presence of LCCs.

6 Conclusions

The material and energy-efficient use of lignocellulosic biomass is an important priority in maximizing the overall yield of biorefinery processes. By modulating the severity of pretreatment, the use of all parts of biomass and the enzyme accessibility can be improved. By applying advanced characterization methods, novel structural knowledge on the effects of mild steam pretreatment was generated. We investigated six different pretreatment conditions and found that lowering the severity of steam pretreatment had a differential impact on the overall structure, cellulose ultrastructure, and lignin-lignin bonds of Norway spruce. Surface defibrillation and lignin depolymerization were observed increasing pretreatment severity. Further, lignin-carbohydrate complexes signals were detected. Cellulose analysis revealed the rearrangement of microfibrils leading to the formation of larger microfibril aggregates during the pretreatment. This microfibril rearrangement likely contributed to the observed increase in enzymatic hydrolysis yields as better enzyme accessibility resulted. The added knowledge is important to understand the structural characteristics of mild steam pretreated spruce that can drive the enzyme choice for an efficient saccharification process. An efficient saccharification yield, is a prerequisite for high overall yields of the process.