Introduction

In order for a lignocellulose biorefinery to function as an economically viable enterprise, it must be developed to yield value-added coproducts in the process [1, 2]. Coupling the source of biomass to the waste stream of an associated commercial process is one way to create value in the overall system. The production of palm oil generates a substantial amount of empty fruit bunch (EFB) fibers that can be readily processed into sugars for biofuels production and a mixture of unfermentable components including lignin. The resulting lignin stream could then offer new opportunities for the development and isolation of value-added products derived from the oil palm industry [3]. For example, in 2013 Malaysia, the second largest oil palm producer after Indonesia, produced over 19 million tons of palm oil on a total plantation area of over 5 million ha, which also generated an estimated 70–80 million tons of biomass, mostly from oil palm EFB [4]. This biomass, which is a high-quality lignocellulosic fiber, has not been fully utilized commercially and is a largely untapped resource, although research in the biorefinery area has attempted to provide high value-added products via liquefaction [5], solvolysis [6] and pyrolysis [7].

Lignin, a stochastically generated biopolymer found in plants, forms by means of radical coupling reactions, primarily between a monolignol and the growing lignin polymer in an endwise process [8, 9]. In angiosperms, coniferyl and sinapyl alcohols are the predominant monomers and, once incorporated into the lignin backbone, produce guaiacyl (G) and syringyl (S) units in the resulting polymer. A monolignol couples overwhelmingly at its β-position to the growing polymer at available 4–O- (G or S) or 5- (G only) positions. Branching is introduced by 5–O–4- or 5–5-coupling reactions between preformed oligomers, with the latter occurring only between G units and the former involving at least one G unit. Evidence continues to accumulate, particularly from studies of the lignin biosynthetic pathway using mutant and transgenic plants, that lignins can incorporate monomers beyond the traditional three monolignols. As reviewed previously [811], these include (but are not limited to) the novel caffeyl and 5-hydroxyconiferyl alcohols from incomplete methylation in monolignol biosynthesis (OMT-deficient plants); the immediate hydroxycinnamaldehyde monolignol precursors, coniferaldehyde and sinapaldehyde, and the hydroxy-benzaldehydes (vanillin and syringaldehyde) derived from them from incomplete reduction (CAD deficiency); and products such as dihydroconiferyl alcohol and its derived guaiacylpropane-1,3-diol in wild-type and mutant softwoods.

Beyond the monolignol biosynthetic pathway, various monolignol conjugates have also been shown to participate in lignification [1214]. Acylated lignins of four types have been encountered in various plants to date. Angiosperm trees have long been thought to have low-level lignin acetylation [15], whereas kenaf bast fiber lignins and lignins in various other plants are extensively acetylated [1621]. Grasses produce the enzyme p-coumaroyl-CoA:monolignol transferase (PMT) [22, 23], which produces γ-p-coumaroylated monolignols that are incorporated into the growing lignin polymer, particularly at advanced maturity [2426]. Recently, monolignol ferulate conjugates were introduced into poplar trees through the introduction of a gene expressing feruloyl-CoA:monolignol transferase (FMT) [27], further demonstrating the plasticity of lignification and the role of monolignol conjugates. A few hardwoods have partially p-hydroxybenzoylated lignins, notably the willows (Salix spp.) (family Salicaceae) [28], the poplars and aspens (Populus spp.) (family Salicaceae) [29, 24, 3032], and Aralia cordata (family Araliaceae) [33]. In addition, the palms, which are a large family (Arecaceae) of economically important monocotyledonous plants comprising some 2400 species, also have partially p-hydroxybenzoylated lignins [3437]. Several low molecular mass cross-coupling products γ-acylated with p-hydroxybenzoate were isolated from actively lignifying poplar xylem [29, 32]. Enhanced interest in p-hydroxybenzoates has surfaced in conjunction with reduced lignification in aspen and poplar following down- or upregulation of genes in the monolignol pathway [3841].

Relatively little is known about the nature of p-hydroxybenzoylated lignins or how they arise. From oil palm trunk, nitrobenzene oxidation produced high levels of p-hydroxybenzoic acid (9.8–14.9 %) [37], which was also the main product of alkaline hydrolysis of parenchyma milled wood lignin (MWL) [31]. Small quantities of vanillic and syringic acids were also found. The authors concluded that the phenolic acids acylated the side-chain alcohol groups of lignin subunits. High levels of p-hydroxybenzoic acid were also released from oil palm lignins [4246, 34]. The 13C-NMR chemical shifts in Salix (willow) MWL samples showed good concordance with expected shifts for free-phenolic γ-p-hydroxybenzoate esters based on model compound studies [28]. Rather than suspecting that the lignin side-chains were acylated after the polymer had already formed, p-hydroxybenzoyl monolignol conjugates were proposed as the precursors to these structures in early research [47]; monolignol acylation was further conjectured [25] and eventually proven by the observation of novel β–β-coupling products in lignins that could only have arisen from pre-acylated monolignols [29, 14, 48, 32]. Recent studies have delivered the transferases p-coumaroyl-CoA:monolignol transferase (PMT) in grasses responsible for lignin p-coumarate esters via p-coumaroylation of monolignols [22, 23, 49, 50] and now an exotic feruloyl-CoA:monolignol transferase (FMT) that feruloylates monolignols prior to lignification [27]; the putative p-hydroxybenzoyl-CoA and acetyl-CoA monolignol transferases have not yet been identified.

The research presented in this paper seeks to answer several key questions regarding the intricacies of acylation of lignins by p-hydroxybenzoate, specifically: whether p-hydroxybenzoates exist as free-phenolic appendages on lignins, akin to the p-coumarates; whether they, like p-coumarates [51, 52, 23, 49] and acetates [18] in most plants, primarily acylate S units; whether the various lignin substructures carry these units and thus implicate the incorporation of preformed monolignol γ-p-hydroxybenzoate conjugates into the lignification process; and whether diagnostic β–β-coupled units can be found to provide proof that lignification utilizes preformed monolignol p-hydroxybenzoate conjugates, thus, also establishing that monolignol p-hydroxybenzoate conjugates are lignin precursors. Although a survey across all palms is beyond the scope of this work, a sago palm lignin previously examined by analytical pyrolysis [35] and prior studies on coconut (Cocos nucifera) coir fibers [36] are used to suggest a degree of generality in palms. Together, such studies will help to elucidate the roles of putative genes and enzymes associated with the required transferase activity, knowledge that may clarify the poorly understood reasons for widespread lignin acylation in the plant kingdom. At the same time, it will also shed light on future paths that may be worth exploring for value-added products making lignocellulosic biofuels a more viable option; depending on the level of p-hydroxybenzoates found in a given biomass material, and the ease with which it can be cleaved and recovered from the polymer, p-hydroxybenzoic acid or its derivatives may be accessible value-added products [34, 31]. A simple net search shows that p-hydroxybenzoic acid currently sells for ~$US4,000/ton and has uses: as a source of esters that have antibacterial and antifungal properties, as an intermediate in pesticides and antiseptics, and for preparation of co-polyesters (including liquid crystal polymers).

Materials and Methods

General

Commercial chemicals, including solvents, were of reagent grade or better and used without further purification. Analytical thin-layer chromatography (TLC) was carried out on EM Science TLC plates pre-coated with silica gel 60F254; TLC visualization was by UV. Purification by flash column chromatography was accomplished on silica gel (230–450 mesh). 4-Acetoxybenzoyl chloride p Bz Cl was prepared according to a previous procedure [53]. Nuclear magnetic resonance (NMR) data and assignments for all model compounds are from well-described 1D and 2D NMR experiments (on a Bruker Biospin Avance 500 MHz spectrometer). High-resolution mass spectrometric (HRMS) data were acquired using electrospray ionization (ESI) on a Waters (Micromass) LCT ESI/TOF MS instrument. UV–vis analysis performed in 1-cm quartz cuvettes at λ = 280 nm on a Shimadzu UV-1800.

Plant Materials

Palm Samples

The African oil palm (Elaeis guineensis) fractions used in this study came from two sources. The first sample, consisting of oil palm fronds and empty fruit bunch (EFB) fibers, was supplied by the Forest Research Institute of Malaysia and is the same sample used in a previous study by Sun et al. [45] and will be denoted here as EFB-1. The second sample of oil palm EFB fibers was provided by the Ladang Tai Tak Sdn. Bhd. Palm oil mill in Kota Tinggi, Johor, Malaysia, and will be denoted here as EFB-2. These EFB fibers were steam-treated (45 psi, 130 °C, 70 min) to remove the fruitlets and then shredded into fibers (4–6 in in length). Various data on both oil palm EFB samples are presented in Table 1. Isolated sago palm (Metroxylon sagu) lignin was the material previously described in a pyrolysis study [35].

Table 1 Analytical data on the two sources of oil palm EFB material and isolated MWL

Sample Preparation and Lignin Isolation

All fibrous oil palm samples were dried at 50 °C in an oven for at least 72 h before being ground. Two-step grinding was performed in a Wiley mill using a 2-mm followed by a 1-mm mesh. Removal of extractives from the samples was carried out by successive extractions with water, ethanol, acetone, and chloroform. Extractions were accomplished by refluxing the material in Soxhlet equipment for at least 8 h with each solvent. Prior to analysis, extracted samples, frequently referred to as whole-cell-wall preparations, were dried for 2 weeks in a desiccator under vacuum using P2O5 as drying agent, which was replaced regularly. The dried materials were ball-milled using a stainless steel vibratory ball mill for 1.5 h (with 0.5 h on, 0.5 h off, to minimize heating). The ball-milled materials (50 g) were submitted to enzymatic digestion with crude cellulases (Cellulysin, Calbiochem) for polysaccharide degradation [25, 54, 55]. As with aspen and poplar, the enzymatic treatment was particularly efficient at removing polysaccharides, leaving just 22.2 and 20.9 % of the cell wall from the EFB and frond samples, respectively. Samples of the enzyme lignins obtained (10 g) were then extracted with dioxane and water (96:4) in complete darkness at room temperature for 2 days [56]. The soluble fractions were obtained by filtration and lyophilized overnight. The isolated lignins were washed extensively with water and 6 mM EDTA aqueous solution (pH 8). This removed as much of the remaining low-molecular-weight saccharides and metal ions as possible [25]. The washed Björkman lignins were lyophilized again overnight and stored. The yields of isolated EFB lignin used in a previous study by Sun et al. [45] were 19.9 % of the original cell wall material. As seen in Table 1, the AcBr lignin contents, measured with ε 280 = 17.9, as previously described [57, 58], of the EFB-1 fibers were 22.7 ± 0.4 %, giving a MWL yield on a lignin basis of ~88 %. The pBz content of the MWL was 4.2 ± 0.1 %. For the EFB-2 sample, the AcBr lignin content was 18.3 ± 0.2 %.

Isolated lignins were acetylated using acetic anhydride and pyridine as previously described [59]. Solutions of the acetylated lignins in EtOAc were washed with 6 mM EDTA to more rigorously remove metal ion contaminants following methods described previously [25].

NMR of Lignins

Spectra have been run over many years on a variety of Bruker instruments, including DP 360 and Avance 400, 500 and 600 MHz instruments, using standard experiments and conditions [13]. The 3D total correlation spectroscopy-heteronuclear single-quantum coherence (TOCSY-HSQC) experiment was run as described previously [60, 55].

Derivatization Followed by Reductive Cleavage (DFRC) Method Adapted for the Analysis of Monolignol p-Hydroxybenzoate Conjugates

The derivatization followed by reductive cleavage (DFRC) method [61, 62] was adapted for HPLC analysis, as the monolignol p-hydroxybenzoate conjugates were found to be incompatible with GC-MS analysis. The modified procedure is detailed below.

In a 2-dram vial equipped with a polytetrafluoroethylene (PTFE) pressure-release cap and containing a stir bar was suspended each extract-free EFB whole-cell-wall sample (50 mg) in a 20 % solution of acetyl bromide in acetic acid (5 mL). The suspension was heated at 50 °C and allowed to stir for 3 h. The solvent was removed on a SpeedVac (Thermo Scientific SPD131DDA, 50 °C, 35 min, 1.0 Torr, 40 Torr/min, RC lamp on). The crude film was suspended in absolute ethanol (1 mL), and the ethanol was then removed on the SpeedVac (50 °C, 15 min, 6.0 Torr, 30 Torr/min, RC lamp on). To the residue was added a mixture of dioxane:acetic acid:water (5/4/1 v/v, 5 mL) and nano-powder zinc (250 mg). The sample was sonicated to ensure that the biomass film was dissolved and the zinc was in suspension. The reaction was stirred in the sealed vial in the dark at room temperature for 16–20 h. Additional nano-powder zinc was added to the vial as required to maintain a fine suspension. The reaction was quenched with a mixture of dichloromethane (DCM, 6 mL), saturated ammonium chloride (15 mL), and internal standard (ISTD, diethyl 5-5′-diferulate diacetate, DEDF, 54.0 μg). After isolating the organic fraction using a separatory funnel, the aqueous phase was extracted with DCM (3 × 10 mL). The combined organic fractions were dried over anhydrous sodium sulfate, filtered, and the solvent evaporated in vacuo. The free hydroxyl groups were acetylated by adding pyridine/acetic anhydride (1/1 v/v, 5 mL) and allowing the mixture to sit in the dark for 16 h, after which the solvent was removed to give an oily film.

To remove most of the polysaccharide-derived products, the crude DFRC product was dissolved in ethyl acetate (EtOAc, 0.15 mL), diluted with hexanes (0.15 mL), and loaded onto an SPE cartridge (Supelco Supelclean LC-Si SPE tube, 3 mL, P/N: 505048). The products were eluted with hexanes:ethyl acetate (1:1, 8 mL), and the solvent was removed on a rotary evaporator. The product was dissolved in ethyl acetate and filtered through a 0.2-μm PTFE filter into a high-performance liquid chromatography (HPLC) vial. Analysis of the purified products was performed on an HPLC–PDA-ESI-MS (Shimadzu LC-MS 2020) with an amino stationary-phase column (Phemonenex Luna NH2, 100 Å, 250 mm × 4.60 mm × 5 μm). The mobile phase was hexanes and EtOAc with 0.1 % triethylamine, running at 0.7 mL/min, holding at 30 % EtOAc for 1 min then ramping at 1.2 %/min up to 95 % EtOAc. The column eluent was split 80:20 between a PDA detector (265–400 nm) and an ESI-MS detector operating in positive-ion mode. External synthetic standards of the predominantly trans-isomers were used for determination of the retention time, calibration of the PDA response, and authentication of each compound by MS, Table 2. The DFRC data is also summarized in Table 1.

Table 2 HPLC-PDI-ESI-MS elution parameters and observed mass-ion-ratios for parent or fragment ions from synthetic standards CA, SA, CA-pBz, SA-pBz, and internal recovery standard, DEDF (see Fig. 4 for structures of the peracetates of these compounds)

Syntheses of γ-p-Hydroxybenzoylated β-Ether Model Compounds

As we determined by NMR and DFRC that p-hydroxybenzoates acylated essentially only S units, the prime model compounds of interest were the etherified and the free-phenolic S β-ether γ-p-hydroxybenzoate models 7 and their peracetates 8. These model compounds were prepared following the synthetic route shown in Scheme 1.

Scheme 1
scheme 1

Synthetic scheme for model compounds. [Note that, in the NMR assignments in the Materials and Methods section, compounds 7 and 8 have rings labeled as A, the lower S ring; B, the middle S/G ring; and C, the pBz moiety]

Preparation of 4-Acetoxy-3,5-dimethoxyacetophenone 1

To a solution of the starting acetophenone (3,5-dimethoxy-4-hydroxy-acetophenone, 51.0 mmol, 10.0 g) in pyridine (30 mL) was added Ac2O (30 mL). The solution was allowed to stir overnight at room temperature. Upon confirmation by TLC that the starting material had been fully consumed, the reaction mixture was diluted with EtOAc (300 mL) and washed successively with 1 M HCl (3 × 150 mL), sat. NaHCO3 (1 × 150 mL), and sat. NaCl (1 × 100 mL). The organic layer was dried over Na2SO4 and concentrated in vacuo. Residual AcOH was removed by azeotropic distillation with toluene (3 × 50 mL), followed by removal of the toluene with MeOH co-distillation (2 × 50 mL). Recrystallization from 20 % EtOAc in Et2O afforded 1 (11.0 g, 91 %) as a pale yellow crystalline solid: 1H NMR (CDCl3) δ 7.23 (s, 2/6), 3.89 (s, 6H, OMe), 2.61 (s, 3H, β), 2.36 (s, 3H, OAc); 13C NMR (CDCl3) δ 197.0 (α), 168.4 (OAc), 152.4 (3/5), 135.3 (1), 132.9 (4), 105.2 (2/6), 56.5 (OMe), 26.7 (β), 20.6 (OAc); HRMS (ESI) m/z calcd. for [M + NH4]+ = 256.1180, found 256.1180.

Bromination of compound 1 (preparation of compound 2)

Compound 1 (46.0 mmol, 11.0 g) was dissolved in EtOAc (460 mL), and the reaction solution was cooled to 0 °C. Pyridinium tribromide (55.4 mmol, 20.0 g) was charged into the reaction flask incrementally over 10 min, and the reaction was allowed to stir for 2 h, with the temperature gradually warming from 0 °C to room temperature (23 °C). If any starting material remained detectable by TLC at that point, additional portions of pyridinium tribromide (in increments of approximately 3 mmol, or 1 g) were added until TLC analysis showed full conversion. The reaction was then quenched with a solution of sat. NaHCO3 (400 mL) and, after separation, the aqueous layer was extracted with DCM (1 × 100 mL). The combined organic layers were then washed with 1 M HCl (2 × 200 mL) and sat. NaCl (1 × 100 mL), dried over Na2SO4, and concentrated under reduced pressure. After crystallization from 30 % EtOAc/hexanes, 2 (9.2 g, 63 %) was obtained as a pale yellow crystalline solid: 1H NMR (CDCl3) δ 7.25 (s, 1H, 2/6A), 4.42 (s, 2H, β), 3.89 (s, 6H, OMeA), 2.36 (s, 3H, OAcA); 13C NMR (CDCl3) δ 190.4 (α), 168.3 (OAc), 152.6 (3/5), 133.6 (1), 132.0 (4), 105.9 (2/6), 56.5 (OMe), 30.6 (β), 20.6 (OAc); HRMS (ESI) m/z calcd. for [M + NH4]+ = 334.0285, found 334.0289.

Etherification of bromide 2 (illustrated for preparation of compound 3 SG )

Upon dissolution of compound 2 (10.0 mmol, 3.20 g) in acetone (100 mL), K2CO3 (40.0 mmol, 5.5 g) and NaI (0.1 mmol, 15.0 mg) were charged into the reaction flask. After adding guaiacol (11.0 mmol, 1.4 g), the reaction mixture was heated to reflux and allowed to stir for 3 h. At that point, TLC showed complete conversion and after the reaction mixture was allowed to cool to room temperature, the acetone was removed in vacuo. The solids were then reconstituted in EtOAc (200 mL) and H2O (100 mL), and the aqueous layer removed. The organic layer was washed with 1 M HCl (1 × 100 mL) and sat. NaCl (1 × 75 mL), dried over Na2SO4, and concentrated in vacuo to a viscous oil which, upon allowing to stand in the freezer overnight, crystallized. Recrystallization with 20 % EtOAc/hexanes afforded 3 SG (2.74 g, 76 %) as a white solid: 1H NMR (CDCl3) δ 7.00–6.96 (m, 1H, 1B), 6.92 (dd, J = 8.1, 1.1 Hz, 1H, 2B), 6.89–6.83 (m, 2H, 5B/6B), 5.28 (s, 2H, β), 3.88 (s, 9H, OMe), 2.36 (s, 3H, OAcA); 13C NMR (CDCl3) δ 193.9 (α), 168.3 (OAcA), 152.5 (3A/5A), 149.8 (3B), 147.4 (4B), 133.4 (4A), 132.6 (1A), 122.8 (1B), 121.0 (6B), 114.8 (5B), 112.2 (2B), 105.3 (2A/6A), 72.6 (β), 56.5 (OMeA), 55.9 (OMe B ), 20.6 (OAcA); HRMS (ESI) m/z calcd. for [M + H]+ = 361.1282, found 361.1286.

Compound 3 SS (2.75 g, 71 %), white solid: 1H NMR (CDCl3) δ 7.38 (s, 2H, 2A/6A), 7.03 (t, J = 8.4 Hz, 1H, 4B), 6.59 (d, J = 8.4 Hz, 2H, 3B/5B), 5.14 (s, 2H, β), 3.88 (s, 6H, OMeA), 3.81 (s, 6H, OMeB), 2.36 (s, 3H, OAcA); 13C NMR (CDCl3) δ 194.1 (α), 168.4 (OAcA), 153.4 (3B/5B), 152.4 (3A/5A), 136.4 (4B), 133.2 (4A), 133.1 (1A), 124.4 (1B), 105.5 (2A/6A), 105.3 (2B/6B), 75.6 (β), 56.5 (OMeA), 56.2 (OMeB), 20.6 (OAcA); HRMS (ESI) m/z calcd. for [M + Na]+ = 413.1207, found 413.1199.

Procedure for hydroxymethylation of 3 (illustrated for preparation of compound 4 SG )

The improved method using dioxane as solvent as described previously [63] was used. Ether 3 SG (7.60 mmol, 2.74 g) was dissolved in 1,4-dioxane (76 mL), and K2CO3 (30.4 mmol, 4.20 g) was charged into the reaction flask. After adding formaldehyde (9.1 mmol, 0.7 mL, 37 wt% in H2O) to the flask, the reaction mixture was allowed to stir for approximately 4 h at room temperature, until the starting material could no longer be detected by TLC. The organics were concentrated in vacuo, with all residual solids then being reconstituted in EtOAc (200 mL) and H2O (100 mL). The layers were separated, and the aqueous layer was extracted with EtOAc (1 × 100 mL). After combining the organic layers, they were washed with 1 M HCl (1 × 150 mL) and sat. NaCl (1 × 100 mL), dried over Na2SO4, and evaporated under reduced pressure. Flash column chromatography (100 g SiO2) using gradient elution from 10 to 75 % EtOAc/hexane over 30 column volumes afforded 4 SG (2.28 g, 77 %) as a light yellow brittle foam: 1H NMR (CDCl3) δ 7.37 (s, 2H, 2A/6A), 7.02 (td, J = 7.8, 1.2 Hz, 1H, 1B), 6.92 (dd, J = 8.1, 1.2 Hz, 1H, 2B), 6.91 (dd, J = 8.0, 1.5 Hz, 1H, 5B), 6.85 (td, J = 7.7, 1.2 Hz, 1H, 6B), 5.36 (dd, J = 5.8, 4.8 Hz, 1H, β), 4.09 (s, 2H, γ), 3.84 (s, 9H, OMeA/B), 2.94 (s, 1H, OH), 2.35 (s, 3H, OAcA); 13C NMR (CDCl3) δ 195.8 (α), 168.3 (OAcA), 152.5 (3A/5A), 150.6 (3B), 146.8 (4B), 133.5 (4A), 132.9 (1A), 124.0 (1B), 121.4 (6B), 118.6 (5B), 112.4 (2B), 105.9 (2A/6A), 84.8 (β), 63.6 (γ), 56.5 (OMeA), 55.9 (OMeB), 20.6 (OAcA); HRMS (ESI) m/z calcd. for [M + NH4]+ = 408.1653, found 408.1659.

Compound 4 SS (2.25 g, 77 %), sticky white foam: 1H NMR (CDCl3) δ 7.39 (s, 2H, 2A/6A), 7.04 (t, J = 8.4 Hz, 1H, 1B), 6.59 (d, J = 8.4 Hz, 2H, 2B/6B), 5.09 (dd, J = 7.4, 3.2 Hz, 1H, β), 4.03 (ddd, J = 11.9, 7.4, 4.5 Hz, 1H, γ), 3.94 (dd, J = 9.0, 4.4 Hz, 1H, OH), 3.86 (s, 6H, OMeA), 3.82 (dd, J = 9.0, 3.1 Hz, 1H, γ), 3.75 (s, 6H, OMeB), 2.36 (s, 3H, OAcA); 13C NMR (CDCl3) δ 195.6 (α), 168.3 (OAcA), 152.9 (3B/5B), 152.3 (3A), 136.4 (4B), 133.6 (4A), 133.1 (1A), 124.6 (1B), 105.9 (2A/6A), 105.3 (2B/6B), 87.5 (β), 63.4 (γ), 56.5 (OMeA), 56.1 (OMeB), 20.6 (OAcA); HRMS (ESI) m/z calcd. for [M + H]+ = 421.1494, found 421.1494.

Procedure for 4-acetoxybenzoylation of the γ-OH in the β–O–4-linked dimers 4 (illustrated for preparation of compound 5 SG-pBz )

Alcohol 4 SG (1.3 mmol, 508 mg) was dissolved in dichloromethane (13 mL), and the resulting solution was cooled to 0 °C. DMAP (0.33 mmol, 40 mg) and Et3N (3.9 mmol, 0.54 mL) were charged into the reaction mixture, followed by 4-acetoxybenzoyl chloride (pBzCl; 1.6 mmol, 318 mg). The reaction was allowed to warm to room temperature gradually with continued stirring for approximately 2 h. Once TLC showed no detectable starting material remaining, the mixture was poured into 1 M HCl (50 mL). After separation, the aqueous layer was extracted with DCM (1 × 30 mL), and the combined organics were washed with sat. NaCl (1 × 50 mL). They were then dried over Na2SO4 and concentrated in vacuo. Purification by flash column chromatography (50 g SiO2, gradient elution from 5 to 50 % EtOAc/hexane) gave ester 5 SG-p Bz (547 mg, 76 %) as a white foam: 1H NMR (CDCl3) δ 8.00 (d, J = 8.8 Hz, 2H, 2C/6C), 7.56 (s, 2H, 2A/6A), 7.15 (d, J = 8.8 Hz, 2H, 3C/5C), 7.00–6.95 (m, 2H, 1B/5B), 6.87–6.80 (m, 2H, 2B/6B), 5.74 (dd, J = 7.3, 3.6 Hz, 1H, β), 4.99 (dd, J = 12.1, 3.7 Hz, 1H, γ), 4.68 (dd, J = 11.9, 7.3 Hz, γ), 3.86 (OMeA), 3.67 (OMeB), 2.36 (OAcA), 2.32 (OAcC); 13C NMR (CDCl3) δ 194.6 (α), 169.0 (OAcC), 168.3 (OAcA), 165.8 (9), 154.7 (4C), 152.5 (3A/5A), 150.5 (3B), 146.8 (4B), 133.5 (4A), 132.8 (1A), 131.5 (2C/6C), 127.2 (1C), 123.8 (1B), 121.8 (3C/5C), 121.2 (6B), 118.6 (5B), 112.7 (2B), 106.1 (2A/6A), 81.0 (β), 65.3 (γ), 56.5 (OMeA), 55.7 (OMeB), 21.3 (OAcC), 20.6 (OAcA); HRMS (ESI) m/z calcd. for [M + NH4]+ = 570.1970, found 570.1949.

Compound 5 SS-p Bz (381 mg, 69 %), white foam: 1H NMR (CDCl3) δ 7.90 (d, J = 8.8 Hz, 2H, 2C/6C), 7.57 (s, 2H, 2A/6A), 7.12 (d, J = 8.8 Hz, 2H, 3C/5C), 6.97 (t, J = 8.4 Hz, 1H, 4B), 6.51 (d, J = 8.4 Hz, 2H, 3B/5B), 5.58 (dd, J = 6.6, 4.1 Hz, 1H, β), 4.87 (dd, J = 11.9, 4.0 Hz, 1H, γ), 4.72 (dd, J = 11.9, 6.5 Hz, 1H, γ), 3.84 (s, 6H, OMeA), 3.64 (s, 6H, OMeB), 2.36 (s, 3H, OAcA), 2.31 (s, 3H, OAcC); 13C NMR (CDCl3) δ 195.0 (α), 169.0 (OAcC), 168.3 (OAcA), 165.6 (9), 154.5 (4C), 153.1 (3B/5B), 152.2 (3A/5A), 135.7 (4B), 133.6 (1A), 133.2 (4A), 131.4 (2C/6C), 127.4 (1C), 124.4 (1B), 121.7 (3C/5C), 106.5 (2A/6A), 105.1 (2B/6B), 82.2 (β), 65.1 (γ), 56.5 (OMeA), 55.9 (OMeB), 21.3 (OAcC), 20.6 (OAcA); HRMS (ESI) m/z calcd. for [M + NH4]+ = 600.2076, found 600.2092.

Procedure for acetylation of the γ-OH in the β–O–4-linked dimers 4 (illustrated for preparation of compound 5 SG-OAc )

Alcohol 4 SG (1.0 mmol, 400 mg) was dissolved in equal parts pyridine and Ac2O (2 mL/2 mL) and allowed to stir at room temperature overnight. The mixture was diluted with EtOAc (100 mL) and washed successively with 1 M HCl (3 × 50 mL), sat. NaHCO3 (2 × 50 mL), and sat. NaCl (1 × 50 mL). Residual AcOH was removed by azeotropic distillation with toluene (3 × 25 mL), followed by removal of the toluene by co-distillation with MeOH (3 × 25 mL). Ester 5 SG-OAc (407 mg, 94 %) was recovered as a white foam and used without further purification in the next step: 1H NMR (CDCl3) δ 7.49 (2A/6A), 6.99 (ddd, J = 7.8, 7.8, 1.3 Hz, 1H, 1B), 6.93 (dd, J = 8.0, 1.6 Hz, 1H, 5B), 6.88 (dd, J = 8.2, 1.4 Hz, 1H, 2B), 6.84 (ddd, J = 7.7, 7.7, 1.5 Hz, 1H, 6B), 5.64 (dd, J = 7.5, 3.6 Hz, 1H, β), 4.73 (dd, J = 12.0, 3.6 Hz, 1H, γ), 4.46 (dd, J = 12.0, 7.5 Hz, 1H, γ), 3.87 (s, 6H, OMeA), 3.76 (s, 3H, OMeB), 2.35 (OAcA), 2.07 (OAcγ); 13C NMR (CDCl3) δ 194.5 (α), 171.2 (OAc γ ), 168.3 (OAcA), 152.5 (OMeA), 150.3 (OMeB), 146.8 (4B), 133.4 (4A), 132.7 (1A), 123.7 (1B), 121.2 (6B), 118.1 (5B), 112.7 (2B), 105.9 (2A/6A), 80.6 (β), 64.7 (γ), 56.5 (OMeA), 55.9 (OMeB), 21.0 (OAc γ ), 20.6 (OAcA); HRMS(ESI) m/z calcd. for [M + NH4]+ = 450.1759, found 450.1740.

Compound 5 SS-OAc (254 mg, 92 %), white foam: 1H NMR (CDCl3) δ 7.49 (s, 2H, 2A/6A), 6.99 (t, J = 8.4 Hz, 1H, 1B), 6.54 (d, J = 8.4 Hz, 2H, 2B/6B), 5.46 (J = 6.4, 4.6 Hz, 1H, β), 4.61 (dd, J = 11.7, 4.6 Hz, 1H, γ), 4.53 (dd, J = 11.7, 6.4 Hz, 1H, γ), 3.85 (s, 6H, OMeA), 3.72 (s, 6H, OMeB), 2.35 (OAcA), 1.97 (OAcγ); 13C NMR (CDCl3) δ 194.8 (α), 171.0 (OAcγ), 168.3 (OAcA), 153.1 (OMeB), 152.2 (OMeA), 135.9 (4B), 133.6 (1A), 133.1 (4A), 124.4 (1B), 106.3 (2A/6A), 105.3 (2B/6B), 81.7 (β), 64.4 (γ), 56.4 (OMeA), 56.0 (OMeB), 20.9 (OAcγ), 20.6 (OAcA); HRMS(ESI) m/z calcd. for [M + NH4]+ = 480.1865, found 480.1884.

Typical procedure for reduction of α-ketones 5 (illustrated for preparation of the compound 6 SG-pBz )

Ester 5 SG-p Bz (0.43 mmol, 250 mg) was dissolved in EtOAc (4.4 mL) and MeOH (0.4 mL), and the reaction mixture was cooled to 0 °C. NaBH4 (0.86 mmol, 33 mg) was charged to the flask and allowed to stir continuously for approximately 1 h. After confirming complete consumption of the starting material by TLC, the organics were diluted with EtOAc (25 mL), washed with 1 M HCl (1 × 30 mL) and sat. NaCl (1 × 30 mL), dried over Na2SO4, and concentrated in vacuo. The resulting clear, viscous oil 6 SG-p Bz was used crude in the final deprotection, with an assumed 100 % yield; products 6 SG-p Bz , 6 SS-p Bz , 6 SG-OAc , and 6 SS-OAc were formed and used in the same way.

Typical procedure for deprotection of phenolic acetates (illustrated for preparation of the compound 7 SG-pBz )

Acetates were removed essentially as in the recently described method [53]. Benzyl alcohol 6 SG-p Bz (0.32 mmol, 175 mg) was dissolved in DMF (1.3 mL). Hydrazine acetate (1.1 mmol, 103 mg) was charged to the flask, and the reaction was allowed to stir at room temperature for 1 to 3 h. The solution was diluted with EtOAc (50 mL) and washed with H2O (5 × 15 mL) and NaCl (1 × 10 mL). After drying over Na2SO4, the organics were evaporated under reduced pressure. The resulting crude oil was purified by flash column chromatography (10 g SiO2, gradient elution using either 10 to 70 % EtOAc/hexane or 1 to 10 % MeOH in DCM) to afford 7 SG-p Bz (69 mg, 46 %) as a brittle white foam and a 70:30 mixture of erythro- (anti-) and threo- (syn-) isomers: 1H NMR (CDCl3) δ 7.85 (d, J = 8.7 Hz, 0.7H, 2C,t/6C,t), 7.81 (d, J = 8.7 Hz, 2H, 2C,e/6C,e), 7.18 (dd, J = 7.9, 1.5 Hz, 0.3H, 5B,t), 7.10–7.04 (m, 2.4H, 5B,e, 2B,t, 1B,e), 6.96–6.89 (m, 2.7H, 2B,e/6B,t, 6B,e, 1B,t), 6.83 (d, J = 8.8 Hz, 0.8H, 3C,t/5C,t), 6.81 (d, J = 8.7 Hz, 2H, 3C,e/5C,e), 6.66 (s, 2H, 2A,e/6A,e), 6.63 (s, 0.8H, 2A,t/6A,t), 5.55 (s, 0.3H, ArOHt), 5.48 (s, 1.3H, ArOHe/t), 5.46 (s, 1H, ArOHe), 4.99–4.92 (m, 1.4H, αt, αe), 4.60–4.54 (m, 2H, γe, βe), 4.46 (dd, J = 12.1, 3.5 Hz, 0.3H, γt), 4.42–4.36 (m, 1H, γe), 4.32 (quint, J = 4.1 Hz, 0.4H, βt), 4.28–4.23 (m, 0.7H, γt, αOHt), 3.89 (s, 1H, OMeB,t), 3.87 (s, 3H, OMeB,e), 3.86 (s, 6H, OMeA,e), 3.83 (d, J = 3.8 Hz, 1H, αOHe), 3.80 (s, 2H, OMea,t); 13C NMR (CDCl3) δ 166.45 (7C,e), 166.11 (7C,t), 160.59 (4C,t), 160.44 (4C,e), 151.55 (3B,e), 150.93 (3B,t), 147.75 (4B,t), 147.21 (3A,t/5A,t), 147.14 (3A,e/5A,e), 147.01 (4B,e), 134.78 (4A,t), 134.21 (4A,e), 132.06 (2C,t/6C,t, 2C,e/6C,e), 130.19 (1A,t), 130.14 (1A,e), 124.31 (1B,t), 124.30 (1B,e), 122.07 (1C,e), 121.89 (1C,t), 121.66 (B6,e), 121.61 (B6,t), 120.75 (5B,e), 120.56 (5B,t), 115.42 (3C,t/5C,t), 115.32 (3C,e/5C,e), 112.34 (2B,t), 112.32 (2B,e), 103.87 (2A,t/6A,t), 103.09 (2A,e/6A,e), 85.98 (βt), 84.36 (βe), 75.07 (αt), 72.60 (αe), 63.44 (γt), 62.95 (γe), 56.41 (OMeA,e), 56.33 (OMeA,t), 55.96 (OMeB,e), 55.91 (OMeB,t); HRMS (ESI) m/z calcd. for [M + NH4]+ = 488.1916, found 488.1928.

Compound 7 SS-p Bz (201 mg, 93 %), brittle white foam: 1H NMR (CDCl3) δ 7.86 (d, J = 8.6 Hz, 2H, 2C,t/6C,t), 7.83 (d, J = 8.6 Hz, 1.4H, 2C,e/6C,e), 7.05 (t, J = 8.8 Hz, 0.8H, 1B,e), 7.04 (t, J = 8.8 Hz, 1H, 1B,t), 6.82 (d, J = 8.7 Hz, 2H, 3C,t/5C,t), 6.79 (d, J = 8.8 Hz, 1.5H, 3C,e/5C,e), 6.63–6.58 (m, 7H, 2A,e/6A,e, 2A,t/6A,t, 2B/e/6B,e, 2B,t/6B,t), 5.55 (s, 1.3H, ArOHe/t), 5.15 (d, J = 8.1 Hz, 1H, αt), 5.06 (s, 1H, αOHt), 4.93 (s, 1H, αe), 4.67 (dd, J = 13.0, 3.9 Hz, 1H, γt), 4.61–4.55 (m, 1.3H, βe, γe), 4.52 (d, J = 2.7 Hz, 0.6H, αOHe), 4.42–4.36 (m, 0.6H, γe), 4.16–4.12 (m, 2H, βt, γt), 3.82 (s, 6H, OMeB,t), 3.81 (s, 4.7H, OMeA,e), 3.78 (s, 4.2H, OMeB,e), 3.70 (s, 6H, OMeA,t); 13C NMR (CDCl3) δ 166.59 (7C,e), 166.31 (7C,t), 160.85 (4C,t), 160.60 (4C,e), 153.76 (3B,e/5B,e), 152.96 (3B,t/5B,t), 147.13 (3A,e/5A,e), 147.20 (3A,t/5A,t), 136.63 (4B,t), 134.71 (4B,e), 134.59 (4A,t), 133.98 (4A,e), 132.01 (2C,e/6C,e), 131.99 (2C,t/6C,t), 130.34 (1A,t), 129.78 (1A,e), 124.59 (1B,e), 124.51 (1B,t), 122.24 (1C,e), 121.85 (1C,t), 115.39 (3C,t/5C,t), 115.22 (3C,e/5C,e), 105.33 (2B,e/6B,e), 105.30 (2B,t/6B,t), 103.80 (2A,t/6A,t), 102.83 (2A,e/6A,e), 87.30 (βt), 83.42 (βe), 75.18 (αt), 72.18 (αe), 64.38 (γt), 62.87 (γe), 56.41 (OMeA,e), 56.22 (OMeA,t), 56.19 (OMeB,e), 56.11 (OMeB,t); HRMS(ESI) calcd. for m/z [M + NH4]+ = 518.2021, found 518.2025.

Compound 7 SG-OAc (75 mg, 84 %), white foam: 1H NMR (CDCl3) δ 7.15 (dd, J = 7.8, 1.3 Hz, 0.3H, 5B,t), 7.09–7.03 (m, 2.4H, 1B,e, 2B,t, 5B,e), 6.96–6.91 (m, 2.8H, 1B,t, 2B,e, 6B,e, 6B,t), 6.62 (s, 2H, 2A,e/6A,e), 6.60 (s, 0.7H, 2A,t/6A,t), 5.55 (s, 0.3H, ArOHt), 5.52 (s, 1H, ArOHe), 4.86 (s, 1H, αe), 4.82 (d, J = 8.0 Hz, 0.3H, αt), 4.44–4.36 (m, 2H, βe, γe), 4.31–4.24 (m, 0.5H, αOHt, γt), 4.24–4.18 (m, 0.5H, βt), 4.15–4.09 (m, 1H, γe), 4.05 (dd, J = 11.9, 5.5 Hz, 0.3H, γt), 3.91 (s, 1H, OMeB,t), 3.88 (s, 3H, OMeB,e), 3.87 (s, 6H, OMeA,e), 3.86 (s, 3H, OMeA,t), 3.77 (s, 1H, αOHe), 2.04 (s, 1.5H, γOAct), 2.01 (s, 3H, γOAce); 13C NMR (CDCl3) δ 171.32 (γOAce), 170.72 (γOAct), 151.69 (3B,e), 150.92 (3B,t), 148.14 (4B,t), 147.18 (3A,t/5A,t), 147.11 (3A,e/5A,e), 147.00 (4B,e), 134.77 (4A,t), 134.17 (4A,e), 130.35 (1A,t), 130.10 (1A,e), 124.31 (1B,e), 124.24 (1B,t), 121.59 (6B,e), 121.57 (6B,t), 120.85 (5B,e), 120.55 (5B,t), 112.27 (2B,e), 112.23 (2B,t), 103.82 (2A,t/6A,t), 102.92 (2A,e/6A,e), 86.32 (βt), 84.58 (βe), 74.74 (αt), 72.21 (αe), 63.46 (γt), 62.90 (γe), 56.42 (OMeA,t, OMeA,e), 55.96 (OMeB,e), 55.92 (OMeB,t), 21.19 (γOAce), 20.94 (γOAct); HRMS (ESI) m/z calcd. for [M + NH4]+ = 410.1810, found 410.1817.

Compound 7 SS-OAc (143 mg, 79 %), white foam: 1H NMR (CDCl3) δ 7.06 (t, J = 8.4 Hz, 1H, 1B,e), 7.02 (t, J = 8.4 Hz, 1H, 1B,t), 6.63 (d, J = 8.4 Hz, 2H, 2B,e/6B,e), 6.59 (d, J = 8.4 Hz, 2H, 2B,t/6B,t), 6.57 (s, 2H, 2A,t/6A,t), 6.57 (s, 2H, 2A,e/6A,e), 5.56 (s, 1H, ArOHt), 5.53 (s, 1H, ArOHe), 4.99 (dd, J = 8.5, 1.8 Hz, 1H, αt), 4.82 (t, J = 3.0 Hz, 1H, αe), 4.80 (d, J = 1.9 Hz, 1H, αOHt), 4.54–4.48 (m, 2H, βe, γt), 4.37 (dd, J = 12.0, 8.1 Hz, 1H, γe), 4.33 (d, J = 2.9 Hz, 1H, αOHe), 4.14 (dd, J = 12.0, 3.5 Hz, 1H, γe), 4.00 (dt, J = 8.4, 3.1 Hz, 1H, βt), 3.90 (dd, J = 12.0, 2.9 Hz, 1H, γt), 3.87 (s, 6H, OMeA,e), 3.85 (s, 12H, OMeB,e, OMeB,t), 3.83 (s, 6H, OMeA,t), 2.05 (s, 3H, OAct), 1.99 (s, 3H, OAce); 13C NMR (CDCl3) δ 171.03 (γOAce), 170.78 (γOAct), 153.79 (3B,e/5B,e), 152.86 (3B,t/5B,t), 147.06 (3A,e/5A,e), 147.02 (3A,t/5A,t), 136.70 (4B,t), 134.52 (4B,e), 134.51 (4A,t), 133.87 (4A,e), 130.67 (1A,t), 129.74 (1A,e), 124.55 (1B,e), 124.36 (1B,t), 105.46 (2B,e/6B,e), 105.20 (2B,t/6B,t), 103.72 (2A,t/6A,t), 102.57 (2A,e/6A,e), 86.87 (βt), 82.99 (βe), 74.57 (αt), 71.61 (αe), 64.07 (γt), 62.64 (γe), 56.41 (OMeA,e), 56.31 (OMeA,t), 56.30 (OMeB,e), 56.07 (OMeB,t), 20.99 (γOAce), 20.90 (γOAct); HRMS (ESI) m/z calcd. for [M + NH4]+ = 440.1916, found 440.1914.

Typical procedure for peracetylation (illustrated for preparation of compound 8 SG-pBz )

Compound 7 SG-p Bz (10.0 mg, 20 μmol) was charged into a small amber vial, followed by addition of a solution containing equal parts pyridine and acetic anhydride (500 μL each). The resulting solution was allowed to stir overnight, at which point, all organic solvents were removed in vacuo to yield 8 SG-p Bz as a clear, colorless oil: 1H NMR (CDCl3) δ 7.98–7.94 (m, 2.7H, 2C,e/6C,e, 2C,t/6,C,t), 7.16–7.13 (m, 2.7H, 3C,,e/5C,e, 3C,t/5C,t), 7.02–6.96 (m, 1.7H, 1B,e, 2B,t, 5B,t), 6.89–6.81 (m, 3.8H, 1B,t, 2B,e, 5B,e, 6B,e, 6B,t), 6.72 (s, 2H, 2A,e/6A,e), 6.69 (s, 0.7H, 2A,t/6A,t), 6.18 (d, J = 6.2 Hz, 0.3H, αt), 6.13 (d, J = 5.6 Hz, 1H, αe), 4.84–4.80 (m, 1H, βe), 4.76–4.72 (m, 0.3H, βt), 4.60 (d, J = 5.0 Hz, 2H, γe), 4.50 (dd, J = 11.9, 4.2 Hz, 0.3H, γt), 4.37 (dd, J = 11.9, 6.0 Hz, 0.3H, γt), 3.79 (s, 6H, OMeA,e), 3.76 (s, 2H, OMeA,t), 3.74 (s, 1H, OMeB,t), 3.72 (s, 3H, OMeB,e), 2.33 (s, 3H, OAcA,t, OAcC, t), 2.32 (s, 5H, OAcA,e, OAcC,e), 2.11 (s, 3H, αOAce), 2.09 (s, 1H, αOAct); 13C NMR (CDCl3) δ 169.91 (αOAct), 169.72 (αOAce), 169.00 (OAcA,e/OAcC,e), 168.98 (OAcA,t/OAcC,t), 168.75 (OAcA,e/OAcC,e), 168.70 (OAcA,t/OAcC,t), 165.63 (7C,e), 165.47 (7C,t), 154.59 (4C,t), 154.56 (4C,e), 152.33 (3A,t/5A,t), 152.16 (3A,e/5A,e), 151.25 (3B,e), 150.99 (3B,t), 148.05 (4B,t), 147.30 (4B,e), 135.08 (1A,e), 134.89 (1A,t), 131.45 (2C,e/6C,e), 131.40 (2C,t/6C,t), 128.87 (4A,t), 128.76 (4A,e), 127.38 (1C,e), 127.31 (1C,t), 123.85 (1B,e), 123.58 (1B,t), 121.83 (3C,t/5C,t), 121.79 (3C,e/5C,t), 121.13 (6B,e, 6B,t), 119.88 (5B,e), 119.13 (5B,t), 112.62 (2B,e), 112.51 (2B,t), 104.48 (2A,e/6A,e), 104.16 (2A,t/6A,t), 80.80 (βt), 80.56 (βe), 75.08 (αt), 74.47 (αe), 64.07 (γt), 63.70 (γe), 56.32 (OMeA,e, OMeA,t), 55.80 (OMeB,e, OMeB,t), 21.32 (OAcA,e/OAcC,e), 21.28 (αOAct), 21.24 (αOAce), 20.64 (OAcA,t/OAcC/t).

Compound 8 SS-p Bz (colorless oil): 1H NMR (acetone-d 6 ) δ 7.98 (d, J = 8.8 Hz, 2H, 2C,t/6C,t), 7.85 (d, J = 8.8 Hz, 1.8H, 2C,e/6C,e), 7.27 (d, J = 8.8 Hz, 2H, 3C,t/5C,t), 7.21 (d, J = 8.8 Hz, 1.8H, 3C,e/5C,e), 7.00 (t, J = 8.4 Hz, 1H, 1B,t), 6.99 (t, J = 8.4 Hz, 1H, 1B,e), 6.86 (s, 2H, 2A,t/6A,t), 6.84 (s, 2H, 2A,e/6A,e), 6.65 (d, J = 8.4 Hz, 2H, 2B,t/6B,t), 6.63 (d, J = 8.4 Hz, 2H, 2B,e/6B,e), 6.24 (d, J = 6.5 Hz, 1H, αt), 6.20 (d, J = 4.4 Hz, 0.9H, αe), 4.89–4.85 (m, 0.8H, βe), 4.80–4.75 (m, 1H, βt), 4.64 (dd, J = 11.9, 6.3 Hz, 0.8H, γe), 4.52–4.47 (m, 2H, γe, γt), 4.23 (dd, J = 11.9, 5.0 Hz, 1H, γt), 3.80 (s, 6H, OMeA,e), 3.74 (s, 7H, OMeB,t), 3.73 (s, 7H, OMeB,e), 3.72 (s, 6H, OMeA,t), 2.30 (s, 3H, OAc), 2.29 (s, 3H, OAc), 2.22 (s, 3H, OAc), 2.21 (s, 3H, OAc), 2.16 (s, 3H, αOAce), 1.99 (s, 3H, αOAct); 13C NMR (acetone-d 6 ) δ 170.04 (αOAce), 169.95 (αOAct), 169.29 (OAc), 169.28 (OAc), 168.53 (OAc), 168.47 (OAc), 165.75 (7C,e), 165.67 (7C,t), 155.69 (4C,t), 155.59 (4C,e), 154.29 (3B,e/5B,e), 154.21 (3B,t/5B,t), 153.10 (3A,e/5A,e), 153.07 (3A,t/5A,t), 137.73 (4B,e), 137.60 (4B,t), 136.87 (1A,e), 136.66 (1A,t), 131.79 (1C,t), 131.75 (1C,e), 128.37 (4A,e), 128.34 (4A,t), 124.75 (1B,e), 124.56 (1B,t), 122.91 (2C,t/6C,t), 122.74 (2C,e/6C,e), 106.18 (2B,t/6B,t), 106.03 (2B,e/6B,e), 104.84 (2A,t/6A,t), 104.45 (2A,e/6A,e), 81.80 (βe), 81.69 (βt), 76.69 (αt), 75.60 (αe), 65.19 (γt), 64.22 (γe), 56.45 (OMeA,e), 56.41 (OMeB,t), 56.29 (OMeB,e), 56.21 (OMeA,t), 21.00 (OAc), 20.99 (OAc), 20.27 (OAc), 20.25 (OAc).

Compound 8 SG-OAc (colorless oil): 1H NMR (CDCl3) δ 7.02–6.94 (m, 1.7H, 1B,e, 2B,t, 5B,t), 6.91–6.80 (m, 3.8H, 1B,t, 2B,e, 5B,e, 6B,t, 6B,e), 6.68 (s, 2H, 2A,e/6A,e), 6.67 (s, 0.8H, 2A,t/6A,t), 6.09 (d, J = 6.3 Hz, 0.3H, αt), 6.05 (d, J = 5.7 Hz, 1H, αe), 4.69–4.64 (m, 1H, βe), 4.64–4.59 (m, 0.3H, βt), 4.46 (dd, J = 12.0, 5.7 Hz, 1H, γe), 4.32 (dd, J = 11.8, 4.5 Hz, 0.3H, γt), 4.29 (dd, J = 11.9, 3.8 Hz, 1H, γe), 4.09 (dd, J = 11.8, 5.7 Hz, 0.3H, γt), 3.81 (s, 1H, OMeB,t), 3.81 (s, 2H, OMeA,t), 3.79 (s, 6H, OMeA,e), 3.77 (s, 3H, OMeB,e), 2.32 (s, 4H, OAcA,e, OAcA,t), 2.10 (s, 3H, αOAce), 2.07 (s, 1.1H, αOAct), 2.04 (s, 3H, γOAce), 2.01 (s, 1.1H, γOAct); 13C NMR (CDCl3) δ 171.04 (γOAce), 170.78 (γOAct), 169.90 (αOAct), 169.69 (αOAce), 168.75 (OAcA,e), 168.72 (OAcA,t), 152.28 (3A,t/5A,t), 152.10 (3A,e/5A,e), 151.16 (3B,e), 150.88 (3B,t), 148.07 (4B,t), 147.28 (4B,e), 135.02 (1A,e), 134.95 (1A,t), 128.85 (4A,t), 128.72 (4A,e), 123.76 (1B,e), 123.46 (1B,t), 121.10 (6B,e, 6B,t), 119.52 (5B,e), 118.76 (5B,t), 112.59 (2B,e), 112.50 (2B,t), 104.47 (2A,e/6A,e), 104.16 (2A,t/6A,t), 80.46 (βt), 80.36 (βe), 74.89 (αt), 74.08 (αe), 63.23 (γt), 62.84 (γe), 56.35 (OMeA,t), 56.31 (OMeA,e), 55.89 (OMeB,t), 55.87 (OMeB,e), 21.24 (αOAct), 21.21 (αOAce), 20.97 (γOAce), 20.91 (γOAct), 20.64 (OAcA,e, OAcA,t).

Compound 8 SS-OAc (colorless oil): 1H NMR (acetone-d 6 ) δ 7.00 (t, J = 8.4 Hz, 2H, 1B,e, 1B,t), 6.81 (s, 2H, 2A,t/6A,t), 6.78 (s, 2H, 2A,e/6A,e), 6.67 (d, J = 8.3 Hz, 2H, 2B,t/6B,t), 6.66 (d, J = 8.3 Hz, 2H, 2B,e/6B,e), 6.10 (d, J = 7.0 Hz, 1H, αt), 6.07 (d, J = 4.2 Hz, 1H, αe), 4.72 (dt, J = 6.1, 4.2 Hz, 1H, βe), 4.59 (dt, J = 7.0, 4.2 Hz, 1H, βt), 4.42 (dd, J = 11.8, 6.1 Hz, 1H, γe), 4.28 (dd, J = 11.9, 4.1 Hz, 1H, γt), 4.18 (dd, J = 11.8, 4.1 Hz, 1H, γe), 3.91 (dd, J = 11.9, 4.4 Hz, 1H, γt), 3.81 (s, 5H, OMeB,e), 3.81 (s, 5H, OMeB,t), 3.80 (s, 6H, OMeA,e/t), 3.79 (s, 6H, OMeA,e/t), 2.21 (s, 6H, OAcA,t, OAcA,e), 2.14 (s, 3H, αOAce), 1.98 (s, 3H, γOAct), 1.94 (s, 3H, αOAct), 1.86 (s, 3H, γOAce); 13C NMR (acetone-d 6 ) δ 170.70 (γOAce, γOAct), 169.98 (αOAce), 169.92 (αOAct), 168.53 (OAcA,e), 168.50 (OAcA,t), 154.30 (3B,e/5B,e), 154.11 (3B,t/5B,t), 153.04 (3A,t/5A,t), 153.01 (3A,e/5A,e), 137.80 (4B,t), 136.81 (1A,t), 136.72 (1A,e), 136.53 (4B,e), 129.44 (4A,t), 129.20 (4A,e), 124.74 (1B,e), 124.45 (1B,t), 106.16 (2B,t/6B,t), 106.08 (2B,e/6B,e), 104.78 (2A,t/6A,t), 104.38 (2A,e/6A,e), 81.64 (βt), 81.46 (βe), 76.75 (αt), 75.36 (αe), 64.24 (γt), 63.31 (γe), 56.49 (OMe), 56.44 (OMe), 56.31 (OMe), 56.29 (OMe), 20.98 (αOAct), 20.94 (αOAce), 20.69 (γOAct), 20.62 (γOAce), 20.26 (OAcA,e, OAcA,t).

Note that compounds 8 SG-OAc and 8 SS-OAc , if they are the desired final products, are obviously directly available simply by reducing and acetylating compounds 4 as previously reported [64].

Syntheses of γ-p-Hydroxybenzoylated β–β-Unit Model Compounds

Peroxidase-H2O2 oxidative coupling reactions of sinapyl alcohol or/and acylated sinapyl alcohols were performed according to previously described procedures [16, 32, 14].

Syntheses of model compounds 911

Model compounds for 9a, 10a, and 11a (Fig. 5) in which the 4–O-etherification is simply via a 4–O-methyl group were synthesized from sinapyl 4-acetoxybenzoate and sinapyl alcohol by peroxidase-H2O2 oxidation, followed by separation of the individual products, methylation (MeI-K2CO3, acetone), and acetylation (Ac2O-Py). Analogous model compounds for 9b, 10b, and 11b were obtained in the similar way starting from sinapyl acetate and sinapyl alcohol as previously described [14].

Results and discussion

The following sections address the various issues posed in the introduction regarding lignin p-hydroxybenzoylation in palms.

The Nature of p-Hydroxybenzoates and Acylation Regiochemistry

Determining whether p-hydroxybenzoates are in fact free-phenolic entities acylating the γ-positions of lignin side-chains is most readily deduced from solution-state NMR of isolated lignins, analogously to similar determinations for p-coumarates [25]. Björkman milled wood lignins (MWLs) were prepared from the oil palm (Elaeis guineensis) EFB fibers, fronds, and trunk fractions; the EFBs had already been shown to be particularly rich in p-hydroxybenzoates [34]. Simple saponification of EFB whole-cell-wall (WCW) samples indicated that the EFB-1 and EFB-2 samples contained 1.09 and 1.05 wt% p-hydroxybenzoic acid or, based on AcBr lignin contents of 22.7 and 18.3 %, ~4.8 and 5.8 % of the lignins (Table 1). NMR of the underivatized (i.e., un-acetylated) isolated lignin was made difficult due to the high levels of iron in this sample, as previously reported [65]; unfortunately, we were not able to sufficiently effect its removal by complexation with EDTA. Nevertheless, 13C NMR (Fig. 1) readily revealed the presence of p-hydroxybenzoates, assumed at this point to be on lignins. It is worth noting that these signals have sometimes been misinterpreted as arising from p-hydroxyphenyl lignin units or p-coumarates [43]. Acetylation of the isolated lignin facilitated removal of the metal contaminants. The observed changes in chemical shifts of the aromatic carbons following acetylation indicated that p-hydroxybenzoates were present overwhelmingly in their free-phenolic form on the lignin, Fig. 1c–d. The sago palm lignin, used in a previous study in one of our labs [35], had similar features, Fig. 1a; in fact the p-hydroxybenzoate level was highest on this particular lignin fraction. Also striking, and not observed in previous studies, was the high level of natural acetylation of the EFB lignin, Fig. 1b–c. The presence of natural acetates on lignins is frequently missed for a variety of reasons, including that lignins are routinely peracetylated to improve NMR properties, extraction conditions may hydrolyze the acetates [35], and lignin purification steps involving acetic acid can introduce artifactual acetates. Circumventing some of these issues, solid-state NMR spectra of various palm fractions indicate the presence of acetates, Fig. 1b. Furthermore, acetates are clearly observable in spectra from other published reports, where they have typically been attributed to polysaccharides [66, 42]; historically, the contribution from lignin may have therefore been overlooked.

Fig. 1
figure 1

1D 13C NMR spectra of various palm lignins. a Solution-state spectrum from a sago palm MWL isolated as described previously [35]. b Solid-state (200 MHz) spectrum and c solution-state spectrum of MWL isolated from the oil palm empty fruit bunches (EFB) previously characterized [34, 43]. d Solution-state spectrum of acetylated EFB isolated lignin (Ac-MWL). The solution-state spectra were run at 400 MHz (100 MHz 13C); the acetylated MWL is in acetone-d6, underivatized MWLs in 9:1 acetone-d6:D2O. Red-colored peaks labeled pBz are for carbons in the p-hydroxybenzoyl moiety. The red lines between plot d and its axis denote the chemical shifts from model compound 8 SS- p Bz . S syringyl unit. The side-chain regions are quite disperse but generally labeled α, β, and γ. γ-OpBz signifies p-hydroxybenzoylated γ-alcohols, γ-OAc is (naturally) acetylated γ-alcohols. Cyan-colored peaks in bc indicate the natural γ-acetylation of these lignins; no acetates appear in the sago palm lignin in a presumably due to the isolation method [35]. Spectra of lignins isolated from other parts of the oil palm plant (trunk and fronds, not shown) are almost indistinguishable from those shown in cd

The gradient-enhanced 2D HSQC NMR experiment provides a highly detailed view of the molecular structure of plant cell walls with minimal processing of the sample [67]. The fast relaxation caused by the metals lead to extremely poor spectra, Fig. 2a–b, but the chemical shift data are consistent with acylation of the lignin. The p-hydroxybenzoate (pBz) groups are revealed in the aromatic regions of Fig. 2b, along with the G and predominant S aromatics. The side-chain region, Fig. 2a, offers good evidence that lignin γ-hydroxyls, and not the alternative α-hydroxyls, are acylated; acylated β-ether α-H/C correlations, for example, would be seen at ~6.2/76 ppm. After peracetylation of the lignins, followed by successive extraction into chloroform and washing with EDTA in order to remove metal contaminants, higher-resolution spectra, Fig. 2c–d, are obtained. At this point, the nature of the acylation (i.e., γ- vs. α-regiochemistry) is masked, and distinguishing acylation patterns indicative of p-hydroxybenzoates from those of acetates is less direct; this issue is somewhat ameliorated by the considerably higher resolution of the spectra. The shifts in the p-hydroxybenzoate HSQC correlations (especially pBz3/5) following acetylation (Fig. 2d) concur with earlier observations regarding the 1D 13C spectra, namely, that p-hydroxybenzoates on lignins exist in their free-phenolic form and are not etherified; were they etherified, the pBz group would not become acetylated and their correlations would be essentially unchanged from those in the unacetylated lignins (Fig. 2b).

Fig. 2
figure 2

2D HSQC spectra, and 2D HSQC (F2-F3) planes from 3D TOCSY-HSQC spectra, of oil palm EFB lignins. ab The unacetylated MWL (360 MHz); a side-chain region, b aromatic region. The unacetylated material contained magnetic material, presumably iron compounds, that could not be effectively washed out; the spectra are therefore extremely broad, but a still clearly shows the degree to which γ-hydroxyls are acylated, either by acetate or p-hydroxybenzoate. cd The EDTA-washed acetylated EFB MWL (500 MHz cryoprobe instrument); c side-chain region, d aromatic region. These much sharper spectra resolve the α-protons (and the corresponding correlations) of γ-p-hydroxybenzoylated B from γ-acetylated A units. eg HSQC slices (F2–F3 dimension) from a 3D TOCSY-HSQC experiment on acetylated EFB lignin over a small proton chemical shift range in the F1 dimension: e 6.29–6.19 ppm (α-protons in γ-p-acetoxybenzoylated β-ether units B), f 6.13–6.07 ppm (α-protons in normal β-ether units A, both syn- and anti-isomers), and g 6.04–5.98 (α-protons in normal β-ether units A, mainly anti-isomers). The colored symbols on the red contours in e are shifts from model compounds 8; clearly, only the SS-pBz moieties are present at significant levels in the lignin (with chemical shifts for the α-H/C correlations matching well with those from compounds syn- and anti-SS-pBz but not with those from the SG-pBz compounds; the β and γ data are not shown for the latter as they are less diagnostic). Units A are normal (acetylated) β-ether units, B are γ-p-acetoxybenzoylated β-ether units. G guaiacyl, S syringyl, pBz p-hydroxybenzoyl

p-Hydroxybenzoates are therefore confirmed to be free-phenolic pendant entities acylating the γ-hydroxyls of lignin side-chains. Additional proof is established below.

Which Lignin Units are p-Hydroxybenzoylated?

Next, we needed to establish which units in lignins were acylated by p-hydroxybenzoates. Consistent with observations on p-coumarates and acetates, were p-hydroxybenzoates largely on S units, and were they on a range of lignin unit types? In this case, NMR of the acetylated lignin derivatives could answer both of these questions as, fortuitously, the α-protons in acetylated γ-p-hydroxybenzoylated β-ether units B were rather well resolved from the normal acetylated β-ether units A (Fig. 2c).

The 1D proton projection on the HSQC spectrum in Fig. 2c indicates how well the α-protons in the major β-ether units are resolved by proton NMR. The γ-p-hydroxybenzoylated β-ether units B have α-proton/α-carbon correlations at 6.20/76.8 ppm (threo) and 6.19/75.7 ppm (erythro), whereas those from the normal acetylated units A are at 6.06/76.8 ppm (threo) and 6.02/75.4 ppm (erythro). Initially, the α-1H–13C correlation contours were not sufficiently resolved, but running higher-resolution spectra restricted to acquiring just the side-chain region resulted in sufficient resolution to readily distinguish the various correlations (Fig. 2c).

It is apparent from the aromatic region of HSQC spectra that the EFB lignin was S-rich, Fig. 2b, d. An S:G ratio of 83:17 can be derived from the data in Fig. 2d by volume integration of the S2/6 and G2 aromatic proton/carbon correlations. In support, the side-chain region (Fig. 2c) revealed very low levels of phenylcoumaran (β–5) and dibenzodioxocin (5–5/β–O–4) units that must involve G units—their contours appear below the levels plotted.

Distinguishing whether p-hydroxybenzoates are attached to S or G units requires long-range 1H–13C correlation experiments [13]. The heteronuclear multiple bond correlation (HMBC) experiment can correlate a proton with carbons up to three bonds away. The simplest probe is via the resolved α-protons in β-ether units (in the acetylated lignins), which should correlate with carbons γ (3-bond), β (2-bond), 1 (2-bond), and 2 and 6 (3-bond). The feature of this experiment that makes distinction of S- from G-units so elegant is that the identical 2/6 carbons in symmetrical S units resonate at about 105 ppm, whereas carbons 2 and 6 are distinctly different in the unsymmetrical G-units, resonating at 114 and 120 ppm. As shown in Fig. 3a, it is readily evident that the β-ethers in normal units A are both S (dark blue) and G (cyan), whereas it is only possible to detect γ-p-hydroxybenzoylated S units B (red). The prevalence of p-hydroxybenzoates on S over G units may be either a reflection of the control in the transferase-catalyzed acylation reaction or simply the timing of the conjugation and the levels of the two monolignols present at that time, i.e., that p-hydroxybenzoylation of sinapyl alcohol is either significantly favored over coniferyl alcohol, or that sinapyl alcohol is more prevalent at the time of acylation.

Fig. 3
figure 3

2D HMBC spectra, and 2D HSQC-TOCSY (F1–F2) planes from 3D TOCSY-HSQC spectra, of oil palm EFB acetylated lignin (Ac-MWL). a HMBC section showing long-range 1H–13C correlations to α-protons in β-ether units, elegantly revealing their S and/or G nature. Normal β-ether units A are clearly both S and G in nature; γ-p-hydroxybenzoylated β-ether units B are essentially S-only. b HMBC section showing long-range correlations to the p-acetoxybenzoyl and acetate carbonyl carbons proving acylation of lignin γ-hydroxyls. ce HSQC-TOCSY slices (F1–F2 dimension) from the 3D TOCSY-HSQC experiment on acetylated EFB lignin over a small proton chemical shift range in the F3 dimension: c 6.29–6.19 ppm (α-protons in γ-p-acetoxybenzoylated β-ether units B showing both syn- and anti-isomers of these units, d 6.13–6.07 ppm (α-protons in normal β-ether units A, mainly syn-isomers), e 6.04–5.98 (α-protons in normal β-ether units A, mainly anti-isomers). The colored symbols on the red and blue contours in ce are shifts from model compounds 8; again, it is clear that data from SS-pBz moieties match the model data well, whereas those of the SG analogs are not evidenced; for the acetates, there is evidence for the SG-Ac components as well as, obviously in this high-S lignin, the dominant SS-Ac analogs

The 3-bond correlations in the HMBC data between the γ-protons and the carbonyl (Fig. 3b) also provide compelling proof that the p-hydroxybenzoates acylate lignin γ-positions. The HMBC experiment can also be used to identify the variety of units that are acylated. This is achieved here by examining the γ-protons correlating with the carbonyl carbon of the p-hydroxybenzoate moiety (Fig. 3b). However, determining the range of products is not as easy as it was for p-coumarates, as there are insufficient model compounds, and the dispersion between the unit types is limited. For example, it is currently difficult to prove that p-hydroxybenzoates acylate the phenylcoumaran (β–5) units or hydroxycinnamyl alcohol endgroups; each of those unit types is present only at low levels in this S-rich lignin.

The dispersion that is afforded to the α-protons suggests that it should be possible for a 3D TOCSY-HSQC experiment to delineate whether p-hydroxybenzoates acylate both syn- (threo-) and anti- (erythro-) isomers of the β-ether units. This is because one can take 2D HSQC planes at given proton frequencies and exploit the α-proton differences between the γ-p-hydroxybenzoylated B and normal A units noted above. Similarly, one can take 2D HSQC-TOCSY planes through the α-proton region. Such 2D HSQC slices are shown in Fig. 2e–g, and HSQC-TOCSY slices are shown in Fig. 3c–e, as described in the captions. It is revealed that there are syn- and anti-isomers of the normal β-ether units (Figs. 2f–g and 3d–e), with anti-isomers predominating because of the S-rich nature of the lignin, as already well established [6872]. It is equally clear that p-hydroxybenzoates acylate both of the β-ether isomers (Figs. 2c and 3c). A comparison of the α-H/C correlation peak data for the lignin versus the overlaid data from the various model compounds (Fig. 2e) indicates that, in addition to the pBz’s being almost entirely on S units as noted above, the data for the two SS-pBz isomers match very well, but those for the β–O–4-G analogs SG-pBz do not. Similarly, the data match well for the same SS compounds and not (especially for the β-H/C correlations) for the SG analogs (Fig. 3c). As in the case made early on for p-coumarates acylating grass lignins [25], the presence of p-hydroxybenzoates indiscriminately on both isomers of β-ether units suggests that they arise by lignification with preformed monolignol p-hydroxybenzoate conjugates.

Determination of Regiochemistry and Quantification by DFRC

Proving γ-regiochemistry (although not its exclusivity) and the S versus G nature of the units involved with p-hydroxybenzoates is elegantly made by the DFRC (derivatization followed by reductive cleavage) assay [61, 62], Fig. 4a. This is possible as DFRC advantageously cleaves lignin β-ethers but leaves esters fully intact [62, 51, 18, 23, 27]. Unfortunately, the acetylated coniferyl and sinapyl p-hydroxybenzoate products anticipated from this method do not survive GC analysis conditions, so the products were analyzed by HPLC-ESI-MS, Fig. 4b. Quantification of the DFRC-released diacetates of coniferyl alcohol (CA), sinapyl alcohol (SA), coniferyl p-hydroxybenzoate (CA-pBz), and sinapyl p-hydroxybenzoate (SA-pBz) accounted for 0.9, 2.0, 0.1, and 1.3 wt% of the EFB-1 sample (Table 1). The monolignol component, uncorrected for recovery and losses, accounted for some 16.7 % of the lignin. The DFRC-released total S to G monomer ratio was 73:27, whereas the ratio of SA-pBz to CA-pBz was 96:4, confirming the NMR observations that the p-hydroxybenzoates are primarily located on S units. Similar data were obtained for the EFB-2 sample (Table 1).

Fig. 4
figure 4

DFRC proof of γ-p-hydroxybenzoylation. a A schematic representation of the steps in the DFRC analysis highlighting the bonds that are cleaved and the new functional groups that are formed. b The HPLC chromatogram (λ = 264 nm) of the DFRC product of oil palm EFB overlaid (colored traces) with extracted-ESI-MS-ion chromatograms for the base-peak masses corresponding to the various compounds of interest. The color-band-highlighted peaks correspond to DFRC products coniferyl alcohol (CA [M + CH3CN + H]+, m/z = 306, blue), sinapyl alcohol (SA [M + CH3CN + H]+, m/z = 336, purple), coniferyl p-hydroxybenzoate (CA-pBz, [M + Et3N]+, m/z = 486, orange), sinapyl p-hydroxybenzoate (SA-pBz, [M + Et3N]+, m/z = 516, red), and internal recovery standard diethyl 5,5′-diferulate (DEDF [M + H]+, m/z = 527, gray) all as their acetylated derivatives (as shown); the same compound names are used for the unacetylated compounds in Table 2 and elsewhere. The retention time, UV–vis spectrum, and mass spectrum of each compound were authenticated using external synthetic standards

Proof that Lignin p-Hydroxybenzoates Arise via Lignification with Preformed Monolignol Conjugates

All of the aforementioned data strongly suggest, but strictly do not prove, that the p-hydroxybenzoate groups inherent on these palm lignins arise from lignification incorporating preformed monolignol p-hydroxybenzoate conjugates. As we have described previously for acetates (e.g., on kenaf lignins), there is a way to provide compelling proof [62, 14]. Given that post-coupling rearomatization of β–β-coupled quinone methide lignification intermediates is the one reaction that is perturbed by the presence of an acylated γ-OH, observing novel acylated β–β-coupling products in the lignin provides convincing evidence that acylated monolignols are involved in lignification [14].

As seen in Fig. 5, evidence for all three types of products 9a11a indicating the involvement of sinapyl p-hydroxybenzoate monomers in lignification is provided by observing sets of signature H/C correlations in HSQC spectra, correlations that well match the data from synthesized model compounds for all of the structures 9a11a. These units include both those arising from the cross-coupling of sinapyl p-hydroxybenzoate with sinapyl alcohol (9a, 10a) as well as the homo-dimerization of sinapyl p-hydroxybenzoate (11a). These spectra also provide evidence (with correlations for structures 9b11b) that sinapyl acetate is a monomer conjugate in this palm EFB lignin, as was previously observed in kenaf [14].

Fig. 5
figure 5

Proof that preformed monolignol p-hydroxybenzoates (and acetates) are responsible for the lignin p-hydroxybenzoylation (and acetylation). Diagnostic HSQC correlation contour peaks matching those from model compounds are evidenced in the a α-, b β-, and c γ-regions of spectra for all three types of novel β–β-coupling products (9a, 10a, 11a) that would arise only from radical coupling reactions involving sinapyl p-hydroxybenzoate [and for analogous products (9b, 10b, 11b) from sinapyl acetate]. See text for explanation. Model compounds were of the 4–O-methylated structures shown; assignments of lignin correlation peaks (colored the same as the structures below) were made from the model data. Products in the lignin may be 4–O-etherified during lignification or may be 4–O-Ac here if the moiety in the lignin remained free-phenolic. Labeling is necessarily complex; 9a B for example is the α-, β-, or γ-H/C correlation from the diagnostic B unit of compound 9a; similarly, 9a B2 in Fig. 5c (γ-correlations) is simply the second (lowest field, highest ppm) γ-H/C correlation from the B unit of compound 9a

Evidence from metabolite profiling of poplar wood for novel β–β-coupled heterodimers between sinapyl alcohol and sinapyl p-hydroxybenzoate [29, 32] suggests that it is safe to contend that all lignin p-hydroxybenzoates, across the plant taxa that have them, derive from lignification using monolignol p-hydroxybenzoate conjugates. The evidence is therefore sufficiently compelling that currently unknown transferases are responsible for producing monolignol p-hydroxybenzoate conjugates that are used as ‘monomers’ for lignification in palms (as well as in willows, poplars, and aspen). The levels of p-hydroxybenzoates on lignins should be manipulable by normal methods of selection and breeding or by misregulating such genes, once found. If p-hydroxybenzoic acid or its derived products are sufficiently valuable as commodities and coproducts generated from a biomass conversion process, a long-term strategy might be to upregulate the p-hydroxybenzoate levels on lignin in target plants. As determined here, only some 23 % (Table 1) of lignin monomers are p-hydroxybenzoylated in palm EFBs, and the level is likely to be lower in poplar and willow, so there appears to be significant potential for its increase. Other factors being equal, for example, if all of the monolignols destined for lignification were p-hydroxybenzoylated, the amount of p-hydroxybenzoic acid available would be approximately 40 wt% of the lignin or roughly 8 wt% of the EFB fiber.

Conclusions

Our data provide compelling evidence that, as for acetates on kenaf (and many other plants) lignins and p-coumarates on grass lignins, p-hydroxybenzoates acylate the γ-hydroxyl of lignin side-chains and are found mainly on S units. Furthermore, like their p-coumarate analogs, the p-hydroxybenzoate moieties are not subjected to significant levels of radical coupling, or other etherification, during lignification and therefore remain overwhelmingly as free-phenolic pendant units on lignin unit side-chains and can therefore be easily cleaved off to deliver a single compound in quite pure form. As has been demonstrated for p-coumarates, and as there is no a priori reason that p-hydroxybenzoates can not undergo radical coupling, the logical reason is that such units are preferentially subject to radical transfer reactions to the G and S monomer and polymer phenolics present in the radical-limited lignification process [12, 73]. The data presented here also provide evidence that, analogously to the p-coumarates and acetates found acylating lignin units, p-hydroxybenzoate units arise from lignification using the preformed monolignol p-hydroxybenzoate conjugates and, at least in these palm lignins, are almost entirely derived from sinapyl p-hydroxybenzoate. It is now logical to contend that any of the acylation products seen in any of the native lignins is via acylation at the monomer level and that post-polymerization (or post-coupling) acylation reactions are unlikely. We therefore expect that p-hydroxybenzoyl-CoA:monolignol transferases are involved in lignification in the various willows (Salix spp.), poplars and aspen (Populus spp.), and palms (family Arecaceae) that have p-hydroxybenzoylated lignins, and that identification of the genes involved will allow p-hydroxybenzoate levels on lignin to be manipulated as recently demonstrated for p-coumarates [23]. To date, however, the in planta pathway to p-hydroxybenzoate (or its CoA derivative) remains ambiguous. If p-hydroxybenzoic acid becomes a viable coproduct from cellulosic bioenergy lignin streams, it could conceivably be upregulated or even introduced into other biomass plants. Right now, the current palm oil empty fruit bunch ‘wastes’ should be able to generate a sizeable stream of relatively clean p-hydroxybenzoic acid; when available at the multiton level, industrial applications are expected to increase beyond those already in use today.