Background

Lignin, a phenolic and hydrophobic polymer deposited in plant cell walls, confers rigidity to cell walls and plant bodies, facilitates water transport in tissues and organs, and protects plants against biotic and abiotic stresses [1,2,3]. Lignin is composed of three main monolignols: p-coumaryl, coniferyl, and sinapyl alcohols. After polymerization, these monolignols give rise to the three main building blocks of lignin: p-hydroxyphenyl (H), guaiacyl (G), and syringyl (S) units. Monolignols are biosynthesized from phenylalanine and/or tyrosine in the cytosol via general phenylpropanoid and monolignol pathways [1, 4]. After transportation to the cell wall, phenoxy radicals are generated from the monolignols by phenol oxidases, such as laccase and peroxidase, and, in contrast to other cell-wall polymers, polymerize spontaneously without the need for catalytic support (Fig. 1) [5]. Combinatorial coupling of radicals gives structural complexity to lignin, thereby supporting plant growth and development [1].

Fig. 1
figure 1

Schematic diagram of the general lignin biosynthesis pathway. The predominant route for the biosynthesis of three main monolignols is shown by red arrows. Dashed arrows indicate multiple metabolic steps. PAL, phenylalanine ammonia lyase; C4H, cinnamate 4-hydroxylase; 4CL, p-coumarate: CoA ligase; HCT, hydroxycinnamoyl-CoA: shikimate hydroxycinnamoyl transferase; C3H, p-coumarate 3-hydroxylase; CSE, caffeoyl shikimate esterase; CCoAOMT, caffeoyl-CoA O-methyltransferase; CCR, cinnamoyl-CoA reductase; F5H, ferulate 5-hydroxylase; COMT, caffeic acid O-methyltransferase; CAD, cinnamyl alcohol dehydrogenase; HCALDH, hydroxycinnamaldehyde dehydrogenase. The substrates expected to be converted by couA, calB, or F6H1 are shown in orange

However, in the utilization of secondary cell walls, lignin is highly recalcitrant during chemical and biochemical decomposition of plant biomasses for the production of biofuel and bio-based chemicals [6, 7]. The persistence of lignin even after chemical and physicochemical pretreatments hinders the efficient biorefining of plant biomass. Molecular characteristics of lignin in biomass, such as content, composition, frequency of interunit linkages, and molecular weight, influence this recalcitrance.

As an alternative to disrupting cell wall structure through pretreatments during the biorefining process to reduce lignin recalcitrance, the manipulation of plants through genetic engineering has been attempted. In early trials utilizing this approach, genes encoding monolignol biosynthetic enzymes were upregulated or downregulated to manipulate lignin content and composition [4]. One successful example was the overexpression of a gene encoding ferulate 5-hydroxylase (F5H) in transgenic plants. A gene expression cassette in which F5H transcription was driven by the C4H promoter was introduced into an Arabidopsis thaliana fah12 mutant, wild-type tobacco, and poplar to manipulate lignin composition [8,9,10]. F5H expression resulted in lignin with a high number of S units in the resultant plants without any decrease in plant growth. After hot water pretreatment, a much higher glucose yield was obtained from cell wall residues with S-enriched lignin of transgenic Arabidopsis compared with wild-type plants [11]. Furthermore, significant increases in pulping yield were observed when wood chips from transgenic poplar overexpressing F5H were compared with those from wild-type plants [12].

In addition to manipulating the expression of endogenous genes, expression of exogenous genes is effective for suppression and/or rerouting of monolignol biosynthesis. Recently, Wilkerson et al. expressed a gene encoding feruloyl-CoA monolignol transferase, which was isolated from Chinese angelica, in poplar and successfully increased incorporation of acylated monolignols into lignin [13]. After pretreatment with mild alkali or ionic liquid, significant improvements in saccharification rate and chemical pulping yield are evident in plants harboring the gene [14, 15]. Simultaneous expression of diketide-CoA synthase and curcumin synthase 2 originating from turmeric (Curcuma longa) also rerouted the monolignol biosynthetic pathway in transgenic Arabidopsis plants for heterologous production of curcumin [16]. The resultant curcumin could be incorporated into lignin and improved saccharification efficiency of the transgenic cell wall.

In addition to plant-derived genes, heterologously expressed genes of microorganisms are available for the manipulation of lignin structure. Eudes et al. generated transgenic Arabidopsis plants harboring a gene encoding hydroxycinnamoyl-CoA hydratase/lyase (HCHL) from Pseudomonas fluorescens [17]. HCHL, whose catalytic properties resemble those of hydroxycinnamoyl-CoA hydratase/lyase (CouA; described later), catalyzes the conversion of hydroxycinnamoyl-CoAs to their corresponding hydroxybenzaldehydes (HBAlds), such as vanillin, which are also known to be incorporated into natural lignin as pendant end units [18,19,20]. Expression of a chimeric HCHL construct increased incorporation of HBAlds into lignin, reducing in turn the molecular weight of lignin and increasing the saccharification efficiency of cell wall residues prepared from the transgenic plants [17].

To identify new genetic tools for reducing biomass recalcitrance by rerouting the monolignol biosynthetic pathway, we screened 48 expression constructs containing coding sequences of 50 enzymes (Additional file 1: Tables S1 and S2) potentially able to manipulate the lignin biosynthetic pathway. The selected enzymes could be classified into several categories, namely, enzymes to prompt biosynthesis of phenolics with chemically labile units such as cinnamate esters or amides; enzymes to inhibit radical formation through the reduction of double bonds in the side chain of monolignol intermediates or by methylation of free phenolic hydroxyl groups of monolignols; enzymes to synthesize compounds with C6-C1 structures; enzymes to introduce an additional free phenolic hydroxyl group into monolignol intermediates; and enzymes expected to reverse the metabolic flow in monolignol pathway via oxidation of monolignols or corresponding aldehydes.

These genes were expressed under the control of a promoter that is active in cells with secondary cell walls, and which is derived from the C4H gene of hybrid aspen [21]. We identified four genes that induced changes in staining intensity and color tone by Wiesner and Mäule reagents, and/or cell shape in stem tissues of transgenic plants: two bacterial genes encoding coniferaldehyde dehydrogenase (calB) and couA, and an Arabidopsis gene encoding feruloyl-CoA 6′-hydroxylase (F6H1) in addition to positive control gene ferulic acid 5-hydroxylase (F5H) from Arabidopsis. The enzymatic saccharification efficiency of cell wall residues from the transgenic plants carrying the two bacterial genes was improved compared with wild-type plants. Overexpression of F6H1 successfully functioned for production of coumarin and structural modification of lignin, but it could not contribute to reduce the lignin recalcitrance. Our simple procedure employing lignin staining is useful for the simultaneous screening of many genes with the potential to improve biomass characteristics.

Methods

Plant materials

Arabidopsis thaliana Columbia-0 (Col-0) plants were grown in soil under 16-h/8-h day/night (60–80 μmol−1 s−1) conditions at 22 °C. Transgenic plants were generated from 8-week-old plants by the floral-dip method [22]. Transgenic plants from harvested seeds were selected on solid Murashige–Skoog (MS) medium containing 0.8% agar with 50 mg L−1 kanamycin.

Gene selection and synthesis

Enzymes acting on monolignols and their precursors as the substrates were collected by searching MetaCyc [23], KEGG [24], UniProt [25], and BRENDA [26] databases and previous reports. The genes encoding these enzymes are listed in Additional file 1: Table S1. Properties of these enzymes are shown in Additional file 1: Table S2. In addition to a positive control (F5H), 49 enzymes were selected: 29, 18, and 2 from plants, bacteria, and fungi, respectively (Additional file 1: Table S1). Coding sequences were optimized for expression in Arabidopsis, chemically synthesized with attL Gateway recombination sites at 5´ and 3´ flanking regions, and cloned into pUC57 or pCC1 vectors (GenScript Inc.).

Plasmid construction

Chemically synthesized DNA fragments of candidate enzymatic genes were transferred into pDEST_PkC4H_HSP_GWB4, a binary Gateway destination vector for the expression of genes under the control of the C4H gene promoter (PkC4Hpro) cloned from hybrid aspen (Populus sieboldii × P. grandidentata Y-63) [21] and terminated by the Arabidopsis HSP18.2 terminator [27]. This destination vector was constructed by inserting approximately 2.5 kbp of the PkC4Hpro region into the pDEST_HSP_GWB4 vector, the latter derived from pGWB401 [28]. The PkC4H promoter was amplified by PCR from the plasmid pCYP73a [21] with primers PkC4Hpro_F (5´-GACTGGAAGGATGTATTGGTTGTTG-3´) and PkC4Hpro_R_Xho (5´-TTTAAACTCGAGTATCTTGGAACTGGTTTCTTTGTC-3´). Fifty genes including F5H (positive control) were individually cloned into pDEST_PkC4H_HSP_GWB4. pcaH and pcaG as well as pcaH2 and pcaG2 were placed in each same plasmid by amplifying expression cassette of pcaG and pcaH2 with primers M13F_Asc (5´-CCCTTTGGCGCGCCTCGTTGTAAAACGACGGCCAGTG-3´) and HSPter_R_Asc (5´-AATTTGGCGCGCCTTATCTTTAATCATATTCCATA-3´) and inserting the amplified fragment into AscI site of the vector carrying pcaH and pcaG2, respectively. Agrobacterium strains harboring individual expression constructs (a total of 48 constructs, including one for F5H; Additional file 1: Table S1) were cultured independently, and 2–8 Agrobacterium cultures were mixed into a single pool. Arabidopsis plants were subject to genetic transformation using the traditional floral-dip procedure with seven pools of Agrobacterium strains (see the column “Gene set” in Additional file 1: Table S3). After recovery of transgenic lines grown from T1 seeds on selective medium with kanamycin, the introduced gene(s) in each transgenic plant was confirmed by the method described below.

GUS staining

The GUS gene was transferred from a pENTR-gus vector (Thermo Fisher Scientific Inc.) into the pDEST_PkC4H_HSP_GWB4 vector. Transgenic Arabidopsis plants were produced by the floral-dip method as described above. T1 transgenic plants were grown for 5 weeks, and sliced sections of the inflorescence stem were stained in X-gluc solution containing 100 mM potassium phosphate, 20% (v/v) methanol, 0.5 mM potassium hexacyanoferrate, 0.5 mM potassium ferrocyanide, and 0.05 mg mL−1 5-bromo-4-chloro-3-indolyl-β-d-glucuronic acid.

Identification of introduced genes in transgenic plants

Because several constructs were mixed for Arabidopsis transformation, introduced genes needed to be identified in each transgenic plant. Genomic DNA was extracted from immature leaves of each transgenic plant (T1) using extraction buffer [200 mM Tris–HCl (pH 8.0), 250 mM NaCl, 25 mM EDTA and 0.2% SDS] and partially purified by ethanol precipitation. The introduced genes were amplified using ExTaq polymerase (Takara Bio Inc.) with the following primers: PkC4Hpro_pd (5′-AAACCCAAGCTCTCCTCATCCTGTTGC-3´) and HSPter_R (5′-GCCACAAATTCATAACACAACAAGCCA-3´). All PCR products were separated on 0.8% agarose gel, extracted with a gel extraction kit (Promega Inc.), and sequenced.

Preparation of cross sections and lignin staining

Transgenic plants were selected on 0.8% agar MS medium containing kanamycin for 2 weeks, transferred to soil, and grown for a further 5 weeks. Cross sections were prepared from the inflorescence stem 3 cm above the soil. For high-throughput lignin staining, several hand-sliced sections from each transgenic plant were placed into separate wells of a 96-well plate and rinsed with water several times before staining. Lignin was visualized by Wiesner and Mäule staining as previously described with some modifications [29] and observed using a SZ61 stereo microscope (Olympus Inc.). For Wiesner staining, sections were stained for 5 min with 5% (w/v) phloroglucinol dissolved in concentrated HCl, followed by replacement with 30% (v/v) HCl. For Mäule staining, sections were stained with 1% (w/v) KMnO4 for 5 min, followed by incubation in 10% (v/v) HCl for 5 min and replacement of the solution with 1.5 M Na2CO3. Stem segments of individual transgenic plants were mounted on a 5% agar block, and 50-μm-thick cross sections were prepared using a vibrating microtome (HM-650 V; Microm Inc.). The 50-μm-thick sections subjected to Wiesner or Mäule staining were observed under an Axioscop2 fluorescent microscope (Carl-Zeiss Inc.).

Lignin quantification

Senesced inflorescence stem was cut into 1-cm segments and fixed in methanol overnight. After three exchanges of methanol, the solution was replaced three times with acetone, and the same procedures were followed with methanol/chloroform (1:1, v/v). The resulting stem segments were rinsed twice with ethanol and dried overnight at 65 °C. Dried stem segments were ground using a Shakemaster Neo (Biomedical Science Inc.) with a stainless-steel bead and three zirconia beads. The resulting powder was gelatinized in 0.1 M sodium malate buffer (pH 6.0) at 65 °C for 10 min, and starch was then digested with 500 U mL−1 α-amylase (Megazyme Inc.) and 0.33 U mL−1 amyloglucosidase (Megazyme Inc.) in 0.1 M sodium malate buffer (pH 6.0) at 37 °C for 18 h. The destarched powder was washed with ultrapure water, rinsed three times with 100% ethanol, and dried overnight at 65 °C. The resultant powder was regarded as the alcohol-insoluble residue (AIR) and used to determine lignin content. Lignin content in the AIR was determined by the micro-Klason method described previously [30] with some modification; instead of water, 10% ethanol was used to rinse residues. AIR powder (2.0–3.0 mg) was hydrolyzed by the two-step sulfuric acid method, and the resulting residue was weighed as the acid-insoluble residue. Levels of acid-soluble lignin were calculated based on the ultraviolet absorbance of the supernatant after acid hydrolysis at 205 nm and a lignin extinction coefficient of 110 L g−1 cm−1.

Pyrolysis–gas chromatography/mass spectrometry (Py-GC/MS) analysis

The analysis of Py-GC/MC detection data was performed according to Van Erven et al. [31]. Pyrolysis was carried out with an EGA/PY-3030D multi-shot pyrolyzer (Frontier Laboratories Inc.) using the above-mentioned AIR powder. The pyrolyzer was connected to a gas chromatographer–mass spectrometer system (GCMS-QP2020, Shimazdu Inc.) equipped with a capillary column (30 m × 0.25 mm; 0.25 μm film thickness; Ultra ALLOY+-5; Frontier Laboratories Inc.). The conditions of pyrolysis, and GC and MS settings were based on those described in Van Erven et al., [31]. Pyrolysis of the AIR sample was performed at 500 °C for 2 min via the split/splitless injector on the column (at 250 °C). The split ratio was 1:30 and helium was used as carrier gas with constant flow at 1.72 ml min−1. The GC oven was programmed as follows: hold at 70 °C for 2 min, elevate to 270 °C at 6 °C/min, then hold at 270 °C for 10 min. MS detection was completed using the electron ionization method with 70 eV electrons. The source temperature was set to 250 °C, the scan range was set to 50-550 m/z, and the scan rate was 2000 scans/s. Conventional pyrolysis products were quantified using the response factors described in Van Erven et al. [31].

Thioacidolysis monomer analysis

Thioacidolysis of plant samples was performed according to the method of Yamamura et al. [32]. Briefly, freshly made thioacidolysis reagent containing 87.5% dioxane, 10% ethanethiol (97%, Alfa Aesar Inc.), and 2.5% boron trifluoride diethyl etherate (> 47.5% BF3, Sigma Aldrich Inc.) was mixed with AIR material (approximately 5–10 mg) in a 1-mL screw-cap reaction vial. The vial cap was screwed on tightly and kept on a heating block at 100 °C for 4 h with gentle shaking. After cooling the vial in ice water for 5 min, 200 μL of product mixture solution was transferred into a new vial cap, and 100 μL of 1 M sodium hydrogen carbonate was added to adjust the pH to 7. Next, 130 μL of 1 M hydrochloric acid solution was used to adjust the pH to below 3. The resultant solution was extracted three times with diethyl ether (250 μL). The combined organic phase was washed with saturated sodium chloride and then evaporated after removing water contamination by adding anhydrous sodium sulfate. The residues were redissolved in approximately 250 μL of diethyl ether, and 10 μL of the solution was silylated by adding 8 μL of N, O-bis(trimethylsilyl)acetamide. The resulting solution was analyzed with a gas chromatograph−flame ionization detector (GC-2010 Plus, Shimadzu Inc.) equipped with a DB-5 capillary column (25 m × 0.25 mm, 0.25 μm film thickness, Agilent Technologies Inc.). The column oven temperature was maintained at 160 °C for the first 1 min, then increased at a rate of 10 °C min−1 to 300 °C. The split injector (1:10) was kept at 220 °C, and the FID was maintained at 300 °C. The flow speed of the carrier gas (nitrogen) was 30 cm/s. Conventional thioacidolysis monomers were quantified using the response factors provided by Yue et al. [33].

Analysis of cell wall-bound phenolics

Levels of hydroxycinnamic acids linked to cell walls via ester or ether linkages were quantified by alkaline hydrolysis and subsequent analysis by GC. The AIR (50 mg) was treated with 4 mL of 4 N sodium hydroxide at 170 °C for 2 h. Tetracosane was added to the mixture as an internal standard. After removal by filtration of the residue, the solution was acidified with 1 N hydrochloric acid and saturated with sodium chloride. The solution was then extracted two times with diethylether and the ether-soluble fraction was recovered. This fraction was evaporated to dryness and the residue was dissolved in 250 μl of diethylether. The trimethylsilylated sample was analyzed by GC − FID (GC-2010 Plus, Shimadzu Inc.) equipped with DB-5 capillary column (25 m × 0.25mmID, 0.25 μm film thickness, Agilent Technologies Inc.). The column oven temperature was maintained at 120 °C for the first 2 min, then increased at 8 °C/min to 240 °C. The Split injector (1:10) was kept at 240 °C and the FID was kept at 300 °C. The flow speed of the carrier gas (nitrogen) was 30 cm/s. Although two hydroxybenzaldehydes (vanillin and syringaldehyde), two hydroxybenzoic acids (vanillic and syringic acids), and three hydroxycinnamic acids (p-coumaric, ferulic, and sinapic acids) were analyzed, differences among the tested samples were only detected in p-coumaric and ferulic acids (Table 1).

Table 1 Lignin content, lignin monomer composition, and level of cell-wall bound phenolics

Monosaccharide composition analysis

Monosaccharide composition was analyzed based on the UPLC-ABEE system as previously described [34]. AIR powder was hydrolyzed by two-step sulfuric acid hydrolysis, and the hydrolysate was neutralized with calcium carbonate. The neutralized supernatant was labeled with ethyl p-aminobenzoate (ABEE) reagent. Chromatographic separation and detection of labeled monosaccharides were performed using an ACQUITY UPLC H-Class system (Waters Inc.) equipped with an ACQUITY UPLC BEH C18 column (100 × 2.0 mm, 1.7 μm particle size, Waters Inc.) and fluorescence detector (ACQUITY UPLC FLR detector, Waters Inc.). The eluents used for the chromatographic separation were 200 mM potassium borate buffer (pH 8.9) and 100% acetonitrile.

Sample pretreatment and enzymatic saccharification

A diluted alkali pretreatment was applied using a slight modification of the method of Santoro et al. [35]. Weighed AIR powder (1.5‒2.0 mg) was mixed with 870 μL of 100 mM NaOH solution and incubated for 2 h at 90 °C. After cooling to room temperature, the solution was neutralized with 20 μL of 30% (v/v) HCl, and saccharification was then carried out with 110 μL of neutralized solution containing cellulase [100 mM citrate (pH 4.5), 0.02% (w/v) sodium azide, 0.2 CPU cellulase (Celluclast® 1.5 L, Merck Inc.), and 0.4 CBU cellobiase (Novozyme 188, Merck Inc.)]. After incubation for 24 h at 50 °C with shaking at 200 rpm, the amount of liberated glucose and xylose was determined with a Glucose CII test kit (Wako Inc.) and Xylose Assay kit (Megazyme Inc.), respectively. For non-pretreated samples, AIR powder was digested with 1,000 μL of cellulase solution [10 mM citrate (pH 4.5), 0.02% (w/v) sodium azide, 0.2 CPU cellulase (Celluclast® 1.5 L, Merck Inc.), and 0.4 CBU cellobiase (Novozyme 188, Merck Inc.)] without any pretreatment.

Statistical analysis

Using R software, the Shapiro–Wilk test (“shapiro.test” function) was used to determine if data were significantly likely to be normally distributed. If the data were assumed to be normally distributed, statistical analysis of growth parameters and biochemical data was performed using Welch’s t test with Bonferroni–Holm correction. However, if normal distribution was not significantly suggested (p < 0.05), the Brunner–Munzel test with Bonferroni–Holm correction was performed using the “brunner.munzel.test” function in the brunnermunzel package of R [36, 37].

Results

Promoter activity of PkC4H in Arabidopsis

To express enzymatic genes in Arabidopsis interfascicular and xylem fibers, the C4H gene promoter (PkC4Hpro) derived from hybrid aspen was selected in view of future application of the effective construct to woody plants. C4H is involved in lignin biosynthesis, hence PkC4Hpro is active in fiber cells and vessel precursor cells in hybrid aspen stems [21, 38]. To examine PkC4Hpro expression in Arabidopsis, the β-glucuronidase (GUS) gene fused with PkC4Hpro was stably introduced into the Arabidopsis genome, and GUS activity was monitored histochemically. GUS activity was evident in most tissues where secondary cell walls develop, including xylem and interfascicular fibers in inflorescence stems, root hypocotyls, siliques, and anther and leaf vasculature (Additional file 2: Fig. S1). This result suggested that the promoter activity was conserved as expected in Arabidopsis and thus appropriate for our purpose.

Positive-control experiment using the F5H expression construct

To confirm that the prepared vector was suitable for our purpose, we used F5H/CYP84A1 as a positive control. Overexpression of F5H under the control of the Arabidopsis C4H promoter has been found to increase the proportion of S units of lignin in an Arabidopsis F5H-deficient fah1-2 mutant, wild-type tobacco, and poplar [8, 9]. This overexpression also reduces the lignin content of transgenic tobacco and poplar plants. In the case of our transgenic Arabidopsis expressing F5H under the control of PkC4Hpro (T1 generation), growth characteristics were similar to those of the wild type (Fig. 2); however, the color intensity of interfascicular fiber cells after lignin staining with Wiesner reagent (phloroglucinol–HCl solution; Fig. 3a) was weaker in multiple T1-independent lines than in the wild type, an observation also reported for F5H-overexpressing tobacco [8]. Furthermore, some vascular vessel elements were slightly distorted (triangle in Fig. 3a); this phenomenon, usually referred to as the “irregular xylem (irx)” phenotype, was not reported in the fah1-2 mutant overexpressing F5H [9]. Vascular xylem cells and vessels of the F5H line were clearly stained in red by the Mäule reaction, while those of the wild type were stained in brown (Fig. 3b). As indicated by the results of py-GC/MS and thioacidolysis of AIR prepared from inflorescence stems, lignin in the F5H line was enriched with S units, as expected (Table 1, Fig. 4b, and Additional file 1: Table S4). From these observations, we concluded that the expected phenotypes were observed in the F5H line and that PkC4Hpro was thus appropriate for driving the expression of the tested genes in this study.

Fig. 2
figure 2

Lignin-modified transgenic plants exhibit no severe growth penalty. a Representative plants fully grown for 8 weeks and then dried for 4 weeks. Bars represent 5 cm. Stem height (b) and stem dry weight (c) of inflorescence stems of fully senesced transgenic lines (n = 9, wild-type; n = 8, PkC4Hpro::F5H; n = 9, PkC4Hpro::calB; n = 8, PkC4Hpro::couA; and n = 8, PkC4Hpro::F6H1) and the wild type. Error bars indicate the 95% confidence interval of the mean. Double asterisks indicate statistically significant differences (P < 0.01) of transgenic lines compared with the wild type according to Welch’s t-test or Brunner–Munzel test with Bonferroni–Holm correction

Fig. 3
figure 3

Wiesner (a) and Mäule (b) staining of hand-cut stem sections from transgenic plants harboring F5H, calB, couA, or F6H1 transgenes. The sections were prepared from 8-week-old T1 transgenic plants generated by the transformation of single Agrobacterium strains harboring PkC4Hpro::F5H, PkC4Hpro::calB, PkC4Hpro::couA, or PkC4Hpro::F6H1 constructs. The representative image for each line is presented. The lower panels are magnified images of corresponding upper panels. Scale bars = 100 μm

Fig. 4
figure 4

Lignin quantities of transgenic plants. Acid-insoluble lignin (AIL) and acid-soluble lignin (ASL) content (a), thioethylated monomer content (b) were measured in AIR prepared from inflorescence stems of wild-type and transgenic T1 plants harboring PkC4Hpro::F5H, PkC4Hpro::calB, PkC4Hpro::couA, or PkC4Hpro::F6H1. Error bars indicate 95% confidence intervals (n = 8 for the wild type, PkC4Hpro::F5H, PkC4Hpro::couA, and PkC4Hpro::F6H1; n = 9 for PkC4Hpro::calB). Single, double, and triple asterisks indicate statistically significant differences (* P < 0.05, ** P < 0.01, and *** P < 0.001, respectively) of transgenic lines compared with the wild type according to Welch’s t-test or Brunner–Munzel test with Bonferroni–Holm correction

Generation of transgenic plants and subsequent systematic screening by cross-sectional lignin staining

To test many enzyme-coding genes, we constructed 48 expression constructs encoding 50 different enzymes, including F5H, the positive control (Additional file 1: Table S1). All enzymes had the potential to act on metabolites in shikimate, general phenylpropanoid, flavonoid, or monolignol biosynthetic pathways (Additional file 1: Table S2). Agrobacterium strains harboring individual expression constructs were cultured independently, and two to eight cultures were mixed into a pool. Arabidopsis plants were transformed using the traditional floral-dip procedure and seven pools of Agrobacterium strains (see the column “Gene set” in Additional file 1: Table S3). After transformation and subsequent selection based on genomic PCR followed by sequencing of amplified DNA, we characterized 296 independent transgenic plants (T1 generation) harboring single or multiple transgenes (Additional file 1: Table S3). Nearly three-quarters of the transgenic plants carried a single transgene, whereas the rest carried two, three, or four transgenes (Additional file 1: Table S3). We failed to recover a transgenic plant carrying Atu1417 in the present study.

The resulting transgenic plants were grown until inflorescence stems were fully elongated. Hand-sliced cross sections of primary inflorescence stems from all 296 transgenic plants were stained with Wiesner and Mäule reagents in 96-well plates (Additional file 2: Figs. S2 and S3). In this way, we found that some plants possessing at least one of the genes calB, F6H1, or couA in addition to the positive control, F5H, showed apparent abnormalities (Additional file 2: Figs. S2 and S3). In the analysis with the Mäule reaction, in particular, inflorescence stems of lines transgenic for calB, couA, and F6H1 (CalB, CouA, and F6H1 lines, respectively) exhibited less staining in vascular xylem and interfascicular fibers (Additional file 2: Fig. S3).

Confirmation of transgene effects in T1 generations derived from independent transformation procedures

To verify the effect of the transgenes observed above, we individually recreated T1 plants carrying each identified gene and compared them with wild type and the positive control line harboring the F5H sequence. As shown in Fig. 2a, b, the stem height of the couA line was shorter than that of the wild type even though other transgenic lines were almost the same as the wild type. The stem dry weight of the couA line was also slightly lower than the wild type, but not significantly different (Fig. 2c).

Although not extremely obvious (Additional file 2: Fig. S2 and S3), calB, couA, and F6H1 lines appeared to be more weakly stained by both Wiesner and Mäule reagents compared with the wild type, a situation particularly evident in interfascicular fiber cells of the newly generated individual T1 lines (Fig. 3). A typical irx phenotype was observed in vascular vessels of the couA line, where the red color generated in interfascicular fibers by Wiesner staining was apparently faint compared with that in calB and F6H1 lines. These data suggest that the expression of these genes leads to changes in lignin content and/or composition.

Characterization of lignin and cell wall phenolics in transgenic lines

Lignin modification in the transgenic plant lines was confirmed by chemical analysis. Lignin content and the monomeric composition of lignin in senesced inflorescence stems were measured by micro-Klason, Py-GC/MS, and thioacidolysis procedures (Fig. 4, Tables 1 and 2, and Additional file 1: Table S4). The lignin content of all transgenic lines was not significantly different (Fig. 4a).

Table 2 Relative abundance of structural features within AIR-sample from wild-type and transgenic inflorescence stem on the basis of molar peak area identified by pyGC/MS

We next investigated lignin structure using thioacidolysis. Compared to wild type, a significant difference in the composition of two conventional monomers with G and S nuclei, indicators of relative amounts of β-O-4 structures in lignin, was observed in the calB, couA, and F6H1 lines (Fig. 4b and Table 1). The total levels of monomers released in these transgenic lines were also apparently reduced. These results indicate that expression of these genes leads to structural alteration of lignin without significant changes in lignin content. CalB can oxidize coniferaldehyde, sinapaldehyde, and benzaldehyde [39]. In addition, CouA is predicted to convert p-coumaroyl-CoA, caffeoyl-CoA, and feruloyl-CoA to their corresponding benzaldehydes [20]. It is thus presumed that levels of hydroxycinnamate and hydroxybenzaldehyde derivatives bound to cell walls are altered by the expression of these enzyme genes. However, contrary to expectations, no significant differences were observed in most of the phenolics released after alkaline hydrolysis of AIR, with the exception of p-coumaric and ferulic acids in the CouA line (Table 1).

To further confirm the structural change of lignin, we analyzed the pyrolysis products derived from cell wall samples of each transgenic line by Py-GC/MS. As listed in Table 2, the relative distribution of S units in all transgenic lines was slightly increased while that of G units was decreased. The data are consistent with the results of thioacidolysis, which indicated higher S/G ratio in the transgenic lines compared to the wild type. In CouA lines, vinyl-substituted products are noticeably more abundant than in the wild type. Van Ervan et al. [31] reported that vinylic pyrolysis products such as 4-vinylphenol and 4-vinyl guaiacol are derived from interunit linkages of lignin and wall-bound phenolics such as p-coumarate and ferulate. This data therefore suggests that these hydroxycinnamic acids bound to cell wall are more abundant in CouA line. Increase of these acids was actually observed by the alkaline hydrolysis followed by GC analysis (Table 1). Another marked difference was a decrease of Cγ-O pyrolyzed products in CouA lines. According to Van Erven et al. [40], these products are primarily derived from lignin interunit linkages with a three-carbon side-chain. Most of the Cγ-O products should be derived from β-O-4 structures in lignin since our pyrolysis conditions gave no secondary reaction to the initial three-carbon side-chain connected with β-O-4 linkage. This is consistent with the reduction of total monomer yield in CouA lines. In F6H1 lines, we detected scopoletin in the AIR sample using Py-GC/MS (Fig. 5). F6H1 is a key enzyme for scopoletin biosynthesis, primarily through feruloyl-CoA 6′-hydroxylation in plant roots [41]. Given this, our strategy using the F6H1 enzyme worked for the production of scopoletin bound to the cell wall, more specifically, possibly to the lignin in xylem tissues.

Fig. 5
figure 5

Detection of Scopoletin in F6H1 transgenic plants. Py-GC/MS was performed for inflorescence stem samples prepared from wild type and PkC4Hpro::F6H1 plants fully grown for 8 weeks and completely dried for 4 weeks without watering. a Total ion chromatogram of pyrolyzed products from wild type and PkC4Hpro::F6H1 lines. The triangle indicates the peak of scopoletin. b Mass spectra of pyrolyzed products of scopoletin. The upper and lower panels indicate mass spectra of scopoletin standard and pyrolyzed products, respectively, from the actual peak shown in (a). c Quantification of scopoletin content in AIR residue by Py-GC/MS. Boxplots represent the range of determined scopoletin content. Horizontal bars in the boxes indicate the median of the detected values. Upper and lower hinges of the boxes indicate 75% and 25% ranges of detected values, respectively. The upper and lower extreme bars of the boxplots indicate the maximum (upper) and the minimum (lower) of reported values, respectively. Each dot indicates raw scopoletin values in each F6H1 line

Monosaccharide composition and saccharification efficiency

We next analyzed the monosaccharide composition of AIR prepared from each transgenic line. We found slight changes in monosaccharide composition in CouA and F6H1 lines compared with the wild type, namely, glucose was decreased and 4-O-methyl-glucuronic acid was increased (Fig. 6a and Additional file 1: Table S5). In the CouA line, galactose, arabinose, and rhamnose were increased as well (Fig. 6a). However, total sugars per unit of AIR were not significantly changed in any of these lines from wild type (Fig. 6b and Additional file 1: Table S5). These results indicate that monosaccharide composition was affected by the modification of lignin in the transgenic lines.

Fig. 6
figure 6

Monosaccharide composition of cell walls of transgenic plants. Each individual monosaccharide (a) the total amounts of 10 monosaccharides (b) in cell walls of fully grown for 8 weeks and completely dried for 4 weeks plants without watering and are shown. Glc, Xyl, GalA, Rha, Man, Gal, Ara, mGlcA, GlcA, and Fuc represent glucose, xylose, galacturonic acid, rhamnose, mannose, galactose, arabinose, 4-methyl-glucuronic acid, glucuronic acid, and fucose, respectively. Error bars indicate 95% confidence intervals (n = 8 for the wild type; n = 16 for PkC4Hpro::F5H and PkC4Hpro::calB; n = 11 for PkC4Hpro::couA; n = 14 for PkC4Hpro::F6H1). Asterisks indicate statistically significant differences (* P < 0.05, ** P < 0.01, and *** P < 0.001, respectively) of transgenic lines compared with the wild type according to Welch’s t-test or Brunner–Munzel test with Bonferroni–Holm correction

To further evaluate the impact of lignin modification on cell wall characteristics of the transgenic lines, we measured levels of glucose and xylose released after enzymatic saccharification to determine the saccharification efficiency of AIR with or without alkaline pretreatment (Fig. 7 and Additional file 1: Table S6). Although not improved significantly without pretreatment, glucose release was increased after pretreatment in CalB, CouA, and F5H (positive control) lines, by 21%, 55%, and 31%, respectively, compared with the wild type. Although the efficiency varied, xylose release was also enhanced significantly by pretreatment in the same transgenic lines. Without pretreatment, improved release of xylose compared with the wild type was observed only in the CouA line. While lignin composition was changed just as in other transgenic lines, no improved sugar release was observed in the F6H1 line.

Fig. 7
figure 7

Lignin engineered transgenic plants improve sugar yield by mild alkali pretreatment. a, b Glucose yield by enzymatic saccharification of dried inflorescence stems from wild type and transgenic plants grown for 8 weeks and completely dried for 4 weeks without watering (PkC4Hpro::F5H, PkC4Hpro::calB, PkC4Hpro::couA, and PkC4Hpro::F6H1) without pretreatment (a) or with dilute alkaline pretreatment (b). c, d Released xylose by the enzymatic saccharification from the cell wall residue of wild-type and transgenic plants without pretreatment (c) or with dilute alkaline pretreatment. Error bars indicate 95% confidential intervals (a, cn = 8, wild-type; n = 11, PkC4Hpro::F5H; n = 14, PkC4Hpro::calB; n = 10, PkC4Hpro::couA; n = 8, PkC4Hpro::F6H1. b, d; n = 8, wild-type; n = 13, PkC4Hpro::F5H; n = 16, PkC4Hpro::calB; n = 11, PkC4Hpro::couA; n = 11, PkC4Hpro::F6H1). Scores above each bars were fold-change from wild-type (upper) and P values (lower, * P < 0.05, ** P < 0.01, *** P < 0.001, respectively) according to Welch’s t-test or Brunner–Munzel test with Bonferroni–Holm correction compared with wild type

Discussion

Lignin manipulation is one of the major techniques for reducing lignocellulose recalcitrance. Manipulation can be achieved by suppressing endogenous genes for lignin biosynthesis [7, 42, 43], and by the expression of heterologous genes to prompt biosynthesis of particular phenolics that can act as alternative lignin monomers [44,45,46]. To our knowledge, this is the first attempt to simultaneously screen a large variety of enzymes for the manipulation of lignin. The screening results were supported by the recreated individual T1 lines. Changes in lignin composition (Fig. 4, Table 1 and Additional file 1: Table S4) and saccharification efficiency (Fig. 7 and Additional file 1: Table S6) were clearly observed in these T1 lines. Our results indicate that systematic screening via a histochemical staining assay in Arabidopsis is adequate to identify lignin modifier genes.

calB encodes coniferaldehyde dehydrogenase from Pseudomonas sp. HR199, which catalyzes the NAD+-dependent oxidation of coniferaldehyde and sinapaldehyde to their corresponding hydroxycinnamic acids (Fig. 1 and Additional file 1: Tables S1 and S2) [39, 47]. calB expression was expected to lead to a reduction in conifer- and sinapaldehyde accumulation and an increase in their corresponding acids. This hypothesis was partially supported by the reduced intensity of red color in interfascicular fibers, as detected by Wiesner staining. However, contrary to expectations, levels of cell wall-bound p-coumaric and ferulic acids seemed to be lower compared to the wild type, while sinapic acid levels were lower than the detection limit in both the calB and wild-type lines (Table 1 and Additional file 1: Table S4). One of the reasons for this might be a significantly higher Km value of calB for coniferaldehyde (334 μM) [39] compared to those of Arabidopsis CAD-4 (65 μM) and CAD-5 (35 μM) [48], which have major role in the reduction of coniferaldehyde for monolignol biosynthesis. However, monomeric composition of lignin was apparently altered, and saccharification efficiency was significantly improved in the calB line, though the mechanism is unclear. Future studies should address the role of calB in improved lignocellulose recalcitrance.

The couA gene of Rhodopseudomonas palustris encodes a couA polypeptide that catalyzes side-chain cleavage of hydroxycinnamoyl-CoAs to generate acetyl-CoA and corresponding HBAlds [20], similarly to Pseudomonas fluorescens HCHL [19]. In one study, expression of hchl under the control of the xylem-specific cellulose synthase A4 promoter in Arabidopsis induced the irx phenotype in one of five independent transgenic lines. However, Wiesner and Mäule staining patterns in stem sections of the HCHL lines remained similar to those in the wild type [17]. In our study, by contrast, the couA line exhibited the typical irx phenotype with collapsed vessels (Fig. 3), along with faint coloration in Wiesner staining, especially in interfascicular fibers. Reduction of thioacidolysis monomer yield without decrease in total lignin content was observed in the HCHL line as well as couA line while the reduction in couA line (Table 1) was apparently larger than that in the HCHL lines [17]. This might be due to differences in catalytic properties of the enzymes, transcriptional activity of the employed promoters, generation of the analyzed transgenic plants (T2 in HCHL lines vs. T1 in couA lines), or a combination of these. Lignin modification often causes change in monosaccharide composition [49]. The impact of gene expression on the monomeric composition of cell wall polysaccharides was observed in the couA line (Fig. 6 and Additional file 1: Table S6). Interestingly, monosaccharide composition of couA lines in our study is changed from wild type; especially Rha, Gal, and Ara, major components of pectin, were clearly increased, suggesting a less developed secondary cell wall in the transgenic plants. In any case, the preferential expression of couA achieved lignin manipulation and subsequent improved sugar release from the transgenic plants without biomass yield penalty.

Similar to couA, F6H1 recognizes feruloyl-CoA, a key intermediate in monolignol biosynthesis, as a substrate. This enzyme is derived from A. thaliana and converts feruloyl-CoA to 6′-hydroxyferuloyl-CoA and 2-oxoglutaric acid (Additional file 1: Table S2) [50, 51]. The resultant 6′-hydroxyferuloyl-CoA is then subjected to non-enzymatic lactonization to generate 6-methoxyumbelliferone (scopoletin) [50]. Scopoletin is well known as a phytoalexin. The genes for its biosynthesis are mainly expressed in roots, where lignin content is limited, but scarcely expressed in stems [40]. While scopoletin is usually accumulated in the vacuole in its glycosylated form, in our study scopoletin was detected from alcohol-insoluble residue regarded as cell wall fraction. This suggests that scopoletin can be transported to the cell wall and then incorporated into lignin, cell wall polysaccharides, or both. A new lignin chain is initiated from its hydroxy residue and terminated at the same site. Even though it is still unclear whether scopoletin is incorporated into the transgenic plant lignin, it has the capacity to act as a lignin initiator and terminator. It is considered that these potential roles of scopoletin could improve enzyme saccharification efficiency of the cell wall as reported previously for both unusual initiator and terminator [17, 52]. Further characterization of lignin structure including studying the distribution of molecular weight will clarify the effect of scopoletin on lignin polymerization.

Conclusions

Efficient mining of candidate genes that manipulate lignin biosynthesis can be accomplished by in planta screening of numerous introduced genes in transgenic Arabidopsis based on staining of stem tissues using two different lignin-staining procedures. In-depth characterization of lignin structure using chemical methods accompanied by investigation of transgene expression and metabolites would provide valuable information for the reduction of biomass recalcitrance by using the identified genes. It should be noted that there are other efficient methods of screening like Fourier transform infrared spectrometer (FTIR) analysis and near infrared spectroscopy analysis (NIRS) and the application of our samples to them would uncover more genes which could change lignin.