Introduction

Due to its wide biotechnological application, the cellulose- and hemicellulose-degrading enzyme system of the filamentous fungus Trichoderma reesei has attracted many researchers and was consequently studied in considerable detail. Enzyme yields can exceed 100 g/l in industrial T. reesei fermentations, but the production and formulation of enzyme mixes still remains a bottleneck in the commercialization of products derived from lignocellulosic biomass. It is a peculiarity that virtual all strains used in industry and academia are derived from a single isolate T. reesei QM6a, which is regarded as the T. reesei reference strain and for which the genome sequence is available (Cherry and Fidantsef 2003; Martinez et al. 2008; Gupta et al. 2014; Bischof et al. 2016). Besides its potent cellulolytic enzyme machinery, T. reesei produces several xylanases which efficiently degrade xylan, the major component of hemicellulose and the second most abundant renewable biomaterial in plant biomass (Saha 2003; Ramoni and Seiboth 2016). In total, six xylanases (XYN1–6) are encoded in its genome belonging to the glycoside hydrolase (GH) families 10, 11, and 30 following the carbohydrate-active enzyme (CAZyme) classification (Lombard et al. 2014). The description of the first two GH11 family xylanases XYN1 and XYN2 dates back more than two decades (Tenkanen et al. 1992; Törrönen et al. 1992). These GH11 members are called “true” xylanases because they exclusively catalyze endo-β-1,4 cleavage (EC 3.2.1.8) of xylan. Enzymes in this family are particularly advantageous for various biotechnological applications, as they are small in size (∼20 kDa) and have high catalytic efficiencies with varying temperature and pH optima (Paës et al. 2012). XYN3 belongs to the GH10 family (Nakazawa et al. 2016; Wang et al. 2014), which exhibits endo-β-1,4-xylanase activity, but can also show endo-β-1,3-xylanase (EC 3.2.1.32) or xyloglucan/xyloglucosyl transferase activity (EC 2.4.1.207) as observed for plants (Derba-Maceluch et al. 2015). The GH30 XYN4 exhibits not only clear exo-activity but also some endo-activity, releasing mainly d-xylose from various linear xylooligosaccharides and xylans from the reducing end (Tenkanen et al. 2013). The appendage-dependent glucuronoxylan hydrolase XYN6, also of GH30 family, shows high specific activity on xylans and xylooligosaccharides that contain d-glucuronic acid or 4-O-methyl-d-glucuronic acid side substituents in an almost identical manner as bacterial GH30 xylanases do (Biely et al. 2014). However, in contrast to its bacterial counterparts, fungal XYN6 also revealed low activity on unsubstituted xylan and acidic xylooligosaccharides.

Transcription of the different xylanase genes depends beside other transcription factors mainly on the presence of the cellulase and xylanase regulator xylanase regulator 1 (XYR1; Stricker et al. 2006; Herold et al. 2013) and their induction is found in the presence of different sugars: Transcriptional regulation of the two GH11 family members revealed that xyn1 is induced by different monomeric breakdown products of xylan including d-xylose and l-arabinose. Xyn2 is constitutively expressed and can be further induced by d-xylose, in addition to common cellulase inducers such as cellulose and sophorose (Herold et al. 2013; Mach et al. 1996; Zeilinger et al. 1996). Similar to xyn1, xyn4 is induced by d-xylose and l-arabinose (Herold et al. 2013). Potential conflicting results to former studies regarding inducing carbon sources can be explained by the observation that these xylanases are easily repressed in a carbon catabolite repressor 1 (CRE1)-dependent manner by higher concentrations of the two pentoses and that they are only inducible in the presence of lower concentrations (Herold et al. 2013). Xyn3 and xyn6 are exclusively induced by typical cellulase inducers such as cellulose, l-sorbose, or sophorose but not by xylan-derived carbon sources (Herold et al. 2013; Xu et al. 2000; Jonas Ramoni and Bernhard Seiboth unpublished data).

A further GH11 xylanase, xyn5, was overlooked during the T. reesei QM6a genome annotation (Martinez et al. 2008) and only later reported in a transcriptome analysis of conidiation regulated genes (Metz et al. 2011). Although its transcript was detected in a number of studies (Bischof et al. 2013; Häkkinen et al. 2012; Herold et al. 2013; Ivanova et al. 2013), the corresponding enzyme was not found in the T. reesei secretome (Herpoël-Gimbert et al. 2008; Peciulyte et al. 2014; Saloheimo and Pakula 2012), purified or biochemically characterized. Our analysis of the xyn5 sequence and its corresponding protein of the genome sequenced T. reesei strain QM6a provides an explanation for this. In silico data shows that only a truncated, non-functional protein is encoded by this gene. However, sequencing of several xyn5 alleles from other T. reesei wild-type strains revealed that these harbor a putative functional version of this xylanase in their genomes. Here we report the recombinant expression and biochemical characterization of a full-length XYN5 in a T. reesei strain with a (hemi)cellulase-free background (Uzbas et al. 2012). The produced XYN5 was purified to apparent homogeneity and its substrate specificities and enzymatic profile determined. In addition, replacement of the mutated version of xyn5 by a functional xyn5 allele led to a significant improvement of the overall xylanolytic activity.

Materials and methods

Strains and growth conditions

Strains used were the wild-type strain T. reesei QM6a (ATCC 13631) and different recombinant strains derived from the early cellulase mutant QM9414 (ATCC 26921) including Δxyr1 (Stricker et al. 2006) and Δtku70 which were constructed as described (Guangtao et al. 2009), with the only difference that the selection marker for tku70 deletion was ptrA (Kubodera et al. 2002). QM9414 is derived from QM6a and was obtained by two rounds of irradiation in a linear particle accelerator (Mandels et al. 1971).

Other T. reesei wild-type strains were TUCIM283 (listed also under: C.P.K.283 or G.J.S.97-178), TUCIM665 (C.P.K.665 or TUB F-733), TUCIM936 (C.P.K.936 or G.J.S.89-7), and TUCIM1282 (C.P.K.1282, G.J.S.85-249 or CBS142139) from the TU Collection of Industrially Important Microorganisms and were described previously (Druzhinina et al. 2010; Kuhls et al. 1996; Samuels et al. 2012). Strain propagation, selection of transformants, and purification were performed on potato dextrose agar (PDA, Difco, Detroit, MI, USA) using 100 μg/ml hygromycin B (Carl Roth GmbH + Co. KG, Karlsruhe, Germany) as selective agent when needed. For liquid culture experiments, strains were grown in 250 ml Mandels-Andreotti (MA) medium (Mandels and Andreotti 1978) adjusted to pH 5 and supplemented with 1% (w/v) carbon source in 1-l Erlenmeyer flasks on a rotary shaker at 28 °C and 250 rpm. Escherichia coli Stellar™ (Clontech Laboratories, Inc., Mountain View, CA, USA) cells were used for plasmid construction and amplification.

Sequencing of T. reesei xyn5 alleles

Fungal DNA was extracted as previously described (Liu et al. 2000; Seiboth et al. 2004). Xyn5 including 1.3 kb promoter and 0.7 kb terminator region was amplified by PCR from genomic DNA with oligonucleotides Infuse_xyn5_for and Infuse_xyn5_rev (Supplementary Table S1) using Phusion High-Fidelity DNA Polymerase (Thermo Scientific, Waltham, MA, USA). Sequencing of the xyn5 coding region was performed by Sanger DNA sequencing at Microsynth AG (Balgach, Switzerland).

Protein analysis

In silico analysis of the encoded proteins was performed using SignalP 4.1 Server (http://www.cbs.dtu.dk/services/SignalP/), NetNGlyc 1.0 Server (http://www.cbs.dtu.dk/services/NetNGlyc/), and NetOGlyc 4.0 Server (http://www.cbs.dtu.dk/services/NetOGlyc/). The KEX2 processed aa sequence was used to predict the protein structure using I-TASSER (Roy et al. 2010; Yang et al. 2015; Zhang 2008) and PyMOL (The PyMOL Molecular Graphics System, Version 1.3r1 Schrödinger, LLC. 2010).

Transcriptional analysis

Strains were grown for 24 h on MA medium containing 1% (v/v) glycerol as carbon source. Subsequently, mycelia were collected, washed, and transferred to MA medium without carbon source. Following 30 min of incubation, d-xylose or l-arabinose to a final concentration of 1 mM was added to these cultures. Samples of mycelia were taken directly before adding the carbon source and 2, 4, and 6 h after addition. As control, strains were also cultured without carbon source addition. Mycelia were harvested, frozen in liquid nitrogen, and stored at −80 °C. Following RNA extraction (Chomczynski and Sacchi 1989), 5 μg of total RNA was reversely transcribed to complementary DNA (cDNA) using the RevertAid H minus first-strand cDNA synthesis kit (Thermo Scientific) with a 1:1 mixture of the supplied oligo-dT and random hexamer primers. The RT-qPCR reaction was performed in triplicates in an Eppendorf RealPlex2 Mastercycler (Eppendorf, Hamburg, Germany) in 96-well plates as described (Herold et al. 2013) using the transcription elongation factor 1α gene tef1 for normalization. Relative gene expression was analyzed using REST software (Pfaffl et al. 2002). Oligonucleotides are found in Supplementary Table S1. Results are from at least two biological replicates with technical triplicates.

Fungal strain construction

For expression of xyn5 from T. reesei TUCIM1282, its coding region and terminator region was amplified from genomic DNA using primers infuse_xyn5_for and infuse_xyn5_rev. The 1.4 kb amplicon was introduced by recombinational cloning using the InFusion® HD Cloning Kit (Clontech Laboratories, Inc.) in the ClaI digested pLH_hph1_cDNA1 (Uzbas et al. 2012). The resulting plasmid p_xyn5oe contains the hphB expression cassette for selection of transformants on hygromycin B and xyn5 under the control of the T. reesei cDNA1 promoter region which is highly active in glucose grown cultures (Nakari-Setälä and Penttilä 1995). Ten micrograms of circular p_xyn5oe was transformed in T. reesei Δxyr1 using protoplast transformation (Gruber et al. 1990). Transformants were selected on PDA plates containing 100 μg/ml hygromycin B and purified by single spore isolation on plates containing 0.1% (v/v) Triton X-100. Genomic DNA from purified strains was isolated (Liu et al. 2000), and integration of the xyn5 overexpression cassette was verified by PCR using primers seq_xyn5_3_for and seq_cDNA1_vec_rev. For positive strains amplification of a 1.2-kb fragment comprising the cDNA1 promoter region, the xyn5 open reading frame, and terminator was observed (Supplementary Fig. S1a).

The non-functional xyn5 present in the QM6a strain line was replaced in T. reesei Δtku70 by the TUCIM1282 xyn5 allele. Therefore, a 2.7-kb fragment containing 1.3 kb of the upstream and 0.7 kb of the downstream region of xyn5 was amplified from genomic DNA of TUCIM1282 using oligonucleotides p1f_p/txyn5_tgpd and P2R_Txyn5. This amplicon was introduced by InFusion® HD cloning into the XhoI digested pLH_hph (Hartl et al. 2007) which contains a hygromycin B resistance cassette for selection. The resulting vector was designated pxyn5hr. Using oligonucleotides P3F_Xyn5_5′flank and P4R_Xyn5_5′flank, 1.1 kb of a further upstream region of xyn5 was amplified and cloned into the XmaI digested pxyn5hr resulting in p_xyn5hr_flank. Primers Pop_Xyn5F and Pop_Xyn5R were used to amplify the 6.2 kb xyn5 replacement cassette which was transformed into swelling conidia of T. reesei Δtku70 by electroporation (Schuster et al. 2012). Transformants were selected and purified as described above. For verification of the integration of the full-length xyn5 of TUCIM1282 at the endogenous xyn5 locus in T. reesei Δtku70, PCR was applied using oligonucleotides PF_xyn5_tricho_col and PR_xyn5_tricho_col. These primers bind in the 5′ flanking region of xyn5 spanning the introduced hphB cassette. The amplicon of positive transformants was therefore 3 kb long, whereas the amplicon of the wild-type locus was 0.6 kb long. In case of an ectopic integration, both amplicons were detected (Supplementary Fig. S1b). Replacement was further verified by PCR amplification of xyn5 and restriction analysis of the amplicon (Supplementary Fig. S1c, e). Estimation of the xyn5 gene copy number (Supplementary Fig. S1d) was performed by qPCR (Tisch et al. 2011). The introduced xyn5 coding region was finally verified by sequencing using p1f_p/txyn5_tgpd and P2R_Txyn5 for amplification and Ver_xyn5_1 for sequencing. The two individual positive transformants +xyn5 A and +xyn5 B were used in the further experiments. All oligonucleotides are found in Supplementary Table S1.

XYN5 production and purification

Transformants harboring the xyn5 expression cassette were grown in MA medium containing 1% d-glucose. Culture supernatants were harvested, filtered through Miracloth (Calbiochem, San Diego, CA, USA) and stored at 4 °C. The supernatant of these strains was analyzed for the presence of an around 19 kDa band representing XYN5 by SDS-PAGE (data not shown). For enzyme purification, culture supernatants of strain xyn5g1 harvested after 36 h of cultivation were passed through a 0.22-μm pore size filter (Steritop Filter, Millipore, Billerica, MA) following concentration using Amicon Ultra filter units with 10 kDa cutoff membranes (Millipore, Billerica, MA, USA). XYN5 was then purified using a Mono S HR5/5 column (GE Bioscience, Chalfont, UK) which was equilibrated according to the operating manual with 10 mM NaAc pH 4.5 (buffer A). Bound proteins were eluted with a linear NaCl gradient ranging up to 0.1 M NaCl. Fractions were analyzed by xylanase activity tests and SDS-PAGE. Purified XYN5 protein concentration was determined by Nanodrop spectrophotometer (Thermo Scientific, Vienna, Austria). To analyze if XYN5 is glycosylated, the purified protein was treated with Endo-T (Stals et al. 2012) for 12 h at 28 °C. All culture supernatants were stored at 4 °C until further use.

Xylanase assays

For the enzymatic characterization of XYN5 the enzyme was purified as described above. Xylanase activity was measured with the DNS method with 1.5% (w/v) beechwood xylan (Sigma-Aldrich, St. Louis, MO, USA; Product number X4252) dissolved in 10 mM NaAc buffer pH 4 (Bailey et al. 1992; König et al. 2002). For all reactions, the reaction volume was adjusted according to König et al. (2002) to fit 1.5-ml tubes. All reactions were done in triplicates, and d-xylose was used as standard to calculate the amount of released sugars. One unit was defined as the enzyme amount liberating 1 μmol of reducing sugars in 1 min under the given conditions. Non-linear fitting for determination of the enzyme activity was performed using SigmaPlot v13.0 (Systat Software Inc., San Jose, CA). For the determination of the optimal temperature, xylanase activity was determined in 10 mM NaAc buffer (pH 4) using different temperatures from 30 to 70 °C. For the determination of the pH optimum, McIlvaine buffer was used from pH 2.5 to 6.5 and reactions were performed at 50 °C.

To determine the enzymatic activity in the xyn5 transformants +xyn5a and +xyn5b and the Δtku70 reference strain, strains were cultured on MA medium containing either 1% (w/v) lactose, beechwood xylan, or steam exploded wheat straw (kindly provided by Alexander Jäger of the University of Applied Sciences Upper Austria, FH Wels) in biological duplicates. Culture supernatants were filtered through Miracloth and stored at 4 °C until further use. Biomass was determined in technical triplicates by pelleting 1.5 ml of culture biomass which was subsequently washed with tap water, pelleted again, and dried to constant weight at 80 °C. For the determination of the endo-1,4-ß-d-xylanase activity, the chromogenic substrate S-AXBL (Megazyme International Ireland, Wicklow, Ireland) was used. Reactions were performed as described in the manufacturer’s manual but volumes were reduced 2.5 times to fit 1.5-ml reaction tubes.

Mass spectrometry

Protein identification was carried out on an UltrafleXtreme (Bruker Daltonik, Bremen, Germany) after in-gel digestion, by MS and MS/MS analysis. In brief, after excising the gel lanes and removing the Coomassie staining by incubating the gel pieces in acetonitrile/100 mM NH4HCO3 (pH 8.5) (1/1, v/v), the samples were reduced with 10 mM dithiothreitol, alkylated with 50 mM iodoacetamide and trypsinized (20 ng trypsin, porcine, Roche, Basel, Switzerland). After overnight digestion, peptides were extracted with acetonitrile/water, dried in a vacuum centrifuge, and micropurified with C18 ZipTips (Merck Millipore, Billerica, MA, USA) and then eluted onto a stainless steel target together with α-cyano-4-hydroxy-cinnamic acid (CHCA, 3 mg/ml in 50% acetonitrile containing 0.1% trifluoroacetic acid). For all enzymatic digestion data, autolytic tryptic products, keratin and gel blank artifacts were assigned and removed before database search using an in-house Mascot server (Perkins et al. 1999). The database search was performed with the following parameters: taxonomy fungi, monoisotopic mass values, peptide mass tolerance of ±0.3 Da, two missed cleavages, carboxyamidomethylation as fixed modification, and methionine oxidations as variable modification. A protein was considered correctly identified if the search result was above the statistical threshold for the peptide mass fingerprint and all respective sequencing experiments.

DNA sequences

T. reesei xyn5 nucleotide sequence data are available in the DDBJ/EMBL/GenBank under the accession numbers KX139136 (TUCIM283), KX139137 (TUCIM665), KX139138 (TUCIM938), KX139139 (TUCIM1282/CBS 142139), KX455497 (QM6a), and KX455498 (RUT C-30).

Results

A non-functional version of xyn5 is present in the reference strain T. reesei QM6a

By microarray analysis and genome mining, a yet uncharacterized xylanase, designated as xyn5, was discovered in T. reesei (Metz et al. 2011). A sequence comparison of the encoded protein, deposited under GenBank ID XP_006969745.1, to the XYN5 orthologues of other genome-sequenced Trichoderma spp. revealed that the deposited protein sequence of the T. reesei reference strain QM6a is truncated. Following this annotation, a 152 aa protein is produced as the reading frame is interrupted by a stop codon which leads to a loss of the C-terminal part including one of the two conserved glutamates of the active site (Supplementary Fig. S2). However, this alignment also revealed that at the N-terminus of XYN5, a signal peptide for export in the endoplasmic reticulum is missing. According to our sequence analysis which is based on the comparison of the T. reesei QM6a xyn5 to other orthologues, an ATG about 50 bp upstream from the annotated start ATG represents the true start codon (Fig. 1). But even when we used this start codon, we did not obtain a full-length XYN5 and the size of the encoded XYN5 was even reduced to 28 aa for T. reesei QM6a (Supplementary Fig. S2). This truncation of XYN5 is not the result of a sequencing error as our resequencing of the QM6a xyn5 confirmed the deposited nucleotide sequence. Additional evidence comes from the xyn5 sequence of the QM6a mutant RUT-C30 (GenBank ID: ETR96953.1) which is identical to the xyn5 of strain QM6a. As sequence variations between different wild-type T. reesei strains can be considerably high (Linke et al. 2015), we sequenced the xyn5 coding region of the four T. reesei wild-type strains TUCIM283, TUCIM665, TUCIM938, and TUCIM1282. Sequence analysis shows that in these strains, the xyn5 orthologues translate into a 226 aa protein which is in accordance to the size of the XYN5 orthologues found in Trichoderma atroviride (225 aa) and Trichoderma virens (226 aa) (Supplementary Fig. S2). The encoded XYN5 proteins in T. reesei TUCIM938 and TUCIM1282 have a 100% aa identity while XYN5 from TUCIM283 shows a 99% (224 of 226) and XYN5 from TUCIM665 a 98% (222 of 226) aa sequence identity to the XYN5s of TUCIM1282 and TUCIM938. The aa sequence identity of the XYN5s of T. reesei TUCIM938 and TUCIM1282 to the XYN5s of T. atroviride and T. virens is only 87 and 90% respectively. Comparison of the TUCIM1282 and different Trichoderma spp. xyn5 sequences to that of strain QM6a identified a number of nucleotide differences responsible for the truncation found in strain QM6a. These differences include a four nucleotide deletion in the 5′ part of the xyn5 coding region leading to a premature translational stop resulting in the 28 aa XYN5 version of T. reesei QM6a (Fig. 1). In addition, a C ➔ T transition leads to the truncation in the NCBI deposited XYN5 of strain QM6a (Supplementary Fig. S3).

Fig. 1
figure 1

Alignment of the first 100 nucleotides of xyn5 coding region of different Trichoderma spp. and Trichoderma reesei wild-type strains. Nucleotides were aligned and differences were manually annotated. When translated from the conserved start ATG found in all xyn5 genes, the ORF of T. reesei QM6a misses 4 bp which leads to a frameshift and the termination of the ORF by a TAA stop codon. The start codon of the original NCBI database entry (XP_006969745.1) is found 46 bp downstream which misses therefore the XYN5 signal peptide sequence. GenBank accession numbers for xyn5 are KX455497 (T. reesei QM6a), KX139139 (T. reesei TUCIM1282). The other sequences were retrieved from the JGI Genome Portal MycoCosm (http://genome.jgi.doe.gov/programs/fungi/index.jsf) and are Trias1|83211 (Trichoderma asperellum), Triat2|46014 (Trichoderma atroviride), TriviGv29_8_2|65505 (Trichoderma virens), and Triha1|118868 (Trichoderma harzianum). Residues highlighted by a black background are conserved in at least 90%, while residues highlighted by gray in 40% of the xyn5 sequences

Transcriptional regulation of xyn5

To address if xyn5 is expressed in T. reesei TUCIM1282, we performed a comparative transcriptional analysis under xylanase inducing conditions using d-xylose and l-arabinose as potential inducers. This analysis showed that xyn5 is inducible by both pentoses in TUCIM1282, whereby transcript levels on l-arabinose were higher compared to d-xylose (Fig. 2). We also tested xyn5 transcription in T. reesei QM6a and found a comparable carbon source-dependent regulation which is in line with earlier results for xyn5 transcription in the QM6a derivative QM9414 (Herold et al. 2013).

Fig. 2
figure 2

Transcriptional regulation of xyn5 in T. reesei TUCIM1282 (violet bars) and QM6a (red bars). Strains were grown for 24 h on 1% (v/v) glycerol as carbon source and then transferred to medium with 1 mM d-xylose (a) and 1 mM l -arabinose (b). The fold change of xyn5 expression in QM6a and TUCIM1282 was measured 2, 4 and 6 h after transfer to the inducing medium and normalized to the expression of tef1. Xyn5 transcript levels were compared to the corresponding time points of cultures without a carbon source. Values represent the mean ± SD of at least two biological replicates (color figure online)

In silico analysis and structural modeling of XYN5

According to previous reports which used the truncated XYN5 in their analysis (Häkkinen et al. 2012), it belongs to the GH11 family. The full-length protein consists of 226 aa including a 19 aa signal sequence leading to a protein of a predicted molecular mass of 24 kDa. The deduced protein shows higher sequence identity to XYN1 (69% aa identity) than to XYN2 (48% aa identity) of T. reesei. A comparison with the homologous T. reesei XYN1 revealed that XYN5 contains a putative dibasic (R-R) cleavage site for the endopeptidase KEX2 at position 47 and 48 (Supplementary Fig. S2). The KEX2 processed aa sequence was then used to predict the protein structure using I-TASSER and PyMOL. The analysis revealed typical GH11 structural motifs: the protein is formed mainly by antiparallel β-sheets with one α-helix. The β-sheets are thereby packed against each other and form the substrate binding cleft where the active site with two glutamic acids (E75 and E164) is present (Fig. 3) (Biely et al. 1997; Törrönen and Rouvinen 1997). Furthermore, three potential N-glycosylation sites (position 29, 75, 97) but no O-glycosylation site are found. In accordance with the other T. reesei GH11 xylanases, XYN5 lacks also a carbohydrate binding module.

Fig. 3
figure 3

3D surface model of XYN5 from T. reesei TUCIM1282 based on the structure of XYN1 (PDB: 1XYN). The protein is primarily composed of GH11 typical antiparallel β-sheets with a unique α-helix. These secondary structures form a substrate binding cleft, which is illustrated on the left side. The two GH typical catalytic residues, E75 and E164, are located within this cleft (right side)

Recombinant expression and biochemical characterization of XYN5

To demonstrate that xyn5 of T. reesei TUCIM1282 encodes a functional xylanase, xyn5 was recombinantly overexpressed under the control of the cDNA1 promoter region in T. reesei strain Δxyr1 (Uzbas et al. 2012) to guarantee a xylanase free environment caused by the deletion of the major xylanase regulator xyr1. Purified T. reesei transformants were analyzed for the integration of the xyn5 expression cassette (Supplementary Fig. S1a) and the production of an additional protein of about 19 kDa in the supernatant compared to the expression host Δxyr1. The supernatant of the positive strain xyn5g1 was chosen for purification of XYN5. The small number of background proteins present in the supernatant of strain xyn5g1 could be removed by cation exchange chromatography. Following this procedure, two protein bands found around 21 and 19 kDa were detected by SDS-PAGE (Fig. 4). Analysis of the two protein bands by MS/MS identified both as XYN5 (Supplementary Fig. S4). Following the treatment of the respective XYN5 fractions with Endo-T, an endo-N-acetyl-β-d-glucosaminidase hydrolyzing the linkage between the two N-acetylglucosamine units of N-glycans (Stals et al. 2012), the larger XYN5 band disappeared, indicating that the double band is the result of a differentially N-glycosylated XYN5 (Fig. 4). The purified fractions of XYN5 were then pooled to determine the enzymatic characteristics. The optimal pH of 4 for XYN5 (Fig. 5) was similar to the reported pH optima for XYN1 between 3.5 and 4.5 (Table 1) and thereby lower than XYN2 with optima between 4.5 and 5.5. Highest XYN5 activity was measured at 50 °C (Fig. 5), whereas for XYN1 40 °C and for XYN2 45 °C were reported. The V max of purified XYN5 at pH 4 and 50 °C on beechwood xylan was 2646 nkat/mg (Supplementary Fig. S5).

Fig. 4
figure 4

SDS-PAGE of the purified and deglycosylated XYN5 expressed in T. reesei strain xyn5g1. Following purification of XYN5 by cation exchange chromatography, two bands corresponding to T. reesei TUCIM1282 XYN5 were detected. This double band was reduced to a single band by Endo-T treatment of the XYN5 fraction. Lanes: 1, Endo-T; 2, molecular size marker; 3, purified XYN5 protein (2 μg) after cation exchange chromatography; 4, Endo-T treated purified XYN5 (2 μg)

Fig. 5
figure 5

pH and temperature optimum of the recombinantly expressed XYN5 from T. reesei TUCIM1282 in T. reesei Δxyr1. For determination of the pH optimum, beechwood xylan was dissolved in 0.1 M McIlvaine buffer and DNS-based assays were performed at 50 °C. For the determination of the temperature optimum, the substrate was dissolved in 10 mM NaAc buffer pH 4. Highest enzyme activity was detected at pH 4 and 50 °C. Mean ± SD of two biological replicates are shown

Table 1 Comparison of XYN5 to the other GH11 xylanases of T. reesei

Reintroduction of a functional xyn5 allele in T. reesei QM9414 increases total xylanase activity

The finding that xyn5 of T. reesei TUCIM1282 encodes a functional xylanase prompted us to analyze the effect of a reintroduction of the xyn5 allele into the T. reesei QM6a strain line. Therefore, we used as recipient strain a Δtku70 strain of the early cellulase mutant QM9414 (Mandels et al. 1971) and homologously exchanged its xyn5 with the xyn5 allele of strain TUCIM1282 under control of its native promoter region. Positive transformants were analyzed for homologous recombination at the xyn5 locus and for single copy integration of the xyn5 allele (Supplementary Fig. S1b–e). The recombinant strains were then analyzed for xylanase activity on different xylanase inducing substrates including beechwood xylan, wheat straw, and the soluble carbon source lactose. On all substrates, the overall endo-xylanase activity of the recombinant strains was improved compared to the parental strain (Fig. 6). Highest xylanase activities were recorded on beechwood xylan, where the reintroduction of full-length xyn5 improved the overall xylanolytic activity by 58% in comparison to the parental strain. The improvements in the overall xylanolytic activity were particularly present at later time-points of cultivation. This is in line with the finding that xyn5, together with xyn1, are highly expressed at low pH values which are typically reached at later stages of cultivation (Häkkinen et al. 2015).

Fig. 6
figure 6

Increase in xylanolytic activity upon homologous replacement of the non-functional xyn5 in T. reesei strain Δtku70 by the T. reesei TUCIM1282 xyn5. Total xylanase activity of the strains harboring the full-length xyn5 (blue line) was compared to the parental strain Δtku70 (red line) during growth on MA medium using beechwood xylan, lactose, or wheat straw as carbon source. Lactose and beechwood xylan samples represent the activities corrected for biomass, whereas wheat straw samples represent volumetric activities. Mean ± SD of two biological replicates is shown (color figure online)

Discussion

In the T. reesei genome, three xylanases belonging to the GH11 family are encoded. Two of them were already characterized in the 1990s (Tenkanen et al. 1992; Törrönen et al. 1992), while the third GH11 xylanase XYN5 was discovered only recently. Obviously, due to an improper annotation as a glycoside hydrolase family 18 protein, it was missing from the original publication of the genome sequence of the T. reesei reference strain QM6a (Martinez et al. 2008). Only later during the manual annotation of transcripts expressed during conidiation, this third xylanase of the GH11 family was discovered. Although transcription of xyn5 was reported, the enzyme did not appear in any proteomic or biochemical study (Borin et al. 2015; Dos Santos Castro et al. 2014; Jun et al. 2013; Peciulyte et al. 2014; Saloheimo and Pakula 2012). Based on our comparative analysis of xyn5 alleles and their corresponding aa sequences from different Trichoderma spp. and T. reesei wild-type strains, we show that the deposited protein sequence of the reference strain QM6a is too short since a start codon further upstream of the originally proposed start codon encodes the most likely beginning of the protein sequence. We also identified several nucleotide differences within the coding region of xyn5 from T. reesei QM6a which result in a very small truncated protein. However, the xyn5 alleles of four other T. reesei strains encode a full-length protein which indicates that the relevant mutations leading to the truncated form of XYN5 happened specifically and recently in T. reesei strain QM6a history.

This full-length XYN5 aa sequence reveals a higher structural similarity to the already described XYN1 compared to XYN2. But all three GH11 xylanases are highly similar in size and exhibit comparable biochemical properties. A direct comparison of the kinetic properties is difficult as the data found in different publications on XYN1 and XYN2 are quite divergent (Table 1). One possible explanation for this discrepancy might be the high variations observed for endo-1,4-β-xylanase assays when performed in different laboratories under non standardized conditions (Bailey et al. 1992). An interesting property of XYN5 is its temperature optimum. With 50 °C, it exceeds the temperature optimum of the other two GH11 xylanases and this enzyme seems therefore to be more suitable for reactions which run at higher temperatures.

Regarding the expression profile of xyn5 in T. reesei, we show that its transcription responds to the presence of l-arabinose and d-xylose in the wild-type strains TUCIM1282 and QM6a similar to an earlier study in the early cellulase mutant T. reesei QM9414 (Herold et al. 2013). Xyn5 is also upregulated during sporulation (Metz et al. 2011) and belongs to a group of xylanases which are induced by a broader spectrum of inducers. These include also typical cellulase inducers such as sophorose (Ghassemi et al. 2015). This is in contrast to the regulation of xyn3 and xyn6 which are exclusively induced by cellulases inducers (Herold et al. 2013; Xu et al. 2000).

Our study is a further example that a comparative approach within a species can be useful to identify mutated genes which have lost their function during evolution of the strain. In a previous study, we have identified mutations in the MAP kinase scaffold encoding ham5 in T. reesei QM6a which resulted in a female sterile phenotype of this strain. This non-functional HAM5 prevented sexual reproduction within the industrial and academic strain lines whereas the reintroduction of ham5 led to fertile strains (Linke et al. 2015). Here the reintroduction of xyn5 led to an improvement of the overall xylanase activity. Both studies display good examples that comparative analysis of the genes and genomes of different T. reesei strains is able to reveal non-functional genes and thereby expand the potential biotechnological use of this fungus. This dormant potential of less characterized wild-type strains might be further applied to identify additional non-functional CAZymes to improve the hydrolytic potential of this strain as this is still a major barrier towards cost-efficient production of second generation bioethanol. In addition, it can also be used to identify genes responsible for novel traits in the wild-type strains which are absent in the reference strain line of T. reesei QM6a. Such genes could be used, e.g., as dominant nutritional markers for genetic strain transformation. Introduction of these genes in the organism is described as cisgenesis, and the results are similar to conventionally strain breeding as genes are only transferred between closely related organisms. These homologous genes are considered to be advantageous over genes from heterologous origin, as expression of them usually does not need further optimization. They are preferred particularly in the food and feed industry over markers from other organisms or antibiotic markers as they facilitate the admission procedure for such products.