1H, 13C, 15N backbone and IVL methyl group resonance assignment of the fungal β-glucosidase from Trichoderma reesei

β-glucosidases have received considerable attention due to their essential role in bioethanol production from lignocellulosic biomass. β-glucosidase can hydrolyse cellobiose in cellulose degradation and its low activity has been considered as one of the main limiting steps in the process. Large-scale conversions of cellulose therefore require high enzyme concentration which increases the cost. β-glucosidases with improved activity and thermostability are therefore of great commercial interest. The fungus Trichoderma reseei expresses thermostable cellulolytic enzymes which have been widely studied as attractive targets for industrial applications. Genetically modified β-glucosidases from Trichoderma reseei have been recently commercialised. We have developed an approach in which screening of low molecular weight molecules (fragments) identifies compounds that increase enzyme activity and are currently characterizing fragment-based activators of TrBgl2. A structural analysis of the 55 kDa apo form of TrBgl2 revealed a classical (α/β)8-TIM barrel fold. In the present study we present a partial assignment of backbone chemical shifts, along with those of the Ile (I)-Val (V)-Leu (L) methyl groups of TrBgl2. These data will be used to characterize the interaction of TrBgl2 with the small molecule activators.


Biological context
The depletion of fossil fuel in combination with the increasing demand for energy worldwide has instigated research on alternative and sustainable energy sources such as biofuels. Lignocellulosic (LC) biomass such as wood, agricultural residues and dedicated energy crops are abundant and available at low cost and have received considerable global attention as the most promising alternative, renewable source for biofuel production. LC biomass, the structural backbone of all plant cell walls, is composed mainly of cellulose, in combination with hemicellulose and lignin (Service 2007). A mixture of enzymes that together are known as cellulase, catalyze cellulose degradation and comprise three categories of enzymes; endoglucanases (EC 3.2.1.4), exoglucanases or cellobiohydrolases (EC 3.2.1.91) and β-glucosidases (EC 3.2.1.21) (Sticklen 2008) (Brethauer and Studer 2015). Endoglucanases cleave the internal β-1,4-glycosidic bonds of cellulose microfibrils releasing small fragments. Subsequently, exoglucanases or cellobiohydrolases (CBH) act on the reducing and non-reducing ends resulting in short chain cello-oligosaccharides such as cellobiose, which are hydrolysed into glucose by the action of β-glucosidases (Sticklen 2008). However, the activity of β-glucosidase is a rate limiting step which results in accumulation of cellobiose and subsequent inhibition of other cellulases (Lynd et al. 2002;Bommarius et al. 2008;Resa and Buckin 2011;Brethauer and Studer 2015). Currently, this is overcome by using high concentrations of β-glucosidases increasing the cost of large-scale conversions. Many efforts have therefore been directed towards the improvement of catalytic activity and thermostability of the enzyme using mainly traditional genetic approaches (Lee et al. 2012).
Trichoderma reesei produces large amounts of thermostable cellulolytic enzymes, which make them attractive targets for industrial applications (Gao et al. 2017). Genetically modified β-glucosidases from Trichoderma reesei have been commercialized and are included in cellulolytic enzyme cocktails that have been developed by companies. Recently, Jeng et al. (2011) solved the crystal structure of the 55 kDa β-glucosidase from Trichoderma reesei (TrBgl2) at 1.63 Å resolution, which was found to adopt a classical (α/β) 8 -TIM barrel fold and be in tight association with a TRIS molecule at the active site (Jeng et al. 2011).
In previous studies, we have identified fragment-based activators of enzyme activity (Darby et al. 2014). We are currently characterizing fragment-based activators of TrBgl2. The work reported here is a partial assignment of backbone chemical shifts, along with those of the Ile (I)-Val (V)-Leu (L) methyl groups of TrBgl2. These assignments will provide the basis for a study of the interaction of TrBgl2 with the small molecule activators.

Sample preparation
The codon-optimized DNA sequence for TrBgl2 was inserted into the pET-YSBLIC3C plasmid with an N-terminal His 6 cleavable tag (Fogg and Wilkinson 2008). The plasmid was transformed into the E. coli BL21 (DE3) bacterial strain. Expression was performed using the method from Gans et al. (2010) with some modifications. Labelled samples were grown in M9/D 2 O minimal media containing 15 NH 4 Cl (1.0 g/L) and 13 C 6 -glucose (1.5 g/L). The cells were incubated at 37 °C until reaching an OD 600 of 0.4 when 2-Keto-3-(methyl-d3)-butyric acid-1,2,3,4-13C4, 3-d sodium salt (SIGMA, Cat. 637858) and 2-Ketobutyric acid-13C4,3,3-d2 sodium salt hydrate (SIGMA, Cat. 607541) were added (Gans et al. 2010;Tugarinov and Kay 2003). The cells were further incubated at 37 °C until reaching an OD 600 of 0.7-0.8. The temperature was then decreased to 16 °C and the protein expression was induced with 0.5 mM IPTG and cells were harvested after overnight incubation at 16 °C. The cells were resuspended in a buffer containing 20 mM TRIS-HCl pH 7.5, 400 mM NaCl, 10 mM Imidazole and then disrupted by using the Cell Disruption System (Benchtop, Constant Systems Ltd). After centrifugation, the soluble TrBgl2 was initially purified by Ni 2+ -NTA chromatography as described previously (Jeng et al. 2011). The sample was then loaded onto a size exclusion column (16/600 Superdex 200, GE Healthcare), and eluted with a buffer containing 50 mM TRIS-HCl pH 8, 100 mM NaCl, 3 mM DTT. The purified protein was concentrated by ultrafiltration with 10 kDa cut-off filter and analyzed by SDS-PAGE. Samples were prepared with 10% D 2 O (v/v) yielding concentration of 0.25 mM.

Assignment and data deposition
Spectra of TrBgl2 were initially assessed in a phosphatebased buffer at pH 6.0 and a TRIS-based buffer at pH 8.0. Spectral quality was markedly improved in TRIS buffer at pH 8.0, with several additional peaks becoming visible (data not shown). This observation, together the fact that a TRIS Fig. 1 The assigned 700 MHz [ 15 N, 1 H] TROSY NMR spectrum of TrBgl2. Assigned backbone N-H cross peaks are labelled with the corresponding residue number molecule was found at the active site of the crystallographic structure (Jeng et al. 2011), suggests that TRIS might stabilize apo TrBgl2.
The chemical shift assignment of TrBgl2 was performed in semi-automatic mode using the FLYA algorithm (Schmidt and Guntert 2012). The FLYA assignments were manually inspected and extended. Recombinant TrBgl2 consists of 488 amino acids (including an N-terminal His 6 cleavable tag) and has a molecular weight of 55 kDa. In total, we were able to assign 265 backbone amide resonances and 55% of the methyl group resonances of the IVL residues, from 459 non-proline residues (Figs. 1 and  2). Assignment completeness is limited possibly due to incomplete back-exchange of perdeuterated amide groups, with only ~ 75% of the expected resonances being observed in the [ 15 N, 1 H] TROSY spectrum of perdeuterated TrBgl2. Nonetheless, the assignments here reported provide a useful NMR basis for studies of the interaction of TrBgl2 with small molecules. The assignment of chemical shifts of TrBgl2 has been deposited into BMRB (https ://www. bmrb.wisc.edu/) with accession number 50158.
TrBgl2 secondary structure motifs were predicted by the TALOS-N software (Shen and Bax 2013) using the assigned backbone resonances as input data. The software indicated that the overall secondary structures are in agreement with the X-ray structure of TrBgl2 (PDB ID 3AHY) (Jeng et al. 2011) (Fig. 3).  Secondary structure prediction of TrBgl2 analyzed with TALOS-N using the assigned chemical shifts compared to the secondary structure of the X-ray structure of TrBgl2. Top: colored bars (red and blue bars indicate α-helix and β-strands respectively) show the secondary structure type predicted by TALOS-N. The bar height represents the prediction confidence. The black line shows the predicted S^2 order parameter, a measure of flexibility. Bottom: the secondary structure of TrBgl2 as determined by X-ray crystallography (PDB ID 3AHY) (Jeng et al. 2011) Acknowledgements EM is very grateful to Michael Plevin (University of York) for his help and support in sample preparation. This work was supported by the European Union's Horizon2020 MSCA Programme under grant agreement 675899 (FRAGNET); research in the group of R.E.H. was additionally supported by research Grants from the BBSRC (BB/N008332/1) and institutional infrastructure support from funds provided by the Wellcome Trust and EPSRC.
Funding This work was supported by the European Union's Hori-zon2020 MSCA Programme under Grant agreement 675899 (FRAG-NET); research in the group of R.E.H. was additionally supported by Research Grants from the BBSRC (BB/N008332/1) and institutional infrastructure support from funds provided by the Welcome Trust and EPSRC.

Data availability
The data that support the findings of this work are available on BMRB with the following entry assigned accession number: 50158.

Compliance with ethical standards
Conflict of interest I confirm that all authors of the manuscript have no conflict of interest to declare.
Informed consent I confirm that all authors of the manuscript have given consent to participate to this work.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by/4.0/.