Late metabolic precursors for selective aromatic residue labeling

Schörghuber, Julia; Geist, Leonhard; Platzer, Gerald; Feichtinger, Michael; Bisaccia, Marilena; Scheibelberger, Lukas; Weber, Frederik; Konrat, Robert; Lichtenecker, Roman J.

doi:10.1007/s10858-018-0188-z

Late metabolic precursors for selective aromatic residue labeling

Article
Open access
Published: 28 May 2018

Volume 71, pages 129–140, (2018)
Cite this article

Download PDF

You have full access to this open access article

Journal of Biomolecular NMR Aims and scope Submit manuscript

Late metabolic precursors for selective aromatic residue labeling

Download PDF

Julia Schörghuber¹,
Leonhard Geist²,
Gerald Platzer²,
Michael Feichtinger²,
Marilena Bisaccia¹,
Lukas Scheibelberger¹,
Frederik Weber¹,
Robert Konrat² &
…
Roman J. Lichtenecker¹

2877 Accesses
18 Citations
3 Altmetric
Explore all metrics

Abstract

In recent years, we developed a toolbox of heavy isotope containing compounds, which serve as metabolic amino acid precursors in the E. coli-based overexpression of aromatic residue labeled proteins. Our labeling techniques show excellent results both in terms of selectivity and isotope incorporation levels. They are additionally distinguished by low sample production costs and meet the economic demands to further implement protein NMR spectroscopy as a routinely used method in drug development processes. Different isotopologues allow for the assembly of optimized protein samples, which fulfill the requirements of various NMR experiments to elucidate protein structures, analyze conformational dynamics, or probe interaction surfaces. In the present article, we want to summarize the precursors we developed so far and give examples of their special value in the probing of protein–ligand interaction.

Anthranilic acid, the new player in the ensemble of aromatic residue labeling precursor compounds

Article Open access 31 August 2017

Towards cost-effective side-chain isotope labelling of proteins expressed in human cells

Article Open access 22 August 2024

Non-canonical amino acid labeling in proteomics and biotechnology

Article Open access 22 May 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The structure and interplay of proteins determine the cell’s proliferation, development, function and fate. A deep understanding of their complex conformational properties and interaction networks represents the key issue to unravel the principles of life at a molecular level. Only if the properties of the single nodes in e.g. protein signal cascades or metabolic networks are known and the interaction mechanisms understood, subtle changes leading to pathogenic progression and disease development can be addressed by developing non-endogenous therapeutic compounds. NMR spectroscopy is one of the three main methods, next to X-ray diffraction and cryo-electron microscopy, to investigate the properties of proteins at an atomic, or near-atomic resolution (Banci et al. 2010). The special value of protein NMR is given by the highly diverse set of possible pulse sequences, which can give information about structural properties, dynamic processes and the interaction with binding partners. This information can be obtained under near native conditions from samples in aqueous buffer solution. However, the NMR-based elucidation of proteins is limited by sensitivity and resolution issues (Ardenkjaer-Larsen et al. 2015), which are partly compensated by constant improvement of experimental techniques, hardware development and sample preparation (Campbell 2013), especially concerning novel developments in protein isotope labeling techniques (Ohki and Kainosho 2008).

Introducing ¹³C, ¹⁵N and ²H at defined atomic positions improves signal resolution and leads to spectra simplification. As a result, NMR signals are attributed to the corresponding nuclei more easily and can be transferred to automated assignment algorithms (Güntert 2009). Highly selective labeling shifts the molecular weight limit of proteins which are amenable to structure calculation based on NMR data. The number of high quality structure restraints can be increased in this case resulting in more accurate protein structures. Furthermore, most of the diverse NMR experiments, which have been developed to probe protein dynamics at different time scales require isolated spin systems (Ishima et al. 2001). The analysis of these ¹³C–¹H or ¹⁵N–¹H spin pairs is greatly simplified due to the lack of one-bond ¹³C–¹³C or three-bond ¹³C–¹H coupling, and thus allows for an accurate interpretation of the corresponding relaxation dispersion rates. The importance of protein NMR in the drug development process is constantly increasing as this method provides valuable information about binding sites and large interaction surfaces (Pellecchia et al. 2002). However, protein NMR is associated with high costs and still far from being a high throughput method. Highly selective, economic protein labeling can improve the situation by decreasing the minimal sample concentrations required for certain NMR experiments.

Two main complementary methods have been developed to implement defined protein isotope patterns. In cell-based approaches, a host organism is grown in media containing suitable isotopologues of metabolic amino acid precursors. After cellular uptake, these compounds are converted into the target residues within their metabolic pathways in-vivo. Such overexpression systems have been described for prokaryotic (E. coli) (Hoogstraten and Johnson 2008; Mondal et al. 2013), as well as eukaryotic cells (yeast, insect cell-lines) (Morgan et al. 2000; Takahashi and Shimada 2010). Especially when early metabolic intermediates are used as labeled nutrients, the danger of cross-labeling to unwanted positions is very high, thus resulting in unselective isotope patterns. Cross-labeling is avoided in the second method, which uses cell lysates in-vitro to generate the target proteins from isotope containing amino acids (Kainosho et al. 2006; Kainosho and Güntert 2009; Staunton et al. 2006; Takeda et al. 2010; Torizawa et al. 2004, 2005). These cell-free methods lead to very selective labeling patterns, but their use is often still hampered by high costs and limited applicability (Casteleijn et al. 2013).

The introduction of late α-ketoacid metabolic precursors for valine, isoleucine and leucine (Gardner and Kay 1998; Goto et al. 1999; Lichtenecker et al. 2004, 2013a, b), as well as methionine (Fischer et al. 2007) resulted in hitherto unrivaled labeling selectivity in cell-based protein overexpression. Further development led to techniques of stereoselective methyl labeling in leucine, valine or isoleucine (Ayala et al. 2012; Gans et al. 2010) and extended selective labeling to alanine and threonine (Ayala et al. 2009; Velyvis et al. 2012). The assembly of ¹³CHD₂ methyl groups (Chaykovski et al. 2003; Ollerenshaw et al. 2005; Weininger et al. 2012b) provides optimized isotope patterns to probe for conformational changes. Compared to all these advanced techniques of aliphatic residue labeling (reviewed by Kerfah et al. 2015), the methods to introduce defined isotope patterns into aromatic residues extensively lagged behind for many years. This is all the more surprising, because phenylalanine, tyrosine, tryptophan and histidine are regarded as sensitive reporters of protein dynamics, as well as being valuable sources of structural restraints. In addition, these residues are significantly overrepresented at protein interfaces and play a prominent role in guiding enzyme mechanisms (Bogan and Thorn 1998). The absence of a comprehensive toolbox of amino acid precursors for selective aromatic residue isotope labeling inspired us to identify novel compounds, which show effective cell-uptake and well-defined in-vivo conversion to the target residues in E. coli overexpression systems.

Precursor identification

Early metabolic precursors (biosynthetic intermediates upstream of the shikimate- or the pentose phosphate pathway) have been applied to achieve a certain isotope distribution in the corresponding target residues. Isotopologues of d-glucose (Teilum et al. 2006; Lundström et al. 2007, 2009a, b; Weininger et al. 2012a, 2013), glycerol (Ahlner et al. 2015; LeMaster and Kushlan 1996; Takeuchi et al. 2008), d-erythrose (Kasinath et al. 2013, 2015; Weininger 2017b), d-ribose (Weininger 2017a), acetate (Wand et al. 1995) and pyruvate (Guo et al. 2009; Lee et al. 1997; Lundström et al. 2008; Milbradt et al. 2015; Robson et al. 2018) have been particularly used to reduce unwanted ¹J_CC coupling in NMR-based analysis of protein dynamics. These methods have been frequently applied by the biomolecular NMR community, since the corresponding precursors are commercially available and additional synthetic organic chemistry is not needed. However, a significant degree of cross-labeling cannot be avoided when using early metabolic precursors, which not only leads to heavy isotope incorporation at unwanted atomic positions, but also results in labeling of others than the target amino acids. Since the compounds mentioned above are central to the biosynthesis of all four proteinogenic aromatic residues, generating residue-selective isotope patterns is impossible. In addition, the poor selectivity causes limited maximal isotope incorporation levels, which is to some extent compensated by supplying increased precursor concentrations in the overexpression media. The resulting high consumption rate of isotope labeled compounds is a serious expense factor and limits the sample throughput of protein NMR spectroscopy.

In order to address the issues mentioned above, we identified late metabolic precursors for aromatic residue labeling. Phenylpyruvate and (4-hydroxyphenyl)pyruvate are the substrates of the transaminase catalyzed conversion to the corresponding target amino acids l-phenylalanine and l-tyrosine, respectively (Scheme 1) (Lichtenecker et al. 2013c). These two compounds represent the only non-chiral intermediates in the corresponding biosynthetic pathway and are thus ideal structurally simple targets for isotopologue synthesis (Lichtenecker 2014). The α-ketoacid derivative of tryptophan, indolepyruvate, is not part of the amino acid biosynthesis, but the first intermediate in the corresponding degradation pathway. However, we could identify this compound as a selective precursor for tryptophan labeling (Schörghuber et al. 2015). The reversible character of the corresponding transaminase EC 2.6.1.27 reaction leads to efficient conversion of the precursor to the target residue in this case. In order to access isotopologues of the indole side-chain, we tested structurally more simple compounds for their use in selective tryptophan labeling. Considering the irreversibility of the anthranilate synthase EC 4.1.3.27 catalyzed elimination of pyruvate from chorismate, we could provide evidence that isotope patterns in anthranilate, as well as indole can be transferred to the tryptophan side-chain without losing heavy isotopes in the shikimate pathway (Schörghuber et al. 2015, 2017a). Regarding histidine labeling, we focused on the first intermediate of the minor degradation pathway, imidazolepyruvate. Again, the reversible transaminase EC 2.6.1.38 reaction ensured an effective in-vivo conversion to the target residue. In this case we applied the stable enol-tautomer of imidazolepyruvate as a labeling precursor (Schörghuber et al. 2017b). All of the identified precursor compounds mentioned showed highly selective labeling of the corresponding target residues in absence of any cross-labeling to undesired atomic positions.

Precursor synthesis

Scheme 2 summarizes the precursors we used for aromatic residue labeling so far. These compounds have been prepared via multistep organic synthesis, which we optimized in terms of robustness, yields, labeling selectivity and costs (see the corresponding literature for details). We used commercially available sources of ¹³C as starting materials or reagents, such as isotopologues of acetone, glycine, potassium cyanide and formaldehyde (Lichtenecker 2014; Lichtenecker et al. 2015). Deuterium patterns have been exclusively derived from deuterium oxide, which is the cheapest source of ²H available. 1-¹³C labeled precursors 1, 4, 6 and 14 have been applied to probe for labeling selectivity and precursor uptake in diverse model protein overexpression systems. Their straightforward synthesis (1, 4 and 6 are prepared from [1-¹³C]glycine in three steps) renders them economic tools for residue-selective backbone-labeling to be used for e.g. unambiguous signal assignment in high molecular-weight protein complexes.

The Phe-precursors 2 and 3, the Tyr-precursor 5, as well as the Trp-precursors 9, 10, 12 and 13 display well-defined deuteration patterns leading to isolated ¹³C–¹H spins. These systems are devoid of any additional scalar couplings, which otherwise distort relaxation rate analysis. Highly regioselective ¹H/²H exchange was performed on electron-rich aromatic rings in acidic deuterium oxide under elevated reaction temperatures. All these compounds can be accessed from one common synthetic intermediate (isotopologues of 4-nitrophenol 17 and 19), which is a significant advantage from an economic point of view (Scheme 3). Labeled histidines are important sensors for protein dynamics and help to elucidate the pK_a values of the imidazole ring (Hansen and Kay 2014; Hass et al. 2008). The ε-¹³C His-precursor 15 exhibits an inherently isolated ¹³C–¹H spin system and was developed to provide an optimal isotope pattern to probe this unique heteroaromatic side-chain. Compound 15 can be prepared via a straightforward 5-step route starting from [¹³C]formaldehyde (Schörghuber et al. 2017b).

Precursor uptake and labeling selectivity

Effective uptake of isotope labeled precursors by the overexpressing organism is of utmost importance, since this factor determines the highest possible isotope incorporation level at certain precursor concentrations in the corresponding media. Isotope incorporation at a given concentration may vary with different target proteins as a function of protein size, number of target residues and overexpression conditions. We identified the following precursor concentrations as being required in the E.coli overexpression media to achieve near-quantitative to quantitative isotope incorporation at the target atomic positions: 60–150 mg/L phenylpyruvate, 80–200 mg/L 4-hydroxyphenylpyruvate (Lichtenecker et al. 2013c), 20–60 mg/L indolepyruvate, 12–30 mg/L indole (Schörghuber et al. 2015), 8–30 mg/L anthranilic acid (Schörghuber et al. 2017a), and 50–100 mg/L imidazolepyruvate (Schörghuber et al. 2017b). This data has been deduced from labeling efficiency plots, which were obtained by overexpressing the corresponding protein sample in presence of different precursor concentrations. The low amount of late metabolic precursors required to achieve maximal heavy isotope contents is in sharp contrast to labeling methods using early metabolic intermediates. Examples from literature report concentrations of 1–4 g/L in this case, leading to isotope enrichment of 30–75% at the desired atomic positions in Phe, Tyr, Trp or His-residues (e.g. Kasinath et al. 2013; Weininger 2017b). These values are far from the quantitative isotope labeling, which we observed when applying the late metabolic precursor compounds illustrated in Scheme 2.

Using single atom ¹³C-labeled early metabolic precursors induces a certain pattern of ¹²C/¹³C isotopes, but despite of improving selectivity by exploiting auxotrophic organisms (e.g. LeMaster and Kushlan 1996) or supplying the growth media with enzyme inhibitors (e.g. Tong et al. 2008) a certain degree of cross-labeling cannot be ruled out. In various applications of our precursor toolbox, we did not observe any isotope scrambling so far. Our experiments indicate that any isotope pattern, which can be implemented onto the structures shown in Scheme 2 will quantitatively be transferred to the corresponding target residue. Most importantly, this is also true for patterns of (non-solvent exchangeable) deuterium atoms. Consequently, late metabolic precursors of aromatic residues can be applied to introduce aromatic ring protons into protein samples with high overall deuterium levels. In this case, uniform deuterium labeling can be achieved using deuterium oxide together with ²H-containing early metabolic isotope sources such as [all-²H]glucose.

Another strategy to achieve well-defined isotope patterns in protein samples applies labeled amino acids as additives to the overexpression media. Examples from literature show that amino acid concentrations in the low mg/L range may result in high incorporation levels (Kemple et al. 1994; Miyanoiri et al. 2011; Vuister et al. 1994). The concentrations required can even be further decreased when using auxotrophic expression strains (Lin et al. 2011; Yang et al. 2015). Other reports indicate that the use of labeled amino acids in E. coli-based overexpression systems is to some extent limited by metabolic product feedback control mechanisms, which may lead to decreased isotope uptake, retarded cell growth or cross-labeling (Krishnarjuna et al. 2011; O’Grady et al. 2012; Rowley 1953). In the case of aromatic residues, Phe, Tyr and Trp have shown to inhibit the E. coli DHAP synthase isoenzymes, which control the carbon flow into the shikimate pathway (Herrmann 1995). Additionally, certain levels of amino acid concentrations affect the translation machinery, thereby slowing down growth rates (Avcilar-Kucukgoze et al. 2016). Besides, organic synthesis of complex isotope patterns in the case of amino acids is considerably elaborate and expensive, due to the required implementation of at least one center of chirality, as well as the need of introducing ¹⁵N by additional synthetic steps if nitrogen-15 labeling is desired (Miyanoiri et al. 2011). In contrast to that, ¹⁵N-labeling of the target residues is straightforward in the case of applying metabolic amino acid precursors by adding ¹⁵N-salts to the expression media. In order to reduce synthetic efforts, more elaborate commercially available isotope sources can be applied. Phenylalanine, for instance, has been prepared from labeled tyrosine in two steps (Wang et al. 1999). The simplified synthesis, however, comes along with increased costs for the starting compound. In a recently published noteworthy economic approach, labeled phenylalanine was produced in E. coli from glycerol and secreted into the growth medium. The thus isolated amino acid subsequently served as an isotope source in recombinant E. coli protein overexpression (Ramaraju et al. 2017). However, the application of amino acids in cell-based overexpression required the addition of metabolic inhibitors and unlabeled amino acids in order to obtain high isotope incorporation levels also in this case. One literature reported protocol makes use of shikimic acid to generate protonated aromatic residues in an otherwise uniformly ¹³C-protein (Rajesh et al. 2003). It can be considered as rather improbable that this strategy will be transferred from reverse-labeling to selective ¹³C-labeling in aromatic residues in future, due to the required synthesis of ¹³C-shikimic acid.

Table 1 gives a -by no means exhaustive- overview concerning the various strategies of aromatic residue labeling in E. coli based and cell-free systems published so far. The data shown shall illustrate the differences in selectivity, precursor concentrations needed, synthetic effort and isotope incorporation levels.

Table 1 Overview of published protocols concerning aromatic residue protein labeling

Full size table

Applying late metabolic aromatic precursor compounds to investigate proteins and their interaction sites

Chemical shift mapping

We chose bromodomain 1 of bromodomain-containing protein 4 (Brd4-BD1) as an example to highlight the benefits of labeling isolated positions in aromatic side-chains. Brd4 is a chromatin reader that binds to acetylated lysines in histones and has proven to be a promising cancer target in the pharmaceutical industry (Zeng and Zhou 2002; Sanchez et al. 2014). We already described Trp-labeled Brd4-BD1 in previous work (Schörghuber et al. 2017a), and we want to elaborate on the application of selective aromatic labeling for probing ligand interaction further using this system. Figure 1 compares the ¹H–¹³C HSQC spectrum of uniformly ¹³C-labeled Brd4-BD1 (Fig. 1a) with the spectra of either Trp(¹³C^ε3/¹³C^η2)- or Tyr(¹³C^ε)-labeled Brd4-BD1 (Fig. 1b, blue and red spectrum). Figure 1b shows six tryptophan- and seven tyrosine resonances according to the three ¹³C^ε3/¹³C^η2-labeled tryptophans, as well as the seven ¹³C^ε-labeled tyrosines present in the Brd4-BD1 sequence. All the signals are well-defined and devoid of any splitting due to additional scalar couplings. On the contrary, the tryptophan resonances of the uniformly labeled sample barely exceed the noise threshold and all aromatic side-chain signals are affected by strong J_CC couplings to neighboring positions. Especially the Tyr-signals suffer from substantial signal overlap, which severely restricts their use for ligand induced chemical shift perturbation studies.

Figure 2 illustrates the chemical shift perturbations (CSPs), which have been induced by three different ligands in ¹H–¹³C HSQC spectra of Trp(¹³C^ε3/¹³C^η2)-labeled Brd4-BD1 (Fig. 2a) and Tyr(¹³C^ε)-labeled Brd4-BD1 (Fig. 2b). The corresponding protein samples were overexpressed in E. coli using precursor 12 (20 mg/L medium), or precursor 5 (100 mg/L medium) containing minimal medium, respectively. The low-molecular weight ligands added target the binding cleft of Brd4-BD1, which is lined by one tryptophan (Trp81) as well as two tyrosine residues (Tyr97 and Tyr139). The spectra reveal that the only resonances to be affected originate from residues in close contact to the binding site (Tyr97, Tyr139 and Trp81). The third shifting tyrosine resonance is very likely caused by Tyr98, which is in close contact to Tyr97 and thus also influenced by ligand binding. Importantly, strong CSPs in the proton dimension of selectively labeled sites provide crucial information about the proximity of aromatic ring-systems of interacting ligands. CSPs induced in the tryptophan spectrum (Fig. 2a) show only marginal differences between the three tested ligands. Only one of the compounds induces a strong upfield shift in the proton dimension of the η2-position of Trp81, indicating a binding mode, which places an aromatic ring-system on top of H^η2 (Fig. 2a, magenta spectrum).

Ligand induced cross-correlation rates

In additional experiments, we employed selective Trp ¹³C^ε3/¹³C^η2 labeling on Brd4-BD1 to evaluate the effect of different ligands on the Chemical Shift Anisotropy (CSA)–Dipol-Dipol (DD) cross correlation rate (CCR) η of the labeled carbon nuclei. As mentioned before, the Brd4-BD1 system comprises three tryptophans, one of which (Trp81) is embedded in a hydrophobic pocket that is targeted by various inhibitor molecules (Jung et al. 2014). Cross correlation between CSA and DD interactions in a system of coupled 1/2 spins can yield valuable information about local electronic structure and dynamics (Brutscher 2000; Kumar et al. 2000). Therefore, we chose precursor 12 to generate exclusive ¹³C^ε3–¹H/¹³C^η2–¹H spin pairs in the target residues. This pattern eliminates scalar coupling contributions from nearby C^ζ2–H and C^ζ3–H pairs. In order to calculate CCRs for Trp81^ε3 and Trp81^η2 we utilized a coupled version of a constant time ¹H–¹³C HSQC to extract peak intensities for the upfield (I₊) and downfield (I₋) components of the ¹³C doublets, which allows for the calculation of corresponding cross-correlation rates. Figure 3 displays the twelve signals that emanate from the three tryptophan residues present. Highlighted are the 1D slices for both Trp81C^ε3 and Trp81C^η2 showing the upfield (+) components in red, and the downfield (−) components in blue. The ratio of peak intensities (I₊/I₋) is related to the CCR (η) via ln(I₊/I₋) = 2Tη where T is the mixing time in the NMR experiment during which cross correlated relaxation is active (Brutscher 2000).

Figure 4 shows 2D spectra and extracted 1D traces in the presence of a small molecule ligand and their corresponding η values. CCR rates for Trp81^ε3 and Trp81^η2 were determined to 46.5 and 50.2 s⁻¹. Interestingly, these rates were smaller than in the apo-state of BRD4 (Trp81^ε3: 50.8 s⁻¹; Trp81^η2: 53.3 s⁻¹) presumably due to subtle changes of the CSA tensor and/or local conformational dynamics.

Structural restraints in intrinsically disordered proteins

Highly selective aromatic residue labeling holds promise to yield valuable additional distance information through well-defined NOE signals, even in the case of structurally flexible protein regions or intrinsically disordered proteins (IDPs). Various research groups are just beginning to explore the potential of IDPs as important targets in drug development (Zhang et al. 2015). The acquisition of structural restraints to identify potential preformation of specific secondary structure elements in the unbound state, as well as their conformational stabilization upon ligand binding, is however still a very challenging task (Konrat 2014). Figure 5 shows a ¹³C-NOESY-HSQC strip of an intrinsically disordered N-terminal fragment (residues 50-171) of Yes-associated protein (YAP). This region was identified to contain the binding site to the transcription factor TEAD (Vassilev et al. 2001). The YAP/TEAD interaction regulates the Hippo pathway, which is deregulated in various cancers and therefore represents a promising target for cancer therapy (Liu et al. 2012). A recent study indicates propensities for the preformation of an α-helix and an N-terminal β-strand in YAP50-171 in its unbound state (Feichtinger et al. 2018). These partially preformed secondary structural elements, together with an omega-loop (residues 86-100), form the interacting surface upon TEAD binding. For the omega-loop, no structural preformation was anticipated so far. In order to further investigate the potential preformation of certain structural elements in this IDP, a uniformly ¹⁵N-YAP 50-171 sample was additionally labeled using compound 3 in the corresponding growth medium. Significant long-range (side-chain) NOEs between Phe95/96-H^ς and Leu91-H^γ and Leu91-H^δ were observed (Fig. 5). This data shows that, although YAP is largely unfolded in absence of its binding partner, very selective labeling can identify direct distance proximities and thus long-range structural preformation of the Ω-loop region (residues 86-100). This property was not perceived by analyzing chemical shift data via secondary structure propensity score calculation (Marsh et al. 2006) in previous studies (Feichtinger et al. 2018).

Conclusion

To conclude this article, we want to summarize the various reasons, which make us think that our ensemble of heavy isotope containing aromatic compounds can be considered as a valuable tool for the NMR-based investigation of structure and dynamics in different proteins, as well as the NMR-guided drug development process. The use of metabolic amino acid precursors downstream of the shikimic acid pathway ensures for highly selective labeling with maximum incorporation rates. The effective cellular uptake and direct in-vivo conversion to the target residues not only reduces the amount of labeled protein sample required, but also creates new opportunities to tune certain isotope patterns to the demands of the NMR experiments applied. Well-defined isotope distribution results in highly resolved sets of NMR resonances even in the case of high molecular weight protein targets, which can be probed for changes upon ligand addition in a straightforward way. Binding of different ligand structures can thus be compared without the need of laborious full signal assignment. Concerning our synthetic routes to generate special precursor isotopologues, we laid the focus on isolated spin systems, which detect dynamic properties without interfering additional ill-defined relaxation pathways. An aspect of special significance is given by the reduced content of our labeled compounds in the growth media required to achieve quantitative protein isotope incorporation levels. Our precursors can be synthesized in gram scale from very cheap isotope sources, which significantly helps to reduce costs in the otherwise very expensive sample preparation process.

We believe that aromatic residue labeling using late metabolic precursor compounds will further develop fields of biomolecular NMR, where highly selective labeling is an essential precondition for effective data acquisition. Examples include protein NMR-based drug development, in-cell NMR spectroscopy, studies of IDPs, the investigation of high-energy conformations, as well as the probing of very weak protein–protein interactions.

References

Ahlner A, Andresen C, Khan SN, Kay LE, Lundström P (2015) Fractional enrichment of proteins using [2-¹³C]-glycerol as the carbon source facilitates measurement of excited state ¹³Cα chemical shifts with improved sensitivity. J Biomol NMR 62:341–351
Article Google Scholar
Ardenkjaer-Larsen JH, Boebinger GS, Comment A, Duckett S, Edison AS, Engelke F, Griesinger C, Griffin RG, Hilty C, Maeda H, Parigi G, Prisner T, Ravera E, van Bentum J, Vega S, Webb A, Luchinat C, Schwalbe H, Frydman L (2015) Facing and overcoming sensitivity challenges in biomolecular NMR spectroscopy. Angew Chem Int Ed 54:9162–9185
Article Google Scholar
Avcilar-Kucukgoze I, Bartholomaus A, Cordero Varela JA, Kaml RF, Neubauer P, Budisa N, Ignatova Z (2016) Discharging tRNAs: a tug of war between translation and detoxification in Escherichia coli. Nucleic Acids Res 44:8324–8334
Article Google Scholar
Ayala I, Sounier R, Usé N, Gans P, Boisbouvier J (2009) An efficient protocol for the complete incorporation of methyl-protonated alanine in perdeuterated protein. J Biomol NMR 43:111–119
Article Google Scholar
Ayala I, Hamelin O, Amero C, Pessey O, Plevin MJ, Gans P, Boisbouvier J (2012) An optimized isotopic labelling strategy of isoleucine-γ₂ methyl groups for solution NMR studies of high molecular weight proteins. Chem Commun 48:1434–1436
Article Google Scholar
Banci L, Bertini I, Luchinat C, Mori M (2010) NMR in structural proteomics and beyond. Prog Nucl Magn Reson Spectrosc 56:247–266
Article Google Scholar
Bogan AA, Thorn KS (1998) Anatomy of hot spots in protein interfaces. J Mol Biol 280:1–9
Article Google Scholar
Brutscher B (2000) Principles and applications of cross-correlated relaxation in biomolecules. Concepts Magn Reson 12:207–229
Article Google Scholar
Campbell ID (2013) The evolution of protein NMR. Biomed Spectrosc Imaging 2:245–264
Google Scholar
Casteleijn MG, Urtti A, Sarkhel S (2013) Expression without boundaries: cell-free protein synthesis in pharmaceutical research. Int J Pharm 440:39–47
Article Google Scholar
Chaykovski MM, Bae LC, Cheng M-C, Murray JH, Tortolani KE, Zhang R, Seshadri K, Findlay JHBC., Hsieh S-Y, Kalverda AP, Homans SW, Brown JM (2003) Methyl side-chain dynamics in proteins using selective enrichment with a single isotopomer. J Am Chem Soc 125:15767–15771
Article Google Scholar
Feichtinger M, Sára T, Platzer G, Mateos B, Bokhovchuk F, Chène P, Konrat R (2018) ¹H, ¹³C, ¹⁵N resonance assignment of human YAP 50–171 fragment. Biomol NMR Assign 12:179–182
Article Google Scholar
Fischer M, Kloiber K, Häusler J, Ledolter K, Konrat R, Schmid W (2007) Synthesis of a ¹³C-methyl-group-labeled methionine precursor as a useful tool for simplifying protein structural analysis by NMR spectroscopy. ChemBioChem 8:610–612
Article Google Scholar
Gans P, Hamelin O, Sounier R, Ayala I, Durá MA, Amero CD, Noirclerc-Savoye M, Franzetti B, Plevin MJ, Boisbouvier J (2010) Stereospecific isotopic labeling of methyl groups for NMR spectroscopic studies of high-molecular-weight proteins. Angew Chem Int Ed 49:1958–1962
Article Google Scholar
Gardner KH, Kay LE (1998) The use of ²H, ¹³C, ¹⁵N multidimensional NMR to study the structure and dynamics of proteins. Annu Rev Biophys Biomol Struct 27:357–406
Article Google Scholar
Goto NK, Gardner KH, Mueller GH, Willis RC, Kay LE (1999) A robust and cost-effective method for the production of Val, Leu, Ile (δ₁) methyl-protonated ¹⁵N-, ¹³C-, ²H-labeled proteins. J Biomol NMR 13:369–374
Article Google Scholar
Güntert P (2009) Automated structure determination from NMR spectra. Eur Biophys J 38:129–143
Article Google Scholar
Guo C, Geng C, Tugarinov V (2009) Selective backbone labeling of proteins using {1,2-¹³C₂}-pyruvate as carbon source. J Biomol NMR 44:167–173
Article Google Scholar
Hansen AL, Kay LE (2014) Measurement of histidine pK_a values and tautomer populations in invisible protein states. Proc Natl Acad Sci USA 111:E1705-12
Article ADS Google Scholar
Hass MAS, Hansen DF, Christensen HEM, Led JJ, Kay LE (2008) Characterization of conformational exchange of a histidine side chain: protonation, rotamerization, and tautomerization of His61 in plastocyanin from Anabaena variabilis. J Am Chem Soc 130:8460–8470
Article Google Scholar
Herrmann KM (1995) The shikimate pathway as an entry to aromatic secondary metabolism. Plant Physiol 107:7–12
Article ADS Google Scholar
Hoogstraten CG, Johnson JE Jr (2008) Metabolic labeling: taking advantage of bacterial pathways to prepare spectroscopically useful isotope patterns in proteins and nucleic acids. Concepts Magn Reson 32A:34–55
Article Google Scholar
Ishima R, Louis JM, Torchia DA (2001) Optimized labeling of ¹³CHD₂ methyl isotopomers in perdeuterated proteins: potential advantages for ¹³C relaxation studies of methyl dynamics of larger proteins. J Biomol NMR 21:167–171
Article Google Scholar
Jung M, Philpott M, Müller S, Schulze J, Badock V, Eberspächer U, Moosmayer D, Bader B, Schmees N, Fernández-Montalván A, Haendler B (2014) Affinity map of bromodomain protein 4 (BRD4) interactions with the histone H4 tail and the small molecule inhibitor JQ1. J Biol Chem 289:9304–9319
Article Google Scholar
Kainosho M, Güntert P (2009) SAIL–stereo-array isotope labeling. Q Rev Biophys 42:247–300
Article Google Scholar
Kainosho M, Torizawa T, Iwashita Y, Terauchi T, Mei Ono A, Güntert P (2006) Optimal isotope labelling for NMR protein structure determinations. Nature 440:52–57
Article ADS Google Scholar
Kasinath V, Valentine KG, Wand AJ (2013) A ¹³C labeling strategy reveals a range of aromatic side chain motion in calmodulin. J Am Chem Soc 135:9560–9563
Article Google Scholar
Kasinath V, Fu Y, Sharp KA, Wand AJ (2015) A sharp thermal transition of fast aromatic-ring dynamics in ubiquitin. Angew Chem Int Ed 54:102–107
Article Google Scholar
Kemple MD, Yuan P, Nollet KE, Fuchs JA, Silva N, Prendergast FG (1994) ¹³C NMR and fluorescence analysis of tryptophan dynamics in wild-type and two single-Trp variants of Escherichia coli thioredoxin. Biophys J 66:2111–2126
Article Google Scholar
Kerfah R, Plevin MJ, Sounier R, Gans P, Boisbouvier J (2015) Methyl-specific isotopic labeling: a molecular tool box for solution NMR studies of large proteins. Curr Opin Struct Biol 32:113–122
Article Google Scholar
Konrat R (2014) NMR contributions to structural dynamics studies of intrinsically disordered proteins. J Magn Reson 241:74–85
Article ADS Google Scholar
Krishnarjuna B, Jaipuria G, Thakur A, D’Silva P, Atreya HS (2011) Amino acid selective unlabeling for sequence specific resonance assignments in proteins. J Biomol NMR 49:39–51
Article Google Scholar
Kumar A, Grace RCR, Madhu PK (2000) Cross-correlations in NMR. Prog NMR Spectrosc 37:191–319
Article Google Scholar
Lee AL, Urbauer JL, Wand AJ (1997) Improved labeling strategy for ¹³C relaxation measurements of methyl groups in proteins. J Biomol NMR 9:437–440
Article Google Scholar
LeMaster DM, Kushlan DM (1996) Dynamical mapping of E. coli thioredoxin via ¹³C NMR relaxation analysis. J Am Chem Soc 118:9255–9264
Article Google Scholar
Lichtenecker RJ (2014) Synthesis of aromatic ¹³C/²H-α-ketoacid precursors to be used in selective phenylalanine and tyrosine protein labelling. Org Biomol Chem 12:7551–7560
Article Google Scholar
Lichtenecker R, Ludwiczek ML, Schmid W, Konrat R (2004) Simplification of protein NOESY spectra using bioorganic precursor synthesis and NMR spectral editing. J Am Chem Soc 126:5348–5349
Article Google Scholar
Lichtenecker RJ, Coudevylle N, Konrat R, Schmid W (2013a) Selective isotope labelling of leucine residues by using α-ketoacid precursor compounds. ChemBioChem 14:818–821
Article Google Scholar
Lichtenecker RJ, Weinhäupl K, Reuther L, Schörghuber J, Schmid W, Konrat R (2013b) Independent valine and leucine isotope labeling in Escherichia coli protein overexpression systems. J Biomol NMR 57:205–209
Article Google Scholar
Lichtenecker RJ, Weinhäupl K, Schmid W, Konrat R (2013c) α-Ketoacids as precursors for phenylalanine and tyrosine labelling in cell-based protein overexpression. J Biomol NMR 57:327–331
Article Google Scholar
Lichtenecker RJ, Schörghuber J, Bisaccia M (2015) Synthesis of metabolic amino acid precursors: tools for selective isotope labeling in cell-based protein overexpression. Synlett 26:2611–2616
Article Google Scholar
Lin MT, Sperling LJ, Frericks Schmidt HL, Tang M, Samoilova RI, Kumasaka T, Iwasaki T, Dikanov SA, Rienstra CM, Gennis RB (2011) A rapid and robust method for selective isotope labeling of proteins. Methods 55:370–378
Article Google Scholar
Liu AM, Wong KF, Jiang X, Qiao Y, Luk JM (2012) Regulators of mammalian Hippo pathway in cancer. Biochim Biophys Acta 1826:357–364
Google Scholar
Lundström P, Teilum K, Carstensen T, Bezsonova I, Wiesner S, Hansen DF, Religa TL, Akke M, Kay LE (2007) Fractional ¹³C enrichment of isolated carbons using [1-¹³C]- or [2-¹³C]-glucose facilitates the accurate measurement of dynamics at backbone C^α and side-chain methyl positions in proteins. J Biomol NMR 38:199–212
Article Google Scholar
Lundström P, Hansen DF, Kay LE (2008) Measurement of carbonyl chemical shifts of excited protein states by relaxation dispersion NMR spectroscopy: comparison between uniformly and selectively ¹³C labeled samples. J Biomol NMR 42:35–47
Article Google Scholar
Lundström P, Lin H, Kay LE (2009a) Measuring ¹³C^β chemical shifts of invisible excited states in proteins by relaxation dispersion NMR spectroscopy. J Biomol NMR 44:139–155
Article Google Scholar
Lundström P, Vallurupalli P, Hansen DF, Kay LE (2009b) Isotope labeling methods for studies of excited protein states by relaxation dispersion NMR spectroscopy. Nat Protoc 4:1641–1648
Article Google Scholar
Marsh JA, Singh VK, Jia Z, Forman-Kay JD (2006) Sensitivity of secondary structure propensities to sequence differences between α- and γ-synuclein: implications for fibrillation. Protein Sci 15:2795–2804
Article Google Scholar
Milbradt AG, Arthanari H, Takeuchi K, Boeszoermenyi A, Hagn F, Wagner G (2015) Increased resolution of aromatic cross peaks using alternate ¹³C labeling and TROSY. J Biomol NMR 62:291–301
Article Google Scholar
Miyanoiri Y, Takeda M, Jee J, Ono AM, Okuma K, Terauchi T, Kainosho M (2011) Alternative SAIL-Trp for robust aromatic signal assignment and determination of the χ₂ conformation by intra-residue NOEs. J Biomol NMR 51:425–435
Article Google Scholar
Mondal S, Shet D, Prasanna C, Atreya HS (2013) High yield expression of proteins in E. coli for NMR studies. Adv Biosci Biotechnol 4:751–767
Article Google Scholar
Morgan W, Kragt A, Feeney J (2000) Expression of deuterium-isotope-labelled protein in the yeast Pichia pastoris for NMR studies. J Biomol NMR 17:337–347
Article Google Scholar
O’Grady C, Rempel BL, Sokaribo A, Nokhrin S, Dmitriev OY (2012) One-step amino acid selective isotope labeling of proteins in prototrophic Escherichia coli strains. Anal Biochem 426:126–128
Article Google Scholar
Ohki S-Y, Kainosho M (2008) Stable isotope labeling methods for protein NMR spectroscopy. Prog NMR Spectrosc 53:208–226
Article Google Scholar
Ollerenshaw JE, Tugarinov V, Skrynnikov NR, Kay LE (2005) Comparison of ¹³CH₃, ¹³CH₂D, and ¹³CHD₂ methyl labeling strategies in proteins. J Biomol NMR 33:25–41
Article Google Scholar
Pellecchia M, Sem DS, Wüthrich K (2002) NMR in drug discovery. Nat Rev Drug Discov 1:211–219
Article Google Scholar
Rajesh S, Nietlispach D, Nakayama H, Takio K, D.Laue E, Shibata T, Ito Y (2003) A novel method for the biosynthesis of deuterated proteins with selective protonation at the aromatic rings of Phe, Tyr and Trp. J Biomol NMR 27:81–86
Article Google Scholar
Ramaraju B, McFeeters H, Vogler B, McFeeters RL (2017) Bacterial production of site specific ¹³C labeled phenylalanine and methodology for high level incorporation into bacterially expressed recombinant proteins. J Biomol NMR 67:23–34
Article Google Scholar
Robson SA, Takeuchi K, Boeszoermenyi A, Coote PW, Dubey A, Hyberts S, Wagner G, Arthanari H (2018) Mixed pyruvate labeling enables backbone resonance assignment of large proteins using a single experiment. Nat Commun 9:356
Article ADS Google Scholar
Rowley D (1953) Inhibition of E. coli strains by amino acids. Nature 171:80–81
Article ADS Google Scholar
Sanchez R, Meslamani J, Zhou MM (2014) The bromodomain: from epigenome reader to druggable target. Biochim Biophys Acta 1839:676–685
Article Google Scholar
Schörghuber J, Sára T, Bisaccia M, Schmid W, Konrat R, Lichtenecker RJ (2015) Novel approaches in selective tryptophan isotope labeling by using Escherichia coli overexpression media. ChemBioChem 16:746–751
Article Google Scholar
Schörghuber J, Geist L, Bisaccia M, Weber F, Konrat R, Lichtenecker RJ (2017a) Anthranilic acid, the new player in the ensemble of aromatic residue labeling precursor compounds. J Biomol NMR 69:13–22
Article Google Scholar
Schörghuber J, Geist L, Platzer G, Konrat R, Lichtenecker RJ (2017b) Highly selective stable isotope labeling of histidine residues by using a novel precursor in E. coli-based overexpression systems. ChemBioChem 18:1487 – 1491
Article Google Scholar
Staunton D, Schlinkert R, Zanetti G, Colebrook SA, Campbell ID (2006) Cell-free expression and selective isotope labelling in protein NMR. Magn Reson Chem 44:S2-S9
Article Google Scholar
Takahashi H, Shimada I (2010) Production of isotopically labeled heterologous proteins in non-E. coli prokaryotic and eukaryotic cells. J Biomol NMR 46:3–10
Article Google Scholar
Takeda M, Ono AM, Terauchi T, Kainosho M (2010) Application of SAIL phenylalanine and tyrosine with alternative isotope-labeling patterns for protein structure determination. J Biomol NMR 46:45–49
Article Google Scholar
Takeuchi K, Sun Z-YJ, Wagner G (2008) Alternate ¹³C-¹²C labeling for complete mainchain resonance assignments using Cα direct-detection with applicability toward fast relaxing protein systems. J Am Chem Soc 130:17210–17211
Article Google Scholar
Teilum K, Brath U, Lundström P, Akke M (2006) Biosynthetic ¹³C labeling of aromatic side chains in proteins for NMR relaxation measurements. J Am Chem Soc 128:2506–2507
Article Google Scholar
Tong KI, Yamamoto M, Tanaka T (2008) A simple method for amino acid selective isotope labeling of recombinant proteins in E. coli. J Biomol NMR 42:59–67
Article Google Scholar
Torizawa T, Shimizu M, Taoka M, Miyano H, Kainosho M (2004) Efficient production of isotopically labeled proteins by cell-free synthesis: a practical protocol. J Biomol NMR 30:311–325
Article Google Scholar
Torizawa T, Ono AM, Terauchi T, Kainosho M (2005) NMR assignment methods for the aromatic ring resonances of phenylalanine and tyrosine residues in proteins. J Am Chem Soc 127:12620–12626
Article Google Scholar
Vassilev A, Kaneko KJ, Shu H, Zhao Y, DePamphilis ML (2001) TEAD/TEF transcription factors utilize the activation domain of YAP65, a Src/Yes-associated protein localized in the cytoplasm. Genes Dev 15:1229–1241
Article Google Scholar
Velyvis A, Ruschak AM, Kay LE (2012) An economical method for production of ²H, ¹³CH₃-threonine for solution NMR studies of large protein complexes: application to the 670 kDa proteasome. PLoS ONE 7:e43725
Article ADS Google Scholar
Vuister GW, Kim S-J, Wu C, Bax A (1994) 2D and 3D NMR study of phenylalanine residues in proteins by reverse isotopic labeling. J Am Chem Soc 116:9206–9210
Article Google Scholar
Wand AJ, Bieber RJ, Urbauer JL, McEvoy RP, Gan Z (1995) Carbon relaxation in randomly fractionally ¹³C-enriched proteins. J Magn Reson B 108:173–175
Article Google Scholar
Wang H, Janowick DA, Schkeryantz JM, Liu X, Fesik SW (1999) A method for assigning phenylalanines in proteins. J Am Chem Soc 121:1611–1612
Article Google Scholar
Weininger U (2017a) Site-selective ¹³C labeling of histidine and tryptophan using ribose. J Biomol NMR 69:23–30
Article Google Scholar
Weininger U (2017b) Site-selective ¹³C labeling of proteins using erythrose. J Biomol NMR 67:191–200
Article Google Scholar
Weininger U, Diehl C, Akke M (2012a) ¹³C relaxation experiments for aromatic side chains employing longitudinal- and transverse-relaxation optimized NMR spectroscopy. J Biomol NMR 53:181–190
Article Google Scholar
Weininger U, Liu Z, McIntyre DD, Vogel HJ, Akke M (2012b) Specific ¹²C^βD₂ ¹²C^γD₂S¹³C^εHD₂ isotopomer labeling of methionine to characterize protein dynamics by ¹H and ¹³C NMR relaxation dispersion. J Am Chem Soc 134:18562–18565
Article Google Scholar
Weininger U, Respondek M, Löw C, Akke M (2013) Slow aromatic ring flips detected despite near-degenerate NMR frequencies of the exchanging nuclei. J Phys Chem B 117:9241–9247
Article Google Scholar
Yang CJ, Takeda M, Terauchi T, Jee J, Kainosho M (2015) Differential large-amplitude breathing motions in the interface of FKBP12-drug complexes. Biochemistry 54:6983–6995
Article Google Scholar
Zeng L, Zhou M-M (2002) Bromodomain: an acetyl-lysine binding domain. FEBS Lett 513:124–128
Article Google Scholar
Zhang Y, Cao H, Liu Z (2015) Binding cavities and druggability of intrinsically disordered proteins. Protein Sci 24:688–705
Article Google Scholar

Download references

Acknowledgements

Open access funding provided by University of Vienna. This work was supported by a scholarship of the uni:docs program from the University of Vienna (J. Schörghuber). L. Geist and G. Platzer were funded by the Christian Doppler Laboratory for High-Content Structural Biology and Biotechnology, Austria. The financial support by the Austrian Federal Ministry of Science, Research and Economy and the National Foundation for Research, Technology and Development is gratefully acknowledged.

Author information

Authors and Affiliations

Institute of Organic Chemistry, University of Vienna, Währinger Str. 38, 1090, Vienna, Austria
Julia Schörghuber, Marilena Bisaccia, Lukas Scheibelberger, Frederik Weber & Roman J. Lichtenecker
Christian Doppler Laboratory for High-Content Structural Biology and Biotechnology, Department of Structural and Computational Biology, Max F. Perutz Laboratories, University of Vienna, Dr-Bohr-Gasse 9, 1030, Vienna, Austria
Leonhard Geist, Gerald Platzer, Michael Feichtinger & Robert Konrat

Authors

Julia Schörghuber
View author publications
You can also search for this author in PubMed Google Scholar
Leonhard Geist
View author publications
You can also search for this author in PubMed Google Scholar
Gerald Platzer
View author publications
You can also search for this author in PubMed Google Scholar
Michael Feichtinger
View author publications
You can also search for this author in PubMed Google Scholar
Marilena Bisaccia
View author publications
You can also search for this author in PubMed Google Scholar
Lukas Scheibelberger
View author publications
You can also search for this author in PubMed Google Scholar
Frederik Weber
View author publications
You can also search for this author in PubMed Google Scholar
Robert Konrat
View author publications
You can also search for this author in PubMed Google Scholar
Roman J. Lichtenecker
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Roman J. Lichtenecker.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Schörghuber, J., Geist, L., Platzer, G. et al. Late metabolic precursors for selective aromatic residue labeling. J Biomol NMR 71, 129–140 (2018). https://doi.org/10.1007/s10858-018-0188-z

Download citation

Received: 28 February 2018
Accepted: 19 May 2018
Published: 28 May 2018
Issue Date: July 2018
DOI: https://doi.org/10.1007/s10858-018-0188-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Late metabolic precursors for selective aromatic residue labeling

Abstract

Similar content being viewed by others

Anthranilic acid, the new player in the ensemble of aromatic residue labeling precursor compounds

Towards cost-effective side-chain isotope labelling of proteins expressed in human cells

Non-canonical amino acid labeling in proteomics and biotechnology

Introduction

Precursor identification

Precursor synthesis

Precursor uptake and labeling selectivity