Introduction

Labeling of proteins with synthetic probes in living cells is an important approach to study protein function (Gronemeyer et al. 2006). However, radioactive and fluorescent probes need complex chemical modification and protein purification procedures, leading only in few cases to a specific labeling of proteins. The introduction of fusion polypeptides (tags), which allow to simplify and optimize purification methods (affinity tags) and/or detection procedures (by using specific antibodies) has been fundamental. Nevertheless, most tags do not satisfy the need of the in vivo specific detection of proteins of interest. The discovery of autofluorescent proteins as the Aequorea victoria green fluorescent protein (GFP) made possible the expansion of the studies on fusion proteins in several applicative fields of cellular and molecular biology (Chalfie et al. 1994; Tsien 1998). Despite their wide use in the last decades, GFP tag and its derivatives have some disadvantages and/or limitations: (1) their relative big dimensions (ca. 230 amino acids) in some cases might affect the target protein’s function and localization; (2) the isolation of the natural fluorophore in the active site makes GFPs and variants partially sensitive to environmental changes, as pH (Ashby et al. 2004; Campbell and Choy 2000); (3) because the formation of the internal fluorophore is O2-dependent, GFPs are not suitable tags in applications requiring anaerobic conditions (anaerobic organisms and tumor cells in particular stages); (4) the general use of most protein-tags is restricted to mesophiles and mild reaction conditions. Recently, thermostable GFPs variants have been developed as tags for thermophilic microorganisms (Aliye et al. 2015; Cava et al. 2008; Pédelacq et al. 2006); however, these tools, again, suffer from most of the same limitations listed above for their mesophilic counterparts.

O 6-alkylguanine-DNA-alkyl-transferases (AGTs or MGMTs, EC: 2.1.1.63) are ubiquitous proteins involved in the direct repair of alkylation damage in DNA. By a one-step SN2 reaction mechanism, they specifically transfer the alkylic group from the damaged base (O 6-alkylguanine or O 4-alkylthymine) to a cysteine residue in their own active site (Daniels et al. 2000, 2004; Fang et al. 2005; Pegg 2011; Tubbs et al. 2007; Yang et al. 2009) (Fig. 1a). AGTs are called suicide or kamikaze proteins because of the 1:1 stoichiometry of their reaction, leading to an irreversible inactivation and destabilization of the alkylated form of the protein (Fang et al. 2005; Yang et al. 2009). In 2003, the group of Kai Johnsson, studying the human AGT (hAGT), investigated the possibility to employ this protein as protein-tag, using benzyl-guanine-derivative substrates, also accepted by the active site of the protein (Keppler et al. 2003). The covalent and irreversible nature of the alkylated form of hAGT and its small size (ca. 20.0 kDa) allowed new biotechnological approaches, making it an interesting tool for in vivo and in vitro specific labeling of proteins, when fused to a protein of interest (Gautier et al. 2008; Keppler et al. 2003, 2004). A commercial variant of the hAGT (SNAP-tag, New England Biolabs) is already available to be covalently labelled with a large number of chemical groups conjugated with O 6-benzyl-guanine (BG-derivative) (Fig. 1b).

Fig. 1
figure 1

a Direct repair of alkylation DNA damage by AGTs: reaction mechanism of the irreversible transfer of the alkyl group from the damaged guanine to a cysteine in the active site. b A variant of the human AGT (SNAP-tag) can react in vitro and/or in vivo with BG-derivative substrates, leading to the labeling of a fused protein of interest (P) with a desired chemical group (L)

However, no thermophilic AGTs have been developed as biotechnological tools to date, making the SNAP-tag technology suitable only for mesophilic organisms and mild reaction conditions. Hence, a thermostable version of AGT could be useful for the development of protein-tags for organisms growing under harsh reaction conditions. We previously characterized an AGT from the thermoacidophilic archaeon Sulfolobus solfataricus (SsOGT), which has the same catalytic efficiency of hAGT with fluorescent BG-derivatives, but displays a marked stability over a wide range of temperature, pH, ionic strength and to common denaturing agents (Perugino et al. 2012). Here, we present the use of an engineered variant of the thermostable SsOGT protein, which lacks DNA binding activity (Perugino et al. 2012), as a protein-tag for extremophilic organisms.

Materials and methods

Reagents

All chemicals were purchased from Sigma-Aldrich, SNAP-Vista Green and CLIP-FI fluorescent substrates (referred to BG-FL and BC-FL throughout, respectively) were from New England Biolabs (Ipswich, MA). Synthetic oligonucleotides were from Primm (Milan, Italy) and listed in Table 1, Pfu DNA polymerase were from Stratagene (La Jolla, CA), as well as Escherichia coli ABLE C strain. Thermus thermophilus HB27EC strain is a derivative of DSM 7039 with enhanced natural competence because of a knockout mutation in the argonaute protein (Swarts et al. 2014). Protein concentration was determined by the BioRad protein assay kit (Bio-Rad Pacific), using BSA as standard.

Table 1 List of oligonucleotides used in this study

DNA constructs

The DNA fragment of the beta-glycosidase from S. solfataricus gene (lacS) was PCR-amplified using the pGEX-K-Gly construct as template (Moracci et al. 1996), and by using the LacS-fwd and LacS-rev oligonucleotides (Table 1), which possess an internal Pst I site. This allowed the ligation of this gene downstream and in frame with the ogtH 5 gene in the pQE-ogtH 5 construct (Perugino et al. 2012), and leading to the pQE-ogtH 5-lacS plasmid for the heterologous expression of the SsOGT-H5-Ssβgly fusion protein. pMK-ogtH 5 for the heterologous and constitutively expression in T. thermophilus of the SsOGT-H5 mutant (referred to H5 throughout) was obtained by multiple rounds of PCR amplification: the PlspA-fwd/OP-rev and PO-fwd/Ogt3′ oligonucleotides pairs were first used to amplify the PlspA promoter and the ogtH 5 gene DNA fragments, respectively. In a final round of PCR, the former two DNA fragments were fused to each other by the complementary nature of the PO-fwd and OP-rev oligonucleotides (Table 1), and amplified by using the external PlspA-fwd/Ogt3′ oligonucleotides pair. Finally, the obtained DNA fragment was ligated in the multi-cloning site of the pMK184 (Cava et al. 2007) vector by using the EcoR I/Hind III sites. For all the obtained constructs, regions encoding the cloned genes were verified by DNA sequencing (Primm, Milan, Italy).

Protein expression and purification

H5 and H5-Ssβgly were expressed in E. coli ABLE C and purified by His6-tag affinity chromatography, as described (Perugino et al. 2012, 2015), with the exception of a further purification step for the fusion protein: briefly, to remove E. coli contaminants, the pool of the eluted fractions from the affinity chromatography was incubated for 20 min at 70 °C, and centrifuged for 30 min at 30,000g. The soluble fraction was dialysed against PBS 1× buffer (phosphate buffer 20 mM, NaCl 150 mM, pH 7.3) and aliquots stored at −20 °C. To assess the purity of the protein samples and determine their concentrations, SDS-PAGE and Bio-Rad protein assay were performed, respectively.

In vitro and in vivo alkyl-transferase assay

The activity of the purified proteins wild-type SsOGT and H5, alone or in fusion with Ssβgly, was measured by a fluorescent-based assay using BG-FL and BC-FL substrates in standard conditions, as previously described (Miggiano et al. 2013; Perugino et al. 2012, 2015). Briefly, in a total volume of 10 μL, 5.0 μM of protein (0.1 mg mL−1) was incubated with 20 μM of the substrate in 1× Fluo Reaction Buffer (50 mM phosphate, 0.1 M NaCl, 1.0 mM DTT, pH 6.5) at different temperatures and times, as indicated. Reactions were stopped by denaturing and loading samples on SDS-PAGE, followed by fluorescence imaging analysis on a VersaDoc 4000 system (Bio-Rad) by applying as excitation/emission parameters a blue LED/530 bandpass filter, respectively. For the determination of the second-order rate constants, assuming the irreversible mechanism with 1:1 substrate/enzyme binding, fluorescence intensity data were corrected for the amount of loaded protein (by coomassie staining analysis), in order to estimate the relative amount of covalently modified protein in time-course experiments. (Gautier et al. 2008; Miggiano et al. 2013; Perugino et al. 2012, 2015). For the in vivo assay, transformed cells of T. thermophilus HB27EC strain with the pMK184 and pMK-ogtH 5 plasmids were grown at 60 °C in TB selective medium (tryptone 8 g L−1, yeast extract 4 g L−1, NaCl 3 g L−1, kanamycin 30 mg L−1; in mineral water, pH 7.5) as late as stationary phase (O.D.600 nm >1.5) (Cava et al. 2009). Cell pellets from an appropriate volume (typically 1.0 mL) were resuspended in 0.1 mL of TB medium in the presence of 3.0 μM of BG-FL and incubated at several temperatures for different times, as indicated. After the reaction, cells were first washed twice with 1.0 mL of TB medium, then denatured for 10 min at 110 °C in O’Farrell 1× buffer supplemented with EDTA 10 mM, and finally loaded on SDS-PAGE for the analysis as described above.

Protein stability analysis

The stability of the wild-type and H5 was determined by Differential Scan Fluorimetry (DSF), following an adaptation of a protocol previously described (Niesen et al. 2007; Perugino et al. 2015): samples containing 25 μM of each protein (0.5 mg mL−1) in PBS 1× buffer and SYPRO Orange dye 1× were subjected to a temperature scanning from 20 to 95 °C at 0.2 °C min−1 (5 min per cycle with an increase of 1 °C per cycle) in a Real-Time Light Cycler (Bio-Rad). Relative fluorescence data were normalized to the maximum fluorescence value within each scan. Finally, plots of fluorescence intensity as a function of temperature showed increasing fluorescence sigmoidal curves (which described a two-state transition), leading to the determination of the inflection points, indicating Tm values, as described by the Boltzmann equation (Niesen et al. 2007). Stability was also tested by protease attack resistance and by in vivo protein degradation in T. thermophilus cells. In the first analysis, in a total volume of 10 μL, 1.0 μM of each protein (0.02 mg mL−1) was incubated for 2 h at 25 °C in the presence of trypsin protease at different substrate/enzyme weight ratio, as indicated. Reactions were then denatured for 5 min at 100 °C in O’Farrell 1× buffer and immediately loaded on SDS-PAGE (0.2 μg per lane). Gels were subjected to Western Blotting analysis by using an anti-SsOGT antibody, as previously described (Perugino et al. 2012). For the preparation of the fluoresceinated form of the wild-type and H5, before the trypsin treatment, a reaction with BG-FL for 2.0 h at 25 °C was performed, followed by a gel fluorescence imaging to assess the complete labeling. To determine the stability of alkylated SsOGT in T. thermophilus HB27EC strain, we followed the same procedure described above to assay the protein activity; after incubation with BG-FL and washing, cells were incubated at 70 °C for different times, followed by denaturation, SDS-PAGE and fluorescence imaging analysis.

Beta-glycosidase assay

Glycoside hydrolytic activity of H5-Ssβgly was detected in vivo on LB Agar selective medium in the presence of the chromogenic substrate X-gal (Sambrook et al. 1989), as well as by calculating the steady-state kinetic constants with 4-nitrophenyl-β-d-galactopyranoside (4Np-gal) and 4-nitrophenyl-β-d-glucopyranoside (4Np-glc), by using a Cary E1 spectrophotometer (Varian) and following the same procedure used for the Ssβgly enzyme (Moracci et al. 1996, 1998).

Data analysis and softwares

Analysis of the solved 3D structures of the wild-type and the methylated DNA::SsOGT-C119A complex, as well as the modeling of the methylated DNA::H5 complex, were performed by using the MacPyMOL freeware (DeLano Scientific FCC). Data from activity and stability assays were fitted to appropriate equations by using the Prism Software Package (GraphPad Software) and GraFit 5.0 Data Analysis Software (Erithacus Software).

Results

Mutations at the HTH motif affect the DNA-binding by SsOGT

The 3D structure of SsOGT, recently solved by X-ray crystallography, is overall very similar to that of all other AGTs and consists of two domains connected by a loop (Perugino et al. 2015) (Fig. 2a). The role of the Nter domain of these proteins is not completely understood and is likely involved in the protein overall stability (Perugino et al. 2015) as well as activity (Miggiano et al. 2013), whereas the highly conserved Cter domain contains all the elements for the recognition and the repair of the damaged guanine. In contrast to most DNA binding proteins, AGTs contact the DNA via the minor groove through their helix turn helix (HTH) motif (Daniels et al. 2004) (Fig. 2b).

Fig. 2
figure 2

3D structure of: a free SsOGT (PDB ID: 4ZYE) and b the methylated DNA::SsOGT-C119A complex (PDB ID: 4ZYD)

We previously prepared a mutant version of SsOGT (H5), whose mutations in five conserved residues of the HTH motif (S100A, R102A, G105K, M106T, K110E; zoom in Fig. 2b) impair dramatically the DNA binding capacity of the protein (Perugino et al. 2012). The availability of the crystal structure of the methylated DNA bound SsOGT (Perugino et al. 2015) allowed us to obtain a reliable model of the H5-DNA complex 3D structure (Fig. 3), which showed that: i) the substitution of the positively charged K110 with a glutamic acid residue could result in repulsive effects toward DNA phosphate backbone; ii) the replacement of the arginine finger R102 by an alanine residue might hamper the stabilization of the transiently unpaired cytosine opposite the methylated G (Kanugula et al. 1995; Liu et al. 2007; Perugino et al. 2012; Zang et al. 2005); iii) the lysine in place of the G105 residue could produce a dramatic steric hindrance and collide with the phosphate backbone of the DNA strand close to the O 6-MG. This analysis provided a structural explanation of the complete inability of H5 to bind DNA, which is an important pre-requisite for its application as a protein-tag.

Fig. 3
figure 3

Comparison of the crystal structure of methylated-DNA::SsOGT-C119A complex of the SsOGT-C119A (gray) with a 3D model of the same DNA molecule in complex with the H5 mutant (cyan): DNA, O 6-MG and residues involved in the mutagenesis are represented as ball and sticks. A spheric representation in transparency in the SsOGT-C119A and H5 for the five residues involved in the mutagenesis is also shown. Atoms are colored by the CPK convention, whereas O 6-MG and the unpaired cytosine are colored in yellow and cyan, respectively

Activity and stability of H5

Previous biochemical analysis suggested that H5 might be a good candidate thermostable protein-tag because: i) it retained the catalytic activity on the fluorescent BG-FL substrate, and, like SsOGT, showed significant activity over a broad range of reaction conditions (Perugino et al. 2012); ii) the absence of any structural Zn2+ ion in the SsOGT structure (Perugino et al. 2015), makes it insensitive to the presence of chelating agents (EDTA up to 10 mM; Perugino et al. 2012). Furthermore, H5 displays a significant increase of the activity at any temperature tested, if compared to the wild-type SsOGT (Perugino et al. 2012): the second-order rate constant value at 25 °C on a BG-derivative substrate was one order of magnitude higher than the wild-type (Fig. 4a; Table 2). This result suggests that the replaced residues in the HTH motif mutated in H5 play a role in the overall flexibility of the protein during the protein-substrate complex formation.

Fig. 4
figure 4

a Time-course of wild-type (Perugino et al. 2012) and H5 under standard conditions with 20 μM of SNAP-Vista Green (BG-FL) at 25 °C. Analysis and quantification of protein activity, as well as second-order rate constant values determination were performed as previously described (Perugino et al. 2012), and shown in Table 2. b Stability of H5 in comparison with the wild-type protein by DSF analysis, as described in “Materials and methods”. T m values were obtained by measuring the relative fluorescence intensity of each protein from three independent experiments, as a function of temperature. c Resistance to protease attack of the wild-type SsOGT and H5: proteins (ca. 0.2 μg per lane) were incubated for 2 h at 25 °C in the presence of different amounts of trypsin, as indicated. After SDS-PAGE, gels were analyzed by fluorescence imaging and then subjected to western blot analysis using a polyclonal anti-SsOGT antibody (Perugino et al. 2012). Fluorescein-labelled forms of wild-type and H5 are indicated as wt-FL and H5-FL, respectively. d In vivo stability of fluoresceinated H5: after protein labeling procedure with BG-FL (see “Materials and methods”), intact T. thermophilus cells were incubated at 70 °C for different times as indicated. Acrylamide gel was analyzed by fluorescence imaging and coomassie staining (to correct fluorescence for the amount of loaded proteins)

Table 2 Second order rate constant values of wild-type and H5 under standard conditions on the BG-FL substrate

The substrate specificity of the wild-type and H5 proteins was evaluated by reactions in the presence of a benzyl-cytosine-derivative substrate (BC-FL): the failure of complete labeling hampered the determination of second-order rate constants (data not shown), confirming that alkylated cytosines are poor substrates for natural AGTs.

Previous experiments showed that H5 was only marginally less stable than the wild-type (Perugino et al. 2012). To compare the stability of the H5 protein with that of the SNAP-tag, we performed quantitative analysis by DSF. We first followed the protocol used for SNAP-tag (ramping of 1 %: 1 min per cycle with an increase of 1 °C per cycle; Mollwitz et al. 2012), leading to a T m value for the SNAP-tag protein of ca. 67 °C. However, these conditions were not sufficient to induce the H5 protein denaturation, since no fluorescence change was obtained up to 95 °C, as also observed for the wild-type protein (data not shown). Melting temperature values were achieved only increasing the incubation times by heating samples from 20 to 95 °C with a ramping of 0.2 % (5 min per cycle with an increase of 1 °C per cycle). Under these conditions, we calculate a T m value of 75 °C (Fig. 4b; Perugino et al. 2015). Although this value is comparable to that obtained for SNAP-tag (67 °C; Mollwitz et al. 2012), the stronger conditions used in the DSF analysis for H5 clearly indicate that this protein is more stable than its mesophilic counterpart. Stability was also analyzed by the intrinsical resistance to protease attack of free and labelled form of both wild-type and H5. An incubation for 2 h at 25 °C in the presence of different amounts of trypsin protease was performed (Fig. 4c). Due to their intrinsically stability, both proteins were unaffected by the action of the trypsin in most of the conditions tested (from 1:16 to 1:2 E:S ratio), notably higher that those used for hAGT (Kanugula et al. 1998; Mollwitz et al. 2012). To observe a reasonable reduction of the western blot signals, a very high amount of protease was needed (1:1 and 2:1 E:S ratio): as expected, alkylated (fluoresceinated) forms of both proteins were less stable than their unlabelled counterparts. Again, a slight difference between the wild-type and H5 proteins was found at high E:S ratios; overall data confirmed the general stability and resistance to protease attack of these thermostable AGTs.

Application of H5 as protein-tag

In order to test the possibility to use H5 as a protein-tag, we used as a model the S. solfataricus lacS gene coding for a thermophilic β-glycosidase (Ssβgly; Moracci et al. 1996); previous studies demonstrated that tags linked at the N-terminus of Ssβgly (such as a GST-tag) did not affect the overall folding of the latter (Moracci et al. 1996, 1998). The lacS gene was fused downstream to and in frame with the ogtH 5 gene, for the expression of the H5-Ssβgly fusion protein in the E. coli ABLE C lac () strain. Blue colonies on X-gal LB agar solid medium appeared, indicating the presence of a hexogenous β-galactosidase activity, and confirming that Ssβgly is active also as fusion protein even at mesophilic temperatures (Fig. 5a). Purification of the fusion protein by affinity chromatography through a nickel column was performed, exploiting the presence of a His6-tag at the N-terminus of the H5 moiety. Interestingly, the presence of Ssβgly did not interfere with the H5 activity, making it possible to follow all purification steps by SDS-PAGE and fluorescence imaging of the samples, previously incubated with the BG-FL substrate. Finally, we were able to calculate the kinetic constants of the hydrolytic activity of H5-Ssβgly at high temperatures, which were comparable to those previously described for the Ssβgly enzyme (Table 3) (Moracci et al. 1996). Importantly, the utilization of the thermostable protein-tag enabled us to proceed with a thermoprecipitation treatment to further purify the pool of affinity chromatography eluted fractions (Fig. 5b, lane 5), and to perform kinetic analysis on a thermophilic enzyme at high temperatures without the need to remove the tag.

Fig. 5
figure 5

a E. coli ABLE C strain was transformed by using pQE-ogtH 5 (Perugino et al. 2012) and the pQE-ogtH 5-lacS plasmids. Transformed cells were plated on LB agar in the presence of ampicillin and the X-gal chromogenic substrate. b SDS-PAGE of the expression and purification by His6-tag affinity chromatography of the H5-Ssβgly fusion protein: 1 cell free extract of E. coli ABLE C/pQE-ogtH 5-lacS; 2 protein molecular weight marker; 3 2.0 μg of H5 protein; 4 pool of fractions eluted by imidazole; 5 soluble fraction of the sample of lane 4, after thermal treatment (see “Materials and methods”); 6 whole E. coli ABLE C cells transformed with the pQE31 empty vector (Qiagen). All samples, except for the protein marker, were incubated with the BG-FL substrate under standard conditions, and loaded on SDS-PAGE. After, the gel was exposed for the fluorescence imaging analysis (left) and then stained by coomassie blue (right), as described in “Materials and Methods”. The high molecular weight band marked by an asterisk correspond to a partially denatured form of the fusion protein, which is particularly resistant to denaturation, as previously shown for the tetrameric Ssβgly (Moracci et al. 1996, 1998)

Table 3 Steady state kinetic constants of Ssβgly and the H5-Ssβgly fusion protein for the hydrolysis of chromogenic substrates at 65 °C in 50 mM phosphate buffer pH 6.5

Heterologous expression of H5 in T. thermophilus

Thermus thermophilus HB27EC was chosen as model thermophilic cell host to test our protein-tag because it is a naturally agt () strain, thus lacking any endogenous DNA-alkyl-transferase activity. It encodes a single AGT-like protein (ATL), which is inactive in DNA repair, but involved in the recruiting of nucleotide excision repair (NER) proteins (Morita et al. 2008). The ogtH 5 gene was cloned in the pMK184 plasmid (Fig. 6a) (Cava et al. 2007), a shuttle vector containing: i) both E. coli and T. thermophilus HB27 replication origins; ii) the kat gene for growth in kanamycin selective medium; iii) a multi-cloning site (MCS) upstream of the α-lacZ gene, allowing E. coli white/blue colony screening on X-gal containing plates. As in E. coli cells (Perugino et al. 2012), it was possible to check the in vivo activity of H5, by directly incubating intact cells in the TB medium supplemented with the BG-FL substrate. To test the permeability of T. thermophilus (a bacterium with a complex cell envelope, including an outer membrane) to this substrate, we incubated at different temperatures and also in the presence of non-toxic organic solvents, such as DMSO, to improve cell permeabilization. Figure 6b shows fluorescent signals at the same molecular weight of the purified H5 protein in whole cell extracts from TthHB27/pMK-ogtH 5 transformants, whereas no signals were seen in cells transformed with the empty plasmid (TthHB27/pMK184), confirming the lack of any endogenous alkyltransferase activity. The maximal activity was obtained at 60 and 70 °C, likely due to the higher permeability of the cells at their physiological temperatures concomitant with an increase of activity of the thermophilic H5 protein (Fig. 6, lanes 5, 6 and 9). The presence of organic solvents impaired the protein activity and/or the substrate permeability, as suggested by the absence of the unreacted substrate at the bottom of the gel at the highest concentration of DMSO (lane 8 in Fig. 6). As for most of model organisms used, BG-FL molecule resulted non-toxic for T. thermophilus HB27EC cells, as indicated by a TB agar plate (“spot”) assay (data not shown) (Wilkinson et al. 2012).

Fig. 6
figure 6

a pMK-ogtH 5 plasmid used to transform T. thermophilus HB27EC strain. b Whole transformed cells were incubated in TB medium in the presence of 3.0 μM of BG-FL, at temperatures, times and DMSO concentration as indicated. Entry 1: 1.0 μg of purified H5 protein. All samples were treated as described in Fig. 5

Moreover, the fluoresceinated form of H5 was reasonably stable in vivo in T. thermophilus cells, showing a slow fluorescence signal decay: ca. 50 % of the protein was still detectable after 24 h at high temperatures (Fig. 4d), a time span compatible with most of the experiments performed by using the SNAP-tag technology in vivo.

Discussion

SNAP-tag technology is rising as a powerful alternative approach to GFP-tags for the in vivo labeling of protein of interest. Despite the need to introduce an external substrate for the labeling, this method offers a lot of advantages, mainly the specificity of the reaction and the possibility to label a protein with an infinite number of chemical groups, if conjugated to a benzyl-guanine, a classical inhibitor of hAGT and a suitable substrate of its commercially available variant (Hinner and Johnsson 2010).

The discovery of a BG-sensitive thermostable OGT from the thermoacidophilic Archaea S. solfataricus opened the possibility to widen the SNAP-tag technology to organisms which thrive in extreme environments. Because of the intrinsic nature of proteins from hyperthermophiles, SsOGT displayed a strong stability under several harsh conditions, as extremes of pH, ionic strength, temperatures, presence of organic solvents (Perugino et al. 2012), and protease treatments. Mutagenesis at the expense of the HTH motif allowed the construction of a variant of this enzyme (H5), which retained most of the advantages of the native protein in terms of stability, and showed even enhanced activity at lower temperatures. We showed that H5 is correctly folded, expressed, functionally active and stable in both mesophilic E. coli and thermophilic T. thermophilus hosts. In order to obtain a smaller version of SsOGT, a preliminary attempt to reduce the wt protein by limited proteolysis was performed, obtaining a truncated polypeptide (ca. 14.0 kDa) which was still active in the presence of BG-FL substrate. Gel purification and N-terminal sequence analysis revealed the loss of the first 36 aminoacids. However, this “mini SsOGT” protein could not be obtained by heterologous expression in E. coli (data not shown), suggesting that the first 36 aminoacids are important for the correct folding of the protein during translation. Interestingly, inspection of the 3D structure revealed the presence of a disulfide bond at the Nter (C29-C31), which is important for the protein stability (Perugino et al. 2015).

The analysis of the H5-Ssβgly fusion protein demonstrated that it is possible to study both mesophilic and thermophilic proteins/enzymes fused to H5 under their own physiological conditions, without the need to remove the tag. These properties, together with the complete abolition of binding to DNA, make H5 a robust alternative as protein-tag for the application of the SNAP-tag technology for in vivo (protein–protein interaction, in situ localization, FRET experiment in combination with thermostable GFP variants, etc.) and in vitro studies (heterologous expression and purification of proteins of interest in extremophilic organisms). Finally, the high substrate specificity toward BG-derivatives makes H5 an interesting starting point to be modified by molecular evolution in order to obtain a thermostable variant active on BC-derivative substrates (like the commercial CLIP-tag): this orthogonal substrate specificity will allow simultaneously and specifically the labeling of different molecular targets in living cells (Gautier et al. 2008).