The TFE-induced transient native-like structure of the intrinsically disordered $$\varvec\sigma_{ 4}^{ 70}$$ domain of Escherichia coli RNA polymerase

Kaczka, Piotr; Winiewska, Maria; Zhukov, Igor; Rempoła, Bożenna; Bolewska, Krystyna; Łoziński, Tomasz; Ejchart, Andrzej; Poznańska, Anna; Wierzchowski, Kazimierz L.; Poznański, Jarosław

doi:10.1007/s00249-014-0987-4

The TFE-induced transient native-like structure of the intrinsically disordered $\varvec\sigma_{ 4}^{ 70}$ domain of Escherichia coli RNA polymerase

Original Paper
Open access
Published: 27 September 2014

Volume 43, pages 581–594, (2014)
Cite this article

Download PDF

You have full access to this open access article

European Biophysics Journal Aims and scope Submit manuscript

The TFE-induced transient native-like structure of the intrinsically disordered $\varvec\sigma_{ 4}^{ 70}$ domain of Escherichia coli RNA polymerase

Download PDF

Piotr Kaczka¹,
Maria Winiewska¹,
Igor Zhukov¹,
Bożenna Rempoła¹,
Krystyna Bolewska¹,
Tomasz Łoziński¹,
Andrzej Ejchart¹,
Anna Poznańska²,
Kazimierz L. Wierzchowski¹ &
…
Jarosław Poznański¹

2126 Accesses
8 Citations
Explore all metrics

Abstract

The transient folding of domain 4 of an E. coli RNA polymerase $\sigma^{70}$ subunit ($r{\text{EC}}\sigma_{4}^{70}$) induced by an increasing concentration of 2,2,2-trifluoroethanol (TFE) in an aqueous solution was monitored by means of CD and heteronuclear NMR spectroscopy. NMR data, collected at a 30 % TFE, allowed the estimation of the population of a locally folded $r{\text{EC}}\sigma_{4}^{70}$ structure (CSI descriptors) and of local backbone dynamics (¹⁵N relaxation). The spontaneous organization of the helical regions of the initially unfolded protein into a TFE-induced 3D structure was revealed from structural constraints deduced from ¹⁵N- to ¹³C-edited NOESY spectra. In accordance with all the applied criteria, three highly populated α-helical regions, separated by much more flexible fragments, form a transient HLHTH motif resembling those found in PDB structures resolved for homologous proteins. All the data taken together demonstrate that TFE induces a transient native-like structure in the intrinsically disordered protein.

Atomistic structural ensemble refinement reveals non-native structure stabilizes a sub-millisecond folding intermediate of CheY

Article Open access 08 March 2017

NMR characterization of solvent accessibility and transient structure in intrinsically disordered proteins

Article Open access 11 July 2019

Determining Transient Nucleic Acid Structures by NMR

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Intrinsically disordered proteins (IDPs) are involved in the essential processes of the living cell (Pancsa and Tompa 2012). Their sequences are generally enriched in polar and charged residues, but deficient in hydrophobic groups (Uversky 2002, 2011; Das and Pappu 2013). Free IDPs in a solution are highly flexible and do not display a stable secondary/tertiary structure (Dyson and Wright 2005; Tompa 2002). Under physiological conditions, IDPs sample a large conformational space, and some of the conformers, called a transient secondary structure, may resemble structured elements (Tompa 2005). In order to access the relationship between the structural dynamics of IDPs and the function, a continuum of thermodynamically accessible states has to be characterized in terms of both conformational free energy and interconversion time scales. Hence, a global and/or local flexibility provides the structural plasticity of IDPs, facilitating specific interactions with various partners (Wright and Dyson 2009), although the thermodynamics of partner-induced IDP folding must be linked directly to the intrinsic conformational properties (Uversky 2002, 2011; Das and Pappu 2013; Dima and Thirumalai 2004). IDPs, among others, are prominently represented in proteins associated with mRNA processing (Fu 1995; Buljan et al. 2012), apoptosis (Rautureau et al. 2010), signal transduction (Galea et al. 2008; Moncoq et al. 2003) and transcription regulation (Tompa 2002; Tompa 2005; Liu et al. 2006; Ozbudak et al. 2002; Wright and Dyson 1999).

In bacteria, two forms of RNA polymerase (RNAP), called holoenzyme (composed of α2ββ’ωσ subunits of a total weight of ~450 kDa) and a core (composed of α2ββ’ω subunits; ~400 kDa), are capable of transcribing DNA, but only the holoenzyme initiates transcription solely at the promoter sites. The identification of the σ-factor as a dissociable RNA polymerase subunit (Burgess et al. 1969) suggested that RNA polymerase may recruit other σ-factors to switch on particular regulons (Losick and Pero 1981). Seven different σ subunits have been identified in Escherichia coli (Maeda et al. 2000), each binding to the same core RNAP enzyme, thereby directing the transcription of a specific ensemble of genes. The actual model of bacterial transcription assumes a competition of sigma subunits for the limited amount of the core RNAP to which it can be bound (Maeda et al. 2000; Grigorova et al. 2006; Nyström 2004). In case of vegetative cell growth, σ⁷⁰ predominates at a concentration exceeding that of the core RNAP (Grigorova et al. 2006; Jishage et al. 1996) and displaying the highest affinity of all sigma proteins to the core RNAP (Maeda et al. 2000), contributing to the transcription of 70 % of the operons (Raffaelle et al. 2005).

In a solution, the free σ⁷⁰ subunit does not bind to promoter DNA because of direct intramolecular interactions between the N-terminal segment (region 1.1) and DNA-recognizing motifs located in the C-terminal part of the molecule (regions 3.2 and 4.2) (Camarero et al. 2002). The primary sigma factor of E. coli RNA polymerase (ECσ⁷⁰) is a 613-residue protein of 70 kDa, belonging to the large family of sigma factors composed of four strongly conserved domains (Raffaelle et al. 2005). Its regions (2.4 and 4.2) specifically recognize the promoter sequences of expressed genes, located 10 and 35 bp upstream from the transcription initiation site, respectively. Some parts of the 4.2 region have also been identified as targets for transcription-activating or anti-sigma factors (Typas and Hengge 2006; Lambert et al. 2004; Patikoglou et al. 2007). Free σ⁷⁰ in a solution must be regarded as carrying intrinsically disordered segments, as indicated by solution NMR studies on the segmental isotopic labeling of isolated region 4.2 of ${\text{EC}}\sigma_{4}^{70}$ (Camarero et al. 2002); this conclusion seems to be supported by the absence of a crystal structure of the free form of the sigma subunit, with only the exception of the solution structure of Thermotoga maritime $\sigma^{\text{A}}$ factor (TM $\sigma^{\text{A}}$) (Lambert et al. 2004).

At the moment, only two solution structures of ${\text{EC}}\sigma_{4}^{70}$ in complex with T4 Asia (record 1TLH) (Lambert et al. 2004) and E. coli regulator Rsd (2P7 V) (Patikoglou et al. 2007) are accessible in the Protein Data Bank. There is a known crystal structure of ${\text{EC}}\sigma_{4}^{70}$ in complex with a DNA-promoter fragment (3T72) (Blanco et al. 2011) and the structure of whole σ⁷⁰ in the RNAP holoenzyme of E. coli (4IGC) (Murakami 2013), as well as its complexes with small-mass ligands (4KN7, 4KN4, 4KMU and 4JK2, 4JK1) (Molodtsov et al. 2013; Mechold et al. 2013). There are also the structures of homologous domains from thermophilic bacteria (Lambert et al. 2004; Campbell et al. 2002a, b). However, there are still no structural data concerning free ECσ⁷⁰ in solution.

For our part, we have demonstrated that the population of secondary structure elements was substantially increased for $r{\text{EC}}\sigma_{4}^{70}$ in the presence of TFE (Poznanski et al. 2003). This generally agrees with the idea that TFE induces the closure of individual hydrogen bonds (Jaravine et al. 2001) and thus propagates the context-dependent formation of native-like secondary structures in polypeptides (Lawrence and Johnson 2002). Subsequently, we have monitored the TFE-induced conformational changes of $r{\text{EC}}\sigma_{4}^{70}$ with the aid of NMR spectroscopy (Kaczka et al. 2010). The drift of resonance signals in the ¹H-¹⁵N HSQC spectra indicated that in a mixed water-TFE solution, a non-specific uniform build-up of helical structures takes place for a THE content smaller than 10 % (v/v). However, for a TFE content exceeding 10 %, non-monotonic changes were observed for numerous residues located in either the linker between domains 3 and 4 or the regions separating helical structures in the putative HLHTH motif (helix-loop-helix-turn-helix), namely G564-Y571 (L) and D581-T583 (T) (see PDB record 4igc). This supports the statement that at higher TFE concentrations, $r{\text{EC}}\sigma_{4}^{70}$ would tend to fold in a transient native-like 3D structure found in the crystal structure of the E. coli RNAP whole holoenzyme (Murakami 2013). Yet, does ${\text{EC}}\sigma_{4}^{70}$ in a solution resemble a sigma factor from E. coli RNAP much more than the homologous domains of Thermus aquaticus (Campbell et al. 2002) and Thermus thermophilus (Vassylyev et al. 2002), or the solution structure of T. maritime (Lambert et al. 2004), and that of a distant one, $\sigma_{ 4}^{ 5 4}$, from Axuifex aeolicus (2ahq) (Doucleff et al. 2005)? In this work, on the basis of further CD and NMR studies, we show a low-resolution model of E. coli $\sigma_{ 4}^{ 70}$ fold induced by a 30 % TFE, and we analyze the internal motions of the protein backbone with the use of ¹⁵N relaxation.

Materials and methods

Materials

The preparation of the recombinant ${\text{EC}}\sigma_{4}^{70}$ fragment, r${\text{EC}}\sigma_{4}^{70}$ (86 C-terminal residues of the E. coli σ⁷⁰ preceded by a 21-aa segment carrying His₆-Tag) was described previously (Poznanski et al. 2001). The protein was found to be chemically stable for a prolonged time in both aqueous acidic and mixed aqueous/TFE solutions.

CD spectroscopy

Circular dichroism spectra were recorded on a Jasco J-815 spectropolarimeter equipped with a Peltier thermostatic cell holder. All the spectra were recorded for ~1.5 μM protein solution under a nitrogen atmosphere using a 10-mm path-length quartz cell. The exact protein concentration was determined for each sample based on UV absorption at 274 nm (ε = 1,400 M⁻¹cm⁻¹). Each CD spectrum was measured three times at 25 °C in the range of 195–270 nm. The contribution of the secondary structure elements was estimated with the aid of the CDNN program (Gerald 1997).

NMR spectroscopy

The 2.5-mM protein samples were prepared in a 30 % (v/v) TFE/water binary system (10 % D2O) at pH 4.6. All $r{\text{EC}}\sigma_{4}^{70}$ NMR spectra were collected at 298 K on either a Varian Unity Plus 500 or Varian VNMRS 800 spectrometer, processed with the aid of NMRPipe (Delaglio et al. 1995), and analyzed using SPARKY (Goddard and Kneller 2002). Prior to the Fourier transformation, resolutions in indirect dimensions were increased by a π/3 shifted squared sine-bell weight function, followed by zero filling. The zero-order baseline correction was applied in all dimensions. ¹³C and amide ¹H and ¹⁵N resonances were assigned using a combination of HNCBCA (Wittekind and Mueller 1993), CBCA(CO)NH (Grzesiek and Bax 1992) and C(CO)NH (Gardner et al. 1996) experiments. Additional HNHA (Kuboniwa et al. 1994) and HAHB(CO)NH (Wang et al. 1994) spectra enabled the assignment of the Hα and Hβ signals, while most of the remaining ¹H aliphatic side-chain resonances were assigned using the H(C)(CO)NH spectrum (Grzesiek et al. 1993). Carbonyl ¹³C resonances were assigned using the combination of HNCO (Grzesiek and Bax 1992) and HN(CA)CO (Clubb et al. 1992) spectra. To overcome a problem with strong resonance overlapping, sequence-specific assignments were additionally confirmed by the inspection of sequential contacts identified in ¹⁵N-edited 3D-NOESY spectra. The assignments were deposited in BMRB (Entry 15975).

Chemical shift analysis

Secondary chemical shifts, Δδ(i), were calculated for ¹³Cα, ¹³CO and ¹Hα nuclei according to the reference values for random-coil chemical shifts corrected for sequence-dependent contributions (Schwarzinger et al. 2001). The population of helically folded regions in $r{\text{EC}}\sigma_{4}^{70}$ was then evaluated according to the concept of the chemical shift index (CSI) (Wishart and Sykes 1994). The helical population of individual residues, p _α(i), was estimated as the ratio of Δδ(i)/Δδ_o(i), where the reference values of secondary chemical shifts, Δδ_o(i), were assumed to be 2.8, 2.3 and −0.4 ppm for ¹³Cα, ¹³CO and ¹Hα, respectively (Wishart and Sykes 1994; Schwarzinger et al. 2000). In view of the consistency of the Δδ(i) distribution pattern for ¹³Cα, ¹³CO and ¹Hα, the consensus local propensities toward a helical structure were expressed as the geometric average of the three descriptors referred to above (Kaczka et al. 2010). The population of helical structures were additionally estimated on the δ2D server (uk/d2D/ Jobs submitted: July 2014), the algorithm of which is based on the larger set of chemical shift data (¹Hα, ¹³Cα, ¹³Cβ, ¹³CO, N and HN) (Camilloni et al. 2012). The protein backbone φ and ψ angles, and the square of the order parameter for the backbone amides, S ², were additionally estimated with the aid of TALOS+ (Shen et al. 2009).

¹⁵N relaxation data

Relaxation parameters were measured at 298 K for a uniformly ¹⁵N-labeled protein sample dissolved in a mixed aqueous solution with 30 % TFE content (pH 4.6) on Varian UnityPlus 500, Varian VNMRS 600 and Varian VNMRS 800 spectrometers at 11.4, 14.1 and 18.7 T, respectively. The pulse sequences used for the determination of the ¹⁵N longitudinal (R ₁) and transverse (R ₂) relaxation rates were identical for all three spectrometers, being analogous to those originally reported (Farrow et al. 1994). For R ₂ measurements, a Carr-Parcell-Meiboom-Gill (CPMG) 180° pulse train with a refocusing delay of 650 µs was used during evolution (Meiboom and Gill 1958). Delays between proton π pulses, used for the suppression of cross-correlation effects between ¹H and ¹⁵N nuclei (Kay et al. 1992), were 5 and 10 ms in R ₁ and R ₂ measurements, respectively. The recycle delay was kept as long as 2.5 s. One thousand twenty-four and 128 complex data points in time domains were collected in the hypercomplex mode. ¹⁵N decoupling during the acquisition was performed as a 3.2-kHz GARP pulse scheme (Shaka 1985). The relaxation rates were estimated using ten delays for R ₁ (0.01, 0.09, 0.17, 0.29, 0.41, 0.55, 0.69, 0.85, 1.01 and 1.25 s) and eight delays for R ₂ (0.01, 0.03, 0.05, 0.09, 0.13, 0.17, 0.21 and 0.25 s). {¹H}-¹⁵N heteronuclear NOEs were measured according to the dynamic progressive saturation (Kowalewski 1978; Zhukov and Ejchart 1999) using the standard pulse sequence included in the BioPack (Varian Inc., Palo Alto, CA, USA) software. To overcome problems with peaks overlapping, the NOE intensities were assumed to be cross-peak’s heights. The R ₁ and R ₂ rates were then estimated by fitting an exponential decay curve in the form I(t) = I ₀exp(−R _i × t) implemented in SPARKY, where I is the signal intensity and t is the evolution time for appropriate magnetization. Severe signal overlapping limited the quantitative interpretation of relaxation parameters to only 84 out of 101 assigned backbone amide ¹⁵N nuclei.

Analysis of the ¹⁵N relaxation data

The ¹⁵N relaxation data (R ₁, R ₂, ¹H-¹⁵N NOE) were interpreted according to the reduced spectral density mapping, via sampling spectral density function, J(ω), at low (0), intermediate (ω _N) and high (ω _H) frequencies (Farrow et al. 1995).

$$R_{1} = \left( {\frac{{d^{2} }}{4}} \right)\;\left[ {J(\omega_{\text{H}} - \omega_{\text{N}} )\, + \;6J(\omega_{\text{H}} + \omega_{\text{N}} )} \right] + c^{2} J(\omega_{\text{N}} )$$

$$R_{2} = \left( {\frac{{d^{2} }}{8}} \right)\;\left[ {4J(0)\, + \;J(\omega_{\text{H}} - \;\omega_{\text{N}} ) + 3J(\omega_{\text{N}} ) + 6J(\omega_{\text{H}} ) + \;6J(\omega_{\text{H}} + \omega_{\text{N}} )} \right]\; + \;\left( {\frac{{c^{2} }}{6}} \right)[4J(0) + 3J(\omega_{\text{N}} )]\; + \;R_{\text{ex}}$$

$${\text{NOE}}\; = \;1\; + \;\left( {\frac{{d^{2} }}{{4R_{1} }}} \right)\;\left( {\frac{{\gamma_{\text{H}} }}{{\gamma_{\text{N}} }}} \right)\;\left[ {6J(\omega_{\text{H}} + \omega_{\text{N}} )\; - \;J(\omega_{\text{H}} \; - \;\omega_{\text{N}} )} \right]$$

$${\text{where }}c = \frac{{\omega_{\text{N}} \varDelta \sigma }}{\sqrt 3 },\;d = \left( {\frac{{\mu_{0} h}}{{8\pi^{2} }}} \right)\left( {\gamma_{\text{N}} \gamma_{\text{H}} } \right)\left\langle {\frac{1}{{r_{\text{NH}}^{3} }}} \right\rangle ,\quad\,\varDelta \sigma \; = \; - 160\,{\text{ppm}},\;r_{\text{NH}} = 1.02A,\;R_{\text{ex}} = \;a \times B_{0}^{2}$$

(1)

J(ω) is the spectral density of molecular motion at a given angular frequency, and the additional term R _ex, which scales with the square of magnetic field, stands for the contribution of micro- to millisecond motions to R ₂ (Korzhnev et al. 2001; Stone et al. 1992). The above equation system can be resolved against J(ω) (Farrow et al. 1995), giving:

$$J(0.87\; \times \;\omega_{\text{H}} )\cong(0.8/d^{2} )\; \times \;\left( {\gamma_{\text{N}} /\gamma_{\text{H}} } \right)\; \times \;({\text{NOE}} - 1)\; \times \;R_{1}$$

$$J(\omega_{\text{N}} ) = [(4R_{1} - 7d^{2} )\; \times J(0.87 \times \omega_{\text{H}} )]/(3d^{2} + 4c^{2} )$$

$$J(0) = [(6 \times R_{2} - (9d^{2} /4 + 3c^{2} ) \times J(\omega_{\text{N}} ) - 39d^{2} /4 \times J(0.87 \times \omega_{\text{H}} )]/(3d^{2} + 4c^{2} )$$

(2)

The residues with possible contribution of conformational and/or chemical exchange to the transverse ¹⁵N relaxation were identified from the field dependence of R ₂–½R ₁ (Eq. 3), as those for which the estimated R _ex was identified, according to Peirce’s criterion (Peirce 1852) as outliers in the overall distribution of R _ex.

$$R_{2} - \frac{{R_{1} }}{2} = \frac{{d^{2} }}{4}[2J(0) + 3J(\omega_{\text{H}} )] + \frac{{2c^{2} }}{3}J(0) + R_{\text{ex}} \cong \;\underbrace {{\frac{{d^{2} }}{2}J(0)}}_{{a_{0} }}\; + \;\underbrace {{\frac{{2c^{2} }}{3}J\left( 0 \right) + R_{\text{ex}} }}_{{a_{2} \times B_{0}^{2} }}$$

(3)

Since the studied protein was known to be highly unfolded (Poznanski et al. 2003; Kaczka et al. 2010), the standard Lipari-Szabo model-free approach (Lipari and Szabo 1982a, b), which attributes a global correlation time (τ _m) to the overall rotational diffusion, was found inappropriate (Alexandrescu and Shortle 1994; Brutscher et al. 1997). Hence, an alternative approach of the direct fitting of the spectral density function (Eq. 4) to the J(ω) data sampled at 0, ω _N and 0.87×ω _H was used.

$$J(\omega ) = \frac{2}{5}\left[ {\frac{{S^{2} \times \tau_{\text{local}} }}{{1 + (\omega \times \tau_{\text{local}} )}} + \frac{{(1 - S^{2} ) \times \tau^{\prime}}}{{1 + (\omega \times \tau^{\prime})^{2} }}} \right]$$

(4)

where S ² is the square of the residue-specific generalized order parameter for slow motions, 1/τ′ = 1/τ _local + 1/τ _e, and τ _local and τ _e are the effective, residue-specific correlation times for slow (>1 ns) and fast (<100 ps) motions, respectively.

Structural calculations

NOE-derived distance restraints were extracted from the ¹⁵N-edited 3DNOESY- HSQC (Talluri and Wagner 1996) (mixing time: 150 ms) and ¹³C-edited 3D-NOESY-HSQC (Muhandiram et al. 1993) (mixing times: 90 and 250 ms) experiments. The cross peaks in 3D-NOESY spectra were manually picked and assigned with the SPARKY program (Goddard and Kneller 2002), and their heights were then converted into distance restraints with the aid of the CALIBA procedure included in CYANA 2.1 (Guntert 2004). Additional constraints for backbone torsion angles were adopted from TALOS+ (Shen et al. 2009). Initial structural calculations were made with the simulated annealing (SA) protocol implemented in XPLOR 3 (Schwieters et al. 2003; Brunger 1992). The structure obtained with the homology modeling procedure was used as a template. The SA protocol started with 6-ns dynamics at 1,000 K, with dihedral angle restraints and NOE restraints scaled to 5 kcal mol⁻¹rad⁻² and 50 kcal mol⁻¹A⁻², respectively, and the van der Waals’ radii scaled down by a factor of 0.75. After that, the system was slowly cooled to 100 K during 3 ns, with the gradually increased terms for the repulsive van der Waals’ term (from 0.003 to 50 kcal mol⁻¹A⁴) and for dihedral angles (200 kcal mol⁻¹rad⁻²). Each SA cycle finished with 1 ns low-temperature evolution, followed by 1,000 steps of Powell energy minimization. This protocol was repeated 500 times, and a cluster of 11 lowest pseudo-energy structures was selected for further analyses. The final refinement was performed using additional constraints for experimental ¹³Cα, ¹³Cβ and ¹Hα chemical shifts, using a chemical shift protocol (Kuszewski et al. 1995a, b). Finally, the structures were then inspected with PROCHECK-NMR (Laskowski et al. 1996), and all of them were subjected to further 5 ns molecular dynamic simulations in a water box using the Yasara2 force field implemented in the Yasara Structure package (Krieger et al. 2002). To prevent the momentary unfolding of the protein, and to mimic the helix-promoting propensities of a 30 % TFE solution, the medium-range (i, i + 3) constraints and angular restraints, imported directly from the SA protocol, were applied during the first 2 ns of simulations.

Statistical tests

The relaxation parameters were analyzed in terms of their distributions, and the consistency of these data with normal distribution was initially tested with the aid of the Anderson-Darling goodness-of-fit test (Anderson and Darling 1952). Since some of the distributions significantly differ from the Gaussian one, the relaxation parameters obtained at different conditions (i.e., TFE concentration) or for different subsets of residues (e.g., His₆-tag vs. HLHTH motif) were compared using the appropriate nonparametric Mann-Whitney U test (Mann and Whitney 1947). All tests were done with the aid of the Statistica package (2011).

Computational methods

All the data analyses and presentations were performed using GnuPlot 4.6 (Williams and Kelley 2007).

Results

The HLHTH fold of $r{\text{EC}}\sigma_{4}^{70}$ in a solution

The protein was previously found soluble solely in a low-pH solution (Poznanski et al. 2003). Our previous studies demonstrated that the solubility of the protein substantially decreases above pH 4.5, which is far away from the theoretical pI of 9.1 ± 0.3, estimated as the average of nine different methods (Kozlowski 2007). This phenomenon should be directly addressed to the dissociation of Asp and Glu side-chain carboxyl groups at pH above 5. Dissociation of Asp and Glu side chains enables formation of numerous intra- and intermolecular salt bridges that compete with hydrophobic interactions (Poznanski et al. 2003). This destroys a hydrophobic core of the transiently folded protein, and the exposition of hydrophobic residues toward solvent results in intensive aggregation.

In order to stabilize the protein in a neutral solution, a number of solvent systems preventing the formation of salt bridges have been tested. Thus, 200 mM Na₂SO₄ and 1.8 M MgSO₄ were found to minutely improve protein solubility at neutral pH, additionally stabilizing helical forms. However, limited protein solubility and the required salt concentration precluded the application of any NMR techniques. The high concentration of arginine (~1 M) also improved $r{\text{EC}}\sigma_{4}^{70}$ solubility (Buchner et al. 1992), but such a high concentration precluded even the CD monitoring of a putative Arg-induced secondary structure formation. Substantial progress was achieved with the use of zwitterionic non-detergent sulfobetaine stabilizers (NDSB). NDSB195 does not affect $r{\text{EC}}\sigma_{4}^{70}$ at low pH (Supplementary Figure S1a), albeit its 100 mM solution efficiently screens electrostatic interactions between dissociated side chains of Asp/Glu and those of Arg/Lys, and the protein solubility approached 50 μM at pH 7.7. Moreover, in the presence of 100 mM NDSB195, an increase in pH of $r{\text{EC}}\sigma_{4}^{70}$ solution results in a detectable increase of the estimated contribution of helical forms, which is clearly evidenced in CD spectra recorded at pH ranging from 2.6 to 7.5 (Supplementary Figure S2a), while the further increase of pH unfolds the protein. An optimal range of solution pH at which a protein is properly folded is a common protein property (Dill 1990; Copeland 2000). All CD spectra, analyzed together under the assumption of two-state equilibrium, show that, even at neutral pH, the protein is only partially folded. The maximal contribution of a putative folded state (approximately 50 %) was achieved at the physiological pH of 7.5 (see the insert in Supplementary Figure S2a). It should be stressed that 100 mM NDSB195 does not preserve the protein structure for $r{\text{EC}}\sigma_{4}^{70}$ concentrations exceeding 2 µM (see Supplementary Figure S2b for details), so this solvent system could not be used for heteronuclear NMR studies. Altogether, it clearly indicates that the previously proposed TFE-water binary solvent system is the most suitable for the NMR studies on $r{\text{EC}}\sigma_{4}^{70}$ structural preferences. The 30 % TFE concentration was found optimal, and the further addition of TFE did not result in visible changes of protein structure (Supplementary Figure S3). It should be mentioned that the CD spectrum of $r{\text{EC}}\sigma_{4}^{70}$ recorded in the presence of NDSB195 at pH 7.5 (Supplementary Figure S1a) resembles that recorded in TFE-water binary solvent (Supplementary Figure S3). This impression is strongly supported by the deconvolution of CD spectra against the contribution of secondary structure elements, populations of which were estimated with the aid of the CDNN program (see Supplementary Table S1).

NMR resonance assignment and protein secondary structure preferences derived from CSI data in a 30 % (v/v) TFE solution

The resonance assignments for $r{\text{EC}}\sigma_{4}^{70}$ at 30 % TFE (cf. Methods) were deposited in a BMRB database under accession no. 15975. A consistent pattern of the downfield shifts of ¹³Cα and ¹³CO resonances, accompanying the upfield shift of the ¹Hα, was observed for numerous residues. The values of the score function, ${\tilde{p}_{\alpha } (i)}$ (see (Kaczka et al. 2010) for details) identify significantly populated helical patterns, located in the regions: 3.2 (L532-A542; H0), 4.1 (L551-F563; H1) and 4.2 (L573-L599; H2, H3). Another less populated fragment, S602-S609, is located in the C-terminal part of $r{\text{EC}}\sigma_{4}^{70}$ (Fig. 1a). The three clearly distinguished maxima coincide with the location of helical regions in the structures of homologous σ subunits (Lambert et al. 2004; Patikoglou et al. 2007; Blanco et al. 2011; Murakami 2013; Campbell et al. 2002), while the last one (S602–S609) could only be found in $\sigma_{4}^{\text{A}}$ of T. maritime (Lambert et al. 2004) and in E. coli $\sigma_{4}^{\text{A}}$ in complex with an Rsd regulator (Patikoglou et al. 2007) and in the recently solved structure of E. coli RNAP holoenzyme (Murakami 2013). The same trends, including both the location and population of helical structures, are also clearly visible when they are obtained with the aid of the δ2D server (Fig. 1b). The average population of helical structures in region 4.2 reaches 0.6 for a 30 % (v/v) TFE solution, while at 10 % (v/v) TFE it approached only 0.4 (Kaczka et al. 2010), respectively 0.33 and 0.37, when estimated using δ2d. The average population of helical structures for the whole protein was estimated to be 0.25 (Kaczka et al. 2010) and 0.38 for 10 and 30 % TFE solutions and respectively 0.25 and 0.26 when calculated using the δ2D approach. These values are close to the 0.35 estimated from the CD spectrum (see Supplementary table S1) and substantially higher than the 0.11 determined for the protein at low pH (Poznanski et al. 2003); 0.05 was determined using δ2D. This clearly indicates a considerable increase in the propensity of the $r{\text{EC}}\sigma_{4}^{70}$ backbone toward the α-helical fold in proportion to the increasing TFE concentration in an aqueous solution.

A comparison of the location of helical regions in the TFE-induced structure of $r{\text{EC}}\sigma_{4}^{70}$ with that found for its homologous domain of known structures (see gray vertical strips in Fig. 1, also highlighted in further figures) indicates that the C-terminal fragment of the H1 helix is considerably frayed, while H2 and H3, organized into the HTH motif, are in a similar location. The inspection of the estimated distribution of helical conformations along the protein sequence shows that the H3 helix is the most stable one, and it definitively does not propagate behind L598. The short helix H2 became significantly stabilized only at the highest TFE concentration (30 %), and the location of the loop region that separates H1 and H2 at a 30 % THE solution is similar to those seen in the crystal structures of the homologous proteins (Lambert et al. 2004; Patikoglou et al. 2007; Blanco et al. 2011; Murakami 2013; Campbell et al. 2002). It is worth noting that, according to the δ2D analysis (Fig. 1b), the residues located in the turn region of the HTH motif (D581-T583), which are in helical conformation at 10 % TFE, became non-helical at higher TFE concentration. This perfectly agrees with the break of monotonic drift in the HSQC spectra upon addition of TFE, previously observed for these residues (Kaczka et al. 2010). Altogether this proves that 30 % TFE does induce formation of a transient HLHTH structure, which is absent when the TFE concentration is lower than 15 % (Kaczka et al. 2010). The global analysis of chemical shift patterns performed with TALOS+ (Shen et al. 2009) also confirmed the presence of four helical regions in $r{\text{EC}}\sigma_{4}^{70}$ (Fig. 1c). The values of the φ and ψ backbone torsion angles were predicted as ‘good’ for 66 out of 103 analyzed residues. Among them, 32 are located in the putatively helical regions corresponding to the HLHTH motif. The conformation for the 20 residues located in the N-terminal region of the protein was consistently considered as dynamic. The next 17 residues, 9 of which are located within the HLHTH motif, were excluded from further analysis since their backbone conformation was predicted to be uncertain. Interestingly, the three N-terminal residues of the helix H1 (L551, T552, A553) were also predicted to be in non-helical conformation, which, according to the reported ‘good’ quality of the prediction, indicates that a TFE-induced helix H1 may be for $r{\text{EC}}\sigma_{4}^{70}$ shorter than these found in the known structures of homologous proteins. Similarly, residues Y571, T572 and G577, located at both of the termini of H2, are also predicted to be in non-helical conformation, which, together with the CSI data (Fig. 1a, b), proves that in ${\text{EC}}\sigma_{4}^{70}$, H2 is significantly shorter than those from homologous proteins (Lambert et al. 2004; Patikoglou et al. 2007; Blanco et al. 2011; Murakami 2013; Campbell et al. 2002).

The internal motions of $r{\text{EC}}\sigma_{4}^{70}$ in a 30 % TFE solution derived from ¹⁵N relaxation data

The R ₁ and R ₂ relaxation rates, and {¹H}-¹⁵N NOEs, were determined for $r{\text{EC}}\sigma_{4}^{70}$ in a 30 % TFE solution of 11.4, 14.1 and 18.7 T (see Supplementary Figure S4). As expected for substantially unfolded proteins, most of the {¹H}-¹⁵N heteronuclear NOEs do not exceed 0.5, while the values expected for the limited internal mobility of N–H vectors in the folded proteins of a comparable size are of the order of 0.7 (Brokx et al. 2004). However, the residues located in the putative HLHTH region display visibly increased NOE values (Supplementary Figure S4c). Moreover, the variations in transverse relaxation rates along the protein sequence are consistent with the {¹H}-¹⁵N NOE profile, clearly identifying two regions that differ significantly in their conformational flexibility. Elevated R ₂ values determined for residues R534-L598 (varying in the range of 15–45 s⁻¹) strictly coincide with the location of regions displaying propensities toward a TFE-induced helical structure (Fig. 1). It is worth noting that residues S539, L540, T552, M561, R562, Y571, T572, L573, R586, I587 and R588, most of which are located in the putative HLHTH motif, display substantially decreased rates of backbone ¹⁵N transverse relaxation, which indicates their increased flexibility. The ratio R ₂/R ₁ is commonly used for identification of residues that experience motion of a timescale that differs from average molecular tumbling. Thus, decreased values of R ₂/R ₁, which reflect the variation in R ₂, together with negative values of heteronuclear {¹H}-¹⁵N NOEs, thus again identify residues experiencing increased flexibility (see Fig. 2a). And high R ₂/R ₁ values point to residues with possible contributions of chemical exchange to the transverse relaxation. They may also be identified as those displaying substantial field dependence of R ₂–½R ₁. Both methods pointed to D581, R584, Q589 and K593 (Fig. 2a, b), which were also identified as outliers in the distribution of R _ex (Fig. 2c). Two of them are located in the turn region of the HLHTF motif, while the two other (Q589, K593) are located just in the middle of H3.

Internal $r{\text{EC}}\sigma_{4}^{70}$ dynamics deduced from the reduced spectral density mapping

This analysis of ¹⁵N relaxation data does not require any assumption about the nature of molecular motion (Farrow et al. 1995). The distributions of J(ω) along the protein sequence are presented in Supplementary Figure S5A-C. The J(0), which is most sensitive for slow motions on the nanosecond time scale (Supplementary Figure S5c), follows the trend observed for R ₂ (Supplementary Figure S4c). The residues T537, E538, T544 T545, K557, D581-R584, E591, K593 and A594, for which J(0) substantially varies with the magnetic field, may undergo slow conformational exchange processes on the μs-ms time scale, the contribution of which to the rate of transverse relaxation (R ₂) scales with the square of magnetic field. The extremely low J(0) values observed for residues located in the N-terminal part of the protein of His₆-tag and the linker fragment preceding Leu 528 (average of 0.24 ns/rad), together with visibly high correlation times for fast motions, J(ω _H) (>20 ps/rad), show that this N-terminal part of the protein is almost unfolded. The same trend is observed for the C-terminal part of the protein (i.e., for residues succeeding HLHTH motif), albeit the visibly higher J(0) (average of 0.9 ns/rad) and slightly lower J(ω _H) suggest that the motion of residues located in the C-terminal part of the protein is somehow restricted. The central part of the $r{\text{EC}}\sigma_{4}^{70}$ displays substantially higher values of J(0) and smaller values of J(ω _H). This is indicative for the folded proteins; however, the values of J(ω _H) higher than 7.5 ps/rad are indicative of residues experiencing significant internal flexibility (Viles et al. 2001). Interestingly, a few residues from the HLHTH motif are substantially more flexible than the others. All of them, R562, Y571, T572 and F573, are located in the loop region of the HLHTH motif. It should be thus concluded that this loop region undergoes fast conformational sampling. Finally, as expected, J(ω _N) strongly varies with the strength of the magnetic field.

Internal $r{\text{EC}}\sigma_{4}^{70}$ dynamics estimated according to the model-free approach

The simplest relaxation model describing the contribution of the slow and fast internal motions together with the order parameter for slow motion was used. In the case of poorly folded proteins, a correlation time for slow motions cannot be attributed to the correlation time characterizing the overall rotation of the protein, but must be regarded as a residue-dependent parameter. This makes the problem of fitting J(ω) to the relaxation data challenging. In general, only the simplest model (Eq. 4) was found reproducibly fitted upon several trial runs, while convergence of all others strongly depended on the starting values.

The distribution of the square of the generalized order parameter (S ²) along the protein sequence (Fig. 3a) demonstrates that the mobility of all residues located within the HLHTH motif is definitively much more restricted. It should be emphasized that the trend in S ² variation, estimated from the chemical shift data (a thick broken line in Fig. 3a) follows that obtained directly from the ¹⁵N relaxation data. However, the local extrema do not strictly coincide with the expected location of helical regions indicated by CSI analysis (see Fig. 1a) and the predicted backbone angles (see Fig. 1b) identified in the structures of homologous proteins. The S ² values for residues located in the HLHTH region (Table 1) are visibly smaller than those usually found for properly folded proteins (S ² > 0.9), but exceed the value of 0.6 observed for the TFE-induced helix of calmodulin in 35 % TFE aqueous solution (Brokx et al. 2004).

Table 1 Relaxation parameters determined for $r{\text{EC}}\sigma^{70}_{4}$ in an acidic solution and for a 10 % and 30 % TFE solution at pH 4.55, calculated separately for the residues located in the N-terminal His₆-tag and succeeding linker, in the HLHTH region, and in both regions flanking the HLHTH motif (others)

Full size table

The distribution of local correlation times for slow motions (τ _local; cf. Fig. 3b) remains in complete agreement with the conclusion derived from the data presented in Figs. 1 and 2. Again, the four regions of $r{\text{EC}}\sigma_{4}^{70}$ (L532-D546, R554-M561, E574-D581 and R584-R599), characterized by the elevated values of τ _local, are clearly visible. A considerable simultaneous decrease in S ² and τ _local values observed for some residues located between D566 and L573 additionally supports the increased flexibility of this loop region. The median of 11.6 ns/rad (see Table 1) is visibly higher than that expected for the proteins of a similar size, which may result from increased viscosity of TFE-water mixed solvents (Olive et al. 1996). The possible contribution of slow conformational motions identified for residues T537, E538, T544 T545, K557, D581-R584, E591, K593 and A594 may also result in overestimation of the τ_local values.

The distribution of local correlation times for fast motions, τ_e (Fig. 3c), diverges strongly, and the values estimated for the HLHTH region are close to 30 ps.

The low-resolution structure of $r{\text{EC}}\sigma_{4}^{70}$ in a 30 % TFE aqueous solution from the NOESY data

A total of 1,237 NOE contacts were assigned in ¹³C- and ¹⁵N-edited 3D-NOESY spectra recorded at a 30 % (v/v) TFE. This includes 778 intraresidual, 330 sequential, 106 medium-range (1 ≤ |i – j| ≤ 4) and 23 long-range (|i – j| > 4) ones. The number of medium-range contacts (see Table 2) significantly exceeds that previously identified at 10 % (v/v) TFE (106 vs. 15, respectively). However, most importantly, 23 long-range structural contacts between the regions that display a high population of secondary structure elements have been unequivocally assigned (see Supplementary Table S2 and Supplementary Figure S6). These contacts occur mainly between helical regions (e.g., H2–H3), but some involving loop regions (e.g., H1-L) can also be observed (see Table 2). Altogether, it clearly evidences the formation of at least a transient three-dimensional HLHTH structure. Such a small number of long-range contacts precluded any reasonable ensemble-based interpretation of NMR data (Rezaei-Ghaleh et al. 2012) and only representative low-resolution structures that generally agree with the CSI and NOE data.

Table 2 Long-range cross-peaks unequivocally assigned in ¹⁵N- and ¹³C-edited NOESY spectra recorded for $r{\text{EC}}\sigma^{70}_{4}$ at pH 4.55 in 30 % (v/v) TFE solution

Full size table

As shown in Table 2 and Supplementary Table S2, the assignment of NOESY medium- and long-range cross-peaks, together with the relaxation data (Figs. 2, 3, S4, S5) and CSI analysis (Fig. 1), unequivocally indicate that $r{\text{EC}}\sigma_{4}^{70}$ contains four helical regions, separated by two loops and one short turn, similarly as has been found in the solution structure of homologous domains. The location of these helical regions was further constrained in the structural modeling to access the agreement between the TFE-induced transient fold of $r{\text{EC}}\sigma_{4}^{70}$ with NOESY-derived distance constraints (see Supplementary Table S3 for short statistics).

The ensemble of the resulting structures is presented in Fig. 4a. The arrangement of the H1 and H3 remains the most stable (see Fig. 4b), providing a kind of framework around which the more flexible LHT motif (i.e., a loop and short H2 region, followed by the turn) is organized in a form resembling those of TM $\sigma_{4}^{A}$ (Fig. 4c) and recently solved the whole RNAP holoenzyme for ECσ⁷⁰ (Fig. 4d) (Murakami 2013). The average RMSD calculated for backbone heavy atoms for the H1 and H3 helical regions within the cluster of the 11 lowest (pseudo)energy structures was reduced to 2.1Å. The increased mobility of the LHT region may also be related to its physiological function (see the Discussion for details).

The MD study of the structural stability of a TFE-induced HLHTH fold of r$EC\sigma_{4}^{70}$

The conformational screening was performed by 11 5-ns MD runs, and each of them started from one of the 11 lowest (pseudo)energy structures. Most of these trajectories preserved the initial protein fold, but during one of them a visible reorganization of the HLHTH motif, involving the simultaneous breaking of the H1 helix at R554-E555 and of the H3 helix at E591-A592, was observed. Both locations agree with the increased contribution of the chemical exchange process to ¹⁵N relaxation rates found for residues and E591-R593 (Fig. 2).

Discussion

According to our earlier studies (Poznanski et al. 2003; Kaczka et al. 2010) and all the data presented in this work, $r{\text{EC}}\sigma_{4}^{70}$ in an aqueous solution must be regarded as intrinsically disordered, but capable of transient folding in the HLHTH motif in the presence of 30 % of TFE. In order to check to what extent the TFE-induced solution structure of $r{\text{EC}}\sigma_{4}^{70}$ resembles that of the homologous proteins, the most stable regions formed by the H1 and H3 helices of these two structures were compared (Fig. 4c, d). The backbone architecture of these two helices identified by us in the NMR-derived structure of $r{\text{EC}}\sigma_{4}^{70}$ is close to those of TM $\sigma_{4}^{A}$ in the solution and EC $\sigma_{ 4}^{ 70}$ in the crystal structure of the RNAP complex (Molodtsov et al. 2013), but the RMSD difference for the backbone atoms between the modeled solution structure and each of these two structures exceeds 10 Å. However, this difference is smaller than 2 Å, when calculated only for H1 and H3, confirming the high structural homology of these regions (see Fig. 4c, d). It has to be stressed that the sequences of all known reference $\sigma_{ 4}^{ 70}$ structures differ noticeably in their loop regions of the canonical HLHTH DNA-recognizing motif (I565-Y571 in E. coli). This loop is involved in the proper orientation of the recognition helix, interacting with the DNA major grove, and is also directly involved in an interaction with promoters (Campbell et al. 2002; Murakami et al. 2002).

The internal dynamics of the TFE-induced fold of r${\text{EC}}\sigma_{4}^{70}$

The Anderson-Darling goodness-of-fit test demonstrated that the distributions of S ², τ _local and τ _e estimated for $r{\text{EC}}\sigma_{4}^{70}$ are generally not Gaussian. The resulting medians accompanied by quartiles are presented in Table 1. For comparison, corresponding mean values and standard deviations are also presented. All the distributions were compared with the aid of the nonparametric Mann-Whitney U test.

The global effect of the TFE concentration on ¹⁵N relaxation is clearly visible (Fig. 5). S ² increases with the content of this agent (Table 1), reaching the maximum at 30 % TFE (v/v). Four helical regions indicated by the CSI analysis (see Fig. 1 for backbone angles estimated with TALOS+) agree with the distribution of relaxation parameters along a protein sequence (Fig. 3). It could thus be concluded that the extent of a transient fold of the protein increases upon TFE titration. According to the nonparametric Mann-Whitney U test (MW test), the distribution of S ² differs significantly (p < 10⁻⁶) for all three protein regions, and the differences are statistically significant for all conditions tested (low pH, 10 % TFE, 30 % TFE, see Table 1 details and Table 3 for exact p-values for comparisons).

Table 3 The comparison of the estimated distributions of relaxation parameters determined for $r{\text{EC}}\sigma^{70}_{4}$ in various conditions made with the aid of the Mann-Whitney test

Full size table

The medians for effective local correlation time for fast motions, τ _e, calculated for the HLHTH region at 0, 10 and 30 % (v/v) TFE concentrations equal to 48, 31 and 28 ps, respectively, thus confirming a TFE-induced restriction of fast motions within the HLHTH motif of r${\text{EC}}\sigma_{4}^{70}$. The distribution of slow motions (τ _local; Fig. 3b) remains in full agreement with the results obtained for S ² and τ _e. Again, the four regions of $r{\text{EC}}\sigma_{4}^{70}$ (L532-D546, A549-R560, V576- R582 and E585-R599) can be distinguished by the elevated τ _local. The medians for τ _local are: 4.8, 9.5 and 11.6 ns for 0, 10 and 30 % (v/v) TFE concentrations, respectively.

The majority of these distributions differ significantly (see Table 3), proving that TFE addition does induce at least a transient protein structure, as indicated by elevated S ² and restricted fast motions. A considerable simultaneous decrease in S ² and τ _local values, observed for residues D566-L573, identifies a flexible structure located at the same protein sequence as the putative loop region of the HLHTH motif.

The above data clearly evidence that the addition of TFE to a protein solution stabilizes the HLHTH motif much more than that of the other regions of the protein, as proved by the direct comparison of the HLHTF motif with the flanking residues (others in Table 3). Generally, the distributions of S ² and both correlation times estimated for residues located in the HLHTH motif significantly differ from those calculated separately for the peripheral residues located outside of this motif. However, it should be noted that all the TFE-related conformational changes generally lead to an increase in the population of the ordered structure, even for the residues outside of the HLHTH motif. This confirms that TFE acts as a non-specific structure-inducer, stabilizing the local secondary structures of folded proteins (Luo and Baldwin 1998; Bhakuni 1998; Shiraki et al. 1995), including α-helices (Nelson and Kallenbach 1989), β-turns (Cann et al. 1987) and β-hairpins (Ramirez-Alvarado et al. 1997). The thermodynamic effect of TFE should be connected with weakened non-local hydrophobic interactions, accompanied by slightly enhanced local ones, thus propagating context-dependent formation of a native-like secondary structure in polypeptides (Thomas and Dill 1993). This was reflected in an increase of the order parameter, S ², accompanied by a moderate decrease of the local correlation time for fast motions, τ _e. The structure-inducing effect observed for $r{\text{EC}}\sigma_{4}^{70}$ is definitively not uniform, but follows sequence-dependent propensities of the backbone already visible in a TFE-free low-pH protein solution (Poznanski et al. 2003).

All the results referred to above clearly demonstrate that IDPs, even in the absence of physiological partners, may obtain their functional folds. And these states can be sampled with a variety of commonly used stabilizing agents such as MgSO₄, arginine, proline, sucrose, sarcosine, glycerol, NDSBs, TMAO or even by TFE, which also may act as a stabilizer (Luo and Baldwin 1998).

References

Alexandrescu AT, Shortle D (1994) Backbone dynamics of a highly disordered 131 residue fragment of staphylococcal nuclease. J Mol Biol 242:527–546
Article CAS PubMed Google Scholar
Anderson TW, Darling DA (1952) Asymptotic theory of certain goodness of fit criteria based on stochastic processes. Ann Math Stat 23:193–212
Article Google Scholar
Bhakuni V (1998) Alcohol-induced molten globule intermediates of proteins: are they real folding intermediates or off pathway products? Arch Biochem Biophys 357:274–284
Article CAS PubMed Google Scholar
Blanco AG, Canals A, Bernues J, Sola M, Coll M (2011) The structure of a transcription activation subcomplex reveals how sigma(70) is recruited to PhoB promoters. EMBO J 30:3776–3785
Article CAS PubMed Central PubMed Google Scholar
Brokx RD, Scheek RM, Weljie AM, Vogel HJ (2004) Backbone dynamic properties of the central linker region of calcium–calmodulin in 35 % trifluoroethanol. J Struct Biol 146:272–280
Article CAS PubMed Google Scholar
Brunger AT (1992) XPLOR. A system for X-ray Crystallography and NMR. Yale University Press, New Haven
Google Scholar
Brutscher B, Brüschweiler R, Ernst RR (1997) Backbone dynamics and structural characterization of the partially folded A state of ubiquitin by 1H, 13C, and 15N nuclear magnetic resonance spectroscopy. Biochemistry 36:13043–13053
Article CAS PubMed Google Scholar
Buchner J, Pastan I, Brinkmann U (1992) A method for increasing the yield of properly folded recombinant fusion proteins—single-chain immunotoxins from renaturation of bacterial inclusion-bodies. Anal Biochem 205:263–270
Article CAS PubMed Google Scholar
Buljan M, Chalancon G, Eustermann S, Wagner GP, Fuxreiter M, Bateman A, Babu MM (2012) Tissue-specific splicing of disordered segments that embed binding motifs rewires protein interaction networks. Mol Cell 46:871–883
Article CAS PubMed Central PubMed Google Scholar
Burgess RR, Travers AA, Dunn JJ, Bautz EKF (1969) Factor stimulating transcription by RNA polymerase. Nature 221:43–46
Article CAS PubMed Google Scholar
Camarero JA, Shekhtman A, Campbell EA, Chlenov M, Gruber TM, Bryant DA, Darst SA, Cowburn D, Muir TW (2002) Autoregulation of a bacterial σ factor explored by using segmental isotopic labeling and NMR. Proc Natl Acad Sci USA 99:8536–8541
Article CAS PubMed Central PubMed Google Scholar
Camilloni C, De Simone A, Vranken WF, Vendruscolo M (2012) Determination of secondary structure populations in disordered states of proteins using nuclear magnetic resonance chemical shifts. Biochemistry 51:2224–2231
Article CAS PubMed Google Scholar
Campbell EA, Muzzin O, Chlenov M, Sun JL, Olson CA, Weinman O, Trester-Zedlitz ML, Darst SA (2002a) Structure of the bacterial RNA polymerase promoter specificity subunit. Mol Cell 9:527–539
Article CAS PubMed Google Scholar
Campbell EA, Muzzin O, Chlenov M, Sun JL, Olson CA, Weinman O, Trester-Zedlitz ML, Darst SA (2002b) Structure of the bacterial RNA polymerase promoter specificity σ subunit. Mol Cell 9:527–539
Article CAS PubMed Google Scholar
Cann JR, London RE, Unkefer CJ, Vavrek RJ, Stewart JM (1987) CD-NMR study of the solution conformation of bradykinin analogs containing alpha-aminoisobutyric-acid. Int J Pept Protein Res 29:486–496
Article CAS PubMed Google Scholar
Clubb RT, Thanabal V, Wagner G (1992) A constant-time 3-dimensional triple-resonance pulse scheme to correlate intraresidue H-1(N), N-15, and C-13 chemical-shifts in N-15-C-13-labeled proteins. J Magn Reson 97:213–217
CAS Google Scholar
Copeland RA (2000) Experimental measures of enzyme activity in enzymes. Wiley, New York, pp 188–265
Google Scholar
Das RK, Pappu RV (2013) Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residues. Proc Natl Acad Sci USA 110:13392–13397
Article CAS PubMed Central PubMed Google Scholar
Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, Bax A (1995) NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J Biomol NMR 6:277–293
CAS PubMed Google Scholar
Dill KA (1990) Dominant forces in protein folding. Biochemistry 29:7133–7155
Article CAS PubMed Google Scholar
Dima RI, Thirumalai D (2004) Asymmetry in the shapes of folded and denatured states of proteins. J Phys Chem B 108:6564–6570
Article CAS Google Scholar
Doucleff M, Malak LT, Pelton JG, Wemmer DE (2005) The C-terminal RpoN domain of σ54 forms an unpredicted helix-turn-helix motif similar to domains of σ70. J Biol Chem 280:41530–41536
Article CAS PubMed Google Scholar
Dyson HJ, Wright PE (2005) Intrinsically unstructured proteins and their functions. Nat Rev Mol Cell Biol 6:197–208
Article CAS PubMed Google Scholar
Farrow NA, Muhandiram R, Singer AU, Pascal SM, Kay CM, Gish G, Shoelson SE, Pawson T, Forman-Kay JD, Kay LE (1994) Backbone dynamics of a free and a phosphopeptide-complexed src homology 2 domain studied by 15N NMR relaxation. Biochemistry 33:5984–6003
Article CAS PubMed Google Scholar
Farrow NA, Zhang OW, Szabo A, Torchia DA, Kay LE (1995) Spectral density function mapping using 15N relaxation data exclusively. J Biomol NMR 6:153–162
CAS PubMed Google Scholar
Fu XD (1995) The superfamily of arginine/serine-rich splicing factors. RNA 1:663–680
CAS PubMed Central PubMed Google Scholar
Galea CA, Wang Y, Sivakolundu SG, Kriwacki RW (2008) Regulation of cell division by intrinsically unstructured proteins: intrinsic flexibility, modularity, and signaling conduits. Biochemistry 47:7598–7609
Article CAS PubMed Central PubMed Google Scholar
Gardner KH, Konrat R, Rosen MK, Kay LE (1996) An (H)C(CO)NH-TOCSY pulse scheme for sequential assignment of protonated methyl groups in otherwise deuterated ¹⁵N,¹³C-laheled proteins. J Biomol NMR 8:351–356
Article CAS PubMed Google Scholar
Gerald B (1997) CDNN. Institut für Biotechnologie, Martin-Luther-Universität, Halle-Wittenberg, Halle
Google Scholar
Goddard TD, Kneller DG (2002) SPARKY 3. University of California, San Francisco. http://www.cgl.ucsf.edu/home/sparky
Grigorova IL, Phleger NJ, Mutalik VK, Gross CA (2006) Insights into transcriptional regulation and If competition from an equilibrium model of RNA polymerase binding to DNA. Proc Natl Acad Sci USA 103:5332–5337
Article CAS PubMed Central PubMed Google Scholar
Grzesiek S, Bax A (1992a) Correlating backbone amide and side chain resonances in larger proteins by multiple relayed triple resonance NMR. J Am Chem Soc 114:6291–6293
Article CAS Google Scholar
Grzesiek S, Bax A (1992b) Improved 3D triple-resonance NMR techniques applied to a 31 kDa protein. J Magn Res 96:432–440
CAS Google Scholar
Grzesiek S, Anglister J, Bax A (1993) Correlation of backbone amide and aliphatic side-chain resonances in ¹³C/¹⁵N-Enriched proteins by isotropic mixing of ¹³C magnetization. J Magn Reson, Ser B 101:114–119
Article CAS Google Scholar
Guntert P (2004) Automated NMR structure calculation with CYANA. Methods Mol Biol 278:353–378
CAS PubMed Google Scholar
Jaravine VA, Alexandrescu AT, Grzesiek S (2001) Observation of the closing of individual hydrogen bonds during TFE-induced helix formation in a peptide. Protein Sci 10:943–950
Article CAS PubMed Central PubMed Google Scholar
Jishage M, Iwata A, Ueda S, Ishihama A (1996) Regulation of RNA polymerase sigma subunit synthesis in Escherichia coli: intracellular levels of four species of sigma subunit under various growth conditions. J Bacteriol 178:5447–5451
CAS PubMed Central PubMed Google Scholar
Kaczka P, Polkowska-Nowakowska A, Bolewska K, Zhukov I, Poznanski J, Wierzchowski KL (2010) Backbone dynamics of TFE-induced native-like fold of region 4 of Escherichia coil RNA polymerase sigma(70) subunit. Proteins–Struct Funct Bioinform 78:754–768
CAS Google Scholar
Kay LE, Nicholson LK, Delaglio F, Bax F, Torchia DA (1992) Pulse sequences for removal of the effects of cross correlation between dipolar and chemical-shift anisotropy relaxation mechanisms on the measurement of heteronuclear T1 and T2 values in proteins. J Magn Reson 97:359–375
CAS Google Scholar
Korzhnev DM, Billeter M, Arseniev AS, Orekhov VY (2001) NMR studies of Brownian tumbling and internal motions in proteins. Prog Nucl Magn Reson Spectrosc 38:197–266
Article CAS Google Scholar
Kowalewski J (1978) Determination of NOE factors using the dynamic overhauser enhancement technique combined with a nonlinear least-squares-fitting procedure. J Magn Reson 31:165–169
CAS Google Scholar
Kozlowski L. 2007–2012. Isoelectric point Calculator
Krieger E, Koraimann G, Vriend G (2002) Increasing the precision of comparative models with YASARA NOVA—a self-parameterizing force field. Proteins-Struct Funct Genet 47:393–402
Article CAS PubMed Google Scholar
Kuboniwa H, Grzesiek S, Delaglio F, Bax A (1994) Measurement of HN-H alpha J couplings in calcium-free calmodulin using new 2D and 3D water-flip-back methods. J Biomol NMR 4:871–878
Article CAS PubMed Google Scholar
Kuszewski J, Gronenborn AM, Clore GM (1995a) The impact of direct refinement against proton chemical-shifts on protein-structure determination by NMR. J Magn Reson Ser B 107:293–297
Article CAS Google Scholar
Kuszewski J, Qin J, Gronenborn AM, Clore GM (1995b) The impact of direct refinement against C-13(alpha) and C-13(beta) chemical-shifts on protein-structure determination by NMR. J Magn Reson Ser B 106:92–96
Article CAS Google Scholar
Lambert LJ, Wei Y, Schirf V, Demeler B, Werner MH (2004) T4 AsiA blocks DNA recognition by remodeling sigma70 region 4. EMBO J 23:2952–2962
Article CAS PubMed Central PubMed Google Scholar
Laskowski RA, Rullmann JAC, MacArthur MW, Kaptein R, Thornton JM (1996) AQUA and PROCHECK-NMR: programs for checking the quality of protein structures solved by NMR. J Biomol NMR 8:477–486
CAS PubMed Google Scholar
Lawrence JR, Johnson WC (2002) Lifson-Roig nucleation for α-helices in trifluoroethanol: context has a strong effect on the helical propensity of amino acids. Biophys Chem 101–102:375–385
Article PubMed Google Scholar
Lipari G, Szabo A (1982a) Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules. 2. Analysis of experimental results. J Am Chem Soc 104:4559–4570
Article CAS Google Scholar
Lipari G, Szabo A (1982b) Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules. 1. Theory and range of validity. J Am Chem Soc 104:4546–4559
Article CAS Google Scholar
Liu JG, Perumal NB, Oldfield CJ, Su EW, Uversky VN, Dunker AK (2006) Intrinsic disorder in transcription factors. Biochemistry 45:6873–6888
Article CAS PubMed Central PubMed Google Scholar
Losick R, Pero J (1981) Cascades of sigma factors. Cell 25:582–584
Article CAS PubMed Google Scholar
Luo YZ, Baldwin RL (1998) Trifluoroethanol stabilizes the pH 4 folding intermediate of sperm whale apomyoglobin. J Mol Biol 279:49–57
Article CAS PubMed Google Scholar
Maeda H, Fujita N, Ishihama A (2000) Competition among seven Escherichia coli sigma subunits: relative binding affinities to the core RNA polymerase. Nucleic Acids Res 28:3497–3503
Article CAS PubMed Central PubMed Google Scholar
Mann HB, Whitney DR (1947) On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat 18:50–60
Article Google Scholar
Mechold U, Potrykus K, Murphy H, Murakami KS, Cashel M (2013) Differential regulation by ppGpp versus pppGpp in Escherichia coli. Nucleic Acids Res 41:6175–6189
Article CAS PubMed Central PubMed Google Scholar
Meiboom S, Gill D (1958) Modified spin-echo method for measuring nuclear relaxation times. Rev Sci Instrum 29:688–691
Article CAS Google Scholar
Molodtsov V, Nawarathne IN, Scharf NT, Kirchhoff PD, Showalter HDH, Garcia GA, Murakami KS (2013) X-ray crystal structures of the Escherichia coli RNA polymerase in complex with benzoxazinorifamycins. J Med Chem 56:4758–4763
Article CAS PubMed Central PubMed Google Scholar
Moncoq K, Broutin I, Larue V, Perdereau D, Cailliau K, Browaeys-Poly E, Burnol AF, Ducruix A (2003) The PIR domain of Grb14 is an intrinsically unstructured protein: implication in insulin signaling. FEBS Lett 554:240–246
Article CAS PubMed Google Scholar
Muhandiram DR, Farrow NA, Xu GY, Smallcombe SH, Kay LE (1993) A gradient C-13 NOESY-HSQC experiment for recording NOESY spectra of C-13-labeled proteins dissolved in H₂O. J Magn Reson Ser B 102:317–321
Article CAS Google Scholar
Murakami KS (2013) X-ray Crystal Structure of Escherichia coli RNA Polymerase sigma(70) Holoenzyme. J Biol Chem 288:9126–9134
Article CAS PubMed Central PubMed Google Scholar
Murakami KS, Masuda S, Campbell EA, Muzzin O, Darst SA (2002) Structural basis of transcription initiation: an RNA polymerase holoenzyme-DNA complex. Science 296:1285–1290
Article CAS PubMed Google Scholar
Nelson JW, Kallenbach NR (1989) Persistence of the.alpha-helix stop signal in the S-peptide in trifluoroethanol solutions. Biochemistry 28:5256–5261
Article CAS PubMed Google Scholar
Nyström T (2004) Growth versus maintenance: a trade-off dictated by RNA polymerase availability and sigma factor competition? Mol Microbiol 54:855–862
Article PubMed Google Scholar
Olive F, Chaudhari SK, Patil KR, Coronas A (1996) Viscosity of the binary systems containing trifluoroethanol, water and tetraethylene glycol dimethyl ether. Prediction of the ternary viscosity from binary data. Can J Chem Eng 74:163–169
Article CAS Google Scholar
Ozbudak EM, Thattai M, Kurtser I, Grossman AD, van Oudenaarden A (2002) Regulation of noise in the expression of a single gene. Nat Genet 31:69–73
Article CAS PubMed Google Scholar
Pancsa R, Tompa P (2012) Structural disorder in eukaryotes. PLoS One 7:e34687
Article CAS PubMed Central PubMed Google Scholar
Patikoglou GA, Westblade LF, Campbell EA, Lamour V, Lane WJ, Darst SA (2007) Crystal structure of the Escherichia coli regulator of sigma70, Rsd, in complex with sigma70 domain 4. J Mol Biol 372:649–659
Article CAS PubMed Central PubMed Google Scholar
Peirce B (1852) Criterion for the rejection of doubtful observations. Astron J 45:161–163
Article Google Scholar
Poznanski J, Zhukov I, Bolewska K, Wierzchowski KL (2001) Sequence-specific H-1, N-15, and C-13 resonance assignments for the whole region 4 of Escherichia coli RNA polymerase sigma (70) subunit. J Biomol NMR 20:181–182
Article CAS PubMed Google Scholar
Poznanski J, Bolewska K, Zhukov I, Wierzchowski KL (2003) Characterization of the low pH solution structure and dynamics of the region 4 of Escherichia coli RNA polymerase sigma(70) subunit. Biochemistry 42:13438–13448
Article CAS PubMed Google Scholar
Raffaelle M, Kanin EI, Vogt J, Burgess RR, Ansari AZ (2005) Holoenzyme switching and stochastic release of sigma factors from RNA polymerase in vivo. Mol Cell 20:357–366
Article CAS PubMed Google Scholar
Ramirez-Alvarado M, Serrano L, Blanco FJ (1997) Conformational analysis of peptides corresponding to all the secondary structure elements of protein L B1 domain: secondary structure propensities are not conserved in proteins with the same fold. J Muscle Res Cell Motil 6:162–174
CAS Google Scholar
Rautureau GJP, Day CL, Hinds MG (2010) Intrinsically disordered proteins in bcl-2 regulated apoptosis. Int J Mol Sci 11:1808–1824
Article CAS PubMed Central PubMed Google Scholar
Rezaei-Ghaleh N, Blackledge M, Zweckstetter M (2012) Intrinsically disordered proteins: from sequence to conformational properties toward drug discovery. Chem Bio Chem 13:930–950
Article CAS PubMed Google Scholar
Schwarzinger S, Kroon GJA, Foss TR, Wright PE, Dyson HJ (2000) Random coil chemical shifts in acidic 8 M urea: implementation of random coil shift data in NMRView. J Biomol NMR 18:43–48
Article CAS PubMed Google Scholar
Schwarzinger S, Kroon GJA, Foss TR, Chung J, Wright PE, Dyson HJ (2001) Sequence-dependent correction of random coil NMR chemical shifts. J Am Chem Soc 123:2970–2978
Article CAS PubMed Google Scholar
Schwieters CD, Kuszewski JJ, Tjandra N, Clore GM (2003) The Xplor-NIH NMR molecular structure determination package. J Magn Reson 160:65–73
Article CAS PubMed Google Scholar
Shaka AJ (1985) Computer-optimized decoupling scheme for wideband applications and low-level operation. J Magn Reson 64:547–552
CAS Google Scholar
Shen Y, Delaglio F, Cornilescu G, Bax A (2009) TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts. J Biomol NMR 44:213–223
Article CAS PubMed Central PubMed Google Scholar
Shiraki K, Nishikawa K, Goto Y (1995) Trifluoroethanol-induced stabilization of the alpha-helical structure of beta-lactoglobulin—implication for non-hierarchical protein-folding. J Mol Biol 245:180–194
Article CAS PubMed Google Scholar
Stone MJ, Fairbrother WJ, Palmer AG III, Reizer J, Saier MH Jr, Wright PE (1992) Backbone dynamics of the Bacillus subtilis glucose permease IIA domain determined from 15N NMR relaxation measurements. Biochemistry 31:4394–4406
Article CAS PubMed Google Scholar
Talluri S, Wagner G (1996) An optimized 3D NOESY-HSQC. J Magn Reson Ser B 112:200–205
Article CAS Google Scholar
Thomas PD, Dill KA (1993) Local and nonlocal interactions in globular-proteins and mechanisms of alcohol denaturation. Protein Sci 2:2050–2065
Article CAS PubMed Central PubMed Google Scholar
Tompa P (2002) Intrinsically unstructured proteins. Trends Biochem Sci 27:527–533
Article CAS PubMed Google Scholar
Tompa P (2005) The interplay between structure and function in intrinsically unstructured proteins. FEBS Lett 579:3346–3354
Article CAS PubMed Google Scholar
Typas A, Hengge R (2006) Role of the spacer between the −35 and −10 regions in Ifs promoter selectivity in Escherichia coli. Mol Microbiol 59:1037–1051
Article CAS PubMed Google Scholar
δ2D server; Available http://www-vendruscolo.ch.cam.ac.uk/d2D/ jobs submitted: July 2014
Uversky VN (2002) What does it mean to be natively unfolded? Eur J Biochem 269:2–12
Article CAS PubMed Google Scholar
Uversky VN (2011) Intrinsically disordered proteins from A to Z. Int J Biochem Cell Biol 43:1090–1103
Article CAS PubMed Google Scholar
Vassylyev DG, Sekine SI, Laptenko O, Lee J, Vassylyeva MN, Borukhov S, Yokoyama S (2002) Crystal structure of a bacterial RNA polymerase holoenzyme at 2.6. A resolution. Nature 417:712–719
Article CAS PubMed Google Scholar
Viles JH, Donne D, Kroon G, Prusiner SB, Cohen FE, Dyson HJ, Wright PE (2001) Local structural plasticity of the prion protein. Analysis of NMR relaxation dynamics. Biochemistry 40:2743–2753
Article CAS PubMed Google Scholar
Wang AC, Lodi PJ, Qin J, Vuister GW, Gronenborn AM, Clore GM (1994) An efficient triple-resonance experiment for proton-directed sequential backbone assignment of medium-sized proteins. J Magn Reson Ser B 105:196–198
Article CAS Google Scholar
Williams T, Kelley C (2007) http://www.gnuplot.info
Wishart DS, Sykes BD (1994) The 13C chemical-shift index: a simple method for the identification of protein secondary structure using 13C chemical-shift data. J Biomol NMR 4:171–180
CAS PubMed Google Scholar
Wittekind M, Mueller L (1993) HNCACB, a high-sensitivity 3D NMR experiment to correlate amide-proton and nitrogen resonances with the alpha- and beta-carbon resonances in proteins. J Magn Reson Ser B 101:201–205
Article CAS Google Scholar
Wright PE, Dyson HJ (1999) Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. J Mol Biol 293:321–331
Article CAS PubMed Google Scholar
Wright PE, Dyson HJ (2009) Linking folding and binding. Curr Opin Struct Biol 19:31–38
Article CAS PubMed Central PubMed Google Scholar
StatSoft, Inc (2011) STATISTICA (data analysis software system) version 10. StatSoft, Inc
Zhukov I, Ejchart A (1999) Factors improving the accuracy of determination of 15N relaxation parameters in proteins. Acta Biochim Pol 46:665–671
CAS PubMed Google Scholar

Download references

Acknowledgments

This work was partially supported by the grant CRP/08/11 founded by the International Centre for Genetic Engineering and Biotechnology, Trieste, Italy.

Author information

Authors and Affiliations

Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawińskiego 5a, 02-106, Warsaw, Poland
Piotr Kaczka, Maria Winiewska, Igor Zhukov, Bożenna Rempoła, Krystyna Bolewska, Tomasz Łoziński, Andrzej Ejchart, Kazimierz L. Wierzchowski & Jarosław Poznański
Centre for Monitoring and Analyses of Population Health Status, National Institute of Public Health, National Institute of Hygiene, Warsaw, Poland
Anna Poznańska

Authors

Piotr Kaczka
View author publications
You can also search for this author in PubMed Google Scholar
Maria Winiewska
View author publications
You can also search for this author in PubMed Google Scholar
Igor Zhukov
View author publications
You can also search for this author in PubMed Google Scholar
Bożenna Rempoła
View author publications
You can also search for this author in PubMed Google Scholar
Krystyna Bolewska
View author publications
You can also search for this author in PubMed Google Scholar
Tomasz Łoziński
View author publications
You can also search for this author in PubMed Google Scholar
Andrzej Ejchart
View author publications
You can also search for this author in PubMed Google Scholar
Anna Poznańska
View author publications
You can also search for this author in PubMed Google Scholar
Kazimierz L. Wierzchowski
View author publications
You can also search for this author in PubMed Google Scholar
Jarosław Poznański
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jarosław Poznański.

Additional information

P. Kaczka and M. Winiewska have contributed equally to this work.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 537 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Kaczka, P., Winiewska, M., Zhukov, I. et al. The TFE-induced transient native-like structure of the intrinsically disordered $\varvec\sigma_{ 4}^{ 70}$ domain of Escherichia coli RNA polymerase. Eur Biophys J 43, 581–594 (2014). https://doi.org/10.1007/s00249-014-0987-4

Download citation

Received: 25 April 2014
Revised: 31 July 2014
Accepted: 08 September 2014
Published: 27 September 2014
Issue Date: December 2014
DOI: https://doi.org/10.1007/s00249-014-0987-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The TFE-induced transient native-like structure of the intrinsically disordered \(\varvec\sigma_{ 4}^{ 70}\) domain of Escherichia coli RNA polymerase

Abstract

Similar content being viewed by others

Atomistic structural ensemble refinement reveals non-native structure stabilizes a sub-millisecond folding intermediate of CheY

NMR characterization of solvent accessibility and transient structure in intrinsically disordered proteins

Determining Transient Nucleic Acid Structures by NMR

Introduction