3.1. 1DIR Spectra of Beta3s
The one-dimensional IR spectrum of each of the Beta3s conformational ensembles was calculated revealing significant differentiation between structures. (Figure 3) In the 1DIR, Amide-I band absorption at both high and low frequency peaks distinguish different secondary structure conformations and thus can be used to detect the degree of folding or point in the folding mechanism. Common Amide-I bands in proteins originating from the protein backbone configuration include, β structures absorbing at 1610-1640 cm-1(v⊥) and 1680-1690 cm-1 (v||), α-helix at 1640-1650 cm-1 and the 1650-1660 cm-1 random coil regions [16]. The low frequency (v⊥) β absorption is a result coupling perpendicular to the β-strand while the high frequency β absorption is (v||) a result of in plane coupling [58]. The 1DIR data presented in figure 3 and table 2 has been calculated for explicitly solvated conformational ensembles with a homogenous broadening parameter (Γ) of 1.0 (solid line), 5.0 (dashed line) and 10.0 cm-1 (dotted line). Experimental homogenous broadening parameters for the Beta3s structure would be expected to be around 8-10 cm-1 similar to a poly-l-lysine in extended AP β-sheet conformation with measured broadening of 8 cm-1 [59]. The simulation at 5 cm-1 and 10 cm-1 provide effects of broadening parameters that depend not only on the broadening parameter we set but also increases with the number of structures in the ensemble.
Table 2 2DIR Diagonal Peak Characteristics The native conformation 1DIR spectrum of Beta3s at Γ of 10 cm-1 shows a maximum peak at 1636 cm-1 which is consistent with the low frequency v|| mode for a β-sheet structure resulting from oscillations in phase perpendicular to the β-strands (Figure 3, Table 2) [16]. In the 10 cm-1 data the dominant high frequency peak occurs at 1666 cm-1 which is lower than the typical 1680 cm-1 characteristic high frequency peak v|| for β-sheets. Investigation of the data obtained at Γ of 1 cm-1 reveals the expected high frequency β-sheet v|| mode at 1680 cm-1. The data at 1 cm-1 also revealed a significant peak at 1654 cm-1, typically associated with random coil and α-helical secondary structure [16]. The presence of the 1666 cm-1and 1654 cm-1 peaks result from the turn region residues in the native conformation, although the absorption intensity for these modes is higher than expected. High absorption intensity is likely attributable to overlapping contributions from very similar modes occurring as a result of the structural homogeneity of the turn region versus the β-sheet regions in the ensemble. A comparison of the turn region versus β-sheet regions is described in Table S1 (see Additional file 1) and shows the β-sheet regions of Beta3s to be less homogenous in structure than the turn sections.
The major peak in the Ns conformation originated at 1653 cm-1 and was surrounded by a shoulder at 1630 cm-1 and another at 1671 cm-1 at Γ of 10 cm-1. (Figure 3, Table 2) The right shoulders correspond to the high v|| frequency β-sheet absorption while the central peak and left peaks result from increased random coil character and the v|| mode of the β-sheet in this conformation [16]. Interesting, the low frequency v||, β-sheet peak shifted approximately 6 cm-1 lower compared to the native state. A decreasing distance between the high and low frequency Amide-I bands of the β-sheets has been shown to correspond to decreasing β-sheet content [26]. This is expected because the Ns conformation lacks the fully formed N-terminal β-sheet, and thus contains less β-sheet content when compared to the Native state.
The Cs conformation incorporating an out of register C-terminal region contains a predominate peak at 1660 cm-1 surrounded by a large shoulder at 1636 cm-1 and a weak shoulder at 1676 cm-1. (Figure 3, Table 2) The 1DIR spectra at 1 cm-1 Γ further resolves these peaks particularly the weak shoulder at 1676 cm-1. Relative to the Native structure a 4 cm-1 decrease in width between the v|| (1636 cm-1) and v|| (1676 cm-1) modes of the β-sheet modes was observed indicative of the decrease in β-sheet structure in this conformation. Additionally, the evolution of a significant random coil peak as a result of the less structured C-terminal region was observed.
The Ch-Curl conformation is most similar to the Ns conformation containing a well ordered C-terminal β-sheet structure and disrupted N-terminal region. In the Ch-Curl structure however, the C-terminus turn is inverted. (Figure 1d) The 1DIR of Ch-Curl consists of 2 peaks at 1638 cm-1 and 1671 cm-1 in the 10 cm-1 Γ regime, additional resolution reveals multiple strong peaks from 1630 cm-1 to 1688 cm-1. (Figure 3, Table 2) Critical β-sheet peaks are present as expected with the well structured C-terminus in this conformation. Variation and multiple strong peaks in the range of ~1645-1660 cm-1 appear to result from the mostly unstructured N-terminal part of the structure, providing the "Curl" component of this ensemble. The β-sheet peak occurred at 1638 cm-1, 6 cm-1 higher than in the Ns conformation, likely a result of the different interactions between adjacent β-sheets due to the inversion of the C-terminus sheet [26].
The 1DIR of the 6-12 helical conformation contains a single strong peak at 1658 cm-1 which corresponds the an α-helix or random coil Amide-I absorption [16]. (Figure 3, Table 2) Further resolution at Γ 1 cm-1 shows a splitting of the main peak into a 1656 cm-1 and 1660 cm-1 peak which likely corresponds to the α-helix structure absorption from residues 6 to 12 and the remainder of the structure which is largely in a random coil configuration.
3.2. 2DIR Spectra Beta3s
The simulated two-dimensional IR correlation spectroscopy (2DCS) reveals three-dimensional structural information about protein structure by reporting on vibration couplings and correlations between vibrations contacts. Although 1DIR appears to be sufficient to distinguish the different conformations in the folding mechanism of Beta3s it lacks ability to reveal coupling between specific residues observed in the off-diagonal peaks of the 2D spectrum. Since 2DIR spectra are calculated with the same Local Amide Hamiltonian as the 1DIR spectra cross peak locations are identical for similar Γ parameters. The 2DIR as displayed in figure 3 was calculated for 3 different homogenous broadening parameters (Γ) of 1.0 (right), 5.0 (middle) and 10.0 cm-1 (left). In figure 3 each 2DIR spectrum is split into two regions, the upper left corner (blue peaks) and lower right corner (red peaks), containing signals originating from the 0->1 and 1->2 IR transitions respectively.
The native state 2DIR spectrum of Beta3s contains a full complement of cross peak interactions of the folded protein and was used as a point of reference for the other conformations. The native state exhibited 15 distinguishable off-diagonal cross peaks in the 2DIR spectra calculated at a line width of 5 cm-1 as noted in table 3 and figure 3. Three specific regions of the 2DIR spectra report on the general conformation of Beta3s, the 1620-1630 cm-1 and 1650-1680 cm-1 region contain signals resulting from the C-terminal β-structure. The peaks at 1636-1650 cm-1 in the middle of the spectra correspond to the N-terminal sheet, while the 1675 cm-1 to 1700 cm-1 region from 1620-1650 cm-1 reports on long-range coupling between C and N-terminal chains through the central β-sheet. The Ns 2DIR spectra contained 10 of 15 native cross peaks and a novel peak at 1621 cm-1 and 1660 cm-1 (Figure 4b, Table 3, 4) The Cs conformation included 4 of the native cross peaks and 2 unique signals. (Figure 4c, Table 3) The additional cross peaks were noted in the Cs conformation between the in 1642 cm-1's at 1674 cm-1 and 1678 cm-1. (Figure, 4c, Table 4) The Ch conformation exhibited the 6 of 15 native cross peak interactions likely all from the C-terminal sheet. The Ch configuration also exhibited 2 novel peaks the in 1660 cm-1's at 1684 cm-1 and 1698 cm-1 range. (Figure, 4d, Table 3, 4) Finally, the 6-12 helix displayed no native peaks and 5 new cross peaks. (Figure 4e, Table 4)
Table 3 Native Peak Assignment Table 4 Non-Native Peak Assignment 3.3. Residue Contributions and Peak Assignment
One of the primary goals of 2DCS is the assignment of specific cross peaks to particular residue interactions to reveal the three-dimensional structure of proteins [15]. This has also been among the most challenging tasks for the 2DCS spectroscopists. In this work the small size of Beta3s along with the multiple conformations of known structure helps facilitate peak assignment. This is an ideal situation, unique to computation, because experimentalists normally do not have atomic conformations for comparison or work with very large proteins that complicate structure assignment and necessitate isotopic labels to isolate specific peaks [15, 56].
Structure-peak assignment of Beta3s was determined by two methods, conformational difference analysis and normal mode decomposition (NMD). Examination of the Native structure ensemble alone suggests that Beta3s contains two distinct cross peak contributors originating from residues on the C-terminal and on the N-terminal β-strands. This, however, does not address which residues contribute to each peak. NMD analysis of the Native conformation of Beta3s allowed us to ascertain 15 readily identifiable residues that contribute to 15 peaks in the 2DIR spectra. (Table 3, Figure 4a) The eigenvalues from NMD allowed us to approximate the residue carbonyl group origin of peaks by assigning an IR absorption frequency to each residue. (Figure 4a, Table 3) The eigenvalues assigned to each residue by NMD are shown in Table 5. It is important to remember that NMD analysis provides absorption frequencies for each residue but that these do not fully incorporate delocalization and overlap between the modes of the residues.
Table 5 NMD Residue Eigenvalues NMD analysis showed 6 peaks originated from N-terminal strand residue interaction with the central strand and 6 peaks originating from C-terminal to central strand interaction. (Figure 4a, bottom left) Interestingly, NMD also revealed peaks resulting from long range coupling between C-terminal and N-terminal residues (coupled through the central strand) which provides an indicator of degree of Native structure.
NMD analysis assigns the peak at 1636 cm-1 to Ile3 and a peak at 1647 cm-1 to Asn5, both of which couple to central-strand Tyr11 and Asn13 at 1664 cm-1 and 1669 cm-1 respectively. (Table 3) This is consistent with our 1DIR data that under initial conformational analysis suggested a 1636 cm-1 peak originates on the N-terminal strand. Gln4 absorbing at 1648 cm-1 also interacted strongly with Tyr11 and Gln12 producing additional cross peaks at 1664 cm-1 and 1666 cm-1. Peaks originating from Gln4 and Asn5 are established by coupling in the NMD plots in figure 4a (bottom left) but less discernable in the 2DIR spectra. These nearly identical peak locations are likely a result of strong coupling between nearby residues, significant vibrational mode delocalization. Together the peaks from 1636 cm-1 to 1647 cm-1 provide a spectral region representative of the degree native-ness of the N-terminal sheet. Conformational analysis supports these peaks as an indicator of the N-terminal sheet since it was also found that these peaks are not present when the N-terminal region was interrupted as occurs in the Ns conformation.
The native conformation of Beta3s exhibited 6 peaks consistent with residue interactions on the C-terminal β-strand. Specifically, Trp10 and Gly14 were found to interact with Tyr19 and Thr20 producing cross peaks from v|| interactions at 1661 cm-1, 1674 cm-1 and 1682 cm-1 and a 1695 cm-1 respectively. (Figure 4a) Additionally, v|| interactions were noted between residue Lys17 at 1630 cm-1 in the C-terminal sheet and Lys9 and Asn13 at 1659 cm-1 at 1669 cm-1. Conformational analysis further supports the NMD results, since the majority of C-terminal strand residues do not produce cross peaks when interrupted in the Cs conformation. (Figures 4a)
In the Ns conformation the interactions involving Trp2 and Asn5 were not noted corresponding to the Ns out-of-register structural disruption. (Table 3, Figure 4b) Gln4 however, exhibited an interaction with residues Tyr11, Gln12 and Asn13 forming a cross peak between 1645 cm-1 and 1660 cm-1 as well as 1665 cm-1 and 1668 cm-1. Residues on the C-terminal strand produced similar peaks to that of the Native structure. A blue shift of ~5 cm-1 relative to the native conformation was noted for the 1656 cm-1 peak resulting from coupling to Trp10 on the central β-strand. The blue shift has been observed in prior work and occurs as the Amide modes localize as a result of β-sheet unfolding [58]. The majority of the long-range interactions between the C and N-terminal β-sheets were disrupted in this conformation as anticipated. (Figure 4b, Table 3)
The Cs conformation was noted to contain similar peaks to the Native state with the exception of anticipated disruption in the C-terminal strand. (Figure 4c, Table 3) A single C-terminal interaction was noted between Gly14 and Lys17 producing the peak at 1627 cm-1 and 1677 cm-1. Half of the N-terminal peaks were noted with peaks between Gln4 at 1652 cm-1and Lys9, Tyr11 and Gln12.(Figure 4c) Cross peaks at the N-terminal region as a result of Ile3 interacting with Tyr11 and Asn13 were not present in the Cs conformation. Additionally, N-terminal signals due to the Asn5 to Asn13 interaction were not noted in the spectra or by the NMD. Interestingly, new peaks evolved at 1642 cm-1 and 1674 cm-1 as well as 1678 cm-1a result of Gly6 interacting with Lys9 and Gln12 respectively. These spectral signatures reflect that structurally the N-terminal domain is perturbed in the Cs conformation as the C-terminal domain falls out of register. No blue shifts were noted relative to the native conformation in the Cs state. The contact map at the bottom of figure 4c displays the disruption of overall secondary structure. This is unsurprising, since it has been suggested that folding of Beta3s prefers to first fold a structurally stable C-terminal domain that forms a scaffolding to facilitate folding of the N-terminal domain [32–37]. In the spectra of Cs such structural changes are ultimately reflected in the red-shifting of noted peaks relative to the native state caused by the peptide backbone shifting from a β-sheet (1630-1640 cm-1) to a more α-helical conformation (1650-1660 cm-1).
The Ch-curled conformation exhibited complete disruption of the peaks corresponding to the N-terminal region. (Figure 4d, Table 3) Two additional interactions near the N-terminus indicated by peaks at 1663 cm-1, 1684 cm-1 and 1698 cm-1 attributed to Asn5 interacting with Tyr11 and Ile18. (Table 4) The remainder of the N-terminal domain did not produce cross peaks in this conformation. In the Ch-curl structure the C-terminal region is inverted relative to the native state and as such a new set of contacts causes the spectral cross peaks. The new peaks associated with the C-terminal sheet are red shifted as much as 10 cm-1 but are associated with residues similar to or close to those resulting in the C-terminal peaks of the native state. (Table 3 & 4) Ser7 and Thr8 interact with Thr16 at the same location as in the native state at 1629 cm-1 but the cross peak is far shifted Ser7 and Thr8 locations. Largely this is a result of the conformation of residues 7 and 8 which become helical in the turn region because of N-terminal domain disruption.
Most different from the Native structure, the 6-12 helical conformation of Beta3s exhibited none of the native cross peaks. (Figure 4e, Table 3, 4) NMD revealed that although some peaks and modal interactions appear similar to those in the Native state they originate from interactions of different residues. Six new interactions were noted as described in table 4. Local backbone interactions in the 1660 cm-1's between Trp2 and Gln4 and Gly6 as well as those between Asn13, Thr16 and Lys17 are typical of the random coil and α-helical structure noted here. Additionally, the NMD plot reveals significant coupled interactions along the diagonal indicating strong coupling to nearby N+1...N+3 residues [16]. (Figure 4e, bottom left) Such residue interactions are highly local and thus largely in the diagonal of the 2DIR spectra due to their very similar absorption frequencies.
3.4. The Folding Mechanisms
Computational studies by Caflisch et al. have suggested Beta3s folds through two possible pathways [33, 34]. The main folding pathway of Beta3s starts with the formation of the C-terminal side chain contacts followed by the N-terminal contacts. (Figure 2 and Figure 4) Additionally, the reverse was also found to be possible where the N-terminal structure forms first followed by the C-terminal ones. Contacts in the turn regions were also found to form first [33, 34]. The proposed folding pathways have been examined numerous times by many methodologies computationally but never experimentally [32–37]. The 2DCS IR data presented here can be coupled with 2DIR experiments to investigate the intermediates and the order in which they are sampled during the folding of Beta3s.
3.5. Spectral Signatures of Folding
In this study specific spectra-structure correlations have been established that can provide unambiguous indicators of conformational identity during a folding experiment. Specifically, during folding Beta3s may sample the 6-12 helical conformation on its way to the native state. Transition from the 6-12 helix conformation to a more native like structure (β-sheet) is well described by the diagonal signals which decrease from a 1660 cm-1 centered primary peak to one in the 1640 cm-1. Moreover, evolution of folding along the primary pathway, sampling the Ns conformation before the native state, is indicated by the evolution of 1648 cm-1 cross peaks a result of Gln4 coupling as well as signals from the full compliment of C-terminal peaks. (Figure 4b, Table 3) The existence of the Gln4 peaks also differentiates the Ns conformation spectrally from the Ch-Curl conformation which lacks this structure but contains a similar C-terminal spectral signature. (Figure 4d) Finally, in the alternate folding pathway the N-terminal region forms first, thus sampling the Cs conformation prior to the native state. 2DIR spectral analysis along this folding path would include the Gln4 associated cross peaks and nearly none of the C-terminal associated peaks in the 1666 cm-1-1680 cm-1 range. (Figure 4c, Table 3)
3.6. Isotopic Labeling Experiments
Isotope labeling can be used to manipulate 2DCS signals by enhancing desired spectral features as demonstrated by Hochstrasser [60]. 13C and 18O labeling of peptides can induce 65 cm-1 red shift of Amide-I bands providing detailed structural constraints. In this work the different conformations of Beta3s were known and so isotopic labeling is of minimal help in identifying structure-peak correlations because they can be determined by conformational analysis alone, however because the information is not available in experiments, labels may aid in tracking folding. In practice labeling of Gln4 and Tyr19 (both commercially available) could provide insight into coupling of the N-terminal sheet and C-terminal sheet respectively.