1 Introduction

Formation of chain reversals as centers of folding initiation by interactions between hydrophobic amino-acid residues in proteins was proposed for the first time by Matheson and Scheraga [1]. Muňoz and his coworkers [2] presented an idea for the mechanism of the creation of the central part of the β conformations (β-hairpins) by hydrophobic forces and hydrogen bonds. Further study by Karplus et al. [3] underlined that the hydrophobic forces are very important in the protein folding process. In the works that have been published recently [4, 5], it was determined that both local conformational preferences to form a bent structure, and long-range interactions (hydrophobic interactions and hydrogen bonds) are a crucial part in turn formation; however creations of salt bridges by the peptide groups of the main chain does not play the main role in this process.

One approach to investigate very first stages of the formation of the tertiary structure of a protein is based on conformational studies of the parts of proteins with well-defined secondary/supersecondary structure [610]. Investigation of small segments of proteins convey information of local interactions which are separated from the context of the whole protein, and therefore enable us to discern the short-range interactions that are important in formation of the secondary structure blocks in biomolecules. In aqueous solution, some fragments of small proteins can form similar structure to that that a given fragment has in full protein, although such structure is more flexible, has only a small fraction of native hydrogen bonds and residue–residue packing is not exactly as that in the full protein [11]. It has been already known that, for protein folding and stability, the local interaction play a crucial role. Therefore, it should be expected that selected fragments which have the secondary structure will populate native-like structures even with lack of the tertiary contacts.

The conformational analysis of the peptide with the following sequence: Ac-(X)2-(A)7-(O)2-NH2 (where X, A, and O denote diaminobutyric acid, alanine, and ornithine, respectively) by using the NMR method [12] led us to the conclusion that its structure is bent. Further potentiometric-titration studies [13] supported this conclusion because showed that pKa1 for this system is abnormally low (2.72), compared to the value of the model compound of the ornithine side chain. This observation suggests that the distance between the ionizable groups is small, implying a bent shape. Yet further studies [14], enabled us to formulate a hypothesis that the shape of alanine-based oligopeptides, flanked by charged amino-acid residues, depends probably more on the size of the amino-acid side chains of the non-neutral flanking groups of the sequence and less of the kind of their charge (like or opposite). The bent shape of the structure which those peptides assume because of the presence of charged groups (if the side chains are not too long) makes it easier to screen the uncharged alanine segment between them from the solvent. In the peptides which contained flanking residues with longer charged side chains, the effect of protection of alanine core was manifested to a smaller extent [14].

The β-hairpins from the FPB28 protein and from protein G are known to initiate folding of these proteins [4, 5]. They are, therefore, independently forming and not induced β-hairpins and, consequently, are excellent object with which to study the factors that influence β-hairpin formation. The β-hairpin from the FBP28 WW domain has two charged (lysine) residues on the opposite strand. According to our previous research [15], these residues might stabilize the structure. Conversely, the β-hairpin fragment of the IgG protein B3 domain has negatively charged (aspartic acid) residues on one side and a positively charged (lysine) residue on the opposite side. These residues probably stabilize the structure by forming a salt bridge [4, 5]. Consequently, these two systems and their variants are very suitable to investigate the influence of charged residues on β-hairpin formation (Fig. 1).

Fig. 1
figure 1

a NMR structure [44], of the FBP28 WW domain (PDB code: 1E0L); the boxed fragments of the FBP28 WW domain: FPB28 (12-18) D7, and FPB28 (11-19) D9 were synthesized and examined. b X-ray structure [45] of the protein G (PDB code: 1IGD) derived from Streptococcus—B3 domain of immunoglobulin binding protein; the boxed fragment of the 1IGD (51-56) Dag1 and its mutants were synthesized and examined

In our previous work [15] we investigated two peptides: FPB28 (11–19) (D9), with sequence YKTADGKTY-NH2, and FPB28 (12–18) (D7), with sequence KTADGKT-NH2, (Fig. 1a), by using potentiometric titration at different temperatures, circular dichroism spectroscopy (CD), and differential scanning calorimetry (DSC) measurements, respectively. Subsequently, by using the same experimental techniques, we studied [16] the DDATKT-NH2 peptide derived from protein IgG (PDB code: 1IGD 51–56) (Dag1), its variant in which D52 in the original sequence was replace with valine (DVATKT-NH2; Dag2), and two variants in which D52 was replaced with valine and D51 was replaced with ornithine or lysine, respectively (OVATKT-NH2; Dag3, and KVATKT-NH2; Dag4) (see Fig. 1b). These studies suggested that both D7 and D9 peptides undergo a folding-unfolding transition (as indicated by the presence of a maximum in the corresponding DSC curves); nevertheless, a trend of the calorimetric curve indicates that the equilibrium between conformation is established for both peptides. For D7 and D9, the Tm determined from the heat capacity curves is at t ~ 50 °C. Simultaneously, the potentiometric titration studies showed that the values of pKa1 and pKa2 are below 50 °C, which suggest that the two charged groups (lysines) are neighbours and the D7 and D9 have a tendency to be bent at low temperatures (especially D9). Above t = 50 °C (the melting point), pKa1 and pKa2 rise for D9, whereas for D7 the increase is less significant. This results suggests that the peptide turn bend to an shapeless structure and that the basic amino-acids (lysines) stabilizes the ordered state [15].

An analysis of temperature dependence of pKa values in the fragment of IgG protein and its derivatives enabled us to estimate the temperature of conformational transition [16]. In the works summarized above [1216] we did not, however, determine explicitly the structure of the investigated peptides. Therefore, to complete our study, in this work, we investigated the conformation of D7, D9 and Dag1, Dag2, Dag3, Dag4 peptides by using the NMR spectroscopy and molecular dynamics (MD) simulations with restraints.

2 Materials and Methods

2.1 Peptide Synthesis

All peptides under study were synthesized by using the procedure describing in our earlier work [14]. The purities of the peptides were 99.98, 99.99, 99.99, 99.95, 99.98, 99.98 % for D7, D9, Dag1, Dag2, Dag3 and Dag4, respectively, as assessed by the analytical HPLC and MALDI-TOF analyses.

2.2 NMR Measurements

The NMR experiments of D7, D9 and Dag1, Dag2, Dag3 and Dag4 peptides were measured on the spectrometer VARIAN 500-MHz Unity-Plus instrument in the Intercollegiate NMR Laboratory at the Technical University of Gdansk. The following 2D 1H–1H were recorded: DQF-COSY [17], TOCSY [18] (80 ms) and ROESY [19] (200 ms) at 303 K. The samples were dissolved in H2O/2H2O (9:1 by vol). The pH value of D7 and D9 peptides was about 7 and of all Dag peptides about 5.5. The concentration of each of the samples was ~5 mM. The spectra were processed by using VARIAN 4.3 software (Varian Instruments, Palo Alto, CA) and analyzed with the SPARKY program [20]. The spectra were calibrated against DSS (sodium 4,4-dimethyl-4-silapentane-1-sulfonate) signal [21]. Proton signals were assigned based on the TOCSY spectra. The ROE inter-proton cross-peaks of all peptides under study were derived from 2D 1H–1H NMR ROESY spectra. In the first step, the ROESY peak volumes were converted to upper distance bounds by using CALIBA [22] of the DYANA package [23]. In the next step, torsion angles, based on the Bystrov–Karplus equation [24], were generated by using the HABAS algorithm of the DYANA package [25]. The upper distance limits and torsional angles were used as restraints in MD calculations.

2.3 Molecular Dynamics Calculations with NMR-Derived Restraints

Molecular dynamics calculations (MD) were carried out by using the AMBER 11 program [26, 27] with the AMBER ff99SB force field at constant volume and temperature (the NVT scheme). All simulations were performed in a periodic box of TIP3P water [28]. The linear dimensions of the box were 40.94 Å × 59.4 Å × 41.1 Å with volume 99,836.397 Å3 for D7, 59.71 Å × 59.38 Å × 41.07 Å with volume 145,620.203 Å3 for D9, 59.714 Å × 40.602 Å × 41.07 Å with volume 99,576.006 Å3 for Dag1, Dag2, Dag3, and 59.71 Å × 59.34 Å × 41.07 Å with volume 145,620.203 Å3 for Dag4, respectively. The particle-mesh Ewald procedure [29, 30] was used for the calculations of long-range electrostatic interactions at T = 303 K. All simulations were performed in a periodic box of TIP3P water (The pH of D7, D9, Dag1, Dag2, Dag3 and Dag4 peptides solutions was 3.8, 4.2, 3.3, 4.3, 5.6, 5.9, respectively. This means that all Lys side-chain amino groups were protonated and that the Asp side-chain carboxyl group was deprotonated. The Berendsen thermostat was implemented to maintain constant temperature. The peptides studied were terminally-blocked and counter ions were added for both systems studied to neutralize the peptide charge. The total simulation time was 10 ns for each trajectory, and the integration time step was 2 fs. The time-averaged restraint method (TAV) [29, 31, 32] was used to include experimental values for the calculations, with interproton-distance restraints (calculated from the intensities of the ROE signals Fig. 2). In our earlier work [1214, 33, 34], we noticed that all-α-helical conformations are often formed if anti-ROE restraints are not imposed. Therefore, anti-ROEs between j ≥ i + 2 residue pairs (where i and j denote residue numbers) were included for those pairs of protons for which no ROE signals were observed. This means that the protons involved were assumed to be separated by more than 6 Å, which is justified because ROE signals would be observed if the distances were smaller than 6 Å.

Fig. 2
figure 2

The ROE effects and the values of the vicinal 3JHN/Hα coupling constants for D7 (a), D9 (b), Dag1 (c), Dag2 (d), Dag3 (e) and Dag4 (f) peptide

The numbers of interproton-distance restraints were: 69 for D7, 74 for D9, 72 for Dag1, 85 for Dag2, 59 for Dag3, 88 for Dag4 (including anti-ROE). The interproton distances for all three peptides were restrained with the force constant k d  = 20 kcal/(mol × Ǻ2), and the angles with k θ  = 2 kcal/(mol × rad2), respectively. The force constants corresponding to anti-ROE restraints were the same as those corresponding to ROE restraints.

5,000 conformations (the conformations were collected every 1,000 steps) were obtained for each peptide and the last 1,200 of those for D7 and D9 and last 800 for Dag1, Dag2, Dag3 and Dag4 were analyzed. The number 5,000 refers to the number of snapshots and not to the number of conformations. In production calculations, a single trajectory per peptide was run. Initially, for selected peptides we ran four independent trajectories and compared the resulting structures. They did not depend on trajectory. Moreover, to address the ergodicity issue, additional simulations at T = 400 K and T = 500 K, respectively, were run for D9 and Dag1. These simulations resulted in the same conformational families as those obtained at T = 300 K.

The set of the final conformations was clustered by using the MOLMOL program [35]. MOLMOL uses a hierarchical minimal spanning tree method [3639]. The rms-deviation cut-off values of 2.5 Å for D7 and 3.0 Å for D9, respectively, over the Thr1-Lys7, Tyr1-Tyr9 residues and of 2.5 Å for Dag1, Dag2, Dag3 and Dag4 over Asp1-Thr6 (Dag1, Dag2), Orn1-Thr6 (Dag3), Lys1-Thr6 (Dag4) residues was used for the clustering. These rmsd cut-off values are relatively large in view the small size of the molecules under study. However, because of high flexibility of the molecules under study, these relatively large cut-off values were required to dissect the sets of conformations into a reasonably small number of clusters.

3 Results

3.1 NMR

The assignments of the proton NMR spectra of the D7, D9 and Dag1, Dag2, Dag3 and Dag4 peptides were done according to the standard Wüthrich procedure [40] by using the DQF-COSY, TOCSY and ROESY spectra. The chemical shifts of the proton resonances for all six peptides are listed in Table 1a–f. The fingerprint region of the TOCSY spectra, with peak assignments of D7 (A), D9 (B), Dag1 (C), Dag2 (D), Dag3 (E) and Dag4 (F) are shown in Figure 1S (supplementary materials).

Table 1 A–F proton chemical shifts (ppm) at 303 K for D7 (A), D9 (B), Dag1 (C), Dag2 (D), Dag3 (E) and Dag4 (F)

The rotating-frame Overhauser enhancement (ROE) effects corresponding to the interproton distances and the 3JHN/Hα coupling constants of D7, D9, Dag1, Dag2, Dag3 and Dag4 are shown in Fig. 2 and |i − j| > 1 interactions (where i and j are the residue number) are summarized in Table 2.

Table 2 Hydrogen atoms of residues separated by at least 2 residues in sequence (|i − j| > 1) between which ROE peaks were found in D7, D9, Dag1, Dag2, Dag3 and Dag4 peptides

3.2 D7 and D9 Peptides

For D7 peptide the Hα(i)–HN (i + 1), Hβ(i)–HN(i + 1) and HN(i)–HN(i + 1) sequential ROE signals were observed (Fig. 2a). We also observed few short-range (i)–(i + 2) or (i)–(i + 3) interactions between residues located mainly in the middle part of the D7 sequence (see Fig. 2a).

The D9 peptide differs from the D7 peptide by two Tyr residues at both ends of the peptide chain, which certainly influence the structure of D9. We observed strong sequential Hα(i)–HN(i + 1) signals along the whole amino acid sequence (see Fig. 2b) in the ROESY spectrum. The HN(i)–HN(i + 1) were observed between residues Thr3-Ala4-Asp5-Gly6-Lys7 and Thr8 (Fig. 2b). We also observed long-range (i)–(i + 6) and (i)–(i + 7) interactions between Tyr1-Thr8 and Lys2-Thr8 residues, respectively.

3.3 Dag1, Dag2, Dag3 and Dag4 Peptides

We identified 56, 64, 60 and 67 ROE signals for Dag1, Dag2, Dag3 and Dag4, respectively. For each of the analyzed Dag peptides Hα(i)–HN(i + 1) and Hβ(i)–HN(i + 1) sequential ROE connectivities were always observed (see Fig. 2c–f). We also observed HN(i)–HN(i + 1) sequential ROE interactions for all Dag peptides, especially in the central part of the peptide chain.

4 Discussion

We found in the TOCSY spectra (Figure 1S) only one set of chemical resonances for all amino acid residues, what indicates that all peptides exist as a mixture of conformations with the tendency of the inter-conversion.

The HN(i)–HN(i + 1) sequential ROE interactions can provide either an α-helix or β-turn-like structure [40]. In the β-turn like structure, long-range interactions, which fasten together the two lines, should also be observed. Such signals are observed between Thr2-Asp4, Ala3-Gly5 and Thr2-Gly5 residues for D7 peptide (Fig. 2a; Table 2). This results suggests that the shape of the conformation of D7 is bent in the middle part of the sequence and stabilized by this particular interactions. In NMR spectra we did not observe any interactions, that could clip the two ends together. However extension of D7 sequence of one amino acid residue on the N and C-terminus clearly indicated the bent-conformation preferences of these sequence.

The observed long-range interactions for D9 indicates that the D9 peptide adopts a bent structure and has a U-shape, which is in good agreement with pKa values measured for these peptides in our earlier potentiometric study [15].

All observed sequential HN/HN interactions showed that the distance between the HN(i)–HN(i + 1) atoms of Ala3/Thr4/Lys5 in all Dag peptides never exceeds 3.8 Å. For the fully extended conformation, the distance between HN(i)–HN(i + 1) atoms is about 4.3 Å. This observation suggests that the main chains of Dag1, Dag2, Dag3 and Dag4 peptide have some tendency to bend in the middle part of the sequence. On the other hand, long-range interactions are not observed for theses peptides, which suggests that their ends are, on average, not close to each other. This fact can be explained in terms of increased flexibility of the ends of the peptides which might, in turn, be caused by the presence of structures with a sequence of turns directing the chain in opposite directions (such as, e.g., two consecutive turns of a helix). Only the second-neighbor interactions between Val2 and Thr4 residues (see Table 2) are observed for Dag2, Dag3, and Dag4. This observation is in clear contrast to the D7 and D9 peptides, for which long-range interactions were observed.

It should be also emphasized that both 1H-alpha chemical shifts and vicinal 3JHN/Hα coupling constants (with values ranging 5.9–8.3 Hz, see Fig. 2c–f) of all analyzed Dag peptides indeed indicate the random coil structure, however the conservative (i)–(i + 2) interaction observed in the Dag2, Dag3 and Dag4 peptides denote a strong inclination to have a turn-like structure. Except for the interaction mentioned above we also observed many (i)–(i + 1) connectivities between residues located in the middle part of the sequence in all analyzed Dag peptides. It has been demonstrated [41] that the Dag1 peptide tends to adopt a bent conformation due to its amino acid composition, especially at lower temperatures. Moreover, the HN(i)–HN(i + 1) ROE inter links appear in the Dag1 peptide along the entire amino acid sequence while, in Dag2, Dag3 and Dag4 peptides, such signals are observed only between residues 3–4, 4–5 and 5–6. This observation suggests that the main chain of Dag1 peptide is slightly more stable than that of Dag2, Dag3 and Dag4.

In summary, analysis of the NMR spectra clearly indicates the existence of bent shape conformation of the D9 peptide with a turn located between residues {Ala-Asp-Gly}. The presence of long-range connectivities in D9 suggests that this peptide has a quite well defined structure despite its relatively small size. For D7, only a short-range interaction between Ala3(αH) and Gly5(NH) is observed, which suggests the presence of a turn in this region. The presence of a turn in D7 (there are i − i + 2 and i − i + 3 interactions seen in Fig. 2a) is supported by NMR data. It should be stressed that both 1Hα chemical shifts and vicinal 3JHN/Hα coupling constants estimated for all six analyzed peptides indicate the statistical coil structure; however, the conservative (i)–(i + 2), (i)–(i + 3) interactions observed in the D7, Dag2, Dag3 and Dag4 peptides and (i)–(i + 6), (i)–(i + 7) interactions observed in the D9 peptide might denote a tendency to form a turn-like structures.

4.1 Molecular Dynamics

The main families of conformations of two peptides (two for each of them), obtained by MD simulations with TAV derived from NMR measurements and clustered by using the MOLMOL [35] program, are shown in Fig. 3. The two dominant families constitute 40.5 and 37 % of the conformational ensembles of D7 and 39 and 28 % of D9, respectively. The two families of D7 consist of slightly bent conformations and differ by the position of the turn, which occurs at {Ala3-Asp4-Gly5} for the first and at {Thr2-Ala3-Asp4} for the second family (Fig. 3a). The conformations of (Fig. 3b) D9 have more bent shape compared to those of D7 and only the ends are flexible while the central part seems to have a well-defined bent structure. It should be noted that D9 in contradiction to D7 has, in the sequence, two flanking hydrophobic residues which play important role in hairpin formation. As in D7, the conformations of D9 of the two dominant families differ by the position of the turn (Fig. 3b). To check if the presence of some of the observed structural features could result from applying the force field, for D9 we carried out a test run without using any restraints. It was found that the dominant conformations were completely different from those obtained with restraints; in particular, no hairpin-like structures appeared (data not shown). It can, therefore, be stated that the experimental information and not the force field used determined the obtained conformations.

Fig. 3
figure 3

The main family(s) clustered by using the MOLMOL program (hierarchical minimal spanning tree method) [3740] and the most representative conformations for D7 (a), D9 (b), Dag1 (c), Dag2 (d), Dag3 (e) and Dag4 (f) peptide

The average conformations of families 1 of D7 and D9 superpose well on the corresponding section of the native FBP28 WW domain (PDB: 1E0L) (Fig. 4a, b), with Cα root mean square deviation (rmsd) of 1.477 and 2.822 Å for D7 and D9, respectively. This observation suggest that the loop corresponding to the D7 and D9 sequence serves as the nucleation site in the folding of the FBP28 WW domain.

Fig. 4
figure 4

The structural alignment between the most representative conformation of a D7, b D9 peptides with the corresponding fragment of FBP28 WW structure and c Dag1, d Dag2, e Dag3, f Dag4 peptides with the corresponding fragment of 1IGD protein structure

The representatives of the main families of Dag1, Dag2, Dag3 and Dag4 peptides are shown in Fig. 4c–f. It can be seen (Fig. 4c, d) that Dag1 and Dag2 have clearly bent shapes, the bend being more pronounced for Dag1. It can be seen (Fig. 4c, d) that the conformations of these peptides superpose well on the corresponding section of the native IgG protein (PDB: 1IGD). In earlier NMR studies at low temperature [41], the interactions between Asp2 and Lys5 were observed for the Dag1 peptide. These studies have also suggested that the Dag1 fragment can be nucleation site of the IgG protein. Our study suggests that Dag2 also retains the nucleation propensity, which is, therefore not lost upon replacement of Asp2 with Val in Dag1.

Some tendency to form a chain reversal still persists for Dag3 (Fig. 4e) and, to a lesser extent, for Dag4 (Fig. 4f); however, the overall shape of Dag4 is already quite extended. Differences between shapes of the four peptides (Dag1–4) under studies are also reflected by the averaged value of the radius of gyration: 5.5, 6.0, 6.4, 7.0 Å for Dag1, Dag2, Dag3 and Dag4, respectively. This results are in harmony with our previous potentiometric studies [16], which showed that the pKa values of their basic side-chain groups are not much different from those of the reference compounds representing the isolated side chains. These peptides do not superpose that well on the corresponding section of IgG as Dag1 and Dag2 (Fig. 4e, f), which suggests that nucleation propensity is lost upon substituting a basic residue for Asp.

5 Conclusions

We carried out conformational studies by using the experimental (NMR) and theoretical methods of two peptides derived from the N-terminal β-hairpin of the form in binding protein (FPB28): KTADGKT-NH2 and YKTADGKTY-NH2, of the peptide with sequence of the immunoglobulin binding protein G from Streptococcus (DDATKT-NH2; Dag1) (the C-terminal β-hairpin of the B3 domain), of its variant, in which one aspartic acid (negatively charged) residue was replaced with a non-polar valine residue (DVATKT-NH2; Dag2), and of its two variants in which, in addition, the second aspartic acid residue was replaced with a basic (positively charged) residue (OVATKT-NH2; Dag3 and KVATKT-NH2; Dag4). It was found that both fragments of the FBP28 form bent conformations. However, the structure is more definite for the longer (D9) sequence. The D7 peptide is bent in the middle of the sequence but more flexible at the ends, in comparison with D9, because of the lack of two Tyr residues at both ends of the peptide chain, which certainly influence the structure of D9. It should be emphasized that no hydrophobic contacts between the terminal Tyr residues of D9 occur. This observation suggests that the presence of a pair of like-charged lysine residues could stabilizes chain reversal by forming a charged “coat” around the molecule.

The six-residue fragment of IgG (Dag1), and its derivatives, have some tendency to bend in the middle part of the sequence. However, the Dag3 and Dag4 peptides have clearly diminished propensity to forming chain reversals, compared with the Dag1 and Dag2 peptides. Thus, the side chains, which are oppositely charged, in Dag1 and in Dag2 have an influence on conformation to be more compact because of a higher probability of forming salt bridges in comparison with Dag3 and Dag4, while introduction of a positively charged amino-acid residue into the sequence leads to more extended conformations. It should be noted that the two positively charged residues are separated by three residues in Dag3 and Dag4 and by four residues in D7 and D9, which can be the reason why the presence of two positively charged residues disrupts chain reversal for Dag3 and Dag4, while it seems to stabilize it for D7 and D9. For shorter separation, the presence of a salt-bridge, which was observed in a previous study of Dag1 [41] seems to stabilize chain reversal. The mechanism of hairpin formation was investigated in several other studies [4, 5, 4143, 4648]. It was found that both local propensities that lead to the formation of a turn in a certain segment of a polypeptide chain and the formation of short-range hydrophobic contacts play a comparable role in hairpin formation. Hairpin formation thus starts from turn formation followed by zippering [4648]. In the present work we suggest that, apart from hydrophobic interactions, the screening by charged residues can also contribute to zippering.