Electronic polarization stabilizes tertiary structure prediction of HP-36

Duan, Li L.; Zhu, Tong; Zhang, Qing G.; Tang, Bo; Zhang, John Z. H.

doi:10.1007/s00894-014-2195-7

Electronic polarization stabilizes tertiary structure prediction of HP-36

Original Paper
Open access
Published: 09 April 2014

Volume 20, article number 2195, (2014)
Cite this article

Download PDF

You have full access to this open access article

Journal of Molecular Modeling Aims and scope Submit manuscript

Electronic polarization stabilizes tertiary structure prediction of HP-36

Download PDF

Li L. Duan^1,2,
Tong Zhu³,
Qing G. Zhang¹,
Bo Tang² &
…
John Z. H. Zhang^3,4

3146 Accesses
10 Citations
Explore all metrics

Abstract

Molecular dynamic (MD) simulations with both implicit and explicit solvent models have been carried out to study the folding dynamics of HP-36 protein. Starting from the extended conformation, the secondary structure of all three helices in HP-36 was formed in about 50 ns and remained stable in the remaining simulation. However, the formation of the tertiary structure was difficult. Although some intermediates were close to the native structure, the overall conformation was not stable. Further analysis revealed that the large structure fluctuation of loop and hydrophobic core regions was devoted mostly to the instability of the structure during MD simulation. The backbone root-mean-square deviation (RMSD) of the loop and hydrophobic core regions showed strong correlation with the backbone RMSD of the whole protein. The free energy landscape indicated that the distribution of main chain torsions in loop and turn regions was far away from the native state. Starting from an intermediate structure extracted from the initial AMBER simulation, HP-36 was found to generally fold to the native state under the dynamically adjusted polarized protein-specific charge (DPPC) simulation, while the peptide did not fold into the native structure when AMBER force filed was used. The two best folded structures were extracted and taken into further simulations in water employing AMBER03 charge and DPPC for 25 ns. Result showed that introducing polarization effect into interacting potential could stabilize the near-native protein structure.

Energetics of protein backbone hydrogen bonds and their local electrostatic environment

Article 05 November 2014

Correct folding of an α-helix and a β-hairpin using a polarized 2D torsional potential

Article Open access 03 June 2015

Development of SAAP3D force field and the application to replica-exchange Monte Carlo simulation for chignolin and C-peptide

Article 17 November 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Prediction of protein’s native structure from its sequence is still one of the greatest challenges in computational biology. Molecular dynamics (MD) simulation, serving as a complementary means to experimental probe, is capable of providing all kinetic and thermodynamic information on protein folding. Due to the limitations in computing power and accuracy of force field, applications of computational investigations of protein folding are restricted to small “fast folders” [1–3]. The villin headpiece subdomain, a 36 residues peptide (HP-36), has served as a benchmark system for the study of protein folding by all-atom molecular dynamics simulations [4–11]. It is one of the smallest proteins that can fold autonomously without the assistance of disulfide bonds, metal ions, or non-natural amino acids. It is thermally stable with the melting temperature over 70 °C in aqueous solution [12]. Its merits of small size, high stability, fast folding kinetics (10–100 μs) [4] make it an ideal prototype for both experimental and computational studies of protein folding [4–6, 13–33]. Several in silico folding and unfolding studies of HP-36 have been carried out [4–7, 21, 27, 34–45] employing either implicit or explicit solvent model. Duan and Kollman [4] performed a 1 μs folding simulation of HP-36 in explicit solvent using AMBER96 force field, from which they proposed a possible folding pathway for this protein. Shen and Freed [5] performed a 200 ns folding simulation for HP-36 using a simple and fast implicit solvent model and found a native-like structure. Hansmann and Wille [42] used a novel heuristic global optimization approach to predict the structure of HP-36 and found a 3D structure very close to the experimental one. Zagrovic et al. [6] used worldwide distributed computing techniques to simulate the folding of the villin headpiece in atomistic detail. Starting from an extended state, they obtained an ensemble of folded structures and the best Cα RMSD was close to 3.8 Å. They found that the folding mechanism of HP-36 was highly consistent with the hydrophobic collapse view of folding. Srinivas and Bagchi [34] performed Brownian dynamics simulation to study the qualitative folding features of HP-36, and they obtained funnel-like energy landscape for protein folding and found a stable native conformation. Fernández et al. [35] revealed three-body correlations in which hydrophobic residues positioned to protect amide-carbonyl hydrogen bonds from attack by water are important in guiding the folding process of HP-36. An ab initio HP-36 folding simulation at 400 K using AMBER force field and GB model for 15 ns by Jang et al. [26] found that the initial hydrophobic collapse and the rapid formation of helices played important roles in the early stage of folding. Also, in this folding simulation, they observed several native-like conformations with the backbone RMSDs from the NMR structure below 4 Å. The simulated free energy profiles indicated that HP-36 adopted two-state thermodynamic behavior. In a molecular dynamics simulation with Poisson-Boltzmann (PB) implicit solvent model and self-guiding force, Wen et al. found that HP-36 folding started with hydrophobic collapse, followed by the formation of helices [39]. Within these efforts, the best sampled conformation of HP36 with a 1.3 Å Cα RMSD for residues 9–32 had been obtained by Jayachandran et al. using worldwide distributed computing techniques [10]. Yang et al. [28] performed a large number of ab initio Monte Carlo folding simulations using a simple transferable all-atom potential [46] for HP-36, and found that the initial collapse was accompanied by the formation of both helix 2 and helix 3, while helix 1 formed at a slower rate. The transition state ensemble in their simulations contained structures with native-like conformations of helix 1 and helix 2. Best optimized AMBER03 force field with a simple correction to the backbone potential, which improved secondary structure balance [47]. With this force field, they found that the villin headpiece reached the folded state in 50 ns and fluctuated about its folded state after the initial folding [33]. Later, they showed that the combination of the optimized AMBER03 force field and TIP4P/2005 superior water model was capable of providing a more accurate description for the solvation of the unfolded state over a wide range of thermodynamics conditions [48]. The new model showed that predicted helical propensities were in better agreement with experiments and yielded a more realistic collapse of unfolded conformations with increasing temperature. Among those studies, the ground breaking folding simulation for 35 residues villin headpiece was first carried out by Shaw group [49], which extended to over 100 μs near the melting temperature in explicit water using the modified CHARMM force field by a specialized supercomputer Anton. Shaw et al. [49, 50] found that the most representative structure of the folded state fell within 2.0 Å RMSD of the native structure. HP-36 had also been used in experimental studies. Hansen et al. [51] used circular dichroism (CD) spectroscopy to monitor the photoinitiated folding of HP-36 in a nondenaturing environment. Wang et al. [16] measured the folding rate (on the order of 10⁴ s⁻¹) of HP-36 using dynamics NMR line-shape analysis method. The study found the protein folded on the time scale of 10 μs. A recent experimental study indicated that HP-36 was stabilized by tertiary interactions involving Phe47, Phe51, Phe58, and Val50, and there was significant residual structure in the denatured state of HP-36 which was not due simply to locally stabilized structure [15]. In 2005, experiments with equilibrium Fourier transform infrared (FTIR) and temperature jump (T-jump) IR spectroscopic techniques by Brewer et al. predicted the folding and unfolding times of HP-36 to be 3.34 μs and 6.97 μs respectively at 49.9 °C [17].

HP-36 consists of three short helices (referred to as helix 1, 2, and 3 hereafter) for residues 4 to 8, 15 to 18, and 23 to 30. In between are a loop (residues 9 to 14), a turn (residues 19 to 22), and a closely packed hydrophobic core [4, 12, 52]. In this work, MD simulations were performed to study the folding dynamics of HP-36 starting from an extended conformation in both implicit and explicit solvent models. AMBER03 force field was employed, and the simulations extended to 400 ns. Our results showed that the secondary structure formed on the 50 ns time scale while the tertiary structure was hard to fold in the whole MD simulation. Then the two best folded structures were used as the starting structures for MD simulations in explicit water for 25 ns employing two sets of atomic charges, i.e., AMBER03 charge and the dynamically adjusted polarized protein-specific charge (DPPC). The latter is derived from quantum mechanical calculation for protein using the molecular fractionation with conjugate caps approach [53] combined with the PB solvation model and is updated on-the-fly to account for the dynamic fluctuation of the structure [54, 55]. DPPC includes electronic polarization effect and can provide more accurate electrostatic interactions which has a significant impact on the structure and function of the protein [56]. It had been implemented in several studies, such as pKa of an inner residue in protein prediction [54], secondary structural maintenance [56, 57], NMR J-coupling calculation [58, 59], protein/ligand binding [60], and protein folding [55]. It has been well accepted that one of the limitations in traditional force fields is the lack of explicit electronic polarization, which may lead to failure in protein folding simulation. We found that the two best folded structures were more stable under DPPC than under AMBER charge, indicating the importance of explicit treatment of polarization effect in MD simulation. Our recent work [61], which included the polarization effect in MD simulation, demonstrated the electronic polarization energetically stabilized the helix structure.

Methods

Starting from linear structure generated by LEaP module in AMBER, two 400 ns MD simulations were carried out with implicit (GB^OBC (Onufriev, Bashford and Case model)) [62, 63] and explicit solvent models respectively. AMBER03 force field was employed for both simulations. In the simulation with implicit solvent model, the initial structure was first optimized with steepest descent minimization method for 10,000 steps, and then further relaxed with conjugate gradient method until convergence was reached. The minimized structure was then heated up to 300 K in 100 ps, followed by a MD simulation with a time step of 2 fs. Temperature was regulated using Langevin dynamics [64] with the collision frequency set to 1.0 ps⁻¹. The dielectric constants of the protein interior and of the solvent were respectively set to 1.0 and 78.5. Nonbonded interactions were fully counted without any truncations. All the bonds involving hydrogen atoms were constrained by SHAKE algorithm [65] and the salt concentration was set to 0.2 M. The trajectory was saved every 1 ps. In the simulation with explicit solvent model, the protein was placed in a truncated octahedral periodic TIP3P water box with the distance from the surfaces of the box to the closest atoms of the solutes no less than 10 Å. Counterions were added to neutralize the system. Then the system was relaxed in a standard procedure: firstly, only the solvent molecules were free to move, while protein atoms were constrained by an external force; secondly, the whole system was optimized until convergence was reached. After that, the system was heated from 0 to 300 K in 300 ps and then a MD simulation was performed in NPT ensemble to further relax the system without any restraints on solute atoms. Integral time step was set to 2 fs. The Langevin dynamics [64] with a collision frequency of 1.0 ps⁻¹ was applied to regulate the temperature. SHAKE algorithm [65] was employed to fix all bonds involving hydrogen atoms. The trajectory was saved every 1 ps and those snapshots were taken in production run for detailed analysis. All the simulations were carried out using AMBER 10 simulation package [66].

Starting from the intermediate structure (with the backbone RMSD of 6.5 Å) obtained in the implicit solvent simulation at 110 ns, another two simulations with both AMBER03 force field and DPPC were further performed using implicit solvent model for 25 ns. DPPC is initially taken from AMBER force field, but atomic charges are periodically updated during the simulation using PPC scheme [54]. The basic procedure for generating PPC can be described as follows: Firstly, to obtain initial electron density of the protein, gas phase quantum mechanical (QM) calculation of protein is performed with the MFCC approach [53]. Secondly, to calculate electrostatic potential of each residue, the restrained electrostatic potential (RESP) procedure is used to fit atomic charges. Thirdly, solution of PB equation is carried out to generate discrete induced charges on the cavity surface. Fourthly, surface charges which are used to mimic the solvation effect are taken as background charges to preform again QM calculation of protein fragment. Finally, the new atomic charges are used again to calculate new solvent induced charges, and the solute and solvent polarize each other until convergence is reached. All QM calculations are performed at B3LYP/6-31G* level. Because PPC is derived on the basis of a single protein structure, for the simulation with a large conformational change, a fixed PPC will bias the simulation. In the current work, we employ a dynamically adapted charge scheme. In principle, updating charges of all atoms at every MD step are indispensable, but it can be prohibitively demanding, due to large computational overhead of QM calculations. Force field parameters other than atomic charges were kept the same as those in AMBER03 simulations.

Then the two best folded structures (with the lowest backbone RMSD) extracted from the trajectories using implicit solvent model and explicit water model were further refined by simulations in explicit water. Both AMBER03 force field and DPPC were utilized. The simulation time was limited to 25 ns and charge updates were carried out every 100 ps in this work to maintain the balance between efficiency and accuracy. The free energy landscapes are determined by calculating the normalized probability employing the weighted histogram analysis method (WHAM) [67–69] from density of state P(X) = exp[−βW(X)]/Z where X is any set of reaction coordinates, P(X) is the probability at X, and Z is the partition function. The relative free energy can be easily expressed as G(X2)-G(X1) = −RTln[P(X2)/P(X1)] [70]. QM calculations were carried out by Gaussian 09 [71], and charge updates during the simulations were performed by AMBER 10 with in-house modifications.

Results and discussion

Simulation in implicit solvent

Folding of individual helix

The backbone RMSDs from the native structure (PDB ID: 1VII) [52] for the whole protein, and for helix 1, helix 2, and helix 3 are shown in Fig. 1. The formation of helix 1 and helix 2 was very fast, which took only 6.5 ns and 1.5 ns respectively. In the remaining time, these two helices remained generally stable with only small fluctuations in RMSD (< 1 Å). However, the formation of helix 3 was much slower (55 ns for the first folding event), but its structure was stable for the rest of the simulation, except that it underwent two unfolding and refolding processes at about 110 ns and 350 ns. RMSD of the whole protein generally fluctuated between 3.5 Å and 10 Å. The best folded structure occurred in 43.5 ns with the lowest RMSD of 2.6 Å, as shown in Fig. 1, which contained three folded helices and a distorted loop between helix 1 and helix 2. Some intermediate folders with the backbone RMSD around 3.5 Å were found in the trajectory from 15 to 50 ns, 175 to 185 ns, and 220 to 270 ns, but these intermediates were not energetically favorable, and the structure quickly drifted away. In contrast, formation of the tertiary structure was very difficult. Folded state, defined by RMSD below 3.5 Å, was rarely seen. Its occurrence was only around 0.45 %. The observation was in good agreement with the experimental observation that the secondary structure could form in the early stage of folding, while the formation of tertiary structure was a much later event [72].

The influence of loop and hydrophobic core

A previous study by Duan et al. [21] suggested that the hydrophobic core formed by LYS8, LEU35 and PHE36 played an important role in stabilizing the structure. In our simulation, this hydrophobic core had not been formed. Besides, we noticed that there was a strong correlation between the total RMSD and that of the loop plus hydrophobic core with the correlation coefficient 0.93, as shown in Fig. 2. Although the potential energy was still stable along the entire trajectory demonstrating a well-behaved MD simulation, the loop and hydrophobic core were not formed, which resulted in a violent fluctuation of the total RMSD. The representative conformation of the peptide from the most populated cluster has a backbone RMSD of 4.2 Å from the native structure. Both of them were shown in Fig. 2. It clearly demonstrated that the length of helix 2 was too long, which narrowed the space that could be visited by loop between helix 1 and helix 2, thus hindering the packing of helix 1 over the other two. This phenomenon was also observed by Rajan et al., who found that helix 1 and helix 2 almost merged together and the structure was very stable in the first 3 μs [73].

The distribution of backbone torsions in loop and turn regions

Then the free energy landscapes were constructed by using the main chain φ (C-N-Cα-C) and ψ (N-Cα-C-N) angles, it can be easily found that the distributions of main chain torsions of helix 1, helix 2, helix 3 under MD simulations and NMR structure were very close (see Fig. 3). While the distributions of main chain torsions in turn and loop regions were far away from the native state and the corresponding free energy landscape was shown in Fig. 4. Figure 4a showed the distribution of the main chain torsion of ALA9 located at (φ, ψ) = ((−72˚, −45˚), which was far away from that in the native structure (−79˚, 39˚). This mismatch also applied to residues from VAL10 to THR14 in the loop region (Fig. 4b–f). Those lowest free energy states were far away from those corresponding native states except that the torsion of PHE11 fell within the proximal region of the native state. The distributions of the torsions for ALA19 and ASN20 in turn region were also shown in Fig. 4g–h, of which the local minima were near (−69˚, −22˚) and (−75˚, −27˚). The lowest free energy structure shifted from the native state (−85˚, 28˚) and (−107˚, −15˚) significantly. The distributions of torsions of LEU21 and PRO22 in turn region agreed with the native state.

The effect of electrostatic polarization

To study the influence of polarization effect on the structure, we started from the intermediate structure (occurred in 110 ns with an RMSD of 6.5 Å) obtained from the above simulation and performed MD simulation with both AMBER03 force field and DPPC for 25 ns using implicit solvent model. The time development of the backbone RMSD was shown in Fig. 5a. Here we use the native structure as the reference structure in the calculation of the backbone RMSD. It can be seen that the peptide did not show evidence of folding toward the native structure under AMBER simulation. The RMSD fluctuation was dominated around 8.0 Å. However, in the DPPC simulation, the RMSD underwent a rapid fall from an initial value of 6.5 Å to around 3.5 Å at about 3 ns and then fluctuated around this value in subsequent MD simulations. This denoted that the protein stayed in the native-like conformation and this was supported by Fig. 5b, which plotted the intermediate structure and two sets of final structures of MD simulation using AMBER and DPPC respectively. In the intermediate structure, helix 2 was not formed and helix 3 was only partially formed. After DPPC simulation, those helices were completely formed and the tertiary structure was generally consistent with the native structure. In comparison, although those helices were formed in the final structure from AMBER simulation, the overall topology was far away from the native state. This result indicated that DPPC simulation can be applied to restore the intermediate structure toward native state of the protein.

It should be mentioned that the result based on single trajectory simulation of DPPC may not be sufficiently used to support our conclusion. So another trajectory was run using DPPC starting from the same intermediate structure but with a different random seed. This is shown in Fig. 5c in which the backbone RMSD was plotted as a function of the simulation time from two DPPC trajectories. As shown in this figure, the latter trajectory reached the folded structure in about 5.5 ns which was slightly longer than in the former DPPC simulation. Although there were some differences in the folding path and steps, the tertiary structure had formed and the folded structure was stable in each individual trajectory.

Simulation in explicit solvent

Folding of individual helix

In explicit solvent, the peptide was still in denatured states and the folded structure was scarcely found during the whole MD simulation (400 ns) using AMBER force field. The backbone RMSDs of the whole protein and helices 1 to 3 are shown in Fig. 6. Helix 1 began to form in about 7 ns and helices 2 and 3 were in about 15 ns, but their stabilities were very vulnerable with RMSD fluctuating around 1 ∼ 3 Å before 200 ns and then stabilized in subsequent MD simulation. It could be seen that the backbone RMSD fluctuated at about 7.5 Å and its occurrence was only 0.04 % for which the RMSD was below 3.5 Å, suggesting that the peptide was still completely denatured. It can be seen from the RMSD results that the formation of secondary structures dominated the early stage of folding and the tertiary contacts were still nonspecific, which was in excellent agreement with the experimental observation that nascent secondary structure elements could form in an early stage of folding [72]. It has been shown that simulations with implicit and explicit solvent models can produce different ensembles of structures [74, 75]. It seems that the AMBER03 force field works better with the implicit model for folding the HP-36 than the explicit water model.

The influence of loop and hydrophobic core

It was noteworthy that the variations of the backbone RMSD of all residues and the backbone RMSD of the loop plus the hydrophobic core, as shown in Fig. 7, were quite similar along the MD trajectory. There was a good correlation with the change of the RMSD of all residues and the loop and hydrophobic core regions. The correlation coefficient and the slope were 0.71 and 0.65 respectively. This indicated that the variations of the loop and hydrophobic core regions had an important impact on the variations of RMSD of all residues, which agreed with the finding in the simulation employing implicit model. The representative structure of the peptide with a backbone RMSD of 7.0 Å selected from the most populated cluster and the native structure were also shown in Fig. 7. Three helices had formed while the loop and hydrophobic core were far away from the native structure in Fig. 7.

The distribution of backbone torsions in loop and turn regions

Again, we constructed a two-dimensional free energy landscape with φ and ψ angles as reaction coordinates. The main chain torsions of helix 1, helix 2, and helix 3 were in excellent agreement with that in the native structure (Fig. 8). However, in the loop and turn regions, the main chain torsions in MD simulation were quite different from that in the native structure (shown in Fig. 9). The calculated free energy minima were far away from the native state, consistent with the implicit solvent model simulation.

The effect of electrostatic polarization

Another two simulations with explicit solvent model were performed employing both AMBER03 force field and DPPC starting from the best folded conformations obtained using implicit model (denoted simulation1) and explicit water model (denoted simulation2). The best folded structure in implicit model occurred in 43.5 ns with the lowest RMSD of 2.6 Å as shown in Fig. 1. A critical difference of the backbone RMSD for the whole protein (here the best folded structure was the reference structure) can be seen in Fig. 10. Staring from the best folded structure obtained from implicit solvent simulation, the RMSD fluctuated around 3.0 Å using DPPC demonstrating that the structure remained stable in MD simulation. In contrast, the RMSD based on AMBER03 force field was dominated by about 5.0 Å and the average value was 4.7 Å. It was apparent from Fig. 10 that under AMBER force field, the RMSD also visited the vicinity of the folded region with RMSD around 3.0 Å only before 1.5 ns and then it underwent a rapid rise to about 4.0 Å in 1.5 ns to 8 ns in which helices 1 and 3 had very large deformations. Then the RMSD continued to ascend to 5.0 Å at 10 ns in which the tertiary structure was entirely destroyed and helices were only partially formed. Finally the RMSD came to a much more stable stage in which the loop, turn and the hydrophobic core were all destroyed compared with the best folded structure. Figure 10 also plots the comparison between the best folded structure and two sets of final structures resulting from MD simulation using AMBER charge and DPPC, respectively. It was apparent that the structure of DPPC was very close to the best folded structure. However, those tertiary structures were disrupted and the secondary structures also underwent very large deformation in AMBER force field. The current result was another demonstration that the protein structure was stabilized by electrostatic polarization effect, which was in good agreement with previous studies [55, 56, 76]. The computational overhead for DPPC was more expensive than that of AMBER charge, but it was still acceptable. A 500 ps MD simulation took about 12 and 4 h for DPPC and AMBER, respectively, on quad core server.

In explicit solvent model, the best folded structure occurred in 130 ns with an RMSD of 3.1 Å and was also shown in Fig. 6. Then two 25 ns MD simulations were performed in water using this structure as the starting point employing AMBER03 and DPPC. The backbone RMSD for the whole protein along the simulation is shown in Fig. 11. As can be seen, the RMSD was more stable and only fluctuated near 3.0 Å under DPPC simulation. While under AMBER force field, the RMSD fluctuated wildly from 3.0 to 8.0 Å, demonstrating that the peptide was quite unstable. Although some structures were found very close to the native one (with RMSD dropped to 3.0 Å between 5.0 and 9.5 ns), they were not stable and quickly denatured indicating that the instantaneously formed structure was not energetically stable under the standard AMBER force field. The best folded structure and two sets of final structures resulting from MD simulation using AMBER and DPPC were plotted in Fig. 11. It clearly showed that the best folded states were generally stable using DPPC in water model. In AMBER simulation, helix 1 was in a denatured state and helices 2 and 3 retained its secondary structures, whereas the overall topology was far away from the best folded structure.

In order to investigate the motion of loop and hydrophobic core, the root-mean-square fluctuations (RMSFs) of the regions under AMBER charge and DPPC in simulation1 and simulation2 respectively were calculated (shown in Fig. 12). The RMSF reflects the mobility of residue around its average position, so it is also a tool of studying the stability of protein in MD simulation [77]. Obviously, the flexibility of the loop and hydrophobic core regions was higher using AMBER than DPPC in both simulation1 and simulation2. The mean RMSFs were 2.8 and 1.6 Å in simulation1, and they were 4.9 and 1.3 Å in simulation2 for AMBER and DPPC respectively. This suggested that the DPPC performed better in stabilizing the structure of the loop and hydrophobic core. Then, the final structures of two DPPC simulations and the NMR native structure were compared. In simulation1, the distance between LYS8 and LEU35 and that between LYS8 and PHE36 were 9.6 Å, 11.6 Å for the final structure and were 12.0 and 11.6 Å for NMR structure respectively. The final structure had much shorter distance between LYS8 and LEU35 (about 2.4 Å) than that of the native structure and the distance between LYS8 and PHE36 was close to the experimental value. The RMSD of the loop region from the native structure was 1.6 Å. In simulation2, those distances between LYS8 and LEU35, LYS8 and PHE36 were 8.9 and 7.2 Å for the final structure, which was more packed than that for the native state. The RMSD of the loop region from the native structure was 2.9 Å.

Then we further studied the stability of hydrogen bond and performed statistical analysis of the number of backbone H-bonds to examine the percentage occupancy of them from MD simulation. The percentage occupancy was calculated following the way in ref. [56]. Figure 13 plotted the comparison of H-bonds percentage occupation using a histogram for AMBER and DPPC in simulation1 and simulation2 respectively. Obviously, the distribution of high occupancy H-bonds with DPPC was higher than that with AMBER charge, in good agreement with a previous study [56]. We also noticed that the distributions based on simulation1 using AMBER and DPPC were both dominated by low occupancy H-bonds. Because the initial structure came from the best folded structure (the lowest RMSD) in which helix 3 was not completely formed using implicit model, in the subsequent 25 ns MD simulation in water for AMBER and DPPC, helix 3 was still only partially folded. A recent ab initio folding study for a 17-residues peptide (PDB ID 2I9M) showed that the folding of helix completed in about 80 ns in explicit water model using DPPC [76]. We believe that helix 3 can successfully fold to its native structure after long time DPPC simulation. Besides, our purpose is to examine the importance of polarization effect in stabilizing the near-native protein structure, and it will not affect our conclusion.

To further investigate the polarization effect on the stability of native structure, we carried out MD simulation using AMBER and PPC in explicit solvent model starting from the native structure for 150 ns respectively. In this work, we mainly focused on the dynamic properties of HP-36 near the vicinity of the native state. Thus, we applied PPC scheme in which charge fitting was based on the native structure and was fixed throughout simulation to save the computational expense. As shown in Fig. 14, the native conformation was not stable and the RMSD (from native state) rapidly increased after about 8 ns and fluctuated between 3.0 and 5.0 Å using AMBER. For comparison, the use of PPC was shown to give a more dynamically stable structure with an RMSD of around 1.5 Å. The above result indicated that the native structure was not energetically stable under the standard AMBER force field, which can drive the protein away from its native state. However, the native structure of the protein was stable after long time simulation under PPC. The study demonstrated the important effect of electronic polarization in stabilizing the native structure.

Conclusions

In this work, folding simulations of HP-36 protein were performed under AMBER03 force field with both implicit and explicit solvent models. Dynamic properties of intermediate structure and the two best folded structures obtained from these simulations were further studied using AMBER03 charge and the dynamically adjusted polarized protein-specific charge (DPPC), respectively. DPPC is derived from a molecular tailoring quantum mechanical calculation of protein in solvent in which the atomic charges of every residue are fitted periodically in the simulation. Our results showed the following features:

(1)
Under the MD simulation with AMBER03 force field, HP-36 showed ordered secondary structures in about 50 ns starting from an extended structure, while the tertiary structure was hard to reach, regardless of the employed solvent models. It was found that there was a strong correlation between the total RMSD and that of the loop and hydrophobic core regions under the two models.
(2)
The distributions of main chain torsions of helix 1, helix 2, and helix 3 under two MD simulations were very close to that of the NMR structure. However, in the loop and turn regions, the main chain torsions were quite different from those of the native structure.
(3)
Starting from an intermediate structure, HP-36 was found to generally fold into the native state under DPPC. In contrast, the peptide did not fold into the native structure with a high RMSD of around 8.0 Å when AMBER force field was used in the MD simulation.
(4)
When polarization effect was introduced, both of the best folded structures and the native structure were very stable with the low backbone RMSDs in MD simulation. While using AMBER03 force field their stabilities were very low with tertiary structure destroyed and some helices underwent incorrect deformation.

References

Kubelka J, Hofrichter J, Eaton WA (2004) The protein folding ‘speed limit’. Curr Opin Struct Biol 14(1):76–88
Article CAS Google Scholar
Wang JA, Zhu WL, Li GH, Hansmann UHE (2011) Velocity-scaling optimized replica exchange molecular dynamics of proteins in a hybrid explicit/implicit solvent. J Chem Phys 135(8):084115
Article Google Scholar
Best RB, Mittal J (2010) Balance between α and β structures in Ab initio protein folding. J Phys Chem B 114:8790–8798
Article CAS Google Scholar
Duan Y, Kollman PA (1998) Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. Science 282(5389):740–744
Article CAS Google Scholar
Shen MY, Freed KF (2002) All-atom fast protein folding simulations: the villin headpiece. Proteins Struct Funct Genet 49(4):439–445
Article CAS Google Scholar
Zagrovic B, Snow CD, Shirts MR, Pande VS (2002) Simulation of folding of a small alpha-helical protein in atomistic detail using worldwide-distributed computing. J Mol Biol 323(5):927–937
Article CAS Google Scholar
Pande VS, Baker I, Chapman J, Elmer SP, Khaliq S, Larson SM, Rhee YM, Shirts MR, Snow CD, Sorin EJ, Zagrovic B (2003) Atomistic protein folding simulations on the submillisecond time scale using worldwide distributed computing. Biopolymers 68:91–109
Article CAS Google Scholar
Lee I, Kim S, Lee J (2010) Dynamic folding pathway models of the villin headpiece subdomain (HP-36) structure. J Comput Chem 31(1):57–65
Article CAS Google Scholar
Sonavane UB, Ramadugu SK, Joshi RR (2008) Study of early events in the protein folding of villin headpiece using molecular dynamics simulation. J Biomol Struct Dyn 26(2):203–214
Article CAS Google Scholar
Jayachandran G, Vishal V, Pande VS (2006) Using massively parallel simulation and Markovian models to study protein folding: examining the dynamics of the villin headpiece. J Chem Phys 124(16):164902
Article Google Scholar
De Mori GMS, Colombo G, Micheletti C (2005) Investigating protein folding by combining coarse-grained Monte Carlo sampling and all-atom Molecular Dynamics: characterization of the Villin Headpiece folding mechanism. Proteins Struct Funct Bioinforma 58(2):459–471
Article Google Scholar
McKnight CJ, Doering DS, Matsudaira PT, Kim PS (1996) A Thermostable 35-residue subdomain within villin headpiece. J Mol Biol 260:126–134
Article CAS Google Scholar
Lee MR, Duan Y, Kollman PA (2000) Use of MM-PB/SA in estimating the free energies of proteins: application to native, intermediates, and unfolded villin headpiece. Proteins Struct Funct Genet 39(4):309–316
Article CAS Google Scholar
Kubelka J, Eaton WA, Hofrichter J (2003) Experimental tests of villin subdomain folding simulations. J Mol Biol 329(4):625–630
Article CAS Google Scholar
Tang YF, Rigotti DJ, Fairman R, Raleigh DP (2004) Peptide models provide evidence for significant structure in the denatured state of a rapidly folding protein: the villin headpiece subdomain. Biochemistry 43(11):3264–3272
Article CAS Google Scholar
Wang MH, Tang YF, Sato SS, Vugmeyster L, McKnight CJ, Raleigh DP (2003) Dynamic NMR line-shape analysis demonstrates that the villin headpiece subdomain folds on the microsecond time scale. J Am Chem Soc 125(20):6032–6033
Article CAS Google Scholar
Brewer SH, Vu DM, Tang YF, Li Y, Franzen S, Raleigh DP, Dyer RB (2005) Effect of modulating unfolded state structure on the folding kinetics of the villin headpiece subdomain. Proc Natl Acad Sci U S A 102(46):16662–16667
Article CAS Google Scholar
Havlin RH, Tycko R (2005) Probing site-specific conformational distributions in protein folding with solid-state NMR. Proc Natl Acad Sci U S A 102(9):3284–3289
Article CAS Google Scholar
Herges T, Wenzel W (2005) Free-energy landscape of the villin headpiece in an all-atom force field. Structure 13(4):661–668
Article CAS Google Scholar
Srinivasan R, Fleming PJ, Rose GD (2004) Ab initio protein folding using LINUS. Methods Enzymol 383:48–66
Article CAS Google Scholar
Duan Y, Wang L, Kollman PA (1998) The early stage of folding of villin headpiece subdomain observed in a 200-nanosecond fully solvated molecular dynamics simulation. Proc Natl Acad Sci U S A 95(17):9897–9902
Article CAS Google Scholar
Zagrovic B, Pande VS (2006) Simulated unfolded-state ensemble and the experimental NMR structures of villin headpiece yield similar wide-angle solution X-ray scattering profiles. J Am Chem Soc 128(36):11742–11743
Article CAS Google Scholar
Jayachandran G, Vishal V, Garcia AE, Pande VS (2007) Local structure formation in simulations of two small proteins. J Struct Biol 157(3):491–499
Article CAS Google Scholar
Colubri A, Jha AK, Shen MY, Sali A, Berry RS, Sosnick TR, Freed KF (2006) Minimalist representations nearest neighbor effects in and the importance of protein folding simulations. J Mol Biol 363(4):535–557
Article Google Scholar
Bandyopadhyay S, Chakraborty S, Bagchi B (2006) Coupling between hydration layer dynamics and unfolding kinetics of HP-36. J Chem Phys 125(8):084912
Article Google Scholar
Jang SM, Kim E, Shin S, Pak Y (2003) Ab initio folding of helix bundle proteins using molecular dynamics simulations. J Am Chem Soc 125(48):14841–14846
Article CAS Google Scholar
Lin CY, Hu CK, Hansmann UHE (2003) Parallel tempering simulations of HP-36. Proteins Struct Funct Genet 52(3):436–445
Article CAS Google Scholar
Yang J, Wallin S, Shakhnovich E (2008) Universality and diversity of folding mechanics for three-helix bundle proteins. Proc Natl Acad Sci U S A 105:895–900
Article CAS Google Scholar
Wei Y, Nadler W, Hansmann UHE (2008) Backbone and side-chain ordering in a small protein. J Chem Phys 128:025105
Article Google Scholar
Ripoll DR, Vila JA, Scheraga HA (2004) Folding of the villin headpiece subdomain from random structures. Analysis of the charge distribution as a function of pH. J Mol Biol 339(4):915–925
Article CAS Google Scholar
Chiu TK, Kubelka J, Herbst-Irmer R, Eaton WA, Hofrichter J, Davies DR (2005) High-resolution x-ray crystal structures of the villin headpiece subdomain, an ultrafast folding protein. Proc Natl Acad Sci U S A 102(21):7517–7522
Article CAS Google Scholar
Kubelka J, Chiu TK, Davies DR, Eaton WA, Hofrichter J (2006) Sub-microsecond protein folding. J Mol Biol 359(3):546–553
Article CAS Google Scholar
Mittal J, Best RB (2010) Tackling force-field bias in protein folding simulations: folding of villin HP35 and pin WW domains in explicit water. Biophys J 99:L26–L28
Article CAS Google Scholar
Srinivas G, Bagchi B (2002) Foldability and the funnel of HP-36 protein sequence: use of hydropathy scale in protein folding. J Chem Phys 116(19):8579–8587
Article CAS Google Scholar
Fernández A, Shen MY, Colubri A, Sosnick TR, Berry RS, Freed KF (2003) Large-scale context in protein folding: villin headpiece. Biochemistry 42(3):664–671
Article Google Scholar
van der Spoel D, Lindahl E (2003) Brute-force molecular dynamics simulations of Villin headpiece: comparison with NMR parameters. J Phys Chem B 107(40):11178–11187
Article Google Scholar
De Mori GMS, Micheletti C, Colombo G (2004) All-atom folding simulations of the villin headpiece from stochastically selected coarse-grained structures. J Phys Chem B 108(33):12267–12270
Article Google Scholar
Mukherjee A, Bagchi B (2004) Contact pair dynamics during folding of two small proteins: chicken villin head piece and the Alzheimer protein beta-amyloid. J Chem Phys 120(3):1602–1612
Article CAS Google Scholar
Wen EZ, Hsieh MJ, Kollman PA, Luo R (2004) Enhanced ab initio protein folding simulations in Poisson-Boltzmann molecular dynamics with self-guiding forces. J Mol Graph Model 22(5):415–424
Article CAS Google Scholar
Zagrovic B, Snow CD, Khaliq S, Shirts MR, Pande VS (2002) Native-like mean structure in the unfolded ensemble of small proteins. J Mol Biol 323(1):153–164
Article CAS Google Scholar
Jani V, Sonavane U, Joshi R (2011) Microsecond scale replica exchange molecular dynamic simulation of villin headpiece: an insight into the folding landscape. J Biomol Struct Dyn 28(6):845–860
Article CAS Google Scholar
Hansmann UHE, Wille LT (2002) Global optimization by energy landscape paving. Phys Rev Lett 88(6):068105
Article Google Scholar
Chakraborty S, Bandyopadhyay S (2008) Dynamics of water in the hydration layer of a partially unfolded structure of the protein HP-36. J Phys Chem B 112(20):6500–6507
Article CAS Google Scholar
Wickstrom L, Okur A, Song K, Hornak V, Raleigh D, Simmerling C (2006) The unfolded state of the villin headpiece helical subdomain: computational studies of the role of locally stabilized structure. J Mol Biol 360(5):1094–1107
Article CAS Google Scholar
Zhang C, Ma J (2010) Enhanced sampling and applications in protein folding in explicit solvent. J Chem Phys 132(24):244101
Article Google Scholar
Yang JS, Chen WW, Skolnick J, Shakhnovich EI (2007) All-atom ab initio folding of a diverse set of proteins. Structure 15:53–63
Article Google Scholar
Best RB, Hummer G (2009) Optimized molecular dynamics force fields applied to the helix-coil transition of polypeptides. J Phys Chem B 113(26):9004–9015
Article CAS Google Scholar
Best RB, Mittal J (2010) Protein simulations with an optimized water model: cooperative helix formation and temperature-induced unfolded state collapse. J Phys Chem B 114:14916–14923
Article CAS Google Scholar
Lindorff-Larsen K, Piana S, Dror RO, Shaw DE (2011) How fast-folding proteins fold. Science 334:517–520
Article CAS Google Scholar
Piana S, Lindorff-Larsen K, Shaw DE (2011) How robust are protein folding simulations with respect to force field parameterization? Biophys J 100:L47–L49
Article CAS Google Scholar
Hansen KC, Rock RS, Larsen RW, Chan SI (2000) A method for photoinitating protein folding in a nondenaturing environment. J Am Chem Soc 122:11567–11568
Article CAS Google Scholar
McKnight CJ, Matsudaira PT, Kim PS (1997) NMR structure of the 35-residue villin headpiece subdomain. Nat Struct Biol 4(3):180–184
Article CAS Google Scholar
Zhang DW, Zhang JZH (2003) Molecular fractionation with conjugate caps for full quantum mechanical calculation of protein-molecule interaction energy. J Chem Phys 119(7):3599–3605
Article CAS Google Scholar
Ji CG, Mei Y, Zhang JZH (2008) Developing polarized protein-specific charges for protein dynamics: MD free energy calculation of pK(a) shifts for Asp(26)/Asp(20) in thioredoxin. Biophys J 95(3):1080–1088
Article CAS Google Scholar
Duan LL, Mei Y, Zhang DW, Zhang QG, Zhang JZH (2010) Folding of a helix at room temperature is critically aided by electrostatic polarization of intraprotein hydrogen bonds. J Am Chem Soc 132(32):11159–11164
Article CAS Google Scholar
Duan LL, Mei Y, Zhang QG, Zhang JZH (2009) Intra-protein hydrogen bonding is dynamically stabilized by electronic polarization. J Chem Phys 130(11):115102
Article Google Scholar
Ji CG, Zhang JZH (2008) Protein polarization is critical to stabilizing AF-2 and helix-2′ domains in ligand binding to PPAR-gamma. J Am Chem Soc 130(50):17129–17133
Article CAS Google Scholar
Tong Y, Ji CG, Mei Y, Zhang JZH (2009) Simulation of NMR data reveals that proteins’ local structures are stabilized by electronic polarization. J Am Chem Soc 131(24):8636–8641
Article CAS Google Scholar
Ji CG, Zhang JZH (2009) NMR scalar coupling constant reveals that intraprotein hydrogen bonds are dynamically stabilized by electronic polarization. J Phys Chem B 113(42):13898–13900
Article CAS Google Scholar
Tong Y, Mei Y, Li YL, Ji CG, Zhang JZH (2010) Electrostatic polarization makes a substantial contribution to the free energy of Avidin-biotin binding. J Am Chem Soc 132(14):5137–5142
Article CAS Google Scholar
Gao Y, Lu XL, Duan LL, Zhang JZH, Mei Y (2012) Polarization of intraprotein hydrogen bond is critical to thermal stability of short helix. J Phys Chem B 116:549–554
Article CAS Google Scholar
Onufriev A, Bashford D, Case DA (2004) Exploring protein native states and large-scale conformational changes with a modified generalized born model. Proteins Struct Funct Bioinforma 55(2):383–394
Article CAS Google Scholar
Feig M, Onufriev A, Lee MS, Im W, Case DA, Brooks CL (2004) Performance comparison of generalized born and Poisson methods in the calculation of electrostatic solvation energies for protein structures. J Comput Chem 25(2):265–284
Article CAS Google Scholar
Pastor RW, Brooks BR, Szabo A (1988) An analysis of the accuracy of langevin and molecular dynamics algorithm. Mol Phys 65(6):1409–1419
Article Google Scholar
Ryckaert JP, Ciccotti G, Berendsen HJC (1977) Numerical integration of the cartesian equations of motion of a system with constrains: molecular dynamics of n-alkanes. J Comput Phys 23(3):327–341
Article CAS Google Scholar
Pearlman DA, Case DA, Caldwell JW, Ross WS, Cheatham TE III, DelBolt S, Ferguson D, Seibel G, Kollman P (1995) AMBER, A package of computer programs for applying molecular mechanics, Normal mode analysis, Molecular dynamics and free energy calculations to simulate the structural and energetic properties of molecules. Comput Phys Commun 91:1–41
Article CAS Google Scholar
Kumar S, Bouzida D, Swendsen RH, Kollman PA, Rosenberg JM (1992) The weighted histogram analysis method for free-energy calculations on biomolecules. I. The method. J Comput Chem 13:1011–1021
Article CAS Google Scholar
Kumar S, Rosenberg JM, Bouzida D, Swendsen RH, Kollman PA (1995) Multidimensional free-energy calculations using the weighted histogram analysis method. J Comput Chem 16:1339–1350
Article CAS Google Scholar
Roux B (1995) The calculation of the potential of mean force using computer simulations. Comput Phys Commun 91:275–282
Article CAS Google Scholar
Zhou RH (2003) Free energy landscape of protein folding in water: explicit vs. implicit solvent. Proteins Struct Funct Genet 53:148–161
Article CAS Google Scholar
Frisch MJ, Trucks GW, Schlegel HB, Scuseria GE, Robb MA, Cheeseman JR, Montgomery JAJ, Vreven T, Kudin KN, Burant JC, Millam JM, Iyengar SS, Tomasi J, Barone V, Mennucci B, Cossi M, Scalmani G, Rega N, Petersson GA, Nakatsuji H, Hada M, Ehara M, Toyota K, Fukuda R, Hasegawa J, Ishida M, Nakajima T, Honda Y, Kitao O, Nakai H, Klene M, Li X, Knox JE, Hratchian HP, Cross JB, Bakken V, Adamao C, Jaramillo J, Gomperts R, Stratmann RE, Yazyev O, Austin AJ, Cammi R, Pomelli C, Ochterski JW, Ayala PY, Morokuma K, Voth GA, Salvador P, Dannenberg JJ, Zakrzewski VG, Dapprich S, Daniels AD, Strain MC, Farkas O, Malick DK, Rabuck AD, Raghavachari K, Foresman JB, Ortiz JV, Cui Q, Baboul AG, Clifford S, Cioslowski J, Stefanov BB, Liu G, Liashenko A, Piskorz P, Komaromi I, Martin RL, Fox DJ, Keith T, Al-Laham MA, Peng CY, Nanayakkara A, Challacombe M, Gill PMW, Johnson B, Chen W, Wong MW, Gonzalez C, Pople JA (2010) Gaussian 09, revision B.01. Gaussian, Inc, Wallingford
Google Scholar
Gilmanshin R, Williams S, Callender RH, Woodruff WH, Dyer RB (1997) Fast events in protein folding: relaxation dynamics of secondary and tertiary structure in native apomyoglobin. Proc Natl Acad Sci U S A 94:3709–3713
Article CAS Google Scholar
Rajan A, Freddolino PL, Schulten K (2010) Going beyond clustering in MD trajectory analysis: an application to villin headpiece folding. PLoS ONE 5(4):e9890
Article Google Scholar
Nymeyer H, Garcia AE (2003) Simulation of the folding equilibrium of alpha-helical peptides: a comparison of the generalized born approximation with explicit solvent. Proc Natl Acad Sci U S A 100(24):13934–13939
Article CAS Google Scholar
Zhou RH, Berne BJ (2002) Can a continuum solvent model reproduce the free energy landscape of a beta-hairpin folding in water? Proc Natl Acad Sci U S A 99(20):12777–12782
Article CAS Google Scholar
Duan LL, Gao Y, Mei Y, Zhang QG, Tang B, Zhang JZH (2012) Folding of a helix is critically stabilized by polarization of backbone hydrogen bonds: study in explicit water. J Phys Chem B 116:3430–3435
Article CAS Google Scholar
Cheng WY, Chen JZ, Liang ZQ, Li GH, Yi CH, Wang W, Wang KY (2012) A computational analysis of interaction mechanisms of peptide and non-peptide inhibitors with MDMX based on molecular dynamics simulation. Comput Theor Chem 984:43–50
Article CAS Google Scholar

Download references

Acknowledgments

L.L.D. is grateful to the National Natural Science Foundation of China (Grants 11147026 and 31200545) and the Scientific Research Award Fund for the Excellent Middle-Aged and Young Scientists of Shandong Province (Grant BS2011SW046) for support. Q.G.Z. thanks the National Natural Science Foundation of China (Grant 11274206) for support. B.T. was supported by 973 Program (2013CB933800) and the National Natural Science Foundation of China (Grants 21035003 21390411 91313302 and 21227005). J.Z.H.Z. acknowledges financial support from the National Natural Science Foundation of China (Grants 10974054 and 20933002) and the Shanghai PuJiang program (Grant 09PJ1404000). We thank the Supercomputer Center of East China Normal University for CPU time support.

Author information

Authors and Affiliations

College of Physics and Electronics, Shandong Normal University, Jinan, 250014, China
Li L. Duan & Qing G. Zhang
College of Chemistry, Chemical Engineering and Materials Science, Collaborative Innovation Center of Functionalized Probes for Chemical Imaging, Key Laboratory of Molecular and Nano Probes, Ministry of Education, Shandong Normal University, Jinan, 250014, People’s Republic of China
Li L. Duan & Bo Tang
State Key Laboratory of Precision Spectroscopy and Department of Physics, Institute of Theoretical and Computational Science, East China Normal University, Shanghai, 200062, China
Tong Zhu & John Z. H. Zhang
NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, China, 200062
John Z. H. Zhang

Authors

Li L. Duan
View author publications
You can also search for this author in PubMed Google Scholar
Tong Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Qing G. Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Bo Tang
View author publications
You can also search for this author in PubMed Google Scholar
John Z. H. Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Bo Tang or John Z. H. Zhang.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Duan, L.L., Zhu, T., Zhang, Q.G. et al. Electronic polarization stabilizes tertiary structure prediction of HP-36. J Mol Model 20, 2195 (2014). https://doi.org/10.1007/s00894-014-2195-7

Download citation

Received: 04 September 2013
Accepted: 02 March 2014
Published: 09 April 2014
DOI: https://doi.org/10.1007/s00894-014-2195-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Electronic polarization stabilizes tertiary structure prediction of HP-36

Abstract

Similar content being viewed by others

Energetics of protein backbone hydrogen bonds and their local electrostatic environment

Correct folding of an α-helix and a β-hairpin using a polarized 2D torsional potential

Development of SAAP3D force field and the application to replica-exchange Monte Carlo simulation for chignolin and C-peptide

Introduction

Methods