Binding free energy, energy and entropy calculations using simple model systems

Lai, Balder; Oostenbrink, Chris

doi:10.1007/s00214-012-1272-1

Binding free energy, energy and entropy calculations using simple model systems

Regular Article
Open access
Published: 23 September 2012

Volume 131, article number 1272, (2012)
Cite this article

Download PDF

You have full access to this open access article

Theoretical Chemistry Accounts Aims and scope Submit manuscript

Binding free energy, energy and entropy calculations using simple model systems

Download PDF

Balder Lai¹ &
Chris Oostenbrink¹

6585 Accesses
28 Citations
1 Altmetric
Explore all metrics

Abstract

Free energy differences are calculated for a set of two model host molecules, binding acetone and methanol. Two active sites of different characteristics were constructed based on an artificially extended C60 fullerene molecule, possibly functionalised to include polar interactions in an otherwise apolar, spherical cavity. The model host systems minimise the necessary sampling of conformational space while still capturing key aspects of ligand binding. The estimates of the free energies are split up into energetic and entropic contributions, using three different approaches investigating the convergence behaviour. For these systems, a direct calculation of the total energy and entropy is more efficient than calculating the entropy from the temperature dependence of the free energy or from a direct thermodynamic integration formulation. Furthermore, the compensating surrounding–surrounding energies and entropies are split off by calculating reduced ligand-surrounding energies and entropies. These converge much more readily and lead to properties that are more straightforwardly interpreted in terms of molecular interactions and configurations. Even though not experimentally accessible, the reduced thermodynamic properties may prove highly relevant for computational drug design, as they may give direct insights into possibilities to further optimise ligand binding while optimisation in the surrounding–surrounding energy or entropy will exactly cancel and not lead to improved affinity.

Conformational energies of reference organic molecules: benchmarking of common efficient computational methods against coupled cluster theory

Article Open access 19 August 2023

Comprehensive evaluation of end-point free energy techniques in carboxylated-pillar[6]arene host–guest binding: I. Standard procedure

Article 22 September 2022

Detailed potential of mean force studies on host–guest systems from the SAMPL6 challenge

Article 24 August 2018

1 Introduction

Drug design (DD) often requires the binding affinity optimisation of lead compounds or known drugs, which is commonly achieved by the substitution of atoms or groups of atoms in the molecule or by restricting the conformational freedom of the molecules. These modifications should not affect the pharmacophoric features or the interactions with the binding pocket negatively in a significant way, but rather increase binding affinity, that is, induce a favourable change in binding free enthalpy (ΔG _bind). Therefore, improving ΔG _bind forms the main focus during the optimisation process. However, retrospective analyses [1] have shown that a rational modification often only leads to a moderate improvement in ΔG _bind, due to a compensation of the enthalpy (ΔH) and the entropy (ΔS). A shift from mainly entropically driven binding towards mainly enthalpically driven binding, or vice versa, is commonly observed [2–4]. This phenomenon is largely due to the current trend to optimise ligands for more enthalpic binding. It is common that the entropic contribution is dominant for a compound that is the first of its class, while further optimisations lead to stronger enthalpic binding in the best of its class [5].

In the last few years, experimental approaches that attempt to take into account all three mentioned thermodynamic terms, ΔG, ΔH and ΔS, have gained popularity [6, 7]. Examples are isothermal titration calorimetry (ITC) and surface plasmon resonance (SPR) techniques. These methods offer valuable insight into the effect of modifications in the molecular structure on the affinity and help to adjust design strategies in directions that improve either the enthalpic or the entropic contributions. An accurate estimation of ΔG, ΔH and ΔS, by computational means allows for focus of the design on ΔH or ΔS, depending on the specific aims for the ligand. However, estimating ΔH and ΔS in silico still proves to be a challenge and computationally expensive [8, 9]. Simplified host–guest systems offer an attractive tool to assess the accuracy and efficiency of free energy calculations [10, 11].

Here, an attempt is made to estimate ΔG, ΔH and ΔS of binding for two simple host models previously used to illustrate the efficiency of free energy methods [12]. The two host models, as shown in Fig. 1, are C60 fullerenes with carbon–carbon bonds extended to 0.2 nm, which can be considered to be representative for a mostly rigid hydrophobic binding pocket. The only difference between the first (C_APO) and second (C_HB) host model is that the latter has an acetamide group, –(C=O)NH₂, attached to one of the carbon atoms which introduces hydrogen bond forming capabilities. Correspondingly simple ligands, acetone and methanol, were chosen and kept inside the host models in all simulations. These simplifications result in a minimal computational system that allows for faster convergence of molecular interactions and characterisation of various methods to estimate enthalpic and entropic effects.

A popular approach to estimate entropic contributions to ligand binding is through the calculation of configurational entropies through heuristic [13] or quasi harmonic analysis [14] or variations thereof [15–17]. Relevant interpretations of experimental data have been possible with this approach [18, 19]. However, it is clear that entropic contributions due to the solvent may play significant roles [20]. Ideally, the applied methodology should not only consider the (favourable) enthalpic interaction between the protein and the ligand and the (unfavourable) loss of configurational entropy, but also include the enthalpic and entropic contributions of (partial) desolvation [21].

Various computational methods to estimate ΔH and ΔS were assessed for reliability and efficiency. Apart from calculating the full enthalpy and entropy, we also investigated reduced terms by excluding the compensation in enthalpic and entropic contributions due to changes in the interactions within the surroundings of the ligand. From solvation studies, the reduced terms are known as the solute–solvent enthalpy and entropy [22, 23], which we here generalise to a ligand-surrounding enthalpy and entropy. Solvation studies have also shown that the exactly compensating solvent–solvent contributions may obscure a proper interpretation of enthalpic and entropic contribution to the free energy [24]. Also, in DD, the interpretation of the enthalpic and entropic contribution in terms of molecular interactions is often complex and possibly not unambiguous due to a cancellation of effects [25]. The convergence and use of the generalised reduced thermodynamic terms will be investigated and discussed. The methods will be outlined in the following theory section, followed by a description of the applied simulation methodology and settings and by a discussion of the results and the main conclusions.

2 Theory

The free enthalpy, ΔG, enthalpy, ΔH, and entropy, ΔS are connected via the Gibbs equation,

$$ \Updelta G = \Updelta H - T\Updelta S $$

(1)

where T is the absolute temperature in Kelvin. For experiments and simulations at constant volume rather than constant pressure, we use the Helmholtz free energy (ΔA) and total energy (ΔE) to write,

$$ \Updelta A = \Updelta E - T\Updelta S $$

(2)

For ease of notation, and in line with the simulations performed in this work, we will restrict ourselves to the Helmholtz free energy below.

In the first approach, long molecular dynamics (MD) simulations at the end-states of a given process, for example, acetone or methanol in C_APO or C_HB, were used to estimate ΔE, while thermodynamic integration (TI) [26] was used to obtain ΔA using,

$$ \Updelta A = \int\limits_{0}^{1} {\left\langle {\frac{{\partial \mathcal{H}\left( \lambda \right)}}{\partial \lambda }} \right\rangle_{\lambda } {\text{d}}\lambda } , $$

(3)

where $ \mathcal{H} $ is the Hamiltonian of the system and λ is a coupling parameter that connects the initial state (λ = 0), the final state (λ = 1) and a series of intermediate states (0 < λ < 1). The angular brackets represent an ensemble average obtained from a simulation at a state corresponding to the indicated λ-value. The integral of the (ensemble) average of the derivative of the Hamiltonian with respect to λ gives ΔA. ΔS is subsequently calculated from the estimated ΔA and ΔE, using Eq. (2).

The second approach uses a different thermodynamic property which follows from Eq. (2):

$$ \Updelta S = - \frac{{{\text{d}}\Updelta A}}{{{\text{d}}T}} $$

(4)

Equation (4) implies that ΔS may be obtained from a linear regression over multiple ΔA estimates at different temperatures, obtained using, for example, TI.

The third approach estimates ΔS directly from TI [8], using Eq. (5):

$$ \Updelta S = \frac{1}{{k_{\text{B}} \cdot T^{2} }}\int\limits_{0}^{1} {\left\{ {\left\langle {\frac{{\partial \mathcal{H}\left( \lambda \right)}}{\partial \lambda }} \right\rangle_{\lambda } \left\langle {\mathcal{H}\left( \lambda \right)} \right\rangle_{\lambda } - \left\langle {\frac{{\partial \mathcal{H}\left( \lambda \right)}}{\partial \lambda }\mathcal{H}\left( \lambda \right)} \right\rangle_{\lambda } } \right\} \cdot {\text{d}}\lambda } , $$

(5)

where k _B is the Boltzmann constant. This equation is known to converge badly, because it involves correlations between the Hamiltonian and its derivative [8].

Next, we attempt to quantify the compensation of energetic and entropic contributions solely due to the surroundings of the ligand by defining reduced terms which stem from differences in interactions involving the ligand in two systems or states [22]. In solvation, such reduced terms were shown to converge more readily than the full energy and entropy differences [23]. We can generalise the approach by splitting the Hamiltonian ($ \mathcal{H} $) into a λ-dependent term for the ligand–surrounding interaction ($ \mathcal{H}_{\text{ls}} $) and a λ-independent term for the surrounding–surrounding energies ($ \mathcal{H}_{\text{ ss}} $),

$$ \mathcal{H}\left( {{\uplambda}} \right) = \mathcal{H}_{\text{ ls}} \left( {{\uplambda}} \right) + \mathcal{H}_{\text{ ss}} , $$

(6)

where $ \mathcal{H}_{\text{ ls}} $ is defined as the sum of all non-bonded and bonded energy terms specific to interactions between the ligand and its surrounding. The non-bonded energy terms include ligand–ligand, ligand–protein and ligand–solvent Van der Waals and electrostatic interaction energy terms. The bonded energy terms include contributions from the ligands bonds, angles, improper dihedrals and dihedrals. $ \mathcal{H}_{\text{ ss}} $ refers to the surrounding–surrounding energies, here, made up of the protein–protein, protein–solvent and solvent–solvent interaction energies. Accordingly, we can write the energy difference as

$$ \begin{aligned} \Updelta E & = \left\langle \mathcal{H} \right\rangle_{1} - \left\langle \mathcal{H} \right\rangle_{0} \\ & = \Updelta E_{\text{ls}} + \Updelta E_{\text{ss}} = \left\langle {\mathcal{H}_{\text{ls}} } \right\rangle_{1} - \left\langle {\mathcal{H}_{\text{ls}} } \right\rangle_{0} + \left\langle {\mathcal{H}_{\text{ss}} } \right\rangle_{1} - \left\langle {\mathcal{H}_{\text{ss}} } \right\rangle_{0} \\ \end{aligned} $$

(7)

Rewriting Eq. (5) while taking into account Eq. (6) now gives for the entropy difference

$$ \Updelta S = \, \frac{1}{{k_{\text{B}} \cdot T^{2} }}\int\limits_{0}^{1} {\left\{ {\left\langle {\frac{{\partial \mathcal{H}_{\text{ls}} \left( \lambda \right)}}{\partial \lambda }} \right\rangle_{\lambda } \left\langle {\mathcal{H}_{\text{ls}} \left( \lambda \right)} \right\rangle_{\lambda } - \left\langle {\frac{{\partial \mathcal{H}_{\text{ls}} \left( \lambda \right)}}{\partial \lambda }\mathcal{H}_{\text{ls}} \left( \lambda \right)} \right\rangle_{\lambda } + \left. {\left\langle {\frac{{\partial \mathcal{H}_{\text{ls}} \left( \lambda \right)}}{\partial \lambda }} \right\rangle_{\lambda } \left\langle {\mathcal{H}_{\text{ss}} } \right\rangle_{\lambda } - \left\langle {\frac{{\partial \mathcal{H}_{\text{ls}} \left( \lambda \right)}}{\partial \lambda }\mathcal{H}_{\text{ss}} } \right\rangle_{\lambda } } \right\} \cdot {\text{d}}\lambda } \right.} $$

(8)

We can also write the λ-derivative of the ensemble average of $ \mathcal{H}_{\text{ss}} $ as

$$ \begin{aligned} \frac{\text{d}}{{{\text{d}}\lambda }}\left\langle {\mathcal{H}_{\text{ss}} } \right\rangle_{\lambda } & = \frac{\text{d}}{{{\text{d}}\lambda }}\frac{{\iint {\mathcal{H}_{\text{ss}} {\text{e}}^{{ - \mathcal{H}\left( \lambda \right)/k_{\text{B}} T}} {\text{d}}{\mathbf{p}}{\text{d}}{\mathbf{r}}}}}{{\iint {{\text{e}}^{{ - \mathcal{H}\left( \lambda \right)/k_{\text{B}} T}} {\text{d}}{\mathbf{p}}{\text{d}}{\mathbf{r}}}}} = \iint {\mathcal{H}_{\text{ss}} \frac{\text{d}}{{{\text{d}}\lambda }}\frac{{{\text{e}}^{{ - \mathcal{H}\left( \lambda \right)/k_{\text{B}} T}} }}{{\iint {{\text{e}}^{{ - \mathcal{H}\left( \lambda \right)/k_{\text{B}} T}} }{\text{d}}{\mathbf{p}}{\text{d}}{\mathbf{r}}}}}{\text{ d}}{\mathbf{p}}{\text{d}}{\mathbf{r}} \\ & = \iint {\mathcal{H}_{\text{ss}} \left[ {\frac{ - 1}{{k_{\text{B}} T}}\frac{{\partial \mathcal{H}\left( \lambda \right)}}{\partial \lambda }\frac{{{\text{e}}^{{ - \mathcal{H}\left( \lambda \right)/k_{\text{B}} T}} }}{{\iint {{\text{e}}^{{ - \mathcal{H}\left( \lambda \right)/k_{\text{B}} T}} {\text{d}}{\mathbf{p}}{\text{d}}{\mathbf{r}}}}} - \frac{{{\text{e}}^{{ - \mathcal{H}\left( \lambda \right)/k_{\text{B}} T}} \iint {\frac{ - 1}{{k_{\text{B}} T}}\frac{{\partial \mathcal{H}\left( \lambda \right)}}{\partial \lambda }{\text{e}}^{{ - \mathcal{H}\left( \lambda \right)/k_{\text{B}} T}} {\text{d}}{\mathbf{p}}{\text{d}}{\mathbf{r}}}}}{{\left( {\iint {{\text{e}}^{{ - \mathcal{H}\left( \lambda \right)/k_{\text{B}} T}} {\text{d}}{\mathbf{p}}{\text{d}}{\mathbf{r}}}} \right)^{2} }}} \right]}{\text{d}}{\mathbf{p}}{\text{d}}{\mathbf{r}} \\ & = \frac{ - 1}{{k_{\text{B}} T}}\left[ {\left\langle {\mathcal{H}_{\text{ss}} \frac{{\partial \mathcal{H}_{\text{ls}} \left( \lambda \right)}}{\partial \lambda }} \right\rangle_{\lambda } - \left\langle {\mathcal{H}_{\text{ss}} } \right\rangle_{\lambda } \left\langle {\frac{{\partial \mathcal{H}_{\text{ls}} \left( \lambda \right)}}{\partial \lambda }} \right\rangle_{\lambda } } \right] \\ \end{aligned} $$

(9)

where we explicitly write the ensemble average as a normalised integral over all positions (r) and momenta (p). We can now rewrite Eq. (8) as

$$ \begin{aligned} \Updelta S & = \frac{1}{{k_{\text{B}} T^{2} }}\mathop \int \limits_{0}^{1} \left\{ {\left\langle {\frac{{\partial \mathcal{H}_{\text{ls}} \left( \lambda \right)}}{\partial \lambda }} \right\rangle_{\lambda } \left\langle {\mathcal{H}_{\text{ls}} \left( \lambda \right)} \right\rangle_{\lambda } - \left\langle {\frac{{\partial \mathcal{H}_{\text{ls}} \left( \lambda \right)}}{\partial \lambda }\mathcal{H}_{\text{ls}} \left( \lambda \right)} \right\rangle_{\lambda } } \right\} \cdot {\text{d}}\lambda + \frac{{\Updelta \left\langle {\mathcal{H}_{\text{ss}} } \right\rangle }}{T} \\ \, = \Updelta S_{\text{ls}} + \frac{{\Updelta \left\langle {\mathcal{H}_{\text{ss}} } \right\rangle }}{T}, \\ \end{aligned} $$

(10)

defining the ligand-surrounding entropy ΔS _ls as

$$ \Updelta S_{\text{ls}} = \frac{1}{{k_{\text{B}} T^{2} }}\mathop \int \limits_{0}^{1} \left\{ {\left\langle {\frac{{\partial \mathcal{H}_{\text{ls}} (\lambda )}}{\partial \lambda }} \right\rangle_{\lambda } \left\langle {\mathcal{H}_{\text{ls}} (\lambda )} \right\rangle_{\lambda } - \left\langle {\frac{{\partial \mathcal{H}_{\text{ls}} (\lambda )}}{\partial \lambda }\mathcal{H}_{\text{ls}} (\lambda )} \right\rangle_{\lambda } } \right\} \cdot {\text{d}}\lambda $$

(11)

Together with the ligand-surrounding energy differences, ΔE _ls, in Eq. (7), we can write,

$$ \Updelta A = \Updelta E - T\Updelta S = \Updelta E_{\text{ls}} - T\Updelta S_{\text{ls}} $$

(12)

From Eq. (12), we can see that the free energy is only defined by the λ-dependent energy and the λ-dependent entropy while the energetic and entropic contributions from the λ-independent part, $ \mathcal{H}_{\text{ss}} $, exactly cancel in the free energy.

As will be outlined below, harmonic distance restraints were applied to restrain non-interacting dummy particles to a given position during the simulations. The contribution of these distance restraints to the free energy (ΔA _r) and entropy (ΔS _r) was calculated using Eqs. (13) and (14):

$$ \Updelta A_{\text{r}} = - k_{\text{B}} T \cdot \ln \frac{{V}}{{\left( {\frac{{ 2 {{\uppi}}k_{\text{B}} T}}{{K_{\text{b}} }}} \right)^{\frac{3}{2}} }} $$

(13)

$$ \Updelta S_{\text{r}} = - k_{\text{B}} \cdot \ln \frac{V}{{\left( {\frac{{ 2 {{\uppi}}k_{\text{B}} T}}{{K_{\text{b}} }}} \right)^{\frac{3}{2}} }} - \frac{3}{2}k_{\text{B}} $$

(14)

where K _b is the force constant used during the simulation and V is the simulation box volume. Both equations are derived from comparing the partition functions of a three-dimensional harmonic oscillator with the partition function of a freely translating particle [27, 28].

3 Methods

3.1 Thermodynamic cycles

A direct assessment of ΔA, ΔE and TΔS for ligand–protein binding from simulations of the actual binding event is very demanding [29], if not impossible [9]. However, ΔA, ΔE and TΔS are state functions. Therefore, it is possible to estimate relative changes of terms utilising thermodynamic cycles. A total of nine thermodynamic cycles composed of fifteen TI legs, shown in Fig. 2, were devised to study ΔA at a defined temperature. This allows us to carefully assess the convergence of the calculations, by determining the total free energy change along closed cycles, which should be 0 kJ mol⁻¹ by definition. Moreover, we can determine the absolute and relative binding free energies of the two compounds or two host model systems by the appropriate combination of free energy terms. For instance, in a first TI, the acetone inside C_HB (H:A_q) changes into methanol (H:M_q) which yields ΔA _m(H:L_q). Performing similar simulations for the ligand-in-solvent [yielding ΔA _m(L_q)] allows for the calculation of the relative binding free energies (ΔΔA _b) as the difference in binding free energy of methanol [ΔA _b(H:M_q)] and acetone [ΔA _b(H:A_q)] to the host C_HB:

$$ \begin{aligned} \Updelta \Updelta A_{\text{b}} & = {{\Updelta}}A_{\text{b}} ( {\text{H:M}}_{\text{q}} )- {{\Updelta}}A_{\text{b}} ( {\text{H:A}}_{\text{q}} )\\ & = {{\Updelta}}A_{\text{m}} ( {\text{H:L}}_{\text{q}} )- {{\Updelta}}A_{\text{m}} ( {\text{L}}_{\text{q}} )\\ \end{aligned} $$

(14)

Similarly, from the starting state of acetone-in-C_HB (H:A_q), charges may be removed from the acetone molecule (leading to H:A_n), followed by removal of the Van der Waals interaction (leading to H:A_dr). The resulting molecule, a so-called ‘dummy’ molecule, does not interact with its environment anymore, but still has a mass and a distance restraint which is introduced during the process to prevent the non-interacting acetone molecule from drifting through the complete simulation box, requiring extremely long simulations in the final stages of this TI leg [30, 31]. Equation (13) is used to calculate the contribution [ΔA _r(H:A_dr)] of the distance restraint to reach the state H:A_d. The transfer of the non-interacting dummy molecule between solvent and the host system is not associated with a free energy change, that is, ΔA _b(H:A_d) ≡ 0. Repeating these calculations for methanol-in-C_HB, for the ligands-in-C_APO and the ligands-in-solvent now permits estimation of the absolute binding free energy (ΔA _b) of the free ligand to C_APO or C_HB, e.g.:

$$ \Updelta A_{\text{b}} ({\text{H:A}}_{\text{q}} ) = {{\Updelta}}A_{\text{el}} ( {\text{A}}_{\text{q}} )+ {{\Updelta}}A_{\text{vdw}} ( {\text{A}}_{\text{n}} )+ {{\Updelta}}A_{\text{b}} ( {\text{H:A}}_{\text{d}} )- {{\Updelta}}A_{\text{r}} ( {\text{H:A}}_{\text{dr}} )- {{\Updelta}}A_{\text{vdw}} ( {\text{H:A}}_{\text{n}} )- {{\Updelta}}A_{\text{el}} ( {\text{H:A}}_{\text{q}} ) { } $$

(15)

Note that the term absolute binding free energy, commonly used in the field, still refers to a free energy difference along the binding process [30, 32].

Thermodynamic cycles can be used to determine internal consistency independent from experimental data. Many more cycles may be derived from Fig. 2, and a successful ΔA cycle closure is required before proceeding to calculate other terms.

A similar approach was used to study cycle closure for ΔS and ΔS _ls where Eq. (14) was used instead of Eq. (13) for calculating the distance restraint contribution.

3.2 Simulation setup

A single topology representation of both ligands (Fig. 3) was placed inside C_APO and C_HB and solvated in a periodic cubic box containing 1781 simple point charge (SPC) water molecules [33]. A similar setup for ligand-in-C_HB requires 1792 SPC water molecules. Ligand-in-solvent simulations contained 1170 SPC water molecules. No counter ions were added. The GROMOS11 package for biomolecular simulations [34] was used for all simulations. Force field parameters were taken from the 54A7 united-atom force field [35]. Hard-coded SPC water parameters were used to speed up the simulations. The number of particles, the volume and the temperature were kept constant during all simulations. Solvent and solute degrees of freedom were coupled separately to two temperature baths with a relaxation time of 0.1 ps using the weak-coupling method [36]. We are aware of the fact that the weak-coupling does not result in energy fluctuations exactly corresponding to the canonical ensemble [37]. Therefore, the application of Eq. (5) may not lead to the exact entropy of the canonical ensemble. However, the aim of this study is not to establish the entropy for a (unphysical) host–guest model system, but to establish the convergence behaviour of Eq. (5). The mutation of acetone to methanol has been repeated using a Nosé–Hoover chains thermostat, leading to very comparable convergence behaviour (see Fig. S1 in supplementary material). The leap-frog algorithm [38] with a timestep of 2 fs was used. All bonds were constrained to their minimum energy values using the SHAKE algorithm [39]. Centre of mass translation was removed every 1,000 steps. All solute molecules were defined as separate energy groups and all solvent molecules defined as one energy group. Energies and free energy derivatives were written out every 50 steps in ligand-to-dummy simulations and every 100 steps in acetone to methanol simulations. In all TI simulations, non-bonded interactions involving ligand atoms are described by a Lennard–Jones soft-core parameter of 0.5 and a Coulomb-reaction-field soft-core parameter of 0.5 nm² [40].

Non-bonded interactions were calculated using a triple-range cut-off scheme. Interactions up to a short-range distance of 0.8 nm were calculated at every timestep from a pairlist that was updated every 5 steps. At pairlist construction [41], interactions up to an intermediate range of 1.4 nm were also calculated and kept constant between updates. A reaction field contribution [42] was added to the forces and energies to account for a dielectric continuum with relative permittivity of 61 beyond the cut-off sphere of 1.4 nm [43].

Velocities corresponding to an initial temperature of 60 K were randomly assigned to all atoms before the equilibration process of each simulation, during which systems were heated to the desired temperature through gradual increase of temperature (ΔT = 60 K) while simultaneously decreasing an imposed position restraint on all solute atoms from 2.5 × 10⁴ to 0 kJ mol⁻¹ nm⁻² in 5 discrete simulation steps of 20 ps each.

MD simulations of 100 ns were used to estimate ΔE and ΔE _ls between the states acetone-in-solvent (A_q), methanol-in-solvent (M_q), C_APO-in-solvent (A), C_HB-in-solvent (H), acetone-in-C_APO (A:A_q), methanol-in-C_APO (A:M_q), acetone-in-C_HB (H:A_q) and methanol-in-C_HB (H:M_q) at 300 K.

TI simulations were performed by adjusting the force field parameters provided in the supplementary material between λ = 0 and λ = 1 for the corresponding states and monitoring the value of $ \partial \mathcal{H}/\partial {{\uplambda}} $ according to the GROMOS functional form [44]. Note that in GROMOS, 1,2- and 1,3-neighbours are excluded from the non-bonded interactions and that the polar hydrogen atom does not have Van der Waals parameters. As the methyl groups in acetone have zero partial charges, the intramolecular non-bonded interactions amount to zero at all times. TI simulations were performed for the mutations ΔA _m(L_q), ΔA _m(A:L_q) and ΔA _m(H:L_q) in Fig. 2 using 51 equidistant λ-values. For the processes ΔA _VdW and ΔA _el, the λ-value was increased by 0.04 between λ = 0 and λ = 0.4 and by 0.02 between λ = 0.4 and λ = 1, yielding 41 separate simulations. Preliminary calculations showed that convergence was sufficient in these simulations even though a slightly coarser approach was used (data not shown). ΔA _m(L_q), ΔA _m(A:L_q) and ΔA _m(H:L_q) were calculated at 220, 250, 280, 290, 300, 310, 320, 350 and 380 K while ΔA _VdW and ΔA _el were only calculated at 280, 300 and 320 K. The simulations were performed for 10 ns at every λ-value at 300 K and 1.2 ns per λ-value at all other temperatures.

3.3 Accuracy and efficiency determination

The simulations described so far allow us to analyse the accuracy and precision of the various properties and approaches as a function of simulation time retrospectively. The total amount of simulation time was restricted to 100 ns while maintaining the most precise ΔA estimate in a non-automated manner. First, all λ-values that are believed to have minimal effect on the ΔA estimate were excluded. This was done by multiple iterations of plotting data, excluding λ-values at what seem to be linear regions and evaluating the influence of the excluded λ-values on ΔA. This is followed by the determination of the minimal simulation time required at each remaining λ-value in two rounds which was achieved by monitoring $ \partial \mathcal{H}/\partial {{\uplambda}} $ as a function of time followed by a careful consideration of the trade-off between accuracy and simulation time. This way, the total simulation time for each calculated value was initially reduced to 100 ns. The total simulation time was further reduced to 10 ns in a second round by reducing the lengths of the simulations by a factor 10.

A similar approach was applied to optimise the calculation of ΔS _ls with a given amount of overall simulation time. As ΔS is known to converge worse than ΔS _ls, the λ-values found to be optimal for ΔS _ls were also used for ΔS. Data reduction for ΔE and ΔE _ls was done by determining the minimal simulation time required per simulation.

Error estimates for the averages obtained from simulations were determined from block averaging and extrapolation to infinite block length [45]. Error estimates in the thermodynamic terms are subsequently obtained from standard propagation of the error estimates on the simulation averages [46].

4 Results

It is well-known that ΔA and ΔS converge differently [8]. Figure 4 shows the profiles of dA/dλ, dS/dλ and dS_ls/dλ for the acetone to methanol mutation in solvent (see Fig. S2 and S3 in supplementary material for the profiles of dA/dλ, dS/dλ and dS_ls/dλ in the host systems). As a minimal requirement for internal consistency, the thermodynamic cycle closure for ΔA was evaluated first. For the various thermodynamic cycles in Table 1, a cycle closure of maximally 2.5 kJ mol⁻¹ (k _B T) at 300 K was obtained. This observation also holds for the cycles studied at 320 K. However, lowering the temperature to 280 K noticeably affects cycle closures, with deviations up to 4.5 kJ mol⁻¹. Careful consideration of many possible factors that might affect the simulations at all three temperatures, including geometrical aspects of the ligand and its environment, various contributing energy terms and possible calculation errors, has led to the conclusion that the cycles are internally consistent. We will subsequently attempt to calculate TΔS and TΔS _ls from the same simulation data at 300 K. The cycles that do not close at 280 K most likely imply that due to reduced dynamics, additional sampling is still required for these systems at 280 K.

Table 1 Thermodynamic cycle closures for ΔA, TΔS and TΔS _ls in kJ mol⁻¹ (see Fig. 2 for explanations of abbreviations; x is a placeholder for q, n, dr or d)

Full size table

The situation is different for the cycles at 300 K for TΔS, as obtained using Eq. (5), which are also presented in Table 1. Although some cycles seem close to closing, the error estimates clearly show that these values are far from converged. The main issue here is that the estimated errors are several orders of magnitude larger than the estimated value itself. In sharp contrast, the TΔS _ls cycles calculated using Eq. (11) are comparable to the ΔA cycles. The statistical errors are still about 10-fold larger than the estimated values, but are substantially smaller when compared to the estimated errors from the TΔS cycles.

Table 2 presents estimates of ΔA, ΔE, TΔS and TΔS _ls from all three different approaches for each system. We denote the direct application of Eq. (2) as approach I, the utilisation of multiple simulations at different temperature (Eq. 4) as approach II and the application of the thermodynamic integration formula (Eq. 5) as approach III. If all available simulation data are taken into consideration, approaches I and II (Eqs. 2, 4) seem to yield a similar TΔS for ligand-in-solvent and ligand-in-C_HB, even though the estimated errors of the second approach are relatively large. Approach III is completely off, which was already observed for the thermodynamic cycle closures and confirms that TΔS estimation using Eq. (5) remains a challenge, even with 510 ns of total simulation time for a very simple process as the mutation in water or in a purely hydrophobic environment.

Table 2 ΔA, ΔE, TΔS and TΔS _ls in kJ mol⁻¹ from different approaches for the mutation of acetone to methanol

Full size table

Although ΔA estimation using TI is rather precise, using approach II to estimate TΔS does not yield the most precise values. As can be seen from the curves in Fig. 5, the slopes of the Van ’t Hoff plots are almost identical, independent of the simulation time invested. This indicates that the estimates of TΔS using this approach is robust and only slightly affected by data reduction. The relatively large error estimates are due to the error propagation over the linear regression. The large overall amount of simulation time is divided over many individual simulations with (reasonable) error estimates, which are mostly additive in the final error estimates. In the other approaches, the overall simulation time is divided over fewer, longer simulations, more efficiently reducing the error estimates.

For the current system, approach I seems to yield the most precise estimates of the entropy. However, we have to note that this may be different for more realistic systems, for example, for a large flexible host molecule, undergoing slow conformational motion ΔE may not converge to a sufficient level to apply this approach. Approach II was previously applied efficiently for systems involving a smaller alchemical modification [47, 48].

The lower half of Table 2 presents the results for the reduced terms ΔE _ls and TΔS _ls. It can be seen that both ΔE _ls and TΔS _ls converge substantially better than their full counterparts, ΔE and ΔS. The ligand-in-solvent simulation data in Fig. 6 shows that both reduced terms require about 5-fold less simulation time per λ-value to reach convergence and estimated errors for each term are substantially smaller. Ligand-in-C_APO and ligand-in-C_HB data (see Fig. S4 and S5 in supplementary material) follow a similar trend. The reduced noise for the ligand-surrounding energy and entropy indicates that the noise in the full energy and entropy estimates are mostly due to the surrounding–surrounding energy and entropy terms, which cancel exactly in the free energy, which hence converges more readily as well.

The values of TΔS _ls as calculated from approach III (Eq. 11) are consistently 2.4–4.2 kJ mol⁻¹ lower than the values calculated from approach II (Eq. 12). The discrepancy could be traced to the use of bond-length constraints in the simulation, that is, SHAKE, and a change of the C=O bond of 0.123 nm in acetone to a C–O bond of 0.153 nm in methanol. This leads to a slight change of the constraint forces as calculated in the SHAKE algorithm, which is included in the overall estimate of ΔA through the appropriate contribution to dA/dλ [49]. As, however, a constraint to a (modified) minimum energy value is not reflected in an energy change, it will occur neither in the estimate of ΔE nor in the estimate of TΔS using Eq. (5). The same holds for the calculations of the reduced terms ΔE _ls and TΔS _ls using Eq. (11). Indeed, the free energy difference between rigid rotors of lengths 0.123 nm and 0.153 nm amount to about 1 kJ mol⁻¹ [27]. From a calculation of ΔA _m(A_q) without applying SHAKE on the solute, a value of −21.8 kJ mol⁻¹ is obtained (−19.5 kJ mol⁻¹ with SHAKE), explaining the difference of 2.4 kJ mol⁻¹ for the ligand-in-solvent state. Note, however, that the differences in TΔS _ls largely cancel in the relative entropy changes TΔΔS _ls (Table 2). This suggests that both approaches II and III (Eqs. 11, 12) are suitable to estimate TΔΔS _ls consistently which in turn is interesting for computational DD. However, the full TΔΔS term does not seem to be directly comparable to the reduced TΔΔS _ls term.

The setup for calculating the values in Table 2 for each approach is quite inefficient, and different amounts of simulation time were used in the various approaches, possibly obscuring a fair comparison of their efficiencies. Therefore, the question arises whether the same is achievable using 100 ns overall simulation time per calculated TΔS or TΔS _ls. This allows for a fair comparison of the precision to be reached by the various approaches. The results from a careful reduction of simulation data are presented in Table 3 and show that similar trends are still observed. The error estimates increase slightly and the values vary somewhat, but remain very similar with the exception of ΔE from ligand-in-C_APO. A reduction of the simulation time seems to affect the full terms more than the reduced terms, and a 100 ns overall simulation time still seems adequate. Do keep in mind that this is not automated and biases may have been introduced during the manual data reduction process.

Table 3 ΔA, ΔE, TΔS and TΔS _ls in kJ mol⁻¹ calculated at 300 K using different methods when restricted to an overall simulation time of 100 ns

Full size table

Reducing the overall simulation time further to 10 ns per TΔS or TΔS _ls, results in the values presented in Table 4. Again, the full terms seem most affected while the reduced terms are less susceptible. The best convergence seems to be obtained for ΔA, closely followed by the reduced ΔE _ls and TΔS _ls terms, while the full ΔE and TΔS terms deviate more, due to insufficient sampling of the solvent–solvent degrees of freedom.

Table 4 ΔA, ΔE, TΔS and TΔS _ls in kJ mol⁻¹ calculated at 300 K using different methods when restricted to an overall simulation time of 10 ns

Full size table

5 Discussion

The mutation of acetone to methanol was simulated in different surrounding environments: pure solvent, bound to C_APO in water and bound to C_HB in water. The cavity in the first host model represents a relatively large hydrophobic cavity, while the cavity in the second host model has a more hydrophilic character and is smaller in size. In the current force field (parameter set 54A7, see supplementary material), methanol is more hydrophilic than acetone [50]. This is reflected by the negative value of ΔA _m(A_q) = −19.5 kJ mol⁻¹ in Table 2, which is the result of an energetic contribution of −6.3 kJ mol⁻¹ and an entropic contribution of 13.2 kJ mol⁻¹. Note that the intermolecular interaction energies amount to zero, such that no gas-phase corrections are needed. Comparing the values of ΔE and ΔE _ls or TΔS and TΔS _ls from approach I (Eq. 11) allows us to quantify the surrounding–surrounding contribution to the energy and entropy of the acetone to methanol mutation. ΔE is built up from −21.6 kJ mol⁻¹ (ΔE _ls) as a result of stronger interactions between methanol and the water molecules and a loss of 15.3 kJ mol⁻¹ (ΔE _ss) due to reduced solvent–solvent interactions between these water molecules. The favourable entropic contribution of 13.2 kJ mol⁻¹ predominantly stems from the solvent–solvent reorganisation of TΔS _ss = 15.3 kJ mol⁻¹ exactly cancelling the unfavourable energy contribution of ΔE _ss. What remains is a slightly unfavourable contribution of the ligand-surrounding entropy, TΔS _ls = −2.1 kJ mol⁻¹, probably due to the smaller, more spherical size of the solute.

In the hydrophobic C_APO cavity, the free energy associated with the same mutation is unfavourable by 11.7 kJ mol⁻¹, due to an unfavourable ΔE _ls = 22.2 kJ mol⁻¹ resulting from incomplete ‘solvation’ in a cavity that is too large for the methanol molecule, partly compensated by an increased ligand entropy. In the C_HB cavity with a smaller volume, the energy change is small ΔE _ls = −1.1 kJ mol⁻¹, while the increase in ligand-surrounding entropy TΔS _ls is of comparable size (11–14 kJ mol⁻¹), indicating that more relevant configurations are accessible for methanol than for acetone in both cavities.

Considering in more detail the relative binding free energy, ΔΔA _b of acetone and methanol in C_HB, we obtain a moderate value of 4.4 kJ mol⁻¹, which is built up from a small unfavourable energetic contribution ΔΔE = 1.8 kJ mol⁻¹ and a small unfavourable entropic contribution TΔΔS = −2.5 kJ mol⁻¹. It may be tempting to conclude from these numbers that the binding of the two compounds is governed by the same principles. However, the values of ΔΔE and TΔΔS are obscured by a large, exactly compensating value of ΔΔE _ss = TΔΔS _ss = −18.6 kJ mol⁻¹. Considering the reduced terms, which leave out the surrounding–surrounding energies and entropies, we see that ΔΔA _b is built up from a large ΔΔE _ls = 20.5 kJ mol⁻¹ (as the result of a significantly larger desolvation energy of methanol than of acetone) and a considerable TΔΔS _ls = 16.1 kJ mol⁻¹ (as the results of methanol having more space to move around the small cavity than acetone). So the small value of ΔΔA _b is the result of two distinct molecular features in which the two molecules differ. The above example nicely demonstrates how the surrounding–surrounding energy and entropy, which do not contribute to ΔΔA _b, may obscure a molecular interpretation of basic thermodynamic properties. Similar considerations may very well explain the observation of Biela et al. [25] where very similar thermodynamic profiles were obtained for two ligands with distinct binding poses.

The values for ΔE _ss and TΔS _ss range from −3.4 to 15.3 kJ mol⁻¹ and the corresponding relative surrounding–surrounding binding energies (ΔΔE _ss) and entropies (TΔΔS _ss) amount to −7.8 (C_APO) and −18.6 kJ mol⁻¹ (C_HB), respectively. Not unexpected for the host model molecules completely shielding the ligand from direct interactions with the solvent, the surrounding–surrounding energy entropy compensation is smaller in the C_APO and C_HB systems than free in solution, leading to negative values for ΔΔE _ss. This suggests that the more readily converging reduced terms cannot straightforwardly be used as a replacement for the full energetic and entropic terms and that the surrounding–surrounding contributions do depend strongly on the actual surrounding of the ligand and cannot be expected to cancel in the relative values. The fact that ΔΔE _ss and TΔΔS _ss are so different in the two host systems also shows that they should really be excluded from the interpretation of free energy differences in which they cancel. Inclusion of the surrounding–surrounding terms will obscure differences between the hosts while the reduced terms offer physical interpretations more relevant for drug design.

The reduced terms do not correspond to experimentally observable quantities and as such cannot be validated by experimental means. The decomposition of the energetic and entropic contributions in terms of a ligand and its surroundings is intuitive, but different choices can be made including fewer or more terms that are compensated in ΔE and TΔS. The observation that ΔE and TΔS contain exactly compensating terms allows one to argue that, even though not corresponding to experimental observations, the reduced ΔE _ls and TΔS _ls terms may be of more use in computational drug design than their full counterparts. After all, what use is an optimisation in terms of energy if a significant portion of it is compensated by a loss in entropy and the overall affinity is not improved?

More importantly, many of the optimisations either try to rigidify the ligand or address an additional ligand–surrounding interaction, which will be more easily quantified in terms of the well-converging reduced terms. Therefore, it may be advisable and also feasible to first characterise a lead compound and its affinity in terms of ΔE _ls and TΔS _ls and to rationally optimise these terms in silico in order to design a new compound with a higher affinity. Whether part of the full energy is subsequently compensated by the full entropy is irrelevant for the binding affinity.

6 Conclusion

The free energy difference between acetone and methanol in solution and when bound to two model host systems was calculated. Three approaches were taken to quantify the energetic and entropic contributions to the free energies. Moreover, these were described in terms of ligand-surrounding energies and entropies, effectively also quantifying the (exactly compensating) surrounding–surrounding energies and entropies. Internal consistency of the calculations was ensured by investigating multiple cycle closures for the state functions. The convergence of all thermodynamic properties was monitored.

The first approach, calculating the entropy as a difference between the free energy and the energy leads to the smallest statistical uncertainties for this highly simplified host model system. Quantifying the entropy from the temperature dependence of the free energy in the second approach leads to comparable values, but a proper propagation of the error estimates increases the statistical uncertainty significantly. The third approach, in which the entropy is directly estimated from thermodynamic integration, does not lead to converged results on the timescales investigated here. This does not hold for the reduced thermodynamic terms (ΔE _ls and TΔS _ls), for which the first and third approaches yield comparable estimates, except for a contribution due to modified bond-length constraints.

Although not corresponding to experimentally accessible quantities, the reduced terms can be readily calculated from molecular simulations and may prove very powerful in the thermodynamic optimisation of lead compounds in computational drug design, as the intrinsic energy–entropy compensation due to the surrounding is not included. We have described examples of how the surrounding–surrounding energy–entropy compensation obscures a proper molecular interpretation of the thermodynamic terms. Rather, we propose to use the reduced terms, opening the way to new design strategies.

References

Ladbury JE, Klebe G, Freire E (2010) Adding calorimetric data to decision making in lead discovery: a hot tip. Nat Rev Drug Discov 1:23–27. doi:10.1038/nrd3054
Article Google Scholar
Klebe G (2006) Virtual ligand screening: strategies, perspectives and limitations. Drug Discov Today 13–14:580–594. doi:10.1016/j.drudis.2006.05.012
Article Google Scholar
Leavitt S, Freire E (2001) Direct measurement of protein binding energetics by isothermal titration calorimetry. Curr Opin Struct Biol 5:560–566
Article Google Scholar
Lafont V, Armstrong AA, Ohtaka H, Kiso Y, Mario Amzel L, Freire E (2007) Compensating enthalpic and entropic changes hinder binding affinity optimization. Chem Biol Drug Des 6:413–422. doi:10.1111/j.1747-0285.2007.00519.x
Google Scholar
Freire E (2008) Do enthalpy and entropy distinguish first in class from best in class? Drug Discov Today 19–20:869–874. doi:10.1016/j.drudis.2008.07.005
Article Google Scholar
Freire E (2009) A thermodynamic approach to the affinity optimization of drug candidates. Chem Biol Drug Des 5:468–472. doi:10.1111/j.1747-0285.2009.00880.x
Google Scholar
Falconer RJ, Collins BM (2011) Survey of the year 2009: applications of isothermal titration calorimetry. J Mol Recogn 1:1–16. doi:10.1002/jmr.1073
Article Google Scholar
Peter C, Oostenbrink C, van Dorp A, van Gunsteren WF (2004) Estimating entropies from molecular dynamics simulations. J Chem Phys 6:2652–2661. doi:10.1063/1.1636153
Article Google Scholar
Reinhardt WP, Miller MA, Amon LM (2001) Why is it so difficult to simulate entropies, free energies, and their differences? Acc Chem Res 7:607–614
Article Google Scholar
Skillman AG (2012) SAMPL3: blinded prediction of host-guest binding affinities, hydration free energies, and trypsin inhibitors. J Comput Aided Mol Des 5:473–474. doi:10.1007/s10822-012-9580-z
Article Google Scholar
Muddana HS, Daniel Varnado C, Bielawski CW, Urbach AR, Isaacs L, Geballe MT et al (2012) Blind prediction of host-guest binding affinities: a new SAMPL3 challenge. J Comput Aided Mol Des 5:475–487. doi:10.1007/s10822-012-9554-1
Article Google Scholar
Oostenbrink C (2009) Efficient free energy calculations on small molecule host-guest systems—a combined linear interaction energy/one-step perturbation approach. J Comput Chem 2:212–221. doi:10.1002/jcc.21116
Article Google Scholar
Schlitter J (1993) Estimation of absolute and relative entropies of macromolecules using the covariance matrix. Chem Phys Lett 6:617–621. doi:10.1016/0009-2614(93)89366-P
Article Google Scholar
Karplus M, Kushick JN (1981) Method for estimating the configurational entropy of macromolecules. Macromolecules 2:325–332. doi:10.1021/ma50003a019
Article Google Scholar
Hensen U, Grubmüller H, Lange OF (2009) Adaptive anisotropic kernels for nonparametric estimation of absolute configurational entropies in high-dimensional configuration spaces. Phys Rev E 1:011913
Article Google Scholar
Hensen U, Lange OF, Grubmuller H (2010) Estimating absolute configurational entropies of macromolecules: the minimally coupled subspace approach. PLoS ONE 2:e9179. doi:10.1371/journal.pone.0009179
Article Google Scholar
Harpole KW, Sharp KA (2011) Calculation of configurational entropy with a Boltzmann–quasiharmonic model: the origin of high-affinity protein-ligand binding. J Phys Chem B 30:9461–9472. doi:10.1021/jp111176x
Article Google Scholar
Dolenc J, Baron R, Oostenbrink C, Koller J, van Gunsteren WF (2006) Configurational entropy change of netropsin and distamycin upon DNA minor-groove binding. Biophys J 4:1460–1470. doi:10.1529/biophysj.105.074617
Article Google Scholar
Lange JH, Venhorst J, van Dongen MJ, Frankena J, Bassissi F, de Bruin NM et al (2011) Biophysical and physicochemical methods differentiate highly ligand-efficient human D-amino acid oxidase inhibitors. Eur J Med Chem 10:4808–4819. doi:10.1016/j.ejmech.2011.04.023
Article Google Scholar
Baron R, Setny P, Andrew McCammon J (2010) Water in cavity—ligand recognition. J Am Chem Soc 34:12091–12097. doi:10.1021/ja1050082
Article Google Scholar
DeLorbe JE, Clements JH, Teresk MG, Benfield AP, Plake HR, Millspaugh LE et al (2009) Thermodynamic and structural effects of conformational constraints in protein-ligand interactions. Entropic paradoxy associated with ligand preorganization. J Am Chem Soc 46:16758–16770. doi:10.1021/ja904698q
Article Google Scholar
Ben-Naim A, Marcus Y (1984) Solvation thermodynamics of nonionic solutes. J Chem Phys 4:2016–2027
Article Google Scholar
van der Vegt NFA, van Gunsteren WF (2004) Entropic contributions in cosolvent binding to hydrophobic solutes in water. J Phys Chem B 3:1056–1064. doi:10.1021/jp030532c
Article Google Scholar
Ozal TA, van der Vegt NF (2006) Confusing cause and effect: energy-entropy compensation in the preferential solvation of a nonpolar solute in dimethyl sulfoxide/water mixtures. J Phys Chem B 24:12104–12112. doi:10.1021/jp061608i
Article Google Scholar
Biela A, Sielaff F, Terwesten F, Heine A, Steinmetzer T, Klebe G (2012) Ligand binding stepwise disrupts water network in thrombin: enthalpic and entropic changes reveal classical hydrophobic effect. J Med Chem. doi:10.1021/jm300337q
Google Scholar
Kirkwood JG (1935) Statistical mechanics of fluid mixtures. J Chem Phys 5:300–313. doi:10.1063/1.1749657
Article Google Scholar
McQuarrie DA (2000) Statistical mechanics, 1st edn. University Science Book, Mill Valley
Google Scholar
Hermans J, Shankar S (1986) The free-energy of xenon binding to myoglobin from molecular-dynamics simulation. Isr J Chem 2:225–227
Google Scholar
Buch I, Giorgino T, De Fabritiis G (2011) Complete reconstruction of an enzyme-inhibitor binding process by molecular dynamics simulations. Proc Natl Acad Sci USA 25:10184–10189. doi:10.1073/pnas.1103547108
Article Google Scholar
Boresch S, Tettinger F, Leitgeb M, Karplus M (2003) Absolute binding free energies: a quantitative approach for their calculation. J Phys Chem B 35:9535–9551. doi:10.1021/jp0217839
Article Google Scholar
Roux B, Nina M, Pomès R, Smith JC (1996) Thermodynamic stability of water molecules in the bacteriorhodopsin proton channel: a molecular dynamics free energy perturbation study. Biophys J 2:670–681. doi:10.1016/S0006-3495(96)79267-6
Article Google Scholar
Woo HJ, Roux B (2005) Calculation of absolute protein-ligand binding free energy from computer simulations. Proc Natl Acad Sci USA 19:6825–6830. doi:10.1073/pnas.0409005102
Article Google Scholar
Berendsen H, Postma J, van Gunsteren W, Hermans J (1981) In: Pullman B (ed) Intermolecular forces D. Reidel Publishing Company, Dordrecht
Google Scholar
Schmid N, Christ CD, Christen M, Eichenberger AP, van Gunsteren WF (2012) Architecture, implementation and parallelisation of the GROMOS software for biomolecular simulation. Comput Phys Commun 4:890–903. doi:10.1016/j.cpc.2011.12.014
Article Google Scholar
Schmid N, Eichenberger A, Choutko A, Riniker S, Winger M, Mark A et al (2011) Definition and testing of the GROMOS force-field versions 54A7 and 54B7. Eur Biophys J 7:843–856. doi:10.1007/s00249-011-0700-9
Article Google Scholar
Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola A, Haak JR (1984) Molecular dynamics with coupling to an external bath. J Chem Phys 8:3684–3690
Article Google Scholar
Berendsen HJC (2007) Simulating the physical world. Cambridge University Press, Cambridge, MA
Google Scholar
Hockney RW, Eastwood JW (1992) Computer simulation using particles. Institute of Physics
Ryckaert J, Ciccotti G, Berendsen HJC (1977) Numerical integration of the Cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J Comput Phys 3:327–341. doi:10.1016/0021-9991(77)90098-5
Article Google Scholar
Beutler TC, Mark AE, van Schaik RC, Gerber PR, van Gunsteren WF (1994) Avoiding singularities and numerical instabilities in free energy calculations based on molecular simulations. Chem Phys Lett 6:529–539. doi:10.1016/0009-2614(94)00397-1
Article Google Scholar
Heinz TN, Hünenberger PH (2004) A fast pairlist-construction algorithm for molecular simulations under periodic boundary conditions. J Comput Chem 12:1474–1486. doi:10.1002/jcc.20071
Article Google Scholar
Tironi IG, Sperb R, Smith PE, van Gunsteren WF (1995) A generalized reaction field method for molecular dynamics simulations. J Chem Phys 13:5451–5459. doi:10.1063/1.469273
Article Google Scholar
Heinz TN, van Gunsteren WF, Hünenberger PH (2001) Comparison of four methods to compute the dielectric permittivity of liquids from molecular dynamics simulations. J Chem Phys 3:1125–1136. doi:10.1063/1.1379764
Article Google Scholar
Riniker S, Christ CD, Hansen HS, Hünenberger PH, Oostenbrink C, Steiner D et al (2011) Calculation of relative free energies for ligand-protein binding, solvation, and conformational transitions using the GROMOS software. J Phys Chem B 46:13570–13577. doi:10.1021/jp204303a
Article Google Scholar
Allen MP, Tildesley DJ (1989) Computer simulation of liquids. Clarendon Press, Oxford
Google Scholar
Berendsen HJC (2011) A student’s guide to data and error analysis. Cambridge University Press, Cambridge
Book Google Scholar
Carlsson J, Aqvist J (2006) Calculations of solute and solvent entropies from molecular dynamics simulations. Phys Chem Chem Phys 46:5385–5395. doi:10.1039/b608486a
Article Google Scholar
Carlsson J, Aqvist J (2009) Absolute hydration entropies of alkali metal ions from molecular dynamics simulations. J Phys Chem B 30:10255–10260. doi:10.1021/jp900818z
Article Google Scholar
van Gunsteren WF, Beutler TC, Fraternali F, King PM, Mark AE, Smith PE (1993) In: van Gunsteren WF, Weiner PK, Wilkinson AJ (eds) Computer simulation of biomolecular systems, theoretical and experimental applications. Escom Science Publishers, Leiden
Google Scholar
Horta BAC, Fuchs PFJ, van Gunsteren WF, Hünenberger PH (2011) New interaction parameters for oxygen compounds in the GROMOS force field: improved pure-liquid and solvation properties for alcohols, ethers, aldehydes, ketones, carboxylic acids, and esters. J Chem Theory Comput 4:1016–1031. doi:10.1021/ct1006407
Article Google Scholar

Download references

Acknowledgments

Financial support from Grant No. LS08-QM3 of the Vienna Science and Technology Fund (WWTF), Grant No. 260408 of the European Research Council (ERC) and the PhD programme “BioToP—Biomolecular Technology of Proteins” (Austrian Science Funds, FWF Project W1224) is gratefully acknowledged.

Open Access

This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Author information

Authors and Affiliations

Institute of Molecular Modeling and Simulation, University of Natural Resources and Life Science Vienna, Muthgasse 18, Vienna, Austria
Balder Lai & Chris Oostenbrink

Authors

Balder Lai
View author publications
You can also search for this author in PubMed Google Scholar
Chris Oostenbrink
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chris Oostenbrink.

Electronic supplementary material

Below is the link to the electronic supplementary material.

214_2012_1272_MOESM1_ESM.pdf

Force field parameters for the ligands are available in the supplementary material. Also included are the profiles of dA/dλ, dS/dλ and dS_ls/dλ for the acetone to methanol mutation in C_APO and C_HB and the profiles of dA/dλ, dS/dλ and dS_ls/dλ for the acetone to methanol mutation in solvent using the Nosé–Hoover chain thermostat instead of the weak-coupling method, and figures that illustrate the convergence of the energy, entropy and free energy for ligand-in-C_APO and ligand-in-CHB thermodynamic integration simulations (PDF 636 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Lai, B., Oostenbrink, C. Binding free energy, energy and entropy calculations using simple model systems. Theor Chem Acc 131, 1272 (2012). https://doi.org/10.1007/s00214-012-1272-1

Download citation

Received: 10 July 2012
Accepted: 25 August 2012
Published: 23 September 2012
DOI: https://doi.org/10.1007/s00214-012-1272-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Binding free energy, energy and entropy calculations using simple model systems

Abstract

Similar content being viewed by others

Conformational energies of reference organic molecules: benchmarking of common efficient computational methods against coupled cluster theory

Comprehensive evaluation of end-point free energy techniques in carboxylated-pillar[6]arene host–guest binding: I. Standard procedure

Detailed potential of mean force studies on host–guest systems from the SAMPL6 challenge

1 Introduction

2 Theory