Abstract
The key to obtaining the modelfree description of the dynamics of a macromolecule is the optimisation of the modelfree and Brownian rotational diffusion parameters using the collected R _{1}, R _{2} and steadystate NOE relaxation data. The problem of optimising the chisquared value is often assumed to be trivial, however, the long chain of dependencies required for its calculation complicates the modelfree chisquared space. Convolutions are induced by the Lorentzian form of the spectral density functions, the linear recombinations of certain spectral density values to obtain the relaxation rates, the calculation of the NOE using the ratio of two of these rates, and finally the quadratic form of the chisquared equation itself. Two major topological features of the modelfree space complicate optimisation. The first is a long, shallow valley which commences at infinite correlation times and gradually approaches the minimum. The most severe convolution occurs for motions on two timescales in which the minimum is often located at the end of a long, deep, curved tunnel or multidimensional valley through the space. A large number of optimisation algorithms will be investigated and their performance compared to determine which techniques are suitable for use in modelfree analysis. Local optimisation algorithms will be shown to be sufficient for minimisation not only within the modelfree space but also for the minimisation of the Brownian rotational diffusion tensor. In addition the performance of the programs Modelfree and Dasha are investigated. A number of modelfree optimisation failures were identified: the inability to slide along the limits, the singular matrix failure of the Levenberg–Marquardt minimisation algorithm, the low precision of both programs, and a bug in Modelfree. Significantly, the singular matrix failure of the Levenberg–Marquardt algorithm occurs when internal correlation times are undefined and is greatly amplified in modelfree analysis by both the grid search and constraint algorithms. The program relax (http://www.nmrrelax.com) is also presented as a new software package designed for the analysis of macromolecular dynamics through the use of NMR relaxation data and which alleviates all of the problems inherent within modelfree analysis.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Detailed, atomic resolution information describing the dynamics of proteins or other macromolecules can be obtained experimentally through NMR by the measurement of the R _{1} and R _{2} heteronuclear relaxation rates and the steadystate heteronuclear NOE. These numbers by themselves are, however, difficult to interpret meaningfully and are therefore usually decomposed using the modelfree analysis of Lipari and Szabo, Lipari and Szabo (1982a, b) which separates the global rotational diffusion of the macromolecule from the internal motions of the molecule’s bond vectors. The relaxation values are translated into the parameters of Brownian rotational diffusion as well as those of the internal motions which consist of the generalised order parameter S ^{2} and the effective correlation time τ_{ e }. The order parameter S ^{2} is related to the amplitude of the internal motion. An S ^{2} value of zero indicates large amplitude motions whereas a value of one means that the bond vector is completely rigid. The effective correlation time τ_{ e } is correlated with the timescale of the internal motion of the bond vector albeit being dependent on the amplitude of that motion. The original Lipari and Szabo modelfree analysis was extended by Clore et al. (1990) to include internal motions on two timescales. The order parameter \(S^2_f\) and effective correlation time τ_{ f } describe the faster of the two motions whereas \(S^2_s\) and τ_{ s } are associated with the slower motion. The two order parameters are related by the equation \(S^2_f \cdot S^2_s = S^2.\)
By assuming certain order parameters or correlation times to be statistically negligible, either being one or zero respectively, a number of modelfree models can be constructed. An order parameter of one means that the motion is statistically insignificant whereas the correlation time of zero means that the motion is too fast for that parameter to be reliably extracted. In combining parametric restrictions, in which statistically insignificant parameters are dropped, together with the addition of a parameter accounting for chemical exchange relaxation a number of increasingly complex models of modelfree motions can be constructed (Fushman et al. 1997; Orekhov et al. 1999; Korzhnev et al. 2001; Zhuravleva et al. 2004). Extending this list the modelfree models will be labelled as
where R _{ ex } is the parameter accounting for chemical exchange relaxation which solely affects the R _{2} relaxation rate. The chemical exchange is assumed to be fast and is scaled quadratically with field strength. Model m0 corresponds to the special situation whereby no internal motions are statistically significant.
The simplest model of Brownian rotational diffusion is the tumbling of a sphere and is parameterised either by global correlation time τ_{ m } or the isotropic diffusion rate \({\mathfrak{D}}_{\rm iso}.\) For spherical diffusion all three eigenvalues of the diffusion tensor are equal, \({\mathfrak{D}}_x = {\mathfrak{D}}_y = {\mathfrak{D}}_z.\) When only two of the eigenvalues are equal the molecule diffuses as a spheroid. This model, which is synonymous with axially symmetric anisotropic diffusion, is parameterised by \({\mathfrak{D}}=\{{\mathfrak{D}}_{\scriptscriptstyle \perp}, {\mathfrak{D}}_{\scriptscriptstyle \parallel}, \theta, \phi\}\) where the geometric parameters \({\mathfrak{G}}\) are defined as \({\mathfrak{D}}_{\scriptscriptstyle \perp} \equiv {\mathfrak{D}}_x ={\mathfrak{D}}_y\) and \({\mathfrak{D}}_{\scriptscriptstyle \parallel} \equiv {\mathfrak{D}}_z\) (d’Auvergne and Gooley 2007). The orientational parameters \({\mathfrak{O}}=\{\theta, \phi\}\) are the azimuthal angle and the polar angle respectively. Alternatively spheroidal diffusion can be parameterised by \({\mathfrak{D}} = \{{\mathfrak{D}}_{\rm iso}, {\mathfrak{D}}_a, \theta, \phi\}\) where \({\mathfrak{D}}_a\) is the anisotropic diffusion parameter which when positive denotes a prolate spheroid and when negative denotes an oblate spheroid (d’Auvergne 2006). The most complex model of rotational diffusion is when all three eigenvalues are different and is the diffusion of an ellipsoid (Einstein 1905; Perrin 1934, 1936; Favro 1960; Woessner 1962). The ellipsoid can be parameterised by \({\mathfrak{D}} = \{{\mathfrak{D}}_x, {\mathfrak{D}}_y, {\mathfrak{D}}_z, \alpha, \beta, \gamma\}\) where the geometric parameters \({\mathfrak{G}}\) are the eigenvalues of the tensor and the orientational parameters \({\mathfrak{O}}\) are the Euler angles defined using the zyz notation. Alternatively the ellipsoid can be characterised by the parameters \({\mathfrak{D}}= \{{\mathfrak{D}}_{\rm iso}, {\mathfrak{D}}_a, {\mathfrak{D}}_r, \alpha, \beta, \gamma\}\) where \({\mathfrak{D}}_r\) is a measure of the rhombicity (d’Auvergne 2006).
Not only can the diffusion tensor be optimised as a global model affecting all residues of the protein but a set of modelfree models can be constructed in which each residue is assumed to diffuse independently. In these models a single local τ_{ m } parameter approximates the true, multiexponential description of the Brownian rotational diffusion of the molecule. Each residue of the macromolecule is treated independently. Another set of modelfree models which add the local τ_{ m } parameter to Models m0 to m9 can be created and will be labelled tm0 to tm9. These models are an extension of the ideas introduced in Barbato et al. (1992) and Schurr et al. (1994) whereby the model m2, the original Lipari and Szabo modelfree equation, is extended with the local τ_{ m } parameter to avoid the issues associated with inaccurate diffusion tensor approximations.
A number of computer programs have been written to minimise the χ ^{2} value in search of optimal parameter values of the modelfree models. The majority of these programs implement solely local minimisers as it is assumed that only a single minimum exists. In certain situations two minima within a single modelfree space have been identified (Orekhov et al. 1995b). Nevertheless, global minimisers such as simulated annealing or genetic algorithms are generally considered superfluous for modelfree analysis. The most commonly used program in the literature is the Modelfree program (Palmer et al. 1991; Mandel et al. 1995), the current version of which will be labelled as Modelfree4. Another widely used modelfree optimisation program is Dasha (Orekhov et al. 1995a) in which two local optimisation algorithms are available. Less widely used are the programs DYNAMICS (Fushman et al. 1997) and Tensor 2 (Blackledge et al. 1998; Cordier et al. 1998; Dosset et al. 2000; Tsan et al. 2000).
This article also presents a new software package named ‘relax’ which is designed for the analysis of NMR relaxation data. The program is open source and can be freely downloaded from http://www.nmrrelax.com. All aspects of data analysis starting from the initial peak intensities and ending with the visualisation of the final results are implemented within the program. The software is highly flexible as relax has been written as a modular collection of data analysis tools. The advantages of relax include a high precision of optimisation; an advanced and efficient constraint algorithm; the ability to avoid the Levenberg–Marquardt minimisation algorithm which tends to be problematic for modelfree analysis; builtin support for both model elimination (d’Auvergne and Gooley 2006) and numerous model selection techniques (d’Auvergne and Gooley 2003); support for all types of rotational diffusion tensor; the use of the numerically stabilised modelfree spectral density functions; simplified diffusion equations (d’Auvergne 2006); and a documented collapse of the diffusion tensor symmetries (d’Auvergne 2006). The program can also create the input files for both Modelfree (Palmer et al. 1991; Mandel et al. 1995) and Dasha (Orekhov et al. 1995a), execute both programs inline, and then read their output. This means that both programs can be used as optimisation engines replacing the minimisation algorithms built into relax. A number of other data analysis techniques involving NMR relaxation data are also supported by relax including the calculation of the steadystate NOE and its error, exponential curvefitting to determine the R _{1} and R _{2} relaxation rates through optimisation and error propagation through Monte Carlo (MC) simulation, and reduced spectral density mapping (Lefevre et al. 1996) using MC simulation to translate the uncertainties.
Theory and methods
Modelfree theory
The original modelfree spectral density functions presented in Lipari and Szabo (1982a) and Clore et al. (1990) are not the most numerically stable form of these equations. An important problem encountered in optimisation is roundoff error in which machine precision influences the result of mathematical operations. The double reciprocal \(\tau^{1}=\tau_m^{1} + \tau_e^{1}\) used in the equations are operations which are particularly susceptible to roundoff error, especially when \(\tau_e \ll \tau_m.\) By incorporating these reciprocals into the modelfree spectral density functions and then simplifying the equations this source of roundoff error can be eliminated. The new, numerically stabilised form of the modelfree equations can be expressed as
The perfect, noisefree grids
To thoroughly test the optimisation algorithms across a large range of modelfree motions, two grids covering the entire modelfree space of models m4 (1.4) and m5 (1.5) were constructed. The parameter values at each grid point were used to create a synthetic data set consisting of the R _{1}, R _{2} and NOE at both 600 and 500 MHz. Each grid point was treated as a separate, independent residue of a large protein. These two grids are the same as presented in d’Auvergne and Gooley (2003, 2006) and will be labelled the R _{ ex } Grid (RG) for the data of model m4 and the Double Motion Grid (DMG) for the data of model m5.
Optimisation
The modelfree parameters
Prior to optimisation using relax a coarse grid search of 11 increments per dimension was carried out to find the initial parameter vector, with 0 ≤ S ^{2} ≤ 1, 0 ≤ τ_{ i } ≤ 10 ns and \(0 {\le}R_{ex} {\le}20\) s^{−1}. This grid search, which is a ubiquitous optimisation technique, is in no way related to the RG and DMG described above which were simply used to sample different modelfree motions. A grid search over the same range was carried out for Modelfree4 but with 20 increments per dimension, the granularity of which reflects the values presented in the examples section of the Modelfree4 manual and hence would be similar to that used by much of the modelfree literature. Although the steps of 526.316 ps in the correlation time dimension are coarse compared to the values extracted in an analysis, this is finer than the 1,000 ps steps used with relax for the comparison. If optimisation is unsuccessful, then the granularity of the grid search will have a large effect on the final results. The numerous optimisation algorithms tested, the precision of optimisation and issue of parameter deconvolution are all addressed in the supplementary material.
The diffusion tensor parameters
All optimisations of the diffusion parameters presented within this paper employed Newton minimisation in conjunction with the backtracking line search algorithm (Nocedal and Wright 1999) and the Gill, Murray and Wright modified Cholesky Hessian modification algorithm (Gill et al. 1981). For the initial grid search of the diffusion tensor parameters all searches were carried out between the values 1 ns \(\leq \tau_m \leq\) 12 ns, \(0 \leq {\mathfrak{D}}_r \leq 1,\) and \(0 \leq \theta, \phi, \alpha, \beta, \gamma \leq \pi.\) For the prolate spheroid and the ellipsoid the parameter \({\mathfrak{D}}_a\) ranged from 0 to 1e^{7}. For the oblate spheroid the parameter ranged from −1e^{7} to 0.
Mapping of the modelfree space
A third type of grid was used for the mapping of the modelfree and diffusion tensor χ ^{2} spaces. By laying out a grid over select parameter values the chisquared value was calculated at each position. These values were then fed into the OpenDX program (Open Visualization Data Explorer 4.3.2, http://www.opendx.org) to create images of the modelfree spaces. The spaces were mapped to a resolution of 100 points per dimension.
Analysis of the relaxation data of cytochrome c_{2}
The R _{1} and R _{2} relaxation rates for the 44 helical residues of cytochrome c_{2} from Rhodobacter capsulatus were obtained from the supplementary material of Blackledge et al. (1998). For consistency and to enable a direct comparison with the original publication a PDB file was created from the amide NH coordinates from the same supplementary material. For optimisation modelfree model m1 (1.1) was selected for all residues. Rather than using a grid search for finding the optimal parameter values to use as an initial position for minimisation, this position was chosen to be where the S ^{2} values of all residues are 0.8. The diffusion tensor parameters were set approximately to the values in the supplementary material of Blackledge et al. (1998). For the constants used in minimisation the values from Cordier et al. (1998) were used. The NH bond length was set to 1.01 Å, the CSA value set to −170 ppm, and the field strength of the relaxation data was assumed to be exactly 600 MHz.
Results and discussion
The R _{ ex } and DMG
All grid points of both the R _{ ex } Grid and DMG were optimised using three different programs: Modelfree4, Dasha and relax. The program relax (http://www.nmrrelax.com) implements all of the optimisation algorithms utilised in this paper. The exact versions of the programs used are version 4.15 of Modelfree, version 3.48c of Dasha, and various versions of relax (from 1.0 to 1.2). To visualise the results the difference between each optimised parameter and the true value was calculated. For each parameter the difference for every grid point was joined to create a surface using OpenDX. If the minimum has been successfully located the parameter difference should be zero for each point and hence the surface should be perfectly flat with a height of zero. The figures for Modelfree4, Dasha and relax of the S ^{2}, τ_{ e } and R _{ ex } difference surfaces of the RG and S ^{2}, \(S^2_f\) and τ_{ s } of the DMG are presented in the supplementary material (Figs. S1–S18). A select few of these surfaces are presented in Fig. 1.
Comparison of optimisation algorithms
The curvature of the single motion modelfree space
For a large majority of the single timescale motions of models m1 (1.1) to m4 (1.4) represented by the R _{ ex } Grid, optimisation is relatively simple requiring only a small number of iterations for most algorithms to find the minimum. Nevertheless, minimisation can be complicated by being drawn towards and then being forced to navigate along shallow, long, curved valleys which originate at infinite correlation times but slowly head to the minimum (for example see Fig. 2). This is the same topological feature which can induce model failure due to distortion by experimental noise—the true local minimum is lost while a new minimum appears at infinite τ_{ e } values (d’Auvergne and Gooley 2006). Despite this topology, in which most algorithms can navigate without problem, the curvature in the vicinity of the minimum is relatively quadratic.
The tunnels of the double motion modelfree space
For the models representing motions on two timescales, models m5 to m8 (1.5–1.8), the curvature of the space can be much more convoluted. In certain situations to reach the single local minimum within the modelfree space the optimisation algorithms must pass through a long, curved and deep tunnel or multidimensional valley in which minimisation can be quite complex. The closer the timescales of the two motions, τ_{ f } and τ_{ s }, the more drawn out, crescentshaped and finer the tunnel becomes and therefore the more difficult it is to reach the local minimum. Examples of this type of curvature are shown in Fig. 3 (as well as Figs. S19, S21 and S22 of the supplementary material).
The best optimisation techniques
As discussed in the supplementary material, of the numerous optimisation algorithms tested the Newton line search algorithm combined with the backtracking line search (Nocedal and Wright 1999) and GMW Hessian modification (Gill et al. 1981) was found to be the best combination for modelfree analysis. The use of parameter deconvolution in the double motion models, by optimising the parameters \(\{S^2,\, S^2_f,\, \tau_s\}\) rather than \(\{S^2_f,\, S^2_s, \tau_s\},\) is also important for simplifying the modelfree optimisation problem.
Failure of optimisation
By investigating the results of minimisation of Modelfree4, Dasha and relax across the grids spanning the modelfree space a number of points of failure have been identified. These are described in detail below and include: the failure of the limits algorithm within Modelfree4; the singular matrix failure of the Levenberg–Marquardt optimisation algorithm in all programs; the low precision of both Modelfree4 and Dasha causing incomplete optimisation; and a bug within the Levenberg–Marquardt algorithm of Modelfree4 causing early termination of minimisation.
Failure of the limits algorithm
The type of curvature which complicates modelfree optimisation with boundaries is illustrated in Fig. 2 by the grid point S ^{2} = 0.970, τ_{ e } = 2048 ps and R _{ ex } = 0.149 s^{−1} where the shallow valley parallel to the correlation time dimension is clearly visible. The results of optimisation using Modelfree4, Dasha and relax are presented in Table S6 in the supplementary material, the corresponding positions in the modelfree space being indicated by spheres in the figure. This example was chosen because not only does it illustrate the failure of the constraints algorithm in Modelfree4 but it also demonstrates a failure of the Levenberg–Marquardt algorithm in Dasha which is due either to the singular matrix failure of the algorithm or to Dasha’s constraint algorithm. The results of Dasha using the combined Newton–Raphson/conjugate gradient optimisation are close to the truth yet low precision prevents the exact values from being found. relax is able to find the exact values to within machine precision.
Within Modelfree4 when a trajectory step of the Levenberg–Marquardt algorithm exceeds a constraint, that step terminates flush at the boundary. If the negative of the gradient at that position points back into the box then optimisation will continue, otherwise optimisation terminates. In Fig. 2 the downhill gradient at the point where Modelfree4 terminates points outside of the box, hence the minimum is not found. In Dasha constraints are implemented by imposing a penalty on steps outside the limit. The penalty is added to the chisquared value as the distance outside the constraint to the sixth power (Orekhov et al. 1995a). The program relax implements the Method of Multipliers algorithm (Nocedal and Wright 1999), also known as the Augmented Lagrangian, which allows optimisation to be successful in this case. As a penalty is added to the chisquared value when a constraint is violated the optimisation trajectory can exit the box in which minimisation is constrained. During the progression of the Method of Multipliers algorithm the penalty is increased. The end result is that optimisation will slide along the constraint face, albeit just outside the box, allowing the local minimum to be found.
Incomplete minimisation due to low precision
A number of internal program settings influence the precision of optimisation. These include the maximum number of iterations of the minimisation algorithm, the function tolerance, the gradient tolerance and a step length tolerance. For a simple space low optimisation precision will have a negligible effect, however the modelfree space is highly convoluted. Both the curved, shallow valley parallel to the correlation time dimensions as well as the curved, deep tunnels through the double motion spaces are particularly sensitive to optimisation precision.
The grid point \(S^2_f=0.952,\, S^2_s=0.582,\) and τ_{ s } = 32 ps perfectly illustrates the results of the different optimisation precisions of Modelfree4, Dasha and relax. It was chosen due to its optimisation complexity and hence its ability to thoroughly test the precision of optimisation. The space close to the minimum demonstrates the classic curved and deep tunnel of modelfree double motion spaces (Fig. 3). Despite both the chisquared difference between iterations and the length of each individual step being quite small the addition of numerous small steps results in a large change in parameter values and a large overall decrease in the chisquared value.
The optimisation precisions of Modelfree4, Dasha and relax are compared in Table 1. From the low number of iterations as well as the relatively high cutoff tolerances, of all programs, Modelfree4 by far exhibits the lowest precision. This is reflected in the optimisation results where Modelfree4 terminates optimisation at the very start of the tunnel, the result being that the \(S^2_f\) and \(S^2_s\) parameters appear to be swapped and the correlation time is overestimated by an order of magnitude. The chisquared difference between the results and the true minimum is 3.47, a statistically significant value which strongly biases model selection away from model m5. The relax cutoffs allow the true parameter values to be found regardless of the curvature of the modelfree space thereby facilitating unbiased model selection. The precision of both Modelfree4 and Dasha can be increased to the high default values of relax by modification and recompilation of the source code. The high precision modelfree results of both programs are presented in Table S8 of the supplementary material.
Singular matrix failure of Levenberg–Marquardt minimisation
The Levenberg–Marquardt optimisation algorithm is a widely used technique which, for most problems, performs extremely well. However this method is known to, at times, experience outright failure. Its Achilles heel is when the Levenberg–Marquardt matrix is singular which, when encountered, causes the algorithm necessarily terminate at the current position. The failure is linked to the curvature of the space. The modelfree space is particularly prone to the zero pivot or singular matrix failure of the Levenberg–Marquardt technique. This is due to the Lorentzian form of the modelfree spectral density functions and occurs whenever one of the internal correlation times is undefined due to the corresponding order parameter being equal to one. When both order parameters \(S^2_f\) and S ^{2} of the extended modelfree spectral density function are equal to zero the τ_{ s } parameter is additionally undefined.
Two features of modelfree optimisation tend to amplify the problem, the initial grid search prior to optimisation and the constraints used during optimisation. When searching through the order parameter dimensions the grid searches have upper and lower limits of one and zero respectively. This results in a large number of initial starting positions whereby the first iteration of the Levenberg–Marquardt algorithm is preconditioned to fail. If constraints are used during optimisation the order parameters are limited to being between zero and one, again accentuating the problem. The number of failures encountered by Modelfree4 are given in Table 2. The percentages from both relax and Dasha should be lower due to the constraint algorithm. The result of these failures is that the modelfree parameter values are meaningless. The convolution of the space ensures that proximity to the minimum cannot be guaranteed and the chisquared value is significantly overestimated.
The Modelfree4 bug
One final issue affecting the results of modelfree analysis was identified in the Modelfree program and has been traced back to a bug. The result is that optimisation is terminated prematurely. In many cases optimisation stops on the very first step—the final modelfree parameters are those found by the initial grid search. To demonstrate the failure the grid point S ^{2} = 0.388, τ_{ e } = 128 ps and R _{ ex } = 0.223 s^{−1} was chosen, the simple curvature of which is demonstrated in Fig. 4. The gradient at the black sphere, which is the position of the Modelfree4 results, is perpendicular to the isosurface passing through it hence failure is clearly not due to optimisation complexity. The results of optimisation using Modelfree4, Dasha and relax are presented in Table S9 of the supplementary material. The number of failures for both grids are presented in Table 3. This bug has been fixed in Modelfree versions 4.20 and above.
The double minima of the modelfree space
Local versus global optimisation
In the modelfree optimisation of models m0 to m9 (1.0–1.9), models tm0 to tm9, the Brownian rotational diffusion tensor parameters, or the totality of all modelfree and diffusion parameters one fundamental assumption about the chisquared space is made. The programs Modelfree4, Dasha, DYNAMICS and relax all use local rather than global optimisation algorithms. Their success is reliant upon the presence of only one minimum within the space being searched. If this is indeed true, finding the local minimum is synonymous with finding the global minimum. However it has been noted that there are cases where this basic premise collapses.
The two minima
In Orekhov et al. (1995b), and subsequently discussed in Korzhnev et al. (1997), it was demonstrated that there are situations in which two minima exist within a single space. This occurred when motions were occurring on two timescales but the fitted model only represented a single internal motion. The example presented was composed of synthetic data representing m5 (Model 5) where the parameter values were \(S^2_f=0.9,\, S^2_s = 0.8,\) and τ_{ s } = 0.8 ns (ibid.). The global correlation time was set to 7 ns and, as there is only a single residue, this model is identical to tm5. To approximately replicate the example the CSA and bond length were set to −170 ppm and 1.02 Å, respectively. In Fig. 5a the two minima of the space are clearly distinguishable. When using noisefree synthetic data, as in Orekhov et al. (1995b), the first and broadest of the minima is located at the position S ^{2} = 0.749, τ_{ e } = 0.89 ns and τ_{ m } = 7.1 ns, its chisquared value being 8.27. The second minimum, which is narrower but is the global minimum, is at S ^{2} = 0.790, τ_{ e } = 0.10 ns and τ_{ m } = 6.5 ns. Its chisquared value is 1e^{−29} and, if found by the local minimiser instead of the broad minimum, AIC model selection (Akaike 1973; d’Auvergne and Gooley 2003) will pick tm2 over the true tm5 model.
Avoidance of the problem
The selection of the simplistic and incorrect double minima model can be avoided in a number of indirect ways. The first is implicit within experimental data as the addition of noise to the synthetic test case of Orekhov et al. (1995b) differentially shifts the chisquared values of all the models and, in most cases, tilts the balance towards model tm5 (Fig. 5b). For the first and second minima of the tm2 space the average chisquared values shifted to 28.0 ± 10.5 and 8.5 ± 5.2, respectively. The chisquared value of model tm5 shifted from zero to an average of 2.0 ± 2.0 and, in using AIC model selection, this model is selected 84% of the time over the global minimum of model tm2 (assuming that this minimum is found 100% of the time). Therefore for most experimental data AIC model selection will discard the double minima model.
Second, the double minima phenomenon is only a tiny subset of a much bigger problem—the hiding of nanosecond motions when the whole protein exhibits motions on these timescales. This larger problem will be discussed in detail in the next paper of this series, Paper II, and can be avoided by using the new modelfree optimisation protocol which is also presented in that paper. Finally, if all else fails, collection of data at another field strength will solve the problem (Orekhov et al. 1995b). The addition of 800 MHz data to the above synthetic, noisefree example causes model tm5, which now has a minimised chisquared value of 3.4e^{−10}, to be chosen by AIC model selection over model tm2 which has a new global minimum of 6.82. The chisquared value of the broader minimum is 16.98.
Cytochrome c_{2}: optimisation of the diffusion tensor
When solving the modelfree puzzle in order to obtain the dynamic description of the system, the optimisation of models m0 to m9 is only one part of a multifaceted enigma. A second minimisation problem needs to be resolved and that is the optimisation of the Brownian rotational diffusion tensor.
Two different approaches: global verses local optimisation
An ideal, real protein test case for the optimisation of the rotational diffusion tensor is that of the ferrocytochrome c_{2} molecule from the photosynthetic alphaproteobacterium Rhodobacter capsulatus (Blackledge et al. 1998; Cordier et al. 1998). This system, a proven ellipsoid, is perfect for testing the local optimisation of the diffusion tensor parameters using the new modelfree optimisation protocol (Paper II). In the analysis of Blackledge et al. (1998) the optimisation algorithm utilised was not a local but a global minimiser—simulated annealing combined with a fuzzy logic algorithm utilising the Levenberg–Marquardt local minimiser as a subalgorithm. The R _{1} and R _{2} relaxation rates collected at 600 MHz were used to characterise the diffusion tensor of cytochrome c_{2} using the program Tensor. This approach employed the original ideas of Kay et al. (1989) whereby the R _{2}/R _{1} ratio is used to find the diffusion tensor by chisquared optimisation. To replicate as closely as possible the results of Blackledge et al. (1998) and Cordier et al. (1998) and to allow for comparison of the global and local optimisation techniques modelfree model m1 (Model 1) was chosen for all residues. The use of this model is important as the chisquared equation used in Tensor and that used in relax are defined differently.
Although the global and local optimisation algorithms are operating in χ ^{2} spaces with different dimensions and dimensionalities the final diffusion tensors should nevertheless be the same. This is demonstrated in Table S10 of the supplementary material where the original results from the program Tensor (Blackledge et al. 1998; Cordier et al. 1998; Dosset et al. 2000; Tsan et al. 2000) are contrasted with the results from the program relax. Comparing the eigenvalues of the various diffusion tensors both methodologies can be seen to return the same geometric picture.
The glide reflection and translational symmetries
Figure 6 shows a subset of the chisquared space which is searched in the optimisation of both the prolate and oblate spheroid diffusion tensors. Four distinct minima are clearly visible—two correspond to the prolate spheroid, two to the oblate spheroid. The duplication of minima within each of the spheroid types is due to the glide reflection symmetry of the orientational parameter space \({\mathfrak{O}}\) (d’Auvergne 2006). The paired minima are identical and are related by {θ,ϕ} = {π − θ, ϕ − π}. This figure is very similar to Fig. 1 of Blackledge et al. (1998) except \({\mathfrak{D}}_{\rm ratio}\) has been swapped for \({\mathfrak{D}}_a\) and the glide reflection has been shown.
The map of the subspace of ellipsoid orientational parameters \({\mathfrak{O}} \in {\mathfrak{S}}\) (d’Auvergne and Gooley 2007) shown in Fig. S24 of the supplementary material was constructed similarly to that of the spheroid subspace. The geometric diffusion parameters \({\mathfrak{G}}\) and all S ^{2} values were fixed to those of the minimised diffusion tensor while the Euler angles of \({\mathfrak{O}}\) were varied. The figure clearly demonstrates the four identical minima caused by the glide reflection and translational symmetries of the space \({\mathfrak{O}}.\)
The adequacy of local minimisers
For each of the diffusion models the diffusion parameter space \({\mathfrak{D}}\) contains only a single local minimum because of three restrictions: the eigenvalue permutation restriction of the geometric parameter subspace \({\mathfrak{G}}\) by \({\mathfrak{D}}_x \leq {\mathfrak{D}}_y \leq {\mathfrak{D}}_z;\) the collapse of the glide reflection and translational symmetries of the subspace \({\mathfrak{O}}\) by α = α − π, {α, β, γ} = {π − α, π − β, γ − π}, {α, β} = {π − α, β − π} and {θ, ϕ} = {π − θ, ϕ − π}; and the isolation of the prolate and oblate subspaces by the restrictions \({\mathfrak{D}}_a \geq 0\) and \({\mathfrak{D}}_a \leq 0\) respectively (d’Auvergne 2006). The smoothness of the space and the presence of only a single local and, therefore, global minimum in either the spheroid subspaces or full ellipsoid space is demonstrated in both Figs. 6 and S24. Because of these two factors Tensor’s global minimiser, which is based on simulated annealing combined with a fuzzy logic algorithm, is not essential for optimising the diffusion tensor.
Conclusion
Two topological features convolute the modelfree space causing optimisation to be a nontrivial problem. The best local optimisation algorithm within the modelfree space was found to be Newton minimisation in conjunction with the backtracking line search and the Gill, Murray and Wright (GMW) Hessian modification. In using the cytochrome c_{2} relaxation data (Blackledge et al. 1998; Cordier et al. 1998), local optimisation algorithms were shown to be sufficient for optimising the diffusion tensor parameters.
Four problems have been identified in the currently used modelfree optimisation software. The singular matrix failure of the Levenberg–Marquardt minimisation algorithm is a problem which occurs when correlation times are undefined, a phenomenon associated with order parameters of one. Modelfree analysis is highly prone to this failure as both the initial coarse grid search and constraint algorithms can place the order parameter at one. This common algorithm should be avoided for modelfree analysis. The second problem is low precision causing the final values to be distant from the local minimum, affecting both Modelfree and Dasha. To find the minimum in the twisted modelfree space high precision optimisation with fine cutoffs and a large number of iterations is essential. The third failure is due to Modelfree4’s constraint algorithm in which sliding along the limits is never allowed. Finally, Modelfree4 is heavily affected by a bug which causes early termination of the Levenberg–Marquardt algorithm. The result of these four failures of optimisation is that minimisation is terminated early, often far from the true modelfree parameter values.
All the steps of modelfree analysis from the initial spectral peak intensities to the final modelfree description of the dynamics of the system are implemented by the program relax. The model selection methodologies described in d’Auvergne and Gooley (2003), relaxation curvefitting, NOE calculation, reduced spectral density mapping and modelfree analysis are all available within relax. This software solves all of the optimisation issues described herein.
Abbreviations
 AIC:

Akaike’s Information Criteria
 BFGS:

Broyden, Fletcher, Goldfarb, and Shanno quasiNewton optimisation
 CD:

Coordinate descent optimisation
 CG:

Conjugate gradient
 CSA:

Chemical Shift Anisotropy
 \({\mathfrak{D}}\) :

set of diffusion tensor parameters
 DMG:

Double Motion Grid
 \({\mathfrak{G}}\) :

set of geometric diffusion parameters
 GMW:

Gill, Murray, and Wright
 LM:

Levenberg–Marquardt optimisation
 MT:

Moré and Thuente steplength selection algorithm
 NR:

the combined Newton–Raphson/conjugate gradient minimisation of Dasha
 \({\mathfrak{O}}\) :

set of orientational diffusion parameters
 RG:

R _{ ex } Grid
 \({\mathfrak{S}}\) :

the global model, space, or universe
 XH bond:

heteronucleusproton bond
References
Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: Petrov BN, Csaki F (eds) Proceedings of the second international symposium on information theory. Budapest, Akademia Kiado, pp 267–281
Barbato G, Ikura M, Kay LE, Pastor RW, Bax A (1992) Backbone dynamics of calmodulin studied by 15N relaxation using inverse detected twodimensional NMR spectroscopy: the central helix is flexible. Biochemistry 31(23):5269–5278
Blackledge M., Cordier F, Dosset P, Marion D (1998) Precision and uncertainty in the characterization of anisotropic rotational diffusion by N15 relaxation. J Am Chem Soc 120(18):4538–4539
Broyden CG (1970) The Convergence of a Class of Doublerank Minimization Algorithms 1. General Considerations. J Inst Math Appl 6(1):76–90
Clore GM, Szabo A, Bax A, Kay LE, Driscoll PC, Gronenborn AM (1990) Deviations from the simple 2parameter modelfree approach to the interpretation of N15 nuclear magneticrelaxation of proteins. J Am Chem Soc 112(12):4989–4991
Cordier F, Caffrey M, Brutscher B, Cusanovich MA, Marion D, Blackledge M (1998) Solution structure, rotational diffusion anisotropy and local backbone dynamics of Rhodobacter capsulatus cytochrome c2. J Mol Biol 281(2):341–361
d’Auvergne EJ (2006) Protein dynamics: a study of the modelfree analysis of NMR relaxation data. Ph.D. thesis, Biochemistry and Molecular Biology, University of Melbourne. http://eprints.infodiv.unimelb.edu.au/archive/00002799/
d’Auvergne EJ, Gooley PR (2003) The use of model selection in the modelfree analysis of protein dynamics. J Biomol NMR 25(1):25–39
d’Auvergne EJ, Gooley PR (2006) Modelfree model elimination: a new step in the modelfree dynamic analysis of NMR relaxation data. J Biomol NMR 35(2):117–135
d’Auvergne EJ, Gooley PR (2007) Set theory formulation of the modelfree problem and the diffusion seeded modelfree paradigm. Mol Biosyst 3(7):483–494
Dosset P, Hus JC, Blackledge M, Marion D (2000) Efficient analysis of macromolecular rotational diffusion from heteronuclear relaxation data. J Biomol NMR 16(1):23–28
Einstein A (1905) Über die von der molekularkinetischen Theorie der Wärme geforderte Bewegung von in ruhenden Flüssigkeiten suspendierten Teilchen (The motion of elements suspended in static liquids as claimed in the molecular kinetic theory of heat). Ann Physik 17(8):549–560
Favro LD (1960) Theory of the rotational brownian motion of a free rigid body. Phys Rev 119(1):53–62
Fletcher R (1970) A new approach to variable metric algorithms. Comput J 13(3):317–322
Fletcher R, Reeves CM (1964) Function minimization by conjugate gradients. Comp J 7(2):149–154
Fushman D, Cahill S, Cowburn D (1997) The mainchain dynamics of the dynamin pleckstrin homology (PH) domain in solution: analysis of 15N relaxation with monomer/dimer equilibration. J Mol Biol 266(1):173–194
Gill PE, Murray W, Wright MH (1981) Practical optimization. Academic Press
Goldfarb D (1970) A family of variablemetric methods derived by variational means. Math Comp 24(109):23–26
Hestenes MR, Stiefel E (1952) Methods of conjugate gradients for solving linear systems. J Res Nat Bur Stand 49(6):409–436
Kay LE, Torchia DA, Bax A (1989) Backbone dynamics of proteins as studied by 15N inverse detected heteronuclear NMR spectroscopy: application to staphylococcal nuclease. Biochemistry 28(23):8972–8979
Korzhnev DM, Billeter M, Arseniev AS, Orekhov VY (2001) NMR studies of Brownian tumbling and internal motions in proteins. Prog NMR Spectrosc 38(3):197–266
Korzhnev DM, Orekhov VY, Arseniev AS (1997) Modelfree approach beyond the borders of its applicability. J Magn Reson 127(2):184–191
Lefevre JF, Dayie KT, Peng JW, Wagner G (1996) Internal mobility in the partially folded DNA binding and dimerization domains of GAL4: NMR analysis of the N–H spectral density functions. Biochemistry 35(8):2674–2686
Levenberg K (1944) A method for the solution of certain nonlinear problems in least squares. Quart Appl Math 2:164–168
Lipari G, Szabo A (1982a) Modelfree approach to the interpretation of nuclear magneticresonance relaxation in macromolecules I. Theory and range of validity. J Am Chem Soc 104(17):4546–4559
Lipari G, Szabo A (1982b) Modelfree approach to the interpretation of nuclear magneticresonance relaxation in macromolecules II. Analysis of experimental results. J Am Chem Soc 104(17):4559–4570
Mandel AM, Akke M, Palmer AG 3rd (1995) Backbone dynamics of Escherichia coli ribonuclease HI: correlations with structure and function in an active enzyme. J Mol Biol 246(1):144–163
Marquardt DW (1963) An algorithm for least squares estimation of nonlinear parameters. SIAM J 11:431–441
Moré JJ, Thuente DJ (1994) Line search algorithms with guaranteed sufficient decrease. ACM Trans Maths Software 20(3):286–307
Nocedal J, Wright SJ (1999) Numerical optimization, Springer series in operations research. New York, SpringerVerlag
Orekhov VY, Korzhnev DM, Diercks T, Kessler H, Arseniev AS (1999) H1N15 NMR dynamic study of an isolated alphahelical peptide (136) bacteriorhodopsin reveals the equilibrium helixcoil transitions. J Biomol NMR 14(4):345–356
Orekhov VY, Nolde DE, Golovanov AP, Korzhnev DM, Arseniev AS (1995a) Processing of heteronuclear NMR relaxation data with the new software DASHA.Appl Magn Reson 9(4):581–588
Orekhov VY, Pervushin KV, Korzhnev DM, Arseniev AS (1995b) Backbone dynamics of (171)Bacterioopsin and (136)Bacterioopsin studied by 2dimensional H1N15 NMRspectroscopy. J Biomol NMR 6(2):113–122
Palmer AG, Rance M, Wright PE (1991) Intramolecular motions of a zinc finger DNAbinding domain from Xfin characterized by protondetected natural abundance C12 heteronuclear NMRspectroscopy. J Am Chem Soc 113(12):4371–4380
Perrin F (1934) Mouvement Brownien d’un ellipsoïde (I). Dispersion diéletrique pour des molécules ellipsoïdales. J Phys Radium 5:497–511
Perrin F (1936) Mouvement Brownien d’un ellipsoïde (II). Rotation libre et dépolarisation des fluorescences. Translation et diffusion de molécules ellipsoïdales. J Phys Radium 7:1–11
Polak E, Ribière G (1969) Note sur la convergence de méthodes de directions conjuguées. Revue Française d’Informatique et de Recherche Opérationnelle 16:35–43
Schurr JM, Babcock HP, Fujimoto BS (1994) A test of the modelfree formulas. Effects of anisotropic rotational diffusion and dimerization. J Magn Reson B 105(3):211–224
Shanno DF (1970) Conditioning of quasiNewton methods for function minimization. Math Comp 24(111):647–656
Steihaug T (1983) The conjugate gradient method and trust regions in large scale optimization. SIAM J Numer Anal 20(3):626–637
Tsan P., Hus JC, Caffrey M, Marion D, Blackledge M (2000) Rotational diffusion anisotropy and local backbone dynamics of carbon monoxidebound Rhodobacter capsulatus cytochrome c2. J Am Chem Soc 122(23):5603–5612
Woessner DE (1962) Nuclear spin relaxation in ellipsoids undergoing rotational brownian motion. J Chem Phys 37(3):647–654
Zhuravleva AV, Korzhnev DM, Kupce E, Arseniev AS, Billeter M, Orekhov VY (2004) Gated electron transfers and electron pathways in azurin: a NMR dynamic study at multiple fields and temperatures. J Mol Biol 342(5):1599–1611
Acknowledgements
We thank Andrew J. Perry for discussions on both the science and programming aspects behind relax. A number of relax’s design features, usability and core layout ideas originated from those discussions. This includes the concepts of the promptbased interface with the help system and tab completion, the automatic generation of the documentation on the user functions by parsing their docstrings, and the powerful Python scripting available to the user. Haydyn D. T. Mertens and James D. Swarbrick are acknowledged for helpful discussions. We are indebted to Chris A. MacRaild for many code contributions quashing a large number of bugs, to Gary S. Thompson for both improvements to the exponential relaxation curvefitting C modules and uncovering problems in relax, and to James Horne and Horst J. Schirra for help in tracking down bugs throughout the program.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License ( https://creativecommons.org/licenses/bync/2.0 ), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
About this article
Cite this article
d’Auvergne, E.J., Gooley, P.R. Optimisation of NMR dynamic models I. Minimisation algorithms and their performance within the modelfree and Brownian rotational diffusion spaces. J Biomol NMR 40, 107–119 (2008). https://doi.org/10.1007/s1085800792142
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s1085800792142