In the secular approximation, a dipolar coupling D
kl
between two nuclear spins k and l with distance vector r
kl
has a magnitude of
$$ D_{kl} = \mu _{kl} \user2{r}_{kl}^T \user2{Sr}_{kl} /r_{kl}^5 $$
(1)
where S is the Saupe order matrix and
\( \mu _{kl} = - \mu _0 \gamma _k \gamma _l h/8\pi ^3 \) (Saupe and Englert 1963). This relation is strictly valid only if the molecule is rigid and undergoes rotational diffusion, which we will assume in the following. The Saupe tensor describes the average orientation of the molecule as well as the degree of alignment. It is determined by several factors such as the solvent medium, its concentration, the molecule’s shape and electrostatic properties (Zweckstetter and Bax 2000; Zweckstetter et al. 2004; Zweckstetter 2006). The alignment tensor is symmetric and traceless and can be parameterized with five independent elements s
1,..., s
5:
$$ S = \left( {\begin{array}{*{20}c} {s_1 - s_2 } & {s_3 } & {s_4 } \\ {s_3 } & { - s_1 - s_2 } & {s_5 } \\ {s_4 } & {s_5 } & {2s_2 } \\ \end{array} } \right) $$
(2)
Using this parameterization, a dipolar coupling can be written as the scalar product between two five-dimensional vectors:
$$ D_{kl} = \mu _{kl} \user2{s}^T \user2{a}(\user2{r}_{kl} ) $$
(3)
where
$$\begin{aligned} \user2{s}^T = (s_1 ,s_2 ,s_3 ,s_4 ,s_5), \\{\user2{a}(\user2{r})^T} = (x^2 - y^2 ,\,3z^2 - r^2 ,\,2xy,\,2xz,\,2yz)/r^5 , \end{aligned}$$
(4)
for an internuclear vector r with Cartesian coordinates x, y, z and length r. Equation (3) reveals that dipolar couplings depend linearly on the tensor elements, which allows us to treat them similarly to the Karplus parameters which also enter linearly into the Karplus relation (Karplus 1963).
In analogy to our treatment of scalar coupling constants (Habeck et al. 2005a), we model the observation of a single dipolar coupling with a Gaussian error distribution with an unknown error σ. The likelihood function, i.e. the probability of a data set comprising n measurements, is (Habeck et al. 2006)
$$ L(\theta ,s,\sigma ) = (2\pi \sigma ^2 )^{ - n/2} \exp \left\{ { - \frac{1} {{2\sigma ^2 }}\chi ^2 (\theta ,s)} \right\} $$
(5)
where θ are the conformational degrees of freedom of the molecule. The residual of the fit between observed and calculated dipolar couplings resulting from a Gaussian likelihood is
$$ \chi ^2 (\theta ,s) = \sum\limits_{(k,l)} {\left[ {D_{kl} - \mu _{kl} \user2{s}^T \user2{a}(\user2{r}_{kl} )} \right]^2 } $$
(6)
where the sum runs over all pairs of atoms for which a dipolar coupling has been measured. The likelihood function (5) is not a probability for θ, s, and σ in a strict sense, because it is normalized with respect to the data. However, similar to a probability the likelihood function quantifies how consistent settings for θ, s, and σ are with the observations and therefore ranks parameter values according to their ability to explain the data.
Most of the existing methods for structure calculation from dipolar couplings minimize the residual defined in Eq. (6) with respect to the conformational degrees of freedom; during this minimization, the alignment tensor remains fixed to some empirical estimate. Using our probabilistic framework, we are able to determine all unknowns simultaneously, including the conformational degrees of freedom, the five elements of the alignment tensor, and the error of the couplings. The estimation is based on the joint posterior probability distribution
$$ p(\theta ,\user2{s},\sigma ) \propto L(\theta ,\user2{s},\sigma )\,\pi (\theta ,\user2{s},\sigma ) $$
(7)
obtained from Bayes’ theorem (Jaynes 2003). Bayes’ theorem requires a prior probability π(θ,
s
,σ) that quantifies our background knowledge about the unknown parameters. In most situations we dispose of little a priori information about the tensor elements and therefore choose a uniform prior distribution for them.Footnote 1 We also have little knowledge about the error, except that it is a scale parameter (Habeck et al. 2006) leading to π(σ) = 1/σ (Jeffreys 1946). The prior probability for the atomic coordinates is a canonical ensemble at inverse temperature β and is based on a standard molecular force field E(θ) (Rieping et al. 2005a; Habeck et al. 2005b).
Application of Bayes’ theorem results in the posterior distribution
$$ p(\theta ,\user2{s},\sigma ) \propto \sigma ^{ - (n + 1)} \,\exp \left\{ { - \frac{1} {{2\sigma ^2 }}\chi ^2 (\theta ,\user2{s}) - \beta E(\theta )} \right\} $$
(8)
This distribution is a joint probability for all unknown parameters. We make practical use of the posterior distribution by generating a sequence of statistical samples from it. These samples approximate the posterior distribution and can be utilized to estimate θ, s, and σ or to compute an integral such as an expected value.
It is possible to eliminate uninteresting parameters by integrating them out (marginalization (Jaynes 2003; Habeck et al. 2005b)). If, for example, we are not interested in the alignment tensor we can use the marginal posterior distribution
$$ p(\theta ,\sigma ) = \int {{\text{d}}\user2{s}} \;p(\theta ,\user2{s},\sigma ) $$
(9)
to determine the conformational degrees of freedom and the error of the measurements without explicit knowledge of the alignment tensor. In some cases it is possible to solve marginalization integrals analytically. In general, however, we need to integrate numerically using statistical sampling techniques.
A parameterization of the Saupe tensor in terms of five independent matrix elements exhibits several invariances that may complicate the parameter estimation. For example, a reflection of the coordinates along the x-axis can be compensated by changing the signs of s
3
and s
4
. We can reparameterize the alignment tensor using its spectral decomposition
$$\user2{S} = \user2{U}\varvec{\Lambda}\user2{U}^{T}$$
(10)
where U is a rotation matrix and Λ the diagonal matrix of eigenvalues λ
i
which are numbered such that |λ
1| < |λ
2| < |λ
3|; because S is traceless, λ
1 + λ
2 + λ
3 = 0. The rotation matrix U describes the average orientation of the molecule. We define the magnitude A and the rhombicity R of the alignment tensor as
$$ A = \lambda _3 - (\lambda _1 + \lambda _2 )/2 = \lambda _3 /2,\quad R = 2(\lambda _1 - \lambda _2 )/3\lambda _3 $$
(11)
That is, A is related to the size of the largest principal axis and R measures the asymmetry of the alignment tensor along this axis. The strength of a dipolar coupling in the molecular reference frame defined by U is:
$$ D_{kl} = \mu _{kl} A\;\left[ {3\cos ^2 \theta _{kl} - 1 + 3R\sin ^2 \theta _{kl} \cos (2\varphi _{kl} )/2} \right] $$
(12)
where φ
kl
and θ
kl
are the azimuthal and the polar angle of the internuclear vector r
kl
in the principal axis system.
If we describe U with Euler angles α, β, γ and replace the tensor elements s
1,..., s
5 with the new parameters A, R, α, β, γ, we obtain posterior probabilities for the new parameters. The new parameterization has the advantage that it is less degenerate, but the reparameterized posterior distributions become more complicated: R is confined to values between 0 and 2/3, the distribution of the Euler angles is not of a standard form. We therefore use the parametrization based on s
1,..., s
5.