Optimized RDC-based Iterative and Unified Model-free analysis (ORIUM) consists of three principal stages for the extraction of RDC order parameters \( \left( {S_{RDC}^{2} } \right) \) from data measured in multiple alignment media (see Fig. 2 for schematic diagram). First, the matrix formalism introduced by Tolman in the DIDC approach is utilized to calculate refined structural coordinates from the alignment tensors (Tolman 2002). From here, each refined vector is put into a local axis system in order to determine the vector specific structural and dynamic information (Meirovitch et al. 2012; Meiler et al. 2001; Peti et al. 2002). Finally, the resulting Euler angles are used as structural input to restart the calculation in an iterative fashion, similar to SCRM (Lakomek et al. 2008). ORIUM continues until the variation in \( \left( {S_{RDC}^{2} } \right) \) for the entire dataset falls below a certain threshold.
Alignment tensor calculation
For two nuclear spins, the observed resonance splitting (Hz) resulting from the partial alignment of a protein emanate from the secular part of the magnetic dipole interaction
$$ D_{k}^{exp} = D_{ij}^{max} \left\langle {{{\left( {3 \cos^{2} \theta_{k} - 1} \right)} \mathord{\left/ {\vphantom {{\left( {3 \cos^{2} \theta_{k} - 1} \right)} 2}} \right. \kern-0pt} 2}} \right\rangle $$
(1)
$$ D_{ij}^{max} = - \frac{{\mu_{0} \gamma_{i} \gamma_{j} \hbar }}{{4\pi^{2} r_{ij}^{3} }} $$
(1a)
where \( \mu_{0} \) is the permeability of vacuum, \( \gamma_{X} \) is the gyromagnetic ratio of spin X, ℏ is Planck’s constant, r
ij
is the distance between nuclei i and j (assumed to be fixed at 1.02 Å for the N–HN and 1.095 Å for the Cα–Hα vectors), and θ
k
is the angle between the inter-nuclear vector formed by nuclear spin pair k and the magnetic field (B
0). The angular brackets denote ensemble averaging. As Eq. (1) explicitly illustrates, the magnitude of \( D_{k}^{exp} \) depends on \( \left\langle {(3\cos^{2} \theta_{k} - 1)/2} \right\rangle \). By definition, the term cos θ
k
is the scalar product between an inter-nuclear vector and the vector parallel to B
0.
When considering a rigid molecule, the coordinates of an inter-nuclear vector can be described within an arbitrary reference frame, termed the molecular frame (MF), and defined by three angles, β
x
, β
y
, and β
z
, between the vector and the respective MF axes. In a similar fashion, the vector parallel to B
0 can be expressed by three angles representing the instantaneous orientation of B
0 relative to the MF axes, α
x
, α
y
, and α
z
. Within the MF, \( D_{k}^{exp} \) can be recast as
$$ D_{k}^{exp} = D_{ij}^{max} \left\langle {B \cdot A} \right\rangle $$
(2)
where \( \left\langle {B \cdot A} \right\rangle \) is the scalar product of two vectors representing the inter-nuclear orientations (B) and the B
0 orientations (A). Here, A is the alignment tensor and B is the inter-nuclear vector tensor. Both A and B contain five independent terms and are related to a 3 × 3 second rank Cartesian order tensor as follows (Saupe 1964, 1968; Snyder 1965)
$$ A = \left[ {a_{zz} , \frac{1}{\sqrt 3 }\left( {a_{xx} - a_{yy} } \right),\;\frac{2}{\sqrt 3 }a_{xz} ,\;\frac{2}{\sqrt 3 }a_{yz} ,\;\frac{2}{\sqrt 3 }a_{xy} } \right]_{l} $$
(3)
where the orientation of B
0 in the MF is given by
$$ a_{mn} = \left\langle {\frac{1}{2}\left( {3\cos \alpha_{m} \cos \alpha_{n} - \delta_{mn} } \right)} \right\rangle_{l} $$
(3a)
and
$$ B = \left[ {b_{zz}^{k}, \frac{1}{\sqrt 3 }\left( {b_{xx}^{k} - b_{yy}^{k} } \right), \frac{2}{\sqrt 3 }b_{xz}^{k}, \frac{2}{\sqrt 3 }b_{yz}^{k}, \frac{2}{\sqrt 3 }b_{xy}^{k} } \right] $$
(4)
where the orientation of the inter-nuclear vector in the MF is described by
$$ b_{mn}^{k} = \left\langle {\frac{1}{2}\left( {3\cos \beta_{m}^{k} \cos \beta_{n}^{k} - \delta_{mn} } \right)} \right\rangle . $$
(4a)
The term δ
mn
represents the Kronecker delta function, l is the alignment condition, and m, n = x, y, z.
A matrix formalism is introduced to render analysis of the RDC data in a more intuitive manner (Tolman 2002). When K RDCs are measured under L alignments, then Eq. (2) becomes
$$ {\mathbf{D}} = \left\langle {\mathbf{B}} \right\rangle \left\langle {\mathbf{A}} \right\rangle $$
(5)
where D is a K × L matrix, B is a K × 5 matrix, and A is a 5 × L matrix. In Eq. (5), the term \( D_{ij}^{max} \) is included in \( \left\langle {\mathbf{A}} \right\rangle \). The rows of B are defined by Eq. (4) and the columns of A are given by Eq. (3). An inherent assumption in the present analysis is that inter-nuclear dynamics are uncorrelated with the alignment process; hence the averages of \( \left\langle \bf{B} \right\rangle \) and \( \left\langle \bf{A} \right\rangle \) are independent of each other. This assumption can be tested with the SECONDA analysis (Hus and Brüschweiler 2002; Hus et al. 2003). When the structure of the molecule is known and RDCs for at least five linearly independent inter-nuclear vectors are measured, the matrix B (input from the rigid structure or random structural coordinates) and the measured RDCs are used to calculate \( \left\langle \bf{A} \right\rangle \)
$$ \left\langle {\mathbf{A}} \right\rangle = {\mathbf{B}}^{ + } {\mathbf{D}} $$
(6)
where B
+ is the pseudo-inverse of B. It should be noted that a single alignment tensor per alignment medium is necessary for the successful application of the following protocols. Intrinsically disordered proteins (see Bertoncini et al. 2005; Bernadó et al. 2005) and multiple domain proteins (see Bertini et al. 2004; Rodriguez-Castañeda et al. 2006) will have to be described by several alignment tensors per alignment medium and will not be amenable to the present analysis.
Each column of \( \left\langle \bf{A} \right\rangle \), given by Eq. (3), can be recast into L symmetric 3 × 3 second rank Cartesian order tensors, \( (A_{l}^{(2)} ) \). These order matrices are then redefined in a principal axis system (PAS), termed the alignment frame (AF), where Eq. (1) becomes (Bax et al. 2001)
$$ D_{k,l}^{exp} = D_{a,l} \left[ {\left\langle {3\cos^{2} \theta_{k,l}^{AF} - 1} \right\rangle + \frac{3}{2}R_{l} \left\langle {\sin^{2} \theta_{k,l}^{AF} \cos 2\phi_{k,l}^{AF} } \right\rangle } \right]. $$
(7)
In Eq. (7), the magnitude of the alignment tensor is \( D_{a,l} = \frac{1}{2}D_{ij}^{max} *A_{zz,l}^{PAS} \), the rhombicity is \( R_{l} = \frac{{2\left({A_{xx,l}^{PAS} - A_{yy,l}^{PAS}} \right)}}{{3*A_{zz,l}^{PAS}}},\,(\theta_{k,l}^{AF},\phi_{k,l}^{AF}) \) are the polar angles defining the inter-nuclear vector in the AF, and \( A_{(mm,l)}^{PAS} \) are the eigenvalues resulting from the diagonalization of \( A_{l}^{(2)} \). From the eigenvectors \( \left( {A_{mn,l}^{EV} } \right) \), the Euler angles describing the rotation of \( A_{l}^{(2)} \) into the PAS are defined
$$ \begin{aligned} \alpha_{l} &= { \arctan }\left[ {A_{xz,l}^{EV} ,A_{yz,l}^{EV} } \right],\quad \beta_{l} = { \arccos }\left[ {A_{zz,l}^{EV} } \right],\\ \gamma_{l} &= \arctan \begin{array}{*{20}c} {\left[ { - A_{zx,l}^{EV} ,A_{zy,l}^{EV} } \right].} \\ \end{array} \end{aligned}$$
(8)
Model free analysis
With the MFA (Meiler et al. 2001; Peti et al. 2002), the five parameters describing each alignment tensor in the PAS, \( \left\{ {A_{zz}^{PAS} , R, \alpha , \beta , \gamma } \right\}_{l} \), are used to construct the F matrix which is needed to derive the five dynamically averaged second order spherical harmonics
$$ \left\langle {Y_{2,0} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle = \sqrt {\frac{5}{16\pi }} \left\langle {3\cos^{2} \theta_{k}^{MF} - 1} \right\rangle $$
(9a)
$$ \left\langle {Y_{2, \pm 1} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle = \mp \sqrt {\frac{15}{8\pi }} \left\langle {e^{{ \pm i{\phi_{k}^{MF} }}} \cos \theta_{k}^{MF} \sin \theta_{k}^{MF} } \right\rangle $$
(9b)
$$ \left\langle {Y_{2, \pm 2} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle = \sqrt {\frac{15}{32\pi }} \left\langle {e^{{ \pm 2i\phi_{k}^{MF} }} \sin^{2} \theta_{k}^{MF} } \right\rangle . $$
(9c)
Equation (7) can be recast in terms of dynamically averaged second order spherical harmonics
$$ D_{k,l}^{exp} = A_{zz,l}^{PAS} \sqrt {\frac{4\pi }{5}} \left[ {\left\langle {Y_{2,0} \left( {\theta_{k,l}^{AF} ,\phi_{k,l}^{AF} } \right)} \right\rangle + \sqrt \frac{3}{8} R_{l} \left( {\left\langle {Y_{2,2} \left( {\theta_{k,l}^{AF} ,\phi_{k,l}^{AF} } \right)} \right\rangle + \left\langle {Y_{2, - 2} \left( {\theta_{k,l}^{AF} ,\phi_{k,l}^{AF} } \right)} \right\rangle } \right) } \right] $$
(10)
where
$$ \left\langle {Y_{2,0} \left( {\theta_{k,l}^{AF} ,\phi_{k,l}^{AF} } \right)} \right\rangle = \sqrt {\frac{5}{16\pi }} \left\langle {3\cos^{2} \theta_{k,l}^{AF} - 1} \right\rangle $$
(10a)
$$ \left\langle {Y_{2,2} \left( {\theta_{k,l}^{AF} ,\phi_{k,l}^{AF} } \right)} \right\rangle + \left\langle {Y_{2, - 2} \left( {\theta_{k,l}^{AF} ,\phi_{k,l}^{AF} } \right)} \right\rangle = 2\sqrt {\frac{15}{32\pi }} \left\langle {\sin^{2} \theta_{k,l}^{AF} \cos 2\phi_{k,l}^{AF} } \right\rangle . $$
(10b)
The F matrix relates the measured RDCs to the spherical harmonics defined in the MF by a Wigner rotation from the MF to the AF
$$ \frac{{D_{k,l}^{exp} }}{{A_{zz,l}^{PAS} }} = \mathop \sum \limits_{M = - 2}^{2} F_{l,M} \left\langle {Y_{2,M} (\theta_{k}^{MF} ,\phi_{k}^{MF} )} \right\rangle $$
(11)
with
$$ \begin{aligned} F_{l,M} & = \sqrt {\frac{4\pi }{5}} \left[ {D_{M0}^{2} (\alpha_{l} ,\beta_{l} ,\gamma_{l} ) + \sqrt \frac{3}{8} R_{l} \left( {D_{M2}^{2} (\alpha_{l} ,\beta_{l} ,\gamma_{l} ) + D_{M - 2}^{2} (\alpha_{l} ,\beta_{l} ,\gamma_{l} )} \right) } \right] \\ & = \sqrt {\frac{4\pi }{5}} \left[ {e^{{ - iM\alpha_{l} }} d_{M0}^{2} (\beta_{l} ) + \sqrt \frac{3}{8} R_{l} \left( {e^{{ - iM\alpha_{l} }} d_{M2}^{2} (\beta_{l} )e^{{ - i2\gamma_{l} }} + e^{{ - iM\alpha_{l} }} d_{M - 2}^{2} (\beta_{l} )e^{{i2\gamma_{l} }} } \right) } \right]. \\ \end{aligned} $$
(12)
In analogy to the component definition from Eq. (5), Y is a K × 5 matrix containing the dynamically averaged spherical harmonics in the MF and F is a 5 × L matrix containing the alignment tensor information. The \( \left\langle{\mathbf{Y}}\right\rangle_{\mathbf{refined}} \) matrix is determined in direct correspondence to Eq. (6)
$$ \left\langle {\mathbf{Y}} \right\rangle_{{{\mathbf{refined}}}} = {\mathbf{D}}_{{{\mathbf{normalized}}}} \left\langle {\mathbf{F}} \right\rangle^{ + } . $$
(13)
Here, \( {\mathbf{D}}_{{{\mathbf{normalized}}}} \) represents \( \frac{D_{k,l}^{exp}}{A_{zz,l}^{PAS}} \) in order to normalize the contributions of each alignment condition to the calculation of refined structural coordinates. Each row of \( \left\langle{\mathbf{Y}}\right\rangle_{\mathbf{refined}} \) is used to determine \( S_{RDC,k}^{2} \)
$$ S_{{RDC,k}}^{2} = \frac{{4\pi }}{5}\sum\limits_{{M = - 2}}^{2} {\left\langle {Y_{{2,M}} \left( {\theta _{k}^{{MF}} ,\phi _{k}^{{MF}} } \right)} \right\rangle } \left\langle {Y_{{2,M}}^{*} \left( {\theta _{k}^{{MF}} ,\phi _{k}^{{MF}} } \right)} \right\rangle . $$
(14)
From the dynamically averaged spherical harmonics, the dynamically averaged orientations for each inter-nuclear vector, \( \left( {\theta_{avg,k}^{MF} , \phi_{avg,k}^{MF} } \right) \), can be obtained. Maximizing \( \left\langle {Y_{ 2,0} \left( {\theta_{k}^{VF} , \phi_{k}^{VF} } \right)} \right\rangle \) places the z axis of the vector’s axis system, termed the vector frame (VF), in the center of the inter-nuclear vector’s orientational distribution,
$$ \begin{aligned} { \hbox{max} }\left\langle {Y_{2,0} (\theta_{k}^{VF} ,\phi_{k}^{VF} )} \right\rangle & = \mathop \sum \limits_{M = - 2}^{2} D_{M,0} \left( {\phi_{avg,k}^{MF} ,\theta_{avg,k}^{MF} ,0} \right)\left\langle {Y_{2,M} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle \\ & = \sqrt {\frac{4\pi }{5}} \sum\limits_{M = - 2}^{2} {Y_{2, - M} \left( {\theta_{avg,k}^{MF} ,\varphi_{avg,k}^{MF} } \right)\left\langle {Y_{2,M} (\theta_{k}^{MF} ,\phi_{k}^{MF} )} \right\rangle } . \\ \end{aligned} $$
(15)
The terms \( \left\langle {Y_{ 2, \pm 1} \left( {\theta_{k}^{MF} , \phi_{k}^{MF} } \right)} \right\rangle \) vanish in the VF and \( \left\langle {Y_{ 2, \pm 2} (\theta_{k}^{MF} , \phi_{k}^{MF} )} \right\rangle \) possesses information on the amplitude of anisotropy, η
k
, and the orientation of anisotropic motions, \( \phi_{k}^{\prime} \)
$$ \eta_{k} = \sqrt {\frac{{\mathop \sum \nolimits_{M = - 2,2} \left\langle {Y_{2,M} (\theta_{k}^{VF} ,\phi_{k}^{VF} )} \right\rangle \left\langle {Y_{2, - M} (\theta_{k}^{VF} ,\phi_{k}^{VF} )} \right\rangle }}{{\mathop \sum \nolimits_{M = - 2}^{2} \left\langle {Y_{2,M} (\theta_{k}^{VF} ,\phi_{k}^{VF} )} \right\rangle \left\langle {Y_{2, - M} (\theta_{k}^{VF} ,\phi_{k}^{VF} )} \right\rangle }}} $$
(16)
$$ \phi_{k}^{'} = \frac{1}{2}\arctan \frac{{\left\langle {Y_{2,2} (\theta_{k}^{VF} ,\phi_{k}^{VF} )} \right\rangle - \left\langle {Y_{2, - 2} (\theta_{k}^{VF} ,\phi_{k}^{VF} )} \right\rangle }}{{i\left( {\left\langle {Y_{2,2} \left( {\theta_{k}^{VF} ,\phi_{k}^{VF} } \right)} \right\rangle + \left\langle {Y_{2, - 2} \left( {\theta_{k}^{VF} ,\phi_{k}^{VF} } \right)} \right\rangle } \right)}}. $$
(17)
It should be noted that \( S_{RDC,k}^{ 2} \) is the same in any frame, thus
$$ S_{RDC,k}^{2} = \frac{4\pi }{5}\mathop \sum \limits_{M = - 2}^{2} \left\langle {Y_{2,M} (\theta_{k}^{VF} ,\phi_{k}^{VF} )} \right\rangle \left\langle {Y_{2,M}^{*} (\theta_{k}^{VF} ,\phi_{k}^{VF} )} \right\rangle , $$
(18)
which is equivalent to Eq. (14).
Standard tensorial analysis
Recalling Eq. (4), the following relationships are established in order to construct \( B_{k}^{(2)} \) (Snyder 1965)
$$ b_{zz,k} = \sqrt {\frac{4\pi }{5}} \left\langle {Y_{2,0} (\theta_{k}^{MF} ,\phi_{k}^{MF} )} \right\rangle $$
(19a)
$$\begin{aligned} \frac{1}{\sqrt 3 }\left( {b_{xx,k} - b_{yy,k} } \right) &= \sqrt \frac{1}{2} \sqrt {\frac{4\pi }{5}} \left[ \left\langle {Y_{2, - 2} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle \right.\\ &\quad +\left. \left\langle {Y_{2,2} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle \right] \end{aligned} $$
(19b)
$$ \begin{aligned} \frac{2}{\sqrt 3 }b_{xz,k} &= \sqrt \frac{1}{2} \sqrt {\frac{4\pi }{5}} \left[ \left\langle {Y_{2, - 1} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle \right.\\ &\quad -\left. \left\langle {Y_{2,1} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle \right]\end{aligned} $$
(19c)
$$\begin{aligned} \frac{2}{\sqrt 3 }b_{yz,k} &= i\sqrt \frac{1}{2} \sqrt {\frac{4\pi }{5}} \left[ \left\langle {Y_{2, - 1} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle\right. \\ &\quad +\left. \left\langle {Y_{2,1} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle \right] \end{aligned} $$
(19d)
$$ \frac{2}{\sqrt 3 }b_{xy,k} = i\sqrt \frac{1}{2} \sqrt {\frac{4\pi }{5}} \left[ {\left\langle {Y_{2, - 2} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle - \left\langle {Y_{2,2} \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right)} \right\rangle } \right]. $$
(19e)
The resulting eigenvalues (\( B_{mm,k}^{PAS} ) \) contain the dynamic information for each vector \( \left( {S_{RDC,k}^{ 2} , \eta_{k} } \right) \), while the eigenvectors, \( \left( {B_{mn,k}^{EV} } \right) \), encompass the bond orientations \( \left({\theta_{k}^{MF}, \phi_{k}^{MF}}\right) \) and the direction of the anisotropic local motion \( \left( {\phi_{k}^{'} } \right) \). The following equations detail how the dynamic parameters are calculated from \( B_{mm,k}^{PAS} \). The Saupe order parameters are defined as
$$ S_{0,k}^{2} = B_{zz,k}^{PAS} = \sqrt {\frac{4\pi }{5}} \left\langle {Y_{2,0} (\theta_{k}^{PAS} ,\phi_{k}^{PAS} )} \right\rangle $$
(20a)
$$ S_{2,k}^{2} = \sqrt \frac{2}{3} (B_{xx,k}^{PAS} - B_{yy,k}^{PAS} ) = \sqrt {\frac{4\pi }{5}} \left[ {\left\langle {Y_{2,2} \left( {\theta_{k}^{PAS} ,\phi_{k}^{PAS} } \right)} \right\rangle + \left\langle {Y_{2, - 2} \left( {\theta_{k}^{PAS} ,\phi_{k}^{PAS} } \right)} \right\rangle } \right]. $$
(20b)
$$ S_{RDC,k}^{2} = \left( {B_{zz,k}^{PAS} } \right)^{2} + \frac{1}{3}\left( {B_{xx,k}^{PAS} - B_{yy,k}^{PAS} } \right)^{2} = \left( {S_{0,k}^{2} } \right)^{2} + \frac{1}{2}\left( {S_{2,k}^{2} } \right)^{2} $$
(21)
$$ \eta_{k}^{PAS} = \sqrt {\frac{{\frac{1}{3}\left( {B_{xx,k}^{PAS} - B_{yy,k}^{PAS} } \right)^{2} }}{{S_{RDC,k}^{2} }}} = \sqrt {\frac{{\frac{1}{2}\left( {S_{2,k}^{2} } \right)^{2} }}{{S_{RDC,k}^{2} }}} . $$
(22)
For each inter-nuclear vector, \( \left( {\theta_{k}^{MF} ,\phi_{k}^{MF} } \right) \) and \( \phi_{k}^{\prime } \) are extracted from the transpose of the resulting \( B_{mn,k}^{EV} \) matrix
$$ \begin{aligned} \phi_{k}^{MF} &= { \arctan }\left[ {B_{xz,k}^{EV} ,B_{yz,k}^{EV} } \right]],\quad \theta_{k}^{MF} = { \arccos }\left[ {B_{zz,k}^{EV} } \right],\\ \phi_{k}^{\prime } &= { \arctan }\left[ { - B_{zx,k}^{EV} ,B_{zy,k}^{EV} } \right].\end{aligned} $$
(23)
Direct interpretation of dipolar couplings
With DIDC, once \( \left\langle{\mathbf{A}}\right\rangle \) is determined from Eq. (6), \( \left\langle {\mathbf{A}} \right\rangle \) is used to directly calculate a new set of dynamically averaged coordinates, \( {\mathbf{B}}_{{{\mathbf{refined}}}} \), without extracting each set of \( \left\{ {A_{zz}^{PAS} ,R,\alpha ,\beta ,\gamma } \right\}_{l} \), according to
$$ \left\langle {\mathbf{B}} \right\rangle_{{{\mathbf{refined}}}} = {\mathbf{D}}\left\langle {\mathbf{A}} \right\rangle^{ + }\,+\,{\mathbf{B}}\left[ {1 - \left\langle {\mathbf{A}} \right\rangle \left\langle {\mathbf{A}} \right\rangle^{ + } } \right]. $$
(24)
This formula leaves the information for \( \left\langle{\mathbf{A}}\right\rangle \) in the MF. It should be noted that the previous implementations of DIDC did not scale the RDCs by \( A_{zz,l}^{PAS} \) as in the MFA (see Eq. 13), which is necessary to normalize the contributions of each alignment condition for the calculation of refined structural coordinates. Therefore, we have modified Eq. (24) as follows
$$ \left\langle {\mathbf{B}} \right\rangle_{{{\mathbf{refined}}}} = {\mathbf{D}}_{{{\mathbf{normalized}}}} \left\langle {\mathbf{A}} \right\rangle_{{{\mathbf{normalized}}}}^{ + }\,+\,{\mathbf{B}}\left[ {1 - \left\langle {\mathbf{A}} \right\rangle \left\langle {\mathbf{A}} \right\rangle^{ + } } \right], $$
(25)
where \( {\mathbf{D}}_{{{\mathbf{normalized}}}} \) and \( \left\langle {\mathbf{A}} \right\rangle_{{{\mathbf{normalized}}}} \) represent the RDCs and alignment tensors divided by \( A_{zz,l}^{PAS} \).
As described by Tolman, the first term in Eqs. (24) and (25) encompasses the contribution of the measured RDCs to determining \( \left\langle {\mathbf{B}} \right\rangle_{{{\mathbf{refined}}}} \) (Tolman 2002). When the rank of \( \left\langle {\mathbf{A}} \right\rangle \) is smaller than 5, then the second term accounts for the degeneracy in the possible solutions that results from B. Otherwise, this term will equal zero for data representing more than five alignment media. With \( \left\langle{\mathbf{B}}\right\rangle_{{{\mathbf{refined}}}} \), the 3 × 3 second rank Cartesian tensor, \( B_{k}^{(2)} \), for each inter-nuclear vector is constructed, diagonalized into the VF, and Eqs. (21), (22), and (23) calculate each set of \( \left\{ {S_{RDC}^{2} ,\eta ,\theta_{avg}^{MF} ,\phi_{avg}^{MF} ,\phi^{\prime } } \right\}_{k} \).
ORIUM procedure
Optimized RDC-based Iterative and Unified Model-free analysis (ORIUM) is an iterative approach (see Fig. 2) and is related to but different from the SCRM and iterative DIDC approaches as discussed in this section. The scheme is summarized as follows. First, alignment tensors, \( \left\langle{\mathbf{A}}\right\rangle \), are calculated with Eq. (6) and are used to determine \( \left\langle{\mathbf{B}}\right\rangle_{\mathbf{refined}} \). A comparison of the effects of scaling the RDCs with \( A_{zz,l}^{PAS} \) in the determination of \( \left\langle{\mathbf{B}}\right\rangle_{{{\mathbf{refined}}}} \) will be presented in the Applications section below [see Eqs. (24) and (25)]. Based on \( \left\langle{\mathbf{B}}\right\rangle_{{{\mathbf{refined}}}} \), the 3 × 3 symmetric Saupe matrix is constructed for the inter-nuclear vectors using expressions defined in Eqs. (19a)–(19e), and \( B_{k}^{(2)} \) is put into the local PAS. Utilizing Eqs. (21), (22), and (23), each set of \( \{ S_{RDC}^{ 2} , \eta , \theta^{MF} , \phi^{MF} , \phi^{\prime } \}_{k} \) is extracted. These refined angles (θ
MF
k
, ϕ
MF
k
) are used as input for the next cycle of ORIUM. The cycle is finished when the convergence of order parameter is achieved using the relationship
$$ \frac{1}{K}\mathop \sum \limits_{k = 1}^{K} \left| {S_{RDC,k}^{2} \left( r \right) - S_{RDC,k}^{2} \left( {r - 1} \right)} \right| \le 0.0001 $$
(26)
where r is a cycle of iteration.
The ORIUM approach differs from the SCRM method as follows. There is a minor difference: with SCRM, the inter-nuclear vector coordinates are defined in terms of spherical harmonics, while ORIUM utilizes Cartesian coordinates. The relationship between the spherical harmonics and the Cartesian coordinates are give by Eqs. (19a)–(19e). A key difference is that SCRM requires the five alignment tensor parameters \( \left\{ {A_{zz}^{PAS},R,\alpha ,\beta ,\gamma } \right\}_{l} \) to construct the F matrix for the determination of \( \left\langle{\mathbf{Y}}\right\rangle_{{{\mathbf{refined}}}} \). DIDC and ORIUM calculate \( \left\langle{\mathbf{B}}\right\rangle_{{{\mathbf{refined}}}} \) directly from \( \left\langle{\mathbf{A}}\right\rangle_{{{\mathbf{normalized}}}} \). Finally, SCRM maximizes \( \left\langle {Y_{ 2,0} (\theta_{k}^{VF} , \phi_{k}^{VF} )} \right\rangle \), whereas ORIUM places \( B_{k}^{(2)} \) into a local axis system by diagonalization of the symmetric 3 × 3 second rank Cartesian tensor.
There are three key differences between ORIUM and the iterative DIDC approach. First, a grid search is implemented with the iterative DIDC scheme which minimizes the difference between the vector’s coordinates obtained from \( \left\langle{\mathbf{B}}\right\rangle_{{{\mathbf{refined}}}} \) and an exhaustive list of \( (\theta ,\phi ) \) combinations to find dynamically averaged coordinates. As stated above, ORIUM diagonalizes \( B_{k}^{(2)} \) into a local axis system in order to extract this information. The second key difference is that with the iterative DIDC scheme each inter-nuclear vector is constrained to be rigid \( \left( {S_{RDC}^{ 2} = 1} \right) \). Only during the final iterative run is the \( S_{RDC}^{ 2} = 1 \) constraint removed. ORIUM never constrains the dynamics of the inter-nuclear vectors during the iterative procedure. A final divergence between the two procedures is how flexible inter-nuclear vectors are removed from the calculation of the alignment tensors. In the iterative DIDC procedure, RDC data for an individual inter-nuclear vector is removed from the calculation of the alignment tensors if the error in the experimental and back-calculated RDCs is greater than a factor of 2. The calculation is restarted and RDC data for the next inter-nuclear vector is once again removed from the calculation if the deviation is greater than a factor of 2. This procedure is repeated until all inter-nuclear vectors fulfill the threshold for the error in experimental and back-calculated RDCs. At this point, the \( S_{RDC}^{2} = 1 \) constraint is removed and a final iteration is performed. ORIUM removes the most flexible residues \( S_{RDC}^{ 2} \le 0. 9 5 \), after Eq. (26) has been satisfied (see below) and then ORIUM is restarted until Eq. (26) is once again fulfilled.
As with the RDC-based model free analysis, the fundamental assumption is that the internal protein dynamics for each inter-nuclear vector is uncorrelated with fluctuations with the alignment tensor. Thus, a single average alignment tensor can be utilized for each medium. Molecular dynamics simulations indicate that this assumption is true for secondary structural elements, however \( \left\langle {\mathbf{B}} \right\rangle \) and \( \left\langle {\mathbf{A}} \right\rangle \) dynamics may be correlated for the most mobile regions of a protein (Louhivuori et al. 2006; Salvatella et al. 2008). To circumvent this potential inseparability of mobile inter-nuclear vectors and the alignment tensor fluctuations, the approach outlined in the SCRM procedure is followed (Lakomek et al. 2008). After convergence is achieved with Eq. (26), the residues that are the most mobile, as determined by fulfilling the relationship \( S_{RDC}^{2} \le 0.95 \), are removed from the \( \left\langle {\mathbf{A}} \right\rangle \) calculation and ORIUM is restarted with \( \left\langle {\mathbf{B}} \right\rangle_{{{\mathbf{refined}}}} \) from the previous iteration until Eq. (26) is once again satisfied.
The validity of ORIUM was accessed with synthetic RDC data containing a measurement error (0.3 Hz) for the 36 alignment media, which was generated using the RDC refined ubiquitin ensemble ERNST (PDB:2KOX) (Fenwick et al. 2011). The corresponding dynamic parameters (\( S_{RDC}^{2} \) and η) were also calculated using the same ensemble. Using these synthetic RDC data, ORIUM was conducted and the resulting dynamic parameters have been compared with those calculated from the ensemble. The Pearson correlations of \( S_{RDC}^{2} \) and η are 0.97 and 0.93, respectively.
It should be noted that the local PAS differs from the VF when \( B_{zz,k}^{{}} \) is a negative value, although the local PAS is usually the VF. In this case, the averaged vector orientation is actually orthogonal to the z axis of PAS. This issue can be alleviated by choosing a new axis system referred to the vector frame system (VFS), with eigenvalues ordered \( B_{zz,k} \ge B_{xx,k} \ge B_{yy,k} \) instead of \( | {B_{zz,k}^{{}} }| \ge | {B_{xx,k}^{{}} }| \ge | {B_{yy,k}^{{}} }| \). It should also be noted that \( \eta \) from the VFS and the PAS can be significantly different in the case that \( B_{zz,k}^{{}} \) has a negative value. ORIUM utilizes the VFS after removal of residues with \( S_{RDC}^{2} \le 0.95 \) to obtain dynamically averaged angles of the bond vector distribution.
Determination of \( S_{RDC}^{2} \) scaling factor: \( S_{overall} \)
An inherent complication when calculating \( S_{RDC}^{2} \) from experimental RDCs is that dynamic averaging will reduce the actual magnitude of \( A_{zz,l}^{PAS} \) or \( D_{a,l} \). This reduced magnitude will result in some \( S_{RDC}^{2} \) parameters over 1, and thus \( S_{RDC}^{2} \) can only be considered as relative gauge of the actual amplitudes of motion, defined as \( S_{RDC,unscaled}^{2} \) (Lakomek et al. 2006; Meiler et al. 2001). It should be noted that the alignment tensor parameters \( \left\{ {R,\alpha ,\beta ,\gamma } \right\}_{l} \) are unaffected by the reduction in the magnitude of \( A_{zz,l}^{PAS} \) or \( D_{a,l} \). Two avenues to circumvent this complication have been developed. Either all the order parameters are scaled relative to the largest \( S_{RDC, unscaled}^{2} \) leaving one order parameter equal to one (iterative DIDC approach) (Tolman 2002; Yao et al. 2008), or \( S_{RDC, unscaled}^{2} \) is scaled relative to the Lipari-Szabo order parameters (\( S_{LS}^{2} \)) calculated for each residue (MFA/SCRM approach) (Lakomek et al. 2006, 2008). The problem with the first approach is that the resulting \( S_{RDC}^{2} \) parameters will underestimate the amplitude of motion for each inter-nuclear vector. Overestimation can only occur if the largest \( S_{RDC, unscaled}^{2} \) parameter has a large experimental error, leading to an artificially greater value for this parameter than in reality. Sub- and supra-τc motion happening for each vector equally will not be picked up by this approach, underestimating the motion except for the mentioned case. As for the second approach, \( S_{LS}^{2} \) are required which may not be available for the vectors being analyzed. While this approach has been successfully applied, it may also underestimate motion since a general supra-τc motion affecting all the nuclei will not be picked up by this approach. Comparison of the \( S_{overall} \) derived in Lakomek et al. 2008 and Lange et al. 2008 with the average order parameter from solid state data (Schanda et al. 2010) shows that the solid state NMR derived average order parameter is smaller than the one derived by this second approach suggesting that supra-τc motion affecting all nuclei is seen by solid state NMR but not the \( S_{LS}^{2} \) versus \( S_{RDC}^{2} \) approach.
Here, we present a new method for determining \( S_{overall} \) without the requirement of additional information, such as \( S_{LS}^{2} \), which may not be available for the inter-nuclear vectors under investigation. The scaling procedure separates an inter-nuclear vector’s motion into its principal axes in Cartesian space and leads to parameters that have a more straightforward physical interpretation. The inter-nuclear vector’s motional variance is directly related to the resulting eigenvalues calculated from diagonalization of \( B_{k}^{(2)} \) into a local axis system. The methodology outlined below exploits the fact that variance cannot be negative by definition. Therefore, a uniform scaling parameter, \( S_{overall} \), is necessary to insure that the variance for each inter-nuclear vector about each of the three principal axes is positive. In the following, we present a brief outline for the derivation of bond vector motional variance for the determination of \( S_{overall} \).
For each vector, the following relationships between the dynamically averaged Eigenvalues and the unit vector coordinates (x, y, z) within the VF, as shown in Eq. (4a), are as follows
$$ B_{zz} = \frac{{3\left\langle {z^{2} } \right\rangle - 1}}{2},\quad B_{xx} = \frac{{3\left\langle {x^{2} } \right\rangle - 1}}{2},\quad B_{yy} = \frac{{3\left\langle {y^{2} } \right\rangle - 1}}{2}. $$
(27)
The normalization condition sets \( x^{2} + y^{2} + z^{2} = 1 \), which also implies \( \left\langle {x^{2} } \right\rangle + \left\langle {y^{2} } \right\rangle + \left\langle {z^{2} } \right\rangle = 1 \). Therefore, \( B_{zz} \) can be recast as
$$ B_{zz} = \frac{{2 - 3\left\langle {x^{2} } \right\rangle - 3\left\langle {y^{2} } \right\rangle }}{2}. $$
(28)
Utilizing the definitions of \( S_{RDC}^{2} \) and η [Eqs. (21), and (22)], we can now reformulate \( S_{RDC}^{2} \) and η in terms of the Cartesian coordinates defined within the VF
$$ S_{RDC}^{2} = 1 - 3\left\langle {x^{2} } \right\rangle + 3\left\langle {x^{2} } \right\rangle^{2} - 3\left\langle {y^{2} } \right\rangle + 3\left\langle {x^{2} } \right\rangle \left\langle {y^{2} } \right\rangle + 3\left\langle {y^{2} } \right\rangle^{2} $$
(29)
$$ \eta = \frac{{\sqrt 3 \left( {\left\langle {x^{2} } \right\rangle - \left\langle {y^{2} } \right\rangle } \right)}}{{2\sqrt {S_{RDC}^{2} } }}. $$
(30)
The definition of variance is \( \sigma_{k}^{2} = \left\langle {\left( {k - \overline{k} } \right)^{2} } \right\rangle \), where k = x, y. Therefore, \( \sigma_{k}^{2} \) can be substituted for \( \left\langle {k^{2} } \right\rangle \). Now, \( S_{RDC}^{2} \) and η are defined in terms of variance
$$ S_{RDC}^{2} = 1 - 3\sigma_{x}^{2} + 3\left( {\sigma_{x}^{2} } \right)^{2} - 3\sigma_{y}^{2} + 3\sigma_{x}^{2} \sigma_{y}^{2} + 3\left( {\sigma_{y}^{2} } \right)^{2} $$
(31)
$$ \eta = \frac{{\sqrt 3 (\sigma_{x}^{2} - \sigma_{y}^{2} )}}{{2\sqrt {S_{RDC}^{2} } }}. $$
(32)
Solving the system of equations gives the inverse relationships
$$ \sigma_{x}^{2} = \frac{{1 - \sqrt {\left( {1 - \eta^{2} } \right)S_{RDC}^{2} } + \eta \sqrt {3S_{RDC}^{2} } }}{3} $$
(33)
$$ \sigma_{y}^{2} = \frac{{1 - \sqrt {\left( {1 - \eta^{2} } \right)S_{RDC}^{2} } - \eta \sqrt {3S_{RDC}^{2} } }}{3} $$
(34)
A graphical depiction of the mapping between these parameters is shown in Figure S1. Using the relationship (\( S_{RDC}^{2} = S_{overall}^{2} S_{RDC, unscaled}^{2} \)), these equations can be written as
$$ \sigma_{x}^{2} = \frac{{1 - S_{overall} \sqrt {\left( {1 - \eta^{2} } \right)S_{RDC,unscaled}^{2} } + S_{overall} \eta \sqrt {3S_{RDC,unscaled}^{2} } }}{3} $$
(35)
$$ \sigma_{y}^{2} = \frac{{1 - S_{overall} \sqrt {\left( {1 - \eta^{2} } \right)S_{RDC,unscaled}^{2} } - S_{overall} \eta \sqrt {3S_{RDC,unscaled}^{2} } }}{3} $$
(36)
Since the variance must always be positive, the axis with the least variance (\( (\sigma_{y}^{2} ) \) should also be positive. Thus, the following inequalities are derived relating \( S_{RDC}^{2} \) and η to \( \sigma_{y}^{2} \)
$$ \sigma_{y}^{2} = \frac{{1 - S_{overall} \sqrt {\left( {1 - \eta^{2} } \right)S_{RDC,unscaled}^{2} } - S_{overall} \eta \sqrt {3S_{RDC,unscaled}^{2} } }}{3} \ge 0 $$
(37)
$$ \begin{aligned} S_{overall} &\le \frac{1}{{\sqrt {\left( {1 - \eta^{2} } \right)S_{RDC,unscaled}^{2} } + \eta \sqrt {3S_{RDC,unscaled}^{2} } }} \\ &= - \frac{1}{{2B_{yy}^{unscaled} }} = S_{overall}^{max} .\end{aligned} $$
(38)
Using Eq. (38), residue-specific \( S_{overall}^{max} \) can be obtained using \( S_{RDC}^{2} \) and η, or from the lowest eigenvalue. The eigenvalue definition of \( S_{overall}^{max} \) follows directly from Eq. (27).
Since the reduction of magnitude in the alignment due to dynamic averaging is a global effect throughout all residues, the least residue-specific \( S_{overall}^{max} \) may be utilized as the scaling factor, if there is no experimental error. The previous method in which all order parameters are scaled relative to the largest \( S_{RDC, unscaled}^{2} \), leaving one order parameter equal to one (iterative DIDC approach) (Tolman 2002; Yao et al. 2008) is related to this new approach. If bond vector anisotropy is assumed to be axially symmetric (η = 0), \( S_{overall}^{max} \) turns into \( 1/S_{RDC,unscaled} \). This is identical to scaling all inter-nuclear vectors such that the largest is 1 (Figure S1).
This scaling approach using the lowest residue-specific \( S_{overall}^{max} \) may introduce a systematic bias due to the fact that experimental data contain errors. In order to alleviate the systematic bias, we used a statistical procedure accounting for the effect of experimental noise on \( S_{overall} \) without any knowledge of \( S_{LS}^{2} \) unlike the SCRM approach. First, scaling factors were calculated from the original data as well as datasets with noise added equivalent to the experimental error. These scaling factors were used to determine a value (which we term \( S_{overall}^{95\,\% } \)), below which there was a 95 % chance that the true \( S_{overall} \) would fall. Given the maximum scaling factor from the original data that fulfills the constraint equation for all inter-nuclear vectors, \( S_{overall}^{max} \), and corresponding set of values from noise added data (NAD), \( S_{overall,NAD}^{max} \), the \( S_{overall}^{95\,\% } \) value can be calculated as follows:
$$ S_{overall}^{95\,\% } = \frac{{S_{overall}^{max} }}{{\left\langle {S_{overall,NAD}^{max} } \right\rangle }}quantile(S_{overall,NAD}^{max} ,95\,\% ) $$
(39)
where the quantile function returns the given quantile of the set. The quantile prefactor compensates for systematic shifts resulting from the addition of experimental error. With the previous study (Lakomek et al. 2008), the determination of \( S_{overall} \) was conservative in order to circumvent the chance for over-estimating the supra-τc motion, reflected in the reported \( S_{RDC}^{2} \). Here, the criterion for scaling is that \( \sigma_{y}^{2} \) should be positive, which possesses no time-scale bias. Yet, it should be noted that this overall order parameter is an upper limit for \( S_{overall} \) since it could underestimate motion if there is a uniform sub- or supra-τc motion affecting all vectors. This is summarized in Table 1.
Table 1 Summary of the methods for determining \( S_{overall} \)