# Relaxation mode analysis for molecular dynamics simulations of proteins

## Abstract

Molecular dynamics simulation is a powerful method for investigating the structural stability, dynamics, and function of biopolymers at the atomic level. In recent years, it has become possible to perform simulations on time scales of the order of milliseconds using special hardware. However, it is necessary to derive the important factors contributing to structural change or function from the complicated movements of biopolymers obtained from long simulations. Although some analysis methods for protein systems have been developed using increasing simulation times, many of these methods are static in nature (i.e., no information on time). In recent years, dynamic analysis methods have been developed, such as the Markov state model and relaxation mode analysis (RMA), which was introduced based on spin and homopolymer systems. The RMA method approximately extracts slow relaxation modes and rates from trajectories and decomposes the structural fluctuations into slow relaxation modes, which characterize the slow relaxation dynamics of the system. Recently, this method has been applied to biomolecular systems. In this article, we review RMA and its improved versions for protein systems.

## Keywords

Protein Simulation Analysis Dynamics## Introduction

Molecular dynamics simulation is widely used for protein research. In general, the focus of this research is to extract information on the physical properties of individual proteins. The results from such simulations are then often compared with experimental results. Since these experiments are generally conducted in solvents, it is necessary to simulate protein and water molecular systems, which are complicated systems. These simulations are conducted for a variety of purposes such as to analyze the stability and dynamics of the structures around crystal structures and to determine folding from an extended structure into a native structure. There are three difficulties in current approaches for protein simulations (Freddolino et al. 2010). The first is the potential function of the protein systems. In recent years, it has become possible to evaluate the molecular force field by improving the sampling, and accuracy has consequently improved. The second problem is related to the sampling. With respect to the folding mechanism, simulation at the millisecond scale is necessary. Recently, it has become possible to perform simulations at the millisecond scale by using special hardware such as Anton (Lindorff-Larsen et al. 2011, 2012; Dror et al. 2012, Lane et al. 2013), but sampling problems still exist for complex systems such as ligand-binding systems and other even more complex systems. The third issue is related to the analysis methods. It is important to extract the characteristic degrees of freedom (order parameters) from the complex protein movements obtained from simulations, which are good indicators for analyzing trajectories.

In normal mode analysis, the normal mode near the minimum point of the potential energy of the protein molecule is obtained (Go et al. 1983; Brooks and Karplus 1983; Levitt et al. 1985). Langevin mode analysis investigates modes around the native structure, including the water effect (Lamm and Szabo 1986; Kottalam and Case 1990; Kitao et al. 1991; Hayward et al. 1993). An elastic network model and Gaussian network model approximately estimate normal modes with large amplitudes by using the harmonic potential of coarse-grained models (Tirion 1996; Baher et al. 1997; Tama and Sanejouand 2001; Cui and Bahar 2005; Miyashita and Tama 2008). This method extracts collective modes with large amplitudes in the case of huge protein systems such as viruses, because huge proteins have rigid-like motions (Tama and Brooks III 2002).

Principal component analysis (PCA), also called quasiharmonic analysis or the essential dynamics method (Levy et al. 1984; Ichiye and Karplus 1991; Abagyan and Argos 1992; Garcia 1992; Hayward et al. 1993; Amadei et al. 1993; Kitao and Go 1999), is one of the most popular methods adopted for analyzing the structural fluctuations around the average structure. The modes with large structure fluctuations are extracted and are regarded as cooperative movement, and the relation of these fluctuations with function has been widely investigated. The obtained modes are also used as the axis of the free-energy surface. Moreover, various other analysis methods have been proposed, such as full correlation analysis (Lange and Grubmüller 2007), subspace joint approximate diagonalization of eigenmatrices (Sakuraba et al. 2010), and wavelet analysis (Kamada et al. 2011), among others (Moritsugu et al. 2015; Matsunaga et al. 2015).

In recent years, it has become possible to perform an extensively long simulation; thus, development of dynamic analysis methods to identify the local minimum-energy states and analyze the transitions between them is required. Accordingly, many methods to analyze the dynamics and kinetics of protein simulations have been developed (Zuckerman 2010; Komatsuzaki et al. 2011; Bowman et al. 2014). In particular, the Markov state model has been presented and applied to many protein systems (Schütte et al. 1999; Swope et al. 2004; Singhal et al. 2004; Chodera et al. 2006, 2007; Chodera and Noé 2014; Noé et al. 2007; Noé and Fischer 2008; Noé and Clementi 2017 Buchete and Hummer 2008; Prinz et al. 2011; Pérez-Hernández et al. 2013; Schwantes and Pande 2013; Schwantes et al. 2014; Bowman et al. 2014; Wu et al. 2017). The Markov state model can analyze transitions between local minimum-energy states, which are identified from clustering analysis methods. This is a powerful method for analyzing dynamics in the context of both long and short simulations of proteins.

Relaxation mode analysis (RMA) was developed to investigate the “dynamic” properties of spin systems (Takano and Miyashita 1995) and homopolymer systems for Monte Carlo (Koseki et al. 1997) and molecular dynamics (Hirao et al. 1997) analyses, and has been applied to various polymer systems (Hagita and Takano 2002; Saka and Takano 2008; Iwaoka et al. 2015; Natori and Takano 2017) to investigate their slow relaxation dynamics (de Gennes 1984; Doi and Edwards 1986). Recently, RMA has also been applied to biomolecular systems (Mitsutake et al. 2011; Mitsutake et al. 2005; Mitsutake and Takano 2015; Nagai et al. 2009, 2013). RMA approximately estimates slow relaxation modes and rates from trajectories obtained from simulations.

*X*

_{ p }} satisfy

*A*(

*t*)

*B*(0)〉 denotes the equilibrium correlation of

*A*at time

*t*and

*B*at time 0:

*T*

_{ t }(

*Q*|

*Q*

^{′}) is the conditional probability that the system is in state

*Q*at time

*t*given that it is in state

*Q*

^{′}at time

*t*= 0. Further,

*P*

_{eq}(

*Q*

^{′}) denotes the probability that the system is in state

*Q*

^{′}at equilibrium. The relaxation rate of

*X*

_{ p }is denoted by

*λ*

_{ p }. The relaxation time is given by 1/

*λ*

_{ p }. Note that the relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively, from the viewpoint of the statistical mechanics (Hirao et al. 1997; Koseki et al. 1997; Mitsutake and Takano 2015) (see the “Relaxation modes {

*X*

_{ p }} and rates

*λ*

_{ p }” section). The point of RMA is that we consider the variational problem, which is equivalent to the eigenvalue problem of the time evolution operator, and choose an appropriate trial function to estimate the slow relaxation modes and rates in the system (see the “RMA” section). From these processes, we obtain the generalized eigenvalue problem of the time correlation matrices for two different times. From the eigenvectors and eigenvalues, we approximately estimate slow relaxation modes and rates.

Conventional RMA approximately estimates slow relaxation modes by solving the generalized eigenvalue problem of the time correlation matrices of coordinates for two different times, *C*(*τ* + *t*_{0}) and *C*(*t*_{0}), which are calculated from the trajectory. Recently, dynamical analysis methods for molecular simulations of biopolymer systems have been developed to investigate slow dynamics. In these techniques such as time structure-based independent component analysis (tICA) (Naritomi and Fuchigami 2011, 2013), time-lagged independent component analysis (TICA) (Pérez-Hernández et al. 2013; Schwantes and Pande 2013), and dynamic component analysis (DCA) (Mori et al. 2015, 2016), time correlation matrices of certain physical quantities or states are used. (Note that tICA is a special case of RMA with *t*_{0} = 0. See Mitsutake et al. (2011) and Naritomi and Fuchigami (2011) for more details on the differences between tICA and RMA.) In tICA, TICA, and DCA, the time correlation functions C(*τ*) and C(0) are used, whereas C(*τ* + *t*_{0}) and C(*t*_{0}) are used in RMA. The relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively. From this point of view, RMA is related to Markov state models. (The relationship among the Markov state model, tICA, and TICA is explained in Pérez-Hernández et al. (2013), Schwantes and Pande (2013), and Mitsutake and Takano (2015).) The combination method of tICA and a Markov state model was also proposed (Pérez-Hernández et al. 2013; Schwantes and Pande 2013). A Markov state model was constructed from clustering in the subspace determined by tICA.

In this review, we first provide a definition of relaxation modes and rates from the viewpoint of the statistical mechanics in the “Relaxation modes {*X*_{ p }} and rates *λ*_{ p }” section. The “RMA” section explains the original RMA (RMA with a single evolution time) and the process of RMA using coordinates for the trial function in detail. The “Improvement of RMA” section explains the improved versions of RMA, including RMA with multiple evolution times, principal component RMA (PCRMA), two-step RMA, and Markov-state RMA (MSRMA). Finally, in the “Application of RMA to a system with large conformational changes” section, we present results from studies in which RMA was applied to a system with large conformational changes. The “Conclusions” section provides conclusions and perspectives on the state of the field.

## Relaxation modes {*X*_{p}} and rates *λ*_{p}

In this section, we provide the definition of relaxation modes and rates from the viewpoint of the statistical mechanics (Risken 1989; Zwanzig 2001). The relaxation modes {*X*_{ p }} satisfy Eq. 1. The relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively. We first explain the relation in three types of simulations satisfying the detailed balance condition.

*P*(

*Q*;

*t*) that the biomolecule is in a state \(Q= \left (\boldsymbol {r}_{1}^{\mathrm {T}},\boldsymbol {r}_{2}^{\mathrm {T}},\right .\) \(\left .\cdots ,\boldsymbol {r}_{N}^{\mathrm {T}}\right )^{\mathrm {T}}\) at time

*t*is described by a master equation:

*Q*|

*Q*

^{′}) denotes the (

*Q*,

*Q*

^{′})-component of the time evolution matrix Γ, and \({\sum }_{Q^{\prime }}\) denotes the summation over all possible states. Γ(

*Q*|

*Q*

^{′}) is also chosen so that the detailed balance for the equilibrium distribution function

*P*

_{eq}(

*Q*) is satisfied:

**r**_{ i },(

*i*= 1,⋯ ,

*N*) is given by the Langevin equation for a biomolecule with

*N*atoms:

**r**_{ i }(

*t*) denotes the position of the

*i*th atom at time

*t*, and

*ζ*is the friction constant. The interaction between atoms is described by the potential

*U*({

**r**_{ i }}) =

*U*(

**r**_{1},…,

**r**_{ N }). The random force

**w**_{ i }(

*t*) acting on the

*i*th atom is a Gaussian white stochastic process and satisfies

*w*

_{i, α},

*k*

_{ B }, and

*T*denote the

*α*-component of

**w**_{ i }(

*α*=

*x*,

*y*, or

*z*), the Boltzmann constant, and the temperature of the system, respectively. The Smoluchowski equation equivalent to Eq. 5 can be written as

*Q*= {

**r**_{1},…,

**r**_{ N }} denotes a point in the phase space of the system, and

*P*(

*Q*,

*t*)

*d*

*Q*denotes the probability that the system is found at time

*t*in an infinitesimal volume

*d*

*Q*at point

*Q*in the phase space. The time evolution operator Γ satisfies the detailed balance condition (Risken 1989):

*Q*)

*δ*(

*Q*−

*Q*

^{′}) and the adjoint operator Γ

^{‡}(

*Q*)

*δ*(

*Q*−

*Q*

^{′}) act only on

*Q*in

*δ*(

*Q*−

*Q*

^{′}). In the matrix representation, so that Γ(

*Q*)

*δ*(

*Q*−

*Q*

^{′}) = Γ(

*Q*|

*Q*

^{′}) and Γ

^{‡}(

*Q*)

*δ*(

*Q*−

*Q*

^{′}) = Γ(

*Q*

^{′}|

*Q*), the detailed balance condition is the same as that in Eq. 4.

**r**_{ i },(

*i*= 1,⋯ ,

*N*) is given by the Langevin equation for a biomolecule with

*N*atoms:

**r**_{ i }(

*t*) and

**v**_{ i }(

*t*) denote the position and the velocity of the

*i*th atom at time

*t*, respectively. The mass of the

*i*th atom is denoted by

*m*

_{ i }and

*ζ*is the friction constant.

*Q*= {

**r**_{1},…,

**r**_{ N },

**v**_{1},…,

**v**_{ N }} denotes a point in the phase space of the system. The time evolution operator Γ satisfies the detailed balance condition:

*P*

_{eq}(

*Q*) =

*P*

_{eq}(

*𝜖*

*Q*). Here,

*𝜖*

*Q*denotes the time-reversed state of the state

*Q*, namely,

*𝜖*

*Q*= {

*𝜖*

_{1}

**r**_{1},…,

*𝜖*

_{ N }

**r**_{ N },.

*𝜖*

_{N+ 1}

**v**_{1},…,

*𝜖*

_{2N}

**v**_{ N }} with

The time evolution equation of *P*(*Q*;*t*) of Eqs. 7 and 11 corresponds to Eq. 3 in the matrix representation. In Monte Carlo and Brownian dynamics, because only coordinates are the degrees of freedom in the system, *𝜖**Q* = *Q*, the detailed balance condition in all three cases is given by Eq. 14.

*Q*|

*Q*

^{′}) of the master equation:

*ϕ*

_{ n }(

*Q*) and

*ψ*

_{ n }(

*Q*) are the left and right eigenfunctions of the time evolution operator Γ with eigenvalue

*λ*

_{ n }, respectively. When we define a quantity \(\hat {\phi }_{n}(Q)\) through

*ϕ*

_{ n }(

*Q*) and \(\hat {\phi }_{m}(Q)\) is given by the following:

*T*

_{ t }(

*Q*|

*Q*

^{′}) =

*e*

^{−Γτ}(

*Q*|

*Q*

^{′}) is the conditional probability that the system is found at time

*t*at

*Q*given that the system is at

*Q*

^{′}at time 0.

*A*(

*Q*) and

*B*(

*Q*) are expanded as

*A*and

*B*in the equilibrium state is given by

*ϕ*

_{ n }(

*Q*) and \(\hat {\phi }_{n}(Q)\), the correlation function 〈

*A*(

*t*)

*B*(0)〉 is decomposed into a sum of exponentially relaxing contributions. Therefore, we use two sets of functions, {

*ϕ*

_{ n }(

*Q*)} and \(\{ \hat {\phi }_{n}(Q) \}\), as relaxation modes, and refer to {

*λ*

_{ n }} as their relaxation rates. The relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively.

## RMA

### RMA with a single evolution time, *t*_{0}

RMA approximately estimates slow relaxation modes and rates from trajectories obtained from simulations. Herein, we explain how to obtain the slow relaxation modes and rates. The point of this method is that we consider the variational problem, which is equivalent to the eigenvalue problem of the time evolution operator, and choose an appropriate trial function in order to estimate the slow relaxation modes and rates in the system.

*λ*

_{ n }

*τ*). RMA treats the variational problem of Eqs. 24 and 25 using trial functions instead of the eigenvalue problem of Eqs. 22 and 23. To choose the trial function given by a linear combination of important relevant quantities, we can evaluate the relaxation modes and rates from simulation data.

*N*atoms and only treat the coordinates, because the velocities have faster relaxations (∼ picosecond order) than coordinates in protein systems. We assume that

*is a 3*

**R***N*-dimensional column vector that consists of a set of atomic coordinates relative to their average coordinates

**r**_{ i }is the coordinate of the

*i*th atom of the biopolymer in the center-of-mass coordinate system, and 〈

**r**_{ i }〉 is its average. Note that because we consider the coordinates only, \(\hat {\phi }_{n}(Q)=\phi _{n}(\epsilon Q)=\phi _{n}(Q)\) holds.

*R*

_{ i }(

*Q*) is the

*i*th component of

*. The quantity*

**R***R*

_{ i }(

*t*;

*Q*) is the expectation value of

*R*

_{ i }after a period

*t*starting from a state

*Q*and satisfies

*R*

_{ i }(

*t*;

*Q*)|

_{t= 0}=

*R*

_{ i }(

*Q*). The parameter

*t*

_{0}is introduced in order to reduce the relative weight of the faster modes contained in

*, and it is expected that Eq. 28 becomes a better approximation as*

**R***t*

_{0}becomes larger.

*C*

_{i, j}(

*t*) is a component of a 3

*N*× 3

*N*symmetric matrix

*C*(

*t*) defined by

*X*

_{ p }is written as

*λ*

_{ p }and the corresponding relaxation modes

*f*

_{p, i}. We chose the indices of

*λ*

_{ p }so that 0 <

*λ*

_{1}≤

*λ*

_{2}≤⋯ holds. Here, the relation

*𝜖*

*Q*=

*Q*, and the Markovian property

*R*

_{ i }are reproduced by

*t*≥

*t*

_{0}. Here,

Because we are considering position coordinates only, the detailed balance condition yields the following consequences: *C*(*t*) is a symmetric matrix, *C*_{i, j}(*t*) = *C*_{j, i}(*t*); {*λ*_{ p }} are real and positive, which corresponds to pure relaxation. We refer to this method as the “RMA method with a single evolution time,” which is *t*_{0}/2.

In practice, the time correlation matrices for the two different times are calculated through simulations. Then, by solving the generalized eigenvalue problem, {*λ*_{ p }} and {*X*_{ p }} are obtained from the eigenvalues and eigenvectors, respectively. To examine the validity of the present analysis, the autocorrelation functions *C*_{i, i}(*t*) are reconstructed from the estimated eigenvalues and eigenvectors and are compared with those directly calculated via simulation.

Herein, we comment on the trial function. When RMA was first introduced to a spin system, states of spins on a lattice were used as the trial function (Takano and Miyashita (1995)). When RMA was first introduced to polymer systems, the coordinates of polymers were used as the trial function (Koseki et al. (1997); Hirao et al. (1997)). In polymer systems, the Rouse modes, which were derived from the theory of polymer physics (Doi and Edwards 1986), correspond to the relaxation modes. Rouse modes are given as linear combinations of coordinates. Thus, when RMA was applied to polymer systems, the modes obtained by RMA were compared with the Rouse modes. In protein systems, PCA using coordinates has been widely used. In PCA, the eigenvalue problem of the covariance matrix of coordinates is solved. Therefore, when we first applied RMA to a hetero polymer system (protein system), it seemed to be better to use coordinates as trial functions. The results of RMA and PCA were directly compared with each other. Recently, we have proposed to use physical quantities with slow motions as the trial functions and PCRMA and two-step RMA have been introduced (see the “Improvement of RMA” section). However, RMA using coordinates as the trial functions has an advantage that we can easily convert the information on the slow relaxation modes to the information in coordinate space.

### RMA for protein systems

In homopolymer systems, relaxation of the positions of a polymer relative to the center of the mass is investigated. This means that the translational degrees of freedom are removed from the coordinates of the polymer. Because the rotational degrees of freedom remain, the rotational relaxation of the polymer is observed as slow relaxations. In protein systems, it is of interest to evaluate fluctuations of the conformations of a biomolecule around its average conformation. Thus, the translational and rotational degrees of freedom are removed from the sampled conformations of a biomolecule. In practice, treatment of the generalized-eigenvalue problem for removing the translational degrees of freedom in the homopolymer system was given by Koseki et al. (1997). Herein, we explain how to treat the generalized eigenvalue problem for removing the translational and rotational degrees of freedom when using the coordinates for the trial function (Mitsutake et al. 2011). The point of this process is that the generalized eigenvalue problem for real symmetric matrices can be easily solved numerically if the matrices are positive definite. Therefore, we shift the zero eigenvalues to finite positive values without changing the other eigenvalues and the corresponding eigenvectors.

**r**_{ i }〉 with

*i*= 1,…,

*N*, and the axes of the coordinate system are chosen to be the principal axes of the moment of the inertia tensor of the average positions. We calculate \(C_{i,j}(t)= \frac {C_{i,j}(t)+C_{j,i}(t)}{2}\) and

*C*

^{′}(

*t*):

*α*,

*β*=

*x*,

*y*,

*z*and

*a*,

*b*= tr,rot. Then, we solve the generalized eigenvalue problem for

*C*

^{′}(

*t*

_{0}+

*τ*) and

*C*

^{′}(

*t*

_{0}), \( C^{\prime }(t_{0}+\tau ) {\boldsymbol v}^{\prime }_{p} = \exp (-\lambda ^{\prime }_{p} \tau ) C^{\prime }(t_{0}) {\boldsymbol v}^{\prime }_{p} \), with the orthonormal condition \( {{\boldsymbol v}^{\prime }_{p}}^{\mathrm {T}} C^{\prime }(t_{0}) {\boldsymbol v}^{\prime }_{q} = \delta _{p,q} \). The unit vectors \({\boldsymbol d}^{a}_{\alpha }\) are eigenvectors of this generalized eigenvalue problem with eigenvalues \(\exp (-\lambda _{\alpha }^{a}\tau )\). We denote \({\boldsymbol f}^{\prime }_{p}\) as the eigenvectors other than \({\boldsymbol d}_{\alpha }^{a}\). Because \( {{\boldsymbol d}_{\alpha }^{a}}^{\mathrm {T}}C^{\prime }(t){\boldsymbol f}^{\prime }_{p} = \exp (-\lambda ^{a}_{\alpha }(t-t_{0})) {{\boldsymbol d}_{\alpha }^{a}}^{\mathrm {T}}{\boldsymbol f}^{\prime }_{p} = 0 \) , \( C^{\prime }(t){\boldsymbol f}^{\prime }_{p} = C(t) {\boldsymbol f}^{\prime }_{p} \) holds. Therefore, \({\boldsymbol f}^{\prime }_{p}\) are identical with the eigenvectors

**f**_{ p }= (

*f*

_{p,1},

*f*

_{p,2},…,

*f*

_{p,3N})

^{T}of the generalized-eigenvalue problem for

*C*(

*t*

_{0}+

*τ*) and

*C*(

*t*

_{0}) with the same eigenvalues exp(−

*λ*

_{ p }

*τ*). Thus,

**f**_{ p }and exp(−

*λ*

_{ p }

*τ*) can be obtained by solving the generalized eigenvalue problem for

*C*

^{′}(

*t*

_{0}+

*τ*) and

*C*

^{′}(

*t*

_{0}), which are real symmetric positive definite matrices.

After obtaining relaxation modes and rates, we confirm whether or not the slow relaxation modes and rates obtained using *τ* and *t*_{0} are appropriate. For this purpose, the convergences of slow relaxation times as a function of *τ* are examined. The autocorrelation functions *C*_{i, i}(*t*) are reconstructed from the estimated eigenvalues and eigenvectors and are compared with those directly calculated via simulation (especially the slow relaxation behavior). After examining the validity, we use the obtained relaxation modes and rates for analysis.

## Improvement of RMA

### Selection of *τ* and *t*_{0} and relevant quantities for the trial function

*λ*

_{ p }} and the {

*X*

_{ p }} obtained via RMA depend on the manner in which

*t*

_{0}and

*τ*are selected in practice. For simplification, we here consider the case of one physical quantity,

*R*. From the variational problem of Eqs. 24 and 25, the relaxation time 1/

*λ*is obtained from the gradient of the straight line connecting two points at

*t*=

*t*

_{0}and

*t*=

*t*

_{0}+

*τ*in the semi-log plot of the correlation function

*C*(

*t*) = 〈

*R*(

*t*)

*R*(0)〉−〈

*R*〉

^{2}versus

*t*, as shown in Fig. 2a. If the time correlation function of the physical quantity contains several {1/

*λ*

_{ p }}, and if we choose

*t*

_{0}= 0 (tICA case) or a small

*t*

_{0}and small

*τ*, as shown in Fig. 2a (green line), the obtained 1/

*λ*does not correspond to the slow relaxation behavior of log

*C*(

*t*) at long times. To investigate the slow relaxation, we wish to choose values of

*t*

_{0}and

*τ*that are as large as possible, as shown in Fig. 2a (blue line). However, the choice of a longer

*t*

_{0}and

*τ*is also limited, because of the decreasing accuracy of the time correlation function over long time periods. Therefore, we must choose the appropriate

*t*

_{0}and

*τ*.

We can improve the RMA explained above by using two different approaches: introduction of multiple evolution times and using the different relevant physical quantities obtained from coordinates (and velocities) for the trial function. For the first improvement, we describe two types of methods with multiple evolution times, as shown in Fig. 2b, c. (The detailed descriptions are given by Nagai et al. (2013), Natori and Takano (2017), and Karasawa et al. (2017).) For the second improvement, we describe the PCRMA (Nagai et al. 2013), in which the relevant physical quantities for the trial function are given by the PC modes with large structural fluctuations and the two-step RMA (Natori and Takano 2017; Karasawa et al. 2017), which are in turn given by the slowest relaxation modes roughly obtained by RMA. Moreover, the MSRMA (Mitsutake and Takano 2015) is also proposed. We will describe these two improved RMAs in detail below.

### RMA with multiple evolution times

#### RMA with multiple evolution times *t*_{1} and *t*_{2}(1)

*t*

_{1}/2 and

*t*

_{2}/2, are used instead of a single evolution time,

*t*

_{0}/2. Because the contributions of faster modes in

*time-evolved for*

**R***t*

_{1}/2 and those for

*t*

_{2}/2 are different, the approximate relaxation modes can extract the faster modes, which cannot be extracted by the approximate relaxation modes using a single evolution time (see Fig. 2b). Using Eq. 45 as a trial function for the variational problem, the following generalized eigenvalue problem is obtained:

*C*(

*t*) is a 6

*N*× 6

*N*matrix defined by

*N*× 3

*N*matrix defined by

*μ*

_{1},

*μ*

_{2}= 1 or 2. The orthonormal condition is written as

*R*

_{ i }are reproduced by

#### RMA with multiple evolution times *t*_{i} (2)

*in the trial function exhibit different relaxations, it is preferable to use different evolution times for the different physical quantities, as shown in Fig. 1c. That is, if we know the characteristic time scales of the relevant physical quantities, we can choose a specific evolution time*

**R***t*

_{ i }for each relevant physical quantity

*R*

_{ i }based on its characteristic time scale. This RMA method is referred to as “RMA with multiple evolution times {

*t*

_{ i }/2}.” In this method, we use the following trial function:

*t*

_{ i }is introduced in order to reduce the relative weight of the faster modes contained in

*R*

_{ i }. Further, it is expected that Eq. 54 would yield a superior approximation for larger

*t*

_{ i }values.

*C*

_{i, j}(

*t*) = 〈

*R*

_{ i }(

*t*)

*R*

_{ j }(0)〉 and the orthonormal condition for

*X*

_{ p }is expressed as

*λ*

_{ p }and the corresponding relaxation modes. We chose the indices of

*λ*

_{ p }such that 0 <

*λ*

_{1}≤

*λ*

_{2}≤⋯ holds. The inverse transformation of Eq. 54 is given by

*R*

_{ i }are given by

*t*≥ (

*t*

_{ i }+

*t*

_{ j })/2. Here,

### RMAs to automatically reduce the degrees of freedom of relevant quantities for the trial function

RMA requires relatively high statistical precision of the time correlation matrices because of treatment for the generalized eigenvalue problem; thus, it is difficult for RMA to handle a large number of degrees of freedom directly. We must therefore reduce the number of degrees of freedom automatically.

*t*

_{0}and

*τ*. (For the Markov state model, the dependence of relaxation times on the selection of states is discussed in Swope et al. (2004) and Pérez-Hernández et al. (2013).) It is better to use the relevant quantities that include the slow behavior. For the second improvement, we describe the PCRMA in which the relevant quantities are given by the PC modes with large structural fluctuations, and the two-step RMA in which the quantities are given by the slowest relaxation modes roughly obtained by the first RMA. A schematic illustration of PCRMA and two-step RMA is given in Fig. 3.

#### PCRMA

**Φ**=(\({\Phi }_{1},{\Phi }_{2},\cdots ,{\Phi }_{N_{c}})^{T}\)). We use the following function as an approximate relaxation mode:

*N*

_{ c }and the relevant quantities with large variance tend to have slow relaxations, the slow relaxation times can be estimated by setting

*t*

_{0}and

*τ*as large values. Note that because the selected principal components also contain faster relaxation modes, as shown in Fig. 4, Nagai et al. (2013) also combined PCRMA with the RMA using multiple evolution times (1) explained above. Note that in PCRMA, if the

*N*

_{ c }th or more PC modes (with relatively small fluctuations) have slow relaxation, the slow behaviors may not be extracted; thus, there is a possibility that the slow relaxations would not be estimated with small structural fluctuations.

#### Two-step RMA

*X*

_{ p }} obtained from the conventional RMA with small

*t*

_{0}and

*τ*contains the true slow {

*X*

_{ p }} (Mitsutake et al. 2011), although the {1/

*λ*

_{ p }} values are underestimated. The slow relaxation modes obtained by the first RMA may contain the true slow relaxation modes. Thus, we use the slow relaxation modes roughly obtained from the first RMA as the relevant quantities for the trial function. In this technique, RMA with a single evolution time using small

*t*

_{0}and

*τ*is implemented first, and {

*X*

_{ p }} and {

*λ*

_{ p }} are roughly estimated. We then apply the second RMA to a small number of the obtained slowest {

*X*

_{ p }}. We denote the number of {

*X*

_{ p }} used in the second RMA as

*N*

_{m}. In the second RMA, we also use the previously presented technique of RMA with multiple evolution times (2), because the characteristic time scales of the {

*X*

_{ p }} obtained from the first RMA are roughly given by the relaxation times {1/

*λ*

_{ p }}. In the second RMA, we use the following trial function:

*X*

_{ p }(

*Q*) is the relaxation mode obtained from the first RMA and \(t_{p}^{\prime }\) is determined from 1/

*λ*

_{ p }. A detailed explanation is given by Natori and Takano (2017) and Karasawa et al. (2017).

In the second RMA, the time interval *τ* can be chosen to be large, because the number of degrees of freedom is reduced and the physical quantities {*X*_{ p }} exhibit slow relaxations. Using the second RMA, the estimation accuracy of the relaxation modes and times can be improved.

### Markov state RMA

As mentioned above, in RMA, the relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively. From this point of view, RMA is related to Markov state models. Herein, we consider the relation between RMA and Markov state models and propose the new method of MSRMA.

*S*

_{ i },

*i*= 1,…,

*n*. First, the joint probability \(\bar {P}_{i,j}(\tau )=P(Q \in S_{i}, \tau ; Q \in S_{j} , 0)\) that the state of the system

*Q*is in the

*j*th cluster at time 0 and is in the

*i*th cluster at time

*τ*> 0 is calculated in a simulation. Second, the transition probability \(\bar {T}_{i,j}(\tau )\) that the state of the system is found in the

*i*th cluster after time

*τ*starting from a state in the

*j*th cluster is calculated by

*j*th cluster, which is estimated in the simulation. Then, by solving the eigenvalue problem

*p*th eigenvector \(\bar {\boldsymbol f}_{p}\) and its eigenvalue \(\bar {{\Lambda }}_{p}\) are obtained. The eigenvector \(\bar {\boldsymbol f}_{1} \propto (1,1,\ldots ,1)^{\mathrm {T}}\) corresponds to the equilibrium state and its eigenvalue \(\bar {{\Lambda }}_{1} = 1\). Other eigenvectors \(\bar {\boldsymbol f}_{p}\) represent structural transitions and the corresponding eigenvalues \(\bar {{\Lambda }}_{p}\) give their relaxation time scales \(\bar {\tau }_{p}\) as

*τ*should also be chosen carefully. In other words, for the Markov description to work, the time interval of the transition matrix

*τ*must be chosen appropriately so that it is as large as the slowest relaxation time of the states. When plotting \({\bar \tau }_{p}\) as a function of

*τ*, \({\bar \tau }_{p}\) slowly converges to the appropriate time scale when

*τ*is increased. In addition, when a much longer

*τ*than the slowest relaxation time of the states is used, the Markov state model is not expected to be accurate. Thus, we usually set the time interval

*τ*to the value when the variation of

*τ*

_{ p }is sufficiently flat (Swope et al. 2004; Pérez-Hernández et al. 2013).

*δ*

_{ i }(

*t*;

*Q*) is defined in the same way as

*R*

_{ i }(

*t*;

*Q*) in Eq. 29 from

*δ*

_{ i }(

*Q*) given as a function of the state

*Q*of the system by

*δ*

_{ i }(

*Q*), it follows that \(\bar {C}_{i,j}(t)\) is the joint probability \(\bar {P}_{i,j}(t)\).

If we set *t*_{0} = 0, the generalized eigenvalue problem (68) becomes the eigenvalue problem (64) with \(\bar {{\Lambda }}_{p} = \mathrm {e}^{-\bar {\lambda }_{p} \tau }\) or \(\bar {\tau }_{p} = 1/\bar {\lambda }_{p}\), because \(\bar {C}(0)=\text {diag}(\bar {p}_{1},\ldots ,\bar {p}_{n})\) and \(\bar {C}(\tau )\bar {C}(0)^{-1} = \bar {T}(\tau )\). Thus, the Markov state model is a special case of MSRMA with *t*_{0} = 0.

Because *δ*_{ i }(*t*_{0}/2;*Q*) in Eq. 66 reduces the contributions of faster modes in *δ*_{ i }(*Q*), the solutions of the generalized eigenvalue problem (68) provides better approximations to the slow relaxation modes and rates as *t*_{0} becomes larger. Therefore, the relaxation times \(\bar {\tau }_{p}\) obtained by the Markov state model are expected to be improved by solving Eq. 68 with *t*_{0} > 0 rather than Eq. 64.

## Application of RMA to a system with large conformational changes

In this section, we apply RMA to a protein system simulation to show the effectiveness of RMA. The selection of order parameters in simulations is important to analyze the trajectory. PCA, which is a static analysis method, extracts large structural fluctuations from simulations, and the obtained PC mode is used to obtain the order parameters. Moreover, it has now become possible to perform long simulations such as those of unfolded and folded protein structures, and when the simulation involves large structural changes, the difference between local minimum-energy states is relatively small compared with that between the folded and unfolded states. In this case, it is difficult for PCA to extract the effective modes or order parameters to accurately identify the local minimum-energy states. By contrast, RMA extracts slow relaxation modes. It is thought that the local minimum-energy states are usually stable so that the system remains in this state for a long time during a simulation. The order parameters with slow relaxation may correspond to the directions between local minimum-energy states. Thus, slow relaxation modes may be suitable order parameters to identify local minimum-energy states and the transitions between them. To validate this concept, we applied RMA to the 10-residue peptide, chignolin in water near its folding transition temperature.

*β*-hairpin turn structure (Honda et al. 2004). Several simulations of chignolin have been reported to date (Satoh et al. 2006; Suenaga et al. 2007; Harada and Kitao 2011; Kührova et al. 2012; Okumura 2012). Previous research has shown that chignolin has a stable (native) and a misfolded state, which are both found as hairpin-like structures (see Fig. 5c). These two states have a common turn structure from Asp3 to Glu5 but slightly different hydrogen bond patterns. RMA requires a relatively high level of statistical precision for the time correlation matrices and therefore requires a long simulation where many transitions between local minimum-energy states occur. In addition, we sought to analyze the system with large conformational changes. Thus, we performed a 750-ns molecular dynamics simulation of chignolin in aqueous solution near the transition temperature from an extended structure (Case et al. 2014). We observed many transitions among structures, including the native, misfolded, and unfolded states, by performing the simulation at 450 K. We used the coordinates of C

_{ α }atoms on the backbone as coordinates so that the degrees of freedom were 30. After removing the translational and rotational motions from the coordinates of C

_{ α }atoms, PCA and RMA were carried out on the coordinates of C

_{ α }atoms (see Fig. 1). For RMA, we set

*t*

_{0}and

*τ*to 10.0 and 20.0 ps, respectively.

Figure 5 shows the free-energy surfaces obtained from PCA (a) and RMA (b). From the free-energy surface of PCA, the native and misfolded states were not distinguished because the conformational difference between them is much smaller than the conformational fluctuations of the system (the third PC mode distinguished the native and misfolded states). By contrast, in RMA, the transition between the native and misfolded structures is slow, and the slowest relaxation mode was found to be the axis distinguishing them. This analysis showed that the slow relaxation mode is a good order parameter to distinguish the native and misfolded structure. Interestingly, we could also identify the intermediate structure. By extracting the structures in the center part of the free-energy surface shown in Fig. 5b, the cluster was formed with a turn structure common to the native and misfolded structures. Because the structures at both terminals fluctuate, a cluster of intermediate structures forming a turn is also obtained, while ignoring the fast relaxing movement of both terminals. The upper part of the free-energy surface shown in Fig. 5b corresponded to the extended structure. Figure 5c shows the characteristic structures for the four states. When plotting the points for the obtained intermediate structure on the free-energy surface of PCA in Fig 5d, the points were distributed widely because both terminals fluctuate. Thus, RMA can identify the characteristic structure, even when it is only partially formed. From the free-energy surface obtained by RMA, it is clarified that chignolin folds to the native or misfolded structures through the intermediate (turn) structure from the extended structures.

Because the structures were classified into a smaller number of states using the free-energy surface obtained by RMA, we then applied the Markov state model and MSRMA to analyze these four states: native, misfolded, intermediate, and unfolded states. Figure 5e shows the relaxation time *τ*_{ p } = 1/*λ*_{ p } obtained by MSRMA as a function of *τ* when *t*_{0} = 0,10,50,100,200, and 500 ps. Because the first eigenvector corresponds to the steady state with infinite relaxation time *τ*_{1} = *∞*, we show the second slowest relaxation times. The line of *t*_{0} = 0 corresponds to the results of a simple Markov state model. In the case of *t*_{0} = 0, the *τ*_{ p } values slowly approach the appropriate time scale, i.e., the values for plateau regions or peak values of the solid lines, when *τ* is increased. For the lines of *t*_{0} > 0, the values of *τ*_{ p } quickly approach the appropriate time scale, i.e., those corresponding to the values for plateau regions or peak values. Thus, the slow relaxation times can be improved when applying MSRMA with *t*_{0} > 0, which is introduced to reduce the relative weight of the faster modes.

Overall, RMA can be used to effectively analyze long simulations at room temperature and is also useful for investigating systems with large conformational changes, such as intrinsically disordered proteins and protein folding.

## Conclusions

In this paper, we have reviewed the method and application of RMA, a dynamic analysis method for protein simulations. We described the definition of relaxation modes and rates, which correspond to the left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively. After providing the definition, we explained how to estimate the slow relaxation modes and rates from simulation data. We also summarized several new RMAs proposed, including RMA with multiple evolution times, PCRMA, two-step RMA, and MSRMA. Finally, to demonstrate the effectiveness of RMA, we briefly presented the analysis results of the unfolding/folding simulation of the 10-residue peptide chignolin detected near the transition temperature. The simulation results showed that the relaxation mode is a good order parameter for not only extracting the transition between the native state and misfolded state but also for identifying the intermediate state, which is partially folded. This suggests that RMA is suitable to investigate a system with large structural changes and naturally denatured protein systems. Although RMA is efficient for a longer simulation than the longest relaxation time of the system, it can also extract rare events in a finite-time simulation such as that conducted at the microsecond scale. By examining the extent to which the correlation function can be reconstructed, we can clarify the information that can be obtained on dynamics using the obtained relaxation modes and rates. Theoretical studies to compare data of the Markov state model with experimental data from nuclear magnetic resonance and neutron scattering analyses have emerged recently (Xia et al. 2013; Lindner et al. 2013; Zheng et al. 2013; Bowman et al. 2014). In the future, it will also be important to interpret the theoretical relationships in light of experimental data.

## Notes

### Acknowledgements

The authors would like to thank Mr. Toshiki Nagai, Mr. Taku Yamamoto, Mr. Yuta Koizumi, Mr. Satoshi Natori, and Mr. Naoyuki Karasawa at Keio University for fruitful discussions.

## Compliance with ethical standards

## Conflict of Interests

Ayori Mitsutake declares that he has no conflicts of interest. Hiroshi Takano declares that he has no conflicts of interest.

## Ethical approval

This article does not contain any studies with human participants or animals performed by any og the authors.

## References

- Abagyan R, Argos P (1992) Optimal protocol and trajectory visualization for conformational searches of peptides and proteins. J Mol Biol 225:519–532CrossRefPubMedGoogle Scholar
- Amadei A, Linssen ABM, Berendsen HJC (1993) Essential dynamics of proteins. Proteins Struct Funct Genet 17:412–425CrossRefPubMedGoogle Scholar
- Baher I, Atilgan AR, Erman B (1997) Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. Fold Des 2:173CrossRefGoogle Scholar
- Bowman G R, Pande V S, Noé F (eds) (2014) An introduction to Markov state models and their application to long timescale molecular simulation. Springer, DordrechtGoogle Scholar
- Brooks B, Karplus M (1983) Harmonic dynamics of proteins: normal modes and fluctuations in bovine pancreatic trypsin inhibitor. Proc Natl Acad Sci USA 80:6571CrossRefPubMedPubMedCentralGoogle Scholar
- Buchete N, Hummer G (2008) Coarse master equations for peptide folding dynamics. J Phys Chem B 112:6057CrossRefPubMedGoogle Scholar
- Case DA, Babin V, Betz RM, Cai Q, Cerutti DS, Cheatham IIITE, Darden TA, Duke RE, Gohlke H, Götz A W, Gusarov S, Homeyer N, Janowski P, Kaus J, Kolossváry I, Kovalenko A, Lee TS, Le Grand S, Luchko T, Luo R, Madej B, Merz KM, Paesani F, Roe DR, Roitberg A, Sagui C, Salomon-Ferrer R, Seabra G, Simmerling CL, Smith W, Swails J, Walker RC, Wang J, Wolf RM, Wu X, Kollman PA (2014) AMBER 14. University of California, San FranciscoGoogle Scholar
- Chodera JD, Swope WC, Pitera JW, Dill KA (2006) Long-time protein folding dynamics from short-time molecular dynamics simulations. Multiscale Model Simul 5:1214CrossRefGoogle Scholar
- Chodera JD, Singhal N, Vande VS, Dill KA, Swope WC (2007) Automatic discovery of metastable states for the construction of Markov models of macromolecular conformational dynamics. J Chem Phys 126:155101CrossRefPubMedGoogle Scholar
- Chodera JD, Noé F (2014) Markov state models of biomolecular conformational dynamics. Curr Opin Struct Biol 25:135CrossRefPubMedPubMedCentralGoogle Scholar
- Cui Q, Bahar I (eds) (2005) Normal mode analysis: theory and applications to biological and chemical systems. Chapman & Hall/CRC, LondonGoogle Scholar
- Dror RO, Dirks RM, Grossman JP, Xu H, Shaw DE (2012) Biomolecular simulation: a computational microscope for molecular biology. Annu Rev Biophys 41:429CrossRefPubMedGoogle Scholar
- de Gennes PG (1984) Scaling concepts in polymer physics. Cornell University Press, IthacaGoogle Scholar
- Doi M, Edwards SF (1986) The theory of polymer dynamics. Oxford University Press, OxfordGoogle Scholar
- Eckart C (1935) Some studies concerning rotating axes and polyatomic molecules. Phys Rev 47:552–558CrossRefGoogle Scholar
- Freddolino PL, Harrison CB, Liu Y, Schulten K (2010) Challenges in protein folding simulations: timescale, representation, and analysis. Nat Phys 6:751CrossRefPubMedPubMedCentralGoogle Scholar
- Garcia AE (1992) Large-amplitude nonlinear motions in proteins. Phys Rev Lett 68:2696–2699CrossRefPubMedGoogle Scholar
- Go N, Noguti T, Nishikawa T (1983) Dynamics of a small globular protein in terms of low-frequency vibrational modes. Proc Natl Acad Sci USA 80:3696CrossRefPubMedPubMedCentralGoogle Scholar
- Hagita K, Takano H (2002) Relaxation mode analysis of a single polymer chain in a melt. J Phys Soc Jpn 71:673–676CrossRefGoogle Scholar
- Harada R, Kitao A (2011) Exploring the folding free energy landscape of a
*β*-hairpin miniprotein, chignolin, using multiscale free energy landscape calculation method. J Phys Chem B 115:8806CrossRefPubMedGoogle Scholar - Hayward S, Kitao A, Hirata F, Go N (1993) Effect of solvent on collective motions in globular protein. J Mol Biol 234:1207– 1217CrossRefPubMedGoogle Scholar
- Hirao H, Koseki S, Takano H (1997) Molecular dynamics study of relaxation modes of a single polymer chain. J Phys Soc Jpn 66:3399–3405CrossRefGoogle Scholar
- Honda S, Yamasaki K, Sawada Y, Morii H (2004) 10 residue folded peptide designed by segment statistics. Structure 12:1507CrossRefPubMedGoogle Scholar
- Ichiye T, Karplus M (1991) Collective motions in proteins: a covariance analysis of atomic fluctuations in molecular dynamics and normal mode simulations. Protein 11:205–217CrossRefGoogle Scholar
- Iwaoka N, Hagita K, Takano H (2015) Estimation of relaxation modulus of polymer melts by molecular dynamics simulations: application of relaxation mode analysis. J Phys Soc Jpn 84:044801. and references thereinCrossRefGoogle Scholar
- Kamada M, Toda M, Sekijima M, Takata M, Joe J (2011) Analysis of motion features for molecular dynamics simulation of proteins. Chem Phys Lett 502:241CrossRefGoogle Scholar
- Karasawa N, Mitsutake A, Takano H (2017) Two-step relaxation mode analysis with multiple evolution times applied to all-atom molecular dynamics protein simulation. Phys Rev E 96:062408CrossRefPubMedGoogle Scholar
- Kitao A, Hirata F, Go N (1991) The effects of solvent on the conformation and the collective motions of protein: normal mode analysis and molecular dynamics simulations of melittin in water and in vacuum. Chem Phys 158:447CrossRefGoogle Scholar
- Kitao A, Go N (1999) Investigating protein dynamics in collective coordinate space. Curr Opin Struct Biol 9:164CrossRefPubMedGoogle Scholar
- Komatsuzaki T, Berry RS, Leitner DM (2011) Advancing theory for kinetics and dynamics of complex, many-dimensional systems. Wiley, CanadaCrossRefGoogle Scholar
- Kottalam J, Case DA (1990) Langevin modes of macromolecules: applications to crambin and DNA hexamers. Biopolymers 29:1409CrossRefPubMedGoogle Scholar
- Koseki S, Hirao H, Takano H (1997) Monte Carlo study of relaxation modes of a single polymer chain. J Phys Soc Jpn 66:1631–1637CrossRefGoogle Scholar
- Kührova P, Simone AD, Otyepka M, Best RB (2012) Force-field dependence of chignolin folding and misfolding: comparison with experiment and redesign. Biophys J 102:1897CrossRefPubMedPubMedCentralGoogle Scholar
- Lamm G, Szabo A (1986) Langevin modes of macromolecules. J Chem Phys 85:7334CrossRefGoogle Scholar
- Lane TJ, Shukla D, Beauchamp KA, Pande VS (2013) To milliseconds and beyond: challenges in the simulation of protein folding. Curr Opin 23:58Google Scholar
- Lange OF, Grubmüller H (2007) Full correlation analysis of conformational protein dynamics. Proteins 70:1294CrossRefGoogle Scholar
- Levy RM, Srinivasan AR, Olson WK, McCammon JA (1984) Quasi-harmonic method for studying very low frequency modes in proteins. Biopolymers 23:1099–1112CrossRefPubMedGoogle Scholar
- Levitt M, Sander C, Stern PS (1985) Protein normal-mode dynamics: trypsin inhibitor, crambin, ribonuclease and lysozyme. J Mol Biol 181:423CrossRefPubMedGoogle Scholar
- Lindorff-Larsen K, Piana S, Dror RO, Shaw DE (2011) How fast-folding proteins fold. Science 334:517CrossRefPubMedGoogle Scholar
- Lindorff-Larsen K, Margakis P, Piana S, Eastwood MP, Dror RO, Shaw DE (2012) Systematic validation of protein force fields against experimental data. PLos ONE 7:e32131CrossRefPubMedPubMedCentralGoogle Scholar
- Lindner B, Yi Z, Prinz JH, Smith J, Noé F (2013) Dynamic neutron scattering from conformational dynamics I: theory and Markov models. J Chem Phys 139:175101CrossRefPubMedGoogle Scholar
- Matsunaga Y, Kidera A, Sugita Y (2015) Sequential data assimilation for single-molecule FRET photon-counting data. J Chem Phys 142:214115CrossRefPubMedGoogle Scholar
- McLachlan AD (1979) Gene duplications in the structural evolution of chymotrypsin. J Mol Biol 128:49–79CrossRefPubMedGoogle Scholar
- Mitsutake A, Iijima H, Takano H (2005) Principal component analysis and relaxation mode analysis of a peptide. Biophysics. 45: Supplement S214. Abstracts for the 43th Annual Meeting, The Biophysical Society of Japan. (in Japanese)Google Scholar
- Mitsutake A, Iijima H, Takano H (2011) Relaxation mode analysis of a peptide system: comparison with principal component analysis. J Chem Phys 135:164102CrossRefPubMedGoogle Scholar
- Mitsutake A, Takano H (2015) Relaxation mode analysis and Markov state relaxation mode analysis for chignolin in aqueous solution near a transition temperature. J Chem Phys 143:124111CrossRefPubMedGoogle Scholar
- Miyashita O, Tama F (2008) Coarse-graining of condensed phase and biomolecular systems. CRC Press, Boca Raton, p 267Google Scholar
- Moritsugu K, Koike R, Yamada K, Kato H, Kidera A (2015) Motion tree delineates hierarchical structure of protein dynamics observed in molecular dynamics simulation. PLoS ONE 10:e0131583CrossRefPubMedPubMedCentralGoogle Scholar
- Mori T, Saito S (2015) Dynamic heterogeneity in the folding/unfolding transitions of FiP35. J Chem Phys 142:135101CrossRefPubMedGoogle Scholar
- Mori T, Saito S (2016) Molecular mechanism behind the fast folding/unfolding transitions of villin headpiece subdomain: hierarchy and heterogeneity. J Phys Chem B 120:11683CrossRefPubMedGoogle Scholar
- Nagai T, Mitsutake A, Takano H (2013) Principal component relaxation mode analysis of an all-atom molecular dynamics simulation of human lysozyme. J Phys Soc Jpn 82:023803CrossRefGoogle Scholar
- Nagai T, Mitsutake A, Takano H (2009) Relaxation mode analysis of a biopolymer system by molecular dynamics. Biophysics. 49 Supplement S75. (Abstracts for the 47th Annual Meeting, The Biophysical Society of JapanGoogle Scholar
- Naritomi Y, Fuchigami S (2011) Slow dynamics in protein fluctuations revealed by time-structure based independent component analysis: the case of domain motions. J Chem Phys 134:065101CrossRefPubMedGoogle Scholar
- Naritomi Y, Fuchigami S (2013) Slow dynamics of a protein backbone in molecular dynamics simulation revealed by time-structure based independent component analysis. J Chem Phys 139:215102CrossRefPubMedGoogle Scholar
- Natori S, Takano H (2017) Two-step relaxation mode analysis with multiple evolution times: application to a single [n]polycatenane. J Phys Soc Jpn 86:43003CrossRefGoogle Scholar
- Noé F, Horenko I, Schütte C, Smith JC (2007) Hierarchical analysis of conformational dynamics in biomolecules: transition networks of metastable states. J Chem Phys 126:155102CrossRefPubMedGoogle Scholar
- Noé F, Fischer S (2008) Transition networks for modeling the kinetics of conformational change in macromolecules. Curr Opin Struct Biol 18:154CrossRefPubMedGoogle Scholar
- Noé F, Clementi C (2017) Collective variables for the study of long-time kinetics from molecular trajectories: theory and methods. Curr Opin Struct Biol 43:141CrossRefPubMedGoogle Scholar
- Okumura H (2012) Temperature and pressure denaturation of chignolin: folding and unfolding simulation by multibaric-multithermal molecular dynamics method. Proteins 80:2397CrossRefPubMedGoogle Scholar
- Pérez-Hernández G, Paul F, Giorgino TG, Fabritiis D, Noé F (2013) Identification of slow molecular order parameters for Markov model construction. J Chem Phys 139:015102CrossRefPubMedGoogle Scholar
- Prinz J, Wu H, Sarich M, Keller B, Senne M, Held M, Chodera JD, Schütte C, Noé F (2011) Markov models of molecular kinetics: generation and validation. J Chem Phys 134:174105CrossRefPubMedGoogle Scholar
- Risken H (1989) The Fokker-Planck equation: methods of solution and applications 2nd Ed. Springer-Verlag. Berlin, HeidelbergCrossRefGoogle Scholar
- Zwanzig R (2001) Nonequilibrium statistical mechanics. Oxford university press, New YorkGoogle Scholar
- Saka S, Takano H (2008) Relaxation of a single knotted ring polymer. J Phys Soc Jpn 77:034001CrossRefGoogle Scholar
- Sakuraba S, Joti Y, Kitao A (2010) Detecting coupled collective motions in protein by independent subspace analysis. J Chem Phys 133:185102CrossRefPubMedGoogle Scholar
- Satoh D, Shimizu K, Nakamura S, Terada T (2006) Folding free-energy landscape of a 10-residue mini-protein, chignolin. FEBS Letters 580:3422CrossRefPubMedGoogle Scholar
- Schütte C, Fischer A, Huisinga W, Deuflhard P (1999) A direct approach to conformational dynamics based on hybrid Monte Carlo. J Comput Phys 151:146CrossRefGoogle Scholar
- Schwantes CR, Pande VS (2013) Improvements in Markov state model construction reveal many non-native interactions in the folding of NTL9. J Chem Theor Comput 9:2000CrossRefGoogle Scholar
- Schwantes CR, McGibbon RT, Pande VS (2014) Perspective: Markov models for long-timescale biomolecular dynamics. J Chem Phys 141:090901CrossRefPubMedPubMedCentralGoogle Scholar
- Singhal N, Snow CD, Pande VS (2004) Using path sampling to build better Markovian state models: predicting the folding rate and mechanism of a tryptophan zipper beta hairpin. J Chem Phys 121:415CrossRefPubMedGoogle Scholar
- Suenaga A, Narumi T, Futatsugi N, Yanai R, Ohno Y, Okimoto N, Taiji M (2007) Folding dynamics of 10-residue beta-hairpin peptide chignolin. Chem Asian J 2:591CrossRefPubMedGoogle Scholar
- Swope WC, Pitera JW, Suits F (2004) Describing protein folding kinetics by molecular dynamics simulations. 1. Theory J Phys Chem B 108:6571CrossRefGoogle Scholar
- Takano H, Miyashita S (1995) Relaxation modes in random spin systems. J Phys Soc Jpn 64:3688–3698CrossRefGoogle Scholar
- Tama F, Sanejouand YH (2001) Conformational change of proteins arising from normal mode calculations. Protein Engin 14:1CrossRefGoogle Scholar
- Tama F, Brooks III CL (2002) The mechanism and pathway of pH induced swelling in Cowpea chlorotic mottle virus. J Mol Biol 318:733CrossRefPubMedGoogle Scholar
- Tirion MM (1996) Large amplitude elastic motions in proteins from a single-parameter, atomic analysis. Phys Lev Lett 77:1905CrossRefGoogle Scholar
- Wu H, Nüske F, Paul F, Klus S, Koltai P, Noé F (2017) Variational Koopman models: slow collective variables and molecular kinetics from short off-equilibrium simulations. J Chem Phys 146: 154104CrossRefPubMedGoogle Scholar
- Xia J, Deng JN, Levy RM (2013) NMR relaxation in proteins with fast internal motions and slow conformational exchange: model-free framework and Markov state simulations. J Phys Chem B 117:6625CrossRefPubMedPubMedCentralGoogle Scholar
- Zheng Y, Lindner B, Prinz JH, Noé F, Smith J (2013) Dynamic neutron scattering from conformational dynamics II: application using molecular dynamics simulation and Markov modeling. J Chem Phys 139:175102CrossRefGoogle Scholar
- Zuckerman DM (2010) Statistical physics of biomolecules: an introduction. CRC Press, New YorkGoogle Scholar

## Copyright information

**Open Access**This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.