An Online Approach for Mining Collective Behaviors from Molecular Dynamics Simulations
Collective behavior involving distally separate regions in a protein is known to widely affect its function. In this paper, we present an online approach to study and characterize collective behavior in proteins as molecular dynamics simulations progress. Our representation of MD simulations as a stream of continuously evolving data allows us to succinctly capture spatial and temporal dependencies that may exist and analyze them efficiently using data mining techniques. By using multi-way analysis we identify (a) parts of the protein that are dynamically coupled, (b) constrained residues/ hinge sites that may potentially affect protein function and (c) time-points during the simulation where significant deviation in collective behavior occurred. We demonstrate the applicability of this method on two different protein simulations for barnase and cyclophilin A. For both these proteins we were able to identify constrained/ flexible regions, showing good agreement with experimental results and prior computational work. Similarly, for the two simulations, we were able to identify time windows where there were significant structural deviations. Of these time-windows, for both proteins, over 70% show collective displacements in two or more functionally relevant regions. Taken together, our results indicate that multi-way analysis techniques can be used to analyze protein dynamics and may be an attractive means to automatically track and monitor molecular dynamics simulations.
KeywordsMolecular Dynamic Simulation Reconstruction Error Collective Behavior Protein Dynamic Molecular Dynamic Trajectory
Unable to display preview. Download preview PDF.
- 3.Agarwal, P.K.: Enzymes: An integrated view of structure, dynamics and function. Microbial Cell Factories 5 (2006)Google Scholar
- 10.Bahar, I., Cui, Q.: Normal Mode Analysis: Theory and Applications to Biological and Chemical Systems. Mathematical and Computational Biology Series. Chapman and Hall/ CRC, New York (2003)Google Scholar
- 12.Beazley, D.M., Lomdahl, P.S.: Lightweight computational steering of very large scale molecular dynamics simulations. In: Supercomputing 1996 proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM), Washington, DC, USA, p. 50. IEEE Computer Society Press, Los Alamitos (1996)Google Scholar
- 15.Bowers, K.J., Chow, E., Xu, H., Dror, R.O., Eastwood, M.P., Gregersen, B.A., Klepeis, J.L., Kolossvary, I., Moraes, M.A., Sacerdoti, F.D., Salmon, J.K., Shan, Y., Shaw, D.E.: Scalable algorithms for molecular dynamics simulations on commodity clusters. In: SC Conference, p. 43 (2006)Google Scholar
- 17.DeLano, W.L.: The pymol molecular graphics system (2003)Google Scholar
- 23.Gu, W., Eisenhauer, G., Kraemer, E., Schwan, K., Stasko, J., Vetter, J., Mallavarupu, N.: Falcon: on-line monitoring and steering of large-scale parallel programs. In: Symposium on the Frontiers of Massively Parallel Processing, p. 422 (1995)Google Scholar
- 29.Jolliffe, I.T.: Principal Component Analysis. Springer, Heidelberg (2002)Google Scholar
- 32.Kolda, T.G., Bader, B.W.: Tensor decompositions and applications. Technical report, Sandia National Laboratories (2007)Google Scholar
- 40.Staykova, D., Fredriksson, J., Bermel, W., Billeter, M.A: ssignment of protein nmr spectra based on projections, multi-way decomposition and a fast correlation approach. Journal of Biomolecular NMR (2008)Google Scholar
- 42.Sun, J., Tao, D., Faloutsos, C.: Beyond streams and graphs: Dynamic tensor analysis (2006)Google Scholar
- 44.Whiteley, W.: Rigidity of Molecular structures: generic and geometric analysis. In: Rigidity Theory and Applications. Kluwer Academic/ Plenum, New York (1999)Google Scholar