Abstract
We report an automated procedure for high-throughput NMR resonance assignment for a protein of known structure, or of an homologous structure. Our algorithm performs Nuclear Vector Replacement (NVR) by Expectation/Maximization (EM) to compute assignments. NVR correlates experimentally-measured NH residual dipolar couplings (RDCs) and chemical shifts to a given a priori whole-protein 3D structural model. The algorithm requires only uniform 15N-labelling of the protein, and processes unassigned HN-15N HSQC spectra, HN-15N RDCs, and sparse HN-HN NOE's (d NNs). NVR runs in minutes and efficiently assigns the (HN,15N) backbone resonances as well as the sparse d NNs from the 3D 15N-NOESY spectrum, in O(n 3) time. The algorithm is demonstrated on NMR data from a 76-residue protein, human ubiquitin, matched to four structures, including one mutant (homolog), determined either by X-ray crystallography or by different NMR experiments (without RDCs). NVR achieves an average assignment accuracy of over 99%. We further demonstrate the feasibility of our algorithm for different and larger proteins, using different combinations of real and simulated NMR data for hen lysozyme (129 residues) and streptococcal protein G (56 residues), matched to a variety of 3D structural models. Abbreviations: NMR, nuclear magnetic resonance; NVR, nuclear vector replacement; RDC, residual dipolar coupling; 3D, three-dimensional; HSQC, heteronuclear single-quantum coherence; HN, amide proton; NOE, nuclear Overhauser effect; NOESY, nuclear Overhauser effect spectroscopy; d NN, nuclear Overhauser effect between two amide protons; MR, molecular replacement; SAR, structure activity relation; DOF, degrees of freedom; nt., nucleotides; SPG, Streptococcal protein G; SO(3), special orthogonal (rotation) group in 3D; EM, Expectation/Maximization; SVD, singular value decomposition.
Similar content being viewed by others
References
Al-Hashimi, H., Gorin, A., Majumdar, A., Gosser, Y. and Patel, D. (2002) J. Mol. Biol., 318, 637–649.
Al-Hashimi, H. and Patel, D. (2002) J. Biomol. NMR, 22, 1–8.
Altschul, S., Gish, W., Miller, W., Myers, E. and Lipman, D. (1990) J. Mol. Biol., 215, 403–410.
Andrec, M., Du, P. and Levy, R. (2001) J. Biomol. NMR, 21, 335–347.
Annila, A., Aitio, H., Thulin, E. and T., D. (1999) J. Biomol. NMR, 14, 223–230.
Artymiuk, P.J., Blake, C.C.F., Rice, D.W. and Wilson, K.S. (1982) Acta Crystallogr. B Biol. Crystallogr., 38, 778.
Babu, C. R., Flynn, P.F. and Wand, A.J. (2001) J. Am. Chem. Soc., 123, 2691.
Bailey-Kellogg, C., Widge, A., Kelley III, J.J., Berardi, M., Bushweller, J. and Donald, B. (2000) J. Comput. Biol., 7, 537–558.
Berman, H., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T., Weissig, H., Shindyalov, I. and Bourne, P. (2000) Nucl. Acids Res., 28, 235–242.
Blundell, T., Sibanda, B., Sternberg, M. and Thornton, J. (1987) Nature, 326, 347–352.
Chen, Y., Reizer, J., Saier Jr., M.H., Fairbrother, W.J. and Wright, P.E. (1993) Biochemistry, 32, 32–37.
Chou, J., Li, S. and Bax, A. (2000) J. Biomol. NMR, 18, 217–227.
Cormen, T.H., Leiserson, C.E., Rivest, R.L. and Stein, C. (2001) Introduction to Algorithms, 2nd edn., The MIT Press, Cambridge, MA, chapter 5, pp. 109–110.
Cornilescu, G., Marquardt, J.L., Ottiger, M. and Bax, A. (1998) J. Am. Chem. Soc., 120, 6836–6837.
Cross, A.D.J. and Hancock, E.R. (1998) IEEE Trans. Pattern Anal. Machine Intell., 20, 1236–1253.
Delaglio, F., Kontaxis, G. and Bax, A. (2000) J. Am. Chem. Soc., 122, 2142–2143.
Dempster, A., Laird, N. and Rubin, D. (1977) J. Roy. Stat. Soc., Ser. B, 39, 1–38.
Diamond, R. (1974) J Mol. Biol., 82, 371–391.
Fejzo, J., Lepre, C., Peng, J., Bemis, G., Ajay, Murcko, M. and Moore, J. (1999) Chem. Biol., 6, 755–769.
Fetrow, J. and Bryant, S. (1993) Bio/Technology, 11, 479–484.
Fiaux, J., Bertelsen, E.B., Horwich, A.L. and Wüthrich, K. (2002) Nature, 418, 207–211.
Fowler, C., Tian, F., Al-Hashimi, H.M. and Prestegard, J.H. (2000) J. Mol. Biol., 304, 447–460.
Gallagher, T., Alexander, P., Bryan, P. and Gilliland, G.L. (1994) Biochemistry, 33, 4721–4729.
Gemmecker, G., Jahnke, W. and Kessler, H. (1993) J. Am. Chem. Soc., 115, 11620–11621.
Girard, E., Chantalat, L., Vicat, J. and Kahn, R. (2001) Acta Crystallogr. D Biol. Crystallogr., 58, 1–9.
Gnu (2002) The GNU General Public License, http://www.gnu.org/licenses/licenses.html
Greer, J. (1991) Meth. Enzymol., 202, 239–252.
Grishaev, A. and Llinas, M. (2002) PNAS, 99, 6707–6712.
Gronenborn, A.M., Filpula, D.R., Essig, N.Z., Achari, A., Whitlow, M., Wingfield, P.T. and Clore, G.M. (1991) Science, 253, 657.
Grzesiek, S. and Bax, A. (1993) J. Biomol. NMR, 3, 627–638.
Harris, R. (2002) The Ubiquitin NMR Resource Page, BBSRC Bloomsbury Center for Structural Biology http://www.biochem.ucl.ac.uk/bsm/nmr/ubq/index.html
Hoch, J., Burns, M. M. and Redfield, C. (1990) In Frontiers of NMR in Molecular Biology, Alan R. Liss, Inc., NY, pp. 167–175.
Hus, J., Marion, D. and Blackledge, M. (2000) J. Mol. Biol., 298, 927–936.
Hus, J., Prompers, J. and Bruschweiler, R. (2002) J. Magn. Reson., 157, 119–125.
Johnson, E., Lazar, G.A., Desjarlais, J.R. and Handel, T.M. (1999) Struct. Fold Des., 7, 967–976.
Johnson, M., Srinivasan, N., Sowdhamini, R. and Blundell, T. (1994) Mol. Biochem. 29, 1–68.
Koradi, R., Billeter, M. and Wüthrich, K. (1996) J. Mol. Graph., 14, 51–55.
Kuhn, H. (1955) Nav. Res. Logist. Quart., 2, 83–97.
Kurinov, I.V. and Harrison, R.W. (1995) Acta Crystallogr. D Biol. Crystallogr., 51, 98–109.
Kuszewski, J., Gronenborn, A.M. and Clore, G.M. (1999) J. Am. Chem. Soc., 121, 2337–2338.
Langmead, C.J. and Donald, B.R. (2003) In Proceedings of the IEEE Computer Society Bioinformatics Conference (CSB), Stanford University, Palo Alto, CA (August 11–14), pp. 209–217.
Langmead, C.J., Yan, A.K., Wang, L., Lilien, R.H. and Donald, B.R. (2003) In Proceedings of the 7th Ann. Intl. Conf. on Research in Comput. Biol. (RECOMB) Berlin, Germany, April 10–13, pp. 176–187.
Lathrop, R. and Smith, T. (1996) J. Mol. Biol., 255, 641–665.
Lim, K., Nadarajah, A., Forsythe, E.L. and Pusey, M.L. (1998) Acta Crystallogr. D Biol. Crystallogr., 54, 899–904.
Losonczi, J., Andrec, M., Fischer, W. and J.H., P. (1999) J Magn. Reson., 138, 334–342.
Meiler, J., Peti, W. and Griesinger, C. (2000) J. Biomol. NMR, 17, 283–294.
Mueller, G., Choy, W., Yang, D., Forman-Kay, J., Venters, R. and Kay, L. (2000) J. Mol. Biol., 300, 197–212.
Neal, S., Nip, A.M., Zhang, H. and Wishart, D.S. (2003) J. Biomol. NMR, 26, 215–240.
Oki, H., Matsuura, Y., Komatsu, H. and Chernov, A. A. (1999) Acta Crystallogr. D Biol. Crystallogr., 55, 114.
Ottiger, M. and Bax, A. (1998) J. Am. Chem. Soc., 120, 12334–12341.
Palmer, A.G. (1997) Curr. Opin. Struct. Biol., 7, 732–737.
Ramage, R., Green, J., Muir, T.W., Ogunjobi, O.M., Love, S. and Shaw, K. (1994) J. Biochem., 299, 151–158.
Redfield, C., Hoch, J. and Dobson, C. (1983) FEBS Lett., 159, 132–136.
Rohl, C. and Baker, D. (2002) J. Am. Chem. Soc., 124, 2723–2729.
Rossman, M. and Blow, D. (1962) Acta Crystallogr., 15, 24–31.
Sali, A., Overington, J., Johnson, M. and Blundell, T. (1990) Trends Biochem. Sci., 15, 235–240.
Saupe, A. (1968) Angew. Chem., 7, 97–112.
Schneider, D., Dellwo, M. and Wand, A.J. (1992) Biochemistry, 31, 3645–3652.
Schwalbe, H., Grimshaw, S.B., Spencer, A., Buck, M., Boyd, J., Dobson, C.M., Redfield, C. and Smith, L.J. (2001) Protein Sci., 10, 677–688.
Seavey, B., Farr, E., Westler, W. and Markley, J. (1991) J. Biomol. NMR, 1, 217–236.
Shuker, S.B., Hajduk, P.J., Meadows, R.P. and Fesik, S.W. (1996) Science, 274, 1531–1534.
Tian, F., Valafar, H. and Prestegard, J. H. (2001) J. Am. Chem. Soc., 123, 11791–11796.
Tjandra, N. and Bax, A. (1997) Science, 278, 1111–1114.
Tolman, J.R., Flanagan, J.M., Kennedy, M.A. and Prestegard, J.H. (1995) Proc. Natl. Acad. Sci. USA, 92, 9279–9283.
Vaney, M.C., Maignan, S., Ries-Kautt, M. and Ducruix, A. (1996) Acta Crystallogr. D Biol. Crystallogr., 52, 505–517.
Vijay-Kumar, S., Bugg, C.E. and Cook, W.J. (1987) J. Mol. Biol., 194, 531–544.
Wang, L. and Donald, B.R. (2004) J. Biomol. NMR, in press.
Weber, P.L., Brown, S.C. and Mueller, L. (1987) Biochemistry, 26, 7282–7290.
Wedemeyer, W.J., Rohl, C.A. and Scheraga, H.A. (2002) J. Biomol. NMR, 22, 137–151.
Xu, X. and Case, D. (2001) J. Biomol. NMR, 21, 321–333.
Xu, Y., Xu, D., Crawford, O.H., Einstein, J.R. and Serpersu, E. (2000) Proc. RECOMB, pp. 299–307.
Yan, A., Langwead, C. and Donald, B.R. (2003) A Probability-Based Similarity Measure for Saupe Alignment Tensors with Applications to Residual Dipolar Couplings in NMR Structural Biology. Technical Report No. TR2003-474, Dartmouth Computer Science Department, http://www.cs.dartmouth.edu/reports/abstracts/TR2003-474/
Zweckstetter, M. (2003) J. Biomol. NMR, 27, 41–56.
Zweckstetter, M. and Bax, A. (2000) J. Am. Chem. Soc., 122, 3791–3792.
Zweckstetter, M. and Bax, A. (2001) J. Am. Chem. Soc., 123, 9490–9491.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is supported by grants to B.R.D. from the National Institutes of Health (GM-65982) and the National Science Foundation (IIS-9906790, EIA-0305444, and EIA-9802068)
Rights and permissions
About this article
Cite this article
Langmead, C.J., Donald, B.R. An expectation/maximization nuclear vector replacement algorithm for automated NMR resonance assignments. J Biomol NMR 29, 111–138 (2004). https://doi.org/10.1023/B:JNMR.0000019247.89110.e6
Issue Date:
DOI: https://doi.org/10.1023/B:JNMR.0000019247.89110.e6