Skip to main content

SAGA: rapid automatic mainchain NMR assignment for large proteins

Abstract

Here we describe a new algorithm for automatically determining the mainchain sequential assignment of NMR spectra for proteins. Using only the customary triple resonance experiments, assignments can be quickly found for not only small proteins having rather complete data, but also for large proteins, even when only half the residues can be assigned. The result of the calculation is not the single best assignment according to some criterion, but rather a large number of satisfactory assignments that are summarized in such a way as to help the user identify portions of the sequence that are assigned with confidence, vs. other portions where the assignment has some correlated alternatives. Thus very imperfect initial data can be used to suggest future experiments.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

References

  • Atreya HS, Szyperski T (2004) G-matrix fourier transform NMR spectroscopy for complete protein resonance assignment. Proc Natl Acad Sci USA 101:9642–9647

    Article  ADS  Google Scholar 

  • Baldus M (2007) ICMRBS founder’s medal 2006: biological solid-state NMR, methods and applications. J Biomol NMR 39:73–86

    Article  Google Scholar 

  • Bartels C, Xia T, Billeter M, Güntert P, Wüthrich K (1995) The program XEASY for computer-supported NMR spectral-analysis of biological macromolecules. J Biomol NMR 6:1–10

    Article  Google Scholar 

  • Benod C, Delsuc M-A, Pons J-L (2006) CRAACK: consensus program for NMR amino acid type assignment. J Chem Inf Model 46:1517–1522

    Article  Google Scholar 

  • Billeter M, Wagner G, Wüthrich K (2008) Solution NMR structure determination of proteins revisited. J Biomol NMR 42(3):155–158

    Article  Google Scholar 

  • Bron C, Kerbosch J (1973) Finding all cliques of an undirected graph. Comm ACM 16:575–577

    MATH  Article  Google Scholar 

  • Cavanagh J, Fairbrother WJ, Palmer AG, Rance M, Skelton NJ (2007) Protein NMR spectroscopy, 2nd edn. Academic Press, Amsterdam

    Google Scholar 

  • Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, Bax A (1995) NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J Biomol NMR 6:277–293

    Article  Google Scholar 

  • Eghbalnia HR, Bahrami A, Wang L, Assadid A, Markley JL (2005) Probabilistic identification of spin systems and their assignments including coil–helix inference as output (PISTACHIO). J Biomol NMR 32:219–233

    Article  Google Scholar 

  • Friedrichs MS, Mueller L, Wittekind M (1994) An automated procedure for the assignment of protein 1HN, 15N, 13Cα, 1Hα, 13Cβ and 1Hβ resonances. J Biomol NMR 4:703–726

    Article  Google Scholar 

  • Frueh DP, Arthanari H, Koglin A, Walsh CT, Wagner G (2009) A double TROSY hNCAnH experiment for efficient assignment of large and challenging proteins. J Am Chem Soc 131:12880–12881

    Article  Google Scholar 

  • Goddard TD, Kneller DG (2003) Sparky–NMR assignment and integration software. University of California, San Francisco

    Google Scholar 

  • Güntert P (2009) Automated structure determination from NMR spectra. Eur Biophys J 38(2):129–143

    Article  Google Scholar 

  • Hare BJ, Prestegard JH (1994) Application of neural networks to automated assignment of NMR spectra of proteins. J Biomol NMR 4:35–46

    Article  Google Scholar 

  • Hopcroft JE, Karp RM (1973) An n5/2 algorithm for maximum matchings in bipartite graphs. SIAM J Comput 2:225–231

    MATH  Article  MathSciNet  Google Scholar 

  • Hyberts SG, Wagner G (2003) IBIS–A tool for automated sequential assignment of protein spectra from triple resonance experiments. J Biomol NMR 26:335–344

    Article  Google Scholar 

  • Kao M-Y, Lam T-W, Sung W–K, Ting H–F (2001) A decomposition theorem for maximum weight bipartite matchings. SIAM J Comput 31:18–26

    MATH  Article  MathSciNet  Google Scholar 

  • Kuhl FS, Crippen GM, Friesen DK (1984) A combinatorial algorithm for calculating ligand binding. J Comput Chem 5:24–34

    Article  Google Scholar 

  • Kuhn HW (1955) Variants of the hungarian method for assignment problems. Naval Research Logistics Quarterly 2:83–97

    Article  MathSciNet  Google Scholar 

  • Lemak A, Steren CA, Arrowsmith CH, Llinás M (2008) Sequence specific resonance assignment via multicanonical Monte Carlo search using an ABACUS approach. J Biomol NMR 41:29–41

    Article  Google Scholar 

  • Leutner M, Gschwind RM, Liermann J, Schwarz C, Gemmecker G, Kessler H (1998) Automated backbone assignment of labeled proteins using the threshold accepting algorithm. J Biomol NMR 11:31–43

    Article  Google Scholar 

  • Lin G, Wan X, Tegos T, Li Y (2006) Statistical evaluation of NMR backbone resonance assignment. Int J Bioinf Res App 2:147–160

    Google Scholar 

  • Lukin JA, Gove AP, Talukdar SN, Ho C (1997) Automated probabilistic method for assigning backbone resonances of (13C, 15N)-labeled proteins. J Biomol NMR 9:151–166

    Article  Google Scholar 

  • Meadows RP, Olejniczak ET, Fesik SW (1994) A computer-based protocol for semiautomated assignments and 3D structure determination of proteins. J Biomol NMR 4:79–96

    Article  Google Scholar 

  • Moseley HNB, Monleon D, Montelione GT (2001) Automatic determination of protein backbone resonance assignments from triple resonance nuclear magnetic resonance data. Meth in Enzymol 339:91–108

    Article  Google Scholar 

  • Nakada S, Sakakura M, Takahashi H, Tokuda H, Shimada I (2007) Backbone resonance assignment for the outer membrane lipoprotein receptor LolB from Escherichia coli. Biomol NMR Assign 1:121–123

    Article  Google Scholar 

  • Olson JB Jr, Markley JL (1994) Evaluation of an algorithm for the automated sequential assignment of protein backbone resonances: A demonstration of the connectivity tracing assignment tools (CONTRAST) software package. J Biomol NMR 4:385–410

    Article  Google Scholar 

  • Oschkinat H, Croft D (1994) Automated assignment of multidimensional nuclear magnetic resonance spectra. Meth Enzymol 239:308–318

    Article  Google Scholar 

  • Parker RG, Rardin RL (1988) Discrete Optimization. Academic Press, New York

    MATH  Google Scholar 

  • Revington M, Zuiderweg ERP (2004) TROSY-driven NMR backbone assignments of the 381-residue nucleotide-binding domain of the Thermus Thermophilus DnaK molecular chaperone. J Biomol NMR 30:113–114

    Article  Google Scholar 

  • Revington M, Zhang Y, Yip GN, Kurochkin AV, Zuiderweg ERP (2005) NMR investigations of allosteric processes in a two-domain Thermus thermophilus Hsp70 molecular chaperone. J Mol Biol 349:163–183

    Article  Google Scholar 

  • Seavey BR, Farr EA, Westler WM, Markley J (1991) A relational database for sequence-specific protein NMR data. J Biomol NMR 1:217–236

    Article  Google Scholar 

  • Stratmann D, Guittet E, van Heijenoort C (2010) Robust structure-based resonance assignment for functional protein studies by NMR. J Biomol NMR 46:157–173

    Article  Google Scholar 

  • Tugarinov V, Muhandiram R, Ayed A, Kay LE (2002) Four-dimensional NMR spectroscopy of a 723-residue protein: chemical shift assignments and secondary structure of malate synthase G. J Am Chem Soc 124:10025–10035

    Article  Google Scholar 

  • Vitek O, Bailey-Kellogg C, Craig B, Kuliniewicz P, Vitek J (2005) Reconsidering complete search algorithms for protein backbone NMR assignment. Bioinformatics 21:230–236

    Article  Google Scholar 

  • Wan X, Lin G (2007) GASA: A graph-based automated NMR backbone resonance sequential assignment program. J Bioinf Comput Biol 5:313–333

    Article  Google Scholar 

  • Wang J, Wang T, Zuiderweg ERP, Crippen GM (2005) CASA: An efficient automated assignment of protein mainchain NMR data using an ordered tree search algorithm. J Biomol NMR 33:261–279

    Article  Google Scholar 

  • Williamson M, Craven C (2009) Automated protein structure calculation from NMR data. J Biomol NMR 43:131–143

    Article  Google Scholar 

  • Xiong F, Pandurangan G, Bailey-Kellogg C (2008) Contact replacement for NMR resonance assignment. Bioinformatics 24:205–213

    Article  Google Scholar 

  • Zimmerman DE, Montelione GT (1995) Automated analysis of nuclear magnetic resonance assignments for proteins. Curr Opin Struct Biol 5:664–673

    Article  Google Scholar 

  • Zimmerman DE, Kulikowski CA, Montelione GT (1993) A constraint reasoning system for automating sequence-specific resonance assignments from multidimensional protein NMR spectra. Proc Int Conf Intell Syst Mol Biol 1:447–455

    Google Scholar 

  • Zimmerman D, Kulikowski C, Wang L, Lyons B, Montelione GT (1994) Automated sequencing of amino acid spin systems in proteins using multidimensional HCC(CO)NH-TOCSY spectroscopy and constraint propagation methods from artificial intelligence. J Biomol NMR 4:241–256

    Article  Google Scholar 

  • Zimmerman DE, Kulikowski CA, Huang Y, Feng W, Tashiro M, Shimotakahara S, Chien CY, Powers R, Montelione GT (1997) Automated analysis of protein NMR assignments using methods from artificial intelligence. J Mol Biol 269:592–610

    Article  Google Scholar 

Download references

Acknowledgments

E.R.P.Z. acknowledges support from NIH grants GM063027 and GM063027-08S1 (E.R.P.Z., PI) and NS059690 (J.E. Gestwicki, PI). We thank Drs. A.V. Kurochkin and D.S. Weaver for the preparation of the Hsc70 NMR samples.

Author information

Affiliations

Authors

Corresponding authors

Correspondence to Gordon M. Crippen or Erik R. P. Zuiderweg.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 1971 kb)

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Crippen, G.M., Rousaki, A., Revington, M. et al. SAGA: rapid automatic mainchain NMR assignment for large proteins. J Biomol NMR 46, 281–298 (2010). https://doi.org/10.1007/s10858-010-9403-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10858-010-9403-2

Keywords

  • Automatic assignment
  • Generic spin system
  • Triple resonance
  • Large proteins