CheckShift improved: fast chemical shift reference correction with high accuracy
- 121 Downloads
The construction of a consistent protein chemical shift database is an important step toward making more extensive use of this data in structural studies. Unfortunately, progress in this direction has been hampered by the quality of the available data, particularly with respect to chemical shift referencing, which is often either inaccurate or inconsistently annotated. Preprocessing of the data is therefore required to detect and correct referencing errors. In an earlier study we developed CheckShift, a program for performing this task automatically. Now we spent substantial effort in improving the running time of the CheckShift algorithm, which resulted in an running time decrease of 90%, thereby achieving equivalent quality to the former version of CheckShift. The reason for the running time decrease is twofold. Firstly we improved the search for the optimal re-referencing offset considerably. Secondly, as CheckShift is based on a secondary structure prediction from the amino acid sequence (formally PsiPred was used), we evaluated a wide range of available secondary structure prediction programs focusing on the special needs of the CheckShift algorithm. The results of this evaluation prove empirically that we can use faster secondary structure prediction programs than PsiPred without sacrificing CheckShift’s accuracy. Very recently Wang and Markley (2009) gave a small list of extreme outliers of the former version of the CheckShift web-server. Those were due to the empirical reduction of the search space implemented in the old version. The new version of CheckShift now gives very similar results to RefDB and LACS for all outliers mentioned in Table 1 of Wang and Markley (2009).
KeywordsChemical shifts Re-referencing CheckShift
We thank Murray Coles, MPI for Developmental Biology, Tübingen, for providing, the experimental data used in our evaluations. Thanks also go to Christian Weichenberger from the research group of Manfred Sippl, University of Salzburg, who compiled the list for our smaller secondary structure content prediction benchmark.
- Berjanskii MV, Neal S, Wishart DS (2006) Preditor: a web server for predicting protein torsion angle restraints. Nucleic Acids Res 34(Web Server issue):W63–W69. doi: 10.1093/nar/gkl341
- Coles M, Diercks T, Liermann J, Groger A, Rockel B, Baumeister W, Koretke KK, Lupas A, Peters J, Kessler H (1999) The solution structure of VAT-N reveals a ’missing link’ in the evolution of complex enzymes from a simple βαββ element. Curr Biol 9(20):1158–1168. doi: 10.1016/S0960-9822(00)80017-2 Google Scholar
- Ginzinger SW, Gerick F, Coles M, Heun V (2007a) Checkshift: automatic correction of inconsistenly referenced chemical shift data. J Biomol NMR. doi: 10.1007/s10858-007-9191-5
- Haupt M, Bramkamp M, Heller M, Coles M, Deckers-Hebestreit G, Herkenhoff-Hesselmann B, Altendorf K, Kessler H (2006) The holo-form of the nucleotide binding domain of the KdpFABC complex from Escherichia coli reveals a new binding mode. J Biol Chem 281(14):9641–9649. doi: 10.1074/jbc.M508290200 Google Scholar
- Heinig M, Frishman D (2004) STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res 32(Web Server issue):W500–W502. doi: 10.1093/nar/gkh429
- Rost B, Yachdav G, Liu J (2004) The predictprotein server. Nucleic Acids Res 32(Web Server issue):W321–W326. doi: 10.1093/nar/gkh377
- Skočibušić M (2008) Improving the CheckShift algorithm. Diploma thesis, Lugwig-Maximilians Universität MünchenGoogle Scholar
- Wang L, Eghbalnia HR, Bahrami A, Markley JL (2005) Linear analysis of carbon-13 chemical shift differences and its application to the detection and correction of errors in referencing and spin system identifications. J Biomol NMR 32(1):13–22. doi: 10.1007/s10858-005-1717-0