Automated protein motif generation in the structure-based protein function prediction tool ProMOL
- 260 Downloads
ProMOL, a plugin for the PyMOL molecular graphics system, is a structure-based protein function prediction tool. ProMOL includes a set of routines for building motif templates that are used for screening query structures for enzyme active sites. Previously, each motif template was generated manually and required supervision in the optimization of parameters for sensitivity and selectivity. We developed an algorithm and workflow for the automation of motif building and testing routines in ProMOL. The algorithm uses a set of empirically derived parameters for optimization and requires little user intervention. The automated motif generation algorithm was first tested in a performance comparison with a set of manually generated motifs based on identical active sites from the same 112 PDB entries. The two sets of motifs were equally effective in identifying alignments with homologs and in rejecting alignments with unrelated structures. A second set of 296 active site motifs were generated automatically, based on Catalytic Site Atlas entries with literature citations, as an expansion of the library of existing manually generated motif templates. The new motif templates exhibited comparable performance to the existing ones in terms of hit rates against native structures, homologs with the same EC and Pfam designations, and randomly selected unrelated structures with a different EC designation at the first EC digit, as well as in terms of RMSD values obtained from local structural alignments of motifs and query structures. This research is supported by NIH grant GM078077.
KeywordsTemplate-based alignment ProMOL PyMOL Molecular visualization Structural bioinformatics Enzyme Catalytic site motif
The authors thank the invaluable assistance of Frances C. Bernstein and current and former students who have worked at Dowling College and at RIT on the SBEVSL project. Funding for the project has been provided by NIH GM078077, NSF-DUE 0402408, Rochester Institute of Technology and Dowling College.
- 1.Hanson B, Westin C, Rosa M, Grier A, Osipovitch M, MacDonald ML, Dodge G, Boli PM, Corwin CW, Kessler H, McKay T, Bernstein HJ, Craig PA (2014) Estimation of protein function using template-based alignment of enzyme active sites. BMC Bioinform 15:87. doi: 10.1186/1471-2105-15-87 CrossRefGoogle Scholar
- 2.Delano W (2002) Pymol: an open-source molecular graphics tool. CCP4 Newsl Protein Crystallogr 40:44–53Google Scholar
- 7.Levenshtein VI (1966) Binary codes capable of correcting deletions, insertions and reversals. Soviet Phys Dokl 10:707–710Google Scholar
- 16.Joint Center for Structural Genomics (JCSG) (2004) Crystal structure of Methylglyoxal synthase (TM1185) from Thermotoga maritima at 2.06 Å resolution. doi: 10.2210/pdb1vmd/pdb
- 17.Shahsavar A, Erfani Moghaddam M, Antonyuk SV, Khajeh K, Naderi-Manesh H (2010) Atomic resolution structure of methylglyoxal synthase from Thermus Sp. Gh5 bound to phosphate: insights into the distinctive effects of phosphate on the enzyme structure. doi: 10.2210/pdb2xw6/pdb
- 18.Sugahara M, Kunishima N (2004) Crystal structure of methylglyoxal synthase from Thermus thermophilus HB8. doi: 10.2210/pdb1wo8/pdb
- 19.Shahsavar A, Erfani Moghaddam M, Antonyuk SV, Khajeh K, Naderi-Manesh H (2010) Crystal structures of methylglyoxal synthase from Thermus Sp. Gh5 in the open and closed conformational states provide insight into the mechanism of allosteric regulation. doi: 10.2210/pdb2x8w/pdb
- 22.Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson JD, Higgins DG (2011) Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7:539. doi: 10.1038/msb.2011.75 PubMedCentralPubMedCrossRefGoogle Scholar
- 24.Zhang Z, Caradoc-Davies TT, Dickson JM, Baker EN, Squire CJ (2009) Structures of Glycinamide Ribonucleotide Transformylase (PurN) from Mycobacterium tuberculosis reveal a novel dimer with relevance to drug discovery. J Mol Biol 389:722–733. doi: 10.1016/j.jmb.2009.04.044 PubMedCrossRefGoogle Scholar
- 31.Messerschmidt A, Macieira S, Velarde M, Baedeker M, Benda C, Jestel A, Brandstetter H, Neuefeind T, Blaesse M (2005) Crystal structure of the catalytic domain of human atypical protein kinase C-iota reveals interaction mode of phosphorylation site in turn motif. J Mol Biol 352:918–931. doi: 10.1016/j.jmb.2005.07.060 PubMedCrossRefGoogle Scholar