Evaluation of DOCK 6 as a pose generation and database enrichment tool
- 946 Downloads
In conjunction with the recent American Chemical Society symposium titled “Docking and Scoring: A Review of Docking Programs” the performance of the DOCK6 program was evaluated through (1) pose reproduction and (2) database enrichment calculations on a common set of organizer-specified systems and datasets (ASTEX, DUD, WOMBAT). Representative baseline grid score results averaged over five docking runs yield a relatively high pose identification success rate of 72.5 % (symmetry corrected rmsd) and sampling rate of 91.9 % for the multi site ASTEX set (N = 147) using organizer-supplied structures. Numerous additional docking experiments showed that ligand starting conditions, symmetry, multiple binding sites, clustering, and receptor preparation protocols all affect success. Encouragingly, in some cases, use of more sophisticated scoring and sampling methods yielded results which were comparable (Amber score ligand movable protocol) or exceeded (LMOD score) analogous baseline grid-score results. The analysis highlights the potential benefit and challenges associated with including receptor flexibility and indicates that different scoring functions have system dependent strengths and weaknesses. Enrichment studies with the DUD database prepared using the SB2010 preparation protocol and native ligand pairings yielded individual area under the curve (AUC) values derived from receiver operating characteristic curve analysis ranging from 0.29 (bad enrichment) to 0.96 (good enrichment) with an average value of 0.60 (27/38 have AUC ≥ 0.5). Strong early enrichment was also observed in the critically important 1.0–2.0 % region. Somewhat surprisingly, an alternative receptor preparation protocol yielded comparable results. As expected, semi-random pairings yielded poorer enrichments, in particular, for unrelated receptors. Overall, the breadth and number of experiments performed provide a useful snapshot of current capabilities of DOCK6 as well as starting points to guide future development efforts to further improve sampling and scoring.
KeywordsPose identification Pose rescoring Docking Virtual screening Enrichment ROC curves Scoring Sampling Rmsd Symmetry
Greg Warren, Neysa Nevins, and Georgia McGauhey are thanked for organizing the special Docking and Scoring symposium. William J. Allen and Jiangyang Liu are thanked for code development and Steve Skiena is thanked for helpful discussions regarding implementation of symmetry corrected rmsd using the Hungarian matching algorithm. This work was supported in part by NIH grants GM57513 (D.A.C.), R01GM083669 (R.C.R.), and F31CA134201 (T.E.B.), as well as the Stony Brook University Office of the Vice President for Research and the New York State Office of Science Technology and Academic Research (NYSTAR). S.R.B. gratefully acknowledges the use of computational facilities at the Ohio Supercomputer Center and thanks OpenEye Scientific Software for an academic license. This work also used resources at the New York Center for Computational Sciences at Stony Brook University/Brookhaven National Laboratory supported by the US Department of Energy under Contract No. DE-AC02-98CH10886 and by the State of New York. Molecular graphics and analyses were performed with the UCSF Chimera package. Chimera is developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from the National Institutes of Health (National Center for Research Resources grant 2P41RR001081, National Institute of General Medical Sciences grant 9P41GM103311).
- 22.SBU DOCK Tutorials. http://ringo.ams.sunysb.edu/index.php/DOCK_Tutorials. Last accessed Mar 01, 2012
- 23.UCSF DOCK Tutorials. http://dock.compbio.ucsf.edu/DOCK_6/tutorials/index.htm. Last accessed Mar 01, 2012
- 33.Kollman PA, Massova I, Reyes C, Kuhn B, Huo S, Chong L, Lee M, Lee T, Duan Y, Wang W, Donini O, Cieplak P, Srinivasan J, Case DA, Cheatham TE (2000) Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. Acc Chem Res 33(12):889–897CrossRefGoogle Scholar
- 34.Rastelli G, Rio AD, Degliesposti G, Sgobba M (2010) Fast and accurate predictions of binding free energies using MM-PBSA and MM-GBSA. J Comput Chem 31(4):797–810Google Scholar