Blind trials of computer-assisted structure elucidation software
One of the largest challenges in chemistry today remains that of efficiently mining through vast amounts of data in order to elucidate the chemical structure for an unknown compound. The elucidated candidate compound must be fully consistent with the data and any other competing candidates efficiently eliminated without doubt by using additional data if necessary. It has become increasingly necessary to incorporate an in silico structure generation and verification tool to facilitate this elucidation process. An effective structure elucidation software technology aims to mimic the skills of a human in interpreting the complex nature of spectral data while producing a solution within a reasonable amount of time. This type of software is known as computer-assisted structure elucidation or CASE software. A systematic trial of the ACD/Structure Elucidator CASE software was conducted over an extended period of time by analysing a set of single and double-blind trials submitted by a global audience of scientists. The purpose of the blind trials was to reduce subjective bias. Double-blind trials comprised of data where the candidate compound was unknown to both the submitting scientist and the analyst. The level of expertise of the submitting scientist ranged from novice to expert structure elucidation specialists with experience in pharmaceutical, industrial, government and academic environments.
Beginning in 2003, and for the following nine years, the algorithms and software technology contained within ACD/Structure Elucidator have been tested against 112 data sets; many of these were unique challenges. Of these challenges 9% were double-blind trials. The results of eighteen of the single-blind trials were investigated in detail and included problems of a diverse nature with many of the specific challenges associated with algorithmic structure elucidation such as deficiency in protons, structure symmetry, a large number of heteroatoms and poor quality spectral data.
When applied to a complex set of blind trials, ACD/Structure Elucidator was shown to be a very useful tool in advancing the computer's contribution to elucidating a candidate structure from a set of spectral data (NMR and MS) for an unknown. The synergistic interaction between humans and computers can be highly beneficial in terms of less biased approaches to elucidation as well as dramatic improvements in speed and throughput. In those cases where multiple candidate structures exist, ACD/Structure Elucidator is equipped to validate the correct structure and eliminate inconsistent candidates. Full elucidation can generally be performed in less than two hours; this includes the average spectral data processing time and data input.
- Elyashberg, ME, Williams, AJ, Blinov, KA (2012) Contemporary Computer-Assisted Approaches to Molecular Structure Elucidation. Royal Society of Chemistry, Cambridge, UK
- Smurnyy, YD, Blinov, KA, Churanova, T, Elyashberg, ME, Williams, AJ (2008) Toward more reliable13C and1H chemical shift prediction: a systematic comparison of neural-network and least-squares regression based approaches. J Chem Inf Model 48: pp. 128-134 CrossRef
- Steinbeck, C (2004) Recent developments in automated structure elucidation of natural products. Nat Prod Rep 21: pp. 512-518 CrossRef
- Elyashberg, ME, Blinov, KA, Williams, AJ, Martirosian, ER, Molodtsov, SG (2002) Application of a new expert system for the structure elucidation of natural products from their 1D and 2D NMR data. J Nat Prod 65: pp. 693-703 CrossRef
- Elyashberg, ME, Blinov, KA, Molodtsov, SG, Williams, AJ, Martin, GE, Martirosian, ER (2004) Structure Elucidator: a versatile expert system for molecular structure elucidation from 1D and 2D NMR data and molecular fragments. J Chem Inf Comput Sci 44: pp. 771-792 CrossRef
- Williams AJ: Collaboration, Community and Quality in chemistry databases. [http://www.chemspider.com/blog/collaboration-community-and-quality-in-chemistry-databases.html] Blog Post accessed November 26th 2011
- Shelley, CA, Munk, ME (1981) CASE, a computer model of the structure elucidation process. Anal Chim Acta 133: pp. 507-516 CrossRef
- Structure Elucidator Challenge [http://www.acdlabs.com/products/com_iden/elucidation/struc_eluc/challenge.php]
- Elyashberg, ME, Blinov, KA, Molodtsov, SG, Smurnyy, YD, Williams, AJ, Churanova, T (2009) Computer-assisted methods for molecular structure elucidation: realizing a spectroscopist's dream. J Cheminf 1: pp. 3 CrossRef
- Blinov, KA, Elyashberg, ME, Molodtsov, SG, Williams, AJ, Martirosian, ER (2001) An expert system for automated structure elucidation utilizing1H-1H,13C-1H and 15N-1H 2D NMR correlations. Fresenius J Anal Chem 369: pp. 709-714 CrossRef
- Moser A: The Facets of Structure Elucidation. [http://acdlabs.typepad.com/elucidation/2009/01/the-facets-of-structure-elucidation.html] Blog post accessed November 26th 2011
- Blinov, KA, Carlson, DV, Elyashberg, ME, Martin, GE, Martirosian, ER, Molodtsov, SG, Williams, AJ (2003) Computer-assisted structure elucidation of natural products with limited 2D NMR data: application of the StrucEluc system. Magn Reson Chem 41: pp. 359-372 CrossRef
- Wang, Y, Xiao, J, Suzek, TO, Zhang, J, Wang, J, Bryant, SH (2009) PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Research 37: pp. W623-633 CrossRef
- Elyashberg, ME, Blinov, KA, Williams, AJ, Molodtsov, SG, Martin, GE (2006) Are Deterministic Expert Systems for Computer-Assisted Structure Elucidation Obsolete?. J Chem Inf Comp Sci 42: pp. 241-248
- Dong, L, Schill, H, Grange, RL, Porzelle, A, Johns, JP, Parsons, PG, Gordon, VA, Reddell, PW, Williams, CM (2009) Anticancer Agents from the Australian Tropical Rainforest: Spiroacetals EBC-23, 24, 25, 72, 73, 75 and 76. Chem Eur J 15: pp. 11307-11318 CrossRef
- Stierle, AA, Stierle, DB, Kelly, K (2006) Berkelic Acid, A Novel Spiroketal with Selective Anticancer Activity from an Acid Mine Waste Fungal Extremophile. J Org Chem 71: pp. 5357-5360 CrossRef
- Elyashberg, ME, Williams, AJ, Martin, GE (2008) Computer-assisted structure verification and elucidation tools in NMR-based structure elucidation. Prog NMR Spectrosc 53: pp. 1-104 CrossRef
- Bremser, W (1978) HOSE - A novel substructure code. Anal Chim Acta 103: pp. 355-365 CrossRef
- Elyashberg, ME, Blinov, KA, Williams, AJ (2009) A systematic approach for the generation and verification of structural hypotheses. Magn Reson Chem 47: pp. 371-389 CrossRef
- Tsuda, M, Takahashi, Y, Fromont, J, Mikami, Y, Kobayashi, J (2005) Dendridine A, a Bis-indole Alkaloid from a Marine Sponge Dictyodendrilla Species. J Nat Prod 68: pp. 1277-1278 CrossRef
- ACD/CNMR Predictor, Version 12.0 Advanced Chemistry Development, Inc., Ontario, Canada; 2011.
- Houssen, WH (2006) Chemical, biological and molecular approaches toward drug discovery from marine organisms. Aberdeen University, Aberdeen, Scotland.
- Zain, ME, Awaad, Amani S, Razak, AA, Maitland, DJ, Khamis, NE, Sakhawy, MA (2009) Secondary Metabolites of Aureobasidium Pullulans Isolated from Egyptian Soil and Their Biological Activity. J Appl Sci Res 5: pp. 1582-1591
- Cichewicz, RH, Valeriote, FA, Crews, P (2004) Psymberin, A Potent Sponge-Derived Cytotoxin from Psammocinia Distantly Related to the Pederin Family. Org Lett 6: pp. 1951-1954 CrossRef
- Baker, LA, Williams, CM, Bernhardta, PV, Yanikb, GW (2006) Azedaralide: total synthesis, relative and absolute stereochemical assignment. Tetrahedron 62: pp. 7355-7360 CrossRef
- Joyner, PM, Liu, J, Zhang, Z, Merritt, J, Qi, F, Cichewicz, RH (2010) Mutanobactin A from the human oral pathogen Streptococcus mutans is a cross-kingdom regulator of the yeast-mycelium transition. Org & Biomol Chem 8: pp. 5461-5702 CrossRef
- Lambert, JB, Mazzola, E (2004) Nuclear Magnetic Resonance Spectroscopy. Pearson Prentice Hall, Upper Saddle River, N.J
- Pilgrim, S, Kociok-Kohn, G, Lloyd, MD, Lewis, SE (2011) ''Inosaminoacids'': novel inositol-amino acid hybrid structures accessed by microbial arene oxidation. Chem Commun 47: pp. 4799-4801 CrossRef
- Blind trials of computer-assisted structure elucidation software
- Open Access
- Available under Open Access This content is freely available online to anyone, anywhere at any time.
Journal of Cheminformatics
- Online Date
- February 2012
- Online ISSN
- Chemistry Central
- Additional Links
- Author Affiliations
- 1. Toronto Department, Advanced Chemistry Development, 110 Yonge Street, 14th floor, Toronto, Ontario, M5C 1T4, Canada
- 2. Moscow Department, Advanced Chemistry Development, 6 Akademik Bakulev Street, Moscow, 117513, Russian Federation
- 3. Royal Society of Chemistry, 904 Tamaras Circle, Wake Forest, NC, 27587, USA