Blind trials of computer-assisted structure elucidation software
One of the largest challenges in chemistry today remains that of efficiently mining through vast amounts of data in order to elucidate the chemical structure for an unknown compound. The elucidated candidate compound must be fully consistent with the data and any other competing candidates efficiently eliminated without doubt by using additional data if necessary. It has become increasingly necessary to incorporate an in silico structure generation and verification tool to facilitate this elucidation process. An effective structure elucidation software technology aims to mimic the skills of a human in interpreting the complex nature of spectral data while producing a solution within a reasonable amount of time. This type of software is known as computer-assisted structure elucidation or CASE software. A systematic trial of the ACD/Structure Elucidator CASE software was conducted over an extended period of time by analysing a set of single and double-blind trials submitted by a global audience of scientists. The purpose of the blind trials was to reduce subjective bias. Double-blind trials comprised of data where the candidate compound was unknown to both the submitting scientist and the analyst. The level of expertise of the submitting scientist ranged from novice to expert structure elucidation specialists with experience in pharmaceutical, industrial, government and academic environments.
Beginning in 2003, and for the following nine years, the algorithms and software technology contained within ACD/Structure Elucidator have been tested against 112 data sets; many of these were unique challenges. Of these challenges 9% were double-blind trials. The results of eighteen of the single-blind trials were investigated in detail and included problems of a diverse nature with many of the specific challenges associated with algorithmic structure elucidation such as deficiency in protons, structure symmetry, a large number of heteroatoms and poor quality spectral data.
When applied to a complex set of blind trials, ACD/Structure Elucidator was shown to be a very useful tool in advancing the computer's contribution to elucidating a candidate structure from a set of spectral data (NMR and MS) for an unknown. The synergistic interaction between humans and computers can be highly beneficial in terms of less biased approaches to elucidation as well as dramatic improvements in speed and throughput. In those cases where multiple candidate structures exist, ACD/Structure Elucidator is equipped to validate the correct structure and eliminate inconsistent candidates. Full elucidation can generally be performed in less than two hours; this includes the average spectral data processing time and data input.
- Elyashberg ME, Williams AJ, Blinov KA: Contemporary Computer-Assisted Approaches to Molecular Structure Elucidation. Cambridge, UK: Royal Society of Chemistry; 2012.
- Smurnyy YD, Blinov KA, Churanova T, Elyashberg ME, Williams AJ: Toward more reliable 13 C and 1 H chemical shift prediction: a systematic comparison of neural-network and least-squares regression based approaches. J Chem Inf Model 2008, 48:128–134. CrossRef
- Steinbeck C: Recent developments in automated structure elucidation of natural products. Nat Prod Rep 2004, 21:512–518. CrossRef
- Elyashberg ME, Blinov KA, Williams AJ, Martirosian ER, Molodtsov SG: Application of a new expert system for the structure elucidation of natural products from their 1D and 2D NMR data. J Nat Prod 2002, 65:693–703. CrossRef
- Elyashberg ME, Blinov KA, Molodtsov SG, Williams AJ, Martin GE, Martirosian ER: Structure Elucidator: a versatile expert system for molecular structure elucidation from 1D and 2D NMR data and molecular fragments. J Chem Inf Comput Sci 2004, 44:771–792. CrossRef
- Williams AJ: Collaboration, Community and Quality in chemistry databases. [http://www.chemspider.com/blog/collaboration-community-and-quality-in-chemistry-databases.html] Blog Post accessed November 26th 2011
- Shelley CA, Munk ME: CASE, a computer model of the structure elucidation process. Anal Chim Acta 1981, 133:507–516. CrossRef
- Structure Elucidator Challenge [http://www.acdlabs.com/products/com_iden/elucidation/struc_eluc/challenge.php]
- Elyashberg ME, Blinov KA, Molodtsov SG, Smurnyy YD, Williams AJ, Churanova T: Computer-assisted methods for molecular structure elucidation: realizing a spectroscopist's dream. J Cheminf 2009, 1:3. CrossRef
- Blinov KA, Elyashberg ME, Molodtsov SG, Williams AJ, Martirosian ER: An expert system for automated structure elucidation utilizing 1 H- 1 H, 13 C- 1 H and 15 N- 1 H 2D NMR correlations. Fresenius J Anal Chem 2001, 369:709–714. CrossRef
- Moser A: The Facets of Structure Elucidation. [http://acdlabs.typepad.com/elucidation/2009/01/the-facets-of-structure-elucidation.html] Blog post accessed November 26th 2011
- Blinov KA, Carlson DV, Elyashberg ME, Martin GE, Martirosian ER, Molodtsov SG, Williams AJ: Computer-assisted structure elucidation of natural products with limited 2D NMR data: application of the StrucEluc system. Magn Reson Chem 2003, 41:359–372. CrossRef
- Wang Y, Xiao J, Suzek TO, Zhang J, Wang J, Bryant SH: PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Research 2009, 37:W623–633. CrossRef
- Elyashberg ME, Blinov KA, Williams AJ, Molodtsov SG, Martin GE: Are Deterministic Expert Systems for Computer-Assisted Structure Elucidation Obsolete? J Chem Inf Comp Sci 2006, 42:241–248.
- Dong L, Schill H, Grange RL, Porzelle A, Johns JP, Parsons PG, Gordon VA, Reddell PW, Williams CM: Anticancer Agents from the Australian Tropical Rainforest: Spiroacetals EBC-23, 24, 25, 72, 73, 75 and 76. Chem Eur J 2009, 15:11307–11318. CrossRef
- Stierle AA, Stierle DB, Kelly K: Berkelic Acid, A Novel Spiroketal with Selective Anticancer Activity from an Acid Mine Waste Fungal Extremophile. J Org Chem 2006, 71:5357–5360. CrossRef
- Elyashberg ME, Williams AJ, Martin GE: Computer-assisted structure verification and elucidation tools in NMR-based structure elucidation. Prog NMR Spectrosc 2008, 53:1–104. CrossRef
- Bremser W: HOSE - A novel substructure code. Anal Chim Acta 1978, 103:355–365. CrossRef
- Elyashberg ME, Blinov KA, Williams AJ: A systematic approach for the generation and verification of structural hypotheses. Magn Reson Chem 2009, 47:371–389. CrossRef
- Tsuda M, Takahashi Y, Fromont J, Mikami Y, Kobayashi J: Dendridine A, a Bis-indole Alkaloid from a Marine Sponge Dictyodendrilla Species. J Nat Prod 2005, 68:1277–1278. CrossRef
- ACD/CNMR Predictor, Version 12.0 Advanced Chemistry Development, Inc., Ontario, Canada; 2011.
- Houssen WH: Chemical, biological and molecular approaches toward drug discovery from marine organisms. Aberdeen University, Aberdeen, Scotland 2006.
- Zain ME, Awaad AmaniS, Razak AA, Maitland DJ, Khamis NE, Sakhawy MA: Secondary Metabolites of Aureobasidium Pullulans Isolated from Egyptian Soil and Their Biological Activity. J Appl Sci Res 2009, 5:1582–1591.
- Cichewicz RH, Valeriote FA, Crews P: Psymberin, A Potent Sponge-Derived Cytotoxin from Psammocinia Distantly Related to the Pederin Family. Org Lett 2004, 6:1951–1954. CrossRef
- Baker LA, Williams CM, Bernhardta PV, Yanikb GW: Azedaralide: total synthesis, relative and absolute stereochemical assignment. Tetrahedron 2006, 62:7355–7360. CrossRef
- Joyner PM, Liu J, Zhang Z, Merritt J, Qi F, Cichewicz RH: Mutanobactin A from the human oral pathogen Streptococcus mutans is a cross-kingdom regulator of the yeast-mycelium transition. Org & Biomol Chem 2010, 8:5461–5702. CrossRef
- Lambert JB, Mazzola E: Nuclear Magnetic Resonance Spectroscopy. Pearson Prentice Hall, Upper Saddle River, N.J; 2004.
- Pilgrim S, Kociok-Kohn G, Lloyd MD, Lewis SE: ''Inosaminoacids'': novel inositol-amino acid hybrid structures accessed by microbial arene oxidation. Chem Commun 2011, 47:4799–4801. CrossRef
- Blind trials of computer-assisted structure elucidation software
- Open Access
- Available under Open Access This content is freely available online to anyone, anywhere at any time.
Journal of Cheminformatics
- Online Date
- February 2012
- Online ISSN
- Chemistry Central
- Additional Links
- Author Affiliations
- 1. Toronto Department, Advanced Chemistry Development, 110 Yonge Street, 14th floor, Toronto, Ontario, M5C 1T4, Canada
- 2. Moscow Department, Advanced Chemistry Development, 6 Akademik Bakulev Street, Moscow, 117513, Russian Federation
- 3. Royal Society of Chemistry, 904 Tamaras Circle, Wake Forest, NC, 27587, USA