Tautomerism in chemical information management systems

  • Wendy A. WarrEmail author


Tautomerism has an impact on many of the processes in chemical information management systems including novelty checking during registration into chemical structure databases; storage of structures; exact and substructure searching in chemical structure databases; and depiction of structures retrieved by a search. The approaches taken by 27 different software vendors and database producers are compared. It is hoped that this comparison will act as a discussion document that could ultimately improve databases and software for researchers in the future.


Tautomer Tautomerism Chemical structure representation Chemical information management Registration Substructure search Software vendors Database producers 



I am indebted to the many people who supplied data for this article, and in particular I would like to thank Jonathan Brecher of CambridgeSoft and Geoff Skillman of OpenEye for helpful discussions, Russ Hillard and Keith Taylor of Symyx for drafting two figures, and Thomson Reuters for some careful proof reading.


  1. 1.
    Warr WA (1999). Balancing the needs of the recruiters and the aims of the educators. Book of Abstracts, 218th ACS National Meeting, New Orleans, Aug. 22-26, 1999: COMPGoogle Scholar
  2. 2.
    Warr WA Extract from 218th ACS National Meeting and Exposition New Orleans, Louisiana, August 22-26, 1999. Accessed 28 Feb 2010
  3. 3.
    Morgan HL (1965) J Chem Doc 5(2):107CrossRefGoogle Scholar
  4. 4.
    Dittmar PG, Stobaugh RE, Watson CE (1976) J Chem Inf Comput Sci 16(2):111Google Scholar
  5. 5.
    Freeland RG, Funk SA, O’Korn LJ, Wilson GA (1979) J Chem Inf Comput Sci 19(2):94Google Scholar
  6. 6.
    Blackwood JE, Elliott PM, Stobaugh RE, Watson CE (1977) J Chem Inf Comput Sci 17(1):3Google Scholar
  7. 7.
    Vander Stouw GG, Gustafson C, Rule JD, Watson CE (1976) J Chem Inf Comput Sci 16(4):213Google Scholar
  8. 8.
    Zamora A, Dayton DL (1976) J Chem Inf Comput Sci 16(4):219Google Scholar
  9. 9.
    Stobaugh RE (1980) J Chem Inf Comput Sci 20(2):76Google Scholar
  10. 10.
    Mockus J, Stobaugh RE (1980) J Chem Inf Comput Sci 20(1):18Google Scholar
  11. 11.
    Moosemiller JP, Ryan AW, Stobaugh RE (1980) J Chem Inf Comput Sci 20(2):83Google Scholar
  12. 12.
    Ryan AW, Stobaugh RE (1982) J Chem Inf Comput Sci 22(1):22Google Scholar
  13. 13.
    Hamill KA, Nelson RD, Vander Stouw GG, Stobaugh RE (1988) J Chem Inf Comput Sci 28(4):175Google Scholar
  14. 14.
    Stobaugh RE (1988) J Chem Inf Comput Sci 28(4):180Google Scholar
  15. 15.
    Blackwood JE, Blower PE Jr, Layten SW, Lillie DH, Lipkus AH, Peer JP, Qian C, Staggenborg LM, Watson CE (1991) J Chem Inf Comput Sci 31(2):204Google Scholar
  16. 16.
    CTfile formats. Accessed 28 Feb 2010
  17. 17.
    Dalby A, Nourse JG, Hounshell WD, Gushurst AKI, Grier DL, Leland BA, Laufer J (1992) J Chem Inf Comput Sci 32(3):244Google Scholar
  18. 18.
    The IUPAC International Chemical Identifier (InChI). Accessed 28 Feb 2010
  19. 19.
    Weininger D (1988) J Chem Inf Comput Sci 28(1):31Google Scholar
  20. 20.
    Weininger D, Weininger A, Weininger JL (1989) J Chem Inf Comput Sci 29(2):97Google Scholar
  21. 21.
    Wipke WT, Dyott TM (1974) J Am Chem Soc 96(15):4834CrossRefGoogle Scholar
  22. 22.
    NEMA key based exact match searching. Accessed 28 Feb 2010
  23. 23.
    Wipke WT, Krishnan S, Ouchi GI (1978) J Chem Inf Comput Sci 18(1):32Google Scholar
  24. 24.
    Heller SR, McNaught AD (2009) Chem Int 31(1):7Google Scholar
  25. 25.
    InChI Technical Manual. Accessed 28 Feb 2010 (The file InChI_TechMan is included in, inside the folder DOC-101)
  26. 26.
  27. 27.
    The National Cancer Institute Computer-Aided Drug Design Group Chemical Structure Lookup Service (CSLS). Accessed 28 Feb 2010
  28. 28.
    Sitzmann M, Ihlenfeldt W-D, Nicklaus MC (2010) J Comput-Aided Mol Des. doi: 10.1007/s10822-010-9346-4 Google Scholar
  29. 29.
    Anon (2007) Naming and indexing of chemical substances for Chemical Abstracts. Chemical Abstracts Service, ColumbusGoogle Scholar
  30. 30.
    IUPAC Compendium of Chemical Terminology—the Gold Book. Accessed 28 Feb 2010
  31. 31.
    Oellien F, Cramer J, Beyer C, Ihlenfeldt W-D, Selzer PM (2006) J Chem Inf Model 46(6):2342CrossRefGoogle Scholar
  32. 32.
  33. 33.
    Epik. Accessed 28 Feb 2010
  34. 34.
    Greenwood JR, Calkins D, Sullivan AP, Shelley JC (2010) J Comput-Aided Mol Des. doi: 10.1007/s10822-010-9349-1 Google Scholar
  35. 35.
    ChemAxon Standardizer. Accessed 28 Feb 2010
  36. 36.
    Trepalin SV, Skorenko AV, Balakin KV, Nasonov AF, Lang SA, Ivashchenko AA, Savchuk NP (2003) J Chem Inf Comput Sci 43(3):852Google Scholar
  37. 37.
    Sayle RA, Delany JJ (1999) Canonicalization and Enumeration of Tautomers. Paper presented at EuroMUG99, Cambridge, UK, 28–29 Oct 1999Google Scholar
  38. 38.
  39. 39.
  40. 40.
    Shelley JC, Cholleti A, Frye LL, Greenwood JR, Timlin MR, Uchimaya M (2007) J Comput-Aided Mol Des 21(12):681CrossRefGoogle Scholar
  41. 41.
    ChemAxon tautomerization plugin. Accessed 28 Feb 2010
  42. 42.
    ChemAxon tautomer duplicate filtering search option. Accessed 28 Feb 2010
  43. 43.
    Tautomer generation. pKa based dominance conditions for generating dominant tautomers. Accessed 28 Feb 2010
  44. 44.
    Clark AM, Labute P. SD File Processing with MOE Pipeline Tools. Accessed 28 Feb 2010
  45. 45.
    LigPrep. Accessed 28 Feb 2010
  46. 46.
  47. 47.
    Cruz-Cabeza AJ, Schreyer A, Pitt WR (2010) J Comput-Aided Mol Des. doi: 10.1007/s10822-010-9345-5 Google Scholar
  48. 48.
    InfoChem search tutorial. Accessed 28 Feb 2010
  49. 49.
  50. 50.
    Town WG (1989) In: Warr WA (ed) Chemical structure information systems. ACS symposium series, vol 400. American Chemical Society, Washington, p 68CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media B.V. 2010

Authors and Affiliations

  1. 1.Wendy Warr & AssociatesCheshire, EnglandUK

Personalised recommendations