Journal of Computer-Aided Molecular Design

, Volume 28, Issue 10, pp 1023–1030 | Cite as

The Royal Society of Chemistry and the delivery of chemistry data repositories for the community

  • Antony WilliamsEmail author
  • Valery Tkachenko


Since 2009 the Royal Society of Chemistry (RSC) has been delivering access to chemistry data and cheminformatics tools via the ChemSpider database and has garnered a significant community following in terms of usage and contribution to the platform. ChemSpider has focused only on those chemical entities that can be represented as molecular connection tables or, to be more specific, the ability to generate an InChI from the input structure. As a structure centric hub ChemSpider is built around the molecular structure with other data and links being associated with this structure. As a result the platform has been limited in terms of the types of data that can be managed, and the flexibility of its searches, and it is constrained by the data model. New technologies and approaches, specifically taking into account a shift from relational to NoSQL databases, and the growing importance of the semantic web, has motivated RSC to rearchitect and create a more generic data repository utilizing these new technologies. This article will provide an overview of our activities in delivering data sharing platforms for the chemistry community including the development of the new data repository expanding into more extensive domains of chemistry data.


Data repository ChemSpider Crowdsourcing InChI NoSQL 



ChemSpider is the result of the aggregate work of many contributors extending outside of our own team. Our RSC platforms are supported by a dedicated team of IT specialists. The authors acknowledge the support of the Open Source community, the commercial software vendors (specifically Accelrys, ACD/Labs, GGA Software, OpenEye Scientific Software, Dotmatics and many data providers, curators and users for their contributions to the development of the data content in terms of breadth and quality.


  1. 1.
    Pence H, Williams AJ (2010) J Chem Educ 87(11):1123CrossRefGoogle Scholar
  2. 2.
    Williams AJ (2011) Public compound databases—how ChemSpider changed the rules making molecules on the web free. In: Ekins S, Hupcey MAZ, Williams AJ (eds) Collaborative computational technologies for the life sciences. Wiley, Hoboken, p 363CrossRefGoogle Scholar
  3. 3.
    Williams AJ (2010) ChemSpider: integrating structure-based resources distributed across the internet. In: Belford R, Moore JW, Pence HE (eds) Enhancing learning with online resources, social networking, and digital libraries, vol 1060. American Chemical Society, Washington, p 23CrossRefGoogle Scholar
  4. 4.
    The IUPAC International Chemical Identifier (InChI). Accessed 16 April 2014
  5. 5.
    PubMed. Accessed 16 April 2014
  6. 6.
    Google Scholar. Accessed 16 April 2014
  7. 7.
    Google Patents. Accessed 16 April 2014
  8. 8.
    Published JCAMP-DX Protocols. Accessed 16 April 2014
  9. 9.
    Bradley JC, Lancashire RJ, Lang AS, Williams AJ (2009) J Cheminform 1(1):9CrossRefGoogle Scholar
  10. 10.
    Williams AJ, Harland L, Groth P, Pettifer S, Chichester C, Willighagen EL, Evelo CT, Blomberg N, Ecker G, Goble C, Mons B (2012) Drug Discov Today 17(21–22):1188CrossRefGoogle Scholar
  11. 11.
    Hunter AJ (2008) Drug Discov Today 13(9–10):371CrossRefGoogle Scholar
  12. 12.
    PharmaSea. Accessed 16 April 2014
  13. 13.
    Chemical Database Service. Accessed 16 April 2014
  14. 14.
    ChemSpider Synthetic Pages. Accessed 16 April 2014
  15. 15.
    BaseX: The XML Database. Accessed 16 April 2014
  16. 16.
    MongoDB. Accessed 16 April 2014

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  1. 1.Royal Society of ChemistryWake ForestUSA

Personalised recommendations