The Royal Society of Chemistry and the delivery of chemistry data repositories for the community
Since 2009 the Royal Society of Chemistry (RSC) has been delivering access to chemistry data and cheminformatics tools via the ChemSpider database and has garnered a significant community following in terms of usage and contribution to the platform. ChemSpider has focused only on those chemical entities that can be represented as molecular connection tables or, to be more specific, the ability to generate an InChI from the input structure. As a structure centric hub ChemSpider is built around the molecular structure with other data and links being associated with this structure. As a result the platform has been limited in terms of the types of data that can be managed, and the flexibility of its searches, and it is constrained by the data model. New technologies and approaches, specifically taking into account a shift from relational to NoSQL databases, and the growing importance of the semantic web, has motivated RSC to rearchitect and create a more generic data repository utilizing these new technologies. This article will provide an overview of our activities in delivering data sharing platforms for the chemistry community including the development of the new data repository expanding into more extensive domains of chemistry data.
KeywordsData repository ChemSpider Crowdsourcing InChI NoSQL
ChemSpider is the result of the aggregate work of many contributors extending outside of our own team. Our RSC platforms are supported by a dedicated team of IT specialists. The authors acknowledge the support of the Open Source community, the commercial software vendors (specifically Accelrys, ACD/Labs, GGA Software, OpenEye Scientific Software, Dotmatics and many data providers, curators and users for their contributions to the development of the data content in terms of breadth and quality.
- 3.Williams AJ (2010) ChemSpider: integrating structure-based resources distributed across the internet. In: Belford R, Moore JW, Pence HE (eds) Enhancing learning with online resources, social networking, and digital libraries, vol 1060. American Chemical Society, Washington, p 23CrossRefGoogle Scholar
- 4.The IUPAC International Chemical Identifier (InChI). http://www.iupac.org/inchi/. Accessed 16 April 2014
- 5.PubMed. http://www.ncbi.nlm.nih.gov/pubmed/. Accessed 16 April 2014
- 6.Google Scholar. http://scholar.google.com/. Accessed 16 April 2014
- 7.Google Patents. http://www.google.com/patents. Accessed 16 April 2014
- 8.Published JCAMP-DX Protocols. http://www.jcamp-dx.org/protocols.html. Accessed 16 April 2014
- 12.PharmaSea. http://www.pharma-sea.eu/. Accessed 16 April 2014
- 13.Chemical Database Service. http://cds.rsc.org. Accessed 16 April 2014
- 14.ChemSpider Synthetic Pages. http://cssp.chemspider.com. Accessed 16 April 2014
- 15.BaseX: The XML Database. http://www.basex.org. Accessed 16 April 2014
- 16.MongoDB. http://www.mongodb.org/. Accessed 16 April 2014