Journal of Molecular Modeling

, Volume 11, Issue 6, pp 532–541 | Cite as

A global resource for computational chemistry

  • Peter Murray-Rust
  • Henry S. Rzepa
  • James J. P. Stewart
  • Yong Zhang
Original Paper

Abstract

A modular distributable system has been built for high-throughput computation of molecular structures and properties. It has been used to process 250,000 compounds from the NCI database and to make the results searchable by structures and properties. The IUPAC/NIST InChI specification and algorithm has been used to index the structures and enforce integrity during computation. A number of novel features of the PM5 Hamiltonian were identified as a result of the high-throughput approach. The system and the data can be redistributed and reused and promote the value of computed data as a primary chemical resource

Figure Workflow schematic for conversion of an SDF file to a Mopac input file

.

Keywords

Workflow InChI NCI Mopac XML-CML 

Supplementary material

894_2005_278_MOESM1_ESM.zip (2.9 mb)
List of molecules in the NCI database (zip-file 3,1 MB)

References

  1. 1.
    Beck B, Horn A, Carpenter JE, Clark T (1998) J Chem Inf Comp Sci 38:1214–1217Google Scholar
  2. 2.
    Beck B (2000) Abstracts of papers, 220th ACS National Meeting, Washington DC, USA, August 20–24, COMP-151Google Scholar
  3. 3.
    Pear M, Berstein J, Li C, McDonald R (1996) Book of abstracts, 212th ACS National Meeting, Orlando, FL, August 25–29, CINF-028Google Scholar
  4. 4.
    Brown RD, Guner OF, Hahn M, Li H (1998) Book of abstracts, 216th ACS National Meeting, Boston, August 23–27, CINF-051Google Scholar
  5. 5.
    Richards WG (2002) Innovation: nature reviews drug discovery 1:551–555CrossRefGoogle Scholar
  6. 6.
    Hawick KA, Grove DA, Coddington PD, Buntine MA (2000) Int J Chem article 4Google Scholar
  7. 7.
    Hela M, Yousef YA, Afeneh AT (2002) J Comp Chem 23:966–976CrossRefGoogle Scholar
  8. 8.
    Zhang Y, Glen RC, Murray-Rust P, Rzepa HS, Townsend JA (2004) Abstracts of papers, 227th ACS National Meeting, Anaheim, CA, USA, March 28–April 1, CINF-032Google Scholar
  9. 9.
    Murray-Rust P, Rzepa HS, Zhang Y (2004) Org Biomol Chem 2:3192–3203PubMedCrossRefGoogle Scholar
  10. 10.
    Murray-Rust P, Rzepa HS, Williamson MJ, Willighagen EL (2004) J Chem Inf Comp Sci 44:462–469CrossRefGoogle Scholar
  11. 11.
    Jakalian A, Jack D, Bayly CI (2000) Abstracts of papers. Am Chem Soc COMP-001Google Scholar
  12. 12.
    Stewart JJP (2004) Abstracts of papers. Am Chem Soc, COMP-004Google Scholar
  13. 13.
    Murray-Rust P, Rzepa HS (1999) J Chem Inf Comp Sci 39:928–942CrossRefGoogle Scholar
  14. 14.
    Murray-Rust P, Rzepa HS (2003) J Chem Inf Comp Sci 43:757–772CrossRefGoogle Scholar
  15. 15.
    Sadowski J, Gasteiger J (1993) Chem Rev 93:2567–2581CrossRefGoogle Scholar
  16. 16.
    Linstrom PJ, Tchekhovskoi DV (2003) Abstracts of papers. Am Chem Soc, CINF-085Google Scholar
  17. 17.
    Stein SE, Heller SR, Tchekhovski D (2003) Nimes International Chemical Information Conference Proceedings, pp 131–143Google Scholar
  18. 18.
    Clark T, Martin S, Liefeld T (2004) Brief Bioinform 5:59–70PubMedCrossRefGoogle Scholar
  19. 19.
    Godden JW, Stahura FL, Bajorath J (2000) J Chem Inf Comp Sci 40:796–800CrossRefGoogle Scholar
  20. 20.
  21. 21.
    Tannenbaum T, Wright D, Miller K, Livny M (2002) Condor—a distributed job scheduler. In: Sterling T (ed) Beowulf cluster computing with linux. MIT Press, Cambridge, MAGoogle Scholar
  22. 22.
    Thain D, Tannenbaum T, and Livny M (2003) “Condor and the grid”. In: Fran B, Anthony JGH, Geoffrey F (eds) Grid computing: making the global infrastructure a reality. Wiley, New YorkGoogle Scholar
  23. 23.
  24. 24.
    For details of CMLComp and other evolving schemas, see http://cml.sourceforge.net/
  25. 25.
    Frey JG (2004) Abstracts of papers, 227th ACS National Meeting, CINF-025Google Scholar
  26. 26.
    Hughes G, Mills H, De Roure D, Frey JG, Moreau L, Schraefel MC, Smith G, Zaluska E (2004) Org Biomol Chem 2:3284–3293PubMedCrossRefGoogle Scholar
  27. 27.
    Berners-Lee T, Hendler J (2001) Nature 410:1023–1024CrossRefPubMedGoogle Scholar

Copyright information

© Springer-Verlag 2005

Authors and Affiliations

  • Peter Murray-Rust
    • 1
  • Henry S. Rzepa
    • 2
  • James J. P. Stewart
    • 3
  • Yong Zhang
    • 1
  1. 1.Department of ChemistryUnilever Centre for Molecular Informatics, University of CambridgeCambridgeUK
  2. 2.Department of ChemistryImperial CollegeLondonUK
  3. 3.Stewart Computational ChemistryColorado springsUSA

Personalised recommendations