Abstract
WISDOM is an international initiative to enable a virtual screening pipeline on a Grid infrastructure. Its first attempt was to deploy large scale in silico docking on a public Grid infrastructure. Protein–ligand docking is about computing the binding energy of a protein target to a library of potential drugs using a scoring algorithm. Previous deployments were either limited to one cluster, to Grids of clusters in the tightly protected environment of a pharmaceutical laboratory or to desktop Grids. The first large scale docking experiment ran on the EGEE Grid production service from 11 July 2005 to 19 August 2005 against targets relevant to research on malaria and saw over 41 million compounds docked for the equivalent of 80 years of CPU time. Up to 1,700 computers were simultaneously used in 15 countries around the world. Issues related to the deployment and the monitoring of the in silico docking experiment as well as experience with Grid operation and services are reported in the paper. The main problem encountered for such a large scale deployment was the Grid infrastructure stability. Although the overall success rate was above 80%, a lot of monitoring and supervision was still required at the application level to resubmit the jobs that failed. But the experiment demonstrated how Grid infrastructures have a tremendous capacity to mobilize very large CPU resources for well targeted goals during a significant period of time. This success leads to a second computing challenge targeting avian flu neuraminidase N1.
Similar content being viewed by others
References
Spencer, R.W.: High throughput virtual screening of historic collections on the file size, biological targets, and file diversity. Biotechnol. Bioeng. 61, 61–67 (1998)
Anderson, A.C.: The process of structure-based drug design. Chem. Biol. 10, 787–797 (2003)
Lyne, P.D.: Structure-based virtual screening: an overview. Drug Discov. Today 7, 1047–1055 (2002)
Buyya, R., Branson, K., Giddy, J., Abramson, D.: The Virtual Laboratory: a toolset to enable distributed molecular modeling for drug design on the World-Wide Grid. Concurrency Computat. Pract. Exper. 15, 1–25 (2003)
Chien, A., Foster, I., Goddette, D.: Grid technologies empowering drug discovery. Drug Discov. Today 7 Suppl 20, 176–180 (2002)
Garcia Aristegui, D.J., Mendez Lorenzo, P., Valverde, J.R.: GROCK: high-throughput docking using LCG Grid tools. In: The 6th IEEE/ACM International Workshop on Grid Computing, 85–90 (2005)
Sudholt, W., Baldridge, K.K., Abramson, D., Enticott, C., Garic, S., Kondric, C., Nguyen, D.: Application of Grid computing to parameter sweeps and optimizations in molecular modelling. Future Gener. Comput. Syst. 21, 27–35 (2005)
Graham Richards, W.: Virtual screening using Grid computing: the screensaver project. Nat. Rev. Drug Discov. 1, 551–555 (2002)
Oram, A. (ed.) Peer-to-peer: harnessing the Power of Disruptive Technologies. O’Reilly Press, CA (2001)
Loewe, L.: Global computing for bioinformatics. Brief. Bioinform. 3, 377–388 (2002)
Peitsch, M. C., et al.: Informatics and knowledge management at the Novartis Institutes for BioMedical Research. SCIP 46, 1–4 (2004)
Jacq, N., et al.: Demonstration of In Silico Docking at a Large Scale on Grid Infrastructure. Studies in Health Technology and Informatics 120 155–157. http://wisdom.healthGrid.org/ (2006)
Solomonides, T., McClatchey, R., Breton, V., Legré, Y., Nørager, S.: White paper HealthGrid. In: Proceedings of HealthGrid 2005, IOS Press, 112 (2005)
Breton, V., Hofmann, M., Jacq, N.: Grid added value to address malaria. In: Proceedings of the 6th IEEE/ACM CCGrid Conference (2006)
Jacq, N., Blanchet, C., Combet, C., Cornillot, E., Duret, L., Kurata, K., Nakamura, H., Silvestre, T., Breton, V.: Grid as a bioinformatics tool. Parallel Comput. 30, 1093–1107 (2004)
Campana, S., et al.: Analysis of the ATLAS Rome Production Experience on the LHC computing Grid. In: IEEE International Conference on e-Science and Grid Computing (2005)
Bird, I., et al.: Operating the LCG and EGEE production Grids for HEP. In: Proceedings of the CHEP‘04 Conference (2004)
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)
Sulakhe, D., et al.: GNARE: an environment for Grid-based high-throughput genome analysis. In: CCGrid 2005 BioGrid Workshop (2005)
Taufer, M., et al.: Predictor@home: a “Protein Structure Prediction Supercomputer” based on public-resource computing. IEEE Trans. Parallel Distrib. Syst. 17(8), 786–796 (2006)
Stefan, M., et al.: Folding@Home and Genome@Home: using distributed computing to tackle previously intractable problems in computational biology. citeseer.ist.psu.edu/589744.html (2002)
Chien, A., et al.: Grid technologies empowering drug discovery. Drug Discov. Today 7(20), 176–180 (2002)
Ziegler, R., Pharma GRIDs: Key to pharmaceutical innovation? In: Proceedings of the HealthGrid conference 2004 (2004).
Weisner, J., Ortmann, R., Jomaa, H., Schlitzer, M.: New Antimalarial drugs. Angew. Chem. Int. 42, 5274–5529 (2003)
Francis, S. E., Sullivan, D. J. Jr., Goldberg, D.E.: Hemoglobin metabolism in the malaria parasite plasmodium falciparum. Annu. Rev. Microbiol. 51, 97–123 (1997)
Coombs, G.H., Goldberg, D.E., Klemba, M., Berry, C., Kay, J., Mottram, J.C.: Aspartic proteases of plasmodium falciparum and other protozoa as drug targets. Trends Parasitol. 17, 532–537 (2001)
Silva, A.M., Lee, A.Y., Gulnik, S.V., Majer, P., Collins, J., Bhat, T.N., Collins, P.J., Cachau, R.E., Luker, K.E., Gluzman, I.Y., Francis, S.E., Oksman, A., Goldberg, D.E., Erickson, J.W.: Structure and inhibition of plasmepsin II, A haemoglobin degrading enzyme from Plasmodium falciparum. Proc. Natl. Acad. Sci. USA 93, 10034–10039 (1996)
Gagliardi, F., Jones, B., Grey, F., Bégin, M.E., Heikkurinen, M.: Building an infrastructure for scientific Grid computing: status and goals of the EGEE project, Philos. Trans. Royal Soc. Math. Phys. Eng. Sci. 363, 1729–1742 (2005)
The LCG Editorial Board: LHC Computing Grid Technical Design Report. CERN-LHCC-2005-024. LCG, France (2005)
Raman, R., Livny, M., Solomon, M.: Matchmaking: distributed resource management for high throughput computing. In: Proceedings of the 12th IEEE International Symposium on High-performance Distributed Computing (2003)
Rarey, M., Kramer, B., Lengauer, T., Klebe, G.: Predicting Receptor–Ligand interactions by an incremental construction algorithm. J. Mol. Biol. 261, 470–489 (1996)
Morris, G.M., Goodsell, D.S., Halliday, R.S., Huey, R., Hart, W.E., Belew, R.K., Olson, A.J.: Automated docking using a Lamarckian genetic algorithm and empirical binding free energy function. J. Computat. Chem. 19, 1639–1662 (1998)
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The protein data bank. Nucleic Acids Res. 28, 235–242 (2000)
Irwin, J.J., Shoichet, B.K.: ZINC – a free database of commercially available compounds for virtual screening. J. Chem. Inf. Model. 45(1), 177–182 (2005)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jacq, N., Salzemann, J., Jacq, F. et al. Grid-enabled Virtual Screening Against Malaria. J Grid Computing 6, 29–43 (2008). https://doi.org/10.1007/s10723-007-9085-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10723-007-9085-5