Journal of Grid Computing

, Volume 4, Issue 2, pp 209–222 | Cite as

GeneGrid: Architecture, Implementation and Application

  • P. V. Jithesh
  • P. Donachy
  • T. Harmer
  • N. Kelly
  • R. Perrott
  • S. Wasnik
  • J. Johnston
  • M. McCurley
  • M. Townsley
  • S. McKee


The emergence of Grid computing technology has opened up an unprecedented opportunity for biologists to share and access data, resources and tools in an integrated environment leading to a greater chance of knowledge discovery. GeneGrid is a Grid computing framework that seamlessly integrates a myriad of heterogeneous resources spanning multiple administrative domains and locations. It provides scientists an integrated environment for the streamlined access of a number of bioinformatics programs and databases through a simple and intuitive interface. It acts as a virtual bioinformatics laboratory by allowing scientists to create, execute and manage workflows that represent bioinformatics experiments. A number of cooperating Grid services interact in an orchestrated manner to provide this functionality. This paper gives insight into the details of the architecture, components and implementation of GeneGrid.

Key words

Bioinformatics GeneGrid Globus Grid computing Virtual Bioinformatics Laboratory 



open Grid services architecture


service oriented architecture


open Grid services infrastructure


simple object access protocol


GeneGrid application manager service factory


GeneGrid application manager service


open Grid services architecture-database access and integration


GeneGrid data manager service factory


GeneGrid data manager service


GeneGrid workflow definition database


GeneGrid status tracking, results & input parameters database


GeneGrid workflow manager service factory


GeneGrid workflow manager service


GeneGrid node monitor


GeneGrid application and resources registry


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Genomes OnLine Database, see website
  2. 2.
    Foster, I., Kesselman, C., Tuecke, S.: The anatomy of the Grid: Enabling scalable virtual organisations. Int. J. Supercomput. Appl. 15(3) (2003)Google Scholar
  3. 3.
    Foster, I., Kesselman, C., Nick, J., Tuecke, S.: The physiology of the Grid: An open Grid services architecture for distributed systems integration. Open Grid Service Infrastructure WG, Global Grid Forum, June 22 (2002)Google Scholar
  4. 4.
    Foster, I.: Service-oriented science. Science 308, 814–817 (6 May 2005)CrossRefGoogle Scholar
  5. 5.
    Foster, I., Kesselman, C.: Globus: A metacomputing infrastructure toolkit. Int. J. Supercomput. Appl. 11, 115–128 (1997)CrossRefGoogle Scholar
  6. 6.
    Tuecke, S., Czajkowski, K., Foster, I., Frey, J., Graham, S., Kesselman, C., Maguire, T., Sandholm, T., Vanderbilt, P., Snelling, D.: Open Grid services infrastructure (OGSI) Version 1.0. Global Grid Forum Draft Recommendation, 6/27/2003Google Scholar
  7. 7.
    Biomedical Informatics Research Network, see website
  8. 8.
    Cancer Biomedical Informatics Grid, see website
  9. 9.
    myGrid, see website
  10. 10.
    North Carolina BioGrid, see website
  11. 11.
    Bio-GRID, see website
  12. 12.
    Donachy, P., Harmer, T.J., Perrott, R.H., et al.: Grid based virtual bioinformatics laboratory. In: Proceedings of the UK e-Science All Hands Meeting, Nottingham, pp. 111–116, 2003Google Scholar
  13. 13.
    Joseph, J., Ernest, M., Fellenstein, C.: Evolution of Grid computing architecture and Grid adoption models. IBM Syst. J. 43, 624–645 (2004)CrossRefGoogle Scholar
  14. 14.
    Jithesh, P.V., Kelly, N., Simpson, D.R., et al.: Bioinformatics application integration and management in genegrid: Experiments and experiences. In: Proceedings of UK e-Science All Hands Meeting, Nottingham, pp. 563–570, 2004Google Scholar
  15. 15.
    Altschul, S.F., et al.: Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (Sep 1. 1997)CrossRefGoogle Scholar
  16. 16.
    Thompson, J.D., Higgins, D.G., Gibson, T.J.: CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680 (Nov 11. 1994)CrossRefGoogle Scholar
  17. 17.
    Eddy, S.R.: Profile hidden Markov models. Bioinformatics 14, 755–763 (1998)CrossRefGoogle Scholar
  18. 18.
    Rice, P., Longden, I., Bleasby, A.: EMBOSS: The European molecular biology open software suite. Trends Genet. 16, 276–277 (2000)CrossRefGoogle Scholar
  19. 19.
    Krogh, A., et al.: Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. J. Mol. Biol. 305(3), 567–580 (January 2001)CrossRefGoogle Scholar
  20. 20.
    Bendtsen, J.D., Nielsen, H., von Heijne, G., Brunak, S.: Improved prediction of signal peptides: SignalP 3.0. J. Mol. Biol. 340, 783–795 (Jul 16. 2004)CrossRefGoogle Scholar
  21. 21.
    Darling, A., Carey, L., Feng, W.: The design, implementation, and evaluation of mpiBLAST. In: ClusterWorld Conference & Expo in conjunction with the 4th International Conference on Linux Clusters: The HPC Revolution 2003, San Jose, CA, June 2003Google Scholar
  22. 22.
    Stajich, J.E., et al.: The bioperl toolkit: Perl modules for the life sciences. Genome Res. 12, 1611–1618 (October 2002)CrossRefGoogle Scholar
  23. 23.
    OGSA-DAI Project, see website
  24. 24.
    Kanz, C., Aldebert, P., Althorpe, N., et al.: The EMBL nucleotide sequence database. Nucleic Acids Res. 33 Database Issue, D29–D33 (Jan 1. 2005)CrossRefGoogle Scholar
  25. 25.
    Apweiler, R., Bairoch, A., Wu, C.H., et al.: UniProt: The universal protein knowledgebase. Nucleic Acids Res. 32, D115–D119 (Jan 1. 2004)CrossRefGoogle Scholar
  26. 26.
    The Gene Ontology Consortium. Gene ontology: Tool for the unification of biology. Nat. Genet. 25, 25–29 (2000)CrossRefGoogle Scholar
  27. 27.
    Wolfgang Meier. eXist: An open source native XML database. In: Chaudri, A.B., Jeckle, M., Rahm, E., Unland, R. (eds.) Web, Web-Services, and Database Systems. NODe 2002 Web- and Database-Related Workshops, Erfurt, Germany, October 2002Google Scholar
  28. 28.
    MySQL, see website
  29. 29.
    Novotny, J., Russell, M., Wehrens, O.: GridSphere: An advanced portal framework. In: Proceedings of EuroMicro Conference, pp. 412–419, 2004Google Scholar

Copyright information

© Springer Science + Business Media B.V. 2006

Authors and Affiliations

  • P. V. Jithesh
    • 1
  • P. Donachy
    • 1
  • T. Harmer
    • 1
  • N. Kelly
    • 1
  • R. Perrott
    • 1
  • S. Wasnik
    • 1
  • J. Johnston
    • 2
  • M. McCurley
    • 2
  • M. Townsley
    • 2
  • S. McKee
    • 3
  1. 1.Belfast e-Science Centre, Computer ScienceThe Queen’s University of BelfastBelfastUK
  2. 2.Fusion Antibodies Ltd.BelfastUK
  3. 3.Amtec Medical Ltd.AntrimUK

Personalised recommendations