Advertisement

GOCDB4, a New Architecture for the European Grid Infrastructure

  • Gilles Mathieu
  • John Casson
Conference paper

Abstract

The tool known as GOCDB, or Grid Operations Centre Data Base, consists of a central authoritative database which contains static Grid resource and topology related data. It stores information about regions, countries, sites, nodes, services and users, and links this information together in a logical way. Within the past years, GOCDB has imposed itself as a central authoritative repository for topology and site information within EGEE and WLCG. As for all other operational tools, the dramatic evolution of EGEE in order to prepare for a sustainable European Grid Infrastructure imposed many changes on GOCDB architecture. One of these changes is the requirement for a distributed architecture, where a central system can collect and display information maintained by regional instances of the system: in May 2010, GOCDB will become the official central topology repository for the European Grid Initiative (EGI), and will propose a regionalised model that will allow National Grids (NGI) to run their own instance of the system while keeping synchronised with the central EGI repository. These new requirements along with the limitations of GOCDB old model (known as GOCDB-3) led us to adopt an innovative database design based on a pseudo object database model (PROM). In this model, constraints and relations are built using meta-data. This allows for a large flexibility in the database schema, thus enabling different instances of the same tool to store different schemas while remaining interoperable. On top of this, a PHP-written input/output module gets and retrieves data in XML format, making the whole system as standard and configurable as possible. After reviewing GOCDB-3 architecture and explaining its limitations and the reasons for changing, the paper will describe GOCDB-4 inner architecture from general system overview down to technical details on database design, application level and standard interfaces. It will show how flexibility is achieved through the use of XML configuration files. Pros and cons of adopted model will also be assessed. The paper will finally review the overall distributed architecture and distribution scenarios, as well as interactions between GOCDB-4 and similar tools.

Keywords

Database Schema Grid Infrastructure Regional Instance Central Instance European Grid 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Laure E, Jones R, “Enabling Grids for e-Science: The EGEE Project”. EGEE-PUB-2009-001, http://cdsweb.cern.ch/record/1128647
  2. 2.
    The Worldwide LHC Computing Grid (WLCG), http://lcg.web.cern.ch/LCG
  3. 3.
    Mathieu G, Richards A, Gordon J, Del Cano Novales C, Colclough P, Viljoen M:“GOCDB, a Topology Repository for a Worldwide Grid Infrastructure”, in Proceedings of Computing in High Energy and Nuclear Physics (CHEP09), Prague, Czech Republic, March 2009Google Scholar
  4. 4.
    Berkeley Database Information Index (BDII), 12https://twiki.cern.ch/twiki/bin/view/EGEE/BDII
  5. 5.
    Czajkowski K, Fitzgerald S, Foster I and Kesselman, C, “Grid information services for distributed resource sharing” in Proceedings of the 10th IEEE Int. Symposium on High performance distributed computing (HPDC-10), San Francisco, CA, 7–9 August 2001Google Scholar
  6. 6.
    OSG Information management System (OIM), http://oim.grid.iu.edu
  7. 7.
    The Open Science Grid Consortium (OSG), http://www.opensciencegrid.org
  8. 8.
    Aidel O, Cavalli A, Cordier H, L’Orphelin C, Mathieu G, Pagano A, Reynaud S, “CIC portal: a Collaborative and Scalable Integration Platform for High Availability Grid Operations”, in Proceedings of The 8th IEEE/ACM International Conference on Grid Computing (Grid 2007), Austin TX, US, Sept. 2007Google Scholar
  9. 9.
    Hierarchical Grid Site Management (HGSM), http://hgsm.sourceforge.net
  10. 10.
    South Eastern European Grid Infrastructure Development (SEE-GRID),http://www.see-grid.org
  11. 11.
    Cordier H, Mathieu G, Schaer F, Novak J, Nyczyk P, Schulz M and Tsai MH, “Grid Operations: the evolution of operational model over the first year”, In Proceedings of Computing in High Energy and Nuclear Physics (CHEP06), Mumbai, India, 2006.Google Scholar
  12. 12.
    The European Grid Initiative (EGI) design study, http://web.eu-egi.eu
  13. 13.
    Casey J et al, “Operations Automation Strategy”, https://edms.cern.ch/document/927171
  14. 14.
    Colclough P and Mathieu G, “A pseudo object database model and its applications on a highly complex distributed architecture”, In Proceedings of the IARA 1st international conference on advances in databases, knowledge and data applications (DBKDA 2009), Cancun, Mexico, March 2009Google Scholar
  15. 15.
  16. 16.
    Representational State Transfer (REST) architecture, http://en.wikipedia.org/wiki/REST
  17. 17.
  18. 18.
  19. 19.
    Apache ActiveMQ, http://activemq.apache.org
  20. 20.
    Casey J et al, “MSG - A messaging system for efficient and scalable grid monitoring”, EGEE User Forum, March 09.Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  • Gilles Mathieu
    • 1
  • John Casson
    • 1
  1. 1.STFC, Rutherford Appleton LaboratoryOxfordshireUK

Personalised recommendations