Advertisement

Merging File Systems and Data Bases to Fit the Grid

  • Yves Denneulin
  • Cyril Labbé
  • Laurent d’Orazio
  • Claudia Roncancio
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6265)

Abstract

Grids are widely used by CPU intensive applications requiring to access data with high level queries as well as in a file based manner. Their requirements include accessing data through metadata of different kinds, system or application ones. In addition, grids provide large storage capabilities and support cooperation between sites. However, these solutions are relevant only if they supply good performance. This paper presents Gedeon, a middleware that proposes a hybrid approach for scientific data management for grid infrastructures. This hybrid approach consists in merging distributed files systems and distributed databases functionalities offering thus semantically enriched data management and preserving easiness of use and deployment. Taking advantage of this hybrid approach, advanced cache strategies are deployed at different levels to provide efficiency. Gedeon has been implemented, tested and used in the bioinformatic field.

Keywords

Cloud Computing Virtual Node Globus Toolkit Legacy Application Synthetic Workload 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Egee enabling grids for e-science, http://public.eu-egee.org/
  2. 2.
    glite middleware for grid, http://glite.web.cern.ch/glite/
  3. 3.
  4. 4.
    The mobius project, http://projectmobius.osu.edu/
  5. 5.
    Srb the sdsc storage ressource broker, http://www.sdsc.edu/srb
  6. 6.
    Boeckmann, B., Bairoch, A., Apweiler, R., Blatter, M.-C., Estreicher, A., Gasteiger, E., Martin, M.J., Michoud, K., O’Donovan, C., Phan, I., Pilbout, S., Schneider, M.: The swiss-prot protein knowledgebase and its supplement trembl in 2003. Nucleic Acids Res. 31(1), 365–370 (2003)CrossRefGoogle Scholar
  7. 7.
    Cappello, F., Caron, E., Dayde, M., Desprez, F., Jegou, Y., Primet, P., Jeannot, E., Lanteri, S., Leduc, J., Melab, N., Mornet, G., Namyst, R., Quetier, B., Richard, O.: Grid’5000: A large scale and highly reconfigurable grid experimental testbed. In: Proceedings of the IEEE/ACM International Workshop on Grid Computing, Seattle, USA, pp. 99–106 (2005)Google Scholar
  8. 8.
    Chaiken, R., Jenkins, B., Larson, P.-Å., Ramsey, B., Shakib, D., Weaver, S., Zhou, J.: Scope: easy and efficient parallel processing of massive data sets. PVLDB 1(2), 1265–1276 (2008)Google Scholar
  9. 9.
    Chidlovskii, B., Borghoff, U.M.: Semantic caching of web queries. The Very Large Data Bases Journal 9(1), 2–17 (2000)CrossRefGoogle Scholar
  10. 10.
    Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. Communications of the ACM 51(1), 107–113 (2008)CrossRefGoogle Scholar
  11. 11.
    d’Orazio, L.: Caches adaptables et applications aux systèmes de gestion de données répartis à grande échelle. PhD thesis, Institut National Polytechnique de Grenoble (December 2007)Google Scholar
  12. 12.
    d’Orazio, L., Jouanot, F., Denneulin, Y., Labbé, C., Roncancio, C., Valentin, O.: Distributed semantic caching in grid middleware. In: Proceedings of the International Conference on Database and Expert Systems Applications, pp. 162–171. Regensburg, Germany (2007)Google Scholar
  13. 13.
    d’Orazio, L., Roncancio, C., Labbé, C., Jouanot, F.: Semantic caching in large scale querying systems. Revista Colombiana De Computación 9(1) (2008)Google Scholar
  14. 14.
    Foster, I.T.: Globus toolkit version 4: Software for service-oriented systems. Journal of Computer Science and Technology 21(4), 513–520 (2006)CrossRefGoogle Scholar
  15. 15.
    Luo, Q., Naughton, J.F., Krishnamurthy, R., Cao, P., Li, Y.: Active query caching for database web servers. In: Proceedings of the International Workshop on The World Wide Web and Databases, Dallas, USA, pp. 92–104 (2001)Google Scholar
  16. 16.
    Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig latin: a not-so-foreign language for data processing. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 1099–1110 (2008)Google Scholar
  17. 17.
    Thusoo, A., Sarma, J.S., Jain, N., Shao, Z., Chakka, P., Anthony, S., Liu, H., Wyckoff, P., Murthy, R.: Hive - a warehousing solution over a map-reduce framework. PVLDB 2(2), 1626–1629 (2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Yves Denneulin
    • 1
  • Cyril Labbé
    • 1
    • 3
  • Laurent d’Orazio
    • 2
  • Claudia Roncancio
    • 1
  1. 1.LIGGrenoble UniversityFrance
  2. 2.LIMOSBlaise Pascal UniversityFrance
  3. 3.DSSEMonash UniversityMelbourneAustralia

Personalised recommendations