Scalable Repositories for Virtual Clusters

  • Paolo Anedda
  • Simone Leo
  • Massimo Gaggero
  • Gianluigi Zanetti
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6043)

Abstract

For a large class of scientific data analysis applications it is becoming important, due to the sheer size of datasets, to have the option to perform the analysis directly where the data are stored, rather than on remote computational clusters. A possible strategy is the use of virtual clusters, thus guaranteeing a high degree of isolation from the underlying physical computational structure, and a very compact initial description. Deploying, saving and restoring HPC dedicated virtual clusters introduces, however, a different class of requirements on the virtual machines managing infrastructure, in particular for what concerns storage I/O requirements, whose scalability boundaries are easily reached. Here we discuss an alternative approach based on a storage model that leverages the WORM (write once, read many) character of the data used by VM management to increase, in a scalable way, the aggregate data bandwidth available to virtual cluster level operations and provide preliminary results indicating that it is a viable solution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Paolo Anedda
    • 1
  • Simone Leo
    • 1
  • Massimo Gaggero
    • 1
  • Gianluigi Zanetti
    • 1
  1. 1.CRS4 Distributed Computing GroupPulaItaly

Personalised recommendations