Parallelization and Distribution Strategies of Large Bioinformatics Requests over the Grid

  • Eddy Caron
  • Frédéric Desprez
  • Gaël Le Mahec
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5022)


This paper focuses on simultaneous scheduling of computation and data replication for life science applications on the grid. We present an adaptive algorithm based on the SRA algorithm (Static Joint Replication and Scheduling) [4] with more dynamicity for the jobs frequencies. The use of a linear program giving a databases mapping on the nodes and a jobs distribution schema, ensures us that our data placement and jobs distribution will be near the optimal solution, as long as the informations about the jobs frequencies are right. We validate our results with large jobs submissions simulations on a realistic platform.


High Performance Computing Execution Time Average Data Replication Data Placement Wait Time Average 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Bolze, R., Cappello, F., Caron, E., Daydé, M., Desprez, F., Jeannot, E., Jégou, Y., Lanteri, S., Leduc, J., Melab, N., Mornet, G., Namyst, R., Primet, P., Quetier, B., Richard, O., Talbi, E.-G., Touché, I.: Grid 5000: A Large Scale and Highly Reconfigurable Experimental Grid Testbed. International Journal of High Performance Computing Applications 20(4), 481–494 (2006)CrossRefGoogle Scholar
  2. 2.
    Cameron, D.G., Carvajal-Schiaffino, R., Millar, A.P., Nicholson, C., Stockinger, K., Zini, F.: Evaluating scheduling and replica optimisation strategies in OptorSim. In: Proc. Fourth International Workshop on Grid Computing, 2003, pp. 52–59 (2003)Google Scholar
  3. 3.
    Caron, E., Desprez, F.: Diet: A Scalable Toolbox to Build Network Enabled Servers on the Grid. International Journal of High Performance Computing Applications 20(3), 335 (2006)CrossRefGoogle Scholar
  4. 4.
    Desprez, F., Vernois, A.: Simultaneous Scheduling of Replication and Computation for Data-Intensive Applications on the Grid. J. of Grid Computing 4(1), 19–31 (2006)CrossRefGoogle Scholar
  5. 5.
    Donno, F., Gaido, L., Ghiselli, A., Prelz, F., Sgaravatto, M.: Datagrid prototype 1. In: TERENA Networking conference (June 2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Eddy Caron
    • 1
  • Frédéric Desprez
    • 2
  • Gaël Le Mahec
    • 3
  1. 1.University of Lyon, ENS Lyon, INRIA 
  2. 2.LIP. UMR 5668, ENS Lyon, INRIA, CNRS, UCBLFrance
  3. 3.LPC de Clermont-Ferrand, CNRS, IN2P3UBP Université Blaise PascalFrance

Personalised recommendations