Study of Scheduling Strategies in a Dynamic Data Grid Environment

  • R. A. Dheepak
  • Shakeb Ali
  • Shubhashis Sengupta
  • Anirban Chakrabarti
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3326)

Abstract

Data grids seek to harness geographically distributed resources for large-scale data-intensive problems. Such problems involve loosely coupled jobs and large data sets mostly distributed geographically. Data grids have found applications in scientific research, in the field of high-energy Physics, Life Sciences etc. The issues that need to be considered in the data grid research area include: resource management including computation management and data management. Computation management include scheduling of jobs, scalability, response time involved in such scheduling, while data management include data replication in selected sited, data movement when required. Therefore, scheduling and replication assumes great importance in a data grid environment. In this paper, we have developed several scheduling strategies based on a developed replication strategy. The scheduling strategies are called Matching based Scheduling (MJS), Cost base Scheduling (CJS) and Latency based Scheduling (LJS). Among these, LJS and CJS perform similarly and MJS performs worse than both of them.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chervenak, Foster, I., Kesselman, C., Salisbury, C., Tuecke, S.: The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets. Journal of Network and Computer Applications 23, 187–200 (2001)CrossRefGoogle Scholar
  2. 2.
    Foster, Kesselman, C.: The Globus Project: A Status Report. In: Proc. IPPS/SPDP 1998 Heterogeneous Computing Workshop, pp. 4–18 (1998)Google Scholar
  3. 3.
    Casanova, H., Obertelli, G., Berman, F., Wolski, R.: The AppLeS Parameter Sweep Template: User-Level Middleware for the Grid. In: Proceedings of SuperComputing 2000 (2000)Google Scholar
  4. 4.
    Alhusaini, A.H., Prasanna, V.K., Raghavendra, C.S.: A Unified Resource Scheduling Framework for Heterogeneous Computing Environments. In: Eighth Heterogeneous Computing Workshop (1999)Google Scholar
  5. 5.
    Ranganathan, K., Foster, I.: Identifying dynamic replication strategies for a high-performance data grid. In: Lee, C.A. (ed.) GRID 2001. LNCS, vol. 2242, p. 75. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  6. 6.
    Chakrabarti, A., Dheepak, R.A., Sengupta, S.: Integration of Scheduling and Replication in Data Grids. In: Bougé, L., Prasanna, V.K. (eds.) HiPC 2004. LNCS, vol. 3296, pp. 375–385. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  7. 7.
    Bell, W.H., Cameron, D.G., Capozza, L., Millar, A.P., Stockinger, K., Zini, F.: Simulation of dynamic grid replication strategies in optorSim. In: Parashar, M. (ed.) GRID 2002. LNCS, vol. 2536, pp. 46–57. Springer, Heidelberg (2002)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • R. A. Dheepak
    • 1
  • Shakeb Ali
    • 1
  • Shubhashis Sengupta
    • 1
  • Anirban Chakrabarti
    • 1
  1. 1.Software Engineering and Technology LaboratoryInfosys Technologies Ltd.BangaloreIndia

Personalised recommendations