Skip to main content

Availability Management in Data Grid

  • Conference paper
  • First Online:
IT Convergence and Services

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 107))

Abstract

The data grids are highly distributed environments where nodes are geographically distributed across the globe and shared data are generally very large. The use of replication techniques ensure better availability and easy access to data handled in the grids. In this article, we propose a dynamic replication strategy based on availability and popularity, this replication takes into account failures in the system. The minimum degree of replication is specified by a certain probability of availability and the maximum degree is controlled by the popularity of the data, we introduced also the concept of dynamic primary replica that is used to ensure availability without increasing recovery time. We show in this article that the proposed strategy improves the availability of data according to its popularity and at the same time it improves system performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Lei M, Vrbsky S (2008) An on-line replication strategy to increase availability in data grids. Future Gener Comput Syst 24(2):85–98

    Article  MATH  Google Scholar 

  2. Dabek F, Kaashoek MF, Karger D, Morris R, Stoica I (2001) Wide-area cooperative storage with CFS. In: Proceedings of the 18th ACM symposium on operating systems principles, Banff, Canada, Oct 2001, pp 202–215

    Google Scholar 

  3. Haeberlen A, Mislove A, Druschel P (2005) Glacier: Highly durable, decentralized storage despite massive correlated failures. In: Proceedings of the Second USENIX symposium on networked systems design and implementation, Boston, May 2005, pp 143–158

    Google Scholar 

  4. Ghemawat S, Gobioff H, Leung ST (2003) The google file system. In: Proceedings of the 19th ACM symposium on operating systems principles, Bolton Landing, NY, Oct 2003, pp 29–43

    Google Scholar 

  5. Nath S, Yu H, Gibbons PB, Seshan S (2006) Subtleties in tolerating correlated failures in wide-area storage systems. In: Proceedings of the third USENIX symposium on networked systems design and implementation, San Jose, CA, May 2006, pp 225–238

    Google Scholar 

  6. Yu H, Gibbons PB, Nath S (2006) Availability of multi-object operations. In: Proceedings of the third USENIX symposium on networked systems design and implementation, San Jose, CA, May 2006, pp 211–224

    Google Scholar 

  7. Min Park S, Kim J-H, Ko Y-B, Yoon W-S (2003) Dynamic data grid replication strategy based on internet hierarchy. Second international workshop on grid and cooperative computing (GCC’2003) Shanghai, China, Dec

    Google Scholar 

  8. Madi KM, Hassan S (2008) Dynamic replication algorithm in data grid: survey. In: International conference on network applications, protocols and services 2008 (NetApps2008), ISBN 978-983-2078-33-3, on 21–22 Nov 2008

    Google Scholar 

  9. Douceur JR, Wattenhofer RP (2001) Competitive hill-climbing strategies for replica placement in a distributed file system. In: Proceedings of the 15th international symposium on distributed computing, Lisboa, Portugal, Oct 2001, pp 48–62

    Google Scholar 

  10. Huu T, Segarra M-T, Gilliot J-M (2008) Un système adaptatif de placement de données, In: CFSE’6, Fribourg, Switzerland, 11–13 Feb 2008

    Google Scholar 

  11. Lamhamedi H, Szymansky B, Shentu Z, Deelman E. (2002) Data replication strategies in grid environments. In: Proceedings of the 5th international conference on algorithms and architectures for parallel processing (ICA3PP’02) IEEE CS Press, Los Alamitos

    Google Scholar 

  12. Meroufel B (2011) Fault tolerance in data grid. These Master. University of Oran, Alegria, March

    Google Scholar 

  13. Lei M, Vrbsky S (2006) A data replication strategy to increase availability in data Grids. In: Grid computing and applications, Las Vegas, NV, pp. 221–227

    Google Scholar 

  14. Foster I (2002) The grid: a new infrastructure for 21st century science. Phys Today 55(2):42–47

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ghalem Belalem .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer Science+Business Media B.V.

About this paper

Cite this paper

Meroufel, B., Belalem, G. (2011). Availability Management in Data Grid. In: Park, J., Arabnia, H., Chang, HB., Shon, T. (eds) IT Convergence and Services. Lecture Notes in Electrical Engineering, vol 107. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2598-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-2598-0_5

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-2597-3

  • Online ISBN: 978-94-007-2598-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics