Brief Announcement: A Concurrent Partial Snapshot Algorithm for Large-Scale and Dynamic Distributed Systems

  • Yonghwan Kim
  • Tadashi Araragi
  • Junya Nakamura
  • Toshimitsu Masuzawa
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6976)

Abstract

We propose a concurrent partial snapshot algorithm (CSS algorithm) to extend a previously proposed sub-snapshot algorithm (SSS algorithm) [2] by introducing a method of merging multiple snapshots that are concurrently initiated by different nodes. In earlier work [5,6], efficient merging algorithms have already been introduced for CL algorithm. On the other hand, the main issue of our merging algorithm is to cope with dynamic situations based on SSS algorithm. Since the SSS algorithm is an extension of Chandy-Lamport snapshot algorithm (CL algorithm) [1], it allows large-scale and dynamic situations in snapshots. A dynamic situation means that nodes can join and leave freely during the execution of a snapshot algorithm. A snapshot algorithm for the dynamic situation has also been proposed [4]; however in this algorithm, nodes must stop sending application messages during its execution of the snapshot algorithm. Moreover, for concurrent snapshots, it has to cancel a portion of snapshot algorithms. Our algorithm has successfully removed these restrictions.

References

  1. 1.
    Chandy, K., Lamport, L.: Distributed snapshots: Determining global states of distributed systems. ACM Trans. Computer Systems 3(1), 63–75 (1985)CrossRefGoogle Scholar
  2. 2.
    Moriya, S., Araragi, T.: Dynamic Snapshot Algorithm and Partial Rollback Algorithm for Internet Agents. In: DISC 2001 Brief Announcements, pp. 23–28 (2001)Google Scholar
  3. 3.
    Moriya, S., Araragi, T.: Dynamic Snapshot Algorithm and Partial Rollback Algorithm for Internet Agents. IEICE Transactions on Information and Systems J86-D-I, 301–317 (2003) (in Japanese)Google Scholar
  4. 4.
    Koo, R., Toueg, S.: Checkpointing and Roll-back Recovery for Distributed Systems. IEEE Transactions on Software Engineering 13(1), 23–31 (1987)CrossRefMATHGoogle Scholar
  5. 5.
    Spezialetti, M., Kearns, P.: Efficient Distributed Snapshots. In: Proceedings of the 6th International Conference on Distributed Computing Systems, pp. 382–388 (1986)Google Scholar
  6. 6.
    Prakash, R., Singhal, M.: Maximal Global Snapshot with Concurrent Initiators. In: Proceedings of the 6th IEEE Symposium of Parallel and Distributed Processing, pp. 334–351 (1994)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Yonghwan Kim
    • 1
  • Tadashi Araragi
    • 2
  • Junya Nakamura
    • 1
  • Toshimitsu Masuzawa
    • 1
  1. 1.Graduate School of Information Science and TechnologyOsaka UniversityJapan
  2. 2.NTT Communication Science LaboratoriesKyotoJapan

Personalised recommendations