Abstract
In distributed computing systems, processes in different hosts take checkpoints to survive failures. For mobile computing systems, due to certain new characteristics conventional distributed checkpointing schemes need to be reconsidered. In this paper, we propose a low-cost coordinated checkpointing algorithm. During normal computation message transmission, the checkpoint dependency information among mobile hosts is recorded in the corresponding mobile support stations. When a checkpointing procedure begins, the initiator concurrently informs relevant mobile hosts, which minimizes the identifying time. Moreover, compared with existing coordinated checkpointing schemes, our algorithm blocks the minimum number of mobile support stations during the identifying procedure. Experimental simulation shows that the proposed algorithm outperforms other coordinated checkpointing schemes and can provide a better system performance for mobile computing systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Deng, Y., Park, E.K.: Checkpointing and rollback-recovery algorithms in distributed systems. Journal of Systems and Software, 59–712 (April 1994)
Elnozahy, E.N., Johnson, D.B., Zwaenepoel, W.: The performance of consistent checkpointing. In: Proc. 11th Symp. Reliable Distributed Systems, October 1992, pp. 86–95 (1992)
Prakash, R., Singhal, M.: Low-cost checkpointing and failure recovery in mobile computing systems. IEEE Tran. Parallel and Distributed Systems, 1035–1048 (October 1996)
Cao, G.H., Singhal, M.: On the Impossibility of Min-Process Non-Blocking Checkpointing and An Efficient Checkpointing Algorithm for Mobile Computing Systems. In: Proc. The 27th Intl. Conf. On Parallel Processing, August 1998, pp. 37–44 (1998)
Koo, R., Toueg, S.: Checkpointing and roll-back recovery for distributed systems. IEEE Tran. On Software Engineering, 23–31 (January 1987)
Randell, B.: System structure for software tolerance. IEEE Transactions on Software Engineering, 220–232 (June 1975)
Huang, S.T.: Detecting termination of distributed computations by external agents. In: Proc. 9th International Conf. Distributed Computing Systems, June 1989, pp. 79–84 (1989)
Crow, B., Widjaja, I., Kim, J., Sakai, P.: IEEE 802.11 Wireless Local Area Networks. IEEE Comm. Magazine, 116–126 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, G., Wang, H., Chen, J. (2004). A Low-Cost Checkpointing Scheme for Mobile Computing Systems. In: Li, Q., Wang, G., Feng, L. (eds) Advances in Web-Age Information Management. WAIM 2004. Lecture Notes in Computer Science, vol 3129. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27772-9_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-27772-9_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22418-1
Online ISBN: 978-3-540-27772-9
eBook Packages: Springer Book Archive