Abstract
Replication systems require a state-transfer mechanism in order to recover crashed replicas and to integrate new ones into replication groups. This paper presents and evaluates efficient techniques for parallel state transfer in such systems. These techniques enable a faster integration of replicas and improve overall service availability. On the basis of previous work on distributed download in client-server and peer-to-peer systems, we obtain parallel state-transfer mechanisms for replicated objects. Our algorithms support static and dynamic distributed download of state without a priori knowledge about the state size. A non-blocking transfer minimises the time of service unavailability during state transfer. In addition, partial state capturing is presented as an additional technique that improves the parallel transfer of large states.
Chapter PDF
References
Peña Cabañas, L., Pavón Mestras, J.: PODDP 2000 and DDEP 2000. LNCS, vol. 2023. Springer, Heidelberg (2000)
Parrington, G.D., Shrivastava, S.K., Wheater, S.M., Little, M.C.: The Design and Implementation of Arjuna. Computing Systems 8(2), 255–308 (1995)
Maffeis, S.: Adding Group Communication and Fault-Tolerance to CORBA. In: Proc. of the Conf. on Object-Oriented Technologies (Monterey, CA) USENIX, pp. 135–146 (1995)
Malloth, C.P.: Conception and implementation of a toolkit for building fault-tolerant distributed applications in large scale networks. PhD thesis, EPFL (1996)
Birman, K.: Building secure and reliable network applications. Manning Publications Co., Greenwich (1997)
Ban, B.: Design and implementation of a reliable group communication toolkit for Java. Technical report, Dept. of Computer Science, Cornell University (1998)
Narasimhan, P., Moser, L., Melliar-Smith, P.M.: State Synchronization and Recovery for Strongly Consistent Replicated CORBA Objects. In: DSN, pp. 261–270 (2001)
Mishra, S., Peterson, L., Schlichting, R.: Consul: a communication substrate for fault-tolerant distributed programs. Distributed Systems Engineering 1(2), 87–103 (1993)
Castro, M.: Practical Byzantine Fault Tolerance. Ph.D., MIT, January 2001, Also as Technical Report MIT-LCS-TR-817 (2001)
Kemme, B., Bartoli, A., Babaoglu, Ö.: Online Reconfiguration in Replicated Databases Based on Group Communication. In: DSN ’01. Proc. of the 2001 Int. Conf. on Dependable Systems and Networks, pp. 117–130. IEEE Computer Society Press, Washington, DC (2001)
Jiménez-Peris, R., Patiño-Martínez, M., Alonso, G.: Non-Intrusive, Parallel Recovery of Replicated Data. In: SRDS ’02. Proc. of the 21st IEEE Symp. on Reliable Distributed Systems (SRDS’02), p. 150. IEEE Computer Society Press, Washington, DC (2002)
Rodriguez, P., Kirpal, A., Biersack, E.W.: Parallel-access for mirror sites in the Internet. In: INFOCOM 2000. Nineteenth Annual Joint Conf. of the IEEE Computer and Communications Societies. Proc. IEEE, vol. 2, pp. 864–873 (2000)
Rodriguez, P., Biersack, E.W.: Dynamic parallel access to replicated content in the internet. IEEE/ACM Trans. Netw. 10(4), 455–465 (2002)
Vazhkudai, S.: Distributed Downloads of Bulk, Replicated Grid Data. J. Grid Comput. 2(1), 31–42 (2004)
Xu, Z., Xianliang, L., Mengshu, H., Chuan, Z.: A speed-based adaptive dynamic parallel downloading technique. SIGOPS Oper. Syst. Rev. 39(1), 63–69 (2005)
Gkantsidis, C., Ammar, M., Zegura, E.: On the Effect of Large-Scale Deployment of Parallel Downloading. In: WIAPP ’03. Proc. of the The Third IEEE Workshop on Internet Applications, pp. 79–89. IEEE Computer Society Press, Washington, DC (2003)
Reiser, H.P., Kapitza, R., Domaschka, J., Hauck, F.J.: Fault-tolerant replication based on fragmented objects. In: Proc. of the 6th IFIP Int. Conf. on Distributed Applications and Interoperable Systems (DAIS 2006) (2006)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 IFIP International Federation for Information Processing
About this paper
Cite this paper
Kapitza, R., Zeman, T., Hauck, F.J., Reiser, H.P. (2007). Parallel State Transfer in Object Replication Systems. In: Indulska, J., Raymond, K. (eds) Distributed Applications and Interoperable Systems. DAIS 2007. Lecture Notes in Computer Science, vol 4531. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72883-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-72883-2_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72881-8
Online ISBN: 978-3-540-72883-2
eBook Packages: Computer ScienceComputer Science (R0)
