Abstract
Fault tolerance is an important design criterion for reliable and robust video-on-demand systems. Conventional fault-tolerant designs use either a primary backup or an active replication method to provide system fault tolerance. However, these approaches suffer from low utilization of the backup or replication system. In this paper we propose two playback-recovery schemes for distributed video-on-demand systems called the forward playback-recovery scheme and the backward playback-recovery scheme. Unlike conventional fault-tolerant designs, our schemes use existing playback resources to recover faulty playbacks without allocating new resources, significantly reducing recovery overhead. To use the schemes effectively, we developed a distributed algorithm for determining the order and gap information between the playbacks on the distributed video-on-demand servers so that overhead for recovering from a server failure can be minimized. This algorithm achieves N − 1 fault-tolerant resiliency for N-server video-on-demand systems. In addition, three server-recovery policies are also presented to guide surviving servers in applying the proper scheme to recover faulty playbacks, thus reducing overall recovery costs. Simulation results show that the proposed recovery schemes are effective and useful in designing fault-tolerant multiple-server video-on-demand systems.
Similar content being viewed by others
References
J.W. Byun and T.T. Lee, “The design and analysis of an ATMmulticast switch with adaptive traffic controller,” IEEE/ACM Trans. on Networking, Vol. 2, No. 3, pp. 288–298, 1994.
M.S. Chen, D.D. Kandlur, and P.S. Yu, “Storage and retrieval methods to support fully interactive playout in a disk-array-based video server,” Multimedia Systems, Vol. 3, pp. 126–135, 1995.
F. Cristian, “Understanding fault-tolerant distributed systems,” Comm. of the ACM, Vol. 34, pp. 56–78, 1991.
A. Dan, D. Sitaram, and P. Shahabuddin, “Dynamic batching policies for an on-demand video server,” Multimedia Systems, Vol. 4, No. 3, pp. 112–121, 1996.
W. Effelsberg and T. Haerder, “Principles of database buffer management,” ACMTrans. on Database Systems, Vol. 9, No. 4, pp. 560–595, 1984.
C. Federighi and L.A. Rowe, “A distributed hierarchical storage manager for a video-on-demand system,” in Proc. of IS&T/SPIE, San Jose, CA, 1994.
E.A. Fox, “The coming revolution of interactive digital video,” Comm. of the ACM, Vol. 32, pp. 794–801, 1989.
E.A. Fox, “Standards and emergence of digital multimedia systems,” Comm. of the ACM, Vol. 34, pp. 26–30, 1991.
B. Furht, “Multimedia systems: An overview,” IEEE Multimedia, pp. 47–59, 1994.
D. Le Gall, “MPEG:Avideo compression standard for multimedia applications,” Comm. of theACM,Vol. 34, No. 4, pp. 46–58, 1991.
E. Gelenbe, D. Finkel, and S. Tripathi, “Availability of a distributed computer system with failures,” Acta Informatica, Vol. 23, pp. 643–655, 1986.
D.J. Gemmell, “Multimedia storage servers: A tutorial,” IEEE Multimedia, pp. 40–49, 1995.
L. Golubchik, C.S. Lui, and R. Muntz, “Adaptive piggybacking: A novel technique for data sharing in video-on-demand storage servers,” Multimedia Systems, Vol. 4, No. 3, pp. 140–155, 1996.
J. Huang and L.A. Stankovic, “Buffer management in real-time database,” Department of Computer Science, University of Massachusetts, Technique Report, pp. 90–65, July 1990.
Y. Huang and S.K. Tripathi, “Resource allocation for primary-site fault-tolerant systems,” IEEE Trans. on Software Engineering, Vol. 19, No. 2, 1993.
J.C. Laprie, J. Arlat, and C. Beounes, “Definition and analysis of hardware-and software-fault-tolerant architectures,” IEEE Computer, Vol. 23, pp. 39–51, 1990.
T.D.C. Little and D. Venkatesh, “Popularity-based assignment of movies to storage devices in a video-ondemand system,” Multimedia Systems, Vol. 2, No. 6, pp. 280–2987, 1995.
K. Nahrstedt, “Resource management in networked multimedia systems,” IEEE Multimedia, pp. 52–63, 1995.
V.P. Nelson, “Fault-tolerant computing: Fundamental Concepts,” IEEE Computer, Vol. 23, pp. 19–25, 1990.
K.K. Ramakrishnan, “Operating system support for a video-on-demand file service,” Multimedia Systems, Vol. 3, pp. 53–63, 1995.
P.V. Rangan, H.M. Vin, and S. Ramanathan, “Designing an on-demand multimedia service,” IEEE Communications, Vol. 30, pp. 56–64, 1992.
D. Rotem and J.L. Zhao, “Buffer management for video database systems,” IEEE Intl. Conf. on Data Engineering, 1995, pp. 439–448.
R.D. Schlichting and F.B. Schneider, “Fail-stop processors: Anapproach to designing fault-tolerant computing systems,” ACM Transactions on Computing Systems, Vol. 1, pp. 222–238, 1993.
F.B. Schneider, “Byzantine generals in action: Implementing fail-stop processors,”ACMTrans. on Computing Systems, Vol. 2, pp. 145–154, 1984.
D.P. Siewiorek, “Fault tolerance in commercial computers,” IEEE Computer, Vol. 23, pp. 19–25, 1990.
G.K. Zipf, Human Behavior and the Principles of Least Effort, Addison-Wesley: Reading, MA, 1949.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Shyu, IJ., Shieh, SP. A Distributed Fault-Tolerant Design for Multiple-Server VOD Systems. Multimedia Tools and Applications 8, 219–247 (1999). https://doi.org/10.1023/A:1009685918587
Issue Date:
DOI: https://doi.org/10.1023/A:1009685918587