Unreliable Error Correction in Dynamic Systems
This chapter focuses on constructing reliable dynamic systems exclusively out of unreliable components, including unreliable components in the error-correcting mechanism. At each time step, a particular component can suffer a transient fault with a probability that is bounded by a constant. Faults between different components and between different time steps are treated as independent. Essentially, the chapter considers an extension of the techniques described in Chapter 2 to a dynamic system setting. Since dynamic systems evolve in time according to their internal state, the major task is to effectively deal with the effects of error propagation, i.e., the effects of errors that corrupt the system state.
KeywordsFault Tolerance Linear Code LDPC Code Parity Check Propagation Failure
Unable to display preview. Download preview PDF.
- Avizienis, A. (1981). Fault-tolerance by means of external monitoring of computer systems. In Proceedings of the 1981 National Computational Conference, pages 27–40.Google Scholar
- Gallager, R. G. (1963). Low-Density Parity Check Codes. MIT Press, Cambridge, Massachusetts.Google Scholar
- Hadjicostis, C. N. (1999). Coding Approaches to Fault Tolerance in Dynamic Systems. PhD thesis, EECS Department, Massachusetts Institute of Technology, Cambridge, Massachusetts.Google Scholar
- Hadjicostis, C. N. (2000). Fault-tolerant dynamic systems. In Proceedings of ISIT 2000, the Int. Symp. on Information Theory, page 444.Google Scholar
- Hadjicostis, C. N. and Verghese, G. C. (1999). Fault-tolerant linear finite state machines. In Proceedings of the 6th IEEE Int. Conf. on Electronics, Circuits and Systems, pages 1085–1088.Google Scholar
- Johnson, B. (1989). Design and Analysis of Fault-Tolerant Digital Systems. Addison-Wesley, Reading, Massachusetts.Google Scholar
- Parekhji, R. A., Venkatesh, G., and Sherlekar, S. D. (1991). A methodology for designing optimal self-checking sequential circuits. In Proceedings of the Int. Conf. VLSI Design, pages 283–291. IEEE CS Press.Google Scholar
- Pippenger, N. (1990). Developments in the synthesis of reliable organisms from unreliable components. In Proceedings of Symposia in Pure Mathematics, volume 50, pages 311–324.Google Scholar
- Pradhan, D. K. (1996). Fault-Tolerant Computer System Design. Prentice Hall, Englewood Cliffs, New Jersey.Google Scholar
- Robinson, S. H. and Shen, J. P. (1992). Direct methods for synthesis of self-monitoring state machines. In Proceedings of 22nd Fault-Tolerant Computing Symp., pages 306–315. IEEE CS Press.Google Scholar
- Spielman, D. A. (1996a). Highly fault-tolerant parallel computation. In Proceedings of the Annual Symp. on Foundations of Computer Science, volume 37, pages 154–160.Google Scholar
- Taylor, M. G. (1968a). Reliable computation in computing systems designed from unreliable components. The Bell System Journal, 47(10):2239–2366.Google Scholar