On fault tolerant matrix decomposition

  • Patrick Fitzpatrick
Article

Abstract

We present a fault tolerant algorithm for matrix factorization in the presence of multiple hardware faults which can be used for solving the linear systemAx=b without determining the correctZU decomposition ofA. HereZ is eitherL for ordinary Gaussian decomposition with partial pivoting,X for pairwise or neighbor pivoting (motivated by the Gentleman-Kung systolic array structure), orQ for the usualQR decomposition. Our algorithm generalizes that of Luk and Park whose method allows for the correction of a single error in a single iterate of the matrixU. Using ideas from the theory of error correcting codes we prove that the algorithm of Luk and Park can in fact tolerate multiple errors in multiple iterates ofU provided these are all confined to a single column. We then generalize the algorithm to one that tolerates multiple errors in multiple iterates ofU provided they are confined to two columns. Our procedure for identifying the erroneous columns is based on the extended Euclidean algorithm and it analogous to the decoding algorithms for BCH codes. We indicate how our methods may be adapted to apply to any number of columns and finally we show how to compute a correct factorization ofA.

Keywords

Parity Check Systolic Array Parity Check Matrix Single Error Multiple Error 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    J.M. Speiser and H.J. Whitehouse, Signal processing computational needs: An update,in Mathematics in Signal Processing (J.G. McWhirter, ed.), OUP, 1990, pp. 633–664.Google Scholar
  2. 2.
    F.T. Luk and H. Park, Fault tolerant matrix triangularization on systolic arraysIEEE Trans. Comp., Vol. C-37 (1988) pp. 1434–1438.MathSciNetCrossRefGoogle Scholar
  3. 3.
    W.M. Gentleman and H.T. Kung, Matrix triangularization by systolic arrays, inReal Time Signal Processing TV, Proc. SPIE (T.F. Tao, ed.), Vol. 298 (1981) pp. 19–26.Google Scholar
  4. 4.
    K.H. Huang and J.A. Abraham, Algorithm-based fault tolerance for matrix operations,IEEE Trans. Comp., Vol. C-33 (1984) pp. 518–528.CrossRefGoogle Scholar
  5. 5.
    J-Y. Jou and J.A. Abraham, Fault-tolerant matrix arithmetic and signal processing on highly concurrent computing structures,Proc. IEEE, Vol. 74 (1986) pp. 732–741.CrossRefGoogle Scholar
  6. 6.
    P. Fitzpatrick and C.C. Murphy, Solution of linear systems of equations in the presence of two transient hardware faults,IEE Proc. Pt. E., Vol. 140 (1993) pp. 247–254.MATHGoogle Scholar
  7. 7.
    C.J. Anfinson and F.T. Luk, A linear algebraic model of algorithm-based fault tolerance,IEEE Trans. Comp., Vol. C-37 (1988) pp. 1599–1604.MathSciNetCrossRefGoogle Scholar
  8. 8.
    R.P. Brent, F.T. Luk, and C.J. Anfinson, Checksum schemes for fault tolerant systolic computing,in Mathematics in Signal Processing (J.G. McWhirter, ed.), OUP, 1990, pp. 791–804.Google Scholar
  9. 9.
    D.L. Boley, R.P. Brent, G.H. Golub, and F.T. Luk, Error correction via the Lanczos process,Tech. Rep. EE-CEG-91-1, School of Elect. Eng., Cornell Univ. (Jan. 1991).Google Scholar
  10. 10.
    T. Fuja, C. Heegard, and R. Goodman, Linear sum codes for random access memories,IEEE Trans. Comp., Vol. C-37 (1988) pp. 1030–1042.MathSciNetCrossRefGoogle Scholar
  11. 11.
    B.W. Johnson,Design and Analysis of Fault Tolerant Digital Systems, Addison-Wesley, 1989.Google Scholar
  12. 12.
    T.R.N. Rao and E. Fujiwara,Error-control coding for Computer Systems, Prentice-Hall, 1989.Google Scholar
  13. 13.
    R.J. McEliece,The Theory of Information and Coding, Addison-Wesley, Reading, Mass. 1977.MATHGoogle Scholar
  14. 14.
    H. Golub and C.F. Van Loan,Matrix Computations, Johns Hopkins University Press, Baltimore, Maryland, 1983.MATHGoogle Scholar
  15. 15.
    G. Pólya and G. Szegö,Aufgaben und Lehräsätze aus der Analysis, Vol. 2 (second edition), Springer, Berlin, 1954.CrossRefGoogle Scholar

Copyright information

© Kluwer Academic Publishers 1994

Authors and Affiliations

  • Patrick Fitzpatrick
    • 1
  1. 1.IFI Institute of Advanced Microelectronics, National Microelectronics Research CentreUniversity CollegeCorkIreland

Personalised recommendations