Measuring Fault Tolerance with the FTAPE fault injection tool
This paper describes FTAPE (Fault Tolerance And Performance Evaluator), a tool that can be used to compare fault-tolerant computers. The major parts of the tool include a system-wide fault injector, a workload generator, and a workload activity measurement tool. The workload creates high stress conditions on the machine. Using stress-based injection, the fault injector is able to utilize knowledge of the workload activity to ensure a high level of fault propagation. The errors/fault ratio, performance degradation, and number of system crashes are presented as measures of fault tolerance.
Keywordsfault injection workload generator fault tolerance measurement stress-based injection
Unable to display preview. Download preview PDF.
- G. B. Finelli. Characterization of fault recovery through fault injection on FTMP. IEEE Transactions on Reliability, 36(2):164–170, June 1987.Google Scholar
- U. Gunneflo, J. Karlsson, and J. Rorrin. Evaluation of error detection schemes using fault injection by heavy-ion radiation. In Proceedings 19th International Symposium on Fault-Tolerant Computing, pages 340–347, Chicago, Illinois, June 1989.Google Scholar
- R. Iyer and D. Tang. Experimental analysis of computer system dependability. Technical Report CRHC-93-15, University of Illinois at Urbana-Champaign, Urbana, Illinois, 1993.Google Scholar
- E. Jenn, J. Arlat, M. Rimen, J. Ohlsson, and J. Karlsson. Fault injection into VHDL models: The MEFISTO tool. In 24st International Symposium on Fault-Tolerant Computing, 1994.Google Scholar
- D. Jewett. Integrity S2: A fault-tolerant UNIX platform. In 21st International Symposium on Fault-Tolerant Computing, pages 512–519, Montreal, Canada, June 1991.Google Scholar
- G. Kanawati, N. Kanawati, and J. Abraham. Ferrari: A fault and error automatic real-time injector. In Proc. 22nd International Symposium on Fault-Tolerant Computing, Boston, Massachusetts, 1992.Google Scholar
- W.-L. Kao and R. K. Iyer. Define: A distributed fault injection and monitoring environment. In Proceedings of IEEE Workshop on Fault-tolerant Parallel and Distributed Systems, June 1994.Google Scholar
- H. Rosenberg and K. Shin. Software fault injection and its application in distributed environment. In Proceedings of the 23rd International Symposium on Fault-Tolerant Computing, France, June 1993.Google Scholar
- Z. Segall, D. Vrsalovie, et al. FIAT-fault injection-based automated testing environment. In 18th International Symposium on Fault-Tolerant Computing, pages 102–107, 1988.Google Scholar
- L. Young and R. Iyer. Error latency measurements in symbolic architectures. In AIAA Computing in Aerospace 8, pages 786–794, Baltimore, Maryland, October 1992.Google Scholar
- L. Young, R. Iyer, K. Goswami, and C. Alonso. A hybrid monitor assisted fault injection environment. In Proceedings of the Third IFIP Working Conference on Dependable Computing for Critical Applications, pages 163–174, Mondello, Sicily, Italy, September 1992.Google Scholar