Skip to main content

Assessing Error Detection Coverage by Simulated Fault Injection

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1667))

Abstract

Server dependability is of increasing importance as more critical applications rely on the client-server computing model. As a consequence, complex fault/error handling mechanisms are becoming common features of today servers. This paper presents a new simulated fault injection method, which allows the assessment of the effectiveness of error detection mechanisms without using expensive test circuits. Fault injection was performed in two stages. First, physical fault injection was performed on a prototype server. Transient faults were injected in randomly selected signals. Traces of the signals sensitive to transients were captured. A complex protocol checker was devised for increasing error detection. The new detection circuitry was simulated in the second stage of the experiment. Signal traces, injected with transient faults, were used as inputs of the simulation. The error detection coverage and latency were derived. Fault injection also showed that coverage probability was a function of fault duration.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arlat, J. et al: Fault Injection for Dependability Validation: a Methodology and Some Applications. IEEE Trans. Software Engineering, 2 (1990) 166–182

    Article  Google Scholar 

  2. Barton, J., Czeck, E., Segall, Z., Siewiorek, D.: Fault Injection Experiments using FIAT. IEEE Trans. Computers, 4 (1990) 575–582

    Article  Google Scholar 

  3. Carreira, J., Madeira, H., Silva, J. G.: Xception: A technique for the experimental evaluation of dependability in modern computers. IEEE Trans. Soft. Engineering, 2 (1998) 125–136

    Article  Google Scholar 

  4. Chillarege, R., Bowen, N.: Understanding Large Systems Failures-A fault Injection Experiment., Proc. 19th FTCS Symposium (1989) 356–363

    Google Scholar 

  5. Constantinescu, C.: Validation of the fault/error handling mechanisms of the Teraflops supercomputer. Proc. 28th FTCS Symposium (1998) 382–389

    Google Scholar 

  6. Constantinescu, C.: Estimation of coverage probabilities for dependability validation of fault-tolerant computing systems. Proc. 9th Annual Conf. Computer Assurance, Gaithersburg (1994) 101–106

    Google Scholar 

  7. Constantinescu, C.: Using Multi-stage & Stratified Sampling for Inferring Fault Coverage Probabilities. IEEE Trans. Reliability, 4 (1995) 632–639

    Article  Google Scholar 

  8. Echtle, K., Leu, M.: The EFA Fault Injector for Fault Tolerant Distributed System Testing. Proc. Fault Tolerant Parallel and Distributed Systems Workshop, (1992) 28–35

    Google Scholar 

  9. Folkesson, P., Svensson, S., Karlsson, J.: A comparison of simulated based and scan chain implemented fault injection. Proc. 28th FTCS Symposium (1998) 284–293

    Google Scholar 

  10. Ghosh, A., Johnson, B.: System-Level Modeling in the ADEPT Environment of a Distributed Computer System for Real-Time Applications. Proc. IEEE International Computer Performance and Dependability Symposium (1995) 194–203

    Google Scholar 

  11. Goswami, K., Iyer, R.K., Young, L.: DEPEND: A Simulation Based Environment for System Level Dependability Analysis. IEEE Trans. Computers, 1 (1997) 60–74

    Article  Google Scholar 

  12. Han, S., Shin, K., Rosenberg, H.: DOCTOR: An Integrated Software Fault Injection Environment for Distributed Real-Time Systems. Proc. IEEE International Computer Performance and Dependability Symposium. (1995) 204–213

    Google Scholar 

  13. Hsueh, M.C., Tsai, T. K., Iyer, R. K.: Fault injection techniques and tools. IEEE Computer, 4 (1997) 75–82

    Google Scholar 

  14. Iyer, R. K.: Experimental Evaluation. Special Issue 25th FTCS Symp. (1995) 115–132

    Google Scholar 

  15. Jenn, E. et al.: Fault Injection into VHDL Models: The MEFISTO tool. Proc. 24th FTCS Symposium (1994) 66–75

    Google Scholar 

  16. Kanawati, G., Kanawati, N., Abraham, J.: FERRARI: A Tool for the Validation of System Dependability Properties. Proc. 22nd FTCS Symposium (1992) 336–344

    Google Scholar 

  17. Karlsson, J. et al.: Using Heavy-ion Radiation to Validate Fault Handling Mechanisms. IEEE Micro, 1 (1994) 8–32

    Article  Google Scholar 

  18. Karlsson, J. et al.: Application of Three Physical Fault Injection Techniques to the Experimental Assessment of the MARS Architecture. Proc. 5th DCCA Conference (1995) 150–161

    Google Scholar 

  19. Lala, P. K.: Fault Tolerant and Fault Testable Hardware Design. Prentice Hall Int., New York (1985)

    Google Scholar 

  20. Madeira, H., Rela, M., Moreira, F., Silva, J. G.: A General Purpose Pin-level Fault Injector. Proc. 1st European Dependable Computing Conference, (1994) 199–216

    Google Scholar 

  21. Powel, D., Martins, E., Arlat, J., Crouzet, Y.: Estimators for fault tolerance coverage evaluation. IEEE Trans. Computers, 2 (1995) 261–274

    Article  Google Scholar 

  22. Powel, D., Cukier, M., Arlat, J.: On stratified sampling for high coverage estimators. Proc. 2nd European Dependable Computing Conference (1996) 37–54

    Google Scholar 

  23. Scott, D. T., Ries, G., Hsueh, M., Iyer, R. K.: Dependability Analysis of a High-Speed Network Using Software-implemented Fault Injected and Simulated Fault Injection. IEEE Trans. Computers, 1 (1998) 108–119

    Google Scholar 

  24. Segal, Z., Lin, T.: FIAT: Fault Injection Based Automated Testing Environment. Proc. 18th FTCS Symposium (1988) 102–107

    Google Scholar 

  25. Silva, J. G. et al: Experimental Assessment of Parallel Systems. Proc. 26th FTCS Symposium (1996) 415–424

    Google Scholar 

  26. Siewiorek, D. P., Swarz, R. S.: The Theory and Practice of Reliable Design. Digital Press, Digital Equipment Corp., Bedford, Massachusetts (1984)

    Google Scholar 

  27. Trivedi, K. S.: Probability and Statistics with Reliability, Queuing, and Computer Science Applications. Prentice-Hall (1982)

    Google Scholar 

  28. Walter, C. J.: Evaluation and Design of an Ultra reliable Distributed Architecture for Fault Tolerance. IEEE Trans. Reliability, 4 (1990) 492–499

    Article  Google Scholar 

  29. PROTO Language. Design Technology Documentation PROT03, Intel Corp (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Constantinescu, C. (1999). Assessing Error Detection Coverage by Simulated Fault Injection. In: Hlavička, J., Maehle, E., Pataricza, A. (eds) Dependable Computing — EDCC-3. EDCC 1999. Lecture Notes in Computer Science, vol 1667. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48254-7_12

Download citation

  • DOI: https://doi.org/10.1007/3-540-48254-7_12

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66483-3

  • Online ISBN: 978-3-540-48254-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics