Skip to main content

Fundamentals of Dependability

  • Chapter
  • First Online:
Fault-Tolerant Design

Abstract

The ultimate goal of fault tolerance is the development of a dependable system. In a broad term, dependability is the ability of a system to deliver its intended level of service to its users [16]. As computing becomes ubiquitous and penetrates our everyday lives on all scales, dependability becomes important not only for the traditional safety-, mission-, and business-critical applications, but also for our society as a whole. In this chapter, we study three fundamental characteristics of dependability: attributes, impairment, and means. Dependability attributes describe the properties which are required of a system. Dependability impairments express the reasons for a system to cease to perform its function or, in other words, the threats to dependability. Dependability means are the methods and techniques enabling the development of a dependable system, such as fault prevention, fault tolerance, fault removal, and fault forecasting.

“Ah, this is obviously some strange usage of the word ‘safe’ that I wasn’t previously aware of.” Douglas Adams, The Hitchhikers Guide to the Galaxy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 99.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Abramovici, M., Breuer, M.A., Frideman, A.D.: Digital system testing and testable design. Computer Science Press, New York (1995)

    Google Scholar 

  2. Akamai: Akamai reveals 2 seconds as the new threshold of acceptability for ecommerce web page response times (2000). http://www.akamai.com/html/about/press/releases/2009/press_091409.html

  3. Avižienis, A.: Fault-tolerant systems. IEEE Trans. Comput. 25(12), 1304–1312 (1976)

    Article  MATH  Google Scholar 

  4. Avižienis, A.: The four-universe information system model for the study of fault-tolerance. In: Proceedings of the 12th Annual International Symposium on Fault-Tolerant Computing, FTCS’82, IEEE Press, pp. 6–13 (1982)

    Google Scholar 

  5. Avižienis, A.: Design diversity: An approach to fault tolerance of design faults. In: Proceedings of the National Computer Conference and Exposition, pp. 163–171 (1984)

    Google Scholar 

  6. Avizienis, A., Laprie, J.C., Randell, B., Landwehr, C.: Basic concepts and taxonomy of dependable and secure computing. IEEE Trans. Dependable Secur. Comput. 1(1), 11–33 (2004)

    Article  Google Scholar 

  7. Berry, J.M.:\(\,\$32\) billion overdraft resulted from snafu (1985). http://catless.ncl.ac.uk/Risks/1.31.html#subj4

  8. Bowen, J., Stravridou, V.: Safety-critical systems, formal methods and standards. IEE/BCS Softw. Eng. J. 8(4), 189–209 (1993)

    Article  Google Scholar 

  9. Deverell, E.: The 2001 Kista Blackout: Corporate Crisis and Urban Contingency. The Swedish National Defence College, Stockholm (2003)

    Google Scholar 

  10. Gray, J.: A census of TANDEM system availability between 1985 and 1990. IEEE Trans. Reliab. 39(4), 409–418 (1990)

    Article  Google Scholar 

  11. Hayes, J.: Fault modeling for digital MOS integrated circuits. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 3(3), 200–208 (1984)

    Article  Google Scholar 

  12. IAEA: Frequently asked Chernobyl questions (2005). http://www.iaea.org/newscenter/features/chernobyl-15/cherno-faq.shtml

  13. Joch, A.: How software doesn’t work: nine ways to make your code reliable (1995). http://www.welchco.com/02/14/01/60/95/12/0102.HTM

  14. Johnson, B.W.: The Design and Analysis of Fault Tolerant Digital Systems. Addison-Wesley, New York (1989)

    Google Scholar 

  15. Karlsson, I.: Utvärdering av birka energi (Birka Energi’s Evaluation), Sweden (2001)

    Google Scholar 

  16. Laprie, J.C.: Dependable computing and fault tolerance: Concepts and terminology. In: Proceedings of 15th International Symposium on Fault-Tolerant Computing (FTSC-15), IEEE Computer Society, pp. 2–11 (1985)

    Google Scholar 

  17. Lions, J.L.: Ariane 5 flight 501 failure, report by the inquiry board (1996). http://www.esrin.esa.it/htdocs/tidc/Press/Press96/ariane5rep.html

  18. Lyu, M.R.: Introduction. In: Lyu, M.R. (ed.) Handbook of Software Reliability, pp. 3–25. McGraw-Hill, New York (1996)

    Google Scholar 

  19. Mills, H., Dyer, M., Linger, R.: Cleanroom software engineering. IEEE Softw. 4(5), 19–25 (1987)

    Article  Google Scholar 

  20. NASA: The Role of Small Satellites in NASA and NOAA Earth Observation Programs. Space Studies Board, National Research Council, National Academy of Sciences, Washington, USA (2000)

    Google Scholar 

  21. Nelson, V.P.: Fault-tolerant computing: fundamental concepts. IEEE Comput. 23(7), 19–25 (1990)

    Article  Google Scholar 

  22. Randell, B.: System structure for software fault tolerance. In: Proceedings of the International Conference on Reliable Software, pp. 437–449 (1975)

    Google Scholar 

  23. Saleh, R., Wilton, S., Mirabbasi, S., Hu, A., Greenstreet, M., Lemieux, G., Pande, P., Grecu, C., Ivanov, A.: System-on-chip: Reuse and integration. Proc. IEEE 94(6) (2006)

    Google Scholar 

  24. Smith, M.: RAM reliability: Soft errors (1998). http://www.crystallineconcepts.com/ram/ram-soft.html

  25. Smith, M.D.J., Simpson, K.G.: Safety Critical Systems Handbook, 3rd edn. Elsevier Ltd., New York (2011)

    Google Scholar 

  26. Tezzaron Semiconductor: Soft errors in electronic memory (2004). http://www.tezzaron.com/about/papers/papers.html

  27. Tumer, I.Y.: Design methods and practises for fault prevention and management in spacecraft. Tech. Rep. 20060022566, NASA (2005)

    Google Scholar 

  28. Pratt, V.: Anatomy of the pentium bug. In: Mosses, P.D., Nielsen, M., Schwartzbach, M.I. (eds.) TAPSOFT’95: Theory and Practice of Software Development, vol. 915, pp. 97–107. Springer, Verlag (1995)

    Google Scholar 

  29. Yu, W.D.: A software fault prevention approach in coding and root cause analysis. Bell Labs Tech. J. 3(2), 3–21 (1998)

    Article  Google Scholar 

  30. Ziegler, J.F.: Terrestrial cosmic rays and soft errors. IBM J. Res. Dev. 40(1), 19–41 (1996)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elena Dubrova .

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media New York

About this chapter

Cite this chapter

Dubrova, E. (2013). Fundamentals of Dependability. In: Fault-Tolerant Design. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-2113-9_2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-2113-9_2

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-2112-2

  • Online ISBN: 978-1-4614-2113-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics