Abstract
The ultimate goal of fault tolerance is the development of a dependable system. In a broad term, dependability is the ability of a system to deliver its intended level of service to its users [16]. As computing becomes ubiquitous and penetrates our everyday lives on all scales, dependability becomes important not only for the traditional safety-, mission-, and business-critical applications, but also for our society as a whole. In this chapter, we study three fundamental characteristics of dependability: attributes, impairment, and means. Dependability attributes describe the properties which are required of a system. Dependability impairments express the reasons for a system to cease to perform its function or, in other words, the threats to dependability. Dependability means are the methods and techniques enabling the development of a dependable system, such as fault prevention, fault tolerance, fault removal, and fault forecasting.
“Ah, this is obviously some strange usage of the word ‘safe’ that I wasn’t previously aware of.” Douglas Adams, The Hitchhikers Guide to the Galaxy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abramovici, M., Breuer, M.A., Frideman, A.D.: Digital system testing and testable design. Computer Science Press, New York (1995)
Akamai: Akamai reveals 2 seconds as the new threshold of acceptability for ecommerce web page response times (2000). http://www.akamai.com/html/about/press/releases/2009/press_091409.html
Avižienis, A.: Fault-tolerant systems. IEEE Trans. Comput. 25(12), 1304–1312 (1976)
Avižienis, A.: The four-universe information system model for the study of fault-tolerance. In: Proceedings of the 12th Annual International Symposium on Fault-Tolerant Computing, FTCS’82, IEEE Press, pp. 6–13 (1982)
Avižienis, A.: Design diversity: An approach to fault tolerance of design faults. In: Proceedings of the National Computer Conference and Exposition, pp. 163–171 (1984)
Avizienis, A., Laprie, J.C., Randell, B., Landwehr, C.: Basic concepts and taxonomy of dependable and secure computing. IEEE Trans. Dependable Secur. Comput. 1(1), 11–33 (2004)
Berry, J.M.:\(\,\$32\) billion overdraft resulted from snafu (1985). http://catless.ncl.ac.uk/Risks/1.31.html#subj4
Bowen, J., Stravridou, V.: Safety-critical systems, formal methods and standards. IEE/BCS Softw. Eng. J. 8(4), 189–209 (1993)
Deverell, E.: The 2001 Kista Blackout: Corporate Crisis and Urban Contingency. The Swedish National Defence College, Stockholm (2003)
Gray, J.: A census of TANDEM system availability between 1985 and 1990. IEEE Trans. Reliab. 39(4), 409–418 (1990)
Hayes, J.: Fault modeling for digital MOS integrated circuits. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 3(3), 200–208 (1984)
IAEA: Frequently asked Chernobyl questions (2005). http://www.iaea.org/newscenter/features/chernobyl-15/cherno-faq.shtml
Joch, A.: How software doesn’t work: nine ways to make your code reliable (1995). http://www.welchco.com/02/14/01/60/95/12/0102.HTM
Johnson, B.W.: The Design and Analysis of Fault Tolerant Digital Systems. Addison-Wesley, New York (1989)
Karlsson, I.: Utvärdering av birka energi (Birka Energi’s Evaluation), Sweden (2001)
Laprie, J.C.: Dependable computing and fault tolerance: Concepts and terminology. In: Proceedings of 15th International Symposium on Fault-Tolerant Computing (FTSC-15), IEEE Computer Society, pp. 2–11 (1985)
Lions, J.L.: Ariane 5 flight 501 failure, report by the inquiry board (1996). http://www.esrin.esa.it/htdocs/tidc/Press/Press96/ariane5rep.html
Lyu, M.R.: Introduction. In: Lyu, M.R. (ed.) Handbook of Software Reliability, pp. 3–25. McGraw-Hill, New York (1996)
Mills, H., Dyer, M., Linger, R.: Cleanroom software engineering. IEEE Softw. 4(5), 19–25 (1987)
NASA: The Role of Small Satellites in NASA and NOAA Earth Observation Programs. Space Studies Board, National Research Council, National Academy of Sciences, Washington, USA (2000)
Nelson, V.P.: Fault-tolerant computing: fundamental concepts. IEEE Comput. 23(7), 19–25 (1990)
Randell, B.: System structure for software fault tolerance. In: Proceedings of the International Conference on Reliable Software, pp. 437–449 (1975)
Saleh, R., Wilton, S., Mirabbasi, S., Hu, A., Greenstreet, M., Lemieux, G., Pande, P., Grecu, C., Ivanov, A.: System-on-chip: Reuse and integration. Proc. IEEE 94(6) (2006)
Smith, M.: RAM reliability: Soft errors (1998). http://www.crystallineconcepts.com/ram/ram-soft.html
Smith, M.D.J., Simpson, K.G.: Safety Critical Systems Handbook, 3rd edn. Elsevier Ltd., New York (2011)
Tezzaron Semiconductor: Soft errors in electronic memory (2004). http://www.tezzaron.com/about/papers/papers.html
Tumer, I.Y.: Design methods and practises for fault prevention and management in spacecraft. Tech. Rep. 20060022566, NASA (2005)
Pratt, V.: Anatomy of the pentium bug. In: Mosses, P.D., Nielsen, M., Schwartzbach, M.I. (eds.) TAPSOFT’95: Theory and Practice of Software Development, vol. 915, pp. 97–107. Springer, Verlag (1995)
Yu, W.D.: A software fault prevention approach in coding and root cause analysis. Bell Labs Tech. J. 3(2), 3–21 (1998)
Ziegler, J.F.: Terrestrial cosmic rays and soft errors. IBM J. Res. Dev. 40(1), 19–41 (1996)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media New York
About this chapter
Cite this chapter
Dubrova, E. (2013). Fundamentals of Dependability. In: Fault-Tolerant Design. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-2113-9_2
Download citation
DOI: https://doi.org/10.1007/978-1-4614-2113-9_2
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-2112-2
Online ISBN: 978-1-4614-2113-9
eBook Packages: EngineeringEngineering (R0)