Online Fault Localization and Health Monitoring for Software Systems
Software systems used in the industry are often large and complex. Even with an extensive validation phase, it is impossible to ensure that a software system is fault-free and will remain so all along its evolution. When a failure happens in operation, the time to solve the fault should be minimized. The major challenge in this realm is the localization of a fault in one of the constituent components of the overall system. We strive at simplifying both the detection of failures and the localization of the fault that led to this failure by adapting existing techniques to the online context. This chapter first presents the Spectrum-based Fault Localization (SFL) method. It then explores the specificities of SFL for online fault localization and health monitoring. Its applicability to actual systems is evaluated through simulation of online failure scenarios, and through implementation in a demonstration surveillance system. The results of the studies performed confirm that applying SFL online, using monitoring, can successfully provide health information and locate problematic components, so that a software failure can be addressed adequately and timely.
KeywordsSimilarity Coefficient Fault Localization Software Fault Current Spectrum Performance Overhead
This research has been carried out as a part of the Poseidon project at Thales under the responsibilities of the Embedded Systems Institute (ESI). This project is partially supported by the Dutch Ministry of Economic Affairs under the BSIK program.
- 1.Abreu R, Zoeteweij P, van Gemund AJC (2007) On the accuracy of spectrum-based fault localization. In Proceedings of the testing: academic and industrial conference practice and research techniques – MUTATION, Washington, DC, USA, Aug 2007. IEEE Computer Society, pp 89–98Google Scholar
- 2.Abreu R, Zoeteweij P, van Gemund AJC (2009) Spectrum-based multiple fault localization. In ASE ’09: proceedings of the 2009 IEEE/ACM international conference on automated software engineering, Washington, DC, USA. IEEE Computer Society, pp 88–99Google Scholar
- 3.Cleve H. Zeller A (2005) Locating causes of program failures. In ICSE ’05: proceedings of the 27th international conference on software engineering, St. Louis, MO, USA, May 2005. ACM Press, pp 342–351Google Scholar
- 4.International Telecommunications Union (2001) Technical characteristics for a universal shipborne Automatic Identification System using time division multiple access in the VHF maritime mobile band. Recommendation ITU-R M.1371-1Google Scholar
- 5.Jones JA, Harrold MJ, Stasko J (2002) Visualization of test information to assist fault localization. In ICSE ’02: proceedings of the 24th international conference on software engineering, Orlando, FL, USA, May 2002. ACM, pp 467–477Google Scholar
- 6.Kiczales G, Lamping J, Menhdhekar A, Maeda C, Lopes C, Loingtier J-M, Irwin J (1997) Aspect-oriented programming. In: Akşit M, Matsuoka S (eds) Proceedings European conference on object-oriented programming, vol 1241. Springer, Berlin/Heidelberg/New York, pp 220–242Google Scholar
- 8.Piel E, González-Sánchez A, Gross H-G, van Gemund AJC (2011) Spectrum-based health monitoring for self-adaptive systems. In 5th IEEE international conference on self-adaptive and self-organizing systems (SASO’11). IEEE Computer Society, Washington, DC, USA, pp 99–108Google Scholar
- 9.Slane D (2009) Fault localization in in vivo software testing. Master’s thesis, Bard College, Massachusetts, USAGoogle Scholar
- 10.Williams BC, Ingham MD, Chung SH, Elliott PH (2003) Model-based programming of intelligent embedded systems and robotic space explorers. In IEEE special issue on modeling and design of embedded software, IEEE Computer Society, Washington, DC, USA, pp 212–237Google Scholar
- 11.Zoeteweij P, Abreu R, Golsteijn R, Arjan JC van Gemund (2009) Spectrum-based fault localization in practice. In Mathijsen R (ed) Trader: reliability of high-volume consumer products, Eindhoven, The Netherlands. Embedded Systems Institute, pp 113–124Google Scholar