Advertisement

Software Design for Resilient Computer Systems

  • Igor Schagaev
  • Thomas Kaegi-Trachsel

Table of contents

  1. Front Matter
    Pages i-xiv
  2. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 1-6
  3. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 7-10
  4. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 11-22
  5. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 23-43
  6. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 45-55
  7. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 57-66
  8. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 67-104
  9. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 105-134
  10. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 135-145
  11. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 147-157
  12. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 159-182
  13. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 183-184
  14. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 185-187
  15. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 189-197
  16. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 199-205
  17. Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 207-211
  18. Back Matter
    Pages 213-214

About this book

Introduction

This book addresses the question of how system software should be designed to account for faults, and which fault tolerance features it should provide for highest reliability. The authors first show how the system software interacts with the hardware to tolerate faults. They analyze and further develop the theory of fault tolerance to understand the different ways to increase the reliability of a system, with special attention on the role of system software in this process. They further develop the general algorithm of fault tolerance (GAFT) with its three main processes: hardware checking, preparation for recovery, and the recovery procedure. For each of the three processes, they analyze the requirements and properties theoretically and give possible implementation scenarios and system software support required. Based on the theoretical results, the authors derive an Oberon-based programming language with direct support of the three processes of GAFT.  In the last part of this book, they introduce a simulator, using it as a proof of concept implementation of a novel fault tolerant processor architecture (ERRIC) and its newly developed runtime system feature-wise and performance-wise. The content applies to industries such as military, aviation, intensive health care, industrial control, space exploration, etc. 

·         Outlines potential critical faults in the modern computer systems and what is required to change them

·         Explains how to design and re-design system software for the next generation of computers for wider application domains and greater efficiency and reliability

·         Presents how implemented system software support makes maintenance of computer systems much easier, while reliability and performance increases

Keywords

ERRIC architecture Extreme reliability Fault tolerance Hardware and software reliability Hardware and software resilience Hardware deficiency Hardware faults Reliability engineering Software for hardware efficiency

Authors and affiliations

  • Igor Schagaev
    • 1
  • Thomas Kaegi-Trachsel
    • 2
  1. 1.IT-ACS LtdStevenageUnited Kingdom
  2. 2.IT-ACS LtdStevenageUnited Kingdom

Bibliographic information

  • DOI https://doi.org/10.1007/978-3-319-29465-0
  • Copyright Information Springer International Publishing Switzerland 2016
  • Publisher Name Springer, Cham
  • eBook Packages Engineering
  • Print ISBN 978-3-319-29463-6
  • Online ISBN 978-3-319-29465-0
  • Buy this book on publisher's site