Software Design for Resilient Computer Systems
Springer International Publishing (Verlag)
978-3-030-21243-8 (ISBN)
- Titel erscheint in neuer Auflage
- Artikel merken
This book addresses the question of how system software should be designed to account for faults, and which fault tolerance features it should provide for highest reliability. With this second edition of Software Design for Resilient Computer Systems the book is thoroughly updated to contain the newest advice regarding software resilience. With additional chapters on computer system performance and system resilience, as well as online resources, the new edition is ideal for researchers and industry professionals.
The authors first show how the system software interacts with the hardware to tolerate faults. They analyze and further develop the theory of fault tolerance to understand the different ways to increase the reliability of a system, with special attention on the role of system software in this process. They further develop the general algorithm of fault tolerance (GAFT) with its three main processes: hardware checking, preparation for recovery, andthe recovery procedure. For each of the three processes, they analyze the requirements and properties theoretically and give possible implementation scenarios and system software support required. Based on the theoretical results, the authors derive an Oberon-based programming language with direct support of the three processes of GAFT. In the last part of this book, they introduce a simulator, using it as a proof of concept implementation of a novel fault tolerant processor architecture (ERRIC) and its newly developed runtime system feature-wise and performance-wise.Due to the wide reaching nature of the content, this book applies to a host of industries and research areas, including military, aviation, intensive health care, industrial control, and space exploration.Dr. Igor Schagaev is a Professor and Director of IT-ACS Ltd (UK). He is a Fellow of the Institute of Analyst and Programmers (UK), Fellow of British Computer Society (UK). His career has started as an Electromechanical Engineer at the Smolensk aviation factory, USSR, a Senior Programmer and Design Engineer at the Institute of Advanced Computations, Central Bureau, Smolensk Branch, and a Senior Design Engineer and System Programmer for Avionics. Completed PhD in Russian Academy of Science (Institute of Control Science) and involvement in projects of hardware and software for submarines, satellites and aircrafts enables Igor to absorb an experience which he share with Boeing in 98-99. In 1994 Igor has established ATLAB Ltd Bristol now transformed into IT-ACS Ltd. He has published 7 books in three languages, over 60 papers, since 2006 holds international patent on New Active System Control and supportive mathematical models. Professor Schagaev has been honoured with several industry awards, achievements, and grants.
Introduction.- Hardware Faults.- Fault Tolerance: Theory and Concepts.- Generalized Algorithm of Fault Tolerance (GAFT).- GAFT Generalization: A Principle and Model of Active System Safety.- System Software Support for Hardware Deficiency: Function and Features.- Testing and Checking.- Recovery Preparation.- Recovery: Searching and Monitoring of Correct Software States.- Recovery Algorithms: An Analysis.- Programming Language for Safety Critical Systems.- Proposed Runtime System Structure.- Proposed Runtime System vs. Existing Approaches.- Hardware: The ERRIC Architecture.- Architecture Comparison and Evaluation.- Reliability of ERRIC .- Performance of ERRIC.- ERRIC Software.- How about resilience at large.- Map of Resilience.
Erscheinungsdatum | 13.10.2019 |
---|---|
Zusatzinfo | XVIII, 308 p. 175 illus., 133 illus. in color. |
Verlagsort | Cham |
Sprache | englisch |
Maße | 155 x 235 mm |
Gewicht | 613 g |
Themenwelt | Technik ► Elektrotechnik / Energietechnik |
Technik ► Nachrichtentechnik | |
Schlagworte | ERRIC architecture • Extreme Reliability • fault tolerance • Hardware and Software Reliability • Hardware and Software Resilience • Hardware deficiency • Hardware faults • Quality Control, Reliability, Safety and Risk • Reliability Engineering • Software for hardware efficiency |
ISBN-10 | 3-030-21243-2 / 3030212432 |
ISBN-13 | 978-3-030-21243-8 / 9783030212438 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
aus dem Bereich