On the Quality of Service of Crash-Recovery Failure Detectors

Research output: Contribution to journalArticlepeer-review

Abstract

We model the probabilistic behavior of a system comprising a failure detector and a monitored crash-recovery target. We extend failure detectors to take account of failure recovery in the target system. This involves extending QoS measures to include the recovery detection speed and proportion of failures detected. We also extend estimating the parameters of the failure detector to achieve a required QoS to configuring the crash-recovery failure detector. We investigate the impact of the dependability of the monitored process on the QoS of our failure detector. Our analysis indicates that variation in the MTTF and MTTR of the monitored process can have a significant impact on the QoS of our failure detector. Our analysis is supported by simulations that validate our theoretical results.
Original languageEnglish
Pages (from-to)271-283
Number of pages13
JournalIEEE Transactions on Dependable and Secure Computing
Volume7
Issue number3
DOIs
Publication statusPublished - Aug 2010

Keywords

  • MTTF process
  • MTTR process
  • crash recovery failure detector
  • failure recovery
  • monitored crash recovery target
  • probabilistic system behavior
  • quality of service
  • recovery detection speed
  • computerised monitoring
  • failure analysis
  • fault diagnosis
  • software fault tolerance
  • system recovery

Fingerprint

Dive into the research topics of 'On the Quality of Service of Crash-Recovery Failure Detectors'. Together they form a unique fingerprint.

Cite this