Experimental evaluation of failure-detection schemes in real-time communication networks

Seungjae Han, Kang G. Shin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Citations (Scopus)

Abstract

An effective failure-detection scheme is essential for reliable communication services. Most computer network rely on behavior-based detection schemes: each node uses heartbeats to detect the failure of its neighbor nodes, and the transport protocol (like TCP) achieves reliable communication by acknowledgment/retransmission. In this paper, we experimentally evaluate the effectiveness of such behavior-based detection schemes in real-time communication. Specifically, we measure and analyze the coverage and latency of two failure-detection schemes-neighbor detection and end-to-end detection-through fault-injection experiments. The experimental results have shown that a significant portion of failures can be detected very quickly by the neighbor detection scheme, while the end-to-end detection scheme uncovers the remaining failures with larger detection latencies.

Original languageEnglish
Title of host publicationDigest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages122-131
Number of pages10
ISBN (Electronic)0818678313, 9780818678318
DOIs
Publication statusPublished - 1997 Jan 1
Event27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997 - Seattle, United States
Duration: 1997 Jun 241997 Jun 27

Publication series

NameDigest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997

Other

Other27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997
CountryUnited States
CitySeattle
Period97/6/2497/6/27

Fingerprint

Telecommunication networks
Communication
Computer networks
Network protocols
Experiments

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Hardware and Architecture
  • Software
  • Safety, Risk, Reliability and Quality

Cite this

Han, S., & Shin, K. G. (1997). Experimental evaluation of failure-detection schemes in real-time communication networks. In Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997 (pp. 122-131). [614085] (Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/FTCS.1997.614085
Han, Seungjae ; Shin, Kang G. / Experimental evaluation of failure-detection schemes in real-time communication networks. Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997. Institute of Electrical and Electronics Engineers Inc., 1997. pp. 122-131 (Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997).
@inproceedings{6173cbca4364405c80fb823420316f17,
title = "Experimental evaluation of failure-detection schemes in real-time communication networks",
abstract = "An effective failure-detection scheme is essential for reliable communication services. Most computer network rely on behavior-based detection schemes: each node uses heartbeats to detect the failure of its neighbor nodes, and the transport protocol (like TCP) achieves reliable communication by acknowledgment/retransmission. In this paper, we experimentally evaluate the effectiveness of such behavior-based detection schemes in real-time communication. Specifically, we measure and analyze the coverage and latency of two failure-detection schemes-neighbor detection and end-to-end detection-through fault-injection experiments. The experimental results have shown that a significant portion of failures can be detected very quickly by the neighbor detection scheme, while the end-to-end detection scheme uncovers the remaining failures with larger detection latencies.",
author = "Seungjae Han and Shin, {Kang G.}",
year = "1997",
month = "1",
day = "1",
doi = "10.1109/FTCS.1997.614085",
language = "English",
series = "Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "122--131",
booktitle = "Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997",
address = "United States",

}

Han, S & Shin, KG 1997, Experimental evaluation of failure-detection schemes in real-time communication networks. in Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997., 614085, Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997, Institute of Electrical and Electronics Engineers Inc., pp. 122-131, 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997, Seattle, United States, 97/6/24. https://doi.org/10.1109/FTCS.1997.614085

Experimental evaluation of failure-detection schemes in real-time communication networks. / Han, Seungjae; Shin, Kang G.

Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997. Institute of Electrical and Electronics Engineers Inc., 1997. p. 122-131 614085 (Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Experimental evaluation of failure-detection schemes in real-time communication networks

AU - Han, Seungjae

AU - Shin, Kang G.

PY - 1997/1/1

Y1 - 1997/1/1

N2 - An effective failure-detection scheme is essential for reliable communication services. Most computer network rely on behavior-based detection schemes: each node uses heartbeats to detect the failure of its neighbor nodes, and the transport protocol (like TCP) achieves reliable communication by acknowledgment/retransmission. In this paper, we experimentally evaluate the effectiveness of such behavior-based detection schemes in real-time communication. Specifically, we measure and analyze the coverage and latency of two failure-detection schemes-neighbor detection and end-to-end detection-through fault-injection experiments. The experimental results have shown that a significant portion of failures can be detected very quickly by the neighbor detection scheme, while the end-to-end detection scheme uncovers the remaining failures with larger detection latencies.

AB - An effective failure-detection scheme is essential for reliable communication services. Most computer network rely on behavior-based detection schemes: each node uses heartbeats to detect the failure of its neighbor nodes, and the transport protocol (like TCP) achieves reliable communication by acknowledgment/retransmission. In this paper, we experimentally evaluate the effectiveness of such behavior-based detection schemes in real-time communication. Specifically, we measure and analyze the coverage and latency of two failure-detection schemes-neighbor detection and end-to-end detection-through fault-injection experiments. The experimental results have shown that a significant portion of failures can be detected very quickly by the neighbor detection scheme, while the end-to-end detection scheme uncovers the remaining failures with larger detection latencies.

UR - http://www.scopus.com/inward/record.url?scp=85015234855&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85015234855&partnerID=8YFLogxK

U2 - 10.1109/FTCS.1997.614085

DO - 10.1109/FTCS.1997.614085

M3 - Conference contribution

T3 - Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997

SP - 122

EP - 131

BT - Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

Han S, Shin KG. Experimental evaluation of failure-detection schemes in real-time communication networks. In Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997. Institute of Electrical and Electronics Engineers Inc. 1997. p. 122-131. 614085. (Digest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997). https://doi.org/10.1109/FTCS.1997.614085