Managing fault tolerance information in multi-agents based distributed systems

Dae Won Lee, Kwang Sik Chung, Hwa Min Lee, Seongbin Park, Young Jun Lee, Heonchang Yu, Won Gyu Lee

Research output: Contribution to journalArticle

Abstract

In a fault tolerant system using rollback-recovery protocols, the performance of the system is degraded because of the increment of saved fault tolerance information. To avoid degrading its performance, we propose novel multi-agents based garbage-collection technique that deletes useless fault tolerance information. We define and design a garbage-collection agent for garbage-collection of fault tolerance information, a information agent for management of fault tolerant information, and a facilitator agent for communication between agents. And we propose the garbage-collection algorithm(GCA) using these agents. Our rollback recovery method is based on independent checkpointing protocol and sender based pessimistic message logging protocol. To prove the correctness of the garbage-collection algorithm, we introduce failure injection during operation and compare the domain knowledge of the proposed system using GCA with the domain knowledge of another system without GCA.

Original languageEnglish
Pages (from-to)104-108
Number of pages5
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2690
Publication statusPublished - 2004 Dec 1

Fingerprint

Garbage
Computer Communication Networks
Garbage Collection
Agent-based Systems
Fault tolerance
Fault Tolerance
Distributed Systems
Rollback Recovery
Domain Knowledge
Recovery
Fault-tolerant Systems
Information Management
Checkpointing
Fault-tolerant
Increment
Injection
Correctness
Communication
Injections

ASJC Scopus subject areas

  • Computer Science(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Theoretical Computer Science

Cite this

@article{def8539167a945aabdb20dec96741eb0,
title = "Managing fault tolerance information in multi-agents based distributed systems",
abstract = "In a fault tolerant system using rollback-recovery protocols, the performance of the system is degraded because of the increment of saved fault tolerance information. To avoid degrading its performance, we propose novel multi-agents based garbage-collection technique that deletes useless fault tolerance information. We define and design a garbage-collection agent for garbage-collection of fault tolerance information, a information agent for management of fault tolerant information, and a facilitator agent for communication between agents. And we propose the garbage-collection algorithm(GCA) using these agents. Our rollback recovery method is based on independent checkpointing protocol and sender based pessimistic message logging protocol. To prove the correctness of the garbage-collection algorithm, we introduce failure injection during operation and compare the domain knowledge of the proposed system using GCA with the domain knowledge of another system without GCA.",
author = "Lee, {Dae Won} and Chung, {Kwang Sik} and Lee, {Hwa Min} and Seongbin Park and Lee, {Young Jun} and Heonchang Yu and Lee, {Won Gyu}",
year = "2004",
month = "12",
day = "1",
language = "English",
volume = "2690",
pages = "104--108",
journal = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
issn = "0302-9743",
publisher = "Springer Verlag",

}

TY - JOUR

T1 - Managing fault tolerance information in multi-agents based distributed systems

AU - Lee, Dae Won

AU - Chung, Kwang Sik

AU - Lee, Hwa Min

AU - Park, Seongbin

AU - Lee, Young Jun

AU - Yu, Heonchang

AU - Lee, Won Gyu

PY - 2004/12/1

Y1 - 2004/12/1

N2 - In a fault tolerant system using rollback-recovery protocols, the performance of the system is degraded because of the increment of saved fault tolerance information. To avoid degrading its performance, we propose novel multi-agents based garbage-collection technique that deletes useless fault tolerance information. We define and design a garbage-collection agent for garbage-collection of fault tolerance information, a information agent for management of fault tolerant information, and a facilitator agent for communication between agents. And we propose the garbage-collection algorithm(GCA) using these agents. Our rollback recovery method is based on independent checkpointing protocol and sender based pessimistic message logging protocol. To prove the correctness of the garbage-collection algorithm, we introduce failure injection during operation and compare the domain knowledge of the proposed system using GCA with the domain knowledge of another system without GCA.

AB - In a fault tolerant system using rollback-recovery protocols, the performance of the system is degraded because of the increment of saved fault tolerance information. To avoid degrading its performance, we propose novel multi-agents based garbage-collection technique that deletes useless fault tolerance information. We define and design a garbage-collection agent for garbage-collection of fault tolerance information, a information agent for management of fault tolerant information, and a facilitator agent for communication between agents. And we propose the garbage-collection algorithm(GCA) using these agents. Our rollback recovery method is based on independent checkpointing protocol and sender based pessimistic message logging protocol. To prove the correctness of the garbage-collection algorithm, we introduce failure injection during operation and compare the domain knowledge of the proposed system using GCA with the domain knowledge of another system without GCA.

UR - http://www.scopus.com/inward/record.url?scp=35048848011&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35048848011&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:35048848011

VL - 2690

SP - 104

EP - 108

JO - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

JF - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SN - 0302-9743

ER -