Managing fault tolerance information in multi-agents based distributed systems

Dae Won Lee, Kwang Sik Chung, Hwa Min Lee, Seongbin Park, Young Jun Lee, Heonchang Yu, Won Gyu Lee

Research output: Contribution to journalArticle

Abstract

In a fault tolerant system using rollback-recovery protocols, the performance of the system is degraded because of the increment of saved fault tolerance information. To avoid degrading its performance, we propose novel multi-agents based garbage-collection technique that deletes useless fault tolerance information. We define and design a garbage-collection agent for garbage-collection of fault tolerance information, a information agent for management of fault tolerant information, and a facilitator agent for communication between agents. And we propose the garbage-collection algorithm(GCA) using these agents. Our rollback recovery method is based on independent checkpointing protocol and sender based pessimistic message logging protocol. To prove the correctness of the garbage-collection algorithm, we introduce failure injection during operation and compare the domain knowledge of the proposed system using GCA with the domain knowledge of another system without GCA.

Original languageEnglish
Pages (from-to)104-108
Number of pages5
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2690
Publication statusPublished - 2004 Dec 1

ASJC Scopus subject areas

  • Computer Science(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Theoretical Computer Science

Fingerprint Dive into the research topics of 'Managing fault tolerance information in multi-agents based distributed systems'. Together they form a unique fingerprint.

  • Cite this