A fault tolerance service for QoS in grid computing

Hwa Min Lee, Kwang Sik Chung, Sung Ho Jin, Dae Won Lee, Won Gyu Lee, Soon Young Jung, Heonchang Yu

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

This paper proposes fault tolerance service to satisfy QoS requirement in grid computing. The probability of failure in the grid computing is higher than in a tradition parallel computing. Since the failure of resources affects job execution fatally, fault tolerance service is essential in grid computing. And grid services are often expected to meet some minimum levels of quality of service (QoS) for desirable operation. However Globus toolkit does not provide fault tolerance service that supports fault detection service and management service and satisfies QoS requirement. In order to provide fault tolerance service and satisfy QoS requirements, we expand the definition of failure, such as process failure, processor failure, and network failure. And we propose fault detection service and fault management service and show simulation results.

Original languageEnglish
Pages (from-to)286-296
Number of pages11
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2659
Publication statusPublished - 2003 Dec 1

Fingerprint

Grid computing
Grid Computing
Fault tolerance
Fault Tolerance
Quality of Service
Quality of service
Fault detection
Service Management
Fault Detection
Requirements
Parallel processing systems
Fault Management
Globus Toolkit
Grid Service
Parallel Computing
Expand
Resources
Simulation

ASJC Scopus subject areas

  • Computer Science(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Theoretical Computer Science

Cite this

A fault tolerance service for QoS in grid computing. / Lee, Hwa Min; Chung, Kwang Sik; Jin, Sung Ho; Lee, Dae Won; Lee, Won Gyu; Jung, Soon Young; Yu, Heonchang.

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 2659, 01.12.2003, p. 286-296.

Research output: Contribution to journalArticle

@article{4845a86140854af6b9c2d5d249d7aac1,
title = "A fault tolerance service for QoS in grid computing",
abstract = "This paper proposes fault tolerance service to satisfy QoS requirement in grid computing. The probability of failure in the grid computing is higher than in a tradition parallel computing. Since the failure of resources affects job execution fatally, fault tolerance service is essential in grid computing. And grid services are often expected to meet some minimum levels of quality of service (QoS) for desirable operation. However Globus toolkit does not provide fault tolerance service that supports fault detection service and management service and satisfies QoS requirement. In order to provide fault tolerance service and satisfy QoS requirements, we expand the definition of failure, such as process failure, processor failure, and network failure. And we propose fault detection service and fault management service and show simulation results.",
author = "Lee, {Hwa Min} and Chung, {Kwang Sik} and Jin, {Sung Ho} and Lee, {Dae Won} and Lee, {Won Gyu} and Jung, {Soon Young} and Heonchang Yu",
year = "2003",
month = "12",
day = "1",
language = "English",
volume = "2659",
pages = "286--296",
journal = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
issn = "0302-9743",
publisher = "Springer Verlag",

}

TY - JOUR

T1 - A fault tolerance service for QoS in grid computing

AU - Lee, Hwa Min

AU - Chung, Kwang Sik

AU - Jin, Sung Ho

AU - Lee, Dae Won

AU - Lee, Won Gyu

AU - Jung, Soon Young

AU - Yu, Heonchang

PY - 2003/12/1

Y1 - 2003/12/1

N2 - This paper proposes fault tolerance service to satisfy QoS requirement in grid computing. The probability of failure in the grid computing is higher than in a tradition parallel computing. Since the failure of resources affects job execution fatally, fault tolerance service is essential in grid computing. And grid services are often expected to meet some minimum levels of quality of service (QoS) for desirable operation. However Globus toolkit does not provide fault tolerance service that supports fault detection service and management service and satisfies QoS requirement. In order to provide fault tolerance service and satisfy QoS requirements, we expand the definition of failure, such as process failure, processor failure, and network failure. And we propose fault detection service and fault management service and show simulation results.

AB - This paper proposes fault tolerance service to satisfy QoS requirement in grid computing. The probability of failure in the grid computing is higher than in a tradition parallel computing. Since the failure of resources affects job execution fatally, fault tolerance service is essential in grid computing. And grid services are often expected to meet some minimum levels of quality of service (QoS) for desirable operation. However Globus toolkit does not provide fault tolerance service that supports fault detection service and management service and satisfies QoS requirement. In order to provide fault tolerance service and satisfy QoS requirements, we expand the definition of failure, such as process failure, processor failure, and network failure. And we propose fault detection service and fault management service and show simulation results.

UR - http://www.scopus.com/inward/record.url?scp=33750625842&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33750625842&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:33750625842

VL - 2659

SP - 286

EP - 296

JO - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

JF - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SN - 0302-9743

ER -