Fault Tolerance for Parallel Applications through Replication

Based on the technique of replication, an efficient fault-tolerant model for parallel computing on workstation clusters is proposed. The model is built on top of a runtime system which supports resource allocation for parallel applications running on heterogeneous workstation clusters. According to...

全面介紹

Saved in:
書目詳細資料
主要作者: SHUM, Kam Hong
格式: text
語言:English
出版: Institutional Knowledge at Singapore Management University 1997
主題:
在線閱讀:https://ink.library.smu.edu.sg/sis_research/1054
http://dx.doi.org/10.1109/ICICS.1997.652234
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Singapore Management University
語言: English
id sg-smu-ink.sis_research-2053
record_format dspace
spelling sg-smu-ink.sis_research-20532010-12-22T08:24:06Z Fault Tolerance for Parallel Applications through Replication SHUM, Kam Hong Based on the technique of replication, an efficient fault-tolerant model for parallel computing on workstation clusters is proposed. The model is built on top of a runtime system which supports resource allocation for parallel applications running on heterogeneous workstation clusters. According to the results of resource allocation, replicated parallel applications can minimize their resource consumption by runtime reconfiguration. Besides, checkpointed states only transfer among replicated applications, no expensive disk read/write operations are therefore required. 1997-09-09T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/1054 info:doi/10.1109/ICICS.1997.652234 http://dx.doi.org/10.1109/ICICS.1997.652234 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
SHUM, Kam Hong
Fault Tolerance for Parallel Applications through Replication
description Based on the technique of replication, an efficient fault-tolerant model for parallel computing on workstation clusters is proposed. The model is built on top of a runtime system which supports resource allocation for parallel applications running on heterogeneous workstation clusters. According to the results of resource allocation, replicated parallel applications can minimize their resource consumption by runtime reconfiguration. Besides, checkpointed states only transfer among replicated applications, no expensive disk read/write operations are therefore required.
format text
author SHUM, Kam Hong
author_facet SHUM, Kam Hong
author_sort SHUM, Kam Hong
title Fault Tolerance for Parallel Applications through Replication
title_short Fault Tolerance for Parallel Applications through Replication
title_full Fault Tolerance for Parallel Applications through Replication
title_fullStr Fault Tolerance for Parallel Applications through Replication
title_full_unstemmed Fault Tolerance for Parallel Applications through Replication
title_sort fault tolerance for parallel applications through replication
publisher Institutional Knowledge at Singapore Management University
publishDate 1997
url https://ink.library.smu.edu.sg/sis_research/1054
http://dx.doi.org/10.1109/ICICS.1997.652234
_version_ 1770570834588467200