Fault Tolerance for Parallel Applications through Replication
Based on the technique of replication, an efficient fault-tolerant model for parallel computing on workstation clusters is proposed. The model is built on top of a runtime system which supports resource allocation for parallel applications running on heterogeneous workstation clusters. According to...
Saved in:
主要作者: | |
---|---|
格式: | text |
語言: | English |
出版: |
Institutional Knowledge at Singapore Management University
1997
|
主題: | |
在線閱讀: | https://ink.library.smu.edu.sg/sis_research/1054 http://dx.doi.org/10.1109/ICICS.1997.652234 |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|
機構: | Singapore Management University |
語言: | English |
id |
sg-smu-ink.sis_research-2053 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-20532010-12-22T08:24:06Z Fault Tolerance for Parallel Applications through Replication SHUM, Kam Hong Based on the technique of replication, an efficient fault-tolerant model for parallel computing on workstation clusters is proposed. The model is built on top of a runtime system which supports resource allocation for parallel applications running on heterogeneous workstation clusters. According to the results of resource allocation, replicated parallel applications can minimize their resource consumption by runtime reconfiguration. Besides, checkpointed states only transfer among replicated applications, no expensive disk read/write operations are therefore required. 1997-09-09T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/1054 info:doi/10.1109/ICICS.1997.652234 http://dx.doi.org/10.1109/ICICS.1997.652234 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
Databases and Information Systems Numerical Analysis and Scientific Computing SHUM, Kam Hong Fault Tolerance for Parallel Applications through Replication |
description |
Based on the technique of replication, an efficient fault-tolerant model for parallel computing on workstation clusters is proposed. The model is built on top of a runtime system which supports resource allocation for parallel applications running on heterogeneous workstation clusters. According to the results of resource allocation, replicated parallel applications can minimize their resource consumption by runtime reconfiguration. Besides, checkpointed states only transfer among replicated applications, no expensive disk read/write operations are therefore required. |
format |
text |
author |
SHUM, Kam Hong |
author_facet |
SHUM, Kam Hong |
author_sort |
SHUM, Kam Hong |
title |
Fault Tolerance for Parallel Applications through Replication |
title_short |
Fault Tolerance for Parallel Applications through Replication |
title_full |
Fault Tolerance for Parallel Applications through Replication |
title_fullStr |
Fault Tolerance for Parallel Applications through Replication |
title_full_unstemmed |
Fault Tolerance for Parallel Applications through Replication |
title_sort |
fault tolerance for parallel applications through replication |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
1997 |
url |
https://ink.library.smu.edu.sg/sis_research/1054 http://dx.doi.org/10.1109/ICICS.1997.652234 |
_version_ |
1770570834588467200 |