Fault Tolerant Cluster Computing through Replication
Long-lived parallel applications running on work station clusters are vulnerable to single-node or multiple-node failures. Fault recovery is therefore required to prevent immature program termination. However, much of the runtime overhead imposed by fault tolerance schemes is generally due to the co...
Saved in:
Main Author: | SHUM, Kam Hong |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
1997
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/1053 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Fault Tolerance for Parallel Applications through Replication
by: SHUM, Kam Hong
Published: (1997) -
Effective Fault Tolerance for Agent-Based Cluster Computing
by: SHUM, Kam Hong
Published: (1999) -
Runtime Support for Replicated Parallel Simulators of an ATM Network on Workstation Clusters
by: SHUM, Kam Hong, et al.
Published: (1996) -
Adaptive Distributed Simulation for Computationally Intensive Modelling
by: SHUM, Kam Hong
Published: (1995) -
Replicating Parallel Simulation on Heterogeneous Clusters
by: SHUM, Kam Hong
Published: (1998)