Fault tolerance grid scheduling with checkpoint based on ant colony system

Task resubmission and checkpoint are among several popular techniques used in providing fault tolerance in grid computing. However, due to the lack of side-by-side comparison, it is not certain of the best technique that would not degrade the system performance in addition to providing fault toleran...

Full description

Saved in:
Bibliographic Details
Main Authors: Bukhari, Saufi, Ku-Mahamud, Ku Ruhana, Morino, Hiroaki
Format: Article
Language:English
Published: Science Publications 2017
Subjects:
Online Access:http://repo.uum.edu.my/27875/1/JCS%2013%208%202017%20363%20370.pdf
http://repo.uum.edu.my/27875/
http://doi.org/10.3844/jcssp.2017.363.370
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Utara Malaysia
Language: English
id my.uum.repo.27875
record_format eprints
spelling my.uum.repo.278752020-11-11T06:06:11Z http://repo.uum.edu.my/27875/ Fault tolerance grid scheduling with checkpoint based on ant colony system Bukhari, Saufi Ku-Mahamud, Ku Ruhana Morino, Hiroaki QA75 Electronic computers. Computer science Task resubmission and checkpoint are among several popular techniques used in providing fault tolerance in grid computing. However, due to the lack of side-by-side comparison, it is not certain of the best technique that would not degrade the system performance in addition to providing fault tolerance capability. This study proposed Dynamic ACS-based Fault Tolerance in grid computing using resubmission to new resource, checkpoint technique and utilization of resource execution history with the aim to reduce execution and task processing time and to increase the success rate in grid environment. The proposed algorithm is compared with other relevant algorithms to measure the performance in terms of execution time, success rate and average processing time. The results suggest that the proposed algorithm with improved task resubmission, checkpoint and extended pheromone update formula gives better performance in managing execution failure as well as resource selection during task assignment or resubmission. Science Publications 2017 Article PeerReviewed application/pdf en http://repo.uum.edu.my/27875/1/JCS%2013%208%202017%20363%20370.pdf Bukhari, Saufi and Ku-Mahamud, Ku Ruhana and Morino, Hiroaki (2017) Fault tolerance grid scheduling with checkpoint based on ant colony system. Journal of Computer Science, 13 (8). pp. 363-370. ISSN 1549-3636 http://doi.org/10.3844/jcssp.2017.363.370 doi:10.3844/jcssp.2017.363.370
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Institutional Repository
url_provider http://repo.uum.edu.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Bukhari, Saufi
Ku-Mahamud, Ku Ruhana
Morino, Hiroaki
Fault tolerance grid scheduling with checkpoint based on ant colony system
description Task resubmission and checkpoint are among several popular techniques used in providing fault tolerance in grid computing. However, due to the lack of side-by-side comparison, it is not certain of the best technique that would not degrade the system performance in addition to providing fault tolerance capability. This study proposed Dynamic ACS-based Fault Tolerance in grid computing using resubmission to new resource, checkpoint technique and utilization of resource execution history with the aim to reduce execution and task processing time and to increase the success rate in grid environment. The proposed algorithm is compared with other relevant algorithms to measure the performance in terms of execution time, success rate and average processing time. The results suggest that the proposed algorithm with improved task resubmission, checkpoint and extended pheromone update formula gives better performance in managing execution failure as well as resource selection during task assignment or resubmission.
format Article
author Bukhari, Saufi
Ku-Mahamud, Ku Ruhana
Morino, Hiroaki
author_facet Bukhari, Saufi
Ku-Mahamud, Ku Ruhana
Morino, Hiroaki
author_sort Bukhari, Saufi
title Fault tolerance grid scheduling with checkpoint based on ant colony system
title_short Fault tolerance grid scheduling with checkpoint based on ant colony system
title_full Fault tolerance grid scheduling with checkpoint based on ant colony system
title_fullStr Fault tolerance grid scheduling with checkpoint based on ant colony system
title_full_unstemmed Fault tolerance grid scheduling with checkpoint based on ant colony system
title_sort fault tolerance grid scheduling with checkpoint based on ant colony system
publisher Science Publications
publishDate 2017
url http://repo.uum.edu.my/27875/1/JCS%2013%208%202017%20363%20370.pdf
http://repo.uum.edu.my/27875/
http://doi.org/10.3844/jcssp.2017.363.370
_version_ 1684655812117528576