Efficient and fault tolerant HLA-based simulation

Distributed simulation subdivides a complex simulation (federation) into a group of simulation components (federates) and executes them in distributed manner. The High Level Architecture (HLA), an IEEE 1516 standard, provides a general framework for developing large-scale distributed simulations. Th...

Full description

Saved in:
Bibliographic Details
Main Author: Li, Zengxiang
Other Authors: Stephen John Turner
Format: Theses and Dissertations
Language:English
Published: 2012
Subjects:
Online Access:https://hdl.handle.net/10356/48170
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-48170
record_format dspace
spelling sg-ntu-dr.10356-481702023-03-04T00:48:12Z Efficient and fault tolerant HLA-based simulation Li, Zengxiang Stephen John Turner Cai Wentong School of Computer Engineering Parallel and Distributed Computing Centre DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation Distributed simulation subdivides a complex simulation (federation) into a group of simulation components (federates) and executes them in distributed manner. The High Level Architecture (HLA), an IEEE 1516 standard, provides a general framework for developing large-scale distributed simulations. The Runtime Infrastructure (RTI) is a middleware that controls the communication among federates according to the HLA interface specification. The simulation executions may involve a large number of computationally intensive federates and thus are time and resource consuming. What is worse, these federates may be subject to crash-stop and Byzantine failures and the risk of federation failure increases with the federation scale. In this thesis, we propose mechanisms to support efficient and fault tolerant HLA-based simulation by exploiting the advantages of decoupled federate architecture, in which a federate connects to federation through its corresponding Decoupled RTI Component (DRC). Workload imbalance generally leads to poor distributed simulation performance. To achieve load balancing, we propose to migrate federates from heavily-loaded computing nodes to lightly-loaded ones. Using the decoupled federate architecture, only needs the federate to be migrated to the destination computing node; whereas the DRC can stay at the same place and keep the connection to the federation. One-phase migration protocol is first proposed to illustrate the federate migration process. Then, two-phases and relay-based migration protocols are further developed to reduce migration overhead by overlapping federate migration with continuous federate execution. DOCTOR OF PHILOSOPHY (SCE) 2012-03-26T09:04:04Z 2012-03-26T09:04:04Z 2012 2012 Thesis Li, Z. X. (2012). Efficient and fault tolerant HLA-based simulation. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/48170 10.32657/10356/48170 en 177 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation
spellingShingle DRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementation
Li, Zengxiang
Efficient and fault tolerant HLA-based simulation
description Distributed simulation subdivides a complex simulation (federation) into a group of simulation components (federates) and executes them in distributed manner. The High Level Architecture (HLA), an IEEE 1516 standard, provides a general framework for developing large-scale distributed simulations. The Runtime Infrastructure (RTI) is a middleware that controls the communication among federates according to the HLA interface specification. The simulation executions may involve a large number of computationally intensive federates and thus are time and resource consuming. What is worse, these federates may be subject to crash-stop and Byzantine failures and the risk of federation failure increases with the federation scale. In this thesis, we propose mechanisms to support efficient and fault tolerant HLA-based simulation by exploiting the advantages of decoupled federate architecture, in which a federate connects to federation through its corresponding Decoupled RTI Component (DRC). Workload imbalance generally leads to poor distributed simulation performance. To achieve load balancing, we propose to migrate federates from heavily-loaded computing nodes to lightly-loaded ones. Using the decoupled federate architecture, only needs the federate to be migrated to the destination computing node; whereas the DRC can stay at the same place and keep the connection to the federation. One-phase migration protocol is first proposed to illustrate the federate migration process. Then, two-phases and relay-based migration protocols are further developed to reduce migration overhead by overlapping federate migration with continuous federate execution.
author2 Stephen John Turner
author_facet Stephen John Turner
Li, Zengxiang
format Theses and Dissertations
author Li, Zengxiang
author_sort Li, Zengxiang
title Efficient and fault tolerant HLA-based simulation
title_short Efficient and fault tolerant HLA-based simulation
title_full Efficient and fault tolerant HLA-based simulation
title_fullStr Efficient and fault tolerant HLA-based simulation
title_full_unstemmed Efficient and fault tolerant HLA-based simulation
title_sort efficient and fault tolerant hla-based simulation
publishDate 2012
url https://hdl.handle.net/10356/48170
_version_ 1759856776565489664