Cellular automata simulations on a FPGA cluster

The emergence of multicore architectures and the chip industry’s plan to roll out hundreds of cores per die sometime in the near future might have triggered the evolution of von Neumann architectures towards a parallel processing paradigm. The capability to have hundreds of cores per die is exciting...

Full description

Saved in:
Bibliographic Details
Main Authors: Murtaza, S., Hoekstra, Alfons G., Sloot, Peter M. A.
Other Authors: School of Computer Engineering
Format: Article
Language:English
Published: 2013
Online Access:https://hdl.handle.net/10356/95896
http://hdl.handle.net/10220/10125
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-95896
record_format dspace
spelling sg-ntu-dr.10356-958962020-05-28T07:17:21Z Cellular automata simulations on a FPGA cluster Murtaza, S. Hoekstra, Alfons G. Sloot, Peter M. A. School of Computer Engineering The emergence of multicore architectures and the chip industry’s plan to roll out hundreds of cores per die sometime in the near future might have triggered the evolution of von Neumann architectures towards a parallel processing paradigm. The capability to have hundreds of cores per die is exciting, but how optimally we are able to utilize such a resource remains a challenge. Since there are no straightforward solutions we seek inspiration from relevant scientific processes. Cellular automata which are inherently decentralized and spatially extended structures provide a potential candidate among parallel processing alternatives. The availability of spatial parallelism on field programmable gate arrays make them the ideal platform to investigate cellular automata systems as potential parallel processing paradigms on multicore architectures. This article presents a massively parallel implementation for a floating-point-based cellular automata using special purpose hardware such as Field Programmable Gate Array (FPGAs). The challenge is to best map an application to the underlying many-core architecture and address issues such as inter-core communication, scalability, and flexibility both in terms of hardware and software. Maxwell — a 64-node FPGA supercomputer, is used for accelerator implementations that range from a single to a multiple FPGA-enabled system. A performance model is proposed and demonstrated to closely reproduce measured execution times. The performance model enables identification of the main sources of overhead and suggests improvements to the architecture and implementation of the lattice Boltzmann method and compute-bound cellular automata in general. Further, a 2 million cell 2DQ9 lattice Boltzmann method lattice with periodic boundary conditions, simulated using a multiple FPGA chip accelerator implementation, is presented. The performance model shows how the FPGA-enabled PC cluster is the preferred multiple FPGA organization over the multiple FPGA-based PC setup. Latency hiding is fully exploited for PC cluster-based system implementations and demonstrated using system profiling. 2013-06-10T07:17:59Z 2019-12-06T19:23:04Z 2013-06-10T07:17:59Z 2019-12-06T19:23:04Z 2010 2010 Journal Article Murtaza, S., Hoekstra, A. G., & Sloot, P. M. A. (2010). Cellular Automata Simulations on a FPGA cluster. International Journal of High Performance Computing Applications, 25(2), 193-204. https://hdl.handle.net/10356/95896 http://hdl.handle.net/10220/10125 10.1177/1094342010383138 en International journal of high performance computing applications © 2010 The Authors.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
description The emergence of multicore architectures and the chip industry’s plan to roll out hundreds of cores per die sometime in the near future might have triggered the evolution of von Neumann architectures towards a parallel processing paradigm. The capability to have hundreds of cores per die is exciting, but how optimally we are able to utilize such a resource remains a challenge. Since there are no straightforward solutions we seek inspiration from relevant scientific processes. Cellular automata which are inherently decentralized and spatially extended structures provide a potential candidate among parallel processing alternatives. The availability of spatial parallelism on field programmable gate arrays make them the ideal platform to investigate cellular automata systems as potential parallel processing paradigms on multicore architectures. This article presents a massively parallel implementation for a floating-point-based cellular automata using special purpose hardware such as Field Programmable Gate Array (FPGAs). The challenge is to best map an application to the underlying many-core architecture and address issues such as inter-core communication, scalability, and flexibility both in terms of hardware and software. Maxwell — a 64-node FPGA supercomputer, is used for accelerator implementations that range from a single to a multiple FPGA-enabled system. A performance model is proposed and demonstrated to closely reproduce measured execution times. The performance model enables identification of the main sources of overhead and suggests improvements to the architecture and implementation of the lattice Boltzmann method and compute-bound cellular automata in general. Further, a 2 million cell 2DQ9 lattice Boltzmann method lattice with periodic boundary conditions, simulated using a multiple FPGA chip accelerator implementation, is presented. The performance model shows how the FPGA-enabled PC cluster is the preferred multiple FPGA organization over the multiple FPGA-based PC setup. Latency hiding is fully exploited for PC cluster-based system implementations and demonstrated using system profiling.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Murtaza, S.
Hoekstra, Alfons G.
Sloot, Peter M. A.
format Article
author Murtaza, S.
Hoekstra, Alfons G.
Sloot, Peter M. A.
spellingShingle Murtaza, S.
Hoekstra, Alfons G.
Sloot, Peter M. A.
Cellular automata simulations on a FPGA cluster
author_sort Murtaza, S.
title Cellular automata simulations on a FPGA cluster
title_short Cellular automata simulations on a FPGA cluster
title_full Cellular automata simulations on a FPGA cluster
title_fullStr Cellular automata simulations on a FPGA cluster
title_full_unstemmed Cellular automata simulations on a FPGA cluster
title_sort cellular automata simulations on a fpga cluster
publishDate 2013
url https://hdl.handle.net/10356/95896
http://hdl.handle.net/10220/10125
_version_ 1681057165307019264