Limits of Statically-Scheduled Token Dataflow Processing

FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront,...

Full description

Saved in:
Bibliographic Details
Main Authors: Kapre, Nachiket, Siddhartha
Other Authors: School of Computer Engineering
Format: Conference or Workshop Item
Language:English
Published: 2015
Subjects:
Online Access:https://hdl.handle.net/10356/81240
http://hdl.handle.net/10220/39193
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-81240
record_format dspace
spelling sg-ntu-dr.10356-812402020-05-28T07:18:05Z Limits of Statically-Scheduled Token Dataflow Processing Kapre, Nachiket Siddhartha School of Computer Engineering 2014 Fourth Workshop on Data-Flow Execution Models for Extreme Scale Computing Computer Science and Engineering FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront, either at compile time or at the start of execution, we can employ static scheduling techniques to further improve performance and enhance compute density of the dataflow hardware. In this paper, we identify the costs and performance trends of both static and dynamic scheduling approaches when considering hardware acceleration of SPICE device equations and Sparse LU factorization in circuit graphs. While the experiments are limited to a case study, the hardware design and dataflow compiler are general and can be extended to other problems and instances where dataflow computing may be applicable. With this study, we hope to develop a quantitative basis for the design of a hybrid dataflow architecture that combines both static and dynamic scheduling techniques. We observe a performance benefit of 2 - 4× and a resource utilization saving of 2 - 3× in favor of statically scheduled hardware. Accepted version 2015-12-21T07:17:41Z 2019-12-06T14:26:19Z 2015-12-21T07:17:41Z 2019-12-06T14:26:19Z 2014 Conference Paper Kapre, N., & Siddhartha. (2014). Limits of Statically-Scheduled Token Dataflow Processing. 2014 Fourth Workshop on Data-Flow Execution Models for Extreme Scale Computing, 1-8. https://hdl.handle.net/10356/81240 http://hdl.handle.net/10220/39193 10.1109/DFM.2014.21 en © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/DFM.2014.21]. 8 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Computer Science and Engineering
spellingShingle Computer Science and Engineering
Kapre, Nachiket
Siddhartha
Limits of Statically-Scheduled Token Dataflow Processing
description FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront, either at compile time or at the start of execution, we can employ static scheduling techniques to further improve performance and enhance compute density of the dataflow hardware. In this paper, we identify the costs and performance trends of both static and dynamic scheduling approaches when considering hardware acceleration of SPICE device equations and Sparse LU factorization in circuit graphs. While the experiments are limited to a case study, the hardware design and dataflow compiler are general and can be extended to other problems and instances where dataflow computing may be applicable. With this study, we hope to develop a quantitative basis for the design of a hybrid dataflow architecture that combines both static and dynamic scheduling techniques. We observe a performance benefit of 2 - 4× and a resource utilization saving of 2 - 3× in favor of statically scheduled hardware.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Kapre, Nachiket
Siddhartha
format Conference or Workshop Item
author Kapre, Nachiket
Siddhartha
author_sort Kapre, Nachiket
title Limits of Statically-Scheduled Token Dataflow Processing
title_short Limits of Statically-Scheduled Token Dataflow Processing
title_full Limits of Statically-Scheduled Token Dataflow Processing
title_fullStr Limits of Statically-Scheduled Token Dataflow Processing
title_full_unstemmed Limits of Statically-Scheduled Token Dataflow Processing
title_sort limits of statically-scheduled token dataflow processing
publishDate 2015
url https://hdl.handle.net/10356/81240
http://hdl.handle.net/10220/39193
_version_ 1681057596078817280