Limits of Statically-Scheduled Token Dataflow Processing
FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront,...
Saved in:
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2015
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/81240 http://hdl.handle.net/10220/39193 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-81240 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-812402020-05-28T07:18:05Z Limits of Statically-Scheduled Token Dataflow Processing Kapre, Nachiket Siddhartha School of Computer Engineering 2014 Fourth Workshop on Data-Flow Execution Models for Extreme Scale Computing Computer Science and Engineering FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront, either at compile time or at the start of execution, we can employ static scheduling techniques to further improve performance and enhance compute density of the dataflow hardware. In this paper, we identify the costs and performance trends of both static and dynamic scheduling approaches when considering hardware acceleration of SPICE device equations and Sparse LU factorization in circuit graphs. While the experiments are limited to a case study, the hardware design and dataflow compiler are general and can be extended to other problems and instances where dataflow computing may be applicable. With this study, we hope to develop a quantitative basis for the design of a hybrid dataflow architecture that combines both static and dynamic scheduling techniques. We observe a performance benefit of 2 - 4× and a resource utilization saving of 2 - 3× in favor of statically scheduled hardware. Accepted version 2015-12-21T07:17:41Z 2019-12-06T14:26:19Z 2015-12-21T07:17:41Z 2019-12-06T14:26:19Z 2014 Conference Paper Kapre, N., & Siddhartha. (2014). Limits of Statically-Scheduled Token Dataflow Processing. 2014 Fourth Workshop on Data-Flow Execution Models for Extreme Scale Computing, 1-8. https://hdl.handle.net/10356/81240 http://hdl.handle.net/10220/39193 10.1109/DFM.2014.21 en © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/DFM.2014.21]. 8 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Computer Science and Engineering |
spellingShingle |
Computer Science and Engineering Kapre, Nachiket Siddhartha Limits of Statically-Scheduled Token Dataflow Processing |
description |
FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront, either at compile time or at the start of execution, we can employ static scheduling techniques to further improve performance and enhance compute density of the dataflow hardware. In this paper, we identify the costs and performance trends of both static and dynamic scheduling approaches when considering hardware acceleration of SPICE device equations and Sparse LU factorization in circuit graphs. While the experiments are limited to a case study, the hardware design and dataflow compiler are general and can be extended to other problems and instances where dataflow computing may be applicable. With this study, we hope to develop a quantitative basis for the design of a hybrid dataflow architecture that combines both static and dynamic scheduling techniques. We observe a performance benefit of 2 - 4× and a resource utilization saving of 2 - 3× in favor of statically scheduled hardware. |
author2 |
School of Computer Engineering |
author_facet |
School of Computer Engineering Kapre, Nachiket Siddhartha |
format |
Conference or Workshop Item |
author |
Kapre, Nachiket Siddhartha |
author_sort |
Kapre, Nachiket |
title |
Limits of Statically-Scheduled Token Dataflow Processing |
title_short |
Limits of Statically-Scheduled Token Dataflow Processing |
title_full |
Limits of Statically-Scheduled Token Dataflow Processing |
title_fullStr |
Limits of Statically-Scheduled Token Dataflow Processing |
title_full_unstemmed |
Limits of Statically-Scheduled Token Dataflow Processing |
title_sort |
limits of statically-scheduled token dataflow processing |
publishDate |
2015 |
url |
https://hdl.handle.net/10356/81240 http://hdl.handle.net/10220/39193 |
_version_ |
1681057596078817280 |