Limits of Statically-Scheduled Token Dataflow Processing

FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront,...

Full description

Saved in:

Bibliographic Details
Main Authors:	Kapre, Nachiket, Siddhartha
Other Authors:	School of Computer Engineering
Format:	Conference or Workshop Item
Language:	English
Published:	2015
Subjects:	Computer Science and Engineering
Online Access:	https://hdl.handle.net/10356/81240 http://hdl.handle.net/10220/39193
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-81240
record_format	dspace
spelling	sg-ntu-dr.10356-812402020-05-28T07:18:05Z Limits of Statically-Scheduled Token Dataflow Processing Kapre, Nachiket Siddhartha School of Computer Engineering 2014 Fourth Workshop on Data-Flow Execution Models for Extreme Scale Computing Computer Science and Engineering FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront, either at compile time or at the start of execution, we can employ static scheduling techniques to further improve performance and enhance compute density of the dataflow hardware. In this paper, we identify the costs and performance trends of both static and dynamic scheduling approaches when considering hardware acceleration of SPICE device equations and Sparse LU factorization in circuit graphs. While the experiments are limited to a case study, the hardware design and dataflow compiler are general and can be extended to other problems and instances where dataflow computing may be applicable. With this study, we hope to develop a quantitative basis for the design of a hybrid dataflow architecture that combines both static and dynamic scheduling techniques. We observe a performance benefit of 2 - 4× and a resource utilization saving of 2 - 3× in favor of statically scheduled hardware. Accepted version 2015-12-21T07:17:41Z 2019-12-06T14:26:19Z 2015-12-21T07:17:41Z 2019-12-06T14:26:19Z 2014 Conference Paper Kapre, N., & Siddhartha. (2014). Limits of Statically-Scheduled Token Dataflow Processing. 2014 Fourth Workshop on Data-Flow Execution Models for Extreme Scale Computing, 1-8. https://hdl.handle.net/10356/81240 http://hdl.handle.net/10220/39193 10.1109/DFM.2014.21 en © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/DFM.2014.21]. 8 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	Computer Science and Engineering
spellingShingle	Computer Science and Engineering Kapre, Nachiket Siddhartha Limits of Statically-Scheduled Token Dataflow Processing
description	FPGA-based token dataflow processing has been shown to accelerate hard-to-parallelize problems exhibiting irregular dataflow parallelism by as much as an order of magnitude when compared to conventional compute organizations. However, when the structure of the dataflow computation is known upfront, either at compile time or at the start of execution, we can employ static scheduling techniques to further improve performance and enhance compute density of the dataflow hardware. In this paper, we identify the costs and performance trends of both static and dynamic scheduling approaches when considering hardware acceleration of SPICE device equations and Sparse LU factorization in circuit graphs. While the experiments are limited to a case study, the hardware design and dataflow compiler are general and can be extended to other problems and instances where dataflow computing may be applicable. With this study, we hope to develop a quantitative basis for the design of a hybrid dataflow architecture that combines both static and dynamic scheduling techniques. We observe a performance benefit of 2 - 4× and a resource utilization saving of 2 - 3× in favor of statically scheduled hardware.
author2	School of Computer Engineering
author_facet	School of Computer Engineering Kapre, Nachiket Siddhartha
format	Conference or Workshop Item
author	Kapre, Nachiket Siddhartha
author_sort	Kapre, Nachiket
title	Limits of Statically-Scheduled Token Dataflow Processing
title_short	Limits of Statically-Scheduled Token Dataflow Processing
title_full	Limits of Statically-Scheduled Token Dataflow Processing
title_fullStr	Limits of Statically-Scheduled Token Dataflow Processing
title_full_unstemmed	Limits of Statically-Scheduled Token Dataflow Processing
title_sort	limits of statically-scheduled token dataflow processing
publishDate	2015
url	https://hdl.handle.net/10356/81240 http://hdl.handle.net/10220/39193
_version_	1681057596078817280

Limits of Statically-Scheduled Token Dataflow Processing

Similar Items