Custom FPGA-based soft-processors for sparse graph acceleration

FPGA-based soft processors customized for operations on sparse graphs can deliver significant performance improvements over conventional organizations (ARMv7 CPUs) for bulk synchronous sparse graph algorithms. We develop a stripped-down soft processor ISA to implement specific repetitive operations...

Full description

Saved in:

Bibliographic Details
Main Author:	Kapre, Nachiket
Other Authors:	School of Computer Engineering
Format:	Conference or Workshop Item
Language:	English
Published:	2015
Subjects:	Computer Science and Engineering
Online Access:	https://hdl.handle.net/10356/81243 http://hdl.handle.net/10220/39162
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-81243
record_format	dspace
spelling	sg-ntu-dr.10356-812432020-05-28T07:17:27Z Custom FPGA-based soft-processors for sparse graph acceleration Kapre, Nachiket School of Computer Engineering 2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) Computer Science and Engineering FPGA-based soft processors customized for operations on sparse graphs can deliver significant performance improvements over conventional organizations (ARMv7 CPUs) for bulk synchronous sparse graph algorithms. We develop a stripped-down soft processor ISA to implement specific repetitive operations on graph nodes and edges that are commonly observed in sparse graph computations. In the processing core, we provide hardware support for rapidly fetching and processing state of local graph nodes and edges through spatial address generators and zero-overhead loop iterators. We interconnect a 2D array of these lightweight processors with a packet-switched network-on-chip to enable fine-grained operand routing along the graph edges and provide custom send/receive instructions in the soft processor. We develop the processor RTL using Vivado High-Level Synthesis and also provide an assembler and compilation flow to configure the processor instruction and data memories. We outperform a Microblaze (100MHz on Zedboard) and an NIOS-II/f (100MHz on DE2-115) by 6× (single processor design) as well as the ARMv7 dual-core CPU on the Zynq SoCs by as much as 10× on the Xilinx ZC706 board (100 processor design) across a range of matrix datasets. Accepted version 2015-12-18T06:15:20Z 2019-12-06T14:26:23Z 2015-12-18T06:15:20Z 2019-12-06T14:26:23Z 2015 Conference Paper Kapre, N. (2015). Custom FPGA-based soft-processors for sparse graph acceleration. 2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP), 9-16. https://hdl.handle.net/10356/81243 http://hdl.handle.net/10220/39162 10.1109/ASAP.2015.7245698 en © 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/ASAP.2015.7245698]. 8 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	Computer Science and Engineering
spellingShingle	Computer Science and Engineering Kapre, Nachiket Custom FPGA-based soft-processors for sparse graph acceleration
description	FPGA-based soft processors customized for operations on sparse graphs can deliver significant performance improvements over conventional organizations (ARMv7 CPUs) for bulk synchronous sparse graph algorithms. We develop a stripped-down soft processor ISA to implement specific repetitive operations on graph nodes and edges that are commonly observed in sparse graph computations. In the processing core, we provide hardware support for rapidly fetching and processing state of local graph nodes and edges through spatial address generators and zero-overhead loop iterators. We interconnect a 2D array of these lightweight processors with a packet-switched network-on-chip to enable fine-grained operand routing along the graph edges and provide custom send/receive instructions in the soft processor. We develop the processor RTL using Vivado High-Level Synthesis and also provide an assembler and compilation flow to configure the processor instruction and data memories. We outperform a Microblaze (100MHz on Zedboard) and an NIOS-II/f (100MHz on DE2-115) by 6× (single processor design) as well as the ARMv7 dual-core CPU on the Zynq SoCs by as much as 10× on the Xilinx ZC706 board (100 processor design) across a range of matrix datasets.
author2	School of Computer Engineering
author_facet	School of Computer Engineering Kapre, Nachiket
format	Conference or Workshop Item
author	Kapre, Nachiket
author_sort	Kapre, Nachiket
title	Custom FPGA-based soft-processors for sparse graph acceleration
title_short	Custom FPGA-based soft-processors for sparse graph acceleration
title_full	Custom FPGA-based soft-processors for sparse graph acceleration
title_fullStr	Custom FPGA-based soft-processors for sparse graph acceleration
title_full_unstemmed	Custom FPGA-based soft-processors for sparse graph acceleration
title_sort	custom fpga-based soft-processors for sparse graph acceleration
publishDate	2015
url	https://hdl.handle.net/10356/81243 http://hdl.handle.net/10220/39162
_version_	1681057699040591872

Custom FPGA-based soft-processors for sparse graph acceleration

Similar Items