Networks on chip for FPGAs
Reducing worst case routing latencies while delivering high throughput and low energy are key design concerns in the engineering of overlay packet-switched NoCs for FPGA fabrics. Deflection Torus (Hoplite), an efficient, remarkably lightweight, fast FPGA overlay NoC that is designed to be small and...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2016
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/66694 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-66694 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-666942023-03-03T20:34:23Z Networks on chip for FPGAs Agarwal, Shubham Nachiket Kapre School of Computer Engineering DRNTU::Engineering Reducing worst case routing latencies while delivering high throughput and low energy are key design concerns in the engineering of overlay packet-switched NoCs for FPGA fabrics. Deflection Torus (Hoplite), an efficient, remarkably lightweight, fast FPGA overlay NoC that is designed to be small and compact by (1) eliminating input buffers, and (2) reducing the cost of switch crossbar (by using unidirectional torus instead of bidirectional) that have traditionally limited speeds and imposed heavy resource costs in conventional FPGA overlay NOCs. Though it delivers high sustained bandwidths for various workloads and traffic patterns, it suffers from significantly higher worst case routing latencies due to deflections, particularly at large system sizes (where the deflection cost/penalty is high), when compared to classic buffered NoCs. To tackle this, we design Hierarchical Deflection Torus that (1) targets worst case latencies in deflection torus NoCs by separating deflections into two levels of the NoC, (2) delivers an FPGA-friendly design for deadlock freedom by providing physical escape channels in the lower levels. For instance with the 16x16 NoC, we reduce worst case deflection costs by 1.5–10x while simultaneously improving sustained rates by 1.5–2x and lowering energy requirements by 1.5–2x for a range of statistically-generated traffic patterns. Bachelor of Engineering (Computer Science) 2016-04-21T03:09:24Z 2016-04-21T03:09:24Z 2016 Final Year Project (FYP) http://hdl.handle.net/10356/66694 en Nanyang Technological University application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering |
spellingShingle |
DRNTU::Engineering Agarwal, Shubham Networks on chip for FPGAs |
description |
Reducing worst case routing latencies while delivering high throughput and low energy are key design concerns in the engineering of overlay packet-switched NoCs for FPGA fabrics.
Deflection Torus (Hoplite), an efficient, remarkably lightweight, fast FPGA overlay NoC that is designed to be small and compact by (1) eliminating input buffers, and (2) reducing the cost of switch crossbar (by using unidirectional torus instead of bidirectional) that have traditionally limited speeds and imposed heavy resource costs in conventional FPGA overlay NOCs. Though it delivers high sustained bandwidths for various workloads and traffic patterns, it suffers from significantly higher worst case routing latencies due to deflections, particularly at large system sizes (where the deflection cost/penalty is high), when compared to classic buffered NoCs.
To tackle this, we design Hierarchical Deflection Torus that (1) targets worst case latencies in deflection torus NoCs by separating deflections into two levels of the NoC, (2) delivers an FPGA-friendly design for deadlock freedom by providing physical escape channels in the lower levels. For instance with the 16x16 NoC, we reduce worst case deflection costs by 1.5–10x while simultaneously improving sustained rates by 1.5–2x and lowering energy requirements by 1.5–2x for a range of statistically-generated traffic patterns. |
author2 |
Nachiket Kapre |
author_facet |
Nachiket Kapre Agarwal, Shubham |
format |
Final Year Project |
author |
Agarwal, Shubham |
author_sort |
Agarwal, Shubham |
title |
Networks on chip for FPGAs |
title_short |
Networks on chip for FPGAs |
title_full |
Networks on chip for FPGAs |
title_fullStr |
Networks on chip for FPGAs |
title_full_unstemmed |
Networks on chip for FPGAs |
title_sort |
networks on chip for fpgas |
publishDate |
2016 |
url |
http://hdl.handle.net/10356/66694 |
_version_ |
1759854625213644800 |