Memory interface design for integrating accelerators with Xilinx Zynq platform

Field-Programmable Gate Arrays (FPGAs) are silicon chips with several configurable logic blocks connected to each other with the help of programmable interconnects. Numerous processes can be carried out on an FPGA in parallel. In this report we present a set of experiments for interfacing accelerato...

Full description

Saved in:
Bibliographic Details
Main Author: Srivastava, Shreya
Other Authors: Douglas Leslie Maskell
Format: Final Year Project
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/74341
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-74341
record_format dspace
spelling sg-ntu-dr.10356-743412023-03-03T20:26:53Z Memory interface design for integrating accelerators with Xilinx Zynq platform Srivastava, Shreya Douglas Leslie Maskell School of Computer Science and Engineering DRNTU::Engineering Field-Programmable Gate Arrays (FPGAs) are silicon chips with several configurable logic blocks connected to each other with the help of programmable interconnects. Numerous processes can be carried out on an FPGA in parallel. In this report we present a set of experiments for interfacing accelerators (HLS generated RTL) within the Xilinx Zynq Platform using Xillybus. For Xillybus characterization, the first experiment performed is the single pipe loopback experiment which consists of FIFOs connected in a loopback. It can be concluded from this experiment that the throughput becomes nearly constant for larger input data size. The second experiment explores FPGA coprocessing using trigonometric sine function calculation as an example. This experiment introduces the concept of a synthesized function, a wrapper and a host program. We observe that sending a single set of data for processing is not efficient because of the I/O overhead and hardware and software latencies. In practical host programs, an array of structures is usually sent as the input. The third experiment conducted is the color convert experiment which takes in input color values and returns the output values after processing. A two-step optimization process is implemented for better performance and efficiency. The saturation value for the number of inputs was close to 204,000 32-bit integers. The final experiment conducted is a practical application of the color convert example which takes an image as the input and generates the output image after processing. This takes a short amount of time considering the size of input data and shows the importance of the optimization steps used in the experiment. Bachelor of Engineering (Computer Engineering) 2018-05-16T07:18:27Z 2018-05-16T07:18:27Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74341 en Nanyang Technological University 56 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Srivastava, Shreya
Memory interface design for integrating accelerators with Xilinx Zynq platform
description Field-Programmable Gate Arrays (FPGAs) are silicon chips with several configurable logic blocks connected to each other with the help of programmable interconnects. Numerous processes can be carried out on an FPGA in parallel. In this report we present a set of experiments for interfacing accelerators (HLS generated RTL) within the Xilinx Zynq Platform using Xillybus. For Xillybus characterization, the first experiment performed is the single pipe loopback experiment which consists of FIFOs connected in a loopback. It can be concluded from this experiment that the throughput becomes nearly constant for larger input data size. The second experiment explores FPGA coprocessing using trigonometric sine function calculation as an example. This experiment introduces the concept of a synthesized function, a wrapper and a host program. We observe that sending a single set of data for processing is not efficient because of the I/O overhead and hardware and software latencies. In practical host programs, an array of structures is usually sent as the input. The third experiment conducted is the color convert experiment which takes in input color values and returns the output values after processing. A two-step optimization process is implemented for better performance and efficiency. The saturation value for the number of inputs was close to 204,000 32-bit integers. The final experiment conducted is a practical application of the color convert example which takes an image as the input and generates the output image after processing. This takes a short amount of time considering the size of input data and shows the importance of the optimization steps used in the experiment.
author2 Douglas Leslie Maskell
author_facet Douglas Leslie Maskell
Srivastava, Shreya
format Final Year Project
author Srivastava, Shreya
author_sort Srivastava, Shreya
title Memory interface design for integrating accelerators with Xilinx Zynq platform
title_short Memory interface design for integrating accelerators with Xilinx Zynq platform
title_full Memory interface design for integrating accelerators with Xilinx Zynq platform
title_fullStr Memory interface design for integrating accelerators with Xilinx Zynq platform
title_full_unstemmed Memory interface design for integrating accelerators with Xilinx Zynq platform
title_sort memory interface design for integrating accelerators with xilinx zynq platform
publishDate 2018
url http://hdl.handle.net/10356/74341
_version_ 1759855917478707200