FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK

The development of hardware accelerators for deep learning is increasing rapidly with the purpose for flexibility to be applied to various deep learning architectures. Accelerators that are widely marketed today are accelerators with GPU-based architectures where developers encounter quite a l...

Full description

Saved in:

Bibliographic Details
Main Author:	Dwi Cahyo, Ardian
Format:	Theses
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/58018
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:58018
spelling	id-itb.:580182021-08-30T10:09:37ZFPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK Dwi Cahyo, Ardian Indonesia Theses Convolutional Neural Network, Convolution Layer, Hardware Accelerator INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/58018 The development of hardware accelerators for deep learning is increasing rapidly with the purpose for flexibility to be applied to various deep learning architectures. Accelerators that are widely marketed today are accelerators with GPU-based architectures where developers encounter quite a lot of disadvantages due to high power consumption. Therefore, further developments for hardware accelerators that can be applied to multiple CNN deep learning architectural models is needed. In this research, the authors will designed an accelerator to be applied to an FPGA. The accelerator will be used to handle processes at the convolution layer of the Convolutional Neural Network (CNN). Processes running on the system are carried out in parallel manners using several processing elements at once. The architecture tested with image data input with resolution of 512×512 pixels that divided into four segments with image resolution of 256×256 pixels. The system designed with target clock base 10 ns and capable to provide throughput of 1Gbyte/sec. Iteration time of the accelerator to process data is 670.980 ns per segment with system latency 7930 ns for 3×3 kernel size. furthermore, Iteration time for the system tested with kernel size of 2×2 is 660,750 ns with latency 7930 ns. The use of accelerators to handle the convolution process in the system is proved to be able to speed up the processing time with a very significant time difference. text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	The development of hardware accelerators for deep learning is increasing rapidly with the purpose for flexibility to be applied to various deep learning architectures. Accelerators that are widely marketed today are accelerators with GPU-based architectures where developers encounter quite a lot of disadvantages due to high power consumption. Therefore, further developments for hardware accelerators that can be applied to multiple CNN deep learning architectural models is needed. In this research, the authors will designed an accelerator to be applied to an FPGA. The accelerator will be used to handle processes at the convolution layer of the Convolutional Neural Network (CNN). Processes running on the system are carried out in parallel manners using several processing elements at once. The architecture tested with image data input with resolution of 512×512 pixels that divided into four segments with image resolution of 256×256 pixels. The system designed with target clock base 10 ns and capable to provide throughput of 1Gbyte/sec. Iteration time of the accelerator to process data is 670.980 ns per segment with system latency 7930 ns for 3×3 kernel size. furthermore, Iteration time for the system tested with kernel size of 2×2 is 660,750 ns with latency 7930 ns. The use of accelerators to handle the convolution process in the system is proved to be able to speed up the processing time with a very significant time difference.
format	Theses
author	Dwi Cahyo, Ardian
spellingShingle	Dwi Cahyo, Ardian FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
author_facet	Dwi Cahyo, Ardian
author_sort	Dwi Cahyo, Ardian
title	FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_short	FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_full	FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_fullStr	FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_full_unstemmed	FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_sort	fpga based hardware accelerator design for deep convolutional neural network
url	https://digilib.itb.ac.id/gdl/view/58018
_version_	1823644803872587776

FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK

Similar Items