FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK

The development of hardware accelerators for deep learning is increasing rapidly with the purpose for flexibility to be applied to various deep learning architectures. Accelerators that are widely marketed today are accelerators with GPU-based architectures where developers encounter quite a l...

Full description

Saved in:
Bibliographic Details
Main Author: Dwi Cahyo, Ardian
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/58018
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:58018
spelling id-itb.:580182021-08-30T10:09:37ZFPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK Dwi Cahyo, Ardian Indonesia Theses Convolutional Neural Network, Convolution Layer, Hardware Accelerator INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/58018 The development of hardware accelerators for deep learning is increasing rapidly with the purpose for flexibility to be applied to various deep learning architectures. Accelerators that are widely marketed today are accelerators with GPU-based architectures where developers encounter quite a lot of disadvantages due to high power consumption. Therefore, further developments for hardware accelerators that can be applied to multiple CNN deep learning architectural models is needed. In this research, the authors will designed an accelerator to be applied to an FPGA. The accelerator will be used to handle processes at the convolution layer of the Convolutional Neural Network (CNN). Processes running on the system are carried out in parallel manners using several processing elements at once. The architecture tested with image data input with resolution of 512×512 pixels that divided into four segments with image resolution of 256×256 pixels. The system designed with target clock base 10 ns and capable to provide throughput of 1Gbyte/sec. Iteration time of the accelerator to process data is 670.980 ns per segment with system latency 7930 ns for 3×3 kernel size. furthermore, Iteration time for the system tested with kernel size of 2×2 is 660,750 ns with latency 7930 ns. The use of accelerators to handle the convolution process in the system is proved to be able to speed up the processing time with a very significant time difference. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description The development of hardware accelerators for deep learning is increasing rapidly with the purpose for flexibility to be applied to various deep learning architectures. Accelerators that are widely marketed today are accelerators with GPU-based architectures where developers encounter quite a lot of disadvantages due to high power consumption. Therefore, further developments for hardware accelerators that can be applied to multiple CNN deep learning architectural models is needed. In this research, the authors will designed an accelerator to be applied to an FPGA. The accelerator will be used to handle processes at the convolution layer of the Convolutional Neural Network (CNN). Processes running on the system are carried out in parallel manners using several processing elements at once. The architecture tested with image data input with resolution of 512×512 pixels that divided into four segments with image resolution of 256×256 pixels. The system designed with target clock base 10 ns and capable to provide throughput of 1Gbyte/sec. Iteration time of the accelerator to process data is 670.980 ns per segment with system latency 7930 ns for 3×3 kernel size. furthermore, Iteration time for the system tested with kernel size of 2×2 is 660,750 ns with latency 7930 ns. The use of accelerators to handle the convolution process in the system is proved to be able to speed up the processing time with a very significant time difference.
format Theses
author Dwi Cahyo, Ardian
spellingShingle Dwi Cahyo, Ardian
FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
author_facet Dwi Cahyo, Ardian
author_sort Dwi Cahyo, Ardian
title FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_short FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_full FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_fullStr FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_full_unstemmed FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
title_sort fpga based hardware accelerator design for deep convolutional neural network
url https://digilib.itb.ac.id/gdl/view/58018
_version_ 1822002822492717056