FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK
The development of hardware accelerators for deep learning is increasing rapidly with the purpose for flexibility to be applied to various deep learning architectures. Accelerators that are widely marketed today are accelerators with GPU-based architectures where developers encounter quite a l...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/58018 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:58018 |
---|---|
spelling |
id-itb.:580182021-08-30T10:09:37ZFPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK Dwi Cahyo, Ardian Indonesia Theses Convolutional Neural Network, Convolution Layer, Hardware Accelerator INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/58018 The development of hardware accelerators for deep learning is increasing rapidly with the purpose for flexibility to be applied to various deep learning architectures. Accelerators that are widely marketed today are accelerators with GPU-based architectures where developers encounter quite a lot of disadvantages due to high power consumption. Therefore, further developments for hardware accelerators that can be applied to multiple CNN deep learning architectural models is needed. In this research, the authors will designed an accelerator to be applied to an FPGA. The accelerator will be used to handle processes at the convolution layer of the Convolutional Neural Network (CNN). Processes running on the system are carried out in parallel manners using several processing elements at once. The architecture tested with image data input with resolution of 512×512 pixels that divided into four segments with image resolution of 256×256 pixels. The system designed with target clock base 10 ns and capable to provide throughput of 1Gbyte/sec. Iteration time of the accelerator to process data is 670.980 ns per segment with system latency 7930 ns for 3×3 kernel size. furthermore, Iteration time for the system tested with kernel size of 2×2 is 660,750 ns with latency 7930 ns. The use of accelerators to handle the convolution process in the system is proved to be able to speed up the processing time with a very significant time difference. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
The development of hardware accelerators for deep learning is increasing rapidly with the
purpose for flexibility to be applied to various deep learning architectures. Accelerators that are
widely marketed today are accelerators with GPU-based architectures where developers
encounter quite a lot of disadvantages due to high power consumption. Therefore, further
developments for hardware accelerators that can be applied to multiple CNN deep learning
architectural models is needed.
In this research, the authors will designed an accelerator to be applied to an FPGA. The
accelerator will be used to handle processes at the convolution layer of the Convolutional Neural
Network (CNN). Processes running on the system are carried out in parallel manners using several
processing elements at once. The architecture tested with image data input with resolution of
512×512 pixels that divided into four segments with image resolution of 256×256 pixels. The
system designed with target clock base 10 ns and capable to provide throughput of 1Gbyte/sec.
Iteration time of the accelerator to process data is 670.980 ns per segment with system latency
7930 ns for 3×3 kernel size. furthermore, Iteration time for the system tested with kernel size of
2×2 is 660,750 ns with latency 7930 ns. The use of accelerators to handle the convolution process
in the system is proved to be able to speed up the processing time with a very significant time
difference.
|
format |
Theses |
author |
Dwi Cahyo, Ardian |
spellingShingle |
Dwi Cahyo, Ardian FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK |
author_facet |
Dwi Cahyo, Ardian |
author_sort |
Dwi Cahyo, Ardian |
title |
FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK |
title_short |
FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK |
title_full |
FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK |
title_fullStr |
FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK |
title_full_unstemmed |
FPGA BASED HARDWARE ACCELERATOR DESIGN FOR DEEP CONVOLUTIONAL NEURAL NETWORK |
title_sort |
fpga based hardware accelerator design for deep convolutional neural network |
url |
https://digilib.itb.ac.id/gdl/view/58018 |
_version_ |
1822002822492717056 |