Framework for mapping computing-in-memory to basic neural networks
With the advent of the era of big data, the application of neural networks on edge devices has received extensive attention. However, the traditional Von Neumann architecture shows the disadvantages of high latency, low throughput, and decreasing energy efficiency in the data-intensive algorithms, s...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Coursework |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/159014 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-159014 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1590142022-06-05T11:39:00Z Framework for mapping computing-in-memory to basic neural networks Shang, Hongyang Kim Tae Hyoung School of Electrical and Electronic Engineering THKIM@ntu.edu.sg Engineering::Electrical and electronic engineering With the advent of the era of big data, the application of neural networks on edge devices has received extensive attention. However, the traditional Von Neumann architecture shows the disadvantages of high latency, low throughput, and decreasing energy efficiency in the data-intensive algorithms, so it is of great significance to develop new computing architectures. Computing-in-memory architecture has been proposed as a practical neural network accelerator with the natural advantage for multiply-accumulate (MAC) operations caused by its parallel computing structure. At present, most of the research on CIM chips focuses on the development of the memory elements and the design of computing circuits, and less work is done on automated tools that support the CIM chip design. Therefore, this paper proposes a software framework for mapping CIM to basic neural networks. The mapping framework is a semi-automatic data mapping workflow, which is mainly composed of two sub-tasks: the neural network quantization and the neural network mapping. It can achieve quantization with arbitrary bit-width precision, and perform flexible data mapping scheme according to the design of CIM macros. This work can help CIM chip developers to verify the calculation results of chips, and promote the development of CIM chip automation tools. Master of Science (Electronics) 2022-06-05T11:39:00Z 2022-06-05T11:39:00Z 2022 Thesis-Master by Coursework Shang, H. (2022). Framework for mapping computing-in-memory to basic neural networks. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159014 https://hdl.handle.net/10356/159014 en ISM-DISS-03044 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Shang, Hongyang Framework for mapping computing-in-memory to basic neural networks |
description |
With the advent of the era of big data, the application of neural networks on edge devices has received extensive attention. However, the traditional Von Neumann architecture shows the disadvantages of high latency, low throughput, and decreasing energy efficiency in the data-intensive algorithms, so it is of great significance to develop new computing architectures. Computing-in-memory architecture has been proposed as a practical neural network accelerator with the natural advantage for multiply-accumulate (MAC) operations caused by its parallel computing structure. At present, most of the research on CIM chips focuses on the development of the memory elements and the design of computing circuits, and less work is done on automated tools that support the CIM chip design. Therefore, this paper proposes a software framework for mapping CIM to basic neural networks. The mapping framework is a semi-automatic data mapping workflow, which is mainly composed of two sub-tasks: the neural network quantization and the neural network mapping. It can achieve quantization with arbitrary bit-width precision, and perform flexible data mapping scheme according to the design of CIM macros. This work can help CIM chip developers to verify the calculation results of chips, and promote the development of CIM chip automation tools. |
author2 |
Kim Tae Hyoung |
author_facet |
Kim Tae Hyoung Shang, Hongyang |
format |
Thesis-Master by Coursework |
author |
Shang, Hongyang |
author_sort |
Shang, Hongyang |
title |
Framework for mapping computing-in-memory to basic neural networks |
title_short |
Framework for mapping computing-in-memory to basic neural networks |
title_full |
Framework for mapping computing-in-memory to basic neural networks |
title_fullStr |
Framework for mapping computing-in-memory to basic neural networks |
title_full_unstemmed |
Framework for mapping computing-in-memory to basic neural networks |
title_sort |
framework for mapping computing-in-memory to basic neural networks |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/159014 |
_version_ |
1735491255329095680 |