Framework for mapping computing-in-memory to basic neural networks

With the advent of the era of big data, the application of neural networks on edge devices has received extensive attention. However, the traditional Von Neumann architecture shows the disadvantages of high latency, low throughput, and decreasing energy efficiency in the data-intensive algorithms, s...

Full description

Saved in:

Bibliographic Details
Main Author:	Shang, Hongyang
Other Authors:	Kim Tae Hyoung
Format:	Thesis-Master by Coursework
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/159014
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-159014
record_format	dspace
spelling	sg-ntu-dr.10356-1590142022-06-05T11:39:00Z Framework for mapping computing-in-memory to basic neural networks Shang, Hongyang Kim Tae Hyoung School of Electrical and Electronic Engineering THKIM@ntu.edu.sg Engineering::Electrical and electronic engineering With the advent of the era of big data, the application of neural networks on edge devices has received extensive attention. However, the traditional Von Neumann architecture shows the disadvantages of high latency, low throughput, and decreasing energy efficiency in the data-intensive algorithms, so it is of great significance to develop new computing architectures. Computing-in-memory architecture has been proposed as a practical neural network accelerator with the natural advantage for multiply-accumulate (MAC) operations caused by its parallel computing structure. At present, most of the research on CIM chips focuses on the development of the memory elements and the design of computing circuits, and less work is done on automated tools that support the CIM chip design. Therefore, this paper proposes a software framework for mapping CIM to basic neural networks. The mapping framework is a semi-automatic data mapping workflow, which is mainly composed of two sub-tasks: the neural network quantization and the neural network mapping. It can achieve quantization with arbitrary bit-width precision, and perform flexible data mapping scheme according to the design of CIM macros. This work can help CIM chip developers to verify the calculation results of chips, and promote the development of CIM chip automation tools. Master of Science (Electronics) 2022-06-05T11:39:00Z 2022-06-05T11:39:00Z 2022 Thesis-Master by Coursework Shang, H. (2022). Framework for mapping computing-in-memory to basic neural networks. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159014 https://hdl.handle.net/10356/159014 en ISM-DISS-03044 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Shang, Hongyang Framework for mapping computing-in-memory to basic neural networks
description	With the advent of the era of big data, the application of neural networks on edge devices has received extensive attention. However, the traditional Von Neumann architecture shows the disadvantages of high latency, low throughput, and decreasing energy efficiency in the data-intensive algorithms, so it is of great significance to develop new computing architectures. Computing-in-memory architecture has been proposed as a practical neural network accelerator with the natural advantage for multiply-accumulate (MAC) operations caused by its parallel computing structure. At present, most of the research on CIM chips focuses on the development of the memory elements and the design of computing circuits, and less work is done on automated tools that support the CIM chip design. Therefore, this paper proposes a software framework for mapping CIM to basic neural networks. The mapping framework is a semi-automatic data mapping workflow, which is mainly composed of two sub-tasks: the neural network quantization and the neural network mapping. It can achieve quantization with arbitrary bit-width precision, and perform flexible data mapping scheme according to the design of CIM macros. This work can help CIM chip developers to verify the calculation results of chips, and promote the development of CIM chip automation tools.
author2	Kim Tae Hyoung
author_facet	Kim Tae Hyoung Shang, Hongyang
format	Thesis-Master by Coursework
author	Shang, Hongyang
author_sort	Shang, Hongyang
title	Framework for mapping computing-in-memory to basic neural networks
title_short	Framework for mapping computing-in-memory to basic neural networks
title_full	Framework for mapping computing-in-memory to basic neural networks
title_fullStr	Framework for mapping computing-in-memory to basic neural networks
title_full_unstemmed	Framework for mapping computing-in-memory to basic neural networks
title_sort	framework for mapping computing-in-memory to basic neural networks
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/159014
_version_	1735491255329095680

Framework for mapping computing-in-memory to basic neural networks

Similar Items