A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks

In this work, we present a novel 8T static random access memory (SRAM)-based compute-in-memory (CIM) macro for processing neural networks with high energy efficiency. The proposed 8T bitcell is free from disturb issues thanks to the decoupled read channels by adding two extra transistors to the stan...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yu, Chengshuo, Yoo, Taegeun, Chai, Kevin Tshun Chuan, Kim, Tony Tae-Hyoung, Kim, Bongjin
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2022
Subjects:	Engineering::Electrical and electronic engineering Random Access Memory Neurons
Online Access:	https://hdl.handle.net/10356/163744
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-163744
record_format	dspace
spelling	sg-ntu-dr.10356-1637442022-12-15T08:16:08Z A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks Yu, Chengshuo Yoo, Taegeun Chai, Kevin Tshun Chuan Kim, Tony Tae-Hyoung Kim, Bongjin School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Random Access Memory Neurons In this work, we present a novel 8T static random access memory (SRAM)-based compute-in-memory (CIM) macro for processing neural networks with high energy efficiency. The proposed 8T bitcell is free from disturb issues thanks to the decoupled read channels by adding two extra transistors to the standard 6T bitcell. A 128 $\times $ 128 8T SRAM array offers massively parallel binary multiply and accumulate (MAC) operations with 64 $\times $ binary inputs (0/1) and 64 $\times $ 128 binary weights (+1/-1). After parallel MAC operations, 128 column-based neurons generate 128 $\times $ 1-5 bit outputs in parallel. The proposed column-based neuron comprises 64 $\times $ bitcells for dot-product, 32 $\times $ bitcells for analog-to-digital converter (ADC), and 32 $\times $ bitcells for offset calibration. The column ADC with 32 $\times $ replica SRAM bitcells converts the analog MAC results (i.e., a differential read bitline (RBL/RBLb) voltage) to the 1-5 bit output code by sweeping their reference levels in 1-31 cycles (i.e., $2^{N}$ -1 cycles for $N$ -bit ADC). The measured linearity results [differential nonlinearity (DNL) and integral nonlinearity (INL)] are +0.314/-0.256 least significant bit (LSB) and + 0.27/-0.116 LSB, respectively, after offset calibration. The simulated image classification results are 96.37% for Mixed National Institute of Standards and Technology database (MNIST) using a multi-layer perceptron (MLP) with two hidden layers, 87.1%/82.66% for CIFAR-10 using VGG-like/ResNet-18 convolutional neural networks (CNNs), demonstrating slight accuracy degradations (0.67%-1.34%) compared with the software baseline. A test chip with a 16K 8T SRAM bitcell array is fabricated using a 65-nm process. The measured energy efficiency is 490-15.8 TOPS/W for 1-5 bit ADC resolution using 0.45-/0.8-V core supply. Agency for Science, Technology and Research (ASTAR) This work was supported by the Agency for Science, Technology and Research (ASTAR), Singapore, under Grant A18A7b0058. 2022-12-15T08:16:07Z 2022-12-15T08:16:07Z 2022 Journal Article Yu, C., Yoo, T., Chai, K. T. C., Kim, T. T. & Kim, B. (2022). A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks. IEEE Journal of Solid-State Circuits, 57(11), 3466-3476. https://dx.doi.org/10.1109/JSSC.2022.3162602 0018-9200 https://hdl.handle.net/10356/163744 10.1109/JSSC.2022.3162602 2-s2.0-85128273106 11 57 3466 3476 en A18A7b0058 IEEE Journal of Solid-State Circuits © 2022 IEEE. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering Random Access Memory Neurons
spellingShingle	Engineering::Electrical and electronic engineering Random Access Memory Neurons Yu, Chengshuo Yoo, Taegeun Chai, Kevin Tshun Chuan Kim, Tony Tae-Hyoung Kim, Bongjin A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks
description	In this work, we present a novel 8T static random access memory (SRAM)-based compute-in-memory (CIM) macro for processing neural networks with high energy efficiency. The proposed 8T bitcell is free from disturb issues thanks to the decoupled read channels by adding two extra transistors to the standard 6T bitcell. A 128 $\times $ 128 8T SRAM array offers massively parallel binary multiply and accumulate (MAC) operations with 64 $\times $ binary inputs (0/1) and 64 $\times $ 128 binary weights (+1/-1). After parallel MAC operations, 128 column-based neurons generate 128 $\times $ 1-5 bit outputs in parallel. The proposed column-based neuron comprises 64 $\times $ bitcells for dot-product, 32 $\times $ bitcells for analog-to-digital converter (ADC), and 32 $\times $ bitcells for offset calibration. The column ADC with 32 $\times $ replica SRAM bitcells converts the analog MAC results (i.e., a differential read bitline (RBL/RBLb) voltage) to the 1-5 bit output code by sweeping their reference levels in 1-31 cycles (i.e., $2^{N}$ -1 cycles for $N$ -bit ADC). The measured linearity results [differential nonlinearity (DNL) and integral nonlinearity (INL)] are +0.314/-0.256 least significant bit (LSB) and + 0.27/-0.116 LSB, respectively, after offset calibration. The simulated image classification results are 96.37% for Mixed National Institute of Standards and Technology database (MNIST) using a multi-layer perceptron (MLP) with two hidden layers, 87.1%/82.66% for CIFAR-10 using VGG-like/ResNet-18 convolutional neural networks (CNNs), demonstrating slight accuracy degradations (0.67%-1.34%) compared with the software baseline. A test chip with a 16K 8T SRAM bitcell array is fabricated using a 65-nm process. The measured energy efficiency is 490-15.8 TOPS/W for 1-5 bit ADC resolution using 0.45-/0.8-V core supply.
author2	School of Electrical and Electronic Engineering
author_facet	School of Electrical and Electronic Engineering Yu, Chengshuo Yoo, Taegeun Chai, Kevin Tshun Chuan Kim, Tony Tae-Hyoung Kim, Bongjin
format	Article
author	Yu, Chengshuo Yoo, Taegeun Chai, Kevin Tshun Chuan Kim, Tony Tae-Hyoung Kim, Bongjin
author_sort	Yu, Chengshuo
title	A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks
title_short	A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks
title_full	A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks
title_fullStr	A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks
title_full_unstemmed	A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks
title_sort	65-nm 8t sram compute-in-memory macro with column adcs for processing neural networks
publishDate	2022
url	https://hdl.handle.net/10356/163744
_version_	1753801085424762880

A 65-nm 8T SRAM compute-in-memory macro with column ADCs for processing neural networks

Similar Items