Neural network compression techniques for out-of-distribution detection

One of the key challenges in deploying ML models on embedded systems are the numerous resource constraints, for instance, memory footprint, response time, and power consumption. Such real-time systems require resource-efficient models with low inference time while maintaining reasonable accuracy. In...

Full description

Saved in:

Bibliographic Details
Main Author:	Bansal, Aditya
Other Authors:	Arvind Easwaran
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Computer science and engineering
Online Access:	https://hdl.handle.net/10356/159148
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-159148
record_format	dspace
spelling	sg-ntu-dr.10356-1591482022-06-10T02:54:14Z Neural network compression techniques for out-of-distribution detection Bansal, Aditya Arvind Easwaran School of Computer Science and Engineering arvinde@ntu.edu.sg Engineering::Computer science and engineering One of the key challenges in deploying ML models on embedded systems are the numerous resource constraints, for instance, memory footprint, response time, and power consumption. Such real-time systems require resource-efficient models with low inference time while maintaining reasonable accuracy. In the context of OOD detection, despite the detection model having a high classification accuracy, if the inference time is too high, the system might be rendered ineffectual. There is significant literature on a number of neural network compression techniques. However, the majority of studies have performed offline testing on datasets like CIFAR. Few works have been implemented on some dedicated hardware or FPGAs. By implementing the above techniques on a real-time embedded system of DuckieBot, we studied the performance of these methods, particularly for the task of OOD detection. The compression techniques of pruning, quantization, and knowledge distillation have been experimented with, and analyzed on numerous metrics, for execution time, memory usage, reconstruction loss, and OOD metrics like ROC curve, True Positive, and False Positive Rates. Bachelor of Engineering (Computer Science) 2022-06-10T02:54:13Z 2022-06-10T02:54:13Z 2022 Final Year Project (FYP) Bansal, A. (2022). Neural network compression techniques for out-of-distribution detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159148 https://hdl.handle.net/10356/159148 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering
spellingShingle	Engineering::Computer science and engineering Bansal, Aditya Neural network compression techniques for out-of-distribution detection
description	One of the key challenges in deploying ML models on embedded systems are the numerous resource constraints, for instance, memory footprint, response time, and power consumption. Such real-time systems require resource-efficient models with low inference time while maintaining reasonable accuracy. In the context of OOD detection, despite the detection model having a high classification accuracy, if the inference time is too high, the system might be rendered ineffectual. There is significant literature on a number of neural network compression techniques. However, the majority of studies have performed offline testing on datasets like CIFAR. Few works have been implemented on some dedicated hardware or FPGAs. By implementing the above techniques on a real-time embedded system of DuckieBot, we studied the performance of these methods, particularly for the task of OOD detection. The compression techniques of pruning, quantization, and knowledge distillation have been experimented with, and analyzed on numerous metrics, for execution time, memory usage, reconstruction loss, and OOD metrics like ROC curve, True Positive, and False Positive Rates.
author2	Arvind Easwaran
author_facet	Arvind Easwaran Bansal, Aditya
format	Final Year Project
author	Bansal, Aditya
author_sort	Bansal, Aditya
title	Neural network compression techniques for out-of-distribution detection
title_short	Neural network compression techniques for out-of-distribution detection
title_full	Neural network compression techniques for out-of-distribution detection
title_fullStr	Neural network compression techniques for out-of-distribution detection
title_full_unstemmed	Neural network compression techniques for out-of-distribution detection
title_sort	neural network compression techniques for out-of-distribution detection
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/159148
_version_	1735491093752971264

Neural network compression techniques for out-of-distribution detection

Similar Items