Neural network compression techniques for out-of-distribution detection

One of the key challenges in deploying ML models on embedded systems are the numerous resource constraints, for instance, memory footprint, response time, and power consumption. Such real-time systems require resource-efficient models with low inference time while maintaining reasonable accuracy. In...

Full description

Saved in:
Bibliographic Details
Main Author: Bansal, Aditya
Other Authors: Arvind Easwaran
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/159148
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-159148
record_format dspace
spelling sg-ntu-dr.10356-1591482022-06-10T02:54:14Z Neural network compression techniques for out-of-distribution detection Bansal, Aditya Arvind Easwaran School of Computer Science and Engineering arvinde@ntu.edu.sg Engineering::Computer science and engineering One of the key challenges in deploying ML models on embedded systems are the numerous resource constraints, for instance, memory footprint, response time, and power consumption. Such real-time systems require resource-efficient models with low inference time while maintaining reasonable accuracy. In the context of OOD detection, despite the detection model having a high classification accuracy, if the inference time is too high, the system might be rendered ineffectual. There is significant literature on a number of neural network compression techniques. However, the majority of studies have performed offline testing on datasets like CIFAR. Few works have been implemented on some dedicated hardware or FPGAs. By implementing the above techniques on a real-time embedded system of DuckieBot, we studied the performance of these methods, particularly for the task of OOD detection. The compression techniques of pruning, quantization, and knowledge distillation have been experimented with, and analyzed on numerous metrics, for execution time, memory usage, reconstruction loss, and OOD metrics like ROC curve, True Positive, and False Positive Rates. Bachelor of Engineering (Computer Science) 2022-06-10T02:54:13Z 2022-06-10T02:54:13Z 2022 Final Year Project (FYP) Bansal, A. (2022). Neural network compression techniques for out-of-distribution detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159148 https://hdl.handle.net/10356/159148 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Bansal, Aditya
Neural network compression techniques for out-of-distribution detection
description One of the key challenges in deploying ML models on embedded systems are the numerous resource constraints, for instance, memory footprint, response time, and power consumption. Such real-time systems require resource-efficient models with low inference time while maintaining reasonable accuracy. In the context of OOD detection, despite the detection model having a high classification accuracy, if the inference time is too high, the system might be rendered ineffectual. There is significant literature on a number of neural network compression techniques. However, the majority of studies have performed offline testing on datasets like CIFAR. Few works have been implemented on some dedicated hardware or FPGAs. By implementing the above techniques on a real-time embedded system of DuckieBot, we studied the performance of these methods, particularly for the task of OOD detection. The compression techniques of pruning, quantization, and knowledge distillation have been experimented with, and analyzed on numerous metrics, for execution time, memory usage, reconstruction loss, and OOD metrics like ROC curve, True Positive, and False Positive Rates.
author2 Arvind Easwaran
author_facet Arvind Easwaran
Bansal, Aditya
format Final Year Project
author Bansal, Aditya
author_sort Bansal, Aditya
title Neural network compression techniques for out-of-distribution detection
title_short Neural network compression techniques for out-of-distribution detection
title_full Neural network compression techniques for out-of-distribution detection
title_fullStr Neural network compression techniques for out-of-distribution detection
title_full_unstemmed Neural network compression techniques for out-of-distribution detection
title_sort neural network compression techniques for out-of-distribution detection
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/159148
_version_ 1735491093752971264