Neural network compression techniques for out-of-distribution detection
One of the key challenges in deploying ML models on embedded systems are the numerous resource constraints, for instance, memory footprint, response time, and power consumption. Such real-time systems require resource-efficient models with low inference time while maintaining reasonable accuracy. In...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/159148 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-159148 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1591482022-06-10T02:54:14Z Neural network compression techniques for out-of-distribution detection Bansal, Aditya Arvind Easwaran School of Computer Science and Engineering arvinde@ntu.edu.sg Engineering::Computer science and engineering One of the key challenges in deploying ML models on embedded systems are the numerous resource constraints, for instance, memory footprint, response time, and power consumption. Such real-time systems require resource-efficient models with low inference time while maintaining reasonable accuracy. In the context of OOD detection, despite the detection model having a high classification accuracy, if the inference time is too high, the system might be rendered ineffectual. There is significant literature on a number of neural network compression techniques. However, the majority of studies have performed offline testing on datasets like CIFAR. Few works have been implemented on some dedicated hardware or FPGAs. By implementing the above techniques on a real-time embedded system of DuckieBot, we studied the performance of these methods, particularly for the task of OOD detection. The compression techniques of pruning, quantization, and knowledge distillation have been experimented with, and analyzed on numerous metrics, for execution time, memory usage, reconstruction loss, and OOD metrics like ROC curve, True Positive, and False Positive Rates. Bachelor of Engineering (Computer Science) 2022-06-10T02:54:13Z 2022-06-10T02:54:13Z 2022 Final Year Project (FYP) Bansal, A. (2022). Neural network compression techniques for out-of-distribution detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159148 https://hdl.handle.net/10356/159148 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Bansal, Aditya Neural network compression techniques for out-of-distribution detection |
description |
One of the key challenges in deploying ML models on embedded systems are the numerous resource constraints, for instance, memory footprint, response time, and power consumption. Such real-time systems require resource-efficient models with low inference time while maintaining reasonable accuracy. In the context of OOD detection, despite the detection model having a high classification accuracy, if the inference time is too high, the system might be rendered ineffectual.
There is significant literature on a number of neural network compression techniques. However, the majority of studies have performed offline testing on datasets like CIFAR. Few works have been implemented on some dedicated hardware or FPGAs. By implementing the above techniques on a real-time embedded system of DuckieBot, we studied the performance of these methods, particularly for the task of OOD detection. The compression techniques of pruning, quantization, and knowledge distillation have been experimented with, and analyzed on numerous metrics, for execution time, memory usage, reconstruction loss, and OOD metrics like ROC curve, True Positive, and False Positive Rates. |
author2 |
Arvind Easwaran |
author_facet |
Arvind Easwaran Bansal, Aditya |
format |
Final Year Project |
author |
Bansal, Aditya |
author_sort |
Bansal, Aditya |
title |
Neural network compression techniques for out-of-distribution detection |
title_short |
Neural network compression techniques for out-of-distribution detection |
title_full |
Neural network compression techniques for out-of-distribution detection |
title_fullStr |
Neural network compression techniques for out-of-distribution detection |
title_full_unstemmed |
Neural network compression techniques for out-of-distribution detection |
title_sort |
neural network compression techniques for out-of-distribution detection |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/159148 |
_version_ |
1735491093752971264 |