Retraining SNN conversions: CNN to SNN for audio classification tasks
Efficient yet powerful models are in high demand for its portability and affordability. Amongst other methods such as model-pruning, is limiting neural network operations to sparse event-driven spikes: Spiking Neural Networks (SNNs) aims to unravel a new direction in machine learning research. A si...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/167383 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Efficient yet powerful models are in high demand for its portability and affordability. Amongst other methods such as model-pruning, is limiting neural network operations to sparse event-driven spikes: Spiking Neural Networks (SNNs) aims to unravel a new direction in machine learning research.
A significant amount of SNN literature straddles upon mature works of artificial neural networks (ANNs) by migrating its architecture and parameters into SNNs, optimizing the migration to retain as much performance as possible. We spearhead a novel approach: the architecture is migrated and retrained from scratch.
We hypothesize that this new direction will unravel concepts that currently bottlenecks improvements in the field of SNN conversions. Furthermore, alike Transfer Learning, inspire future efforts of fine-tuning a well converted model through training.
This paper presents our analysis of training converted Convolutional Neural Networks (CNNs) to SNNs on audio classification models. Results show that
(1) SNN conversions consistently underperforms CNNs marginally during training, however we also show that model complexity has a possible association with this margin.
(2) SNN converts doesn't necessarily approach the performance of its CNN counterparts asymptotically by increasing the number of time-steps.
(3) SNN training from scratch is costly and impractical with current hardware and dedicated SNN optimization techniques are necessary.
(4) Enabling the SNN membrane decay rate to be learned doesn't significantly affect performance.
This paper provides valuable insights into the perspective of retraining converted SNNs for audio classification, and serves as a reference for future studies and hardware implementation benchmarks. |
---|