Siamese networks for speaker identification on resource-constrained platforms

This paper investigates the implementation of a lightweight Siamese neural network for enhancing speaker identification accuracy and inference speed in embedded systems. Integrating speaker identification into embedded systems can improve portability and versatility. Siamese neural networks achieve...

Full description

Saved in:

Bibliographic Details
Main Authors:	Lim, Jun Jie, Ahmad Zabidi, Muhammad Mun’im, Abdul Manan, Shahidatul Sadiah, Ab. Rahman, Ab. Al-Hadi
Format:	Conference or Workshop Item
Language:	English
Published:	2023
Subjects:	TK Electrical engineering. Electronics Nuclear engineering
Online Access:	http://eprints.utm.my/107836/1/MuhammadMunim2023_SiameseNetworksforSpeakerIdentification.pdf http://eprints.utm.my/107836/ http://dx.doi.org/10.1088/1742-6596/2622/1/012014
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Teknologi Malaysia
Language:	English

id	my.utm.107836
record_format	eprints
spelling	my.utm.1078362024-10-08T06:13:33Z http://eprints.utm.my/107836/ Siamese networks for speaker identification on resource-constrained platforms Lim, Jun Jie Ahmad Zabidi, Muhammad Mun’im Abdul Manan, Shahidatul Sadiah Ab. Rahman, Ab. Al-Hadi TK Electrical engineering. Electronics Nuclear engineering This paper investigates the implementation of a lightweight Siamese neural network for enhancing speaker identification accuracy and inference speed in embedded systems. Integrating speaker identification into embedded systems can improve portability and versatility. Siamese neural networks achieve speaker identification by comparing input voice samples to reference voices in a database, effectively extracting features and classifying speakers accurately. Considering the trade-off between accuracy and complexity, as well as hardware constraints in embedded systems, various neural networks could be applied to speaker identification. This paper compares the incorporation of CNN architectures targeted for embedded systems, MCUNet, SqueezeNet and MobileNetv2, to implement Siamese neural networks on a Raspberry Pi. Our experiments demonstrate that MCUNet achieves 85% accuracy with a 0.23-second inference time. In comparison, the larger MobileNetv2 attains 84.5% accuracy with a 0.32-second inference time. Additionally, contrastive loss was superior to binary cross-entropy loss in the Siamese neural network. The system using contrastive loss had almost 68% lower loss scores, resulting in a more stable performance and more accurate predictions. In conclusion, this paper establishes that an appropriate lightweight Siamese neural network, combined with contrastive loss, can significantly improve speaker identification accuracy, and enable efficient deployment on resource-constrained platforms. 2023 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utm.my/107836/1/MuhammadMunim2023_SiameseNetworksforSpeakerIdentification.pdf Lim, Jun Jie and Ahmad Zabidi, Muhammad Mun’im and Abdul Manan, Shahidatul Sadiah and Ab. Rahman, Ab. Al-Hadi (2023) Siamese networks for speaker identification on resource-constrained platforms. In: 1st International Conference on Electronic and Computer Engineering, ECE 2023, 4 July 2023 - 5 July 2023, Virtual, UTM Johor Bahru, Johor, Malaysia. http://dx.doi.org/10.1088/1742-6596/2622/1/012014
institution	Universiti Teknologi Malaysia
building	UTM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Teknologi Malaysia
content_source	UTM Institutional Repository
url_provider	http://eprints.utm.my/
language	English
topic	TK Electrical engineering. Electronics Nuclear engineering
spellingShingle	TK Electrical engineering. Electronics Nuclear engineering Lim, Jun Jie Ahmad Zabidi, Muhammad Mun’im Abdul Manan, Shahidatul Sadiah Ab. Rahman, Ab. Al-Hadi Siamese networks for speaker identification on resource-constrained platforms
description	This paper investigates the implementation of a lightweight Siamese neural network for enhancing speaker identification accuracy and inference speed in embedded systems. Integrating speaker identification into embedded systems can improve portability and versatility. Siamese neural networks achieve speaker identification by comparing input voice samples to reference voices in a database, effectively extracting features and classifying speakers accurately. Considering the trade-off between accuracy and complexity, as well as hardware constraints in embedded systems, various neural networks could be applied to speaker identification. This paper compares the incorporation of CNN architectures targeted for embedded systems, MCUNet, SqueezeNet and MobileNetv2, to implement Siamese neural networks on a Raspberry Pi. Our experiments demonstrate that MCUNet achieves 85% accuracy with a 0.23-second inference time. In comparison, the larger MobileNetv2 attains 84.5% accuracy with a 0.32-second inference time. Additionally, contrastive loss was superior to binary cross-entropy loss in the Siamese neural network. The system using contrastive loss had almost 68% lower loss scores, resulting in a more stable performance and more accurate predictions. In conclusion, this paper establishes that an appropriate lightweight Siamese neural network, combined with contrastive loss, can significantly improve speaker identification accuracy, and enable efficient deployment on resource-constrained platforms.
format	Conference or Workshop Item
author	Lim, Jun Jie Ahmad Zabidi, Muhammad Mun’im Abdul Manan, Shahidatul Sadiah Ab. Rahman, Ab. Al-Hadi
author_facet	Lim, Jun Jie Ahmad Zabidi, Muhammad Mun’im Abdul Manan, Shahidatul Sadiah Ab. Rahman, Ab. Al-Hadi
author_sort	Lim, Jun Jie
title	Siamese networks for speaker identification on resource-constrained platforms
title_short	Siamese networks for speaker identification on resource-constrained platforms
title_full	Siamese networks for speaker identification on resource-constrained platforms
title_fullStr	Siamese networks for speaker identification on resource-constrained platforms
title_full_unstemmed	Siamese networks for speaker identification on resource-constrained platforms
title_sort	siamese networks for speaker identification on resource-constrained platforms
publishDate	2023
url	http://eprints.utm.my/107836/1/MuhammadMunim2023_SiameseNetworksforSpeakerIdentification.pdf http://eprints.utm.my/107836/ http://dx.doi.org/10.1088/1742-6596/2622/1/012014
_version_	1814043536362831872

Siamese networks for speaker identification on resource-constrained platforms

Similar Items