L3AM: linear adaptive additive angular margin loss for video-based hand gesture authentication

Feature extractors significantly impact the performance of biometric systems. In the field of hand gesture authentication, existing studies focus on improving the model architectures and behavioral characteristic representation methods to enhance their feature extractors. However, loss functions, wh...

Full description

Saved in:
Bibliographic Details
Main Authors: Song, Wenwei, Kang, Wenxiong, Kong, Adam Wai Kin, Zhang, Yufeng, Qiao, Yitao
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/178949
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-178949
record_format dspace
spelling sg-ntu-dr.10356-1789492024-07-12T15:36:16Z L3AM: linear adaptive additive angular margin loss for video-based hand gesture authentication Song, Wenwei Kang, Wenxiong Kong, Adam Wai Kin Zhang, Yufeng Qiao, Yitao School of Computer Science and Engineering College of Computing and Data Science Computer and Information Science Biometrics Hand gesture authentication · Feature extractors significantly impact the performance of biometric systems. In the field of hand gesture authentication, existing studies focus on improving the model architectures and behavioral characteristic representation methods to enhance their feature extractors. However, loss functions, which can guide extractors to produce more discriminative identity features, are neglected. In this paper, we improve the margin-based Softmax loss functions, which are mainly designed for face authentication, in two aspects to form a new loss function for hand gesture authentication. First, we propose to replace the commonly used cosine function in the margin-based Softmax losses with a linear function to measure the similarity between identity features and proxies (the weight matrix of Softmax, which can be viewed as class centers). With the linear function, the main gradient magnitude decreases monotonically as the quality of the model improves during training, thus allowing the model to be quickly optimized in the early stage and precisely fine-tuned in the late stage. Second, we design an adaptive margin scheme to assign margin penalties to different samples according to their separability and the model quality in each iteration. Our adaptive margin scheme constrains the gradient magnitude. It can reduce radical (excessively large) gradient magnitudes and provide moderate (not too small) gradient magnitudes for model optimization, contributing to more stable training. The linear function and the adaptive margin scheme are complementary. Combining them, we obtain the proposed linear adaptive additive angular margin (L3AM) loss. To demonstrate the effectiveness of L3AM loss, we conduct extensive experiments on seven hand-related authentication datasets, compare it with 25 state-of-the-art (SOTA) loss functions, and apply it to eight SOTA hand gesture authentication models. The experimental results show that L3AM loss further improves the performance of the eight authentication models and outperforms the 25 losses. The code is available at https://github.com/SCUT-BIP-Lab/L3AM. Submitted/Accepted version This work was partially supported by the National Natural Science Foundation of China under Grant Nos. 62376100 and 61976095, the Natural Science Foundation of Guangdong Province of China under Grant No. 2022A1515010114, and China Scholarship Council under Grant No. 202206150104. 2024-07-11T08:40:50Z 2024-07-11T08:40:50Z 2024 Journal Article Song, W., Kang, W., Kong, A. W. K., Zhang, Y. & Qiao, Y. (2024). L3AM: linear adaptive additive angular margin loss for video-based hand gesture authentication. International Journal of Computer Vision. https://dx.doi.org/10.1007/s11263-024-02068-w 0920-5691 https://hdl.handle.net/10356/178949 10.1007/s11263-024-02068-w 2-s2.0-85192171323 en International Journal of Computer Vision © 2024 The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at http://doi.org/10.1007/s11263-024-02068-w. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Biometrics
Hand gesture authentication ·
spellingShingle Computer and Information Science
Biometrics
Hand gesture authentication ·
Song, Wenwei
Kang, Wenxiong
Kong, Adam Wai Kin
Zhang, Yufeng
Qiao, Yitao
L3AM: linear adaptive additive angular margin loss for video-based hand gesture authentication
description Feature extractors significantly impact the performance of biometric systems. In the field of hand gesture authentication, existing studies focus on improving the model architectures and behavioral characteristic representation methods to enhance their feature extractors. However, loss functions, which can guide extractors to produce more discriminative identity features, are neglected. In this paper, we improve the margin-based Softmax loss functions, which are mainly designed for face authentication, in two aspects to form a new loss function for hand gesture authentication. First, we propose to replace the commonly used cosine function in the margin-based Softmax losses with a linear function to measure the similarity between identity features and proxies (the weight matrix of Softmax, which can be viewed as class centers). With the linear function, the main gradient magnitude decreases monotonically as the quality of the model improves during training, thus allowing the model to be quickly optimized in the early stage and precisely fine-tuned in the late stage. Second, we design an adaptive margin scheme to assign margin penalties to different samples according to their separability and the model quality in each iteration. Our adaptive margin scheme constrains the gradient magnitude. It can reduce radical (excessively large) gradient magnitudes and provide moderate (not too small) gradient magnitudes for model optimization, contributing to more stable training. The linear function and the adaptive margin scheme are complementary. Combining them, we obtain the proposed linear adaptive additive angular margin (L3AM) loss. To demonstrate the effectiveness of L3AM loss, we conduct extensive experiments on seven hand-related authentication datasets, compare it with 25 state-of-the-art (SOTA) loss functions, and apply it to eight SOTA hand gesture authentication models. The experimental results show that L3AM loss further improves the performance of the eight authentication models and outperforms the 25 losses. The code is available at https://github.com/SCUT-BIP-Lab/L3AM.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Song, Wenwei
Kang, Wenxiong
Kong, Adam Wai Kin
Zhang, Yufeng
Qiao, Yitao
format Article
author Song, Wenwei
Kang, Wenxiong
Kong, Adam Wai Kin
Zhang, Yufeng
Qiao, Yitao
author_sort Song, Wenwei
title L3AM: linear adaptive additive angular margin loss for video-based hand gesture authentication
title_short L3AM: linear adaptive additive angular margin loss for video-based hand gesture authentication
title_full L3AM: linear adaptive additive angular margin loss for video-based hand gesture authentication
title_fullStr L3AM: linear adaptive additive angular margin loss for video-based hand gesture authentication
title_full_unstemmed L3AM: linear adaptive additive angular margin loss for video-based hand gesture authentication
title_sort l3am: linear adaptive additive angular margin loss for video-based hand gesture authentication
publishDate 2024
url https://hdl.handle.net/10356/178949
_version_ 1806059776915800064