Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training

3D hand estimation from 2D joint information is an essential task in human-machine interaction, which has achieved great progress as an application of deep learning. However, regression-based methods do not perform well because the structural information is not effectively exploited, and the joint c...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zheng, Zhiwei, Hu, Zhongxu, Qin, Hui, Liu, Jie
Other Authors:	School of Mechanical and Aerospace Engineering
Format:	Article
Language:	English
Published:	2023
Subjects:	Engineering::Mechanical engineering 3D Hand Pose Estimation Deep Learning
Online Access:	https://hdl.handle.net/10356/172283
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-172283
record_format	dspace
spelling	sg-ntu-dr.10356-1722832023-12-05T02:34:04Z Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training Zheng, Zhiwei Hu, Zhongxu Qin, Hui Liu, Jie School of Mechanical and Aerospace Engineering Engineering::Mechanical engineering 3D Hand Pose Estimation Deep Learning 3D hand estimation from 2D joint information is an essential task in human-machine interaction, which has achieved great progress as an application of deep learning. However, regression-based methods do not perform well because the structural information is not effectively exploited, and the joint coordinates are variable. To address these issues, the hand pose is represented with bone vectors instead of joint coordinates in this study, which are stabler to learn and allow for easier encoding of the hand geometric structure and joint dependency. A novel graph bone region U-Net is specifically designed for bone representation to learn multiscale structural features, where the proposed novel elements (graph convolution, pooling and unpooling) incorporate hand structural knowledge. Under the introduced “finger-to-hand” framework, the network gradually decreases the scale from bone to finger to hand for learning more meaningful multiscale features. Moreover, the unit network is stacked repeatedly to extract multilevel features. Based on the above network, a simple but effective semi-supervised approach is introduced to address the lack of 3D hand pose labels. Many experiments are conducted to evaluate the proposed approach on two challenging datasets. The experimental results show that the proposed supervised approach outperforms the state-of-the-art methods, and the proposed semi-supervised approach can still achieve favorable performance when the labeled data are scarce. This work is supported by the National Key Research and Development Program of China (2021YFC3200303) and National Natural Science Foundation of China (No. 51979113, 52039004, U1865202). 2023-12-05T02:34:04Z 2023-12-05T02:34:04Z 2023 Journal Article Zheng, Z., Hu, Z., Qin, H. & Liu, J. (2023). Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training. Image and Vision Computing, 134, 104673-. https://dx.doi.org/10.1016/j.imavis.2023.104673 0262-8856 https://hdl.handle.net/10356/172283 10.1016/j.imavis.2023.104673 2-s2.0-85153850402 134 104673 en Image and Vision Computing © 2023 Published by Elsevier B.V. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Mechanical engineering 3D Hand Pose Estimation Deep Learning
spellingShingle	Engineering::Mechanical engineering 3D Hand Pose Estimation Deep Learning Zheng, Zhiwei Hu, Zhongxu Qin, Hui Liu, Jie Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training
description	3D hand estimation from 2D joint information is an essential task in human-machine interaction, which has achieved great progress as an application of deep learning. However, regression-based methods do not perform well because the structural information is not effectively exploited, and the joint coordinates are variable. To address these issues, the hand pose is represented with bone vectors instead of joint coordinates in this study, which are stabler to learn and allow for easier encoding of the hand geometric structure and joint dependency. A novel graph bone region U-Net is specifically designed for bone representation to learn multiscale structural features, where the proposed novel elements (graph convolution, pooling and unpooling) incorporate hand structural knowledge. Under the introduced “finger-to-hand” framework, the network gradually decreases the scale from bone to finger to hand for learning more meaningful multiscale features. Moreover, the unit network is stacked repeatedly to extract multilevel features. Based on the above network, a simple but effective semi-supervised approach is introduced to address the lack of 3D hand pose labels. Many experiments are conducted to evaluate the proposed approach on two challenging datasets. The experimental results show that the proposed supervised approach outperforms the state-of-the-art methods, and the proposed semi-supervised approach can still achieve favorable performance when the labeled data are scarce.
author2	School of Mechanical and Aerospace Engineering
author_facet	School of Mechanical and Aerospace Engineering Zheng, Zhiwei Hu, Zhongxu Qin, Hui Liu, Jie
format	Article
author	Zheng, Zhiwei Hu, Zhongxu Qin, Hui Liu, Jie
author_sort	Zheng, Zhiwei
title	Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training
title_short	Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training
title_full	Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training
title_fullStr	Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training
title_full_unstemmed	Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training
title_sort	stacked graph bone region u-net with bone representation for hand pose estimation and semi-supervised training
publishDate	2023
url	https://hdl.handle.net/10356/172283
_version_	1784855591804469248

Stacked graph bone region U-net with bone representation for hand pose estimation and semi-supervised training

Similar Items