Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks

Despite great progress in 3D pose estimation from single-view images or videos, it remains a challenging task due to the substantial depth ambiguity and severe selfocclusions. Motivated by the effectiveness of incorporating spatial dependencies and temporal consistencies to alleviate these issu...

Full description

Saved in:
Bibliographic Details
Main Authors: Cai, Yujun, Ge, Liuhao, Liu, Jun, Cai, Jianfei, Cham, Tat-Jen, Yuan, Junsong, Thalmann, Nadia Magnenat
Other Authors: School of Computer Science and Engineering
Format: Conference or Workshop Item
Language:English
Published: 2019
Subjects:
Online Access:https://hdl.handle.net/10356/86102
http://hdl.handle.net/10220/49902
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-86102
record_format dspace
spelling sg-ntu-dr.10356-861022020-11-01T04:44:00Z Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks Cai, Yujun Ge, Liuhao Liu, Jun Cai, Jianfei Cham, Tat-Jen Yuan, Junsong Thalmann, Nadia Magnenat School of Computer Science and Engineering School of Electrical and Electronic Engineering Interdisciplinary Graduate School (IGS) 2019 IEEE International Conference on Computer Vision (ICCV 19) Institute for Media Innovation (IMI) 3D Pose Estimation Graph Convolutional Neural Network (GCN) Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Despite great progress in 3D pose estimation from single-view images or videos, it remains a challenging task due to the substantial depth ambiguity and severe selfocclusions. Motivated by the effectiveness of incorporating spatial dependencies and temporal consistencies to alleviate these issues, we propose a novel graph-based method to tackle the problem of 3D human body and 3D hand pose estimation from a short sequence of 2D joint detections. Particularly, domain knowledge about the human hand (body) configurations is explicitly incorporated into the graph convolutional operations to meet the specific demand of the 3D pose estimation. Furthermore, we introduce a local-to-global network architecture, which is capable of learning multi-scale features for the graph-based representations. We evaluate the proposed method on challenging benchmark datasets for both 3D hand pose estimation and 3D body pose estimation. Experimental results show that our method achieves state-of-the-art performance on both tasks. Accepted version 2019-09-09T07:25:14Z 2019-12-06T16:16:04Z 2019-09-09T07:25:14Z 2019-12-06T16:16:04Z 2019 Conference Paper Cai, Y., Ge, L., Liu, J., Cai, J., Cham, T.-J., Yuan, J., & Thalmann, N. M. (2019). Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 2272-2281. doi:10.1109/ICCV.2019.00236 https://hdl.handle.net/10356/86102 http://hdl.handle.net/10220/49902 10.1109/ICCV.2019.00236 2272 2281 en © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/ICCV.2019.00236 10 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic 3D Pose Estimation
Graph Convolutional Neural Network (GCN)
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle 3D Pose Estimation
Graph Convolutional Neural Network (GCN)
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Cai, Yujun
Ge, Liuhao
Liu, Jun
Cai, Jianfei
Cham, Tat-Jen
Yuan, Junsong
Thalmann, Nadia Magnenat
Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks
description Despite great progress in 3D pose estimation from single-view images or videos, it remains a challenging task due to the substantial depth ambiguity and severe selfocclusions. Motivated by the effectiveness of incorporating spatial dependencies and temporal consistencies to alleviate these issues, we propose a novel graph-based method to tackle the problem of 3D human body and 3D hand pose estimation from a short sequence of 2D joint detections. Particularly, domain knowledge about the human hand (body) configurations is explicitly incorporated into the graph convolutional operations to meet the specific demand of the 3D pose estimation. Furthermore, we introduce a local-to-global network architecture, which is capable of learning multi-scale features for the graph-based representations. We evaluate the proposed method on challenging benchmark datasets for both 3D hand pose estimation and 3D body pose estimation. Experimental results show that our method achieves state-of-the-art performance on both tasks.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Cai, Yujun
Ge, Liuhao
Liu, Jun
Cai, Jianfei
Cham, Tat-Jen
Yuan, Junsong
Thalmann, Nadia Magnenat
format Conference or Workshop Item
author Cai, Yujun
Ge, Liuhao
Liu, Jun
Cai, Jianfei
Cham, Tat-Jen
Yuan, Junsong
Thalmann, Nadia Magnenat
author_sort Cai, Yujun
title Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks
title_short Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks
title_full Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks
title_fullStr Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks
title_full_unstemmed Exploiting spatial-temporal relationships for 3D pose estimation via graph convolutional networks
title_sort exploiting spatial-temporal relationships for 3d pose estimation via graph convolutional networks
publishDate 2019
url https://hdl.handle.net/10356/86102
http://hdl.handle.net/10220/49902
_version_ 1683494591278350336