Investigating pose representations and motion contexts modeling for 3D motion prediction
Predicting human motion from historical pose sequence is crucial for a machine to succeed in intelligent interactions with humans. One aspect that has been obviated so far, is the fact that how we represent the skeletal pose has a critical impact on the prediction results. Yet there is no effort tha...
Saved in:
Main Authors: | , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/162631 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-162631 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1626312022-11-02T00:42:18Z Investigating pose representations and motion contexts modeling for 3D motion prediction Liu, Zhenguang Wu, Shuang Jin, Shuyuan Liu, Qi Ji, Shouling Lu, Shijian Cheng, Li School of Computer Science and Engineering Engineering::Computer science and engineering Motion Prediction Motion Context Predicting human motion from historical pose sequence is crucial for a machine to succeed in intelligent interactions with humans. One aspect that has been obviated so far, is the fact that how we represent the skeletal pose has a critical impact on the prediction results. Yet there is no effort that investigates across different pose representation schemes. We conduct an indepth study on various pose representations with a focus on their effects on the motion prediction task. Moreover, recent approaches build upon off-the-shelf RNN units for motion prediction. These approaches process input pose sequence sequentially and inherently have difficulties in capturing long-term dependencies. In this paper, we propose a novel RNN architecture termed AHMR for motion prediction which simultaneously models local motion contexts and a global context. We further explore a geodesic loss and a forward kinematics loss, which have more geometric significance than the widely employed L2 loss. Interestingly, we applied our method to a range of articulate objects including human, fish, and mouse. Empirical results show that our approach outperforms the state-of-the-art methods in short-term prediction and achieves much enhanced long-term prediction proficiency, such as retaining natural human-like motions over 50 seconds predictions. Our codes are released. 2022-11-02T00:42:18Z 2022-11-02T00:42:18Z 2022 Journal Article Liu, Z., Wu, S., Jin, S., Liu, Q., Ji, S., Lu, S. & Cheng, L. (2022). Investigating pose representations and motion contexts modeling for 3D motion prediction. IEEE Transactions On Pattern Analysis and Machine Intelligence, 3139918-. https://dx.doi.org/10.1109/TPAMI.2021.3139918 0162-8828 https://hdl.handle.net/10356/162631 10.1109/TPAMI.2021.3139918 34982672 2-s2.0-85122574743 3139918 en IEEE Transactions on Pattern Analysis and Machine Intelligence © 2021 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Motion Prediction Motion Context |
spellingShingle |
Engineering::Computer science and engineering Motion Prediction Motion Context Liu, Zhenguang Wu, Shuang Jin, Shuyuan Liu, Qi Ji, Shouling Lu, Shijian Cheng, Li Investigating pose representations and motion contexts modeling for 3D motion prediction |
description |
Predicting human motion from historical pose sequence is crucial for a machine to succeed in intelligent interactions with humans. One aspect that has been obviated so far, is the fact that how we represent the skeletal pose has a critical impact on the prediction results. Yet there is no effort that investigates across different pose representation schemes. We conduct an indepth study on various pose representations with a focus on their effects on the motion prediction task. Moreover, recent approaches build upon off-the-shelf RNN units for motion prediction. These approaches process input pose sequence sequentially and inherently have difficulties in capturing long-term dependencies. In this paper, we propose a novel RNN architecture termed AHMR for motion prediction which simultaneously models local motion contexts and a global context. We further explore a geodesic loss and a forward kinematics loss, which have more geometric significance than the widely employed L2 loss. Interestingly, we applied our method to a range of articulate objects including human, fish, and mouse. Empirical results show that our approach outperforms the state-of-the-art methods in short-term prediction and achieves much enhanced long-term prediction proficiency, such as retaining natural human-like motions over 50 seconds predictions. Our codes are released. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Liu, Zhenguang Wu, Shuang Jin, Shuyuan Liu, Qi Ji, Shouling Lu, Shijian Cheng, Li |
format |
Article |
author |
Liu, Zhenguang Wu, Shuang Jin, Shuyuan Liu, Qi Ji, Shouling Lu, Shijian Cheng, Li |
author_sort |
Liu, Zhenguang |
title |
Investigating pose representations and motion contexts modeling for 3D motion prediction |
title_short |
Investigating pose representations and motion contexts modeling for 3D motion prediction |
title_full |
Investigating pose representations and motion contexts modeling for 3D motion prediction |
title_fullStr |
Investigating pose representations and motion contexts modeling for 3D motion prediction |
title_full_unstemmed |
Investigating pose representations and motion contexts modeling for 3D motion prediction |
title_sort |
investigating pose representations and motion contexts modeling for 3d motion prediction |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/162631 |
_version_ |
1749179140116840448 |