Human pose estimation using artificial intelligence
Human Pose Estimation (HPE), is recently a popular research topic in computer vision field. The object of this task is to generate a skeleton-like representation of a human body and then process it further for task-specific applications. Due to the outstanding performance of vision transformer in ot...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Coursework |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/159563 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-159563 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1595632022-06-27T06:13:42Z Human pose estimation using artificial intelligence Dai, Yue Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Human Pose Estimation (HPE), is recently a popular research topic in computer vision field. The object of this task is to generate a skeleton-like representation of a human body and then process it further for task-specific applications. Due to the outstanding performance of vision transformer in other Computer Vision (CV) tasks, its application to HPE problems and ability to surpass Convolutional Neural Network (CNN) models is gaining attention. Apart from the deep learning model, data processing in HPE cannot be ignored. Through the coordinate system transformation in common HPE solutions, the accuracy degrades when flipping strategy is applied in inference. The bias is derived from using resolution measured in pixels instead of size measured in unit length when performing resizing transformation. In this dissertation, TransPose, one state-of-the-art model introducing transformer for HPE is studied. The model is trained and tested on COCO benchmark. The modified TransPose model with unbiased approach is experimented as well. Based on the results, the dependency function of attention map is analyzed. The insights gained from this work provides valuable insight for future exploration of transformer applications and data processing in human pose estimation. Master of Science (Signal Processing) 2022-06-27T06:13:41Z 2022-06-27T06:13:41Z 2022 Thesis-Master by Coursework Dai, Y. (2022). Human pose estimation using artificial intelligence. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159563 https://hdl.handle.net/10356/159563 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Dai, Yue Human pose estimation using artificial intelligence |
description |
Human Pose Estimation (HPE), is recently a popular research topic in computer vision field. The object of this task is to generate a skeleton-like representation of a human body and then process it further for task-specific applications. Due to the outstanding performance of vision transformer in other Computer Vision (CV) tasks, its application to HPE problems and ability to surpass Convolutional Neural Network (CNN) models is gaining attention.
Apart from the deep learning model, data processing in HPE cannot be ignored. Through the coordinate system transformation in common HPE solutions, the accuracy degrades when flipping strategy is applied in inference. The bias is derived from using resolution measured in pixels instead of size measured in unit length when performing resizing transformation.
In this dissertation, TransPose, one state-of-the-art model introducing transformer for HPE is studied. The model is trained and tested on COCO benchmark. The modified TransPose model with unbiased approach is experimented as well. Based on the results, the dependency function of attention map is analyzed. The insights gained from this work provides valuable insight for future exploration of transformer applications and data processing in human pose estimation. |
author2 |
Yap Kim Hui |
author_facet |
Yap Kim Hui Dai, Yue |
format |
Thesis-Master by Coursework |
author |
Dai, Yue |
author_sort |
Dai, Yue |
title |
Human pose estimation using artificial intelligence |
title_short |
Human pose estimation using artificial intelligence |
title_full |
Human pose estimation using artificial intelligence |
title_fullStr |
Human pose estimation using artificial intelligence |
title_full_unstemmed |
Human pose estimation using artificial intelligence |
title_sort |
human pose estimation using artificial intelligence |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/159563 |
_version_ |
1736856401083367424 |