Human pose estimation using artificial intelligence

Human Pose Estimation (HPE), is recently a popular research topic in computer vision field. The object of this task is to generate a skeleton-like representation of a human body and then process it further for task-specific applications. Due to the outstanding performance of vision transformer in ot...

Full description

Saved in:
Bibliographic Details
Main Author: Dai, Yue
Other Authors: Yap Kim Hui
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/159563
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-159563
record_format dspace
spelling sg-ntu-dr.10356-1595632022-06-27T06:13:42Z Human pose estimation using artificial intelligence Dai, Yue Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Human Pose Estimation (HPE), is recently a popular research topic in computer vision field. The object of this task is to generate a skeleton-like representation of a human body and then process it further for task-specific applications. Due to the outstanding performance of vision transformer in other Computer Vision (CV) tasks, its application to HPE problems and ability to surpass Convolutional Neural Network (CNN) models is gaining attention. Apart from the deep learning model, data processing in HPE cannot be ignored. Through the coordinate system transformation in common HPE solutions, the accuracy degrades when flipping strategy is applied in inference. The bias is derived from using resolution measured in pixels instead of size measured in unit length when performing resizing transformation. In this dissertation, TransPose, one state-of-the-art model introducing transformer for HPE is studied. The model is trained and tested on COCO benchmark. The modified TransPose model with unbiased approach is experimented as well. Based on the results, the dependency function of attention map is analyzed. The insights gained from this work provides valuable insight for future exploration of transformer applications and data processing in human pose estimation. Master of Science (Signal Processing) 2022-06-27T06:13:41Z 2022-06-27T06:13:41Z 2022 Thesis-Master by Coursework Dai, Y. (2022). Human pose estimation using artificial intelligence. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/159563 https://hdl.handle.net/10356/159563 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Dai, Yue
Human pose estimation using artificial intelligence
description Human Pose Estimation (HPE), is recently a popular research topic in computer vision field. The object of this task is to generate a skeleton-like representation of a human body and then process it further for task-specific applications. Due to the outstanding performance of vision transformer in other Computer Vision (CV) tasks, its application to HPE problems and ability to surpass Convolutional Neural Network (CNN) models is gaining attention. Apart from the deep learning model, data processing in HPE cannot be ignored. Through the coordinate system transformation in common HPE solutions, the accuracy degrades when flipping strategy is applied in inference. The bias is derived from using resolution measured in pixels instead of size measured in unit length when performing resizing transformation. In this dissertation, TransPose, one state-of-the-art model introducing transformer for HPE is studied. The model is trained and tested on COCO benchmark. The modified TransPose model with unbiased approach is experimented as well. Based on the results, the dependency function of attention map is analyzed. The insights gained from this work provides valuable insight for future exploration of transformer applications and data processing in human pose estimation.
author2 Yap Kim Hui
author_facet Yap Kim Hui
Dai, Yue
format Thesis-Master by Coursework
author Dai, Yue
author_sort Dai, Yue
title Human pose estimation using artificial intelligence
title_short Human pose estimation using artificial intelligence
title_full Human pose estimation using artificial intelligence
title_fullStr Human pose estimation using artificial intelligence
title_full_unstemmed Human pose estimation using artificial intelligence
title_sort human pose estimation using artificial intelligence
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/159563
_version_ 1736856401083367424