Image processing using artificial intelligence

Human pose estimation is an important part of computer vision that determines the positions and orientations of a human body in 2D or 3D images and videos. This project explores the application of Artificial Intelligence (AI) techniques for 3D HPE, specifically leveraging the MixSTE: Seq2seq Mixed S...

Full description

Saved in:
Bibliographic Details
Main Author: Yeo, Clement
Other Authors: Yap Kim Hui
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181618
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181618
record_format dspace
spelling sg-ntu-dr.10356-1816182024-12-13T15:45:02Z Image processing using artificial intelligence Yeo, Clement Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Engineering Human pose estimation is an important part of computer vision that determines the positions and orientations of a human body in 2D or 3D images and videos. This project explores the application of Artificial Intelligence (AI) techniques for 3D HPE, specifically leveraging the MixSTE: Seq2seq Mixed Spatio-Temporal Encoder, used to estimate poses from video sequences. MixSTE combines spatial and temporal feature extraction to accurately predict human poses by modeling complex body dynamics over time. The main goal of this work is to develop and assess MixSTE for human pose estimation in videos, focusing on enhancing the accuracy and reliability of pose predictions, even in challenging conditions involving occlusions and diverse body movements. The proposed system uses a sequence-to-sequence (seq2seq) architecture to effectively encode and decode spatial and temporal information, providing a significant advancement over existing methods that often struggle with temporal inconsistencies. The experiments were performed on benchmark datasets like Human3.6M, and the results indicate that the proposed approach achieves high accuracy in 3D pose estimation, outperforming several state-of-the-art methods in terms of Mean Per Joint Position Error (MPJPE). This work demonstrates the potential of MixSTE for real-world applications, including activity recognition, human-computer interaction, and animation, contributing to the broader field of AI-driven human motion analysis. Bachelor's degree 2024-12-11T06:19:13Z 2024-12-11T06:19:13Z 2024 Final Year Project (FYP) Yeo, C. (2024). Image processing using artificial intelligence. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181618 https://hdl.handle.net/10356/181618 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
spellingShingle Engineering
Yeo, Clement
Image processing using artificial intelligence
description Human pose estimation is an important part of computer vision that determines the positions and orientations of a human body in 2D or 3D images and videos. This project explores the application of Artificial Intelligence (AI) techniques for 3D HPE, specifically leveraging the MixSTE: Seq2seq Mixed Spatio-Temporal Encoder, used to estimate poses from video sequences. MixSTE combines spatial and temporal feature extraction to accurately predict human poses by modeling complex body dynamics over time. The main goal of this work is to develop and assess MixSTE for human pose estimation in videos, focusing on enhancing the accuracy and reliability of pose predictions, even in challenging conditions involving occlusions and diverse body movements. The proposed system uses a sequence-to-sequence (seq2seq) architecture to effectively encode and decode spatial and temporal information, providing a significant advancement over existing methods that often struggle with temporal inconsistencies. The experiments were performed on benchmark datasets like Human3.6M, and the results indicate that the proposed approach achieves high accuracy in 3D pose estimation, outperforming several state-of-the-art methods in terms of Mean Per Joint Position Error (MPJPE). This work demonstrates the potential of MixSTE for real-world applications, including activity recognition, human-computer interaction, and animation, contributing to the broader field of AI-driven human motion analysis.
author2 Yap Kim Hui
author_facet Yap Kim Hui
Yeo, Clement
format Final Year Project
author Yeo, Clement
author_sort Yeo, Clement
title Image processing using artificial intelligence
title_short Image processing using artificial intelligence
title_full Image processing using artificial intelligence
title_fullStr Image processing using artificial intelligence
title_full_unstemmed Image processing using artificial intelligence
title_sort image processing using artificial intelligence
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/181618
_version_ 1819112983273406464