Image processing using artificial intelligence

Human pose estimation is an important part of computer vision that determines the positions and orientations of a human body in 2D or 3D images and videos. This project explores the application of Artificial Intelligence (AI) techniques for 3D HPE, specifically leveraging the MixSTE: Seq2seq Mixed S...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Yeo, Clement
مؤلفون آخرون:	Yap Kim Hui
التنسيق:	Final Year Project
اللغة:	English
منشور في:	Nanyang Technological University 2024
الموضوعات:	Engineering
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/181618
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

id	sg-ntu-dr.10356-181618
record_format	dspace
spelling	sg-ntu-dr.10356-1816182024-12-13T15:45:02Z Image processing using artificial intelligence Yeo, Clement Yap Kim Hui School of Electrical and Electronic Engineering EKHYap@ntu.edu.sg Engineering Human pose estimation is an important part of computer vision that determines the positions and orientations of a human body in 2D or 3D images and videos. This project explores the application of Artificial Intelligence (AI) techniques for 3D HPE, specifically leveraging the MixSTE: Seq2seq Mixed Spatio-Temporal Encoder, used to estimate poses from video sequences. MixSTE combines spatial and temporal feature extraction to accurately predict human poses by modeling complex body dynamics over time. The main goal of this work is to develop and assess MixSTE for human pose estimation in videos, focusing on enhancing the accuracy and reliability of pose predictions, even in challenging conditions involving occlusions and diverse body movements. The proposed system uses a sequence-to-sequence (seq2seq) architecture to effectively encode and decode spatial and temporal information, providing a significant advancement over existing methods that often struggle with temporal inconsistencies. The experiments were performed on benchmark datasets like Human3.6M, and the results indicate that the proposed approach achieves high accuracy in 3D pose estimation, outperforming several state-of-the-art methods in terms of Mean Per Joint Position Error (MPJPE). This work demonstrates the potential of MixSTE for real-world applications, including activity recognition, human-computer interaction, and animation, contributing to the broader field of AI-driven human motion analysis. Bachelor's degree 2024-12-11T06:19:13Z 2024-12-11T06:19:13Z 2024 Final Year Project (FYP) Yeo, C. (2024). Image processing using artificial intelligence. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181618 https://hdl.handle.net/10356/181618 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering
spellingShingle	Engineering Yeo, Clement Image processing using artificial intelligence
description	Human pose estimation is an important part of computer vision that determines the positions and orientations of a human body in 2D or 3D images and videos. This project explores the application of Artificial Intelligence (AI) techniques for 3D HPE, specifically leveraging the MixSTE: Seq2seq Mixed Spatio-Temporal Encoder, used to estimate poses from video sequences. MixSTE combines spatial and temporal feature extraction to accurately predict human poses by modeling complex body dynamics over time. The main goal of this work is to develop and assess MixSTE for human pose estimation in videos, focusing on enhancing the accuracy and reliability of pose predictions, even in challenging conditions involving occlusions and diverse body movements. The proposed system uses a sequence-to-sequence (seq2seq) architecture to effectively encode and decode spatial and temporal information, providing a significant advancement over existing methods that often struggle with temporal inconsistencies. The experiments were performed on benchmark datasets like Human3.6M, and the results indicate that the proposed approach achieves high accuracy in 3D pose estimation, outperforming several state-of-the-art methods in terms of Mean Per Joint Position Error (MPJPE). This work demonstrates the potential of MixSTE for real-world applications, including activity recognition, human-computer interaction, and animation, contributing to the broader field of AI-driven human motion analysis.
author2	Yap Kim Hui
author_facet	Yap Kim Hui Yeo, Clement
format	Final Year Project
author	Yeo, Clement
author_sort	Yeo, Clement
title	Image processing using artificial intelligence
title_short	Image processing using artificial intelligence
title_full	Image processing using artificial intelligence
title_fullStr	Image processing using artificial intelligence
title_full_unstemmed	Image processing using artificial intelligence
title_sort	image processing using artificial intelligence
publisher	Nanyang Technological University
publishDate	2024
url	https://hdl.handle.net/10356/181618
_version_	1819112983273406464

Image processing using artificial intelligence

مواد مشابهة