Human action recognition using machine learning

The accurate detection of human pose is useful in various areas such as surveillance, clothes parsing and game modelling. While many existing models attempt to breach the gap, they suffer from problems including learning uninformative latent variables and being memory- intensive. In this work, we...

全面介紹

Saved in:
書目詳細資料
主要作者: Cao, QingTian
其他作者: Lin Weisi
格式: Final Year Project
語言:English
出版: Nanyang Technological University 2024
主題:
在線閱讀:https://hdl.handle.net/10356/175048
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
實物特徵
總結:The accurate detection of human pose is useful in various areas such as surveillance, clothes parsing and game modelling. While many existing models attempt to breach the gap, they suffer from problems including learning uninformative latent variables and being memory- intensive. In this work, we propose FFPoser, a diffusion-based model that leverages the use of Fast-Fourier Transformation and a complex-valued neural network to resolve the above issues. It is able to generate fairly realistic poses and produce accurate occlusion recovery results. Additionally, it is easy to train while still be able to learn the important information of the source data.