Retrieval-augmented human motion generation with diffusion model

Human motion generation is a crucial area of research with the potential to bring lifelike characters and movements to various applications, enhancing user engagement and immersion. However, the intricacy and diversity of human movements, the scarcity of motion data, the difficulty of incorporating...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Guo, Xinying
مؤلفون آخرون:	Liu Ziwei
التنسيق:	Final Year Project
اللغة:	English
منشور في:	Nanyang Technological University 2023
الموضوعات:	Engineering::Computer science and engineering
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/167733
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

id	sg-ntu-dr.10356-167733
record_format	dspace
spelling	sg-ntu-dr.10356-1677332023-07-07T15:44:26Z Retrieval-augmented human motion generation with diffusion model Guo, Xinying Liu Ziwei Wen Bihan School of Electrical and Electronic Engineering ziwei.liu@ntu.edu.sg, bihan.wen@ntu.edu.sg Engineering::Computer science and engineering Human motion generation is a crucial area of research with the potential to bring lifelike characters and movements to various applications, enhancing user engagement and immersion. However, the intricacy and diversity of human movements, the scarcity of motion data, the difficulty of incorporating human-like traits, and human’s heightened sensitivity to body movements pose persistent challenges in generating plausible human motions. The aforementioned problems have led to a surge in human motion generation model development in recent years, with text-driven motion generation being particularly popular due to its user-friendly nature. However, current text-driven generative approaches suffer from either poor quality or limitations in generalizability and expressiveness. To overcome these challenges, this project draws inspiration from successful diffusion models and retrieval techniques in related fields, and proposes ReMoDiffuse, an efficient diffusion-model-based text-driven motion generation framework complementing with a novel retrieval strategy. Specifically, ReMoDiffuse utilizes a diffusion model and integrates a multi-modality retrieval database to refine the denoising process. The results of extensive experiments demonstrate that the proposed method achieves superior performance in terms of quality, generalizability, and expressiveness. Bachelor of Engineering (Information Engineering and Media) 2023-06-05T03:58:56Z 2023-06-05T03:58:56Z 2023 Final Year Project (FYP) Guo, X. (2023). Retrieval-augmented human motion generation with diffusion model. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167733 https://hdl.handle.net/10356/167733 en B3247-221 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering
spellingShingle	Engineering::Computer science and engineering Guo, Xinying Retrieval-augmented human motion generation with diffusion model
description	Human motion generation is a crucial area of research with the potential to bring lifelike characters and movements to various applications, enhancing user engagement and immersion. However, the intricacy and diversity of human movements, the scarcity of motion data, the difficulty of incorporating human-like traits, and human’s heightened sensitivity to body movements pose persistent challenges in generating plausible human motions. The aforementioned problems have led to a surge in human motion generation model development in recent years, with text-driven motion generation being particularly popular due to its user-friendly nature. However, current text-driven generative approaches suffer from either poor quality or limitations in generalizability and expressiveness. To overcome these challenges, this project draws inspiration from successful diffusion models and retrieval techniques in related fields, and proposes ReMoDiffuse, an efficient diffusion-model-based text-driven motion generation framework complementing with a novel retrieval strategy. Specifically, ReMoDiffuse utilizes a diffusion model and integrates a multi-modality retrieval database to refine the denoising process. The results of extensive experiments demonstrate that the proposed method achieves superior performance in terms of quality, generalizability, and expressiveness.
author2	Liu Ziwei
author_facet	Liu Ziwei Guo, Xinying
format	Final Year Project
author	Guo, Xinying
author_sort	Guo, Xinying
title	Retrieval-augmented human motion generation with diffusion model
title_short	Retrieval-augmented human motion generation with diffusion model
title_full	Retrieval-augmented human motion generation with diffusion model
title_fullStr	Retrieval-augmented human motion generation with diffusion model
title_full_unstemmed	Retrieval-augmented human motion generation with diffusion model
title_sort	retrieval-augmented human motion generation with diffusion model
publisher	Nanyang Technological University
publishDate	2023
url	https://hdl.handle.net/10356/167733
_version_	1772825924380655616

Retrieval-augmented human motion generation with diffusion model

مواد مشابهة