Retrieval-augmented human motion generation with diffusion model

Human motion generation is a crucial area of research with the potential to bring lifelike characters and movements to various applications, enhancing user engagement and immersion. However, the intricacy and diversity of human movements, the scarcity of motion data, the difficulty of incorporating...

Full description

Saved in:
Bibliographic Details
Main Author: Guo, Xinying
Other Authors: Liu Ziwei
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/167733
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-167733
record_format dspace
spelling sg-ntu-dr.10356-1677332023-07-07T15:44:26Z Retrieval-augmented human motion generation with diffusion model Guo, Xinying Liu Ziwei Wen Bihan School of Electrical and Electronic Engineering ziwei.liu@ntu.edu.sg, bihan.wen@ntu.edu.sg Engineering::Computer science and engineering Human motion generation is a crucial area of research with the potential to bring lifelike characters and movements to various applications, enhancing user engagement and immersion. However, the intricacy and diversity of human movements, the scarcity of motion data, the difficulty of incorporating human-like traits, and human’s heightened sensitivity to body movements pose persistent challenges in generating plausible human motions. The aforementioned problems have led to a surge in human motion generation model development in recent years, with text-driven motion generation being particularly popular due to its user-friendly nature. However, current text-driven generative approaches suffer from either poor quality or limitations in generalizability and expressiveness. To overcome these challenges, this project draws inspiration from successful diffusion models and retrieval techniques in related fields, and proposes ReMoDiffuse, an efficient diffusion-model-based text-driven motion generation framework complementing with a novel retrieval strategy. Specifically, ReMoDiffuse utilizes a diffusion model and integrates a multi-modality retrieval database to refine the denoising process. The results of extensive experiments demonstrate that the proposed method achieves superior performance in terms of quality, generalizability, and expressiveness. Bachelor of Engineering (Information Engineering and Media) 2023-06-05T03:58:56Z 2023-06-05T03:58:56Z 2023 Final Year Project (FYP) Guo, X. (2023). Retrieval-augmented human motion generation with diffusion model. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167733 https://hdl.handle.net/10356/167733 en B3247-221 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Guo, Xinying
Retrieval-augmented human motion generation with diffusion model
description Human motion generation is a crucial area of research with the potential to bring lifelike characters and movements to various applications, enhancing user engagement and immersion. However, the intricacy and diversity of human movements, the scarcity of motion data, the difficulty of incorporating human-like traits, and human’s heightened sensitivity to body movements pose persistent challenges in generating plausible human motions. The aforementioned problems have led to a surge in human motion generation model development in recent years, with text-driven motion generation being particularly popular due to its user-friendly nature. However, current text-driven generative approaches suffer from either poor quality or limitations in generalizability and expressiveness. To overcome these challenges, this project draws inspiration from successful diffusion models and retrieval techniques in related fields, and proposes ReMoDiffuse, an efficient diffusion-model-based text-driven motion generation framework complementing with a novel retrieval strategy. Specifically, ReMoDiffuse utilizes a diffusion model and integrates a multi-modality retrieval database to refine the denoising process. The results of extensive experiments demonstrate that the proposed method achieves superior performance in terms of quality, generalizability, and expressiveness.
author2 Liu Ziwei
author_facet Liu Ziwei
Guo, Xinying
format Final Year Project
author Guo, Xinying
author_sort Guo, Xinying
title Retrieval-augmented human motion generation with diffusion model
title_short Retrieval-augmented human motion generation with diffusion model
title_full Retrieval-augmented human motion generation with diffusion model
title_fullStr Retrieval-augmented human motion generation with diffusion model
title_full_unstemmed Retrieval-augmented human motion generation with diffusion model
title_sort retrieval-augmented human motion generation with diffusion model
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/167733
_version_ 1772825924380655616