Retrieval-augmented human motion generation with diffusion model
Human motion generation is a crucial area of research with the potential to bring lifelike characters and movements to various applications, enhancing user engagement and immersion. However, the intricacy and diversity of human movements, the scarcity of motion data, the difficulty of incorporating...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/167733 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-167733 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1677332023-07-07T15:44:26Z Retrieval-augmented human motion generation with diffusion model Guo, Xinying Liu Ziwei Wen Bihan School of Electrical and Electronic Engineering ziwei.liu@ntu.edu.sg, bihan.wen@ntu.edu.sg Engineering::Computer science and engineering Human motion generation is a crucial area of research with the potential to bring lifelike characters and movements to various applications, enhancing user engagement and immersion. However, the intricacy and diversity of human movements, the scarcity of motion data, the difficulty of incorporating human-like traits, and human’s heightened sensitivity to body movements pose persistent challenges in generating plausible human motions. The aforementioned problems have led to a surge in human motion generation model development in recent years, with text-driven motion generation being particularly popular due to its user-friendly nature. However, current text-driven generative approaches suffer from either poor quality or limitations in generalizability and expressiveness. To overcome these challenges, this project draws inspiration from successful diffusion models and retrieval techniques in related fields, and proposes ReMoDiffuse, an efficient diffusion-model-based text-driven motion generation framework complementing with a novel retrieval strategy. Specifically, ReMoDiffuse utilizes a diffusion model and integrates a multi-modality retrieval database to refine the denoising process. The results of extensive experiments demonstrate that the proposed method achieves superior performance in terms of quality, generalizability, and expressiveness. Bachelor of Engineering (Information Engineering and Media) 2023-06-05T03:58:56Z 2023-06-05T03:58:56Z 2023 Final Year Project (FYP) Guo, X. (2023). Retrieval-augmented human motion generation with diffusion model. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167733 https://hdl.handle.net/10356/167733 en B3247-221 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Guo, Xinying Retrieval-augmented human motion generation with diffusion model |
description |
Human motion generation is a crucial area of research with the potential to bring lifelike characters and movements to various applications, enhancing user engagement and immersion. However, the intricacy and diversity of human movements, the scarcity of motion data, the difficulty of incorporating human-like traits, and human’s heightened sensitivity to body movements pose persistent challenges in generating plausible human motions. The aforementioned problems have led to a surge in human motion generation model development in recent years, with text-driven motion generation being particularly popular due to its user-friendly nature. However, current text-driven generative approaches suffer from either poor quality or limitations in generalizability and expressiveness. To overcome these challenges, this project draws inspiration from successful diffusion models and retrieval techniques in related fields, and proposes ReMoDiffuse, an efficient diffusion-model-based text-driven motion generation framework complementing with a novel retrieval strategy. Specifically, ReMoDiffuse utilizes a diffusion model and integrates a multi-modality retrieval database to refine the denoising process. The results of extensive experiments demonstrate that the proposed method achieves superior performance in terms of quality, generalizability, and expressiveness. |
author2 |
Liu Ziwei |
author_facet |
Liu Ziwei Guo, Xinying |
format |
Final Year Project |
author |
Guo, Xinying |
author_sort |
Guo, Xinying |
title |
Retrieval-augmented human motion generation with diffusion model |
title_short |
Retrieval-augmented human motion generation with diffusion model |
title_full |
Retrieval-augmented human motion generation with diffusion model |
title_fullStr |
Retrieval-augmented human motion generation with diffusion model |
title_full_unstemmed |
Retrieval-augmented human motion generation with diffusion model |
title_sort |
retrieval-augmented human motion generation with diffusion model |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/167733 |
_version_ |
1772825924380655616 |