Diffusion model Is a good pose estimator from 3D RF-vision
Human pose estimation (HPE) from Radio Frequency vision (RF-vision) performs human sensing using RF signals that penetrate obstacles without revealing privacy (e.g., facial information). Recently, mmWave radar has emerged as a promising RF-vision sensor, providing radar point clouds by processing RF...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2025
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/181964 https://eccv.ecva.net/ https://link.springer.com/chapter/10.1007/978-3-031-72640-8_1 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-181964 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1819642025-01-06T06:23:15Z Diffusion model Is a good pose estimator from 3D RF-vision Fan, Junqiao Yang, Jianfei Xu, Yuecong Xie, Lihua School of Electrical and Electronic Engineering School of Mechanical and Aerospace Engineering 2024 European Conference on Computer Vision (ECCV) Engineering Human pose estimation (HPE) from Radio Frequency vision (RF-vision) performs human sensing using RF signals that penetrate obstacles without revealing privacy (e.g., facial information). Recently, mmWave radar has emerged as a promising RF-vision sensor, providing radar point clouds by processing RF signals. However, the mmWave radar has a limited resolution with severe noise, leading to inaccurate and inconsistent human pose estimation. This work proposes mmDiff, a novel diffusion-based pose estimator tailored for noisy radar data. Our approach aims to provide reliable guidance as conditions to diffusion models. Two key challenges are addressed by mmDiff: (1) miss-detection of parts of human bodies, which is addressed by a module that isolates feature extraction from different body parts, and (2) signal inconsistency due to environmental interference, which is tackled by incorporating prior knowledge of body structure and motion. Several modules are designed to achieve these goals, whose features work as the conditions for the subsequent diffusion model, eliminating the miss-detection and instability of HPE based on RF-vision. Extensive experiments demonstrate that mmDiff outperforms existing methods significantly, achieving state-of-the-art performances on public datasets. (The project page of mmDiff is https://fanjunqiao.github.io/mmDiff-site/). Ministry of Education (MOE) National Research Foundation (NRF) This research is supported by the National Research Foundation of Singapore under its Medium-Sized Center for Adavnced Robotics Technology Innovation, and Ministry of Education of Singapore under ACRF Tier 1 Grant RG 64/23. 2025-01-06T06:23:15Z 2025-01-06T06:23:15Z 2024 Conference Paper Fan, J., Yang, J., Xu, Y. & Xie, L. (2024). Diffusion model Is a good pose estimator from 3D RF-vision. 2024 European Conference on Computer Vision (ECCV). https://dx.doi.org/10.1007/978-3-031-72640-8_1 9783031726392 https://hdl.handle.net/10356/181964 10.1007/978-3-031-72640-8_1 2-s2.0-85209813823 https://eccv.ecva.net/ https://link.springer.com/chapter/10.1007/978-3-031-72640-8_1 en RG 64/23 © 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering |
spellingShingle |
Engineering Fan, Junqiao Yang, Jianfei Xu, Yuecong Xie, Lihua Diffusion model Is a good pose estimator from 3D RF-vision |
description |
Human pose estimation (HPE) from Radio Frequency vision (RF-vision) performs human sensing using RF signals that penetrate obstacles without revealing privacy (e.g., facial information). Recently, mmWave radar has emerged as a promising RF-vision sensor, providing radar point clouds by processing RF signals. However, the mmWave radar has a limited resolution with severe noise, leading to inaccurate and inconsistent human pose estimation. This work proposes mmDiff, a novel diffusion-based pose estimator tailored for noisy radar data. Our approach aims to provide reliable guidance as conditions to diffusion models. Two key challenges are addressed by mmDiff: (1) miss-detection of parts of human bodies, which is addressed by a module that isolates feature extraction from different body parts, and (2) signal inconsistency due to environmental interference, which is tackled by incorporating prior knowledge of body structure and motion. Several modules are designed to achieve these goals, whose features work as the conditions for the subsequent diffusion model, eliminating the miss-detection and instability of HPE based on RF-vision. Extensive experiments demonstrate that mmDiff outperforms existing methods significantly, achieving state-of-the-art performances on public datasets. (The project page of mmDiff is https://fanjunqiao.github.io/mmDiff-site/). |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Fan, Junqiao Yang, Jianfei Xu, Yuecong Xie, Lihua |
format |
Conference or Workshop Item |
author |
Fan, Junqiao Yang, Jianfei Xu, Yuecong Xie, Lihua |
author_sort |
Fan, Junqiao |
title |
Diffusion model Is a good pose estimator from 3D RF-vision |
title_short |
Diffusion model Is a good pose estimator from 3D RF-vision |
title_full |
Diffusion model Is a good pose estimator from 3D RF-vision |
title_fullStr |
Diffusion model Is a good pose estimator from 3D RF-vision |
title_full_unstemmed |
Diffusion model Is a good pose estimator from 3D RF-vision |
title_sort |
diffusion model is a good pose estimator from 3d rf-vision |
publishDate |
2025 |
url |
https://hdl.handle.net/10356/181964 https://eccv.ecva.net/ https://link.springer.com/chapter/10.1007/978-3-031-72640-8_1 |
_version_ |
1821237143706533888 |