Diffusion model Is a good pose estimator from 3D RF-vision

Human pose estimation (HPE) from Radio Frequency vision (RF-vision) performs human sensing using RF signals that penetrate obstacles without revealing privacy (e.g., facial information). Recently, mmWave radar has emerged as a promising RF-vision sensor, providing radar point clouds by processing RF...

Full description

Saved in:
Bibliographic Details
Main Authors: Fan, Junqiao, Yang, Jianfei, Xu, Yuecong, Xie, Lihua
Other Authors: School of Electrical and Electronic Engineering
Format: Conference or Workshop Item
Language:English
Published: 2025
Subjects:
Online Access:https://hdl.handle.net/10356/181964
https://eccv.ecva.net/
https://link.springer.com/chapter/10.1007/978-3-031-72640-8_1
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181964
record_format dspace
spelling sg-ntu-dr.10356-1819642025-01-06T06:23:15Z Diffusion model Is a good pose estimator from 3D RF-vision Fan, Junqiao Yang, Jianfei Xu, Yuecong Xie, Lihua School of Electrical and Electronic Engineering School of Mechanical and Aerospace Engineering 2024 European Conference on Computer Vision (ECCV) Engineering Human pose estimation (HPE) from Radio Frequency vision (RF-vision) performs human sensing using RF signals that penetrate obstacles without revealing privacy (e.g., facial information). Recently, mmWave radar has emerged as a promising RF-vision sensor, providing radar point clouds by processing RF signals. However, the mmWave radar has a limited resolution with severe noise, leading to inaccurate and inconsistent human pose estimation. This work proposes mmDiff, a novel diffusion-based pose estimator tailored for noisy radar data. Our approach aims to provide reliable guidance as conditions to diffusion models. Two key challenges are addressed by mmDiff: (1) miss-detection of parts of human bodies, which is addressed by a module that isolates feature extraction from different body parts, and (2) signal inconsistency due to environmental interference, which is tackled by incorporating prior knowledge of body structure and motion. Several modules are designed to achieve these goals, whose features work as the conditions for the subsequent diffusion model, eliminating the miss-detection and instability of HPE based on RF-vision. Extensive experiments demonstrate that mmDiff outperforms existing methods significantly, achieving state-of-the-art performances on public datasets. (The project page of mmDiff is https://fanjunqiao.github.io/mmDiff-site/). Ministry of Education (MOE) National Research Foundation (NRF) This research is supported by the National Research Foundation of Singapore under its Medium-Sized Center for Adavnced Robotics Technology Innovation, and Ministry of Education of Singapore under ACRF Tier 1 Grant RG 64/23. 2025-01-06T06:23:15Z 2025-01-06T06:23:15Z 2024 Conference Paper Fan, J., Yang, J., Xu, Y. & Xie, L. (2024). Diffusion model Is a good pose estimator from 3D RF-vision. 2024 European Conference on Computer Vision (ECCV). https://dx.doi.org/10.1007/978-3-031-72640-8_1 9783031726392 https://hdl.handle.net/10356/181964 10.1007/978-3-031-72640-8_1 2-s2.0-85209813823 https://eccv.ecva.net/ https://link.springer.com/chapter/10.1007/978-3-031-72640-8_1 en RG 64/23 © 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
spellingShingle Engineering
Fan, Junqiao
Yang, Jianfei
Xu, Yuecong
Xie, Lihua
Diffusion model Is a good pose estimator from 3D RF-vision
description Human pose estimation (HPE) from Radio Frequency vision (RF-vision) performs human sensing using RF signals that penetrate obstacles without revealing privacy (e.g., facial information). Recently, mmWave radar has emerged as a promising RF-vision sensor, providing radar point clouds by processing RF signals. However, the mmWave radar has a limited resolution with severe noise, leading to inaccurate and inconsistent human pose estimation. This work proposes mmDiff, a novel diffusion-based pose estimator tailored for noisy radar data. Our approach aims to provide reliable guidance as conditions to diffusion models. Two key challenges are addressed by mmDiff: (1) miss-detection of parts of human bodies, which is addressed by a module that isolates feature extraction from different body parts, and (2) signal inconsistency due to environmental interference, which is tackled by incorporating prior knowledge of body structure and motion. Several modules are designed to achieve these goals, whose features work as the conditions for the subsequent diffusion model, eliminating the miss-detection and instability of HPE based on RF-vision. Extensive experiments demonstrate that mmDiff outperforms existing methods significantly, achieving state-of-the-art performances on public datasets. (The project page of mmDiff is https://fanjunqiao.github.io/mmDiff-site/).
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Fan, Junqiao
Yang, Jianfei
Xu, Yuecong
Xie, Lihua
format Conference or Workshop Item
author Fan, Junqiao
Yang, Jianfei
Xu, Yuecong
Xie, Lihua
author_sort Fan, Junqiao
title Diffusion model Is a good pose estimator from 3D RF-vision
title_short Diffusion model Is a good pose estimator from 3D RF-vision
title_full Diffusion model Is a good pose estimator from 3D RF-vision
title_fullStr Diffusion model Is a good pose estimator from 3D RF-vision
title_full_unstemmed Diffusion model Is a good pose estimator from 3D RF-vision
title_sort diffusion model is a good pose estimator from 3d rf-vision
publishDate 2025
url https://hdl.handle.net/10356/181964
https://eccv.ecva.net/
https://link.springer.com/chapter/10.1007/978-3-031-72640-8_1
_version_ 1821237143706533888