CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration

Image-to-point cloud (I2P) registration is a fundamental task for robots and autonomous vehicles to achieve cross-modality data fusion and localization. Current I2P registration methods primarily focus on estimating correspondences at the point or pixel level, often neglecting global alignment. As a...

Full description

Saved in:

Bibliographic Details
Main Authors:	Kang, Shuhao, Liao, Youqi, Li, Jianping, Liang, Fuxun, Li, Yuhao, Zou, Xianghong, Li, Fangning, Chen, Xieyuanli, Dong, Zhen, Yang, Bisheng
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2025
Subjects:	Engineering Coarse-to-fine correspondences Transformer network
Online Access:	https://hdl.handle.net/10356/182367
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-182367
record_format	dspace
spelling	sg-ntu-dr.10356-1823672025-01-27T01:42:02Z CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration Kang, Shuhao Liao, Youqi Li, Jianping Liang, Fuxun Li, Yuhao Zou, Xianghong Li, Fangning Chen, Xieyuanli Dong, Zhen Yang, Bisheng School of Electrical and Electronic Engineering Delta-NTU Corporate Laboratory Engineering Coarse-to-fine correspondences Transformer network Image-to-point cloud (I2P) registration is a fundamental task for robots and autonomous vehicles to achieve cross-modality data fusion and localization. Current I2P registration methods primarily focus on estimating correspondences at the point or pixel level, often neglecting global alignment. As a result, I2P matching can easily converge to a local optimum if it lacks high-level guidance from global constraints. To improve the success rate and general robustness, this letter introduces CoFiI2P, a novel I2P registration network that extracts correspondences in a coarse-to-fine manner. First, the image and point cloud data are processed through a two-stream encoder-decoder network for hierarchical feature extraction. Second, a coarse-to-fine matching module is designed to leverage these features and establish robust feature correspondences. Specifically, in the coarse matching phase, a novel I2P transformer module is employed to capture both homogeneous and heterogeneous global information from the image and point cloud data. This enables the estimation of coarse super-point/super-pixel matching pairs with discriminative descriptors. In the fine matching module, point/pixel pairs are established with the guidance of super-point/super-pixel correspondences. Finally, based on matching pairs, the transformation matrix is estimated with the EPnP-RANSAC algorithm. Experiments conducted on the KITTI Odometry dataset demonstrate that CoFiI2P achieves impressive results, with a relative rotation error (RRE) of 1.14 degrees and a relative translation error (RTE) of 0.29 meters, while maintaining real-time speed. These results represent a significant improvement of 84% in RRE and 89% in RTE compared to the current state-of-the-art (SOTA) method. Additional experiments on the Nuscenes dataset confirm our method's generalizability. This work was supported in part by the National Natural Science Foundation under Project 42201477 and Project 42130105, in part by the Open Fund of Hubei Luojia Laboratory under Grant 2201000054, and in part by Open Fund of Key Laboratory of Urban Spatial Information, Ministry of Natural Resources under Grant 2023ZD001. 2025-01-27T01:42:02Z 2025-01-27T01:42:02Z 2024 Journal Article Kang, S., Liao, Y., Li, J., Liang, F., Li, Y., Zou, X., Li, F., Chen, X., Dong, Z. & Yang, B. (2024). CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration. IEEE Robotics and Automation Letters, 9(11), 10264-10271. https://dx.doi.org/10.1109/LRA.2024.3466068 2377-3766 https://hdl.handle.net/10356/182367 10.1109/LRA.2024.3466068 2-s2.0-85205519040 11 9 10264 10271 en IEEE Robotics and Automation Letters © 2024 IEEE. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering Coarse-to-fine correspondences Transformer network
spellingShingle	Engineering Coarse-to-fine correspondences Transformer network Kang, Shuhao Liao, Youqi Li, Jianping Liang, Fuxun Li, Yuhao Zou, Xianghong Li, Fangning Chen, Xieyuanli Dong, Zhen Yang, Bisheng CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration
description	Image-to-point cloud (I2P) registration is a fundamental task for robots and autonomous vehicles to achieve cross-modality data fusion and localization. Current I2P registration methods primarily focus on estimating correspondences at the point or pixel level, often neglecting global alignment. As a result, I2P matching can easily converge to a local optimum if it lacks high-level guidance from global constraints. To improve the success rate and general robustness, this letter introduces CoFiI2P, a novel I2P registration network that extracts correspondences in a coarse-to-fine manner. First, the image and point cloud data are processed through a two-stream encoder-decoder network for hierarchical feature extraction. Second, a coarse-to-fine matching module is designed to leverage these features and establish robust feature correspondences. Specifically, in the coarse matching phase, a novel I2P transformer module is employed to capture both homogeneous and heterogeneous global information from the image and point cloud data. This enables the estimation of coarse super-point/super-pixel matching pairs with discriminative descriptors. In the fine matching module, point/pixel pairs are established with the guidance of super-point/super-pixel correspondences. Finally, based on matching pairs, the transformation matrix is estimated with the EPnP-RANSAC algorithm. Experiments conducted on the KITTI Odometry dataset demonstrate that CoFiI2P achieves impressive results, with a relative rotation error (RRE) of 1.14 degrees and a relative translation error (RTE) of 0.29 meters, while maintaining real-time speed. These results represent a significant improvement of 84% in RRE and 89% in RTE compared to the current state-of-the-art (SOTA) method. Additional experiments on the Nuscenes dataset confirm our method's generalizability.
author2	School of Electrical and Electronic Engineering
author_facet	School of Electrical and Electronic Engineering Kang, Shuhao Liao, Youqi Li, Jianping Liang, Fuxun Li, Yuhao Zou, Xianghong Li, Fangning Chen, Xieyuanli Dong, Zhen Yang, Bisheng
format	Article
author	Kang, Shuhao Liao, Youqi Li, Jianping Liang, Fuxun Li, Yuhao Zou, Xianghong Li, Fangning Chen, Xieyuanli Dong, Zhen Yang, Bisheng
author_sort	Kang, Shuhao
title	CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration
title_short	CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration
title_full	CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration
title_fullStr	CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration
title_full_unstemmed	CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration
title_sort	cofii2p: coarse-to-fine correspondences-based image to point cloud registration
publishDate	2025
url	https://hdl.handle.net/10356/182367
_version_	1823108706844278784

CoFiI2P: coarse-to-fine correspondences-based image to point cloud registration

Similar Items