Recovering 6D object pose and semantic information from RGBD image

The applications of 6-Dimentional (6D) pose estimation plays an important role in today’s technology. The increasing prominence of robotics, automation, and augmented reality necessitates the need for precise and effective 6D pose information to ensure its successful execution. For industrial app...

Full description

Saved in:
Bibliographic Details
Main Author: Lee, Wen Jie
Other Authors: Chen I-Ming
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/167270
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-167270
record_format dspace
spelling sg-ntu-dr.10356-1672702023-05-27T16:50:53Z Recovering 6D object pose and semantic information from RGBD image Lee, Wen Jie Chen I-Ming School of Mechanical and Aerospace Engineering MICHEN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision The applications of 6-Dimentional (6D) pose estimation plays an important role in today’s technology. The increasing prominence of robotics, automation, and augmented reality necessitates the need for precise and effective 6D pose information to ensure its successful execution. For industrial applications, easily replicable datasets and robust pose estimations are crucial in determining the possible applications for the 6D pose method. The use of color (Red, Green, Blue) and depth information, also known as RGBD images have emerged as a popular source of data for 6D pose estimation. This project explores two popular RGBD pointcloud processing methods for 6D pose estimation: Normalized Object Coordinate Space (NOCS) and Deep Point-wise 3D Keypoints Voting Network (PVN3D). A review of the methodology, results, and dataset requirements regarding the two methods were discussed. Upon further inspecting and comparison of the methods, the replication of the NOCS dataset was found difficult to replicate with the limited knowledge and resources in hand, while PVN3D was found to meet the requirements. The LineMOD dataset which was utilized by PVN3D was shown to be replaceable using the in-house equipment provided. Furthermore, the results obtained from the training and testing of NOCS did not perform up to expectations, while the PVN3D results were acceptable. Overall, this project concludes that PVN3D is a viable 6D pose estimation method for industrial applications. Bachelor of Engineering (Mechanical Engineering) 2023-05-25T06:10:25Z 2023-05-25T06:10:25Z 2023 Final Year Project (FYP) Lee, W. J. (2023). Recovering 6D object pose and semantic information from RGBD image. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167270 https://hdl.handle.net/10356/167270 en B058 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Lee, Wen Jie
Recovering 6D object pose and semantic information from RGBD image
description The applications of 6-Dimentional (6D) pose estimation plays an important role in today’s technology. The increasing prominence of robotics, automation, and augmented reality necessitates the need for precise and effective 6D pose information to ensure its successful execution. For industrial applications, easily replicable datasets and robust pose estimations are crucial in determining the possible applications for the 6D pose method. The use of color (Red, Green, Blue) and depth information, also known as RGBD images have emerged as a popular source of data for 6D pose estimation. This project explores two popular RGBD pointcloud processing methods for 6D pose estimation: Normalized Object Coordinate Space (NOCS) and Deep Point-wise 3D Keypoints Voting Network (PVN3D). A review of the methodology, results, and dataset requirements regarding the two methods were discussed. Upon further inspecting and comparison of the methods, the replication of the NOCS dataset was found difficult to replicate with the limited knowledge and resources in hand, while PVN3D was found to meet the requirements. The LineMOD dataset which was utilized by PVN3D was shown to be replaceable using the in-house equipment provided. Furthermore, the results obtained from the training and testing of NOCS did not perform up to expectations, while the PVN3D results were acceptable. Overall, this project concludes that PVN3D is a viable 6D pose estimation method for industrial applications.
author2 Chen I-Ming
author_facet Chen I-Ming
Lee, Wen Jie
format Final Year Project
author Lee, Wen Jie
author_sort Lee, Wen Jie
title Recovering 6D object pose and semantic information from RGBD image
title_short Recovering 6D object pose and semantic information from RGBD image
title_full Recovering 6D object pose and semantic information from RGBD image
title_fullStr Recovering 6D object pose and semantic information from RGBD image
title_full_unstemmed Recovering 6D object pose and semantic information from RGBD image
title_sort recovering 6d object pose and semantic information from rgbd image
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/167270
_version_ 1772827828015857664