Recovering 6D object pose and semantic information from RGBD image
The applications of 6-Dimentional (6D) pose estimation plays an important role in today’s technology. The increasing prominence of robotics, automation, and augmented reality necessitates the need for precise and effective 6D pose information to ensure its successful execution. For industrial app...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/167270 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-167270 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1672702023-05-27T16:50:53Z Recovering 6D object pose and semantic information from RGBD image Lee, Wen Jie Chen I-Ming School of Mechanical and Aerospace Engineering MICHEN@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision The applications of 6-Dimentional (6D) pose estimation plays an important role in today’s technology. The increasing prominence of robotics, automation, and augmented reality necessitates the need for precise and effective 6D pose information to ensure its successful execution. For industrial applications, easily replicable datasets and robust pose estimations are crucial in determining the possible applications for the 6D pose method. The use of color (Red, Green, Blue) and depth information, also known as RGBD images have emerged as a popular source of data for 6D pose estimation. This project explores two popular RGBD pointcloud processing methods for 6D pose estimation: Normalized Object Coordinate Space (NOCS) and Deep Point-wise 3D Keypoints Voting Network (PVN3D). A review of the methodology, results, and dataset requirements regarding the two methods were discussed. Upon further inspecting and comparison of the methods, the replication of the NOCS dataset was found difficult to replicate with the limited knowledge and resources in hand, while PVN3D was found to meet the requirements. The LineMOD dataset which was utilized by PVN3D was shown to be replaceable using the in-house equipment provided. Furthermore, the results obtained from the training and testing of NOCS did not perform up to expectations, while the PVN3D results were acceptable. Overall, this project concludes that PVN3D is a viable 6D pose estimation method for industrial applications. Bachelor of Engineering (Mechanical Engineering) 2023-05-25T06:10:25Z 2023-05-25T06:10:25Z 2023 Final Year Project (FYP) Lee, W. J. (2023). Recovering 6D object pose and semantic information from RGBD image. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167270 https://hdl.handle.net/10356/167270 en B058 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision Lee, Wen Jie Recovering 6D object pose and semantic information from RGBD image |
description |
The applications of 6-Dimentional (6D) pose estimation plays an important role in
today’s technology. The increasing prominence of robotics, automation, and
augmented reality necessitates the need for precise and effective 6D pose information
to ensure its successful execution. For industrial applications, easily replicable datasets
and robust pose estimations are crucial in determining the possible applications for the
6D pose method. The use of color (Red, Green, Blue) and depth information, also
known as RGBD images have emerged as a popular source of data for 6D pose
estimation. This project explores two popular RGBD pointcloud processing methods
for 6D pose estimation: Normalized Object Coordinate Space (NOCS) and Deep
Point-wise 3D Keypoints Voting Network (PVN3D). A review of the methodology,
results, and dataset requirements regarding the two methods were discussed. Upon
further inspecting and comparison of the methods, the replication of the NOCS dataset
was found difficult to replicate with the limited knowledge and resources in hand,
while PVN3D was found to meet the requirements. The LineMOD dataset which was
utilized by PVN3D was shown to be replaceable using the in-house equipment
provided. Furthermore, the results obtained from the training and testing of NOCS did
not perform up to expectations, while the PVN3D results were acceptable. Overall,
this project concludes that PVN3D is a viable 6D pose estimation method for industrial
applications. |
author2 |
Chen I-Ming |
author_facet |
Chen I-Ming Lee, Wen Jie |
format |
Final Year Project |
author |
Lee, Wen Jie |
author_sort |
Lee, Wen Jie |
title |
Recovering 6D object pose and semantic information from RGBD image |
title_short |
Recovering 6D object pose and semantic information from RGBD image |
title_full |
Recovering 6D object pose and semantic information from RGBD image |
title_fullStr |
Recovering 6D object pose and semantic information from RGBD image |
title_full_unstemmed |
Recovering 6D object pose and semantic information from RGBD image |
title_sort |
recovering 6d object pose and semantic information from rgbd image |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/167270 |
_version_ |
1772827828015857664 |