Reinforcement learning-based method 3D motion planning for a 3D inspection task using prior viewpoint information
Surface inspection and shape reconstruction is a common application in the factory production line. In a robotic inspection task, generating an optimal collision-free trajectory while meeting the coverage requirement of the target object is challenging because there are inspection cost and traveling...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/158522 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Surface inspection and shape reconstruction is a common application in the factory production line. In a robotic inspection task, generating an optimal collision-free trajectory while meeting the coverage requirement of the target object is challenging because there are inspection cost and traveling cost that needs to be optimized. This paper is based on previous work that proposed a computational framework for automatic online path generation for robotic inspection via coverage planning and reinforcement learning-based approach. The online processing stage is utilising Monte Carlo Tree Search (MCTS) with the formulation of Markov Decision Process (MDP). However, a proposed visibility modelling and approximation that considers the presence of an obstacle is introduced to tackle the issue of obstructed view of the target object seen from a 3D camera. The proposed method compares the distance traveled by the same camera rays with a minimum threshold in two scenarios: target object with and without an obstacle to obtain a more realistic visibility of the viewpoint. The proposed MCTS in this final year project modifies the tree policy and default policy. The selection process by the tree policy is based on the reward of MCTS instead of the costs. The definitions of inspection cost and traveling cost are also modified to the number of covered surfaces (the uncovered surfaces of the selected viewpoints) of unselected viewpoint, and the trajectory length respectively. Instead of choosing the viewpoints greedily in the default policy, the average of sum of probabilities of the 2 costs is used to choose an action. After the parameters of MCTS are tuned, the proposed MCTS algorithm will be experimented on two scenes with and without obstacle and on various target objects. |
---|