UR robot manipulator collision avoidance via reinforcement learning

With the development of intelligent technology, robots start to try to complete more complex tasks, so this requires higher stability of robots to the complex environment, and meanwhile there are new requirements for the self-adaptive ability of robots. Deep reinforcement learning has become a resea...

Full description

Saved in:
Bibliographic Details
Main Author: Ding, Yuxin
Other Authors: Hu Guoqiang
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/152895
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-152895
record_format dspace
spelling sg-ntu-dr.10356-1528952023-07-04T17:40:12Z UR robot manipulator collision avoidance via reinforcement learning Ding, Yuxin Hu Guoqiang School of Electrical and Electronic Engineering GQHu@ntu.edu.sg Engineering::Electrical and electronic engineering::Control and instrumentation::Robotics With the development of intelligent technology, robots start to try to complete more complex tasks, so this requires higher stability of robots to the complex environment, and meanwhile there are new requirements for the self-adaptive ability of robots. Deep reinforcement learning has become a research hot spot recently. This thesis mainly studies the motion planning of multi-dimensional robot manipulators. First, traditional planning methods based on random sampling are explained. Simulations are carried out in ROS to compare effect of four different algorithms. The advantage of sampling based methods is with high searching speed, however, the solution is not optimal, and the path searched each time may be different from the previous. Then the basic idea and methods of reinforcement learning is introduced. Combined reinforcement learning with deep learning, target network and experienced replay buffer are introduced to reduce relevance between sampled data. Based on policy gradient, actor network of DDPG can choose action on continuous intervals. PPO algorithm minimizes the impact of strategy changing on the learning results of the agent by controlling the magnitude of updating from previous action policy to new policy. At last, simulation platform is built based on mujoco. And algorithms based on DDPG and PPO are employed to UR5 robot manipulator. Both DDPG and PPO can achieve relative optimal decision making for each joint of manipulator and finally complete motion planning to move to target point with obstacle avoidance. Meanwhile motion path of each joint is smooth. Results of experiment show that DDPG achieves more efficient learning in continuous space, and the hyperparameters in the PPO algorithm are easier to determine, and the algorithm can be realized more intuitively. Master of Science (Computer Control and Automation) 2021-10-14T04:37:06Z 2021-10-14T04:37:06Z 2021 Thesis-Master by Coursework Ding, Y. (2021). UR robot manipulator collision avoidance via reinforcement learning. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152895 https://hdl.handle.net/10356/152895 en ISM-DISS-02244 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering::Control and instrumentation::Robotics
spellingShingle Engineering::Electrical and electronic engineering::Control and instrumentation::Robotics
Ding, Yuxin
UR robot manipulator collision avoidance via reinforcement learning
description With the development of intelligent technology, robots start to try to complete more complex tasks, so this requires higher stability of robots to the complex environment, and meanwhile there are new requirements for the self-adaptive ability of robots. Deep reinforcement learning has become a research hot spot recently. This thesis mainly studies the motion planning of multi-dimensional robot manipulators. First, traditional planning methods based on random sampling are explained. Simulations are carried out in ROS to compare effect of four different algorithms. The advantage of sampling based methods is with high searching speed, however, the solution is not optimal, and the path searched each time may be different from the previous. Then the basic idea and methods of reinforcement learning is introduced. Combined reinforcement learning with deep learning, target network and experienced replay buffer are introduced to reduce relevance between sampled data. Based on policy gradient, actor network of DDPG can choose action on continuous intervals. PPO algorithm minimizes the impact of strategy changing on the learning results of the agent by controlling the magnitude of updating from previous action policy to new policy. At last, simulation platform is built based on mujoco. And algorithms based on DDPG and PPO are employed to UR5 robot manipulator. Both DDPG and PPO can achieve relative optimal decision making for each joint of manipulator and finally complete motion planning to move to target point with obstacle avoidance. Meanwhile motion path of each joint is smooth. Results of experiment show that DDPG achieves more efficient learning in continuous space, and the hyperparameters in the PPO algorithm are easier to determine, and the algorithm can be realized more intuitively.
author2 Hu Guoqiang
author_facet Hu Guoqiang
Ding, Yuxin
format Thesis-Master by Coursework
author Ding, Yuxin
author_sort Ding, Yuxin
title UR robot manipulator collision avoidance via reinforcement learning
title_short UR robot manipulator collision avoidance via reinforcement learning
title_full UR robot manipulator collision avoidance via reinforcement learning
title_fullStr UR robot manipulator collision avoidance via reinforcement learning
title_full_unstemmed UR robot manipulator collision avoidance via reinforcement learning
title_sort ur robot manipulator collision avoidance via reinforcement learning
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/152895
_version_ 1772826057069559808