UR robot manipulator collision avoidance via reinforcement learning

With the development of intelligent technology, robots start to try to complete more complex tasks, so this requires higher stability of robots to the complex environment, and meanwhile there are new requirements for the self-adaptive ability of robots. Deep reinforcement learning has become a resea...

Full description

Saved in:

Bibliographic Details
Main Author:	Ding, Yuxin
Other Authors:	Hu Guoqiang
Format:	Thesis-Master by Coursework
Language:	English
Published:	Nanyang Technological University 2021
Subjects:	Engineering::Electrical and electronic engineering::Control and instrumentation::Robotics
Online Access:	https://hdl.handle.net/10356/152895
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-152895
record_format	dspace
spelling	sg-ntu-dr.10356-1528952023-07-04T17:40:12Z UR robot manipulator collision avoidance via reinforcement learning Ding, Yuxin Hu Guoqiang School of Electrical and Electronic Engineering GQHu@ntu.edu.sg Engineering::Electrical and electronic engineering::Control and instrumentation::Robotics With the development of intelligent technology, robots start to try to complete more complex tasks, so this requires higher stability of robots to the complex environment, and meanwhile there are new requirements for the self-adaptive ability of robots. Deep reinforcement learning has become a research hot spot recently. This thesis mainly studies the motion planning of multi-dimensional robot manipulators. First, traditional planning methods based on random sampling are explained. Simulations are carried out in ROS to compare effect of four different algorithms. The advantage of sampling based methods is with high searching speed, however, the solution is not optimal, and the path searched each time may be different from the previous. Then the basic idea and methods of reinforcement learning is introduced. Combined reinforcement learning with deep learning, target network and experienced replay buffer are introduced to reduce relevance between sampled data. Based on policy gradient, actor network of DDPG can choose action on continuous intervals. PPO algorithm minimizes the impact of strategy changing on the learning results of the agent by controlling the magnitude of updating from previous action policy to new policy. At last, simulation platform is built based on mujoco. And algorithms based on DDPG and PPO are employed to UR5 robot manipulator. Both DDPG and PPO can achieve relative optimal decision making for each joint of manipulator and finally complete motion planning to move to target point with obstacle avoidance. Meanwhile motion path of each joint is smooth. Results of experiment show that DDPG achieves more efficient learning in continuous space, and the hyperparameters in the PPO algorithm are easier to determine, and the algorithm can be realized more intuitively. Master of Science (Computer Control and Automation) 2021-10-14T04:37:06Z 2021-10-14T04:37:06Z 2021 Thesis-Master by Coursework Ding, Y. (2021). UR robot manipulator collision avoidance via reinforcement learning. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152895 https://hdl.handle.net/10356/152895 en ISM-DISS-02244 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering::Control and instrumentation::Robotics
spellingShingle	Engineering::Electrical and electronic engineering::Control and instrumentation::Robotics Ding, Yuxin UR robot manipulator collision avoidance via reinforcement learning
description	With the development of intelligent technology, robots start to try to complete more complex tasks, so this requires higher stability of robots to the complex environment, and meanwhile there are new requirements for the self-adaptive ability of robots. Deep reinforcement learning has become a research hot spot recently. This thesis mainly studies the motion planning of multi-dimensional robot manipulators. First, traditional planning methods based on random sampling are explained. Simulations are carried out in ROS to compare effect of four different algorithms. The advantage of sampling based methods is with high searching speed, however, the solution is not optimal, and the path searched each time may be different from the previous. Then the basic idea and methods of reinforcement learning is introduced. Combined reinforcement learning with deep learning, target network and experienced replay buffer are introduced to reduce relevance between sampled data. Based on policy gradient, actor network of DDPG can choose action on continuous intervals. PPO algorithm minimizes the impact of strategy changing on the learning results of the agent by controlling the magnitude of updating from previous action policy to new policy. At last, simulation platform is built based on mujoco. And algorithms based on DDPG and PPO are employed to UR5 robot manipulator. Both DDPG and PPO can achieve relative optimal decision making for each joint of manipulator and finally complete motion planning to move to target point with obstacle avoidance. Meanwhile motion path of each joint is smooth. Results of experiment show that DDPG achieves more efficient learning in continuous space, and the hyperparameters in the PPO algorithm are easier to determine, and the algorithm can be realized more intuitively.
author2	Hu Guoqiang
author_facet	Hu Guoqiang Ding, Yuxin
format	Thesis-Master by Coursework
author	Ding, Yuxin
author_sort	Ding, Yuxin
title	UR robot manipulator collision avoidance via reinforcement learning
title_short	UR robot manipulator collision avoidance via reinforcement learning
title_full	UR robot manipulator collision avoidance via reinforcement learning
title_fullStr	UR robot manipulator collision avoidance via reinforcement learning
title_full_unstemmed	UR robot manipulator collision avoidance via reinforcement learning
title_sort	ur robot manipulator collision avoidance via reinforcement learning
publisher	Nanyang Technological University
publishDate	2021
url	https://hdl.handle.net/10356/152895
_version_	1772826057069559808

UR robot manipulator collision avoidance via reinforcement learning

Similar Items