Robust motion planning for multi-robot systems against position deception attacks

Deep reinforcement learning (DRL) is widely applied in motion planning for multi-robot systems as DRL leverages the offline training process to improve the real-time computation efficiency. In DRL-based methods, the DRL models compute an action for a robot based on the states of its surrounding obst...

Full description

Saved in:
Bibliographic Details
Main Authors: Tang, Wenbing, Zhou, Yuan, Liu, Yang, Ding, Zuohua, Liu, Jing
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/176231
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Deep reinforcement learning (DRL) is widely applied in motion planning for multi-robot systems as DRL leverages the offline training process to improve the real-time computation efficiency. In DRL-based methods, the DRL models compute an action for a robot based on the states of its surrounding obstacles, including other robots in the system. They always assume that the number of obstacles is fixed and the obtained obstacles' states are reliable. However, in the real world, a multi-robot system may suffer from various attacks, such as remote control attacks and network attacks, that cause wrong positions of the surrounding obstacles received by a robot. In this paper, we propose a robust motion planning method DAE-Crit-LSTM, integrating a denoising autoencoder (DAE) with DRL models, to mitigate such position deception attacks in environments with a different number of obstacles. DAE-Crit-LSTM shows the following two advantages. First, DAE-Crit-LSTM can be applied in benign and attacked scenarios and thus does not require any detector. It learns an encoder and a decoder to approximate the accurate positions of the obstacles, no matter under attack or not. Second, DAE-Crit-LSTM applies an LSTM (Long Short-Term Memory)-based DRL model to deal with a variable number of obstacles in the environment. It is worth noting that DAE-Crit-LSTM is method-agnostic and can be easily implemented in state-of-the-art motion planning methods. Comprehensive experiments show that DAE-Crit-LSTM can mitigate position deception attacks and guarantee safe motion. We also demonstrate the effectiveness and generalization of DAE-Crit-LSTM.