Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach

Multiple unmanned aerial vehicles (UAVs) are able to efficiently accomplish a variety of tasks in complex scenarios. However, developing a collision-avoiding flocking policy for multiple fixed-wing UAVs is still challenging, especially in obstacle-cluttered environments. In this article, we propose...

Full description

Saved in:
Bibliographic Details
Main Authors: Yan, Chao, Wang, Chang, Xiang, Xiaojia, Low, Kin Huat, Wang, Xiangke, Xu, Xin, Shen, Lincheng
Other Authors: School of Mechanical and Aerospace Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/170574
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-170574
record_format dspace
spelling sg-ntu-dr.10356-1705742023-09-19T07:51:06Z Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach Yan, Chao Wang, Chang Xiang, Xiaojia Low, Kin Huat Wang, Xiangke Xu, Xin Shen, Lincheng School of Mechanical and Aerospace Engineering Engineering::Computer science and engineering Task Analysis Collision Avoidance Multiple unmanned aerial vehicles (UAVs) are able to efficiently accomplish a variety of tasks in complex scenarios. However, developing a collision-avoiding flocking policy for multiple fixed-wing UAVs is still challenging, especially in obstacle-cluttered environments. In this article, we propose a novel curriculum-based multiagent deep reinforcement learning (MADRL) approach called task-specific curriculum-based MADRL (TSCAL) to learn the decentralized flocking with obstacle avoidance policy for multiple fixed-wing UAVs. The core idea is to decompose the collision-avoiding flocking task into multiple subtasks and progressively increase the number of subtasks to be solved in a staged manner. Meanwhile, TSCAL iteratively alternates between the procedures of online learning and offline transfer. For online learning, we propose a hierarchical recurrent attention multiagent actor-critic (HRAMA) algorithm to learn the policies for the corresponding subtask(s) in each learning stage. For offline transfer, we develop two transfer mechanisms, i.e., model reload and buffer reuse, to transfer knowledge between two neighboring stages. A series of numerical simulations demonstrate the significant advantages of TSCAL in terms of policy optimality, sample efficiency, and learning stability. Finally, the high-fidelity hardware-in-the-loop (HITL) simulation is conducted to verify the adaptability of TSCAL. A video about the numerical and HITL simulations is available at https://youtu.be/R9yLJNYRIqY. This work was supported in part by the Science and Technology Innovation 2030-Key Project of New Generation Artificial Intelligence under Grant 2020AAA0108200, in part by the National Natural Science Foundation of China under Grant 61825305 and Grant 61906203, and in part by the Postgraduate Scientific Research Innovation Project of Hunan Province under Grant CX20210001. The work of Chao Yanwas supported by the China Scholarship Council. 2023-09-19T07:51:06Z 2023-09-19T07:51:06Z 2023 Journal Article Yan, C., Wang, C., Xiang, X., Low, K. H., Wang, X., Xu, X. & Shen, L. (2023). Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach. IEEE Transactions On Neural Networks and Learning Systems. https://dx.doi.org/10.1109/TNNLS.2023.3245124 2162-237X https://hdl.handle.net/10356/170574 10.1109/TNNLS.2023.3245124 37027621 2-s2.0-85149377499 en IEEE Transactions on Neural Networks and Learning Systems © 2023 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Task Analysis
Collision Avoidance
spellingShingle Engineering::Computer science and engineering
Task Analysis
Collision Avoidance
Yan, Chao
Wang, Chang
Xiang, Xiaojia
Low, Kin Huat
Wang, Xiangke
Xu, Xin
Shen, Lincheng
Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
description Multiple unmanned aerial vehicles (UAVs) are able to efficiently accomplish a variety of tasks in complex scenarios. However, developing a collision-avoiding flocking policy for multiple fixed-wing UAVs is still challenging, especially in obstacle-cluttered environments. In this article, we propose a novel curriculum-based multiagent deep reinforcement learning (MADRL) approach called task-specific curriculum-based MADRL (TSCAL) to learn the decentralized flocking with obstacle avoidance policy for multiple fixed-wing UAVs. The core idea is to decompose the collision-avoiding flocking task into multiple subtasks and progressively increase the number of subtasks to be solved in a staged manner. Meanwhile, TSCAL iteratively alternates between the procedures of online learning and offline transfer. For online learning, we propose a hierarchical recurrent attention multiagent actor-critic (HRAMA) algorithm to learn the policies for the corresponding subtask(s) in each learning stage. For offline transfer, we develop two transfer mechanisms, i.e., model reload and buffer reuse, to transfer knowledge between two neighboring stages. A series of numerical simulations demonstrate the significant advantages of TSCAL in terms of policy optimality, sample efficiency, and learning stability. Finally, the high-fidelity hardware-in-the-loop (HITL) simulation is conducted to verify the adaptability of TSCAL. A video about the numerical and HITL simulations is available at https://youtu.be/R9yLJNYRIqY.
author2 School of Mechanical and Aerospace Engineering
author_facet School of Mechanical and Aerospace Engineering
Yan, Chao
Wang, Chang
Xiang, Xiaojia
Low, Kin Huat
Wang, Xiangke
Xu, Xin
Shen, Lincheng
format Article
author Yan, Chao
Wang, Chang
Xiang, Xiaojia
Low, Kin Huat
Wang, Xiangke
Xu, Xin
Shen, Lincheng
author_sort Yan, Chao
title Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_short Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_full Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_fullStr Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_full_unstemmed Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_sort collision-avoiding flocking with multiple fixed-wing uavs in obstacle-cluttered environments: a task-specific curriculum-based madrl approach
publishDate 2023
url https://hdl.handle.net/10356/170574
_version_ 1779156338288361472