Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach

Multiple unmanned aerial vehicles (UAVs) are able to efficiently accomplish a variety of tasks in complex scenarios. However, developing a collision-avoiding flocking policy for multiple fixed-wing UAVs is still challenging, especially in obstacle-cluttered environments. In this article, we propose...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yan, Chao, Wang, Chang, Xiang, Xiaojia, Low, Kin Huat, Wang, Xiangke, Xu, Xin, Shen, Lincheng
Other Authors:	School of Mechanical and Aerospace Engineering
Format:	Article
Language:	English
Published:	2023
Subjects:	Engineering::Computer science and engineering Task Analysis Collision Avoidance
Online Access:	https://hdl.handle.net/10356/170574
Tags:	Add Tag No Tags, Be the first to tag this record!

id	sg-ntu-dr.10356-170574
record_format	dspace
spelling	sg-ntu-dr.10356-1705742023-09-19T07:51:06Z Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach Yan, Chao Wang, Chang Xiang, Xiaojia Low, Kin Huat Wang, Xiangke Xu, Xin Shen, Lincheng School of Mechanical and Aerospace Engineering Engineering::Computer science and engineering Task Analysis Collision Avoidance Multiple unmanned aerial vehicles (UAVs) are able to efficiently accomplish a variety of tasks in complex scenarios. However, developing a collision-avoiding flocking policy for multiple fixed-wing UAVs is still challenging, especially in obstacle-cluttered environments. In this article, we propose a novel curriculum-based multiagent deep reinforcement learning (MADRL) approach called task-specific curriculum-based MADRL (TSCAL) to learn the decentralized flocking with obstacle avoidance policy for multiple fixed-wing UAVs. The core idea is to decompose the collision-avoiding flocking task into multiple subtasks and progressively increase the number of subtasks to be solved in a staged manner. Meanwhile, TSCAL iteratively alternates between the procedures of online learning and offline transfer. For online learning, we propose a hierarchical recurrent attention multiagent actor-critic (HRAMA) algorithm to learn the policies for the corresponding subtask(s) in each learning stage. For offline transfer, we develop two transfer mechanisms, i.e., model reload and buffer reuse, to transfer knowledge between two neighboring stages. A series of numerical simulations demonstrate the significant advantages of TSCAL in terms of policy optimality, sample efficiency, and learning stability. Finally, the high-fidelity hardware-in-the-loop (HITL) simulation is conducted to verify the adaptability of TSCAL. A video about the numerical and HITL simulations is available at https://youtu.be/R9yLJNYRIqY. This work was supported in part by the Science and Technology Innovation 2030-Key Project of New Generation Artificial Intelligence under Grant 2020AAA0108200, in part by the National Natural Science Foundation of China under Grant 61825305 and Grant 61906203, and in part by the Postgraduate Scientific Research Innovation Project of Hunan Province under Grant CX20210001. The work of Chao Yanwas supported by the China Scholarship Council. 2023-09-19T07:51:06Z 2023-09-19T07:51:06Z 2023 Journal Article Yan, C., Wang, C., Xiang, X., Low, K. H., Wang, X., Xu, X. & Shen, L. (2023). Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach. IEEE Transactions On Neural Networks and Learning Systems. https://dx.doi.org/10.1109/TNNLS.2023.3245124 2162-237X https://hdl.handle.net/10356/170574 10.1109/TNNLS.2023.3245124 37027621 2-s2.0-85149377499 en IEEE Transactions on Neural Networks and Learning Systems © 2023 IEEE. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering Task Analysis Collision Avoidance
spellingShingle	Engineering::Computer science and engineering Task Analysis Collision Avoidance Yan, Chao Wang, Chang Xiang, Xiaojia Low, Kin Huat Wang, Xiangke Xu, Xin Shen, Lincheng Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
description	Multiple unmanned aerial vehicles (UAVs) are able to efficiently accomplish a variety of tasks in complex scenarios. However, developing a collision-avoiding flocking policy for multiple fixed-wing UAVs is still challenging, especially in obstacle-cluttered environments. In this article, we propose a novel curriculum-based multiagent deep reinforcement learning (MADRL) approach called task-specific curriculum-based MADRL (TSCAL) to learn the decentralized flocking with obstacle avoidance policy for multiple fixed-wing UAVs. The core idea is to decompose the collision-avoiding flocking task into multiple subtasks and progressively increase the number of subtasks to be solved in a staged manner. Meanwhile, TSCAL iteratively alternates between the procedures of online learning and offline transfer. For online learning, we propose a hierarchical recurrent attention multiagent actor-critic (HRAMA) algorithm to learn the policies for the corresponding subtask(s) in each learning stage. For offline transfer, we develop two transfer mechanisms, i.e., model reload and buffer reuse, to transfer knowledge between two neighboring stages. A series of numerical simulations demonstrate the significant advantages of TSCAL in terms of policy optimality, sample efficiency, and learning stability. Finally, the high-fidelity hardware-in-the-loop (HITL) simulation is conducted to verify the adaptability of TSCAL. A video about the numerical and HITL simulations is available at https://youtu.be/R9yLJNYRIqY.
author2	School of Mechanical and Aerospace Engineering
author_facet	School of Mechanical and Aerospace Engineering Yan, Chao Wang, Chang Xiang, Xiaojia Low, Kin Huat Wang, Xiangke Xu, Xin Shen, Lincheng
format	Article
author	Yan, Chao Wang, Chang Xiang, Xiaojia Low, Kin Huat Wang, Xiangke Xu, Xin Shen, Lincheng
author_sort	Yan, Chao
title	Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_short	Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_full	Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_fullStr	Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_full_unstemmed	Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach
title_sort	collision-avoiding flocking with multiple fixed-wing uavs in obstacle-cluttered environments: a task-specific curriculum-based madrl approach
publishDate	2023
url	https://hdl.handle.net/10356/170574
_version_	1779156338288361472

Collision-avoiding flocking with multiple fixed-wing UAVs in obstacle-cluttered environments: a task-specific curriculum-based MADRL approach

Similar Items