NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning

Obstacle avoidance under constrained visual perception presents a significant challenge, requiring rapid detection and decision-making within partially observable environments, particularly for unmanned aerial vehicles (UAVs) maneuvering agilely in three-dimensional space. Compared to traditional me...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhang, Yuhang, Yan, Chao, Xiao, Jiaping, Feroskhan, Mir
Other Authors:	School of Mechanical and Aerospace Engineering
Format:	Article
Language:	English
Published:	2024
Subjects:	Engineering Constrained perception Deep reinforcement learning
Online Access:	https://hdl.handle.net/10356/181017
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-181017
record_format	dspace
spelling	sg-ntu-dr.10356-1810172024-11-11T06:37:17Z NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning Zhang, Yuhang Yan, Chao Xiao, Jiaping Feroskhan, Mir School of Mechanical and Aerospace Engineering Engineering Constrained perception Deep reinforcement learning Obstacle avoidance under constrained visual perception presents a significant challenge, requiring rapid detection and decision-making within partially observable environments, particularly for unmanned aerial vehicles (UAVs) maneuvering agilely in three-dimensional space. Compared to traditional methods, obstacle avoidance algorithms based on deep reinforcement learning (DRL) offer a better comprehension of the uncertain operational environment in an end-to-end manner, reducing computational complexity and enhancing flexibility and scalability. However, the inherent trial-and-error learning mechanism of DRL necessitates numerous iterations for policy convergence, leading to sample inefficiency issues. Meanwhile, existing sample-efficient obstacle avoidance approaches that leverage imitation learning often heavily rely on offline expert demonstrations, which are not always feasible in hazardous environments. To address these challenges, we propose a novel obstacle avoidance approach based on Non-Expert Policy Enhanced DRL (NPE-DRL). This approach integrates a fundamental DRL framework with prior knowledge derived from a non-expert policy-guided imitation learning. During the training phase, the agent starts by online imitating the actions generated by the non-expert policy during interactions and progressively shifts toward autonomously exploring the environment to generate the optimal policy. Both simulation and physical experiments validate that our approach improves sample efficiency and achieves a better exploration-exploitation balance in both virtual and real-world flights. Additionally, our NPE-DRL-based obstacle avoidance approach shows better adaptability in complex environments characterized by larger scales and denser obstacle configurations, demonstrating a significant improvement in UAVs' obstacle avoidance capability. Code available at https://github.com/zzzzzyh111/NonExpert-Guided-Visual-UAV-Navigation-Gazebo. Agency for Science, Technology and Research (ASTAR) This Research is supported by the RIE2020/RIE2025 from MTC Young Individual Research Grants (YIRG) 2021 Grant Call (Grant No. M21K3c0121), administered by ASTAR. 2024-11-11T06:37:17Z 2024-11-11T06:37:17Z 2024 Journal Article Zhang, Y., Yan, C., Xiao, J. & Feroskhan, M. (2024). NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning. IEEE Transactions On Artificial Intelligence, 3464510-. https://dx.doi.org/10.1109/TAI.2024.3464510 2691-4581 https://hdl.handle.net/10356/181017 10.1109/TAI.2024.3464510 2-s2.0-85204775298 3464510 en M21K3c0121 IEEE Transactions on Artificial Intelligence © 2024 IEEE. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering Constrained perception Deep reinforcement learning
spellingShingle	Engineering Constrained perception Deep reinforcement learning Zhang, Yuhang Yan, Chao Xiao, Jiaping Feroskhan, Mir NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
description	Obstacle avoidance under constrained visual perception presents a significant challenge, requiring rapid detection and decision-making within partially observable environments, particularly for unmanned aerial vehicles (UAVs) maneuvering agilely in three-dimensional space. Compared to traditional methods, obstacle avoidance algorithms based on deep reinforcement learning (DRL) offer a better comprehension of the uncertain operational environment in an end-to-end manner, reducing computational complexity and enhancing flexibility and scalability. However, the inherent trial-and-error learning mechanism of DRL necessitates numerous iterations for policy convergence, leading to sample inefficiency issues. Meanwhile, existing sample-efficient obstacle avoidance approaches that leverage imitation learning often heavily rely on offline expert demonstrations, which are not always feasible in hazardous environments. To address these challenges, we propose a novel obstacle avoidance approach based on Non-Expert Policy Enhanced DRL (NPE-DRL). This approach integrates a fundamental DRL framework with prior knowledge derived from a non-expert policy-guided imitation learning. During the training phase, the agent starts by online imitating the actions generated by the non-expert policy during interactions and progressively shifts toward autonomously exploring the environment to generate the optimal policy. Both simulation and physical experiments validate that our approach improves sample efficiency and achieves a better exploration-exploitation balance in both virtual and real-world flights. Additionally, our NPE-DRL-based obstacle avoidance approach shows better adaptability in complex environments characterized by larger scales and denser obstacle configurations, demonstrating a significant improvement in UAVs' obstacle avoidance capability. Code available at https://github.com/zzzzzyh111/NonExpert-Guided-Visual-UAV-Navigation-Gazebo.
author2	School of Mechanical and Aerospace Engineering
author_facet	School of Mechanical and Aerospace Engineering Zhang, Yuhang Yan, Chao Xiao, Jiaping Feroskhan, Mir
format	Article
author	Zhang, Yuhang Yan, Chao Xiao, Jiaping Feroskhan, Mir
author_sort	Zhang, Yuhang
title	NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_short	NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_full	NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_fullStr	NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_full_unstemmed	NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_sort	npe-drl: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
publishDate	2024
url	https://hdl.handle.net/10356/181017
_version_	1816858957168246784

NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning

Similar Items