NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
Obstacle avoidance under constrained visual perception presents a significant challenge, requiring rapid detection and decision-making within partially observable environments, particularly for unmanned aerial vehicles (UAVs) maneuvering agilely in three-dimensional space. Compared to traditional me...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/181017 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-181017 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1810172024-11-11T06:37:17Z NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning Zhang, Yuhang Yan, Chao Xiao, Jiaping Feroskhan, Mir School of Mechanical and Aerospace Engineering Engineering Constrained perception Deep reinforcement learning Obstacle avoidance under constrained visual perception presents a significant challenge, requiring rapid detection and decision-making within partially observable environments, particularly for unmanned aerial vehicles (UAVs) maneuvering agilely in three-dimensional space. Compared to traditional methods, obstacle avoidance algorithms based on deep reinforcement learning (DRL) offer a better comprehension of the uncertain operational environment in an end-to-end manner, reducing computational complexity and enhancing flexibility and scalability. However, the inherent trial-and-error learning mechanism of DRL necessitates numerous iterations for policy convergence, leading to sample inefficiency issues. Meanwhile, existing sample-efficient obstacle avoidance approaches that leverage imitation learning often heavily rely on offline expert demonstrations, which are not always feasible in hazardous environments. To address these challenges, we propose a novel obstacle avoidance approach based on Non-Expert Policy Enhanced DRL (NPE-DRL). This approach integrates a fundamental DRL framework with prior knowledge derived from a non-expert policy-guided imitation learning. During the training phase, the agent starts by online imitating the actions generated by the non-expert policy during interactions and progressively shifts toward autonomously exploring the environment to generate the optimal policy. Both simulation and physical experiments validate that our approach improves sample efficiency and achieves a better exploration-exploitation balance in both virtual and real-world flights. Additionally, our NPE-DRL-based obstacle avoidance approach shows better adaptability in complex environments characterized by larger scales and denser obstacle configurations, demonstrating a significant improvement in UAVs' obstacle avoidance capability. Code available at https://github.com/zzzzzyh111/NonExpert-Guided-Visual-UAV-Navigation-Gazebo. Agency for Science, Technology and Research (A*STAR) This Research is supported by the RIE2020/RIE2025 from MTC Young Individual Research Grants (YIRG) 2021 Grant Call (Grant No. M21K3c0121), administered by A*STAR. 2024-11-11T06:37:17Z 2024-11-11T06:37:17Z 2024 Journal Article Zhang, Y., Yan, C., Xiao, J. & Feroskhan, M. (2024). NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning. IEEE Transactions On Artificial Intelligence, 3464510-. https://dx.doi.org/10.1109/TAI.2024.3464510 2691-4581 https://hdl.handle.net/10356/181017 10.1109/TAI.2024.3464510 2-s2.0-85204775298 3464510 en M21K3c0121 IEEE Transactions on Artificial Intelligence © 2024 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering Constrained perception Deep reinforcement learning |
spellingShingle |
Engineering Constrained perception Deep reinforcement learning Zhang, Yuhang Yan, Chao Xiao, Jiaping Feroskhan, Mir NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning |
description |
Obstacle avoidance under constrained visual perception presents a significant challenge, requiring rapid detection and decision-making within partially observable environments, particularly for unmanned aerial vehicles (UAVs) maneuvering agilely in three-dimensional space. Compared to traditional methods, obstacle avoidance algorithms based on deep reinforcement learning (DRL) offer a better comprehension of the uncertain operational environment in an end-to-end manner, reducing computational complexity and enhancing flexibility and scalability. However, the inherent trial-and-error learning mechanism of DRL necessitates numerous iterations for policy convergence, leading to sample inefficiency issues. Meanwhile, existing sample-efficient obstacle avoidance approaches that leverage imitation learning often heavily rely on offline expert demonstrations, which are not always feasible in hazardous environments. To address these challenges, we propose a novel obstacle avoidance approach based on Non-Expert Policy Enhanced DRL (NPE-DRL). This approach integrates a fundamental DRL framework with prior knowledge derived from a non-expert policy-guided imitation learning. During the training phase, the agent starts by online imitating the actions generated by the non-expert policy during interactions and progressively shifts toward autonomously exploring the environment to generate the optimal policy. Both simulation and physical experiments validate that our approach improves sample efficiency and achieves a better exploration-exploitation balance in both virtual and real-world flights. Additionally, our NPE-DRL-based obstacle avoidance approach shows better adaptability in complex environments characterized by larger scales and denser obstacle configurations, demonstrating a significant improvement in UAVs' obstacle avoidance capability. Code available at https://github.com/zzzzzyh111/NonExpert-Guided-Visual-UAV-Navigation-Gazebo. |
author2 |
School of Mechanical and Aerospace Engineering |
author_facet |
School of Mechanical and Aerospace Engineering Zhang, Yuhang Yan, Chao Xiao, Jiaping Feroskhan, Mir |
format |
Article |
author |
Zhang, Yuhang Yan, Chao Xiao, Jiaping Feroskhan, Mir |
author_sort |
Zhang, Yuhang |
title |
NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning |
title_short |
NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning |
title_full |
NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning |
title_fullStr |
NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning |
title_full_unstemmed |
NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning |
title_sort |
npe-drl: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/181017 |
_version_ |
1816858957168246784 |