NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning

Obstacle avoidance under constrained visual perception presents a significant challenge, requiring rapid detection and decision-making within partially observable environments, particularly for unmanned aerial vehicles (UAVs) maneuvering agilely in three-dimensional space. Compared to traditional me...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Yuhang, Yan, Chao, Xiao, Jiaping, Feroskhan, Mir
Other Authors: School of Mechanical and Aerospace Engineering
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181017
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181017
record_format dspace
spelling sg-ntu-dr.10356-1810172024-11-11T06:37:17Z NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning Zhang, Yuhang Yan, Chao Xiao, Jiaping Feroskhan, Mir School of Mechanical and Aerospace Engineering Engineering Constrained perception Deep reinforcement learning Obstacle avoidance under constrained visual perception presents a significant challenge, requiring rapid detection and decision-making within partially observable environments, particularly for unmanned aerial vehicles (UAVs) maneuvering agilely in three-dimensional space. Compared to traditional methods, obstacle avoidance algorithms based on deep reinforcement learning (DRL) offer a better comprehension of the uncertain operational environment in an end-to-end manner, reducing computational complexity and enhancing flexibility and scalability. However, the inherent trial-and-error learning mechanism of DRL necessitates numerous iterations for policy convergence, leading to sample inefficiency issues. Meanwhile, existing sample-efficient obstacle avoidance approaches that leverage imitation learning often heavily rely on offline expert demonstrations, which are not always feasible in hazardous environments. To address these challenges, we propose a novel obstacle avoidance approach based on Non-Expert Policy Enhanced DRL (NPE-DRL). This approach integrates a fundamental DRL framework with prior knowledge derived from a non-expert policy-guided imitation learning. During the training phase, the agent starts by online imitating the actions generated by the non-expert policy during interactions and progressively shifts toward autonomously exploring the environment to generate the optimal policy. Both simulation and physical experiments validate that our approach improves sample efficiency and achieves a better exploration-exploitation balance in both virtual and real-world flights. Additionally, our NPE-DRL-based obstacle avoidance approach shows better adaptability in complex environments characterized by larger scales and denser obstacle configurations, demonstrating a significant improvement in UAVs' obstacle avoidance capability. Code available at https://github.com/zzzzzyh111/NonExpert-Guided-Visual-UAV-Navigation-Gazebo. Agency for Science, Technology and Research (A*STAR) This Research is supported by the RIE2020/RIE2025 from MTC Young Individual Research Grants (YIRG) 2021 Grant Call (Grant No. M21K3c0121), administered by A*STAR. 2024-11-11T06:37:17Z 2024-11-11T06:37:17Z 2024 Journal Article Zhang, Y., Yan, C., Xiao, J. & Feroskhan, M. (2024). NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning. IEEE Transactions On Artificial Intelligence, 3464510-. https://dx.doi.org/10.1109/TAI.2024.3464510 2691-4581 https://hdl.handle.net/10356/181017 10.1109/TAI.2024.3464510 2-s2.0-85204775298 3464510 en M21K3c0121 IEEE Transactions on Artificial Intelligence © 2024 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
Constrained perception
Deep reinforcement learning
spellingShingle Engineering
Constrained perception
Deep reinforcement learning
Zhang, Yuhang
Yan, Chao
Xiao, Jiaping
Feroskhan, Mir
NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
description Obstacle avoidance under constrained visual perception presents a significant challenge, requiring rapid detection and decision-making within partially observable environments, particularly for unmanned aerial vehicles (UAVs) maneuvering agilely in three-dimensional space. Compared to traditional methods, obstacle avoidance algorithms based on deep reinforcement learning (DRL) offer a better comprehension of the uncertain operational environment in an end-to-end manner, reducing computational complexity and enhancing flexibility and scalability. However, the inherent trial-and-error learning mechanism of DRL necessitates numerous iterations for policy convergence, leading to sample inefficiency issues. Meanwhile, existing sample-efficient obstacle avoidance approaches that leverage imitation learning often heavily rely on offline expert demonstrations, which are not always feasible in hazardous environments. To address these challenges, we propose a novel obstacle avoidance approach based on Non-Expert Policy Enhanced DRL (NPE-DRL). This approach integrates a fundamental DRL framework with prior knowledge derived from a non-expert policy-guided imitation learning. During the training phase, the agent starts by online imitating the actions generated by the non-expert policy during interactions and progressively shifts toward autonomously exploring the environment to generate the optimal policy. Both simulation and physical experiments validate that our approach improves sample efficiency and achieves a better exploration-exploitation balance in both virtual and real-world flights. Additionally, our NPE-DRL-based obstacle avoidance approach shows better adaptability in complex environments characterized by larger scales and denser obstacle configurations, demonstrating a significant improvement in UAVs' obstacle avoidance capability. Code available at https://github.com/zzzzzyh111/NonExpert-Guided-Visual-UAV-Navigation-Gazebo.
author2 School of Mechanical and Aerospace Engineering
author_facet School of Mechanical and Aerospace Engineering
Zhang, Yuhang
Yan, Chao
Xiao, Jiaping
Feroskhan, Mir
format Article
author Zhang, Yuhang
Yan, Chao
Xiao, Jiaping
Feroskhan, Mir
author_sort Zhang, Yuhang
title NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_short NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_full NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_fullStr NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_full_unstemmed NPE-DRL: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
title_sort npe-drl: enhancing perception constrained obstacle avoidance with non-expert policy guided reinforcement learning
publishDate 2024
url https://hdl.handle.net/10356/181017
_version_ 1816858957168246784