Learning-based robot navigation in dynamic environments: from indoor scenes to human crowd scenes
The increasing presence of autonomous robots in numerous settings necessitates the development of advanced navigation techniques that can handle complex environments. In this doctoral thesis, we present our previous works about exploring autonomous navigation using Deep Reinforcement Learning (DRL)...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/177871 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | The increasing presence of autonomous robots in numerous settings necessitates the development of advanced navigation techniques that can handle complex environments. In this doctoral thesis, we present our previous works about exploring autonomous navigation using Deep Reinforcement Learning (DRL) methods in various complex and dynamic environments for covering both indoor scenes and outdoor human crowd scenarios. In the earlier phase, our work more focus on indoor scene navigation with learning-based approaches, we first proposed dueling deep deterministic policy gradient (Dueling-TD3) to address the existing issues that the original TD3 has such as inefficient learning and slow convergence speed. In this way, enable the model to derive an ideal action for mobile robot navigation. We compose the dueling network architecture into the critic network to increase the Q-value estimate precision. The results demonstrate that our proposed model outperforms the original model in terms of route planning capabilities. To counter more dynamic indoor situations, our improved work named iTD3-CLN was proposed, which is a DRL-based low-level motion controller, that mainly focuses on map-less autonomous navigation in indoor dynamic scenes. By incorporating enhancements of N-step returns, Priority Experience Replay, and a channel-based Convolutional Laser Network (CLN) architecture to the TD3 algorithm, the proposed method achieves superior navigation performance compared to traditional techniques DWA. Our extensive studies show that in comparison to the state-of-the-art DRL model TD3, our model is with remarkable improvements in training efficiency, accumulated reward, and generalization performance in unseen environments. Despite the effectiveness of iTD3-CLN in indoor scene navigation, the mobile robot application environment is proposing to expand from static indoor to outdoor which is often crowded with humans. Thus, it requires a better crowd inference ability for the agent. The next important component of this thesis, MP-GatedGCN-RL, which is also based on the DRL framework, is proposed to address the challenges of navigating in outdoor crowded environments. It models the environment information in a graph representation and employs a Message-Passing Graph Convolutional Network (MP-GCN) with edge-wise gating mechanisms to encode asymmetric human-human and human-robot interactions. This approach demonstrates significant improvements in success rate and navigation time on simulated environments derived from the ETH/UCY pedestrian datasets compared to both conventional dynamic avoidance methods ORCA and state-of-the-art DRL-based approaches. To sum up, this thesis advances the field of autonomous robot navigation and proposes a series of DRL robot navigation approaches to allow robots to efficiently and safely navigate in a wide range of environments, from indoor dynamics to outdoor human interactions. Experimental results verify the effectiveness of the proposed learning-based navigation methods. |
---|