Large scale automatic scene generation to support deep-reinforcement-learning based navigation in autonomous mobile robots

With the ageing global population, already understaffed healthcare institutions face rising rates of hospitalization and chronic illnesses. This increases nurse burnout, decreasing quality-of-care and nurse retention. It also increases rates of patient oversight, causing prolonged hospital stays and...

Full description

Saved in:
Bibliographic Details
Main Author: Bay, Natania Yining
Other Authors: Andy Khong W H
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/177284
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:With the ageing global population, already understaffed healthcare institutions face rising rates of hospitalization and chronic illnesses. This increases nurse burnout, decreasing quality-of-care and nurse retention. It also increases rates of patient oversight, causing prolonged hospital stays and higher mortality. Use of reinforcement-learning trained autonomous mobile robots to support intra-hospital prescription delivery and vital-sign monitoring at patient residences, can potentially alleviate understaffing and ensure continuity-of-care for patients with chronic illnesses, providing physicians with more comprehensive knowledge of patient health conditions. Focusing on navigation along corridors, shows distinct lack of relevant training data required for training robust navigation policies in this context. Furthermore, the high-dimensionality required of effective training data for robotic-AI applications, increases the difficulty and complexity of curating or constructing such datasets. This work develops an algorithm for bulk generation of logical yet diverse virtual corridor environments for such applications. Associated JSON information files, allow interfacing with the StableBaselines3 reinforcement-learning framework, facilitating policy training with multiple environments and target locations. Testing has shown efficacy of generated environments for training a navigation policy. Furthermore, the algorithm is designed for extensibility, allowing easy inclusion of more variations and new features, which stand to further increase the algorithm’s diversity, robustness, and functionality.