Object-oriented indoor navigation for delivery robot

The navigation of object-oriented delivery robot is widely used in daily life. This thesis focuses on the popular VLN tasks in recent years to solve the problem of indoor delivery navigation in unseen environments. In the research of VLN, the cross model interaction between vision and language has m...

全面介紹

Saved in:
書目詳細資料
主要作者: Li, Yuanwei
其他作者: Wang Dan Wei
格式: Thesis-Master by Coursework
語言:English
出版: Nanyang Technological University 2024
主題:
在線閱讀:https://hdl.handle.net/10356/173302
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
實物特徵
總結:The navigation of object-oriented delivery robot is widely used in daily life. This thesis focuses on the popular VLN tasks in recent years to solve the problem of indoor delivery navigation in unseen environments. In the research of VLN, the cross model interaction between vision and language has made significant progress in the past two years with the rapid development of CV and NLP. The emergence of BERT models also help in training and construct ing navigation frameworks. Although the BERT model has good performance in VLN, the mismatch between instructions and visual information at the input leads to navigation errors for robots in similar scenes. This thesis introduces a cross model interaction transformer to solve the mismatch between instruction and visual information to optimize the input of the BERT model and improve the navigation success rate of the delivery robot.