Object-oriented indoor navigation for delivery robot
The navigation of object-oriented delivery robot is widely used in daily life. This thesis focuses on the popular VLN tasks in recent years to solve the problem of indoor delivery navigation in unseen environments. In the research of VLN, the cross model interaction between vision and language has m...
Saved in:
主要作者: | |
---|---|
其他作者: | |
格式: | Thesis-Master by Coursework |
語言: | English |
出版: |
Nanyang Technological University
2024
|
主題: | |
在線閱讀: | https://hdl.handle.net/10356/173302 |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|
機構: | Nanyang Technological University |
語言: | English |
總結: | The navigation of object-oriented delivery robot is widely used in daily life. This thesis focuses on the popular VLN tasks in recent years to solve the problem of indoor delivery navigation in unseen environments. In the research of VLN, the cross model interaction between vision and language has made significant progress in the past two years with the rapid development of CV and NLP. The emergence of BERT models also help in training and construct ing navigation frameworks. Although the BERT model has good performance in VLN, the mismatch between instructions and visual information at the input leads to navigation errors for robots in similar scenes. This thesis introduces a cross model interaction transformer to solve the mismatch between instruction and visual information to optimize the input of the BERT model and improve the navigation success rate of the delivery robot. |
---|