Object-oriented indoor navigation for delivery robot

The navigation of object-oriented delivery robot is widely used in daily life. This thesis focuses on the popular VLN tasks in recent years to solve the problem of indoor delivery navigation in unseen environments. In the research of VLN, the cross model interaction between vision and language has m...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Li, Yuanwei
مؤلفون آخرون: Wang Dan Wei
التنسيق: Thesis-Master by Coursework
اللغة:English
منشور في: Nanyang Technological University 2024
الموضوعات:
الوصول للمادة أونلاين:https://hdl.handle.net/10356/173302
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
الوصف
الملخص:The navigation of object-oriented delivery robot is widely used in daily life. This thesis focuses on the popular VLN tasks in recent years to solve the problem of indoor delivery navigation in unseen environments. In the research of VLN, the cross model interaction between vision and language has made significant progress in the past two years with the rapid development of CV and NLP. The emergence of BERT models also help in training and construct ing navigation frameworks. Although the BERT model has good performance in VLN, the mismatch between instructions and visual information at the input leads to navigation errors for robots in similar scenes. This thesis introduces a cross model interaction transformer to solve the mismatch between instruction and visual information to optimize the input of the BERT model and improve the navigation success rate of the delivery robot.