Accelerating learned descriptor generation for visual localization

Visual SLAM systems use traditional feature extractor to retrieve features, a pair consisting of a keypoint and descriptor, from images. These features can then be matched to estimate the camera pose. However, these traditional feature extractors are surpassed by newer deep learning-based feature ex...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Liu, Woon Kit
مؤلفون آخرون: Lam Siew Kei
التنسيق: Final Year Project
اللغة:English
منشور في: Nanyang Technological University 2024
الموضوعات:
الوصول للمادة أونلاين:https://hdl.handle.net/10356/175279
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة: Nanyang Technological University
اللغة: English
الوصف
الملخص:Visual SLAM systems use traditional feature extractor to retrieve features, a pair consisting of a keypoint and descriptor, from images. These features can then be matched to estimate the camera pose. However, these traditional feature extractors are surpassed by newer deep learning-based feature extractor in the presence of imaging noise, illumination, or viewpoint changes. However, such AI models may suffer performance issues when deployed to embedded devices, which prioritises low-powered consumption. This report investigates the potential of deep learning accelerator libraries to accelerate feature extractor models for application in visual SLAM systems, particularly on embedded devices. TensorRT, is such a library that this can help achieve a significant speedup compared to traditional feature extraction methods.