VisioMap : lightweight 3-D scene reconstruction toward natural indoor localization

Most existing proposals for indoor localization are 'unnatural,' as they rely on sensing abilities not available to human beings. While such a mismatch causes complications in human-computer interactions and thus potentially reduces the usability and friendliness of a localization service,...

Full description

Saved in:
Bibliographic Details
Main Authors: Li, Feng, Hao,Jie, Wang, Jin, Luo, Jun, He, Ying, Yu, Dongxiao, Cheng, Xiuzhen
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2021
Subjects:
Online Access:https://hdl.handle.net/10356/148581
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Most existing proposals for indoor localization are 'unnatural,' as they rely on sensing abilities not available to human beings. While such a mismatch causes complications in human-computer interactions and thus potentially reduces the usability and friendliness of a localization service, it is partially entailed by the need for low-cost/effort sensing with resource-limited mobile devices. Fortunately, recent developments in smart glasses (e.g., Google Glasses) signal a trend toward realistic visual sensing and hence make the sensing ability of mobile devices more compatible to that of human users. Leveraging such front-end developments, we propose VisioMap as a natural indoor localization system that intentionally mimics the human skills in visual localization. VisioMap uses very sparse photograph samples to reconstruct 3-D indoor scenes; this is facilitated by the facts that photographs are taken at the eye-level with high stability and regularity, and that the reconstruction is lightweight as it exploits geometric features rather than image pixels. Localization is in turn performed by matching the geometric features extracted online to the reconstructed 3-D scene, making VisioMap: 1) natural to users as they can see the matched 3-D scene and 2) dispensed with the need for dense fingerprints/POIs toward accurate localization.