Finding all nearest neighbors with a single graph traversal

Finding the nearest neighbor is a key operation in data analysis and mining. An important variant of nearest neighbor query is the all nearest neighbor (ANN) query, which reports all nearest neighbors for a given set of query objects. Existing studies on ANN queries have focused on Euclidean space....

Full description

Saved in:
Bibliographic Details
Main Authors: XU, Yixin, QI Jianzhong, BOROVICA‐GAJIC Renata, KULIK Lars
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2018
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8276
https://ink.library.smu.edu.sg/context/sis_research/article/9279/viewcontent/Finding_All_Nearest_Neighbors_with_a_Single_Graph_Traversal.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9279
record_format dspace
spelling sg-smu-ink.sis_research-92792023-11-10T08:42:38Z Finding all nearest neighbors with a single graph traversal XU, Yixin QI Jianzhong, BOROVICA‐GAJIC Renata, KULIK Lars, Finding the nearest neighbor is a key operation in data analysis and mining. An important variant of nearest neighbor query is the all nearest neighbor (ANN) query, which reports all nearest neighbors for a given set of query objects. Existing studies on ANN queries have focused on Euclidean space. Given the widespread occurrence of spatial networks in urban environments, we study the ANN query in spatial network settings. An example of an ANN query on spatial networks is finding the nearest car parks for all cars currently on the road. We propose VIVET, an index-based algorithm to efficiently process ANN queries. VIVET performs a single traversal on a spatial network to precompute the nearest data object for every vertex in the network, which enables us to answer an ANN query through a simple lookup on the precomputed nearest neighbors. We analyze the cost of the proposed algorithm both theoretically and empirically. Our results show that the algorithm is highly efficient and scalable. It outperforms adapted state-of-the-art nearest neighbor algorithms in both precomputation and query processing costs by more than one order of magnitude. 2018-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8276 info:doi/10.1007/978-3-319-91452-7_15 https://ink.library.smu.edu.sg/context/sis_research/article/9279/viewcontent/Finding_All_Nearest_Neighbors_with_a_Single_Graph_Traversal.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University All nearest neighbors Euclidean spaces Index based algorithm Nearest neighbor algorithm Nearest neighbor queries Nearest neighbors State of the art Urban environments Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic All nearest neighbors
Euclidean spaces
Index based algorithm
Nearest neighbor algorithm
Nearest neighbor queries
Nearest neighbors
State of the art
Urban environments
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle All nearest neighbors
Euclidean spaces
Index based algorithm
Nearest neighbor algorithm
Nearest neighbor queries
Nearest neighbors
State of the art
Urban environments
Databases and Information Systems
Numerical Analysis and Scientific Computing
XU, Yixin
QI Jianzhong,
BOROVICA‐GAJIC Renata,
KULIK Lars,
Finding all nearest neighbors with a single graph traversal
description Finding the nearest neighbor is a key operation in data analysis and mining. An important variant of nearest neighbor query is the all nearest neighbor (ANN) query, which reports all nearest neighbors for a given set of query objects. Existing studies on ANN queries have focused on Euclidean space. Given the widespread occurrence of spatial networks in urban environments, we study the ANN query in spatial network settings. An example of an ANN query on spatial networks is finding the nearest car parks for all cars currently on the road. We propose VIVET, an index-based algorithm to efficiently process ANN queries. VIVET performs a single traversal on a spatial network to precompute the nearest data object for every vertex in the network, which enables us to answer an ANN query through a simple lookup on the precomputed nearest neighbors. We analyze the cost of the proposed algorithm both theoretically and empirically. Our results show that the algorithm is highly efficient and scalable. It outperforms adapted state-of-the-art nearest neighbor algorithms in both precomputation and query processing costs by more than one order of magnitude.
format text
author XU, Yixin
QI Jianzhong,
BOROVICA‐GAJIC Renata,
KULIK Lars,
author_facet XU, Yixin
QI Jianzhong,
BOROVICA‐GAJIC Renata,
KULIK Lars,
author_sort XU, Yixin
title Finding all nearest neighbors with a single graph traversal
title_short Finding all nearest neighbors with a single graph traversal
title_full Finding all nearest neighbors with a single graph traversal
title_fullStr Finding all nearest neighbors with a single graph traversal
title_full_unstemmed Finding all nearest neighbors with a single graph traversal
title_sort finding all nearest neighbors with a single graph traversal
publisher Institutional Knowledge at Singapore Management University
publishDate 2018
url https://ink.library.smu.edu.sg/sis_research/8276
https://ink.library.smu.edu.sg/context/sis_research/article/9279/viewcontent/Finding_All_Nearest_Neighbors_with_a_Single_Graph_Traversal.pdf
_version_ 1783955663213297664