Benchmarking spatial-vector queries

The growth of data has increased exponentially, spurred by technological advancements such as smartphones becoming readily available, providing an increase in global connectivity as well as access to digital applications. This increased connectivity has led to increased creation of spatial data, dat...

Full description

Saved in:
Bibliographic Details
Main Author: Wong, Scott Wen Jie
Other Authors: Gao Cong
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181533
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181533
record_format dspace
spelling sg-ntu-dr.10356-1815332024-12-09T01:07:42Z Benchmarking spatial-vector queries Wong, Scott Wen Jie Gao Cong College of Computing and Data Science gaocong@ntu.edu.sg Computer and Information Science The growth of data has increased exponentially, spurred by technological advancements such as smartphones becoming readily available, providing an increase in global connectivity as well as access to digital applications. This increased connectivity has led to increased creation of spatial data, data that provide us geospatial information that can be used to further improve our lives. New methods transforming unstructured data such as text, images and audio to structured data in the form of vectors. These vector embeddings have semantic meanings that capture the relationship and context of the data. As such, there must be a database that is able to store such high-dimensional vectors, something that traditional relational databases are not well suited for. Thus, we will need to analyse how vector databases work, to understand and see how we can improve such traditional databases to be on par with vector databases in terms of storing and managing such data. In this report, we provide an overview of how vector databases work, focusing on their indexing and querying techniques. Additionally, we will design and execute various queries that use different data modalities, evaluating the performance of traditional relational database systems that have been enhanced for vector processing and vector databases. By evaluating the different database systems, we can compare their performance and understand why some systems are better than others in specific queries, identifying their strengths and limitations. Finally, we conclude on the effectiveness of each database system against the challenges faced by modern data requirements. Bachelor's degree 2024-12-09T01:07:42Z 2024-12-09T01:07:42Z 2024 Final Year Project (FYP) Wong, S. W. J. (2024). Benchmarking spatial-vector queries. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181533 https://hdl.handle.net/10356/181533 en SCSE23-1120 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
spellingShingle Computer and Information Science
Wong, Scott Wen Jie
Benchmarking spatial-vector queries
description The growth of data has increased exponentially, spurred by technological advancements such as smartphones becoming readily available, providing an increase in global connectivity as well as access to digital applications. This increased connectivity has led to increased creation of spatial data, data that provide us geospatial information that can be used to further improve our lives. New methods transforming unstructured data such as text, images and audio to structured data in the form of vectors. These vector embeddings have semantic meanings that capture the relationship and context of the data. As such, there must be a database that is able to store such high-dimensional vectors, something that traditional relational databases are not well suited for. Thus, we will need to analyse how vector databases work, to understand and see how we can improve such traditional databases to be on par with vector databases in terms of storing and managing such data. In this report, we provide an overview of how vector databases work, focusing on their indexing and querying techniques. Additionally, we will design and execute various queries that use different data modalities, evaluating the performance of traditional relational database systems that have been enhanced for vector processing and vector databases. By evaluating the different database systems, we can compare their performance and understand why some systems are better than others in specific queries, identifying their strengths and limitations. Finally, we conclude on the effectiveness of each database system against the challenges faced by modern data requirements.
author2 Gao Cong
author_facet Gao Cong
Wong, Scott Wen Jie
format Final Year Project
author Wong, Scott Wen Jie
author_sort Wong, Scott Wen Jie
title Benchmarking spatial-vector queries
title_short Benchmarking spatial-vector queries
title_full Benchmarking spatial-vector queries
title_fullStr Benchmarking spatial-vector queries
title_full_unstemmed Benchmarking spatial-vector queries
title_sort benchmarking spatial-vector queries
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/181533
_version_ 1819113007459860480