Benchmarking spatial-vector queries
The growth of data has increased exponentially, spurred by technological advancements such as smartphones becoming readily available, providing an increase in global connectivity as well as access to digital applications. This increased connectivity has led to increased creation of spatial data, dat...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/181533 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-181533 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1815332024-12-09T01:07:42Z Benchmarking spatial-vector queries Wong, Scott Wen Jie Gao Cong College of Computing and Data Science gaocong@ntu.edu.sg Computer and Information Science The growth of data has increased exponentially, spurred by technological advancements such as smartphones becoming readily available, providing an increase in global connectivity as well as access to digital applications. This increased connectivity has led to increased creation of spatial data, data that provide us geospatial information that can be used to further improve our lives. New methods transforming unstructured data such as text, images and audio to structured data in the form of vectors. These vector embeddings have semantic meanings that capture the relationship and context of the data. As such, there must be a database that is able to store such high-dimensional vectors, something that traditional relational databases are not well suited for. Thus, we will need to analyse how vector databases work, to understand and see how we can improve such traditional databases to be on par with vector databases in terms of storing and managing such data. In this report, we provide an overview of how vector databases work, focusing on their indexing and querying techniques. Additionally, we will design and execute various queries that use different data modalities, evaluating the performance of traditional relational database systems that have been enhanced for vector processing and vector databases. By evaluating the different database systems, we can compare their performance and understand why some systems are better than others in specific queries, identifying their strengths and limitations. Finally, we conclude on the effectiveness of each database system against the challenges faced by modern data requirements. Bachelor's degree 2024-12-09T01:07:42Z 2024-12-09T01:07:42Z 2024 Final Year Project (FYP) Wong, S. W. J. (2024). Benchmarking spatial-vector queries. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181533 https://hdl.handle.net/10356/181533 en SCSE23-1120 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science |
spellingShingle |
Computer and Information Science Wong, Scott Wen Jie Benchmarking spatial-vector queries |
description |
The growth of data has increased exponentially, spurred by technological advancements such as smartphones becoming readily available, providing an increase in global connectivity as well as access to digital applications. This increased connectivity has led to increased creation of spatial data, data that provide us geospatial information that can be used to further improve our lives. New methods transforming unstructured data such as text, images and audio to structured data in the form of vectors. These vector embeddings have semantic meanings that capture the relationship and context of the data. As such, there must be a database that is able to store such high-dimensional vectors, something that traditional relational databases are not well suited for. Thus, we will need to analyse how vector databases work, to understand and see how we can improve such traditional databases to be on par with vector databases in terms of storing and managing such data. In this report, we provide an overview of how vector databases work, focusing on their indexing and querying techniques. Additionally, we will design and execute various queries that use different data modalities, evaluating the performance of traditional relational database systems that have been enhanced for vector processing and vector databases. By evaluating the different database systems, we can compare their performance and understand why some systems are better than others in specific queries, identifying their strengths and limitations. Finally, we conclude on the effectiveness of each database system against the challenges faced by modern data requirements. |
author2 |
Gao Cong |
author_facet |
Gao Cong Wong, Scott Wen Jie |
format |
Final Year Project |
author |
Wong, Scott Wen Jie |
author_sort |
Wong, Scott Wen Jie |
title |
Benchmarking spatial-vector queries |
title_short |
Benchmarking spatial-vector queries |
title_full |
Benchmarking spatial-vector queries |
title_fullStr |
Benchmarking spatial-vector queries |
title_full_unstemmed |
Benchmarking spatial-vector queries |
title_sort |
benchmarking spatial-vector queries |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/181533 |
_version_ |
1819113007459860480 |