Implementing semantic search for textual data in web applications

Semantic search, also known as vector search, retrieves data based on their semantic similarity. It is enabled by sentence embeddings, which are high-dimension vectors that encapsulate the semantic meaning of sentences. Compared to traditional keyword search, semantic search accounts for the true in...

Full description

Saved in:
Bibliographic Details
Main Author: Toh, Jeremy Gen Yang
Other Authors: Andy Khong W H
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/177123
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-177123
record_format dspace
spelling sg-ntu-dr.10356-1771232024-05-31T15:43:19Z Implementing semantic search for textual data in web applications Toh, Jeremy Gen Yang Andy Khong W H School of Electrical and Electronic Engineering AndyKhong@ntu.edu.sg Computer and Information Science Engineering Semantic search Semantic search, also known as vector search, retrieves data based on their semantic similarity. It is enabled by sentence embeddings, which are high-dimension vectors that encapsulate the semantic meaning of sentences. Compared to traditional keyword search, semantic search accounts for the true intent of user queries which keyword search struggles to capture. This paper explores the implementation of semantic search in web applications by processing sentences using Sentence-BERT (SBERT), which is a pre-trained deep learning language model for generating meaningful, high-dimension vectors called sentence embeddings. These embeddings are then stored in PostgreSQL database with an extension, vector, which enables efficient similarity comparisons during search queries. This work details the findings from developing a vector search system, integrating it with a web application, and deploying it online. Bachelor's degree 2024-05-27T04:26:03Z 2024-05-27T04:26:03Z 2024 Final Year Project (FYP) Toh, J. G. Y. (2024). Implementing semantic search for textual data in web applications. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/177123 https://hdl.handle.net/10356/177123 en A3261-231 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Engineering
Semantic search
spellingShingle Computer and Information Science
Engineering
Semantic search
Toh, Jeremy Gen Yang
Implementing semantic search for textual data in web applications
description Semantic search, also known as vector search, retrieves data based on their semantic similarity. It is enabled by sentence embeddings, which are high-dimension vectors that encapsulate the semantic meaning of sentences. Compared to traditional keyword search, semantic search accounts for the true intent of user queries which keyword search struggles to capture. This paper explores the implementation of semantic search in web applications by processing sentences using Sentence-BERT (SBERT), which is a pre-trained deep learning language model for generating meaningful, high-dimension vectors called sentence embeddings. These embeddings are then stored in PostgreSQL database with an extension, vector, which enables efficient similarity comparisons during search queries. This work details the findings from developing a vector search system, integrating it with a web application, and deploying it online.
author2 Andy Khong W H
author_facet Andy Khong W H
Toh, Jeremy Gen Yang
format Final Year Project
author Toh, Jeremy Gen Yang
author_sort Toh, Jeremy Gen Yang
title Implementing semantic search for textual data in web applications
title_short Implementing semantic search for textual data in web applications
title_full Implementing semantic search for textual data in web applications
title_fullStr Implementing semantic search for textual data in web applications
title_full_unstemmed Implementing semantic search for textual data in web applications
title_sort implementing semantic search for textual data in web applications
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/177123
_version_ 1806059826642419712