Enhancement of a natural language processing(NLP) based search engine
Search engine is also known as information retrieval. A Web search engine is a tool designed to search for information on the World Wide Web. As the popularization of internet, search engine has already played an indispensable role in people’s everyday life. “If you don’t know, just Google it.” has...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/17888 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-17888 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-178882023-07-07T16:00:38Z Enhancement of a natural language processing(NLP) based search engine Wang, Kun. Chan Chee Keong School of Electrical and Electronic Engineering DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval Search engine is also known as information retrieval. A Web search engine is a tool designed to search for information on the World Wide Web. As the popularization of internet, search engine has already played an indispensable role in people’s everyday life. “If you don’t know, just Google it.” has become conventional words. However, traditional search engines, such as Google, provide either simple string matching or statistical processing of text for its search items. Because of language ambiguity, search engine users are often shown with a bunch of irrelevant documents and have to spend a lot of time to filter away those links and get links relevant to their query. To improve this, NLP (Natural Language Processing) technique can be utilized as a design tool. This project’s objective is to develop a powerful and accurate search input parser based on NLP techniques. By analyzing and parsing the search input, ambiguity is reduced, which in response makes the search results more accurate. Two kinds of parsers are implemented, which are Finite State Machine (FSM) Parser and Phrase Structure Grammar (PSG) Parser. FSM parser implementation is grounded on the assumption that the language is finite. PSG parser implementation involves in three phases. The first phase covers the basic design and skeleton, applying grammar rule. The second phase depicts factors such as unknown words, agreement etc. The third phase is related to interrogative sentence structure, verb category and final improvement on the parser. Besides, a parser user interface is designed to perform user-friendly implementation of the parser. Bachelor of Engineering 2009-06-17T06:13:36Z 2009-06-17T06:13:36Z 2009 2009 Final Year Project (FYP) http://hdl.handle.net/10356/17888 en Nanyang Technological University 148 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval Wang, Kun. Enhancement of a natural language processing(NLP) based search engine |
description |
Search engine is also known as information retrieval. A Web search engine is a tool designed to search for information on the World Wide Web. As the popularization of internet, search engine has already played an indispensable role in people’s everyday life. “If you don’t know, just Google it.” has become conventional words. However, traditional search engines, such as Google, provide either simple string matching or statistical processing of text for its search items. Because of language ambiguity, search engine users are often shown with a bunch of irrelevant documents and have to spend a lot of time to filter away those links and get links relevant to their query.
To improve this, NLP (Natural Language Processing) technique can be utilized as a design tool. This project’s objective is to develop a powerful and accurate search input parser based on NLP techniques. By analyzing and parsing the search input, ambiguity is reduced, which in response makes the search results more accurate.
Two kinds of parsers are implemented, which are Finite State Machine (FSM) Parser and Phrase Structure Grammar (PSG) Parser. FSM parser implementation is grounded on the assumption that the language is finite. PSG parser implementation involves in three phases. The first phase covers the basic design and skeleton, applying grammar rule. The second phase depicts factors such as unknown words, agreement etc. The third phase is related to interrogative sentence structure, verb category and final improvement on the parser. Besides, a parser user interface is designed to perform user-friendly implementation of the parser. |
author2 |
Chan Chee Keong |
author_facet |
Chan Chee Keong Wang, Kun. |
format |
Final Year Project |
author |
Wang, Kun. |
author_sort |
Wang, Kun. |
title |
Enhancement of a natural language processing(NLP) based search engine |
title_short |
Enhancement of a natural language processing(NLP) based search engine |
title_full |
Enhancement of a natural language processing(NLP) based search engine |
title_fullStr |
Enhancement of a natural language processing(NLP) based search engine |
title_full_unstemmed |
Enhancement of a natural language processing(NLP) based search engine |
title_sort |
enhancement of a natural language processing(nlp) based search engine |
publishDate |
2009 |
url |
http://hdl.handle.net/10356/17888 |
_version_ |
1772827696160571392 |