Enhancement of a natural language processing(NLP) based search engine

Search engine is also known as information retrieval. A Web search engine is a tool designed to search for information on the World Wide Web. As the popularization of internet, search engine has already played an indispensable role in people’s everyday life. “If you don’t know, just Google it.” has...

Full description

Saved in:
Bibliographic Details
Main Author: Wang, Kun.
Other Authors: Chan Chee Keong
Format: Final Year Project
Language:English
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10356/17888
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-17888
record_format dspace
spelling sg-ntu-dr.10356-178882023-07-07T16:00:38Z Enhancement of a natural language processing(NLP) based search engine Wang, Kun. Chan Chee Keong School of Electrical and Electronic Engineering DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval Search engine is also known as information retrieval. A Web search engine is a tool designed to search for information on the World Wide Web. As the popularization of internet, search engine has already played an indispensable role in people’s everyday life. “If you don’t know, just Google it.” has become conventional words. However, traditional search engines, such as Google, provide either simple string matching or statistical processing of text for its search items. Because of language ambiguity, search engine users are often shown with a bunch of irrelevant documents and have to spend a lot of time to filter away those links and get links relevant to their query. To improve this, NLP (Natural Language Processing) technique can be utilized as a design tool. This project’s objective is to develop a powerful and accurate search input parser based on NLP techniques. By analyzing and parsing the search input, ambiguity is reduced, which in response makes the search results more accurate. Two kinds of parsers are implemented, which are Finite State Machine (FSM) Parser and Phrase Structure Grammar (PSG) Parser. FSM parser implementation is grounded on the assumption that the language is finite. PSG parser implementation involves in three phases. The first phase covers the basic design and skeleton, applying grammar rule. The second phase depicts factors such as unknown words, agreement etc. The third phase is related to interrogative sentence structure, verb category and final improvement on the parser. Besides, a parser user interface is designed to perform user-friendly implementation of the parser. Bachelor of Engineering 2009-06-17T06:13:36Z 2009-06-17T06:13:36Z 2009 2009 Final Year Project (FYP) http://hdl.handle.net/10356/17888 en Nanyang Technological University 148 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
spellingShingle DRNTU::Engineering::Computer science and engineering::Information systems::Information storage and retrieval
Wang, Kun.
Enhancement of a natural language processing(NLP) based search engine
description Search engine is also known as information retrieval. A Web search engine is a tool designed to search for information on the World Wide Web. As the popularization of internet, search engine has already played an indispensable role in people’s everyday life. “If you don’t know, just Google it.” has become conventional words. However, traditional search engines, such as Google, provide either simple string matching or statistical processing of text for its search items. Because of language ambiguity, search engine users are often shown with a bunch of irrelevant documents and have to spend a lot of time to filter away those links and get links relevant to their query. To improve this, NLP (Natural Language Processing) technique can be utilized as a design tool. This project’s objective is to develop a powerful and accurate search input parser based on NLP techniques. By analyzing and parsing the search input, ambiguity is reduced, which in response makes the search results more accurate. Two kinds of parsers are implemented, which are Finite State Machine (FSM) Parser and Phrase Structure Grammar (PSG) Parser. FSM parser implementation is grounded on the assumption that the language is finite. PSG parser implementation involves in three phases. The first phase covers the basic design and skeleton, applying grammar rule. The second phase depicts factors such as unknown words, agreement etc. The third phase is related to interrogative sentence structure, verb category and final improvement on the parser. Besides, a parser user interface is designed to perform user-friendly implementation of the parser.
author2 Chan Chee Keong
author_facet Chan Chee Keong
Wang, Kun.
format Final Year Project
author Wang, Kun.
author_sort Wang, Kun.
title Enhancement of a natural language processing(NLP) based search engine
title_short Enhancement of a natural language processing(NLP) based search engine
title_full Enhancement of a natural language processing(NLP) based search engine
title_fullStr Enhancement of a natural language processing(NLP) based search engine
title_full_unstemmed Enhancement of a natural language processing(NLP) based search engine
title_sort enhancement of a natural language processing(nlp) based search engine
publishDate 2009
url http://hdl.handle.net/10356/17888
_version_ 1772827696160571392