OFFICE DOCUMENT SEARCH ENGINE

In the current digital era, electronic document has become a common occurrence especially in office jobs. However, finding a specific document becomes increasingly difficult as the number of the stored document also increases. A search engine that is able to recognize, store, and perform queries for...

Full description

Saved in:
Bibliographic Details
Main Author: Ardian Wirasandi, Diki
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/39623
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:39623
spelling id-itb.:396232019-06-27T11:21:41ZOFFICE DOCUMENT SEARCH ENGINE Ardian Wirasandi, Diki Indonesia Final Project office documents, index, indexing process, metadata, query, query process, search engine INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/39623 In the current digital era, electronic document has become a common occurrence especially in office jobs. However, finding a specific document becomes increasingly difficult as the number of the stored document also increases. A search engine that is able to recognize, store, and perform queries for various document formats would be needed to solve that problem. How to recognize the office document structure or pattern as well as how to build an office document search engine with optimal query result are the main focus of this thesis. Document structure or pattern recognition is done by performing word extraction and utilizing the words position in the document itself. The office document search engine system that has been developed consists of two main processes, which are indexing process and query process. In indexing process, the documents are gathered and processed into indexes which will be stored by the system. Query process consists of receiving user’s search query, processing the query, and generating document ranks based on the search results as well as displaying them. Evaluation result shows the system that has been developed performs well. Furthermore, every functional requirement and non-functional requirement have been fulfilled. This shows the method used to recognize the document structure or pattern is suitable for office documents processing. Based on the analysis of the evaluation result, it can be concluded that the method and workflow applied in the search engine system are able to produce optimal search result. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description In the current digital era, electronic document has become a common occurrence especially in office jobs. However, finding a specific document becomes increasingly difficult as the number of the stored document also increases. A search engine that is able to recognize, store, and perform queries for various document formats would be needed to solve that problem. How to recognize the office document structure or pattern as well as how to build an office document search engine with optimal query result are the main focus of this thesis. Document structure or pattern recognition is done by performing word extraction and utilizing the words position in the document itself. The office document search engine system that has been developed consists of two main processes, which are indexing process and query process. In indexing process, the documents are gathered and processed into indexes which will be stored by the system. Query process consists of receiving user’s search query, processing the query, and generating document ranks based on the search results as well as displaying them. Evaluation result shows the system that has been developed performs well. Furthermore, every functional requirement and non-functional requirement have been fulfilled. This shows the method used to recognize the document structure or pattern is suitable for office documents processing. Based on the analysis of the evaluation result, it can be concluded that the method and workflow applied in the search engine system are able to produce optimal search result.
format Final Project
author Ardian Wirasandi, Diki
spellingShingle Ardian Wirasandi, Diki
OFFICE DOCUMENT SEARCH ENGINE
author_facet Ardian Wirasandi, Diki
author_sort Ardian Wirasandi, Diki
title OFFICE DOCUMENT SEARCH ENGINE
title_short OFFICE DOCUMENT SEARCH ENGINE
title_full OFFICE DOCUMENT SEARCH ENGINE
title_fullStr OFFICE DOCUMENT SEARCH ENGINE
title_full_unstemmed OFFICE DOCUMENT SEARCH ENGINE
title_sort office document search engine
url https://digilib.itb.ac.id/gdl/view/39623
_version_ 1822269311420465152