Keyword extraction using a back propagation network and rule extraction

Keyword extraction is vital for Knowledge Management Systems, Information Re- trieval Systems, and Digital Libraries as well as for general browsing of the web. Keywords are often the basis of document processing methods such as clustering and retrieval since processing all the words in the document...

Full description

Saved in:

Bibliographic Details
Main Author:	Liu, Michael David S.
Format:	text
Language:	English
Published:	Animo Repository 2010
Subjects:	Back propagation Keyword searching Computer Sciences
Online Access:	https://animorepository.dlsu.edu.ph/etd_masteral/4007 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/10845/viewcontent/CDTG004916_P.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	De La Salle University
Language:	English

id	oai:animorepository.dlsu.edu.ph:etd_masteral-10845
record_format	eprints
spelling	oai:animorepository.dlsu.edu.ph:etd_masteral-108452022-12-17T03:33:57Z Keyword extraction using a back propagation network and rule extraction Liu, Michael David S. Keyword extraction is vital for Knowledge Management Systems, Information Re- trieval Systems, and Digital Libraries as well as for general browsing of the web. Keywords are often the basis of document processing methods such as clustering and retrieval since processing all the words in the document can be slow. Common models for automating the process of keyword extraction are usually done by using several statistics-based methods such as Bayesian, K-Nearest Neighbour, and Expectation-Maximization. These models are limited by word-related features that can be used since adding more features will make the models more complex and difficult to comprehend. In this research, a Neural Network, specifically a backpropagation network, will be used in generalizing the relationship of the title and the content of articles in the archive by following word features other than TF-IDF, such as position of word in the sentence, paragraph, or in the entire document, and formats such as heading, and other attributes defined beforehand. In order to explain how the backpropagation network works, a rule extraction method will be used to extract symbolic data from the resulting backpropagation network. The rules extracted can then be transformed into decision trees per- forming almost as accurate as the network plus the benefit of being in an easily comprehensible format. 2010-12-18T08:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etd_masteral/4007 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/10845/viewcontent/CDTG004916_P.pdf Master's Theses English Animo Repository Back propagation Keyword searching Computer Sciences
institution	De La Salle University
building	De La Salle University Library
continent	Asia
country	Philippines Philippines
content_provider	De La Salle University Library
collection	DLSU Institutional Repository
language	English
topic	Back propagation Keyword searching Computer Sciences
spellingShingle	Back propagation Keyword searching Computer Sciences Liu, Michael David S. Keyword extraction using a back propagation network and rule extraction
description	Keyword extraction is vital for Knowledge Management Systems, Information Re- trieval Systems, and Digital Libraries as well as for general browsing of the web. Keywords are often the basis of document processing methods such as clustering and retrieval since processing all the words in the document can be slow. Common models for automating the process of keyword extraction are usually done by using several statistics-based methods such as Bayesian, K-Nearest Neighbour, and Expectation-Maximization. These models are limited by word-related features that can be used since adding more features will make the models more complex and difficult to comprehend. In this research, a Neural Network, specifically a backpropagation network, will be used in generalizing the relationship of the title and the content of articles in the archive by following word features other than TF-IDF, such as position of word in the sentence, paragraph, or in the entire document, and formats such as heading, and other attributes defined beforehand. In order to explain how the backpropagation network works, a rule extraction method will be used to extract symbolic data from the resulting backpropagation network. The rules extracted can then be transformed into decision trees per- forming almost as accurate as the network plus the benefit of being in an easily comprehensible format.
format	text
author	Liu, Michael David S.
author_facet	Liu, Michael David S.
author_sort	Liu, Michael David S.
title	Keyword extraction using a back propagation network and rule extraction
title_short	Keyword extraction using a back propagation network and rule extraction
title_full	Keyword extraction using a back propagation network and rule extraction
title_fullStr	Keyword extraction using a back propagation network and rule extraction
title_full_unstemmed	Keyword extraction using a back propagation network and rule extraction
title_sort	keyword extraction using a back propagation network and rule extraction
publisher	Animo Repository
publishDate	2010
url	https://animorepository.dlsu.edu.ph/etd_masteral/4007 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/10845/viewcontent/CDTG004916_P.pdf
_version_	1794553688379883520

Keyword extraction using a back propagation network and rule extraction

Similar Items