Keyword extraction using backpropagation neural networks and rule extraction

Keyword extraction is vital for Knowledge Management System, Information Retrieval System, and Digital Libraries as well as for general browsing of the web. Keywords are often the basis of document processing methods such as clustering and retrieval since processing all the words in the document can...

Full description

Saved in:
Bibliographic Details
Main Authors: Azcarraga, Arnulfo P., Liu, Michael David S., Setiono, Rudy
Format: text
Published: Animo Repository 2012
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/2097
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-3096
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-30962022-12-17T02:58:55Z Keyword extraction using backpropagation neural networks and rule extraction Azcarraga, Arnulfo P. Liu, Michael David S. Setiono, Rudy Keyword extraction is vital for Knowledge Management System, Information Retrieval System, and Digital Libraries as well as for general browsing of the web. Keywords are often the basis of document processing methods such as clustering and retrieval since processing all the words in the document can be slow. Common models for automating the process of keyword extraction are usually done by using several statistics-based methods such as Bayesian, K-Nearest Neighbor, and Expectation-Maximization. These models are limited by word-related features that can be used since adding more features will make the models more complex and difficult to comprehend. In this research, a Neural Network, specifically a backpropagation network, will be used in generalizing the relationship of the title and the content of articles in the archive by following word features other than TF-IDF, such as position of word in the sentence, paragraph, or in the entire document, and formats such as heading, and other attributes defined beforehand. In order to explain how the backpropagation network works, a rule extraction method will be used to extract symbolic data from the resulting backpropagation network. The rules extracted can then be transformed into decision trees performing almost as accurate as the network plus the benefit of being in an easily comprehensible format. © 2012 IEEE. 2012-08-22T07:00:00Z text text/html https://animorepository.dlsu.edu.ph/faculty_research/2097 Faculty Research Work Animo Repository Automatic indexing Text processing (Computer science) Back propagation (Artificial intelligence)
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Automatic indexing
Text processing (Computer science)
Back propagation (Artificial intelligence)
spellingShingle Automatic indexing
Text processing (Computer science)
Back propagation (Artificial intelligence)
Azcarraga, Arnulfo P.
Liu, Michael David S.
Setiono, Rudy
Keyword extraction using backpropagation neural networks and rule extraction
description Keyword extraction is vital for Knowledge Management System, Information Retrieval System, and Digital Libraries as well as for general browsing of the web. Keywords are often the basis of document processing methods such as clustering and retrieval since processing all the words in the document can be slow. Common models for automating the process of keyword extraction are usually done by using several statistics-based methods such as Bayesian, K-Nearest Neighbor, and Expectation-Maximization. These models are limited by word-related features that can be used since adding more features will make the models more complex and difficult to comprehend. In this research, a Neural Network, specifically a backpropagation network, will be used in generalizing the relationship of the title and the content of articles in the archive by following word features other than TF-IDF, such as position of word in the sentence, paragraph, or in the entire document, and formats such as heading, and other attributes defined beforehand. In order to explain how the backpropagation network works, a rule extraction method will be used to extract symbolic data from the resulting backpropagation network. The rules extracted can then be transformed into decision trees performing almost as accurate as the network plus the benefit of being in an easily comprehensible format. © 2012 IEEE.
format text
author Azcarraga, Arnulfo P.
Liu, Michael David S.
Setiono, Rudy
author_facet Azcarraga, Arnulfo P.
Liu, Michael David S.
Setiono, Rudy
author_sort Azcarraga, Arnulfo P.
title Keyword extraction using backpropagation neural networks and rule extraction
title_short Keyword extraction using backpropagation neural networks and rule extraction
title_full Keyword extraction using backpropagation neural networks and rule extraction
title_fullStr Keyword extraction using backpropagation neural networks and rule extraction
title_full_unstemmed Keyword extraction using backpropagation neural networks and rule extraction
title_sort keyword extraction using backpropagation neural networks and rule extraction
publisher Animo Repository
publishDate 2012
url https://animorepository.dlsu.edu.ph/faculty_research/2097
_version_ 1753806429810065408