Swarm based features selection for text summarization

The features are the main entries in text summarization. Treating all features equally causes poor summary generation. In this paper, we investigate the effect of the feature structure on the features selection using particle swarm optimization. The particle swarm optimization is trained using DUC...

Full description

Saved in:
Bibliographic Details
Main Authors: Binwahlan, Mohammed Salem, Salim, Naomie, Suanmali, Ladda
Format: Article
Language:English
Published: International Journal of Computer Science and Network Security 2009
Subjects:
Online Access:http://eprints.utm.my/id/eprint/11825/1/NaomieSalim2009_SwarmBasedFeaturesSelectionFor.pdf
http://eprints.utm.my/id/eprint/11825/
http://paper.ijcsns.org/07_book/200901/20090125.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
Language: English
id my.utm.11825
record_format eprints
spelling my.utm.118252011-01-19T12:29:31Z http://eprints.utm.my/id/eprint/11825/ Swarm based features selection for text summarization Binwahlan, Mohammed Salem Salim, Naomie Suanmali, Ladda QA75 Electronic computers. Computer science QA76 Computer software The features are the main entries in text summarization. Treating all features equally causes poor summary generation. In this paper, we investigate the effect of the feature structure on the features selection using particle swarm optimization. The particle swarm optimization is trained using DUC 2002 data to learn the weight of each feature. The features used are different in terms of the structure, where some features were formed as combination of more than one feature while others as simple or individual feature. Therefore the determining of the effectiveness of each type of features could lead to mechanism to differentiate between the features having high importance and those having low importance. We assume that the combined features have higher priority of getting selection more than the simple features. In each iteration, the particle swarm optimization selects some features, then corresponding weights of those features are used to score the sentences and the top ranking sentences are selected as summary. The selected features of each best summary are used in calculation of the final features weights. The experimental results shown that the simple features are less effective than the combined features International Journal of Computer Science and Network Security 2009-01 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/11825/1/NaomieSalim2009_SwarmBasedFeaturesSelectionFor.pdf Binwahlan, Mohammed Salem and Salim, Naomie and Suanmali, Ladda (2009) Swarm based features selection for text summarization. International Journal of Computer Science and Network Security, 9 (1). pp. 175-179. ISSN 1738-7906 http://paper.ijcsns.org/07_book/200901/20090125.pdf
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
QA76 Computer software
spellingShingle QA75 Electronic computers. Computer science
QA76 Computer software
Binwahlan, Mohammed Salem
Salim, Naomie
Suanmali, Ladda
Swarm based features selection for text summarization
description The features are the main entries in text summarization. Treating all features equally causes poor summary generation. In this paper, we investigate the effect of the feature structure on the features selection using particle swarm optimization. The particle swarm optimization is trained using DUC 2002 data to learn the weight of each feature. The features used are different in terms of the structure, where some features were formed as combination of more than one feature while others as simple or individual feature. Therefore the determining of the effectiveness of each type of features could lead to mechanism to differentiate between the features having high importance and those having low importance. We assume that the combined features have higher priority of getting selection more than the simple features. In each iteration, the particle swarm optimization selects some features, then corresponding weights of those features are used to score the sentences and the top ranking sentences are selected as summary. The selected features of each best summary are used in calculation of the final features weights. The experimental results shown that the simple features are less effective than the combined features
format Article
author Binwahlan, Mohammed Salem
Salim, Naomie
Suanmali, Ladda
author_facet Binwahlan, Mohammed Salem
Salim, Naomie
Suanmali, Ladda
author_sort Binwahlan, Mohammed Salem
title Swarm based features selection for text summarization
title_short Swarm based features selection for text summarization
title_full Swarm based features selection for text summarization
title_fullStr Swarm based features selection for text summarization
title_full_unstemmed Swarm based features selection for text summarization
title_sort swarm based features selection for text summarization
publisher International Journal of Computer Science and Network Security
publishDate 2009
url http://eprints.utm.my/id/eprint/11825/1/NaomieSalim2009_SwarmBasedFeaturesSelectionFor.pdf
http://eprints.utm.my/id/eprint/11825/
http://paper.ijcsns.org/07_book/200901/20090125.pdf
_version_ 1643645785507627008