MMI diversity based text summarization

The search for interesting information in a huge data collection is a tough job frustrating the seekers for that information. The automatic text summarization has come to facilitate such searching process. The selection of distinct ideas “diversity” from the original document can produce an approp...

Full description

Saved in:
Bibliographic Details
Main Authors: Binwahlan, Mohammed Salem, Salim, Naomie, Suanmali, Ladda
Format: Article
Language:English
Published: Computer Science Journals 2009
Subjects:
Online Access:http://eprints.utm.my/id/eprint/11826/1/NaomieSalim2009_MMIDiversityBasedTextSummarization.pdf
http://eprints.utm.my/id/eprint/11826/
http://www.cscjournals.org/csc/manuscript/Journals/IJCSS/volume3/Issue1/IJCSS-55.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
Language: English
id my.utm.11826
record_format eprints
spelling my.utm.118262011-01-19T12:29:23Z http://eprints.utm.my/id/eprint/11826/ MMI diversity based text summarization Binwahlan, Mohammed Salem Salim, Naomie Suanmali, Ladda QA75 Electronic computers. Computer science T Technology (General) The search for interesting information in a huge data collection is a tough job frustrating the seekers for that information. The automatic text summarization has come to facilitate such searching process. The selection of distinct ideas “diversity” from the original document can produce an appropriate summary. Incorporating of multiple means can help to find the diversity in the text. In this paper, we propose approach for text summarization, in which three evidences are employed (clustering, binary tree and diversity based method) to help in finding the document distinct ideas. The emphasis of our approach is on controlling the redundancy in the summarized text. The role of clustering is very important, where some clustering algorithms perform better than others. Therefore we conducted an experiment for comparing two clustering algorithms (K-means and complete linkage clustering algorithms) based on the performance of our method, the results shown that k-means performs better than complete linkage. In general, the experimental results shown that our method performs well for text summarization comparing with the benchmark methods used in this study Computer Science Journals 2009 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/11826/1/NaomieSalim2009_MMIDiversityBasedTextSummarization.pdf Binwahlan, Mohammed Salem and Salim, Naomie and Suanmali, Ladda (2009) MMI diversity based text summarization. International Journal of Computer Science and Security, 3 (1). pp. 23-33. ISSN 1985-1553 http://www.cscjournals.org/csc/manuscript/Journals/IJCSS/volume3/Issue1/IJCSS-55.pdf
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
T Technology (General)
spellingShingle QA75 Electronic computers. Computer science
T Technology (General)
Binwahlan, Mohammed Salem
Salim, Naomie
Suanmali, Ladda
MMI diversity based text summarization
description The search for interesting information in a huge data collection is a tough job frustrating the seekers for that information. The automatic text summarization has come to facilitate such searching process. The selection of distinct ideas “diversity” from the original document can produce an appropriate summary. Incorporating of multiple means can help to find the diversity in the text. In this paper, we propose approach for text summarization, in which three evidences are employed (clustering, binary tree and diversity based method) to help in finding the document distinct ideas. The emphasis of our approach is on controlling the redundancy in the summarized text. The role of clustering is very important, where some clustering algorithms perform better than others. Therefore we conducted an experiment for comparing two clustering algorithms (K-means and complete linkage clustering algorithms) based on the performance of our method, the results shown that k-means performs better than complete linkage. In general, the experimental results shown that our method performs well for text summarization comparing with the benchmark methods used in this study
format Article
author Binwahlan, Mohammed Salem
Salim, Naomie
Suanmali, Ladda
author_facet Binwahlan, Mohammed Salem
Salim, Naomie
Suanmali, Ladda
author_sort Binwahlan, Mohammed Salem
title MMI diversity based text summarization
title_short MMI diversity based text summarization
title_full MMI diversity based text summarization
title_fullStr MMI diversity based text summarization
title_full_unstemmed MMI diversity based text summarization
title_sort mmi diversity based text summarization
publisher Computer Science Journals
publishDate 2009
url http://eprints.utm.my/id/eprint/11826/1/NaomieSalim2009_MMIDiversityBasedTextSummarization.pdf
http://eprints.utm.my/id/eprint/11826/
http://www.cscjournals.org/csc/manuscript/Journals/IJCSS/volume3/Issue1/IJCSS-55.pdf
_version_ 1643645785800179712