SumCR : a new subtopic-based extractive approach for text summarization

In text summarization, relevance and coverage are two main criteria that decide the quality of a summary. In this paper, we propose a new multi-document summarization approach SumCR via sentence extraction. A novel feature called Exemplar is introduced to help to simultaneously deal with these two c...

Full description

Saved in:

Bibliographic Details
Main Authors:	Mei, Jian-Ping, Chen, Lihui
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2013
Subjects:	DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Online Access:	https://hdl.handle.net/10356/98682 http://hdl.handle.net/10220/17549
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-98682
record_format	dspace
spelling	sg-ntu-dr.10356-986822020-03-07T14:02:35Z SumCR : a new subtopic-based extractive approach for text summarization Mei, Jian-Ping Chen, Lihui School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems In text summarization, relevance and coverage are two main criteria that decide the quality of a summary. In this paper, we propose a new multi-document summarization approach SumCR via sentence extraction. A novel feature called Exemplar is introduced to help to simultaneously deal with these two concerns during sentence ranking. Unlike conventional ways where the relevance value of each sentence is calculated based on the whole collection of sentences, the Exemplar value of each sentence in SumCR is obtained within a subset of similar sentences. A fuzzy medoid-based clustering approach is used to produce sentence clusters or subsets where each of them corresponds to a subtopic of the related topic. Such kind of subtopic-based feature captures the relevance of each sentence within different subtopics and thus enhances the chance of SumCR to produce a summary with a wider coverage and less redundancy. Another feature we incorporate in SumCR is Position, i.e., the position of each sentence appeared in the corresponding document. The final score of each sentence is a combination of the subtopic-level feature Exemplar and the document-level feature Position. Experimental studies on DUC benchmark data show the good performance of SumCR and its potential in summarization tasks. 2013-11-11T04:05:52Z 2019-12-06T19:58:25Z 2013-11-11T04:05:52Z 2019-12-06T19:58:25Z 2011 2011 Journal Article Mei, J. P., & Chen, L. (2012). SumCR: A new subtopic-based extractive approach for text summarization. Knowledge and Information Systems, 31(3), 527-545. https://hdl.handle.net/10356/98682 http://hdl.handle.net/10220/17549 10.1007/s10115-011-0437-x en Knowledge and information systems
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems Mei, Jian-Ping Chen, Lihui SumCR : a new subtopic-based extractive approach for text summarization
description	In text summarization, relevance and coverage are two main criteria that decide the quality of a summary. In this paper, we propose a new multi-document summarization approach SumCR via sentence extraction. A novel feature called Exemplar is introduced to help to simultaneously deal with these two concerns during sentence ranking. Unlike conventional ways where the relevance value of each sentence is calculated based on the whole collection of sentences, the Exemplar value of each sentence in SumCR is obtained within a subset of similar sentences. A fuzzy medoid-based clustering approach is used to produce sentence clusters or subsets where each of them corresponds to a subtopic of the related topic. Such kind of subtopic-based feature captures the relevance of each sentence within different subtopics and thus enhances the chance of SumCR to produce a summary with a wider coverage and less redundancy. Another feature we incorporate in SumCR is Position, i.e., the position of each sentence appeared in the corresponding document. The final score of each sentence is a combination of the subtopic-level feature Exemplar and the document-level feature Position. Experimental studies on DUC benchmark data show the good performance of SumCR and its potential in summarization tasks.
author2	School of Electrical and Electronic Engineering
author_facet	School of Electrical and Electronic Engineering Mei, Jian-Ping Chen, Lihui
format	Article
author	Mei, Jian-Ping Chen, Lihui
author_sort	Mei, Jian-Ping
title	SumCR : a new subtopic-based extractive approach for text summarization
title_short	SumCR : a new subtopic-based extractive approach for text summarization
title_full	SumCR : a new subtopic-based extractive approach for text summarization
title_fullStr	SumCR : a new subtopic-based extractive approach for text summarization
title_full_unstemmed	SumCR : a new subtopic-based extractive approach for text summarization
title_sort	sumcr : a new subtopic-based extractive approach for text summarization
publishDate	2013
url	https://hdl.handle.net/10356/98682 http://hdl.handle.net/10220/17549
_version_	1681045896277524480

SumCR : a new subtopic-based extractive approach for text summarization

Similar Items