Utilizing microblogs for improving automatic news high-lights extraction

Story highlights form a succinct single-document summary consisting of 3-4 highlight sentences that reflect the gist of a news article. Automatically producing news highlights is very challenging. We propose a novel method to improve news highlights extraction by using microblogs. The hypothesis is...

Full description

Saved in:
Bibliographic Details
Main Authors: WEI, Zhongyu, GAO, Wei
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2014
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4580
https://ink.library.smu.edu.sg/context/sis_research/article/5583/viewcontent/C14_1083.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5583
record_format dspace
spelling sg-smu-ink.sis_research-55832019-12-26T08:07:01Z Utilizing microblogs for improving automatic news high-lights extraction WEI, Zhongyu GAO, Wei Story highlights form a succinct single-document summary consisting of 3-4 highlight sentences that reflect the gist of a news article. Automatically producing news highlights is very challenging. We propose a novel method to improve news highlights extraction by using microblogs. The hypothesis is that microblog posts, although noisy, are not only indicative of important pieces of information in the news story, but also inherently “short and sweet” resulting from the artificial compression effect due to the length limit. Given a news article, we formulate the problem as two rank-then-extract tasks: (1) we find a set of indicative tweets and use them to assist the ranking of news sentences for extraction; (2) we extract top ranked tweets as a substitute of sentence extraction. Results based on our news-tweets pairing corpus indicate that the method significantly outperform some strong baselines for single-document summarization. 2014-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4580 https://ink.library.smu.edu.sg/context/sis_research/article/5583/viewcontent/C14_1083.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
WEI, Zhongyu
GAO, Wei
Utilizing microblogs for improving automatic news high-lights extraction
description Story highlights form a succinct single-document summary consisting of 3-4 highlight sentences that reflect the gist of a news article. Automatically producing news highlights is very challenging. We propose a novel method to improve news highlights extraction by using microblogs. The hypothesis is that microblog posts, although noisy, are not only indicative of important pieces of information in the news story, but also inherently “short and sweet” resulting from the artificial compression effect due to the length limit. Given a news article, we formulate the problem as two rank-then-extract tasks: (1) we find a set of indicative tweets and use them to assist the ranking of news sentences for extraction; (2) we extract top ranked tweets as a substitute of sentence extraction. Results based on our news-tweets pairing corpus indicate that the method significantly outperform some strong baselines for single-document summarization.
format text
author WEI, Zhongyu
GAO, Wei
author_facet WEI, Zhongyu
GAO, Wei
author_sort WEI, Zhongyu
title Utilizing microblogs for improving automatic news high-lights extraction
title_short Utilizing microblogs for improving automatic news high-lights extraction
title_full Utilizing microblogs for improving automatic news high-lights extraction
title_fullStr Utilizing microblogs for improving automatic news high-lights extraction
title_full_unstemmed Utilizing microblogs for improving automatic news high-lights extraction
title_sort utilizing microblogs for improving automatic news high-lights extraction
publisher Institutional Knowledge at Singapore Management University
publishDate 2014
url https://ink.library.smu.edu.sg/sis_research/4580
https://ink.library.smu.edu.sg/context/sis_research/article/5583/viewcontent/C14_1083.pdf
_version_ 1770574919491387392