Utilizing microblogs for improving automatic news high-lights extraction
Story highlights form a succinct single-document summary consisting of 3-4 highlight sentences that reflect the gist of a news article. Automatically producing news highlights is very challenging. We propose a novel method to improve news highlights extraction by using microblogs. The hypothesis is...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2014
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4580 https://ink.library.smu.edu.sg/context/sis_research/article/5583/viewcontent/C14_1083.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-5583 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-55832019-12-26T08:07:01Z Utilizing microblogs for improving automatic news high-lights extraction WEI, Zhongyu GAO, Wei Story highlights form a succinct single-document summary consisting of 3-4 highlight sentences that reflect the gist of a news article. Automatically producing news highlights is very challenging. We propose a novel method to improve news highlights extraction by using microblogs. The hypothesis is that microblog posts, although noisy, are not only indicative of important pieces of information in the news story, but also inherently “short and sweet” resulting from the artificial compression effect due to the length limit. Given a news article, we formulate the problem as two rank-then-extract tasks: (1) we find a set of indicative tweets and use them to assist the ranking of news sentences for extraction; (2) we extract top ranked tweets as a substitute of sentence extraction. Results based on our news-tweets pairing corpus indicate that the method significantly outperform some strong baselines for single-document summarization. 2014-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4580 https://ink.library.smu.edu.sg/context/sis_research/article/5583/viewcontent/C14_1083.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems |
spellingShingle |
Databases and Information Systems WEI, Zhongyu GAO, Wei Utilizing microblogs for improving automatic news high-lights extraction |
description |
Story highlights form a succinct single-document summary consisting of 3-4 highlight sentences that reflect the gist of a news article. Automatically producing news highlights is very challenging. We propose a novel method to improve news highlights extraction by using microblogs. The hypothesis is that microblog posts, although noisy, are not only indicative of important pieces of information in the news story, but also inherently “short and sweet” resulting from the artificial compression effect due to the length limit. Given a news article, we formulate the problem as two rank-then-extract tasks: (1) we find a set of indicative tweets and use them to assist the ranking of news sentences for extraction; (2) we extract top ranked tweets as a substitute of sentence extraction. Results based on our news-tweets pairing corpus indicate that the method significantly outperform some strong baselines for single-document summarization. |
format |
text |
author |
WEI, Zhongyu GAO, Wei |
author_facet |
WEI, Zhongyu GAO, Wei |
author_sort |
WEI, Zhongyu |
title |
Utilizing microblogs for improving automatic news high-lights extraction |
title_short |
Utilizing microblogs for improving automatic news high-lights extraction |
title_full |
Utilizing microblogs for improving automatic news high-lights extraction |
title_fullStr |
Utilizing microblogs for improving automatic news high-lights extraction |
title_full_unstemmed |
Utilizing microblogs for improving automatic news high-lights extraction |
title_sort |
utilizing microblogs for improving automatic news high-lights extraction |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2014 |
url |
https://ink.library.smu.edu.sg/sis_research/4580 https://ink.library.smu.edu.sg/context/sis_research/article/5583/viewcontent/C14_1083.pdf |
_version_ |
1770574919491387392 |