Using content-level structures for summarizing microblog repost trees
A microblog repost tree provides strong clues on how an event described therein develops. To help social media users capture the main clues of events on microblogging sites, we propose a novel repost tree summarization framework by effectively differentiating two kinds of messages on repost trees ca...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2015
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6795 https://ink.library.smu.edu.sg/context/sis_research/article/7798/viewcontent/Using_Content_level_Structures_for_Summarizing_Microblog_Repost_Trees.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7798 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-77982022-01-27T09:57:26Z Using content-level structures for summarizing microblog repost trees LI, Jing GAO, Wei WEI, Zhongyu PENG, Baolin WONG, Kam-Fai A microblog repost tree provides strong clues on how an event described therein develops. To help social media users capture the main clues of events on microblogging sites, we propose a novel repost tree summarization framework by effectively differentiating two kinds of messages on repost trees called leaders and followers, which are derived from contentlevel structure information, i.e., contents of messages and the reposting relations. To this end, Conditional Random Fields (CRF) model is used to detect leaders across repost tree paths. We then present a variant of random-walk-based summarization model to rank and select salient messages based on the result of leader detection. To reduce the error propagation cascaded from leader detection, we improve the framework by enhancing the random walk with adjustment steps for sampling from leader probabilities given all the reposting messages. For evaluation, we construct two annotated corpora, one for leader detection, and the other for repost tree summarization. Experimental results confirm the effectiveness of our method. 2015-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6795 info:doi/10.18653/v1/D15-1259 https://ink.library.smu.edu.sg/context/sis_research/article/7798/viewcontent/Using_Content_level_Structures_for_Summarizing_Microblog_Repost_Trees.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems |
spellingShingle |
Databases and Information Systems LI, Jing GAO, Wei WEI, Zhongyu PENG, Baolin WONG, Kam-Fai Using content-level structures for summarizing microblog repost trees |
description |
A microblog repost tree provides strong clues on how an event described therein develops. To help social media users capture the main clues of events on microblogging sites, we propose a novel repost tree summarization framework by effectively differentiating two kinds of messages on repost trees called leaders and followers, which are derived from contentlevel structure information, i.e., contents of messages and the reposting relations. To this end, Conditional Random Fields (CRF) model is used to detect leaders across repost tree paths. We then present a variant of random-walk-based summarization model to rank and select salient messages based on the result of leader detection. To reduce the error propagation cascaded from leader detection, we improve the framework by enhancing the random walk with adjustment steps for sampling from leader probabilities given all the reposting messages. For evaluation, we construct two annotated corpora, one for leader detection, and the other for repost tree summarization. Experimental results confirm the effectiveness of our method. |
format |
text |
author |
LI, Jing GAO, Wei WEI, Zhongyu PENG, Baolin WONG, Kam-Fai |
author_facet |
LI, Jing GAO, Wei WEI, Zhongyu PENG, Baolin WONG, Kam-Fai |
author_sort |
LI, Jing |
title |
Using content-level structures for summarizing microblog repost trees |
title_short |
Using content-level structures for summarizing microblog repost trees |
title_full |
Using content-level structures for summarizing microblog repost trees |
title_fullStr |
Using content-level structures for summarizing microblog repost trees |
title_full_unstemmed |
Using content-level structures for summarizing microblog repost trees |
title_sort |
using content-level structures for summarizing microblog repost trees |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2015 |
url |
https://ink.library.smu.edu.sg/sis_research/6795 https://ink.library.smu.edu.sg/context/sis_research/article/7798/viewcontent/Using_Content_level_Structures_for_Summarizing_Microblog_Repost_Trees.pdf |
_version_ |
1770576070352830464 |