Using content-level structures for summarizing microblog repost trees

A microblog repost tree provides strong clues on how an event described therein develops. To help social media users capture the main clues of events on microblogging sites, we propose a novel repost tree summarization framework by effectively differentiating two kinds of messages on repost trees ca...

Full description

Saved in:
Bibliographic Details
Main Authors: LI, Jing, GAO, Wei, WEI, Zhongyu, PENG, Baolin, WONG, Kam-Fai
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2015
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4573
https://ink.library.smu.edu.sg/context/sis_research/article/5576/viewcontent/D15_1259.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5576
record_format dspace
spelling sg-smu-ink.sis_research-55762019-12-26T08:17:37Z Using content-level structures for summarizing microblog repost trees LI, Jing GAO, Wei WEI, Zhongyu PENG, Baolin WONG, Kam-Fai A microblog repost tree provides strong clues on how an event described therein develops. To help social media users capture the main clues of events on microblogging sites, we propose a novel repost tree summarization framework by effectively differentiating two kinds of messages on repost trees called leaders and followers, which are derived from contentlevel structure information, i.e., contents of messages and the reposting relations. To this end, Conditional Random Fields (CRF) model is used to detect leaders across repost tree paths. We then present a variant of random-walk-based summarization model to rank and select salient messages based on the result of leader detection. To reduce the error propagation cascaded from leader detection, we improve the framework by enhancing the random walk with adjustment steps for sampling from leader probabilities given all the reposting messages. For evaluation, we construct two annotated corpora, one for leader detection, and the other for repost tree summarization. Experimental results confirm the effectiveness of our method. 2015-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4573 info:doi/10.18653/v1/D15-1259 https://ink.library.smu.edu.sg/context/sis_research/article/5576/viewcontent/D15_1259.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
LI, Jing
GAO, Wei
WEI, Zhongyu
PENG, Baolin
WONG, Kam-Fai
Using content-level structures for summarizing microblog repost trees
description A microblog repost tree provides strong clues on how an event described therein develops. To help social media users capture the main clues of events on microblogging sites, we propose a novel repost tree summarization framework by effectively differentiating two kinds of messages on repost trees called leaders and followers, which are derived from contentlevel structure information, i.e., contents of messages and the reposting relations. To this end, Conditional Random Fields (CRF) model is used to detect leaders across repost tree paths. We then present a variant of random-walk-based summarization model to rank and select salient messages based on the result of leader detection. To reduce the error propagation cascaded from leader detection, we improve the framework by enhancing the random walk with adjustment steps for sampling from leader probabilities given all the reposting messages. For evaluation, we construct two annotated corpora, one for leader detection, and the other for repost tree summarization. Experimental results confirm the effectiveness of our method.
format text
author LI, Jing
GAO, Wei
WEI, Zhongyu
PENG, Baolin
WONG, Kam-Fai
author_facet LI, Jing
GAO, Wei
WEI, Zhongyu
PENG, Baolin
WONG, Kam-Fai
author_sort LI, Jing
title Using content-level structures for summarizing microblog repost trees
title_short Using content-level structures for summarizing microblog repost trees
title_full Using content-level structures for summarizing microblog repost trees
title_fullStr Using content-level structures for summarizing microblog repost trees
title_full_unstemmed Using content-level structures for summarizing microblog repost trees
title_sort using content-level structures for summarizing microblog repost trees
publisher Institutional Knowledge at Singapore Management University
publishDate 2015
url https://ink.library.smu.edu.sg/sis_research/4573
https://ink.library.smu.edu.sg/context/sis_research/article/5576/viewcontent/D15_1259.pdf
_version_ 1770574887255015424