Topic extraction from microblog posts using conversation structures

Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relat...

Full description

Saved in:
Bibliographic Details
Main Authors: LI, Jing, LIAO, Ming, GAO, Wei, HE, Yulan, WONG, Kam-Fai
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4567
https://ink.library.smu.edu.sg/context/sis_research/article/5570/viewcontent/P16_1199.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5570
record_format dspace
spelling sg-smu-ink.sis_research-55702019-12-26T08:22:01Z Topic extraction from microblog posts using conversation structures LI, Jing LIAO, Ming GAO, Wei HE, Yulan WONG, Kam-Fai Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words according to topic dependencies derived from the conversation structures. In specific, we differentiate messages as leader messages, which initiate key aspects of previously focused topics or shift the focus to different topics, and follower messages that do not introduce any new information but simply echo topics from the messages that they repost or reply. Our model captures the different extents that leader and follower messages may contain the key topical words, thus further enhances the quality of the induced topics. The results of thorough experiments demonstrate the effectiveness of our proposed model. 2016-08-12T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4567 info:doi/10.18653/v1/P16-1199 https://ink.library.smu.edu.sg/context/sis_research/article/5570/viewcontent/P16_1199.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
LI, Jing
LIAO, Ming
GAO, Wei
HE, Yulan
WONG, Kam-Fai
Topic extraction from microblog posts using conversation structures
description Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words according to topic dependencies derived from the conversation structures. In specific, we differentiate messages as leader messages, which initiate key aspects of previously focused topics or shift the focus to different topics, and follower messages that do not introduce any new information but simply echo topics from the messages that they repost or reply. Our model captures the different extents that leader and follower messages may contain the key topical words, thus further enhances the quality of the induced topics. The results of thorough experiments demonstrate the effectiveness of our proposed model.
format text
author LI, Jing
LIAO, Ming
GAO, Wei
HE, Yulan
WONG, Kam-Fai
author_facet LI, Jing
LIAO, Ming
GAO, Wei
HE, Yulan
WONG, Kam-Fai
author_sort LI, Jing
title Topic extraction from microblog posts using conversation structures
title_short Topic extraction from microblog posts using conversation structures
title_full Topic extraction from microblog posts using conversation structures
title_fullStr Topic extraction from microblog posts using conversation structures
title_full_unstemmed Topic extraction from microblog posts using conversation structures
title_sort topic extraction from microblog posts using conversation structures
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/4567
https://ink.library.smu.edu.sg/context/sis_research/article/5570/viewcontent/P16_1199.pdf
_version_ 1770574916342513664