Finding Bursty Topics From Microblogs
Microblogs such as Twitter reflect the general public’s reactions to major events. Bursty topics from microblogs reveal what events have attracted the most online attention. Although bursty event detection from text streams has been studied before, previous work may not be suitable for microblogs be...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2012
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/1547 https://ink.library.smu.edu.sg/context/sis_research/article/2546/viewcontent/P12_1056.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-2546 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-25462016-04-16T03:32:06Z Finding Bursty Topics From Microblogs DIAO, Qiming JIANG, Jing ZHU, Feida LIM, Ee Peng Microblogs such as Twitter reflect the general public’s reactions to major events. Bursty topics from microblogs reveal what events have attracted the most online attention. Although bursty event detection from text streams has been studied before, previous work may not be suitable for microblogs because compared with other text streams such as news articles and scientific publications, microblog posts are particularly diverse and noisy. To find topics that have bursty patterns on microblogs, we propose a topic model that simultaneousy captures two observations: (1) posts published around the same time are more likely to have the same topic, and (2) posts published by the same user are more likely to have the same topic. The former helps find event-driven posts while the latter helps identify and filter out “personal” posts. Our experiments on a large Twitter dataset show that there are more meaningful and unique bursty topics in the top-ranked results returned by our model than an LDA baseline and two degenerate variations of our model. We also show some case studies that demonstrate the importance of considering both the temporal information and users’ personal interests for bursty topic detection from microblogs. 2012-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1547 https://ink.library.smu.edu.sg/context/sis_research/article/2546/viewcontent/P12_1056.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing Social Media |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems Numerical Analysis and Scientific Computing Social Media |
spellingShingle |
Databases and Information Systems Numerical Analysis and Scientific Computing Social Media DIAO, Qiming JIANG, Jing ZHU, Feida LIM, Ee Peng Finding Bursty Topics From Microblogs |
description |
Microblogs such as Twitter reflect the general public’s reactions to major events. Bursty topics from microblogs reveal what events have attracted the most online attention. Although bursty event detection from text streams has been studied before, previous work may not be suitable for microblogs because compared with other text streams such as news articles and scientific publications, microblog posts are particularly diverse and noisy. To find topics that have bursty patterns on microblogs, we propose a topic model that simultaneousy captures two observations: (1) posts published around the same time are more likely to have the same topic, and (2) posts published by the same user are more likely to have the same topic. The former helps find event-driven posts while the latter helps identify and filter out “personal” posts. Our experiments on a large Twitter dataset show that there are more meaningful and unique bursty topics in the top-ranked results returned by our model than an LDA baseline and two degenerate variations of our model. We also show some case studies that demonstrate the importance of considering both the temporal information and users’ personal interests for bursty topic detection from microblogs. |
format |
text |
author |
DIAO, Qiming JIANG, Jing ZHU, Feida LIM, Ee Peng |
author_facet |
DIAO, Qiming JIANG, Jing ZHU, Feida LIM, Ee Peng |
author_sort |
DIAO, Qiming |
title |
Finding Bursty Topics From Microblogs |
title_short |
Finding Bursty Topics From Microblogs |
title_full |
Finding Bursty Topics From Microblogs |
title_fullStr |
Finding Bursty Topics From Microblogs |
title_full_unstemmed |
Finding Bursty Topics From Microblogs |
title_sort |
finding bursty topics from microblogs |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2012 |
url |
https://ink.library.smu.edu.sg/sis_research/1547 https://ink.library.smu.edu.sg/context/sis_research/article/2546/viewcontent/P12_1056.pdf |
_version_ |
1770571296209371136 |