TopicSketch: Real-time Bursty Topic Detection from Twitter
Twitter has become one of the largest platforms for users around the world to share anything happening around them with friends and beyond. A bursty topic in Twitter is one that triggers a surge of relevant tweets within a short time, which often reflects important events of mass interest. How to lev...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2013
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2115 https://ink.library.smu.edu.sg/context/sis_research/article/3114/viewcontent/TopicSketch_Real_timeBurstyTopicDetectionTwitter.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3114 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-31142018-12-07T06:44:15Z TopicSketch: Real-time Bursty Topic Detection from Twitter XIE, Wei ZHU, Feida JIANG, Jing LIM, Ee Peng WANG, Ke Twitter has become one of the largest platforms for users around the world to share anything happening around them with friends and beyond. A bursty topic in Twitter is one that triggers a surge of relevant tweets within a short time, which often reflects important events of mass interest. How to leverage Twitter for early detection of bursty topics has therefore become an important research problem with immense practical value. Despite the wealth of research work on topic modeling and analysis in Twitter, it remains a huge challenge to detect bursty topics in real-time. As existing methods can hardly scale to handle the task with the tweet stream in real-time, we propose in this paper TopicSketch, a novel sketch-based topic model together with a set of techniques to achieve real-time detection. We evaluate our solution on a tweet stream with over 30 million tweets. Our experiment results show both efficiency and effectiveness of our approach. Especially it is also demonstrated that TopicSketch can potentially handle hundreds of millions tweets per day which is close to the total number of daily tweets in Twitter and present bursty event in finer-granularity. 2013-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2115 info:doi/10.1109/ICDM.2013.86 https://ink.library.smu.edu.sg/context/sis_research/article/3114/viewcontent/TopicSketch_Real_timeBurstyTopicDetectionTwitter.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University TopicSketch tweet stream bursty topic realtime Databases and Information Systems Numerical Analysis and Scientific Computing Social Media |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
TopicSketch tweet stream bursty topic realtime Databases and Information Systems Numerical Analysis and Scientific Computing Social Media |
spellingShingle |
TopicSketch tweet stream bursty topic realtime Databases and Information Systems Numerical Analysis and Scientific Computing Social Media XIE, Wei ZHU, Feida JIANG, Jing LIM, Ee Peng WANG, Ke TopicSketch: Real-time Bursty Topic Detection from Twitter |
description |
Twitter has become one of the largest platforms for users around the world to share anything happening around them with friends and beyond. A bursty topic in Twitter is one that triggers a surge of relevant tweets within a short time, which often reflects important events of mass interest. How to leverage Twitter for early detection of bursty topics has therefore become an important research problem with immense practical value. Despite the wealth of research work on topic modeling and analysis in Twitter, it remains a huge challenge to detect bursty topics in real-time. As existing methods can hardly scale to handle the task with the tweet stream in real-time, we propose in this paper TopicSketch, a novel sketch-based topic model together with a set of techniques to achieve real-time detection. We evaluate our solution on a tweet stream with over 30 million tweets. Our experiment results show both efficiency and effectiveness of our approach. Especially it is also demonstrated that TopicSketch can potentially handle hundreds of millions tweets per day which is close to the total number of daily tweets in Twitter and present bursty event in finer-granularity. |
format |
text |
author |
XIE, Wei ZHU, Feida JIANG, Jing LIM, Ee Peng WANG, Ke |
author_facet |
XIE, Wei ZHU, Feida JIANG, Jing LIM, Ee Peng WANG, Ke |
author_sort |
XIE, Wei |
title |
TopicSketch: Real-time Bursty Topic Detection from Twitter |
title_short |
TopicSketch: Real-time Bursty Topic Detection from Twitter |
title_full |
TopicSketch: Real-time Bursty Topic Detection from Twitter |
title_fullStr |
TopicSketch: Real-time Bursty Topic Detection from Twitter |
title_full_unstemmed |
TopicSketch: Real-time Bursty Topic Detection from Twitter |
title_sort |
topicsketch: real-time bursty topic detection from twitter |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2013 |
url |
https://ink.library.smu.edu.sg/sis_research/2115 https://ink.library.smu.edu.sg/context/sis_research/article/3114/viewcontent/TopicSketch_Real_timeBurstyTopicDetectionTwitter.pdf |
_version_ |
1770571800200085504 |