Context Modeling for Ranking and Tagging Bursty Features in Text Streams

Bursty features in text streams are very useful in many text mining applications. Most existing studies detect bursty features based purely on term frequency changes without taking into account the semantic contexts of terms, and as a result the detected bursty features may not always be interesting...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHAO, Xin, JIANG, Jing, HE, Jing, LI, Xiaoming, YAN, Hongfei, Shan, Dongdong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2010
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1314
https://ink.library.smu.edu.sg/context/sis_research/article/2313/viewcontent/p1769_zhao.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2313
record_format dspace
spelling sg-smu-ink.sis_research-23132018-07-13T02:56:22Z Context Modeling for Ranking and Tagging Bursty Features in Text Streams ZHAO, Xin JIANG, Jing HE, Jing LI, Xiaoming YAN, Hongfei Shan, Dongdong Bursty features in text streams are very useful in many text mining applications. Most existing studies detect bursty features based purely on term frequency changes without taking into account the semantic contexts of terms, and as a result the detected bursty features may not always be interesting or easy to interpret. In this paper we propose to model the contexts of bursty features using a language modeling approach. We then propose a novel topic diversity-based metric using the context models to find newsworthy bursty features. We also propose to use the context models to automatically assign meaningful tags to bursty features. Using a large corpus of a stream of news articles, we quantitatively show that the proposed context language models for bursty features can effectively help rank bursty features based on their newsworthiness and to assign meaningful tags to annotate bursty features. 2010-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1314 info:doi/10.1145/1871437.1871725 https://ink.library.smu.edu.sg/context/sis_research/article/2313/viewcontent/p1769_zhao.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University bursty features bursty features ranking bursty feature tagging context modeling Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic bursty features
bursty features ranking
bursty feature tagging
context modeling
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle bursty features
bursty features ranking
bursty feature tagging
context modeling
Databases and Information Systems
Numerical Analysis and Scientific Computing
ZHAO, Xin
JIANG, Jing
HE, Jing
LI, Xiaoming
YAN, Hongfei
Shan, Dongdong
Context Modeling for Ranking and Tagging Bursty Features in Text Streams
description Bursty features in text streams are very useful in many text mining applications. Most existing studies detect bursty features based purely on term frequency changes without taking into account the semantic contexts of terms, and as a result the detected bursty features may not always be interesting or easy to interpret. In this paper we propose to model the contexts of bursty features using a language modeling approach. We then propose a novel topic diversity-based metric using the context models to find newsworthy bursty features. We also propose to use the context models to automatically assign meaningful tags to bursty features. Using a large corpus of a stream of news articles, we quantitatively show that the proposed context language models for bursty features can effectively help rank bursty features based on their newsworthiness and to assign meaningful tags to annotate bursty features.
format text
author ZHAO, Xin
JIANG, Jing
HE, Jing
LI, Xiaoming
YAN, Hongfei
Shan, Dongdong
author_facet ZHAO, Xin
JIANG, Jing
HE, Jing
LI, Xiaoming
YAN, Hongfei
Shan, Dongdong
author_sort ZHAO, Xin
title Context Modeling for Ranking and Tagging Bursty Features in Text Streams
title_short Context Modeling for Ranking and Tagging Bursty Features in Text Streams
title_full Context Modeling for Ranking and Tagging Bursty Features in Text Streams
title_fullStr Context Modeling for Ranking and Tagging Bursty Features in Text Streams
title_full_unstemmed Context Modeling for Ranking and Tagging Bursty Features in Text Streams
title_sort context modeling for ranking and tagging bursty features in text streams
publisher Institutional Knowledge at Singapore Management University
publishDate 2010
url https://ink.library.smu.edu.sg/sis_research/1314
https://ink.library.smu.edu.sg/context/sis_research/article/2313/viewcontent/p1769_zhao.pdf
_version_ 1770570946755690496