Social stream classification with emerging new labels

As an important research topic with well-recognized practical values, classification of social streams has been identified with increasing popularity with social data, such as the tweet stream generated by Twitter users in chronological order. A salient, and perhaps also the most interesting, featur...

Full description

Saved in:
Bibliographic Details
Main Authors: MU, Xin, ZHU, Feida, LIU, Yue, LIM, Ee-peng, ZHOU, Zhi-Hua
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2018
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4079
https://ink.library.smu.edu.sg/context/sis_research/article/5082/viewcontent/Mu2018_Chapter_SocialStreamClassificationWith.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5082
record_format dspace
spelling sg-smu-ink.sis_research-50822019-06-28T00:46:45Z Social stream classification with emerging new labels MU, Xin ZHU, Feida LIU, Yue LIM, Ee-peng ZHOU, Zhi-Hua As an important research topic with well-recognized practical values, classification of social streams has been identified with increasing popularity with social data, such as the tweet stream generated by Twitter users in chronological order. A salient, and perhaps also the most interesting, feature of such user-generated content is its never-failing novelty, which, unfortunately, would challenge most traditional pre-trained classification models as they are built based on fixed label set and would therefore fail to identify new labels as they emerge. In this paper, we study the problem of classification of social streams with emerging new labels, and propose a novel ensemble framework, integrating an instance-based learner and a label-based learner by completely-random trees. The proposed framework can not only classify known labels in the multi-label scenario, but also detect emerging new labels and update itself in the data stream. Extensive experiments on real-world stream data set from Weibo, a Chinese micro-blogging platform, demonstrate the superiority of our approach over the state-of-the-art methods. 2018-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4079 info:doi/10.1007/978-3-319-93034-3_2 https://ink.library.smu.edu.sg/context/sis_research/article/5082/viewcontent/Mu2018_Chapter_SocialStreamClassificationWith.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Stream classification Emerging new labels Model update Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Stream classification
Emerging new labels
Model update
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Stream classification
Emerging new labels
Model update
Databases and Information Systems
Numerical Analysis and Scientific Computing
MU, Xin
ZHU, Feida
LIU, Yue
LIM, Ee-peng
ZHOU, Zhi-Hua
Social stream classification with emerging new labels
description As an important research topic with well-recognized practical values, classification of social streams has been identified with increasing popularity with social data, such as the tweet stream generated by Twitter users in chronological order. A salient, and perhaps also the most interesting, feature of such user-generated content is its never-failing novelty, which, unfortunately, would challenge most traditional pre-trained classification models as they are built based on fixed label set and would therefore fail to identify new labels as they emerge. In this paper, we study the problem of classification of social streams with emerging new labels, and propose a novel ensemble framework, integrating an instance-based learner and a label-based learner by completely-random trees. The proposed framework can not only classify known labels in the multi-label scenario, but also detect emerging new labels and update itself in the data stream. Extensive experiments on real-world stream data set from Weibo, a Chinese micro-blogging platform, demonstrate the superiority of our approach over the state-of-the-art methods.
format text
author MU, Xin
ZHU, Feida
LIU, Yue
LIM, Ee-peng
ZHOU, Zhi-Hua
author_facet MU, Xin
ZHU, Feida
LIU, Yue
LIM, Ee-peng
ZHOU, Zhi-Hua
author_sort MU, Xin
title Social stream classification with emerging new labels
title_short Social stream classification with emerging new labels
title_full Social stream classification with emerging new labels
title_fullStr Social stream classification with emerging new labels
title_full_unstemmed Social stream classification with emerging new labels
title_sort social stream classification with emerging new labels
publisher Institutional Knowledge at Singapore Management University
publishDate 2018
url https://ink.library.smu.edu.sg/sis_research/4079
https://ink.library.smu.edu.sg/context/sis_research/article/5082/viewcontent/Mu2018_Chapter_SocialStreamClassificationWith.pdf
_version_ 1770574262720004096