Enhancing naive bayes with various smoothing methods for short text classification

Partly due to the proliferance of microblog, short texts are becoming prominent. A huge number of short texts are generated every day, which calls for a method that can efficiently accommodate new data to incrementally adjust classification models. Naive Bayes meets such a need. We apply several smo...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuan, Quan, Cong, Gao, Thalmann, Nadia Magnenat
Other Authors: School of Computer Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/97050
http://hdl.handle.net/10220/11705
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-97050
record_format dspace
spelling sg-ntu-dr.10356-970502020-05-28T07:17:37Z Enhancing naive bayes with various smoothing methods for short text classification Yuan, Quan Cong, Gao Thalmann, Nadia Magnenat School of Computer Engineering International conference companion on World Wide Web (21st : 2012 : Lyon, France) DRNTU::Engineering::Computer science and engineering Partly due to the proliferance of microblog, short texts are becoming prominent. A huge number of short texts are generated every day, which calls for a method that can efficiently accommodate new data to incrementally adjust classification models. Naive Bayes meets such a need. We apply several smoothing models to Naive Bayes for question topic classification, as an example of short text classification, and study their performance. The experimental results on a large real question data show that the smoothing methods are able to significantly improve the question classification performance of Naive Bayes. We also study the effect of training data size, and question length on performance. Published Version 2013-07-17T04:35:52Z 2019-12-06T19:38:17Z 2013-07-17T04:35:52Z 2019-12-06T19:38:17Z 2012 2012 Conference Paper Yuan, Q., Cong, G., & Thalmann, N. M. (2012). Enhancing naive bayes with various smoothing methods for short text classification. Proceedings of the 21st international conference companion on World Wide Web - WWW '12 Companion, 645-646. https://hdl.handle.net/10356/97050 http://hdl.handle.net/10220/11705 10.1145/2187980.2188169 en © 2012 The Authors. This paper was published in WWW'12 - Proceedings of the 21st Annual Conference on World Wide Web Companion and is made available as an electronic reprint (preprint) with permission of The Authors. The paper can be found at the following official DOI: [http://dx.doi.org/10.1145/2187980.2188169]. One print or electronic copy may be made for personal use only. Systematic or multiple reproduction, distribution to multiple locations via electronic or other means, duplication of any material in this paper for a fee or for commercial purposes, or modification of the content of the paper is prohibited and is subject to penalties under law. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Yuan, Quan
Cong, Gao
Thalmann, Nadia Magnenat
Enhancing naive bayes with various smoothing methods for short text classification
description Partly due to the proliferance of microblog, short texts are becoming prominent. A huge number of short texts are generated every day, which calls for a method that can efficiently accommodate new data to incrementally adjust classification models. Naive Bayes meets such a need. We apply several smoothing models to Naive Bayes for question topic classification, as an example of short text classification, and study their performance. The experimental results on a large real question data show that the smoothing methods are able to significantly improve the question classification performance of Naive Bayes. We also study the effect of training data size, and question length on performance.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Yuan, Quan
Cong, Gao
Thalmann, Nadia Magnenat
format Conference or Workshop Item
author Yuan, Quan
Cong, Gao
Thalmann, Nadia Magnenat
author_sort Yuan, Quan
title Enhancing naive bayes with various smoothing methods for short text classification
title_short Enhancing naive bayes with various smoothing methods for short text classification
title_full Enhancing naive bayes with various smoothing methods for short text classification
title_fullStr Enhancing naive bayes with various smoothing methods for short text classification
title_full_unstemmed Enhancing naive bayes with various smoothing methods for short text classification
title_sort enhancing naive bayes with various smoothing methods for short text classification
publishDate 2013
url https://hdl.handle.net/10356/97050
http://hdl.handle.net/10220/11705
_version_ 1681057831980105728