Using support vector machine ensembles for target audience classification on Twitter

The vast amount and diversity of the content shared on social media can pose a challenge for any business wanting to use it to identify potential customers. In this paper, our aim is to investigate the use of both unsupervised and supervised learning methods for target audience classification on Twi...

Full description

Saved in:

Bibliographic Details
Main Authors:	LO, Siaw Ling, CHIONG, Raymond, CORNFORTH, David
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2015
Subjects:	Blogging Data Mining Marketing Social Media Support Vector Machine Computer Engineering Numerical Analysis and Scientific Computing
Online Access:	https://ink.library.smu.edu.sg/sis_research/5906 https://ink.library.smu.edu.sg/context/sis_research/article/6904/viewcontent/LoSiawLing_2015_Using_SVM_ensembles_for_target_audience_classification_on_Twitter.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-6904
record_format	dspace
spelling	sg-smu-ink.sis_research-69042023-04-12T05:10:56Z Using support vector machine ensembles for target audience classification on Twitter LO, Siaw Ling CHIONG, Raymond CORNFORTH, David The vast amount and diversity of the content shared on social media can pose a challenge for any business wanting to use it to identify potential customers. In this paper, our aim is to investigate the use of both unsupervised and supervised learning methods for target audience classification on Twitter with minimal annotation efforts. Topic domains were automatically discovered from contents shared by followers of an account owner using Twitter Latent Dirichlet Allocation (LDA). A Support Vector Machine (SVM) ensemble was then trained using contents from different account owners of the various topic domains identified by Twitter LDA. Experimental results show that the methods presented are able to successfully identify a target audience with high accuracy. In addition, we show that using a statistical inference approach such as bootstrapping in over-sampling, instead of using random sampling, to construct training datasets can achieve a better classifier in an SVM ensemble. We conclude that such an ensemble system can take advantage of data diversity, which enables real-world applications for differentiating prospective customers from the general audience, leading to business advantage in the crowded social media space. 2015-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5906 info:doi/10.1371/journal.pone.0122855 https://ink.library.smu.edu.sg/context/sis_research/article/6904/viewcontent/LoSiawLing_2015_Using_SVM_ensembles_for_target_audience_classification_on_Twitter.pdf http://creativecommons.org/licenses/by/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Blogging Data Mining Marketing Social Media Support Vector Machine Computer Engineering Numerical Analysis and Scientific Computing Social Media
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Blogging Data Mining Marketing Social Media Support Vector Machine Computer Engineering Numerical Analysis and Scientific Computing Social Media
spellingShingle	Blogging Data Mining Marketing Social Media Support Vector Machine Computer Engineering Numerical Analysis and Scientific Computing Social Media LO, Siaw Ling CHIONG, Raymond CORNFORTH, David Using support vector machine ensembles for target audience classification on Twitter
description	The vast amount and diversity of the content shared on social media can pose a challenge for any business wanting to use it to identify potential customers. In this paper, our aim is to investigate the use of both unsupervised and supervised learning methods for target audience classification on Twitter with minimal annotation efforts. Topic domains were automatically discovered from contents shared by followers of an account owner using Twitter Latent Dirichlet Allocation (LDA). A Support Vector Machine (SVM) ensemble was then trained using contents from different account owners of the various topic domains identified by Twitter LDA. Experimental results show that the methods presented are able to successfully identify a target audience with high accuracy. In addition, we show that using a statistical inference approach such as bootstrapping in over-sampling, instead of using random sampling, to construct training datasets can achieve a better classifier in an SVM ensemble. We conclude that such an ensemble system can take advantage of data diversity, which enables real-world applications for differentiating prospective customers from the general audience, leading to business advantage in the crowded social media space.
format	text
author	LO, Siaw Ling CHIONG, Raymond CORNFORTH, David
author_facet	LO, Siaw Ling CHIONG, Raymond CORNFORTH, David
author_sort	LO, Siaw Ling
title	Using support vector machine ensembles for target audience classification on Twitter
title_short	Using support vector machine ensembles for target audience classification on Twitter
title_full	Using support vector machine ensembles for target audience classification on Twitter
title_fullStr	Using support vector machine ensembles for target audience classification on Twitter
title_full_unstemmed	Using support vector machine ensembles for target audience classification on Twitter
title_sort	using support vector machine ensembles for target audience classification on twitter
publisher	Institutional Knowledge at Singapore Management University
publishDate	2015
url	https://ink.library.smu.edu.sg/sis_research/5906 https://ink.library.smu.edu.sg/context/sis_research/article/6904/viewcontent/LoSiawLing_2015_Using_SVM_ensembles_for_target_audience_classification_on_Twitter.pdf
_version_	1770575657924820992

Using support vector machine ensembles for target audience classification on Twitter

Similar Items