Automatic classification of software related microblogs

Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is...

Full description

Saved in:
Bibliographic Details
Main Authors: PRASETYO, Philips Kokoh, LO, David, PALAKORN, Achananuparp, TIAN, Yuan, LIM, Ee Peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2012
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1576
https://ink.library.smu.edu.sg/context/sis_research/article/2575/viewcontent/C30___Automatic_Classification_of_Software_Related_Microblogs__ICSM2012_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2575
record_format dspace
spelling sg-smu-ink.sis_research-25752018-07-13T03:10:39Z Automatic classification of software related microblogs PRASETYO, Philips Kokoh LO, David PALAKORN, Achananuparp TIAN, Yuan LIM, Ee Peng Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is stored in these microblogs. However, microblogs also contain a large amount of noisy content that are less relevant to software developers in engineering software systems. In this work, we perform a preliminary study to investigate the feasibility of automatic classification of microblogs into two categories: relevant and irrelevant to engineering software systems. We extract features from the textual content of the microblogs and the titles of any URLs mentioned in the microblogs. These features are then used to learn a discriminative model used in classifying relevant and irrelevant microblogs. We show that our trained model can achieve a promising classification performance. 2012-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1576 info:doi/10.1109/ICSM.2012.6405330 https://ink.library.smu.edu.sg/context/sis_research/article/2575/viewcontent/C30___Automatic_Classification_of_Software_Related_Microblogs__ICSM2012_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Communication Technology and New Media Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Communication Technology and New Media
Software Engineering
spellingShingle Communication Technology and New Media
Software Engineering
PRASETYO, Philips Kokoh
LO, David
PALAKORN, Achananuparp
TIAN, Yuan
LIM, Ee Peng
Automatic classification of software related microblogs
description Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is stored in these microblogs. However, microblogs also contain a large amount of noisy content that are less relevant to software developers in engineering software systems. In this work, we perform a preliminary study to investigate the feasibility of automatic classification of microblogs into two categories: relevant and irrelevant to engineering software systems. We extract features from the textual content of the microblogs and the titles of any URLs mentioned in the microblogs. These features are then used to learn a discriminative model used in classifying relevant and irrelevant microblogs. We show that our trained model can achieve a promising classification performance.
format text
author PRASETYO, Philips Kokoh
LO, David
PALAKORN, Achananuparp
TIAN, Yuan
LIM, Ee Peng
author_facet PRASETYO, Philips Kokoh
LO, David
PALAKORN, Achananuparp
TIAN, Yuan
LIM, Ee Peng
author_sort PRASETYO, Philips Kokoh
title Automatic classification of software related microblogs
title_short Automatic classification of software related microblogs
title_full Automatic classification of software related microblogs
title_fullStr Automatic classification of software related microblogs
title_full_unstemmed Automatic classification of software related microblogs
title_sort automatic classification of software related microblogs
publisher Institutional Knowledge at Singapore Management University
publishDate 2012
url https://ink.library.smu.edu.sg/sis_research/1576
https://ink.library.smu.edu.sg/context/sis_research/article/2575/viewcontent/C30___Automatic_Classification_of_Software_Related_Microblogs__ICSM2012_.pdf
_version_ 1770571305652846592