Automatic classification of software related microblogs
Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2012
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/1576 https://ink.library.smu.edu.sg/context/sis_research/article/2575/viewcontent/C30___Automatic_Classification_of_Software_Related_Microblogs__ICSM2012_.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-2575 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-25752018-07-13T03:10:39Z Automatic classification of software related microblogs PRASETYO, Philips Kokoh LO, David PALAKORN, Achananuparp TIAN, Yuan LIM, Ee Peng Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is stored in these microblogs. However, microblogs also contain a large amount of noisy content that are less relevant to software developers in engineering software systems. In this work, we perform a preliminary study to investigate the feasibility of automatic classification of microblogs into two categories: relevant and irrelevant to engineering software systems. We extract features from the textual content of the microblogs and the titles of any URLs mentioned in the microblogs. These features are then used to learn a discriminative model used in classifying relevant and irrelevant microblogs. We show that our trained model can achieve a promising classification performance. 2012-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1576 info:doi/10.1109/ICSM.2012.6405330 https://ink.library.smu.edu.sg/context/sis_research/article/2575/viewcontent/C30___Automatic_Classification_of_Software_Related_Microblogs__ICSM2012_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Communication Technology and New Media Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Communication Technology and New Media Software Engineering |
spellingShingle |
Communication Technology and New Media Software Engineering PRASETYO, Philips Kokoh LO, David PALAKORN, Achananuparp TIAN, Yuan LIM, Ee Peng Automatic classification of software related microblogs |
description |
Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is stored in these microblogs. However, microblogs also contain a large amount of noisy content that are less relevant to software developers in engineering software systems. In this work, we perform a preliminary study to investigate the feasibility of automatic classification of microblogs into two categories: relevant and irrelevant to engineering software systems. We extract features from the textual content of the microblogs and the titles of any URLs mentioned in the microblogs. These features are then used to learn a discriminative model used in classifying relevant and irrelevant microblogs. We show that our trained model can achieve a promising classification performance. |
format |
text |
author |
PRASETYO, Philips Kokoh LO, David PALAKORN, Achananuparp TIAN, Yuan LIM, Ee Peng |
author_facet |
PRASETYO, Philips Kokoh LO, David PALAKORN, Achananuparp TIAN, Yuan LIM, Ee Peng |
author_sort |
PRASETYO, Philips Kokoh |
title |
Automatic classification of software related microblogs |
title_short |
Automatic classification of software related microblogs |
title_full |
Automatic classification of software related microblogs |
title_fullStr |
Automatic classification of software related microblogs |
title_full_unstemmed |
Automatic classification of software related microblogs |
title_sort |
automatic classification of software related microblogs |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2012 |
url |
https://ink.library.smu.edu.sg/sis_research/1576 https://ink.library.smu.edu.sg/context/sis_research/article/2575/viewcontent/C30___Automatic_Classification_of_Software_Related_Microblogs__ICSM2012_.pdf |
_version_ |
1770571305652846592 |