Detecting Similar Applications with Collaborative Tagging

Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is...

Full description

Saved in:
Bibliographic Details
Main Authors: THUNG, Ferdian, LO, David, JIANG, Lingxiao
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2012
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1577
https://ink.library.smu.edu.sg/context/sis_research/article/2576/viewcontent/CollaborativeTagging_ICSM_2012_av.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2576
record_format dspace
spelling sg-smu-ink.sis_research-25762020-12-07T06:34:03Z Detecting Similar Applications with Collaborative Tagging THUNG, Ferdian LO, David JIANG, Lingxiao Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is stored in these microblogs. However, microblogs also contain a large amount of noisy content that are less relevant to software developers in engineering software systems. In this work, we perform a preliminary study to investigate the feasibility of automatic classification of microblogs into two categories: relevant and irrelevant to engineering software systems. We extract features from the textual content of the microblogs and the titles of any URLs mentioned in the microblogs. These features are then used to learn a discriminative model used in classifying relevant and irrelevant microblogs. We show that our trained model can achieve a promising classification performance. 2012-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1577 info:doi/10.1109/ICSM.2012.6405331 https://ink.library.smu.edu.sg/context/sis_research/article/2576/viewcontent/CollaborativeTagging_ICSM_2012_av.pdf Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Collaborative tagging Plagiarism detection Program comprehension Software applications Software development practices Software systems Usage patterns User study Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Collaborative tagging
Plagiarism detection
Program comprehension
Software applications
Software development practices
Software systems
Usage patterns
User study
Software Engineering
spellingShingle Collaborative tagging
Plagiarism detection
Program comprehension
Software applications
Software development practices
Software systems
Usage patterns
User study
Software Engineering
THUNG, Ferdian
LO, David
JIANG, Lingxiao
Detecting Similar Applications with Collaborative Tagging
description Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is stored in these microblogs. However, microblogs also contain a large amount of noisy content that are less relevant to software developers in engineering software systems. In this work, we perform a preliminary study to investigate the feasibility of automatic classification of microblogs into two categories: relevant and irrelevant to engineering software systems. We extract features from the textual content of the microblogs and the titles of any URLs mentioned in the microblogs. These features are then used to learn a discriminative model used in classifying relevant and irrelevant microblogs. We show that our trained model can achieve a promising classification performance.
format text
author THUNG, Ferdian
LO, David
JIANG, Lingxiao
author_facet THUNG, Ferdian
LO, David
JIANG, Lingxiao
author_sort THUNG, Ferdian
title Detecting Similar Applications with Collaborative Tagging
title_short Detecting Similar Applications with Collaborative Tagging
title_full Detecting Similar Applications with Collaborative Tagging
title_fullStr Detecting Similar Applications with Collaborative Tagging
title_full_unstemmed Detecting Similar Applications with Collaborative Tagging
title_sort detecting similar applications with collaborative tagging
publisher Institutional Knowledge at Singapore Management University
publishDate 2012
url https://ink.library.smu.edu.sg/sis_research/1577
https://ink.library.smu.edu.sg/context/sis_research/article/2576/viewcontent/CollaborativeTagging_ICSM_2012_av.pdf
_version_ 1770571305870950400