Inferring links between concerns and methods with multi-abstraction vector space model

Concern localization refers to the process of locating code units that match a particular textual description. It takes as input textual documents such as bug reports and feature requests and outputs a list of candidate code units that are relevant to the bug reports or feature requests. Many inform...

Full description

Saved in:

Bibliographic Details
Main Authors:	ZHANG, Yun, LO, David, XIA, Xin, LE, Tien-Duy B., SCANNIELLO, Giuseppe, SUN, Jianling
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2016
Subjects:	Concern localization Multi-abstraction Text retrieval Topic modeling Databases and Information Systems Software Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/3667 https://ink.library.smu.edu.sg/context/sis_research/article/4669/viewcontent/InferringLinksConcernsMethodsVectorSpace_2016ICSME.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-4669
record_format	dspace
spelling	sg-smu-ink.sis_research-46692018-12-05T04:02:47Z Inferring links between concerns and methods with multi-abstraction vector space model ZHANG, Yun LO, David XIA, Xin LE, Tien-Duy B. SCANNIELLO, Giuseppe SUN, Jianling Concern localization refers to the process of locating code units that match a particular textual description. It takes as input textual documents such as bug reports and feature requests and outputs a list of candidate code units that are relevant to the bug reports or feature requests. Many information retrieval (IR) based concern localization techniques have been proposed in the literature. These techniques typically represent code units and textual descriptions as a bag of tokens at one level of abstraction, e.g., each token is a word, or each token is a topic. In this work, we propose a multi-abstraction concern localization technique named MULAB. MULAB represents a code unit and a textual description at multiple abstraction levels. Similarity of a textual description and a code unit is now made by considering all these abstraction levels. We combine a vector space model and multiple topic models to compute the similarity and apply a genetic algorithm to infer semi-optimal topic model configurations. We have evaluated our solution on 136 concerns from 8 open source Java software systems. The experimental results show that MULAB outperforms the state-of-art baseline PR, which is proposed by Scanniello et al. in terms of effectiveness and rank. 2016-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3667 info:doi/10.1109/ICSME.2016.51 https://ink.library.smu.edu.sg/context/sis_research/article/4669/viewcontent/InferringLinksConcernsMethodsVectorSpace_2016ICSME.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Concern localization Multi-abstraction Text retrieval Topic modeling Databases and Information Systems Software Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Concern localization Multi-abstraction Text retrieval Topic modeling Databases and Information Systems Software Engineering
spellingShingle	Concern localization Multi-abstraction Text retrieval Topic modeling Databases and Information Systems Software Engineering ZHANG, Yun LO, David XIA, Xin LE, Tien-Duy B. SCANNIELLO, Giuseppe SUN, Jianling Inferring links between concerns and methods with multi-abstraction vector space model
description	Concern localization refers to the process of locating code units that match a particular textual description. It takes as input textual documents such as bug reports and feature requests and outputs a list of candidate code units that are relevant to the bug reports or feature requests. Many information retrieval (IR) based concern localization techniques have been proposed in the literature. These techniques typically represent code units and textual descriptions as a bag of tokens at one level of abstraction, e.g., each token is a word, or each token is a topic. In this work, we propose a multi-abstraction concern localization technique named MULAB. MULAB represents a code unit and a textual description at multiple abstraction levels. Similarity of a textual description and a code unit is now made by considering all these abstraction levels. We combine a vector space model and multiple topic models to compute the similarity and apply a genetic algorithm to infer semi-optimal topic model configurations. We have evaluated our solution on 136 concerns from 8 open source Java software systems. The experimental results show that MULAB outperforms the state-of-art baseline PR, which is proposed by Scanniello et al. in terms of effectiveness and rank.
format	text
author	ZHANG, Yun LO, David XIA, Xin LE, Tien-Duy B. SCANNIELLO, Giuseppe SUN, Jianling
author_facet	ZHANG, Yun LO, David XIA, Xin LE, Tien-Duy B. SCANNIELLO, Giuseppe SUN, Jianling
author_sort	ZHANG, Yun
title	Inferring links between concerns and methods with multi-abstraction vector space model
title_short	Inferring links between concerns and methods with multi-abstraction vector space model
title_full	Inferring links between concerns and methods with multi-abstraction vector space model
title_fullStr	Inferring links between concerns and methods with multi-abstraction vector space model
title_full_unstemmed	Inferring links between concerns and methods with multi-abstraction vector space model
title_sort	inferring links between concerns and methods with multi-abstraction vector space model
publisher	Institutional Knowledge at Singapore Management University
publishDate	2016
url	https://ink.library.smu.edu.sg/sis_research/3667 https://ink.library.smu.edu.sg/context/sis_research/article/4669/viewcontent/InferringLinksConcernsMethodsVectorSpace_2016ICSME.pdf
_version_	1770573474911223808

Inferring links between concerns and methods with multi-abstraction vector space model

Similar Items