AmaLgam+: Composing rich information sources for accurate bug localization

During the evolution of a software system, a large number of bug reports are submitted. Locating the source code files that need to be fixed to resolve the bugs is a challenging problem. Thus, there is a need for a technique that can automatically figure out these buggy files. A number of bug locali...

Full description

Saved in:
Bibliographic Details
Main Authors: WANG, Shaowei, David LO
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3576
https://ink.library.smu.edu.sg/context/sis_research/article/4577/viewcontent/Wang_Lo_AmaLgam_2016_JSEP.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4577
record_format dspace
spelling sg-smu-ink.sis_research-45772017-04-10T07:47:42Z AmaLgam+: Composing rich information sources for accurate bug localization WANG, Shaowei David LO, During the evolution of a software system, a large number of bug reports are submitted. Locating the source code files that need to be fixed to resolve the bugs is a challenging problem. Thus, there is a need for a technique that can automatically figure out these buggy files. A number of bug localization solutions that take in a bug report and output a ranked list of files sorted based on their likelihood to be buggy have been proposed in the literature. However, the accuracy of these tools still needs to be improved. In this paper, to address this need, we propose AmaLgam+, which is a method for locating relevant buggy files that puts together fives sources of information, namely, version history, similar reports, structure, stack traces, and reporter information. We perform a large-scale experiment on four open source projects, namely, AspectJ, Eclipse, SWT, and ZXing to localize more than 3000 bugs. We compare AmaLgam + with several state-of-the-art approaches including AmaLgam, BLUiR+, BRtracer+, BugLocator, and TFIDF-DHbPd. These approaches leverage one or several of the sources of information analyzed by AmaLgam+, but not all of them. On average, AmaLgam + achieves a 6.0% improvement over AmaLgam, which merges three sources of information, in terms of Mean Average Precision (MAP). For AspectJ and Eclipse datasets, in which there are many bug reports with stack traces and many reporters submit multiple bug reports, AmaLgam + achieves a 12.0% improvement over AmaLgam in terms of MAP. Compared with the other state-of-the-art approaches, AmaLgam + achieves an improvement of 20.3%, 22.5%, 33.1%, and 73.9% over BLUiR+, BRtracer+, BugLocator, and TFIDF-DHbPd in terms of MAP, respectively. 2016-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3576 info:doi/10.1002/smr.1801 https://ink.library.smu.edu.sg/context/sis_research/article/4577/viewcontent/Wang_Lo_AmaLgam_2016_JSEP.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University bug localization reporter information similar report stack traces structure version history Computer Sciences Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic bug localization
reporter information
similar report
stack traces
structure
version history
Computer Sciences
Software Engineering
spellingShingle bug localization
reporter information
similar report
stack traces
structure
version history
Computer Sciences
Software Engineering
WANG, Shaowei
David LO,
AmaLgam+: Composing rich information sources for accurate bug localization
description During the evolution of a software system, a large number of bug reports are submitted. Locating the source code files that need to be fixed to resolve the bugs is a challenging problem. Thus, there is a need for a technique that can automatically figure out these buggy files. A number of bug localization solutions that take in a bug report and output a ranked list of files sorted based on their likelihood to be buggy have been proposed in the literature. However, the accuracy of these tools still needs to be improved. In this paper, to address this need, we propose AmaLgam+, which is a method for locating relevant buggy files that puts together fives sources of information, namely, version history, similar reports, structure, stack traces, and reporter information. We perform a large-scale experiment on four open source projects, namely, AspectJ, Eclipse, SWT, and ZXing to localize more than 3000 bugs. We compare AmaLgam + with several state-of-the-art approaches including AmaLgam, BLUiR+, BRtracer+, BugLocator, and TFIDF-DHbPd. These approaches leverage one or several of the sources of information analyzed by AmaLgam+, but not all of them. On average, AmaLgam + achieves a 6.0% improvement over AmaLgam, which merges three sources of information, in terms of Mean Average Precision (MAP). For AspectJ and Eclipse datasets, in which there are many bug reports with stack traces and many reporters submit multiple bug reports, AmaLgam + achieves a 12.0% improvement over AmaLgam in terms of MAP. Compared with the other state-of-the-art approaches, AmaLgam + achieves an improvement of 20.3%, 22.5%, 33.1%, and 73.9% over BLUiR+, BRtracer+, BugLocator, and TFIDF-DHbPd in terms of MAP, respectively.
format text
author WANG, Shaowei
David LO,
author_facet WANG, Shaowei
David LO,
author_sort WANG, Shaowei
title AmaLgam+: Composing rich information sources for accurate bug localization
title_short AmaLgam+: Composing rich information sources for accurate bug localization
title_full AmaLgam+: Composing rich information sources for accurate bug localization
title_fullStr AmaLgam+: Composing rich information sources for accurate bug localization
title_full_unstemmed AmaLgam+: Composing rich information sources for accurate bug localization
title_sort amalgam+: composing rich information sources for accurate bug localization
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/3576
https://ink.library.smu.edu.sg/context/sis_research/article/4577/viewcontent/Wang_Lo_AmaLgam_2016_JSEP.pdf
_version_ 1770573333591490560