Information Retrieval and Spectrum Based Bug Localization: Better Together

Debugging often takes much effort and resources. To help developers debug, numerous information retrieval (IR)-based and spectrum-based bug localization techniques have been proposed. IR-based techniques process textual information in bug reports, while spectrum-based techniques process program spec...

Full description

Saved in:
Bibliographic Details
Main Authors: LE, Tien-Duy B., OENTARYO, Richard J., David LO
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2015
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3082
https://ink.library.smu.edu.sg/context/sis_research/article/4082/viewcontent/esec_fse15_debugging.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4082
record_format dspace
spelling sg-smu-ink.sis_research-40822018-03-22T01:58:44Z Information Retrieval and Spectrum Based Bug Localization: Better Together LE, Tien-Duy B. OENTARYO, Richard J. David LO, Debugging often takes much effort and resources. To help developers debug, numerous information retrieval (IR)-based and spectrum-based bug localization techniques have been proposed. IR-based techniques process textual information in bug reports, while spectrum-based techniques process program spectra (i.e., a record of which program elements are executed for each test case). Both eventually generate a ranked list of program elements that are likely to contain the bug. However, these techniques only consider one source of information, either bug reports or program spectra, which is not optimal. To deal with the limitation of existing techniques, in this work, we propose a new multi-modal technique that considers both bug reports and program spectra to localize bugs. Our approach adaptively creates a bug-specific model to map a particular bug to its possible location, and introduces a novel idea of suspicious words that are highly associated to a bug. We evaluate our approach on 157 real bugs from four software systems, and compare it with a state-of-the-art IR-based bug localization method, a state-of-the-art spectrum-based bug localization method, and three state-of-the-art multi-modal feature location methods that are adapted for bug localization. Experiments show that our approach can outperform the baselines by at least 47.62%, 31.48%, 27.78%, and 28.80% in terms of number of bugs successfully localized when a developer inspects 1, 5, and 10 program elements (i.e., Top 1, Top 5, and Top 10), and Mean Average Precision (MAP) respectively. 2015-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3082 info:doi/10.1145/2786805.2786880 https://ink.library.smu.edu.sg/context/sis_research/article/4082/viewcontent/esec_fse15_debugging.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Program Spectra Information Retrieval Bug Localization Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Program Spectra
Information Retrieval
Bug Localization
Software Engineering
spellingShingle Program Spectra
Information Retrieval
Bug Localization
Software Engineering
LE, Tien-Duy B.
OENTARYO, Richard J.
David LO,
Information Retrieval and Spectrum Based Bug Localization: Better Together
description Debugging often takes much effort and resources. To help developers debug, numerous information retrieval (IR)-based and spectrum-based bug localization techniques have been proposed. IR-based techniques process textual information in bug reports, while spectrum-based techniques process program spectra (i.e., a record of which program elements are executed for each test case). Both eventually generate a ranked list of program elements that are likely to contain the bug. However, these techniques only consider one source of information, either bug reports or program spectra, which is not optimal. To deal with the limitation of existing techniques, in this work, we propose a new multi-modal technique that considers both bug reports and program spectra to localize bugs. Our approach adaptively creates a bug-specific model to map a particular bug to its possible location, and introduces a novel idea of suspicious words that are highly associated to a bug. We evaluate our approach on 157 real bugs from four software systems, and compare it with a state-of-the-art IR-based bug localization method, a state-of-the-art spectrum-based bug localization method, and three state-of-the-art multi-modal feature location methods that are adapted for bug localization. Experiments show that our approach can outperform the baselines by at least 47.62%, 31.48%, 27.78%, and 28.80% in terms of number of bugs successfully localized when a developer inspects 1, 5, and 10 program elements (i.e., Top 1, Top 5, and Top 10), and Mean Average Precision (MAP) respectively.
format text
author LE, Tien-Duy B.
OENTARYO, Richard J.
David LO,
author_facet LE, Tien-Duy B.
OENTARYO, Richard J.
David LO,
author_sort LE, Tien-Duy B.
title Information Retrieval and Spectrum Based Bug Localization: Better Together
title_short Information Retrieval and Spectrum Based Bug Localization: Better Together
title_full Information Retrieval and Spectrum Based Bug Localization: Better Together
title_fullStr Information Retrieval and Spectrum Based Bug Localization: Better Together
title_full_unstemmed Information Retrieval and Spectrum Based Bug Localization: Better Together
title_sort information retrieval and spectrum based bug localization: better together
publisher Institutional Knowledge at Singapore Management University
publishDate 2015
url https://ink.library.smu.edu.sg/sis_research/3082
https://ink.library.smu.edu.sg/context/sis_research/article/4082/viewcontent/esec_fse15_debugging.pdf
_version_ 1770572803869769728