Deep just-in-time defect localization

During software development and maintenance, defect localization is an essential part of software quality assurance. Even though different techniques have been proposed for defect localization, i.e., information retrieval (IR)-based techniques and spectrum-based techniques, they can only work after...

Full description

Saved in:

Bibliographic Details
Main Authors:	QIU, Fangcheng, GAO, Zhipeng, XIA, Xin, LO, David, GRUNDY, John, WANG, Xinyu
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2022
Subjects:	Databases and Information Systems Software Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/7666 https://ink.library.smu.edu.sg/context/sis_research/article/8669/viewcontent/tse216.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-8669
record_format	dspace
spelling	sg-smu-ink.sis_research-86692023-01-10T03:41:38Z Deep just-in-time defect localization QIU, Fangcheng GAO, Zhipeng XIA, Xin LO, David GRUNDY, John WANG, Xinyu During software development and maintenance, defect localization is an essential part of software quality assurance. Even though different techniques have been proposed for defect localization, i.e., information retrieval (IR)-based techniques and spectrum-based techniques, they can only work after the defect has been exposed, which can be too late and costly to adapt to the newly introduced bugs in the daily development. There are also many JIT defect prediction tools that have been proposed to predict the buggy commit. But these tools do not locate the suspicious buggy positions in the buggy commit. To assist developers to detect bugs in time and avoid introducing them, just-in-time (JIT) bug localization techniques have been proposed, which is targeting to locate suspicious buggy code after a change commit has been submitted. In this paper, we propose a novel JIT defect localization approach, named DEEPDL (Deep Learning-based defect localization), to locate defect code lines within a defect introducing change. DEEPDL employs a neural language model to capture the semantics of the code lines, in this way, the naturalness of each code line can be learned and converted to a suspiciousness score. The core of our DEEPDL is a deep learning-based neural language model. We train the neural language model with previous snapshots (history versions) of a project so that it can calculate the naturalness of a piece of code. In its application, for a given new code change, DEEPDL automatically assigns a suspiciousness score to each code line and sorts these code lines in descending order of this score. The code lines at the top of the list are considered as potential defect locations. Our tool can assist developers efficiently check buggy lines at an early stage, which is able to reduce the risk of introducing bugs in time and improve the developers’ confidence in the reliability of their software. We conducted an extensive experiment on 14 open source Java projects with a total of 11,615 buggy changes. We evaluate the experimental results considering four evaluation metrics. The experimental results show that our method outperforms the state-of-the-art by a substantial margin 2022-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7666 info:doi/10.1109/TSE.2021.3135875 https://ink.library.smu.edu.sg/context/sis_research/article/8669/viewcontent/tse216.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Software Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Databases and Information Systems Software Engineering
spellingShingle	Databases and Information Systems Software Engineering QIU, Fangcheng GAO, Zhipeng XIA, Xin LO, David GRUNDY, John WANG, Xinyu Deep just-in-time defect localization
description	During software development and maintenance, defect localization is an essential part of software quality assurance. Even though different techniques have been proposed for defect localization, i.e., information retrieval (IR)-based techniques and spectrum-based techniques, they can only work after the defect has been exposed, which can be too late and costly to adapt to the newly introduced bugs in the daily development. There are also many JIT defect prediction tools that have been proposed to predict the buggy commit. But these tools do not locate the suspicious buggy positions in the buggy commit. To assist developers to detect bugs in time and avoid introducing them, just-in-time (JIT) bug localization techniques have been proposed, which is targeting to locate suspicious buggy code after a change commit has been submitted. In this paper, we propose a novel JIT defect localization approach, named DEEPDL (Deep Learning-based defect localization), to locate defect code lines within a defect introducing change. DEEPDL employs a neural language model to capture the semantics of the code lines, in this way, the naturalness of each code line can be learned and converted to a suspiciousness score. The core of our DEEPDL is a deep learning-based neural language model. We train the neural language model with previous snapshots (history versions) of a project so that it can calculate the naturalness of a piece of code. In its application, for a given new code change, DEEPDL automatically assigns a suspiciousness score to each code line and sorts these code lines in descending order of this score. The code lines at the top of the list are considered as potential defect locations. Our tool can assist developers efficiently check buggy lines at an early stage, which is able to reduce the risk of introducing bugs in time and improve the developers’ confidence in the reliability of their software. We conducted an extensive experiment on 14 open source Java projects with a total of 11,615 buggy changes. We evaluate the experimental results considering four evaluation metrics. The experimental results show that our method outperforms the state-of-the-art by a substantial margin
format	text
author	QIU, Fangcheng GAO, Zhipeng XIA, Xin LO, David GRUNDY, John WANG, Xinyu
author_facet	QIU, Fangcheng GAO, Zhipeng XIA, Xin LO, David GRUNDY, John WANG, Xinyu
author_sort	QIU, Fangcheng
title	Deep just-in-time defect localization
title_short	Deep just-in-time defect localization
title_full	Deep just-in-time defect localization
title_fullStr	Deep just-in-time defect localization
title_full_unstemmed	Deep just-in-time defect localization
title_sort	deep just-in-time defect localization
publisher	Institutional Knowledge at Singapore Management University
publishDate	2022
url	https://ink.library.smu.edu.sg/sis_research/7666 https://ink.library.smu.edu.sg/context/sis_research/article/8669/viewcontent/tse216.pdf
_version_	1770576410663976960

Deep just-in-time defect localization

Similar Items