DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction

Software quality assurance efforts often focus on identifying defective code. To find likely defective code early, change-level defect prediction – aka. Just-In-Time (JIT) defect prediction – has been proposed. JIT defect prediction models identify likely defective changes and they are trained using...

Full description

Saved in:
Bibliographic Details
Main Authors: HOANG, Thong, DAM, Hoa Khanh, KAMEI, Yasutaka, LO, David, UBAYASHI, Naoyasu
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4486
https://ink.library.smu.edu.sg/context/sis_research/article/5489/viewcontent/Thong_MSR2019.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5489
record_format dspace
spelling sg-smu-ink.sis_research-54892020-04-03T09:36:13Z DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction HOANG, Thong DAM, Hoa Khanh KAMEI, Yasutaka LO, David UBAYASHI, Naoyasu Software quality assurance efforts often focus on identifying defective code. To find likely defective code early, change-level defect prediction – aka. Just-In-Time (JIT) defect prediction – has been proposed. JIT defect prediction models identify likely defective changes and they are trained using machine learning techniques with the assumption that historical changes are similar to future ones. Most existing JIT defect prediction approaches make use of manually engineered features. Unlike those approaches, in this paper, we propose an end-to-end deep learning framework, named DeepJIT, that automatically extracts features from commit messages and code changes and use them to identify defects. Experiments on two popular software projects (i.e., QT and OPENSTACK) on three evaluation settings (i.e., cross-validation, short-period, and long-period) show that the best variant of DeepJIT (DeepJIT-Combined), compared with the best performing state-of-the-art approach, achieves improvements of 10.36-11.02% for the project QT and 9.51- 13.69% for the project OPENSTACK in terms of the Area Under the Curve (AUC). 2019-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4486 info:doi/10.1109/MSR.2019.00016 https://ink.library.smu.edu.sg/context/sis_research/article/5489/viewcontent/Thong_MSR2019.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Convolutional neural network Deep learning Just-in-time defect prediction Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Convolutional neural network
Deep learning
Just-in-time defect prediction
Software Engineering
spellingShingle Convolutional neural network
Deep learning
Just-in-time defect prediction
Software Engineering
HOANG, Thong
DAM, Hoa Khanh
KAMEI, Yasutaka
LO, David
UBAYASHI, Naoyasu
DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction
description Software quality assurance efforts often focus on identifying defective code. To find likely defective code early, change-level defect prediction – aka. Just-In-Time (JIT) defect prediction – has been proposed. JIT defect prediction models identify likely defective changes and they are trained using machine learning techniques with the assumption that historical changes are similar to future ones. Most existing JIT defect prediction approaches make use of manually engineered features. Unlike those approaches, in this paper, we propose an end-to-end deep learning framework, named DeepJIT, that automatically extracts features from commit messages and code changes and use them to identify defects. Experiments on two popular software projects (i.e., QT and OPENSTACK) on three evaluation settings (i.e., cross-validation, short-period, and long-period) show that the best variant of DeepJIT (DeepJIT-Combined), compared with the best performing state-of-the-art approach, achieves improvements of 10.36-11.02% for the project QT and 9.51- 13.69% for the project OPENSTACK in terms of the Area Under the Curve (AUC).
format text
author HOANG, Thong
DAM, Hoa Khanh
KAMEI, Yasutaka
LO, David
UBAYASHI, Naoyasu
author_facet HOANG, Thong
DAM, Hoa Khanh
KAMEI, Yasutaka
LO, David
UBAYASHI, Naoyasu
author_sort HOANG, Thong
title DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction
title_short DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction
title_full DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction
title_fullStr DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction
title_full_unstemmed DeepJIT: an end-to-end deep learning framework for just-in-time defect prediction
title_sort deepjit: an end-to-end deep learning framework for just-in-time defect prediction
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/4486
https://ink.library.smu.edu.sg/context/sis_research/article/5489/viewcontent/Thong_MSR2019.pdf
_version_ 1770574873064636416