DeepReview: Automatic code review using deep multi-instance learning

Code review, an inspection of code changes in order to identify and fix defects before integration, is essential in Software Quality Assurance (SQA). Code review is a time-consuming task since the reviewers need to understand, analysis and provide comments manually. To alleviate the burden of review...

Full description

Saved in:
Bibliographic Details
Main Authors: LI, Hengyi, SHI, Shuting, THUNG, Ferdian, HUO, Xuan, XU, Bowen, LI, Ming, LO, David
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4346
https://ink.library.smu.edu.sg/context/sis_research/article/5349/viewcontent/Deepreview_Automatic_code_review_using_deep_multi_instance_learning.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5349
record_format dspace
spelling sg-smu-ink.sis_research-53492019-06-06T08:28:22Z DeepReview: Automatic code review using deep multi-instance learning LI, Hengyi SHI, Shuting THUNG, Ferdian HUO, Xuan XU, Bowen LI, Ming LO, David Code review, an inspection of code changes in order to identify and fix defects before integration, is essential in Software Quality Assurance (SQA). Code review is a time-consuming task since the reviewers need to understand, analysis and provide comments manually. To alleviate the burden of reviewers, automatic code review is needed. However, this task has not been well studied before. To bridge this research gap, in this paper, we formalize automatic code review as a multi-instance learning task that each change consisting of multiple hunks is regarded as a bag, and each hunk is described as an instance. We propose a novel deep learning model named DeepReview based on Convolutional Neural Network (CNN), which is an end-to-end model that learns feature representation to predict whether one change is approved or rejected. Experimental results on open source projects show that DeepReview is effective in automatic code review tasks. In terms of F1 score and AUC, DeepReview outperforms the performance of traditional single-instance based model TFIDF-SVM and the state-of-the-art deep feature based model Deeper. 2019-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4346 info:doi/10.1007/978-3-030-16145-3_25 https://ink.library.smu.edu.sg/context/sis_research/article/5349/viewcontent/Deepreview_Automatic_code_review_using_deep_multi_instance_learning.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Automatic code review Machine learning Multi-instance learning Software mining Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Automatic code review
Machine learning
Multi-instance learning
Software mining
Software Engineering
spellingShingle Automatic code review
Machine learning
Multi-instance learning
Software mining
Software Engineering
LI, Hengyi
SHI, Shuting
THUNG, Ferdian
HUO, Xuan
XU, Bowen
LI, Ming
LO, David
DeepReview: Automatic code review using deep multi-instance learning
description Code review, an inspection of code changes in order to identify and fix defects before integration, is essential in Software Quality Assurance (SQA). Code review is a time-consuming task since the reviewers need to understand, analysis and provide comments manually. To alleviate the burden of reviewers, automatic code review is needed. However, this task has not been well studied before. To bridge this research gap, in this paper, we formalize automatic code review as a multi-instance learning task that each change consisting of multiple hunks is regarded as a bag, and each hunk is described as an instance. We propose a novel deep learning model named DeepReview based on Convolutional Neural Network (CNN), which is an end-to-end model that learns feature representation to predict whether one change is approved or rejected. Experimental results on open source projects show that DeepReview is effective in automatic code review tasks. In terms of F1 score and AUC, DeepReview outperforms the performance of traditional single-instance based model TFIDF-SVM and the state-of-the-art deep feature based model Deeper.
format text
author LI, Hengyi
SHI, Shuting
THUNG, Ferdian
HUO, Xuan
XU, Bowen
LI, Ming
LO, David
author_facet LI, Hengyi
SHI, Shuting
THUNG, Ferdian
HUO, Xuan
XU, Bowen
LI, Ming
LO, David
author_sort LI, Hengyi
title DeepReview: Automatic code review using deep multi-instance learning
title_short DeepReview: Automatic code review using deep multi-instance learning
title_full DeepReview: Automatic code review using deep multi-instance learning
title_fullStr DeepReview: Automatic code review using deep multi-instance learning
title_full_unstemmed DeepReview: Automatic code review using deep multi-instance learning
title_sort deepreview: automatic code review using deep multi-instance learning
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/4346
https://ink.library.smu.edu.sg/context/sis_research/article/5349/viewcontent/Deepreview_Automatic_code_review_using_deep_multi_instance_learning.pdf
_version_ 1770574661815369728