Who Should Review This Change? Putting Text and File Location Analyses Together for More Accurate Recommendations

Software code review is a process of developers inspecting new code changes made by others, to evaluate their quality and identify and fix defects, before integrating them to the main branch of a version control system. Modern Code Review (MCR), a lightweight and tool-based variant of conventional c...

Full description

Saved in:
Bibliographic Details
Main Authors: XIA, Xin, David LO, WANG, Xinyu, YANG, Xiaohu
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2015
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3086
http://dx.doi.org/10.1109/ICSM.2015.7332472
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4086
record_format dspace
spelling sg-smu-ink.sis_research-40862016-04-20T08:39:49Z Who Should Review This Change? Putting Text and File Location Analyses Together for More Accurate Recommendations XIA, Xin David LO, WANG, Xinyu YANG, Xiaohu Software code review is a process of developers inspecting new code changes made by others, to evaluate their quality and identify and fix defects, before integrating them to the main branch of a version control system. Modern Code Review (MCR), a lightweight and tool-based variant of conventional codereview, is widely adopted in both open source and proprietary software projects. One challenge that impacts MCR is the assignment of appropriate developers to review a code change. Considering that there could be hundreds of potential code reviewers in a software project, picking suitable reviewers is not a straightforward task. A prior study by Thongtanunam et al. showed that the difficulty in selecting suitable reviewers may delay the review process by an average of 12 days. In this paper, to address the challenge of assigning suitable reviewers to changes, we propose a hybrid and incremental approach Tie which utilizes the advantages of both Text mIning and a filE location-based approach. To do this, Tie integrates an incremental text mining model which analyzes the textual contents in a reviewrequest, and a similarity model which measures the similarity of changed file paths and reviewed filepaths. We perform a large-scale experiment on four open source projects, namely Android, OpenStack, QT, and LibreOffice, containing a total of 42,045 reviews. The experimental results show that on average Tie can achieve top-1, top-5, and top-10 accuracies, and Mean Reciprocal Rank (MRR) of 0.52, 0.79, 0.85, and 0.64 for the four projects, which improves the state-of-the-art approach RevFinder, proposed by Thongtanunam et al., by 61%, 23%, 8%, and 37%, respectively. 2015-10-01T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/3086 info:doi/10.1109/ICSM.2015.7332472 http://dx.doi.org/10.1109/ICSM.2015.7332472 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Modern Code Review Path Similarity Recommendation System Text Mining Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Modern Code Review
Path Similarity
Recommendation System
Text Mining
Software Engineering
spellingShingle Modern Code Review
Path Similarity
Recommendation System
Text Mining
Software Engineering
XIA, Xin
David LO,
WANG, Xinyu
YANG, Xiaohu
Who Should Review This Change? Putting Text and File Location Analyses Together for More Accurate Recommendations
description Software code review is a process of developers inspecting new code changes made by others, to evaluate their quality and identify and fix defects, before integrating them to the main branch of a version control system. Modern Code Review (MCR), a lightweight and tool-based variant of conventional codereview, is widely adopted in both open source and proprietary software projects. One challenge that impacts MCR is the assignment of appropriate developers to review a code change. Considering that there could be hundreds of potential code reviewers in a software project, picking suitable reviewers is not a straightforward task. A prior study by Thongtanunam et al. showed that the difficulty in selecting suitable reviewers may delay the review process by an average of 12 days. In this paper, to address the challenge of assigning suitable reviewers to changes, we propose a hybrid and incremental approach Tie which utilizes the advantages of both Text mIning and a filE location-based approach. To do this, Tie integrates an incremental text mining model which analyzes the textual contents in a reviewrequest, and a similarity model which measures the similarity of changed file paths and reviewed filepaths. We perform a large-scale experiment on four open source projects, namely Android, OpenStack, QT, and LibreOffice, containing a total of 42,045 reviews. The experimental results show that on average Tie can achieve top-1, top-5, and top-10 accuracies, and Mean Reciprocal Rank (MRR) of 0.52, 0.79, 0.85, and 0.64 for the four projects, which improves the state-of-the-art approach RevFinder, proposed by Thongtanunam et al., by 61%, 23%, 8%, and 37%, respectively.
format text
author XIA, Xin
David LO,
WANG, Xinyu
YANG, Xiaohu
author_facet XIA, Xin
David LO,
WANG, Xinyu
YANG, Xiaohu
author_sort XIA, Xin
title Who Should Review This Change? Putting Text and File Location Analyses Together for More Accurate Recommendations
title_short Who Should Review This Change? Putting Text and File Location Analyses Together for More Accurate Recommendations
title_full Who Should Review This Change? Putting Text and File Location Analyses Together for More Accurate Recommendations
title_fullStr Who Should Review This Change? Putting Text and File Location Analyses Together for More Accurate Recommendations
title_full_unstemmed Who Should Review This Change? Putting Text and File Location Analyses Together for More Accurate Recommendations
title_sort who should review this change? putting text and file location analyses together for more accurate recommendations
publisher Institutional Knowledge at Singapore Management University
publishDate 2015
url https://ink.library.smu.edu.sg/sis_research/3086
http://dx.doi.org/10.1109/ICSM.2015.7332472
_version_ 1770572805392302080