Automating change-level self-admitted technical debt determination

Self-Admitted Technical Debt (SATD) refers to technical debt that is introduced intentionally. Previous studies that identify SATD at the file-level in isolation cannot describe the TD context related to multiple files. Therefore, it is more beneficial to identify the SATD once a change is being mad...

Full description

Saved in:
Bibliographic Details
Main Authors: YAN, Meng, XIA, Xin, SHIHAB, Emad, LO, David, YIN, Jianwei, YANG, Xiaohu
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4352
https://ink.library.smu.edu.sg/context/sis_research/article/5355/viewcontent/Automating_change_level_self_admitted_tse_2018_afv.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5355
record_format dspace
spelling sg-smu-ink.sis_research-53552022-07-26T09:11:41Z Automating change-level self-admitted technical debt determination YAN, Meng XIA, Xin SHIHAB, Emad LO, David YIN, Jianwei YANG, Xiaohu Self-Admitted Technical Debt (SATD) refers to technical debt that is introduced intentionally. Previous studies that identify SATD at the file-level in isolation cannot describe the TD context related to multiple files. Therefore, it is more beneficial to identify the SATD once a change is being made. We refer to this type of TD identification as “Change-level SATD Determination”, and identifying SATD at the change-level can help to manage and control TD by understanding the TD context through tracing the introducing changes. In this paper, we propose a change-level SATD Determination mode by extracting 25 features from software changes that are divided into three dimensions, namely diffusion, history and message, respectively. To evaluate the effectiveness of our proposed model, we perform an empirical study on 7 open source projects containing a total of 100,011 software changes. The experimental results show that our model achieves a promising and better performance than four baselines in terms of AUC and cost-effectiveness. On average across the 7 experimental projects, our model achieves AUC of 0.82, cost-effectiveness of 0.80, which is a significant improvement over the comparison baselines used. In addition, we found that “Diffusion” is the most discriminative dimension for determining TD-introducing changes 2019-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4352 info:doi/10.1109/TSE.2018.2831232 https://ink.library.smu.edu.sg/context/sis_research/article/5355/viewcontent/Automating_change_level_self_admitted_tse_2018_afv.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Self-admitted Technical Debt Software Change Labeling Measurement Software quality Technical Debt Feature extraction Change-level Determination Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Self-admitted Technical Debt
Software Change
Labeling
Measurement
Software quality
Technical Debt
Feature extraction
Change-level Determination
Software Engineering
spellingShingle Self-admitted Technical Debt
Software Change
Labeling
Measurement
Software quality
Technical Debt
Feature extraction
Change-level Determination
Software Engineering
YAN, Meng
XIA, Xin
SHIHAB, Emad
LO, David
YIN, Jianwei
YANG, Xiaohu
Automating change-level self-admitted technical debt determination
description Self-Admitted Technical Debt (SATD) refers to technical debt that is introduced intentionally. Previous studies that identify SATD at the file-level in isolation cannot describe the TD context related to multiple files. Therefore, it is more beneficial to identify the SATD once a change is being made. We refer to this type of TD identification as “Change-level SATD Determination”, and identifying SATD at the change-level can help to manage and control TD by understanding the TD context through tracing the introducing changes. In this paper, we propose a change-level SATD Determination mode by extracting 25 features from software changes that are divided into three dimensions, namely diffusion, history and message, respectively. To evaluate the effectiveness of our proposed model, we perform an empirical study on 7 open source projects containing a total of 100,011 software changes. The experimental results show that our model achieves a promising and better performance than four baselines in terms of AUC and cost-effectiveness. On average across the 7 experimental projects, our model achieves AUC of 0.82, cost-effectiveness of 0.80, which is a significant improvement over the comparison baselines used. In addition, we found that “Diffusion” is the most discriminative dimension for determining TD-introducing changes
format text
author YAN, Meng
XIA, Xin
SHIHAB, Emad
LO, David
YIN, Jianwei
YANG, Xiaohu
author_facet YAN, Meng
XIA, Xin
SHIHAB, Emad
LO, David
YIN, Jianwei
YANG, Xiaohu
author_sort YAN, Meng
title Automating change-level self-admitted technical debt determination
title_short Automating change-level self-admitted technical debt determination
title_full Automating change-level self-admitted technical debt determination
title_fullStr Automating change-level self-admitted technical debt determination
title_full_unstemmed Automating change-level self-admitted technical debt determination
title_sort automating change-level self-admitted technical debt determination
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/4352
https://ink.library.smu.edu.sg/context/sis_research/article/5355/viewcontent/Automating_change_level_self_admitted_tse_2018_afv.pdf
_version_ 1770574663474216960