CC2Vec: Distributed representations of code changes
Existing work on software patches often use features specific to a single task. These works often rely on manually identified features, and human effort is required to identify these features for each task. In this work, we propose CC2Vec, a neural network model that learns a representation of code...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2020
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/5500 https://ink.library.smu.edu.sg/context/sis_research/article/6503/viewcontent/3377811.3380361.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-6503 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-65032021-05-12T03:29:58Z CC2Vec: Distributed representations of code changes HOANG, Thong KANG, Hong Jin LAWALL, Julia LO, David Existing work on software patches often use features specific to a single task. These works often rely on manually identified features, and human effort is required to identify these features for each task. In this work, we propose CC2Vec, a neural network model that learns a representation of code changes guided by their accompanying log messages, which represent the semantic intent of the code changes. CC2Vec models the hierarchical structure of a code change with the help of the attention mechanism and usesmultiple comparison functions to identify the differences between the removed and added code. To evaluate if CC2Vec can produce a distributed representation of code changes that is general and useful for multiple tasks on software patches, we use the vectors produced by CC2Vec for three tasks: log message generation, bug fixing patch identification, and just-in-time defect prediction. In all tasks, the models using CC2Vec outperform the state-of-the-art techniques. 2020-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5500 info:doi/doi.org/10.1145/3377811.3380361 https://ink.library.smu.edu.sg/context/sis_research/article/6503/viewcontent/3377811.3380361.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Software Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Software Engineering |
spellingShingle |
Software Engineering HOANG, Thong KANG, Hong Jin LAWALL, Julia LO, David CC2Vec: Distributed representations of code changes |
description |
Existing work on software patches often use features specific to a single task. These works often rely on manually identified features, and human effort is required to identify these features for each task. In this work, we propose CC2Vec, a neural network model that learns a representation of code changes guided by their accompanying log messages, which represent the semantic intent of the code changes. CC2Vec models the hierarchical structure of a code change with the help of the attention mechanism and usesmultiple comparison functions to identify the differences between the removed and added code. To evaluate if CC2Vec can produce a distributed representation of code changes that is general and useful for multiple tasks on software patches, we use the vectors produced by CC2Vec for three tasks: log message generation, bug fixing patch identification, and just-in-time defect prediction. In all tasks, the models using CC2Vec outperform the state-of-the-art techniques. |
format |
text |
author |
HOANG, Thong KANG, Hong Jin LAWALL, Julia LO, David |
author_facet |
HOANG, Thong KANG, Hong Jin LAWALL, Julia LO, David |
author_sort |
HOANG, Thong |
title |
CC2Vec: Distributed representations of code changes |
title_short |
CC2Vec: Distributed representations of code changes |
title_full |
CC2Vec: Distributed representations of code changes |
title_fullStr |
CC2Vec: Distributed representations of code changes |
title_full_unstemmed |
CC2Vec: Distributed representations of code changes |
title_sort |
cc2vec: distributed representations of code changes |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2020 |
url |
https://ink.library.smu.edu.sg/sis_research/5500 https://ink.library.smu.edu.sg/context/sis_research/article/6503/viewcontent/3377811.3380361.pdf |
_version_ |
1770575481216696320 |