Definition, approaches, and analysis of code duplication detection (2006-2020): a critical review

Code duplication detection is the act of finding similar code in software development. It is important for software engineer to address the issues of code duplication detection. In this paper, a critical review of previous works on code duplication for code clone and plagiarism detection is performe...

Full description

Saved in:
Bibliographic Details
Main Authors: Chen, Chang Feng, Mohd. Zain, Azlan, Zhou, Kai Qing
Format: Article
Published: Springer Science and Business Media Deutschland GmbH 2022
Subjects:
Online Access:http://eprints.utm.my/103386/
http://dx.doi.org/10.1007/s00521-022-07707-2
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.103386
record_format eprints
spelling my.utm.1033862023-11-14T04:01:42Z http://eprints.utm.my/103386/ Definition, approaches, and analysis of code duplication detection (2006-2020): a critical review Chen, Chang Feng Mohd. Zain, Azlan Zhou, Kai Qing Q Science (General) QA75 Electronic computers. Computer science Code duplication detection is the act of finding similar code in software development. It is important for software engineer to address the issues of code duplication detection. In this paper, a critical review of previous works on code duplication for code clone and plagiarism detection is performed. The review involves five main parts. Firstly, a systematic literature review is conducted to confirm the selected articles. Secondly, a critical review of different code duplication approaches is conducted based on three phases; processing, detection, and decision. Thirdly, statistical analysis of the number of review articles is performed to show the trends and hots of code duplication research. Moreover, quantitative analysis of different code duplication approaches is presented to show the effectiveness of different approaches. Fourthly, the advantages and disadvantages of different approaches and techniques are summarized and discussed. Finally, the conclusion of the review is summarized and future research direction of code duplication is described. Springer Science and Business Media Deutschland GmbH 2022-12 Article PeerReviewed Chen, Chang Feng and Mohd. Zain, Azlan and Zhou, Kai Qing (2022) Definition, approaches, and analysis of code duplication detection (2006-2020): a critical review. Neural Computing and Applications, 34 (23). pp. 20507-20537. ISSN 0941-0643 http://dx.doi.org/10.1007/s00521-022-07707-2 DOI:10.1007/s00521-022-07707-2
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic Q Science (General)
QA75 Electronic computers. Computer science
spellingShingle Q Science (General)
QA75 Electronic computers. Computer science
Chen, Chang Feng
Mohd. Zain, Azlan
Zhou, Kai Qing
Definition, approaches, and analysis of code duplication detection (2006-2020): a critical review
description Code duplication detection is the act of finding similar code in software development. It is important for software engineer to address the issues of code duplication detection. In this paper, a critical review of previous works on code duplication for code clone and plagiarism detection is performed. The review involves five main parts. Firstly, a systematic literature review is conducted to confirm the selected articles. Secondly, a critical review of different code duplication approaches is conducted based on three phases; processing, detection, and decision. Thirdly, statistical analysis of the number of review articles is performed to show the trends and hots of code duplication research. Moreover, quantitative analysis of different code duplication approaches is presented to show the effectiveness of different approaches. Fourthly, the advantages and disadvantages of different approaches and techniques are summarized and discussed. Finally, the conclusion of the review is summarized and future research direction of code duplication is described.
format Article
author Chen, Chang Feng
Mohd. Zain, Azlan
Zhou, Kai Qing
author_facet Chen, Chang Feng
Mohd. Zain, Azlan
Zhou, Kai Qing
author_sort Chen, Chang Feng
title Definition, approaches, and analysis of code duplication detection (2006-2020): a critical review
title_short Definition, approaches, and analysis of code duplication detection (2006-2020): a critical review
title_full Definition, approaches, and analysis of code duplication detection (2006-2020): a critical review
title_fullStr Definition, approaches, and analysis of code duplication detection (2006-2020): a critical review
title_full_unstemmed Definition, approaches, and analysis of code duplication detection (2006-2020): a critical review
title_sort definition, approaches, and analysis of code duplication detection (2006-2020): a critical review
publisher Springer Science and Business Media Deutschland GmbH
publishDate 2022
url http://eprints.utm.my/103386/
http://dx.doi.org/10.1007/s00521-022-07707-2
_version_ 1783876357152833536