Automatic generation of plagiarism detection among student programs

A system for the automatic generation of plagiarism detectors that find similar programs in a set of student programs is presented. Existing plagiarism detectors are either applied to a programming language or a pre-defined set of programming languages. The general purpose one usually employs string...

Full description

Saved in:
Bibliographic Details
Main Authors: Roxas, Rachel Edita O., Lim, Nathalie Rose T., Bautista, Natasja
Format: text
Published: Animo Repository 2006
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/469
https://animorepository.dlsu.edu.ph/context/faculty_research/article/1468/type/native/viewcontent
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-1468
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-14682022-11-14T23:54:43Z Automatic generation of plagiarism detection among student programs Roxas, Rachel Edita O. Lim, Nathalie Rose T. Bautista, Natasja A system for the automatic generation of plagiarism detectors that find similar programs in a set of student programs is presented. Existing plagiarism detectors are either applied to a programming language or a pre-defined set of programming languages. The general purpose one usually employs string matching to perform similarity measures that are based on plagiarism detection among documents in general, and not in programs in particular, thus, losing much of the structure and logic of programs in the process. On the other hand, plagiarism detectors for specific languages only cater to that particular set of languages. This study provides a means for the user to specify the programming language of the student programs to be analyzed. Moreover, an automatic plagiarism detector system must be immune to the transformations that students perform on copied programs. These transformations are usually dependent on several factors namely: the type of programming problems and correspondingly, the complexity of the project to be implemented by the students, and also the programming language paradigm of the programs. Thus, the similarity measures employed by the system should be determined by these factors and can be specified by the professor. He/she has the option to specify how the similarities among the student programs will be captured. The system provides an interface for the specification of the particular programming language in which the student programs are implemented, and a knowledgebase of similarity measures that the user would like to include in the analysis of the student programs. Hence, the system provides flexibility in the programming language of the student programs to be analyzed and the similarity measures that the professor wishes to employ. Initial qualitative and quantitative evaluations illustrate a flexible, convenient and cost-effective tool for building plagiarism detectors for effective detection of programs in various imperative and procedural programming languages. The approach also addresses some of the changes that students perform on copied programs which JPlag fails to handle, thus, allowing for improved accuracy in terms of the reduction of false-positives, increasing the chance of catching plagiarized programs. These changes include modification of control structures, use of temporary variables and sub-expressions, in-lining and re-factoring of methods, and redundancy (variables or methods that were not used). Comprehensive tests on other programming languages under various programming language paradigms such as object-oriented, logic and functional languages, considering the different changes that the students employ to copied programs (such as the tests done in JPlag) are also recommended for empirical evaluation. © 2006 IEEE. 2006-12-01T08:00:00Z text text/html https://animorepository.dlsu.edu.ph/faculty_research/469 https://animorepository.dlsu.edu.ph/context/faculty_research/article/1468/type/native/viewcontent Faculty Research Work Animo Repository Plagiarism Programming languages (Electronic computers) Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Plagiarism
Programming languages (Electronic computers)
Computer Sciences
spellingShingle Plagiarism
Programming languages (Electronic computers)
Computer Sciences
Roxas, Rachel Edita O.
Lim, Nathalie Rose T.
Bautista, Natasja
Automatic generation of plagiarism detection among student programs
description A system for the automatic generation of plagiarism detectors that find similar programs in a set of student programs is presented. Existing plagiarism detectors are either applied to a programming language or a pre-defined set of programming languages. The general purpose one usually employs string matching to perform similarity measures that are based on plagiarism detection among documents in general, and not in programs in particular, thus, losing much of the structure and logic of programs in the process. On the other hand, plagiarism detectors for specific languages only cater to that particular set of languages. This study provides a means for the user to specify the programming language of the student programs to be analyzed. Moreover, an automatic plagiarism detector system must be immune to the transformations that students perform on copied programs. These transformations are usually dependent on several factors namely: the type of programming problems and correspondingly, the complexity of the project to be implemented by the students, and also the programming language paradigm of the programs. Thus, the similarity measures employed by the system should be determined by these factors and can be specified by the professor. He/she has the option to specify how the similarities among the student programs will be captured. The system provides an interface for the specification of the particular programming language in which the student programs are implemented, and a knowledgebase of similarity measures that the user would like to include in the analysis of the student programs. Hence, the system provides flexibility in the programming language of the student programs to be analyzed and the similarity measures that the professor wishes to employ. Initial qualitative and quantitative evaluations illustrate a flexible, convenient and cost-effective tool for building plagiarism detectors for effective detection of programs in various imperative and procedural programming languages. The approach also addresses some of the changes that students perform on copied programs which JPlag fails to handle, thus, allowing for improved accuracy in terms of the reduction of false-positives, increasing the chance of catching plagiarized programs. These changes include modification of control structures, use of temporary variables and sub-expressions, in-lining and re-factoring of methods, and redundancy (variables or methods that were not used). Comprehensive tests on other programming languages under various programming language paradigms such as object-oriented, logic and functional languages, considering the different changes that the students employ to copied programs (such as the tests done in JPlag) are also recommended for empirical evaluation. © 2006 IEEE.
format text
author Roxas, Rachel Edita O.
Lim, Nathalie Rose T.
Bautista, Natasja
author_facet Roxas, Rachel Edita O.
Lim, Nathalie Rose T.
Bautista, Natasja
author_sort Roxas, Rachel Edita O.
title Automatic generation of plagiarism detection among student programs
title_short Automatic generation of plagiarism detection among student programs
title_full Automatic generation of plagiarism detection among student programs
title_fullStr Automatic generation of plagiarism detection among student programs
title_full_unstemmed Automatic generation of plagiarism detection among student programs
title_sort automatic generation of plagiarism detection among student programs
publisher Animo Repository
publishDate 2006
url https://animorepository.dlsu.edu.ph/faculty_research/469
https://animorepository.dlsu.edu.ph/context/faculty_research/article/1468/type/native/viewcontent
_version_ 1751550414029848576