Conceptual similarity and graph-based method for plagiarism detection

Plagiarism is a form of academic misconduct. It has increased rapidly because it is now quick and easy to reach data and information through electronic documents and the Internet. The problem occurs when found documents content is illegal and without permission or citation, this problem is known as...

Full description

Saved in:
Bibliographic Details
Main Authors: Osman, Ahmed Hamza, Salim, Naomie, Binwahlan, Mohammed Salem, Hentably, Hamza, Ali, Albaraa M.
Format: Article
Published: Asian Research Publishing Network (A R P N) 2011
Subjects:
Online Access:http://eprints.utm.my/id/eprint/44810/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
Description
Summary:Plagiarism is a form of academic misconduct. It has increased rapidly because it is now quick and easy to reach data and information through electronic documents and the Internet. The problem occurs when found documents content is illegal and without permission or citation, this problem is known as plagiarism. One of the major challenges is to detect the plagiarism and illegal copy. This paper discusses a new representation method for text documents called text graph-based representation. The proposed method does not represent the content of a text document as a graph only, but also captures the underlying semantic meaning in terms of the relationships among its concepts in order to defeat the difficulty which the traditional plagiarism detection systems face with some kinds of plagiarism such as complicated plagiarism in which users can reword the plagiarized part or replace some words by their synonyms. The experiments have been carried out using PAN-PC-09 standardization of plagiarism detection corpus. The results showed that our method remarkably outperforms the modern methods for plagiarism detection.