An improved plagiarism detection scheme based on semantic role labeling

Plagiarism occurs when the content is copied without permission or citation. One of the contributing factors is that many text documents on the internet are easily copied and accessed. This paper introduces a plagiarism detection technique based on the Semantic Role Labeling (SRL). The technique ana...

Full description

Saved in:
Bibliographic Details
Main Authors: Osman, Ahmed Hamza, Salim, Naomie, Binwahlan, Mohammed Salem, Alteeb, Rihab, Abuobieda, Albaraa
Format: Article
Published: 2012
Subjects:
Online Access:http://eprints.utm.my/id/eprint/46593/
https://dx.doi.org/10.1016/j.asoc.2011.12.021
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
Description
Summary:Plagiarism occurs when the content is copied without permission or citation. One of the contributing factors is that many text documents on the internet are easily copied and accessed. This paper introduces a plagiarism detection technique based on the Semantic Role Labeling (SRL). The technique analyses and compares text based on the semantic allocation for each term inside the sentence. SRL is superior in generating arguments for each sentence semantically. Weighting for each argument generated by SRL to study its behaviour is also introduced in this paper. It was found that not all arguments affect the plagiarism detection process. In addition, experimental results on PAN-PC-09 data sets showed that our method significantly outperforms the modern methods for plagiarism detection in terms of Recall, Precision and F-measure.