A unified plagiarism detection framework

With the rapid growth of information technology, Internet and digital libraries have been developing so fast that illegal copying of documents is becoming easier and more popular. A challenging question is how to identify documents with similar content which are candidate of plagiarism. There are se...

Full description

Saved in:
Bibliographic Details
Main Authors: Nguyen, Xuan Toi, Nguyen, Viet Hung, Pham, Bao Son
Format: Article
Language:English
Published: H. : ĐHQGHN 2017
Subjects:
Online Access:http://repository.vnu.edu.vn/handle/VNU_123/56506
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Vietnam National University, Hanoi
Language: English
Description
Summary:With the rapid growth of information technology, Internet and digital libraries have been developing so fast that illegal copying of documents is becoming easier and more popular. A challenging question is how to identify documents with similar content which are candidate of plagiarism. There are several approaches for estimating the similarity between two documents and each has its own advantages and disadvantages. An approach may be effective in one domain but may not work in others. In this paper, we propose a unified plagiarism detection framework that can identify which approach works most effectively in a new domain. Experimental results on three different corpora for different languages have demonstrated the effectiveness of our approach. '