Plagiarism detection using N-Gram model

Access is limited to UniMAP community.

Saved in:
Bibliographic Details
Main Author: Muhammad Syahir, Shah Kholl Ajam
Other Authors: Dr. Nik Adilah Hanin Zahri
Format: Learning Object
Language:English
Published: Universiti Malaysia Perlis (UniMAP) 2016
Subjects:
Online Access:http://dspace.unimap.edu.my:80/xmlui/handle/123456789/41827
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Perlis
Language: English
id my.unimap-41827
record_format dspace
spelling my.unimap-418272016-06-02T07:49:06Z Plagiarism detection using N-Gram model Muhammad Syahir, Shah Kholl Ajam Dr. Nik Adilah Hanin Zahri Plagiarism N-Gram model Plagiarism detection Plagiarism detection -- Methods Access is limited to UniMAP community. The vast increase of available documents in the World Wide Web (WWW) and the ease access to these documents has lead to a serious problem of using other’s works without giving credits. Although many methods have been developed to detect some instances of plagiarism such as changing the structure of sentences or when slightly replacing words by their synonyms, it is often hard to reveal plagiarism when the copied sentences are deliberately modified. This project proposes an algorithm for plagiarism detection by using syntactic plagiarism detection using 1-gram and 2-gram. Jaccard similarity coefficient is applied to detect similarity between documents of English corpus in engineering field by using C programming language. From the value of the results which is precision, recall and f-measure, we considered 2-gram showed the great potential for the plagiarism detection method. The 2-gram extraction achieved values 0.983 for precision, 0.380 for recall and 0.548 for f-measure compared to 1-gram extraction. Jaccard similarity coefficient incorporation with N-gram method is suitable sufficiently to be employed in the word similarity measurement. In efficiency measurement, the program performance can deal appropriately with high stability to calculate the word similarity. 2016-06-02T07:49:06Z 2016-06-02T07:49:06Z 2015-06 Learning Object http://dspace.unimap.edu.my:80/xmlui/handle/123456789/41827 en Universiti Malaysia Perlis (UniMAP) School of Computer and Communication Engineering
institution Universiti Malaysia Perlis
building UniMAP Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Perlis
content_source UniMAP Library Digital Repository
url_provider http://dspace.unimap.edu.my/
language English
topic Plagiarism
N-Gram model
Plagiarism detection
Plagiarism detection -- Methods
spellingShingle Plagiarism
N-Gram model
Plagiarism detection
Plagiarism detection -- Methods
Muhammad Syahir, Shah Kholl Ajam
Plagiarism detection using N-Gram model
description Access is limited to UniMAP community.
author2 Dr. Nik Adilah Hanin Zahri
author_facet Dr. Nik Adilah Hanin Zahri
Muhammad Syahir, Shah Kholl Ajam
format Learning Object
author Muhammad Syahir, Shah Kholl Ajam
author_sort Muhammad Syahir, Shah Kholl Ajam
title Plagiarism detection using N-Gram model
title_short Plagiarism detection using N-Gram model
title_full Plagiarism detection using N-Gram model
title_fullStr Plagiarism detection using N-Gram model
title_full_unstemmed Plagiarism detection using N-Gram model
title_sort plagiarism detection using n-gram model
publisher Universiti Malaysia Perlis (UniMAP)
publishDate 2016
url http://dspace.unimap.edu.my:80/xmlui/handle/123456789/41827
_version_ 1643799819438784512