Plagiarism detection through internet using hybrid artificial neural network and support vectors machine

Currently, most of the plagiarism detections are using similarity measurement techniques. Basically, a pair of similar sentences describes the same idea. However, not all like that, there are also sentences that are similar but have opposite meanings. This is one problem that is not easily solved by...

Full description

Saved in:
Bibliographic Details
Main Authors: Selamat, Ali, Subroto, Imam Much Ibnu
Format: Article
Language:English
Published: Universitas Ahmad Dahlan 2014
Subjects:
Online Access:http://eprints.utm.my/id/eprint/54506/1/AliSelamat2014_Plagiarismdetectionthroughinternet.pdf
http://eprints.utm.my/id/eprint/54506/
http://dx.doi.org/10.12928/telkomnika.v12i1.4
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
Language: English
id my.utm.54506
record_format eprints
spelling my.utm.545062018-08-12T03:56:10Z http://eprints.utm.my/id/eprint/54506/ Plagiarism detection through internet using hybrid artificial neural network and support vectors machine Selamat, Ali Subroto, Imam Much Ibnu QA75 Electronic computers. Computer science Currently, most of the plagiarism detections are using similarity measurement techniques. Basically, a pair of similar sentences describes the same idea. However, not all like that, there are also sentences that are similar but have opposite meanings. This is one problem that is not easily solved by use of the technique similarity. Determination of dubious value similarity threshold on similarity method is another problem. The plagiarism threshold was adjustable, but it means uncertainty. Another problem, although the rules of plagiarism can be understood together but in practice, some people have a different opinion in determining a document, whether or not classified as plagiarism. Of the three problems, a statistical approach could possibly be the most appropriate solution. Machine learning methods like k-nearest neighbors (KNN), support vector machine (SVM), artificial neural networks (ANN) is a technique that is commonly used in solving the problem based on statistical data. This method of learning process based on statistical data to be smart resembling intelligence experts. In this case, plagiarism is data that has been validated by experts. This paper offers a hybrid approach of SVM method for detecting plagiarism. The data collection method in this work using an Internet search to ensure that a document is in the detection is up-to-date. The measurement results based on accuracy, precision and recall show that the hybrid machine learning does not always result in better performance. There is no better and vice versa. Overall testing of the four hybrid combinations concluded that the hybrid ANN-SVM method is the best performance in the case of plagiarism Universitas Ahmad Dahlan 2014 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/54506/1/AliSelamat2014_Plagiarismdetectionthroughinternet.pdf Selamat, Ali and Subroto, Imam Much Ibnu (2014) Plagiarism detection through internet using hybrid artificial neural network and support vectors machine. Telkomnika (Telecommunication Computing Electronics and Control), 12 (1). pp. 209-218. ISSN 1693-6930 http://dx.doi.org/10.12928/telkomnika.v12i1.4 DOI: 10.12928/TELKOMNIKA.v12i1.4
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Selamat, Ali
Subroto, Imam Much Ibnu
Plagiarism detection through internet using hybrid artificial neural network and support vectors machine
description Currently, most of the plagiarism detections are using similarity measurement techniques. Basically, a pair of similar sentences describes the same idea. However, not all like that, there are also sentences that are similar but have opposite meanings. This is one problem that is not easily solved by use of the technique similarity. Determination of dubious value similarity threshold on similarity method is another problem. The plagiarism threshold was adjustable, but it means uncertainty. Another problem, although the rules of plagiarism can be understood together but in practice, some people have a different opinion in determining a document, whether or not classified as plagiarism. Of the three problems, a statistical approach could possibly be the most appropriate solution. Machine learning methods like k-nearest neighbors (KNN), support vector machine (SVM), artificial neural networks (ANN) is a technique that is commonly used in solving the problem based on statistical data. This method of learning process based on statistical data to be smart resembling intelligence experts. In this case, plagiarism is data that has been validated by experts. This paper offers a hybrid approach of SVM method for detecting plagiarism. The data collection method in this work using an Internet search to ensure that a document is in the detection is up-to-date. The measurement results based on accuracy, precision and recall show that the hybrid machine learning does not always result in better performance. There is no better and vice versa. Overall testing of the four hybrid combinations concluded that the hybrid ANN-SVM method is the best performance in the case of plagiarism
format Article
author Selamat, Ali
Subroto, Imam Much Ibnu
author_facet Selamat, Ali
Subroto, Imam Much Ibnu
author_sort Selamat, Ali
title Plagiarism detection through internet using hybrid artificial neural network and support vectors machine
title_short Plagiarism detection through internet using hybrid artificial neural network and support vectors machine
title_full Plagiarism detection through internet using hybrid artificial neural network and support vectors machine
title_fullStr Plagiarism detection through internet using hybrid artificial neural network and support vectors machine
title_full_unstemmed Plagiarism detection through internet using hybrid artificial neural network and support vectors machine
title_sort plagiarism detection through internet using hybrid artificial neural network and support vectors machine
publisher Universitas Ahmad Dahlan
publishDate 2014
url http://eprints.utm.my/id/eprint/54506/1/AliSelamat2014_Plagiarismdetectionthroughinternet.pdf
http://eprints.utm.my/id/eprint/54506/
http://dx.doi.org/10.12928/telkomnika.v12i1.4
_version_ 1643653525028208640