Spam review detection (II)

Online opinions have become an essential part of decision making for millions of web users. However, in a pursuit of profit or success, imposters try to deceive people by opinion spamming to promote or demote a certain targets. The seriousness of the problem has attracted significant attention fro...

Full description

Saved in:
Bibliographic Details
Main Author: Tussupbekov, Yerken.
Other Authors: School of Computer Engineering
Format: Final Year Project
Language:English
Published: 2013
Subjects:
Online Access:http://hdl.handle.net/10356/55113
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Online opinions have become an essential part of decision making for millions of web users. However, in a pursuit of profit or success, imposters try to deceive people by opinion spamming to promote or demote a certain targets. The seriousness of the problem has attracted significant attention from different parties. Big companies develop new approaches of enhancing the filtering systems to detect spam, while spammers come up with new ways to disguise themselves. In this paper, we study two major approaches for spam detection based on linguistic and behavioral features. While most of the past researches focused on either of the methods, in this work we combine the two together in an attempt to find the optimal spam filtering approach. We will take supervised learning approach, as the new ways of detecting training spam will be proposed and put to the test. An in-depth investigation explores new principles for dataset construction that allows us to develop a classifier reaching remarkable 83.4% accuracy in spam filtering. Furthermore, additional enhancement to the developed system will be proposed, that could help to achieve even better performance.