Spam review detection (II)
Online opinions have become an essential part of decision making for millions of web users. However, in a pursuit of profit or success, imposters try to deceive people by opinion spamming to promote or demote a certain targets. The seriousness of the problem has attracted significant attention fro...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/55113 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Online opinions have become an essential part of decision making for millions of web users. However, in a pursuit of profit or success, imposters try to deceive people by opinion spamming to promote or demote a certain targets. The seriousness of the problem has attracted significant attention from different parties. Big companies develop new approaches of enhancing the filtering systems to detect spam, while spammers come up with new ways to disguise themselves.
In this paper, we study two major approaches for spam detection based on linguistic and behavioral features. While most of the past researches focused on either of the methods, in this work we combine the two together in an attempt to find the optimal spam filtering approach. We will take supervised learning approach, as the new ways of detecting training spam will be proposed and put to the test.
An in-depth investigation explores new principles for dataset construction that allows us to develop a classifier reaching remarkable 83.4% accuracy in spam filtering. Furthermore, additional enhancement to the developed system will be proposed, that could help to achieve even better performance. |
---|