Spam review detection

As more people depend heavily on the information presented on the web, user generated content like reviews could easily influence the purchase decisions of other consumers. As such, multiple fake reviews have been frequently posted to various popular online review websites to mislead the consumers....

全面介紹

Saved in:

書目詳細資料
主要作者:	Tan, Hui Min.
其他作者:	School of Computer Engineering
格式:	Final Year Project
語言:	English
出版:	2013
主題:	DRNTU::Engineering::Computer science and engineering
在線閱讀:	http://hdl.handle.net/10356/54968
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!

實物特徵
總結:	As more people depend heavily on the information presented on the web, user generated content like reviews could easily influence the purchase decisions of other consumers. As such, multiple fake reviews have been frequently posted to various popular online review websites to mislead the consumers. Several studies have also been made in spam review detection. However, most research focus on specific review websites such as either Amazon or Yelp. Therefore, this raised a question whether these observed features suggested in these research papers could perform equally well in other domains such as TripAdvisor. In this project, a series of progressive phases were employed to implement algorithm that would detect these spam reviews with referenced to the suggested set of features and procedures. In total, three different types of features, N-Grams features, review centric features and user behavior features were chosen for the study. From the experiments, N-Grams features generally generate a better accuracy than review centric features with a difference in accuracy ranges from 10% to 30%. User behavior features consistently outperforms the other two sets of features with an average accuracy of 60% and above. Despite the limitations in this project, it is evident from the findings that the features relating to user behaviors gives the best accuracy among the rest which means that it is more versatile.

Spam review detection

相似書籍