Spam and scam detection through text analysis

This report summarizes an experimental study to detect spammer and scammer existence in e-commerce platform. The combination studies of analysing business review and rating were used to categorize the text review into two classifications, namely Truthful and Deceptive in which were classified furthe...

全面介紹

Saved in:

書目詳細資料
主要作者:	Prawira, Nathania Anggraini
其他作者:	-
格式:	Final Year Project
語言:	English
出版:	Nanyang Technological University 2020
主題:	Engineering::Electrical and electronic engineering
在線閱讀:	https://hdl.handle.net/10356/140012
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Nanyang Technological University
語言:	English

id	sg-ntu-dr.10356-140012
record_format	dspace
spelling	sg-ntu-dr.10356-1400122023-07-07T18:37:01Z Spam and scam detection through text analysis Prawira, Nathania Anggraini - School of Electrical and Electronic Engineering Chen Li Hui elhchen@ntu.edu.sg Engineering::Electrical and electronic engineering This report summarizes an experimental study to detect spammer and scammer existence in e-commerce platform. The combination studies of analysing business review and rating were used to categorize the text review into two classifications, namely Truthful and Deceptive in which were classified further into Positive and Negative classes. Background knowledge for manual data labeling is discussed later. In this study, a sub-domain of Machine Learning Processing, such as Natural Language Processing (NLP) was implemented for the machine to simulate and classify the given text in human ability degree. The raw corpus collection was predicted with the application of TFIDF Transformer with Count Vectorizer initialization. Furthermore, attention mechanism was believed to pay greater attention to certain factors and help addressing the text focus during the data processing. Hence, the application of attention mechanism may enhance the output prediction accuracy and Transformer model was also considered in this study. The experimental model comparison was made between the integration of a single and multiple classifiers in BERT model. Some programming modules, such as, PyTorch, Scikit-Learn, Keras, spaCy and Natural Language Toolkit (NLTK) were widely used in this experiment. Bachelor of Engineering (Information Engineering and Media) 2020-05-26T04:17:23Z 2020-05-26T04:17:23Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/140012 en A3054-191 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Prawira, Nathania Anggraini Spam and scam detection through text analysis
description	This report summarizes an experimental study to detect spammer and scammer existence in e-commerce platform. The combination studies of analysing business review and rating were used to categorize the text review into two classifications, namely Truthful and Deceptive in which were classified further into Positive and Negative classes. Background knowledge for manual data labeling is discussed later. In this study, a sub-domain of Machine Learning Processing, such as Natural Language Processing (NLP) was implemented for the machine to simulate and classify the given text in human ability degree. The raw corpus collection was predicted with the application of TFIDF Transformer with Count Vectorizer initialization. Furthermore, attention mechanism was believed to pay greater attention to certain factors and help addressing the text focus during the data processing. Hence, the application of attention mechanism may enhance the output prediction accuracy and Transformer model was also considered in this study. The experimental model comparison was made between the integration of a single and multiple classifiers in BERT model. Some programming modules, such as, PyTorch, Scikit-Learn, Keras, spaCy and Natural Language Toolkit (NLTK) were widely used in this experiment.
author2	-
author_facet	- Prawira, Nathania Anggraini
format	Final Year Project
author	Prawira, Nathania Anggraini
author_sort	Prawira, Nathania Anggraini
title	Spam and scam detection through text analysis
title_short	Spam and scam detection through text analysis
title_full	Spam and scam detection through text analysis
title_fullStr	Spam and scam detection through text analysis
title_full_unstemmed	Spam and scam detection through text analysis
title_sort	spam and scam detection through text analysis
publisher	Nanyang Technological University
publishDate	2020
url	https://hdl.handle.net/10356/140012
_version_	1772829140156678144

Spam and scam detection through text analysis

相似書籍