Comparative analysis for toxic comment classification

Due to the development in e-commerce, social media channels like twitter and Facebook flood of Information is poured into the Internet, while this is going to affect the quality of the people since these texts may include many levels of toxicity which results in online harassment, bullying, etc., Th...

Full description

Saved in:
Bibliographic Details
Main Authors: Ismael, Ahmed Jameel, Abdul Rahim, Herlina, Boppana, Udaya Mouni, Yadav, Arpit, Zafar, Afia
Format: Article
Language:English
Published: Malaysian Society for Computed Tomography & Imaging Technology (MyCT) 2021
Subjects:
Online Access:http://eprints.utm.my/id/eprint/97265/1/HerlinaAbdulRahim2021_ComparativeAnalysisforToxicComment.pdf
http://eprints.utm.my/id/eprint/97265/
http://www.tssa.my/index.php/jtssa/article/view/150
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
Language: English
Description
Summary:Due to the development in e-commerce, social media channels like twitter and Facebook flood of Information is poured into the Internet, while this is going to affect the quality of the people since these texts may include many levels of toxicity which results in online harassment, bullying, etc., There is paramount importance to monitor this data. Research and Industrial communities are trying to build an effective model to classify these toxic comments. Still, these attempts are not reaching sufficient levels. Luckily In recent days, there are advancements in hardware and data management techniques like Big Data that encourages computational approaches like Deep learning approaches which can improve the text classification. This paper focuses primarily on, various approaches for short toxic comment classification and the comparative experiments are set to evaluate their accuracy based on Area Under Curve (AUC) metric from “Wikipedia Talk Page Comments annotated with toxicity reasons” Dataset. Based on our experimental analysis, bidirectional GRU performed better as compared to other existing classification algorithms. These findings will be used as the benchmarking results for improvement of Toxic Comment Classification.