Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT

The prevalence of cyberbullying on Social Media (SM) platforms has become a significant concern for individuals, organizations, and society as a whole. The early detection and intervention of cyberbullying on social media are critical to mitigating its harmful effects. In recent years, ensemble lear...

Full description

Saved in:

Bibliographic Details
Main Authors:	Muneer, A., Alwadain, A., Ragab, M.G., Alqushaibi, A.
Format:	Article
Published:	Multidisciplinary Digital Publishing Institute (MDPI) 2023
Online Access:	http://scholars.utp.edu.my/id/eprint/37437/ https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168771459&doi=10.3390%2finfo14080467&partnerID=40&md5=baf6f84fd06fd77245d2746f068b91cd
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Teknologi Petronas

id	oai:scholars.utp.edu.my:37437
record_format	eprints
spelling	oai:scholars.utp.edu.my:374372023-10-04T12:44:13Z http://scholars.utp.edu.my/id/eprint/37437/ Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT Muneer, A. Alwadain, A. Ragab, M.G. Alqushaibi, A. The prevalence of cyberbullying on Social Media (SM) platforms has become a significant concern for individuals, organizations, and society as a whole. The early detection and intervention of cyberbullying on social media are critical to mitigating its harmful effects. In recent years, ensemble learning has shown promising results for detecting cyberbullying on social media. This paper presents an ensemble stacking learning approach for detecting cyberbullying on Twitter using a combination of Deep Neural Network methods (DNNs). It also introduces BERT-M, a modified BERT model. The dataset used in this study was collected from Twitter and preprocessed to remove irrelevant information. The feature extraction process involved utilizing word2vec with Continuous Bag of Words (CBOW) to form the weights in the embedding layer. These features were then fed into a convolutional and pooling mechanism, effectively reducing their dimensionality, and capturing the position-invariant characteristics of the offensive words. The validation of the proposed stacked model and BERT-M was performed using well-known model evaluation measures. The stacked model achieved an F1-score of 0.964, precision of 0.950, recall of 0.92 and the detection time reported was 3 min, which surpasses the previously reported accuracy and speed scores for all known NLP detectors of cyberbullying, including standard BERT and BERT-M. The results of the experiment showed that the stacking ensemble learning approach achieved an accuracy of 97.4 in detecting cyberbullying on Twitter dataset and 90.97 on combined Twitter and Facebook dataset. The results demonstrate the effectiveness of the proposed stacking ensemble learning approach in detecting cyberbullying on SM and highlight the importance of combining multiple models for improved performance. Â© 2023 by the authors. Multidisciplinary Digital Publishing Institute (MDPI) 2023 Article NonPeerReviewed Muneer, A. and Alwadain, A. and Ragab, M.G. and Alqushaibi, A. (2023) Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT. Information (Switzerland), 14 (8). ISSN 20782489 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168771459&doi=10.3390%2finfo14080467&partnerID=40&md5=baf6f84fd06fd77245d2746f068b91cd 10.3390/info14080467 10.3390/info14080467 10.3390/info14080467
institution	Universiti Teknologi Petronas
building	UTP Resource Centre
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Teknologi Petronas
content_source	UTP Institutional Repository
url_provider	http://eprints.utp.edu.my/
description	The prevalence of cyberbullying on Social Media (SM) platforms has become a significant concern for individuals, organizations, and society as a whole. The early detection and intervention of cyberbullying on social media are critical to mitigating its harmful effects. In recent years, ensemble learning has shown promising results for detecting cyberbullying on social media. This paper presents an ensemble stacking learning approach for detecting cyberbullying on Twitter using a combination of Deep Neural Network methods (DNNs). It also introduces BERT-M, a modified BERT model. The dataset used in this study was collected from Twitter and preprocessed to remove irrelevant information. The feature extraction process involved utilizing word2vec with Continuous Bag of Words (CBOW) to form the weights in the embedding layer. These features were then fed into a convolutional and pooling mechanism, effectively reducing their dimensionality, and capturing the position-invariant characteristics of the offensive words. The validation of the proposed stacked model and BERT-M was performed using well-known model evaluation measures. The stacked model achieved an F1-score of 0.964, precision of 0.950, recall of 0.92 and the detection time reported was 3 min, which surpasses the previously reported accuracy and speed scores for all known NLP detectors of cyberbullying, including standard BERT and BERT-M. The results of the experiment showed that the stacking ensemble learning approach achieved an accuracy of 97.4 in detecting cyberbullying on Twitter dataset and 90.97 on combined Twitter and Facebook dataset. The results demonstrate the effectiveness of the proposed stacking ensemble learning approach in detecting cyberbullying on SM and highlight the importance of combining multiple models for improved performance. Â© 2023 by the authors.
format	Article
author	Muneer, A. Alwadain, A. Ragab, M.G. Alqushaibi, A.
spellingShingle	Muneer, A. Alwadain, A. Ragab, M.G. Alqushaibi, A. Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
author_facet	Muneer, A. Alwadain, A. Ragab, M.G. Alqushaibi, A.
author_sort	Muneer, A.
title	Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
title_short	Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
title_full	Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
title_fullStr	Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
title_full_unstemmed	Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
title_sort	cyberbullying detection on social media using stacking ensemble learning and enhanced bert
publisher	Multidisciplinary Digital Publishing Institute (MDPI)
publishDate	2023
url	http://scholars.utp.edu.my/id/eprint/37437/ https://www.scopus.com/inward/record.uri?eid=2-s2.0-85168771459&doi=10.3390%2finfo14080467&partnerID=40&md5=baf6f84fd06fd77245d2746f068b91cd
_version_	1779441382861045760

Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT

Similar Items