Determining Linguistic Features of Hate Speech from 2016 Philippine Election-Related Tweets

Hate speech is characterized as a deliberate attack directed towards a group of people motivated by aspects of the group, s identity. There is a growing interest in solutions involving automatic hate speech detection in response to the proliferation of hate speech. However., most automatic hate spee...

Full description

Saved in:

Bibliographic Details
Main Authors:	Enriquez, Raphael Christen K., Estuar, Ma. Regina Justina
Format:	text
Published:	Archīum Ateneo 2023
Subjects:	Hate Speech Detection Linguistic Features Machine Learning Natural Language Processing Text Classification Computer Engineering Electrical and Computer Engineering Engineering
Online Access:	https://archium.ateneo.edu/discs-faculty-pubs/374 https://doi.org/10.1109/ITIKD56332.2023.10100008
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Ateneo De Manila University

id	ph-ateneo-arc.discs-faculty-pubs-1374
record_format	eprints
spelling	ph-ateneo-arc.discs-faculty-pubs-13742024-02-21T03:41:19Z Determining Linguistic Features of Hate Speech from 2016 Philippine Election-Related Tweets Enriquez, Raphael Christen K. Estuar, Ma. Regina Justina Hate speech is characterized as a deliberate attack directed towards a group of people motivated by aspects of the group, s identity. There is a growing interest in solutions involving automatic hate speech detection in response to the proliferation of hate speech. However., most automatic hate speech detection tools are designed for high-resource languages such as English which results in challenges in detecting hate speech in low-resource languages such as Filipino. Social media users within the Philippines predominantly use native language or a code-switched variation such as Taglish as the preferred linguistic style in online communication. This study seeks to determine linguistic features that characterize hate speech in the Philippine setting. The study characterizes hate speech using the following features: bilingual., part-of-speech., and psycho-linguistic features. Feature extraction was facilitated via fastText., NLTK (Natural Language Toolkit)., and LIWC (Linguistic Inquiry and Word Count) from an existing Filipino hate speech corpus collected during the 2016 Philippine Presidential Elections. Results show that hate speech from this dataset has significantly different features from non-hate speech. Specifically., the distinct features include language dominance., frequency of code-switching., frequency of parts-of-speech., and LIWC's summary variables and psychological process. These features which have been demonstrated to be statistically different between hate speech and non-hate speech can be leveraged to augment existing hate speech detection models., particularly within low-resource language contexts. 2023-01-01T08:00:00Z text https://archium.ateneo.edu/discs-faculty-pubs/374 https://doi.org/10.1109/ITIKD56332.2023.10100008 Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo Hate Speech Detection Linguistic Features Machine Learning Natural Language Processing Text Classification Computer Engineering Electrical and Computer Engineering Engineering
institution	Ateneo De Manila University
building	Ateneo De Manila University Library
continent	Asia
country	Philippines Philippines
content_provider	Ateneo De Manila University Library
collection	archium.Ateneo Institutional Repository
topic	Hate Speech Detection Linguistic Features Machine Learning Natural Language Processing Text Classification Computer Engineering Electrical and Computer Engineering Engineering
spellingShingle	Hate Speech Detection Linguistic Features Machine Learning Natural Language Processing Text Classification Computer Engineering Electrical and Computer Engineering Engineering Enriquez, Raphael Christen K. Estuar, Ma. Regina Justina Determining Linguistic Features of Hate Speech from 2016 Philippine Election-Related Tweets
description	Hate speech is characterized as a deliberate attack directed towards a group of people motivated by aspects of the group, s identity. There is a growing interest in solutions involving automatic hate speech detection in response to the proliferation of hate speech. However., most automatic hate speech detection tools are designed for high-resource languages such as English which results in challenges in detecting hate speech in low-resource languages such as Filipino. Social media users within the Philippines predominantly use native language or a code-switched variation such as Taglish as the preferred linguistic style in online communication. This study seeks to determine linguistic features that characterize hate speech in the Philippine setting. The study characterizes hate speech using the following features: bilingual., part-of-speech., and psycho-linguistic features. Feature extraction was facilitated via fastText., NLTK (Natural Language Toolkit)., and LIWC (Linguistic Inquiry and Word Count) from an existing Filipino hate speech corpus collected during the 2016 Philippine Presidential Elections. Results show that hate speech from this dataset has significantly different features from non-hate speech. Specifically., the distinct features include language dominance., frequency of code-switching., frequency of parts-of-speech., and LIWC's summary variables and psychological process. These features which have been demonstrated to be statistically different between hate speech and non-hate speech can be leveraged to augment existing hate speech detection models., particularly within low-resource language contexts.
format	text
author	Enriquez, Raphael Christen K. Estuar, Ma. Regina Justina
author_facet	Enriquez, Raphael Christen K. Estuar, Ma. Regina Justina
author_sort	Enriquez, Raphael Christen K.
title	Determining Linguistic Features of Hate Speech from 2016 Philippine Election-Related Tweets
title_short	Determining Linguistic Features of Hate Speech from 2016 Philippine Election-Related Tweets
title_full	Determining Linguistic Features of Hate Speech from 2016 Philippine Election-Related Tweets
title_fullStr	Determining Linguistic Features of Hate Speech from 2016 Philippine Election-Related Tweets
title_full_unstemmed	Determining Linguistic Features of Hate Speech from 2016 Philippine Election-Related Tweets
title_sort	determining linguistic features of hate speech from 2016 philippine election-related tweets
publisher	Archīum Ateneo
publishDate	2023
url	https://archium.ateneo.edu/discs-faculty-pubs/374 https://doi.org/10.1109/ITIKD56332.2023.10100008
_version_	1792202612802584576

Determining Linguistic Features of Hate Speech from 2016 Philippine Election-Related Tweets

Similar Items