DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL

Patronizing and condescending language (PCL), although often used with good intentions, can lead to discrimination, perpetuate negative stigma, and hinder the inclusion of vulnerable groups. The detection of patronizing and condescending language was highlighted in SemEval 2022 as the fourth task...

Full description

Saved in:

Bibliographic Details
Main Author:	Abraham Sianturi, Gerald
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/85073
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:85073
spelling	id-itb.:850732024-08-19T14:21:33ZDETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL Abraham Sianturi, Gerald Indonesia Final Project patronizing and condescending language, language model, data augmentation INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/85073 Patronizing and condescending language (PCL), although often used with good intentions, can lead to discrimination, perpetuate negative stigma, and hinder the inclusion of vulnerable groups. The detection of patronizing and condescending language was highlighted in SemEval 2022 as the fourth task, consisting of binary classification as well as multilabel PCL classification. This study discusses the development of a PCL detection model using a transformer-based approach due to its ability to capture complex linguistic features. Experimental results show that in the first task, binary classification, the DeBERTa-v3-large model achieved a better f1- score performance than LLaMA-3-8B, with scores of 0.549 and 0.361, respectively. Data augmentation yielded varying performance depending on the subtask and model used, indicating inconsistency in its application. Moreover, using binary relevance, the weighted random sampling strategy effectively enhanced performance on the multilabel task. The performance results using task transformation strategies, such as binary relevance and label powerset, produced macro-average f1-scores of 0.346 and 0.17, respectively, which were unable to surpass the model performance achieved in SemEval 2022 Task 4. text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	Patronizing and condescending language (PCL), although often used with good intentions, can lead to discrimination, perpetuate negative stigma, and hinder the inclusion of vulnerable groups. The detection of patronizing and condescending language was highlighted in SemEval 2022 as the fourth task, consisting of binary classification as well as multilabel PCL classification. This study discusses the development of a PCL detection model using a transformer-based approach due to its ability to capture complex linguistic features. Experimental results show that in the first task, binary classification, the DeBERTa-v3-large model achieved a better f1- score performance than LLaMA-3-8B, with scores of 0.549 and 0.361, respectively. Data augmentation yielded varying performance depending on the subtask and model used, indicating inconsistency in its application. Moreover, using binary relevance, the weighted random sampling strategy effectively enhanced performance on the multilabel task. The performance results using task transformation strategies, such as binary relevance and label powerset, produced macro-average f1-scores of 0.346 and 0.17, respectively, which were unable to surpass the model performance achieved in SemEval 2022 Task 4.
format	Final Project
author	Abraham Sianturi, Gerald
spellingShingle	Abraham Sianturi, Gerald DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL
author_facet	Abraham Sianturi, Gerald
author_sort	Abraham Sianturi, Gerald
title	DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL
title_short	DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL
title_full	DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL
title_fullStr	DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL
title_full_unstemmed	DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL
title_sort	detection of patronizing and condescending language (pcl) using a transformer-based model
url	https://digilib.itb.ac.id/gdl/view/85073
_version_	1823657370105937920

DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL

Similar Items