DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL
Patronizing and condescending language (PCL), although often used with good intentions, can lead to discrimination, perpetuate negative stigma, and hinder the inclusion of vulnerable groups. The detection of patronizing and condescending language was highlighted in SemEval 2022 as the fourth task...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/85073 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:85073 |
---|---|
spelling |
id-itb.:850732024-08-19T14:21:33ZDETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL Abraham Sianturi, Gerald Indonesia Final Project patronizing and condescending language, language model, data augmentation INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/85073 Patronizing and condescending language (PCL), although often used with good intentions, can lead to discrimination, perpetuate negative stigma, and hinder the inclusion of vulnerable groups. The detection of patronizing and condescending language was highlighted in SemEval 2022 as the fourth task, consisting of binary classification as well as multilabel PCL classification. This study discusses the development of a PCL detection model using a transformer-based approach due to its ability to capture complex linguistic features. Experimental results show that in the first task, binary classification, the DeBERTa-v3-large model achieved a better f1- score performance than LLaMA-3-8B, with scores of 0.549 and 0.361, respectively. Data augmentation yielded varying performance depending on the subtask and model used, indicating inconsistency in its application. Moreover, using binary relevance, the weighted random sampling strategy effectively enhanced performance on the multilabel task. The performance results using task transformation strategies, such as binary relevance and label powerset, produced macro-average f1-scores of 0.346 and 0.17, respectively, which were unable to surpass the model performance achieved in SemEval 2022 Task 4. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Patronizing and condescending language (PCL), although often used with good intentions, can
lead to discrimination, perpetuate negative stigma, and hinder the inclusion of vulnerable groups.
The detection of patronizing and condescending language was highlighted in SemEval 2022 as the
fourth task, consisting of binary classification as well as multilabel PCL classification. This study
discusses the development of a PCL detection model using a transformer-based approach due to
its ability to capture complex linguistic features. Experimental results show that in the first task,
binary classification, the DeBERTa-v3-large model achieved a better f1- score performance than
LLaMA-3-8B, with scores of 0.549 and 0.361, respectively. Data augmentation yielded varying
performance depending on the subtask and model used, indicating inconsistency in its application.
Moreover, using binary relevance, the weighted random sampling strategy effectively enhanced
performance on the multilabel task. The performance results using task transformation strategies,
such as binary relevance and label powerset, produced macro-average f1-scores of 0.346 and 0.17,
respectively, which were unable to surpass the model performance achieved in SemEval 2022 Task
4. |
format |
Final Project |
author |
Abraham Sianturi, Gerald |
spellingShingle |
Abraham Sianturi, Gerald DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL |
author_facet |
Abraham Sianturi, Gerald |
author_sort |
Abraham Sianturi, Gerald |
title |
DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL |
title_short |
DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL |
title_full |
DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL |
title_fullStr |
DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL |
title_full_unstemmed |
DETECTION OF PATRONIZING AND CONDESCENDING LANGUAGE (PCL) USING A TRANSFORMER-BASED MODEL |
title_sort |
detection of patronizing and condescending language (pcl) using a transformer-based model |
url |
https://digilib.itb.ac.id/gdl/view/85073 |
_version_ |
1822283020005015552 |