Prompting for multimodal hateful meme classification

Hateful meme classification is a challenging multimodal task that requires complex reasoning and contextual background knowledge. Ideally, we could leverage an explicit external knowledge base to supplement contextual and cultural information in hateful memes. However, there is no known explicit ext...

Full description

Saved in:
Bibliographic Details
Main Authors: CAO, Rui, LEE, Roy Ka-Wei, CHONG, Wen-Haw, JIANG, Jing
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7671
https://ink.library.smu.edu.sg/context/sis_research/article/8674/viewcontent/Prompting_for_multimodal_hateful_meme_classification.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8674
record_format dspace
spelling sg-smu-ink.sis_research-86742023-01-10T03:39:45Z Prompting for multimodal hateful meme classification CAO, Rui LEE, Roy Ka-Wei CHONG, Wen-Haw JIANG, Jing Hateful meme classification is a challenging multimodal task that requires complex reasoning and contextual background knowledge. Ideally, we could leverage an explicit external knowledge base to supplement contextual and cultural information in hateful memes. However, there is no known explicit external knowledge base that could provide such hate speech contextual information. To address this gap, we propose PromptHate, a simple yet effective prompt-based model that prompts pre-trained language models (PLMs) for hateful meme classification. Specifically, we construct simple prompts and provide a few in-context examples to exploit the implicit knowledge in the pretrained RoBERTa language model for hateful meme classification. We conduct extensive experiments on two publicly available hateful and offensive meme datasets. Our experimental results show that PromptHate is able to achieve a high AUC of 90.96, outperforming state-ofthe-art baselines on the hateful meme classification task. We also perform fine-grained analyses and case studies on various prompt settings and demonstrate the effectiveness of the prompts on hateful meme classification. 2022-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/7671 https://ink.library.smu.edu.sg/context/sis_research/article/8674/viewcontent/Prompting_for_multimodal_hateful_meme_classification.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Databases and Information Systems Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Artificial Intelligence and Robotics
Databases and Information Systems
Graphics and Human Computer Interfaces
spellingShingle Artificial Intelligence and Robotics
Databases and Information Systems
Graphics and Human Computer Interfaces
CAO, Rui
LEE, Roy Ka-Wei
CHONG, Wen-Haw
JIANG, Jing
Prompting for multimodal hateful meme classification
description Hateful meme classification is a challenging multimodal task that requires complex reasoning and contextual background knowledge. Ideally, we could leverage an explicit external knowledge base to supplement contextual and cultural information in hateful memes. However, there is no known explicit external knowledge base that could provide such hate speech contextual information. To address this gap, we propose PromptHate, a simple yet effective prompt-based model that prompts pre-trained language models (PLMs) for hateful meme classification. Specifically, we construct simple prompts and provide a few in-context examples to exploit the implicit knowledge in the pretrained RoBERTa language model for hateful meme classification. We conduct extensive experiments on two publicly available hateful and offensive meme datasets. Our experimental results show that PromptHate is able to achieve a high AUC of 90.96, outperforming state-ofthe-art baselines on the hateful meme classification task. We also perform fine-grained analyses and case studies on various prompt settings and demonstrate the effectiveness of the prompts on hateful meme classification.
format text
author CAO, Rui
LEE, Roy Ka-Wei
CHONG, Wen-Haw
JIANG, Jing
author_facet CAO, Rui
LEE, Roy Ka-Wei
CHONG, Wen-Haw
JIANG, Jing
author_sort CAO, Rui
title Prompting for multimodal hateful meme classification
title_short Prompting for multimodal hateful meme classification
title_full Prompting for multimodal hateful meme classification
title_fullStr Prompting for multimodal hateful meme classification
title_full_unstemmed Prompting for multimodal hateful meme classification
title_sort prompting for multimodal hateful meme classification
publisher Institutional Knowledge at Singapore Management University
publishDate 2022
url https://ink.library.smu.edu.sg/sis_research/7671
https://ink.library.smu.edu.sg/context/sis_research/article/8674/viewcontent/Prompting_for_multimodal_hateful_meme_classification.pdf
_version_ 1770576400231694336