English-Malay Cross-Lingual Emotion Detection In Tweets Using Word Embedding Alignment

The three-phase methodology to address the goals of this study included the construction of the English-Malay cross-lingual word embedding using word embedding alignment, enrichment of the cross-lingual word embedding with sentiment information, and pre-training of the hierarchical attention model s...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Ying Hao
Format: Thesis
Language:English
Published: 2023
Subjects:
Online Access:http://eprints.usm.my/60433/1/Pages%20from%20LIM%20YING%20HAO%20-%20TESIS.pdf
http://eprints.usm.my/60433/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Sains Malaysia
Language: English
Description
Summary:The three-phase methodology to address the goals of this study included the construction of the English-Malay cross-lingual word embedding using word embedding alignment, enrichment of the cross-lingual word embedding with sentiment information, and pre-training of the hierarchical attention model solely on English tweets. We evaluated our model in two scenarios: zero-shot learning and few-shot learning on 4176 Malay tweets annotated with emotion. We also examined the optimal number of Malay tweets required to finetune the model and the effect of finetuning different layers in our model.