Building an enhanced resource for Indonesian sentiment analysis
This study aims at constructing an Indonesian sentiment resource and improving its accuracy through the study of emotion research. This research comprises four different interconnected studies to unveil the formula of creating a good and accurate sentiment resource for Indonesian: (1) Indonesian emo...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/162004 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-162004 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1620042023-03-11T20:15:11Z Building an enhanced resource for Indonesian sentiment analysis Yosephine Erik Cambria Ng Bee Chin School of Humanities MBCNg@ntu.edu.sg, cambria@ntu.edu.sg Humanities::Linguistics This study aims at constructing an Indonesian sentiment resource and improving its accuracy through the study of emotion research. This research comprises four different interconnected studies to unveil the formula of creating a good and accurate sentiment resource for Indonesian: (1) Indonesian emotion lexicon, (2) Emotion and Emotion Families in Indonesian, (3) Crosslinguistic comparison: Indonesian emotion profile, (4) Indonesian SenticNet. Here, I compiled the first Indonesian emotion lexicon created without any translation. This lexicon is equipped by the affective dimensional ratings of intensity and valence. The influencing factors of how emotion is evaluated (e.g. gender and language) were carefully observed. I also conducted the crosslinguistic comparison with other languages, especially English to highlight the Indonesian emotion profile. Despite the wide-spread claim on the universality of basic emotions, I discovered intriguing differences between the two languages. The results were then put into practice for the purpose of revamping and localizing a state-of-the-art sentiment resource SenticNet for Indonesian. In its early stage, this resource successfully achieved a satisfactory result. When tested against various datasets, it was able to predict the sentiments in a text with almost 75% of accuracy on average. The end product of Indonesian SenticNet will be mostly valuable for companies and brands that are conducting market research for their products in Indonesia. It can aid them in getting insights into their user/customer experience (UX research) and making right decisions for their marketing strategy in a faster and more accurate way. Doctor of Philosophy 2022-09-29T08:34:39Z 2022-09-29T08:34:39Z 2022 Thesis-Doctor of Philosophy Yosephine (2022). Building an enhanced resource for Indonesian sentiment analysis. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/162004 https://hdl.handle.net/10356/162004 10.32657/10356/162004 en MOE Academic Research Fund Tier 1 - MICE - A Multilingual Corpus of Emotion Expressions of Malay, Indonesian, Chinese And English (04MNP000096C420) This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Humanities::Linguistics |
spellingShingle |
Humanities::Linguistics Yosephine Building an enhanced resource for Indonesian sentiment analysis |
description |
This study aims at constructing an Indonesian sentiment resource and improving its accuracy through the study of emotion research. This research comprises four different interconnected studies to unveil the formula of creating a good and accurate sentiment resource for Indonesian: (1) Indonesian emotion lexicon, (2) Emotion and Emotion Families in Indonesian, (3) Crosslinguistic comparison: Indonesian emotion profile, (4) Indonesian SenticNet. Here, I compiled the first Indonesian emotion lexicon created without any translation. This lexicon is equipped by the affective dimensional ratings of intensity and valence. The influencing factors of how emotion is evaluated (e.g. gender and language) were carefully observed. I also conducted the crosslinguistic comparison with other languages, especially English to highlight the Indonesian emotion profile. Despite the wide-spread claim on the universality of basic emotions, I discovered intriguing differences between the two languages. The results were then put into practice for the purpose of revamping and localizing a state-of-the-art sentiment resource SenticNet for Indonesian. In its early stage, this resource successfully achieved a satisfactory result. When tested against various datasets, it was able to predict the sentiments in a text with almost 75% of accuracy on average. The end product of Indonesian SenticNet will be mostly valuable for companies and brands that are conducting market research for their products in Indonesia. It can aid them in getting insights into their user/customer experience (UX research) and making right decisions for their marketing strategy in a faster and more accurate way. |
author2 |
Erik Cambria |
author_facet |
Erik Cambria Yosephine |
format |
Thesis-Doctor of Philosophy |
author |
Yosephine |
author_sort |
Yosephine |
title |
Building an enhanced resource for Indonesian sentiment analysis |
title_short |
Building an enhanced resource for Indonesian sentiment analysis |
title_full |
Building an enhanced resource for Indonesian sentiment analysis |
title_fullStr |
Building an enhanced resource for Indonesian sentiment analysis |
title_full_unstemmed |
Building an enhanced resource for Indonesian sentiment analysis |
title_sort |
building an enhanced resource for indonesian sentiment analysis |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/162004 |
_version_ |
1761781317647728640 |