Building an enhanced resource for Indonesian sentiment analysis

This study aims at constructing an Indonesian sentiment resource and improving its accuracy through the study of emotion research. This research comprises four different interconnected studies to unveil the formula of creating a good and accurate sentiment resource for Indonesian: (1) Indonesian emo...

Full description

Saved in:
Bibliographic Details
Main Author: Yosephine
Other Authors: Erik Cambria
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/162004
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-162004
record_format dspace
spelling sg-ntu-dr.10356-1620042023-03-11T20:15:11Z Building an enhanced resource for Indonesian sentiment analysis Yosephine Erik Cambria Ng Bee Chin School of Humanities MBCNg@ntu.edu.sg, cambria@ntu.edu.sg Humanities::Linguistics This study aims at constructing an Indonesian sentiment resource and improving its accuracy through the study of emotion research. This research comprises four different interconnected studies to unveil the formula of creating a good and accurate sentiment resource for Indonesian: (1) Indonesian emotion lexicon, (2) Emotion and Emotion Families in Indonesian, (3) Crosslinguistic comparison: Indonesian emotion profile, (4) Indonesian SenticNet. Here, I compiled the first Indonesian emotion lexicon created without any translation. This lexicon is equipped by the affective dimensional ratings of intensity and valence. The influencing factors of how emotion is evaluated (e.g. gender and language) were carefully observed. I also conducted the crosslinguistic comparison with other languages, especially English to highlight the Indonesian emotion profile. Despite the wide-spread claim on the universality of basic emotions, I discovered intriguing differences between the two languages. The results were then put into practice for the purpose of revamping and localizing a state-of-the-art sentiment resource SenticNet for Indonesian. In its early stage, this resource successfully achieved a satisfactory result. When tested against various datasets, it was able to predict the sentiments in a text with almost 75% of accuracy on average. The end product of Indonesian SenticNet will be mostly valuable for companies and brands that are conducting market research for their products in Indonesia. It can aid them in getting insights into their user/customer experience (UX research) and making right decisions for their marketing strategy in a faster and more accurate way. Doctor of Philosophy 2022-09-29T08:34:39Z 2022-09-29T08:34:39Z 2022 Thesis-Doctor of Philosophy Yosephine (2022). Building an enhanced resource for Indonesian sentiment analysis. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/162004 https://hdl.handle.net/10356/162004 10.32657/10356/162004 en MOE Academic Research Fund Tier 1 - MICE - A Multilingual Corpus of Emotion Expressions of Malay, Indonesian, Chinese And English (04MNP000096C420)​ This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Humanities::Linguistics
spellingShingle Humanities::Linguistics
Yosephine
Building an enhanced resource for Indonesian sentiment analysis
description This study aims at constructing an Indonesian sentiment resource and improving its accuracy through the study of emotion research. This research comprises four different interconnected studies to unveil the formula of creating a good and accurate sentiment resource for Indonesian: (1) Indonesian emotion lexicon, (2) Emotion and Emotion Families in Indonesian, (3) Crosslinguistic comparison: Indonesian emotion profile, (4) Indonesian SenticNet. Here, I compiled the first Indonesian emotion lexicon created without any translation. This lexicon is equipped by the affective dimensional ratings of intensity and valence. The influencing factors of how emotion is evaluated (e.g. gender and language) were carefully observed. I also conducted the crosslinguistic comparison with other languages, especially English to highlight the Indonesian emotion profile. Despite the wide-spread claim on the universality of basic emotions, I discovered intriguing differences between the two languages. The results were then put into practice for the purpose of revamping and localizing a state-of-the-art sentiment resource SenticNet for Indonesian. In its early stage, this resource successfully achieved a satisfactory result. When tested against various datasets, it was able to predict the sentiments in a text with almost 75% of accuracy on average. The end product of Indonesian SenticNet will be mostly valuable for companies and brands that are conducting market research for their products in Indonesia. It can aid them in getting insights into their user/customer experience (UX research) and making right decisions for their marketing strategy in a faster and more accurate way.
author2 Erik Cambria
author_facet Erik Cambria
Yosephine
format Thesis-Doctor of Philosophy
author Yosephine
author_sort Yosephine
title Building an enhanced resource for Indonesian sentiment analysis
title_short Building an enhanced resource for Indonesian sentiment analysis
title_full Building an enhanced resource for Indonesian sentiment analysis
title_fullStr Building an enhanced resource for Indonesian sentiment analysis
title_full_unstemmed Building an enhanced resource for Indonesian sentiment analysis
title_sort building an enhanced resource for indonesian sentiment analysis
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/162004
_version_ 1761781317647728640