TUPLE OPINION EXTRACTION USING GENERATIVE APPROACH BASED ON PRE-TRAINED MODEL FOR ASPECT BASED SENTIMENT ANALYSIS
The widespread use of social media as a means of conveying public opinion has made sentiment analysis an increasingly promising field of study in the future. Aspect-based sentiment analysis (ASBA) can be used by industry to view criticisms and suggestions from the public regarding aspects of the...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/78026 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:78026 |
---|---|
spelling |
id-itb.:780262023-09-15T22:21:19ZTUPLE OPINION EXTRACTION USING GENERATIVE APPROACH BASED ON PRE-TRAINED MODEL FOR ASPECT BASED SENTIMENT ANALYSIS Athallariq Harya P, Danendra Indonesia Theses aspect-based sentiment analysis, post-training INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/78026 The widespread use of social media as a means of conveying public opinion has made sentiment analysis an increasingly promising field of study in the future. Aspect-based sentiment analysis (ASBA) can be used by industry to view criticisms and suggestions from the public regarding aspects of the industrial services and products they offer. ASBA can be subdivided into smaller sub-tasks based on the identification it does. The large number of ASBA sub-tasks makes comparisons between previous studies difficult and also requires model adjustments due to specific designs to complete only certain sub-tasks. In this study a sub-task called opinion tuple extraction was designed which was a combination of the previous ASBA sub-tasks. From an input text the model will predict opinion tuple pairs consisting of (entity, aspect, opinion, sentiment, aspect category) for each aspect in it. Opinion tuple extraction can be accomplished with a generative approach based on pre-trained language models and other ASBA sub- tasks can be accomplished by a simple decoding process of opinion tuple extraction labels. Opinion tuple extraction has the advantage of flexibility in solving various ASBA sub-tasks because it only requires the decoding process of the labels compared to previous research which required adjustments to the model architecture. In this thesis research, we tested the different effect of text pre- processing, pre-trained language models used, post-training, and post-processing with Levenshtein distance on the performance of opinion tuple extraction. The best model has precision performance of 0.836, recall of 0.833, and f1-score of 0.834 which was obtained using standard text cleaning (removing urls, mentions, emojis and hashtags) plus the use of stemming, using the IndoT5 pre- trained language model which was post-trained to improve domain knowledge and task awareness and with post-processing. This model outperforms the performance of the baseline model from previous studies. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
The widespread use of social media as a means of conveying public opinion has
made sentiment analysis an increasingly promising field of study in the future.
Aspect-based sentiment analysis (ASBA) can be used by industry to view criticisms
and suggestions from the public regarding aspects of the industrial services and
products they offer. ASBA can be subdivided into smaller sub-tasks based on the
identification it does. The large number of ASBA sub-tasks makes comparisons
between previous studies difficult and also requires model adjustments due to
specific designs to complete only certain sub-tasks.
In this study a sub-task called opinion tuple extraction was designed which was a
combination of the previous ASBA sub-tasks. From an input text the model will
predict opinion tuple pairs consisting of (entity, aspect, opinion, sentiment, aspect
category) for each aspect in it. Opinion tuple extraction can be accomplished with
a generative approach based on pre-trained language models and other ASBA sub-
tasks can be accomplished by a simple decoding process of opinion tuple extraction
labels. Opinion tuple extraction has the advantage of flexibility in solving various
ASBA sub-tasks because it only requires the decoding process of the labels
compared to previous research which required adjustments to the model
architecture. In this thesis research, we tested the different effect of text pre-
processing, pre-trained language models used, post-training, and post-processing
with Levenshtein distance on the performance of opinion tuple extraction.
The best model has precision performance of 0.836, recall of 0.833, and f1-score
of 0.834 which was obtained using standard text cleaning (removing urls,
mentions, emojis and hashtags) plus the use of stemming, using the IndoT5 pre-
trained language model which was post-trained to improve domain knowledge
and task awareness and with post-processing. This model outperforms the
performance of the baseline model from previous studies. |
format |
Theses |
author |
Athallariq Harya P, Danendra |
spellingShingle |
Athallariq Harya P, Danendra TUPLE OPINION EXTRACTION USING GENERATIVE APPROACH BASED ON PRE-TRAINED MODEL FOR ASPECT BASED SENTIMENT ANALYSIS |
author_facet |
Athallariq Harya P, Danendra |
author_sort |
Athallariq Harya P, Danendra |
title |
TUPLE OPINION EXTRACTION USING GENERATIVE APPROACH BASED ON PRE-TRAINED MODEL FOR ASPECT BASED SENTIMENT ANALYSIS |
title_short |
TUPLE OPINION EXTRACTION USING GENERATIVE APPROACH BASED ON PRE-TRAINED MODEL FOR ASPECT BASED SENTIMENT ANALYSIS |
title_full |
TUPLE OPINION EXTRACTION USING GENERATIVE APPROACH BASED ON PRE-TRAINED MODEL FOR ASPECT BASED SENTIMENT ANALYSIS |
title_fullStr |
TUPLE OPINION EXTRACTION USING GENERATIVE APPROACH BASED ON PRE-TRAINED MODEL FOR ASPECT BASED SENTIMENT ANALYSIS |
title_full_unstemmed |
TUPLE OPINION EXTRACTION USING GENERATIVE APPROACH BASED ON PRE-TRAINED MODEL FOR ASPECT BASED SENTIMENT ANALYSIS |
title_sort |
tuple opinion extraction using generative approach based on pre-trained model for aspect based sentiment analysis |
url |
https://digilib.itb.ac.id/gdl/view/78026 |
_version_ |
1822995600167141376 |