OPINION TRIPLET EXTRACTION FOR ASPECT BASED SENTIMENT ANALYSIS USING GRID TAGGING SCHEME
Aspect-based sentiment analysis (ASBA) is one of the variations of sentiment analysis that can be used by companies to find out public opinion in detail on aspects related to the products or services provided. There are several subtasks under ASBA, namely aspect/sentiment term extraction, aspect...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/56237 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:56237 |
---|---|
spelling |
id-itb.:562372021-06-21T16:21:21ZOPINION TRIPLET EXTRACTION FOR ASPECT BASED SENTIMENT ANALYSIS USING GRID TAGGING SCHEME Pradipta Wirawan, Gama Indonesia Final Project ASBA, opinion triplet extraction, aspect sentiment triplet extraction, grid tagging scheme INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/56237 Aspect-based sentiment analysis (ASBA) is one of the variations of sentiment analysis that can be used by companies to find out public opinion in detail on aspects related to the products or services provided. There are several subtasks under ASBA, namely aspect/sentiment term extraction, aspect categorization, extraction of aspect and sentiment terms relations, and sentiment classification. Opinion triplet extraction is a combination of several previous subtasks, which aims to extract three opinion factors from the review sentence (aspect expression, sentiment expression, sentiment polarity). In general, these tasks are performed separately sequentially. However, this approach is considered less efficient and has the potential to reduce model performance due to errors in the previous process. The Grid Tagging Scheme approach performs opinion triplet extraction simultaneously and gets better performance than the pipelined approach. In addition, this approach can also overcome one of the problems in extracting aspect sentiment pairs, namely overlapped triplet, which means that there are one or more aspect terms that have two or more different opinion terms, and vice versa. This final project focuses on adapting this approach for extracting triplet opinion from Indonesian hotel reviews. Based on the experimental results using the Airy dataset, the best model configuration is to include incomplete triplet data into the training data, use a monolingual language model and use a fine-tuning strategy in the model training process. The F1-score of the opinion triplet extraction task is 0.78. As for the aspect expression and sentiment expression extraction tasks, the F1-score of the test is 0.87, which is lower in performance than the baseline model. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Aspect-based sentiment analysis (ASBA) is one of the variations of sentiment analysis
that can be used by companies to find out public opinion in detail on aspects related to
the products or services provided. There are several subtasks under ASBA, namely
aspect/sentiment term extraction, aspect categorization, extraction of aspect and
sentiment terms relations, and sentiment classification. Opinion triplet extraction is a
combination of several previous subtasks, which aims to extract three opinion factors
from the review sentence (aspect expression, sentiment expression, sentiment polarity).
In general, these tasks are performed separately sequentially. However, this approach
is considered less efficient and has the potential to reduce model performance due to
errors in the previous process.
The Grid Tagging Scheme approach performs opinion triplet extraction simultaneously
and gets better performance than the pipelined approach. In addition, this approach can
also overcome one of the problems in extracting aspect sentiment pairs, namely
overlapped triplet, which means that there are one or more aspect terms that have two
or more different opinion terms, and vice versa. This final project focuses on adapting
this approach for extracting triplet opinion from Indonesian hotel reviews.
Based on the experimental results using the Airy dataset, the best model configuration
is to include incomplete triplet data into the training data, use a monolingual language
model and use a fine-tuning strategy in the model training process. The F1-score of the
opinion triplet extraction task is 0.78. As for the aspect expression and sentiment
expression extraction tasks, the F1-score of the test is 0.87, which is lower in
performance than the baseline model. |
format |
Final Project |
author |
Pradipta Wirawan, Gama |
spellingShingle |
Pradipta Wirawan, Gama OPINION TRIPLET EXTRACTION FOR ASPECT BASED SENTIMENT ANALYSIS USING GRID TAGGING SCHEME |
author_facet |
Pradipta Wirawan, Gama |
author_sort |
Pradipta Wirawan, Gama |
title |
OPINION TRIPLET EXTRACTION FOR ASPECT BASED SENTIMENT ANALYSIS USING GRID TAGGING SCHEME |
title_short |
OPINION TRIPLET EXTRACTION FOR ASPECT BASED SENTIMENT ANALYSIS USING GRID TAGGING SCHEME |
title_full |
OPINION TRIPLET EXTRACTION FOR ASPECT BASED SENTIMENT ANALYSIS USING GRID TAGGING SCHEME |
title_fullStr |
OPINION TRIPLET EXTRACTION FOR ASPECT BASED SENTIMENT ANALYSIS USING GRID TAGGING SCHEME |
title_full_unstemmed |
OPINION TRIPLET EXTRACTION FOR ASPECT BASED SENTIMENT ANALYSIS USING GRID TAGGING SCHEME |
title_sort |
opinion triplet extraction for aspect based sentiment analysis using grid tagging scheme |
url |
https://digilib.itb.ac.id/gdl/view/56237 |
_version_ |
1822274523417804800 |