TRANSFER LEARNING FOR INDONESIAN ASPECT-BASED SENTIMENT ANALYSIS USING LANGUAGE MODEL FINE-TUNING APPROACH

Aspect-based sentiment analysis (ABSA) is one of the methods widely used by companies to find out public opinion in detail down to aspects contained in products or services. ABSA is typically divided into two subtasks, namely aspect extraction / categorization which aims to extract the expression of...

Full description

Saved in:
Bibliographic Details
Main Author: Nurul Azhar, Annisa
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/48070
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:48070
spelling id-itb.:480702020-06-26T00:36:05ZTRANSFER LEARNING FOR INDONESIAN ASPECT-BASED SENTIMENT ANALYSIS USING LANGUAGE MODEL FINE-TUNING APPROACH Nurul Azhar, Annisa Indonesia Theses pre-trained language model, transfer learning, fine-tuning INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/48070 Aspect-based sentiment analysis (ABSA) is one of the methods widely used by companies to find out public opinion in detail down to aspects contained in products or services. ABSA is typically divided into two subtasks, namely aspect extraction / categorization which aims to extract the expression of aspects / categorization of aspects into predetermined categories and sentiment classification which aims to find out the sentiment towards each aspect. Previously, research on ABSA has been done for the subtask aspect categorization and sentiment classification using convolutional neural networks (CNN) as feature extractor and extreme gradient boosting (XGBoost) as top-level-classifier (Azhar, 2019). However, the model does not generalize well in the test data and there are still a lot of out-of-vocabulary (OOV) words. Therefore, we need a new technique which could be used to solve these problems. Nowadays, the techniques of pre-training language representation models are developing so rapidly that most of the state-of-the-art results for a variety of natural language processing tasks are achieved using language representation models such as OpenAI GPT, ELMo, and BERT. In this thesis, the BERT pre-trained language model is used to complete the ASBA task for Indonesian language review texts on the hotel domain. The BERT model used is a multilingual model because currently there are no Indonesian-specific BERT pre-trained models available to the public. There are two methods of solving problems and two strategies for using language models that are compared through experiments, namely single sentence classification and sentence-pair classification as proposed in Sun, et al. (2019) as well as feature extraction and fine-tuning. Based on the results of combined experiments, the combination that produces the best performance is sentence-pair classification with fine-tuning that is equal to 0.9751. For the test results, the resulting model reaches an F1 score of 0.9765 in test data I (until December 2018) and 0.9304 in test data II (March 2019 - July 2019). There was an increase in performance of 8% in the test data I and 44% in test data II compared to the performance of the model from previous studies. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Aspect-based sentiment analysis (ABSA) is one of the methods widely used by companies to find out public opinion in detail down to aspects contained in products or services. ABSA is typically divided into two subtasks, namely aspect extraction / categorization which aims to extract the expression of aspects / categorization of aspects into predetermined categories and sentiment classification which aims to find out the sentiment towards each aspect. Previously, research on ABSA has been done for the subtask aspect categorization and sentiment classification using convolutional neural networks (CNN) as feature extractor and extreme gradient boosting (XGBoost) as top-level-classifier (Azhar, 2019). However, the model does not generalize well in the test data and there are still a lot of out-of-vocabulary (OOV) words. Therefore, we need a new technique which could be used to solve these problems. Nowadays, the techniques of pre-training language representation models are developing so rapidly that most of the state-of-the-art results for a variety of natural language processing tasks are achieved using language representation models such as OpenAI GPT, ELMo, and BERT. In this thesis, the BERT pre-trained language model is used to complete the ASBA task for Indonesian language review texts on the hotel domain. The BERT model used is a multilingual model because currently there are no Indonesian-specific BERT pre-trained models available to the public. There are two methods of solving problems and two strategies for using language models that are compared through experiments, namely single sentence classification and sentence-pair classification as proposed in Sun, et al. (2019) as well as feature extraction and fine-tuning. Based on the results of combined experiments, the combination that produces the best performance is sentence-pair classification with fine-tuning that is equal to 0.9751. For the test results, the resulting model reaches an F1 score of 0.9765 in test data I (until December 2018) and 0.9304 in test data II (March 2019 - July 2019). There was an increase in performance of 8% in the test data I and 44% in test data II compared to the performance of the model from previous studies.
format Theses
author Nurul Azhar, Annisa
spellingShingle Nurul Azhar, Annisa
TRANSFER LEARNING FOR INDONESIAN ASPECT-BASED SENTIMENT ANALYSIS USING LANGUAGE MODEL FINE-TUNING APPROACH
author_facet Nurul Azhar, Annisa
author_sort Nurul Azhar, Annisa
title TRANSFER LEARNING FOR INDONESIAN ASPECT-BASED SENTIMENT ANALYSIS USING LANGUAGE MODEL FINE-TUNING APPROACH
title_short TRANSFER LEARNING FOR INDONESIAN ASPECT-BASED SENTIMENT ANALYSIS USING LANGUAGE MODEL FINE-TUNING APPROACH
title_full TRANSFER LEARNING FOR INDONESIAN ASPECT-BASED SENTIMENT ANALYSIS USING LANGUAGE MODEL FINE-TUNING APPROACH
title_fullStr TRANSFER LEARNING FOR INDONESIAN ASPECT-BASED SENTIMENT ANALYSIS USING LANGUAGE MODEL FINE-TUNING APPROACH
title_full_unstemmed TRANSFER LEARNING FOR INDONESIAN ASPECT-BASED SENTIMENT ANALYSIS USING LANGUAGE MODEL FINE-TUNING APPROACH
title_sort transfer learning for indonesian aspect-based sentiment analysis using language model fine-tuning approach
url https://digilib.itb.ac.id/gdl/view/48070
_version_ 1822927818092183552