ADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT
ABSTRA CT ADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT By Azhar Abdurasyid NIM : 13517097 Adversarial Training (AT) is a learning technique that uses adversarial example at the training stage so that the model will be able to handle such input data well. An advers...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/64103 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:64103 |
---|---|
spelling |
id-itb.:641032022-03-29T09:24:14ZADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT Abdurrasyid, Azhar Indonesia Final Project adversarial training, adversarial example, aspect categorization, multilabel classification, binary relevance, classifier chain INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/64103 ABSTRA CT ADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT By Azhar Abdurasyid NIM : 13517097 Adversarial Training (AT) is a learning technique that uses adversarial example at the training stage so that the model will be able to handle such input data well. An adversarial example is an input that has been added with small disturbance so that when the model accepts the input, the model will fail to make correct predictions. In addition to being resistant to adversarial examples, AT is believed to be able to generalize the model and prevent overfit because AT can be seen as a regularization technique that introduces different data. One of the challenges encountered when trying to adapt AT techniques for natural language processing tasks is the need for continuous input data. This Final Project focuses on to adapt the AT technique to the task of categorizing multilabel aspects in Indonesian Language. To get continuous input data on text data, this Final Project approaches using input data in the form of word embedding. In addition, this Final Project also adopts the CNN feature extraction topology in the previous aspect categorization research to obtain a continuous feature vector that can be used as input data for the AT model. In this way, two AT models will be built, namely the word embedding AT model and the feature-map AT model. Then to solve the problem of multilabel classification, this final project will compare the use of binary relevance and classifier chain strategies. The test results show that the AT word embedding and AT feature-map models have a better F1-macro performance than the baseline with a sequential score of 0.9253 and 0.9234 in the first test data and 0.8531 and 0.8505 in the second test data. The AT feature-map model succeeded in increasing the baseline average F1-score performance on the adversarial test data with an increase of 0.3291. Meanwhile, the AT word embedding model did not succeed in increasing the baseline average F1-score performance on the adversarial test data. Keywords: adversarial training, adversarial example, aspect categorization, multilabel classification, binary relevance, classifier chain text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
ABSTRA
CT
ADVERSARIAL TRAINING FOR CATEGORIZATION
ASPECTS ON MULTILABEL REVIEW TEXT
By
Azhar Abdurasyid
NIM : 13517097
Adversarial Training (AT) is a learning technique that uses adversarial example at the training stage so that the model will be able to handle such input data well. An adversarial example is an input that has been added with small disturbance so that when the model accepts the input, the model will fail to make correct predictions. In addition to being resistant to adversarial examples, AT is believed to be able to generalize the model and prevent overfit because AT can be seen as a regularization technique that introduces different data. One of the challenges encountered when trying to adapt AT techniques for natural language processing tasks is the need for continuous input data. This Final Project focuses on to adapt the AT technique to the task of categorizing multilabel aspects in Indonesian Language.
To get continuous input data on text data, this Final Project approaches using input data in the form of word embedding. In addition, this Final Project also adopts the CNN feature extraction topology in the previous aspect categorization research to obtain a continuous feature vector that can be used as input data for the AT model. In this way, two AT models will be built, namely the word embedding AT model and the feature-map AT model. Then to solve the problem of multilabel classification, this final project will compare the use of binary relevance and classifier chain strategies.
The test results show that the AT word embedding and AT feature-map models have a better F1-macro performance than the baseline with a sequential score of 0.9253 and 0.9234 in the first test data and 0.8531 and 0.8505 in the second test data. The AT feature-map model succeeded in increasing the baseline average F1-score performance on the adversarial test data with an increase of 0.3291. Meanwhile, the AT word embedding model did not succeed in increasing the baseline average F1-score performance on the adversarial test data.
Keywords: adversarial training, adversarial example, aspect categorization, multilabel classification, binary relevance, classifier chain |
format |
Final Project |
author |
Abdurrasyid, Azhar |
spellingShingle |
Abdurrasyid, Azhar ADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT |
author_facet |
Abdurrasyid, Azhar |
author_sort |
Abdurrasyid, Azhar |
title |
ADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT |
title_short |
ADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT |
title_full |
ADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT |
title_fullStr |
ADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT |
title_full_unstemmed |
ADVERSARIAL TRAINING FOR CATEGORIZATION ASPECTS ON MULTILABEL REVIEW TEXT |
title_sort |
adversarial training for categorization aspects on multilabel review text |
url |
https://digilib.itb.ac.id/gdl/view/64103 |
_version_ |
1822004471323951104 |