#TITLE_ALTERNATIVE#
<p align="justify">Researches on theclassification of emotions on onlinetext chat has not been done in Indonesian. The challenge of classifying emotions on online chatting text is the difference of characteristics on online chatting text with general text, such as unclear segmentatio...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/30197 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:30197 |
---|---|
spelling |
id-itb.:301972018-06-29T08:26:54Z#TITLE_ALTERNATIVE# JANOAH HASUDUNGAN - NIM:13514089, RAMOS Indonesia Final Project INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/30197 <p align="justify">Researches on theclassification of emotions on onlinetext chat has not been done in Indonesian. The challenge of classifying emotions on online chatting text is the difference of characteristics on online chatting text with general text, such as unclear segmentation in the text, the existence of inter-document contexts affecting each other's labels, abnormal and multilingual vocabulary, and unclear syntacticstructures. To handle these matters, various adaptations of the text classification process in general, such as pre-processing of word normalization, pragmatic feature extraction, and not performing syntactic feature extraction are done in this final project. In addition, to handle the context of the sentence, in this final project, there aretwo approaches, non-sequential approach and sequential approach. In the non-sequential approach, the context between documents will not be taken into account, whereas in the sequential approach, the context between chat bubbles will be taken into account, whereas in the sequential approach, the context betweenchatbubbleswillhaveaneffect. In both of these approaches, experiments were conducted on the combination of features and machine learning algorithms. The features are a combination of ngram, pragmatic features, non-textual features, and word embedding features. The learning algorithms tried are K-nearest neighbor (KNN), support vector machine (SVM), multilayer perceptron (MLP), and random forest for non-sequential approachalsoLongShort-TermMemory(LSTM)forsequentialapproach. Model building and testing were performed using data derived from social media LINE, by splitting the data into training, validation, and testing data. The best F1micro emotion evaluated on the test and validation data is 0.326 and 0.325 respectively using multilayer perceptron with 1-gram, pragmatic feature and averagewordvectorwordembeddingfeature.<p align="justify"> <br /> text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
<p align="justify">Researches on theclassification of emotions on onlinetext chat has not been done in Indonesian. The challenge of classifying emotions on online chatting text is the difference of characteristics on online chatting text with general text, such as unclear segmentation in the text, the existence of inter-document contexts affecting each other's labels, abnormal and multilingual vocabulary, and unclear syntacticstructures. To handle these matters, various adaptations of the text classification process in general, such as pre-processing of word normalization, pragmatic feature extraction, and not performing syntactic feature extraction are done in this final project. In addition, to handle the context of the sentence, in this final project, there aretwo approaches, non-sequential approach and sequential approach. In the non-sequential approach, the context between documents will not be taken into account, whereas in the sequential approach, the context between chat bubbles will be taken into account, whereas in the sequential approach, the context betweenchatbubbleswillhaveaneffect. In both of these approaches, experiments were conducted on the combination of features and machine learning algorithms. The features are a combination of ngram, pragmatic features, non-textual features, and word embedding features. The learning algorithms tried are K-nearest neighbor (KNN), support vector machine (SVM), multilayer perceptron (MLP), and random forest for non-sequential approachalsoLongShort-TermMemory(LSTM)forsequentialapproach. Model building and testing were performed using data derived from social media LINE, by splitting the data into training, validation, and testing data. The best F1micro emotion evaluated on the test and validation data is 0.326 and 0.325 respectively using multilayer perceptron with 1-gram, pragmatic feature and averagewordvectorwordembeddingfeature.<p align="justify"> <br />
|
format |
Final Project |
author |
JANOAH HASUDUNGAN - NIM:13514089, RAMOS |
spellingShingle |
JANOAH HASUDUNGAN - NIM:13514089, RAMOS #TITLE_ALTERNATIVE# |
author_facet |
JANOAH HASUDUNGAN - NIM:13514089, RAMOS |
author_sort |
JANOAH HASUDUNGAN - NIM:13514089, RAMOS |
title |
#TITLE_ALTERNATIVE# |
title_short |
#TITLE_ALTERNATIVE# |
title_full |
#TITLE_ALTERNATIVE# |
title_fullStr |
#TITLE_ALTERNATIVE# |
title_full_unstemmed |
#TITLE_ALTERNATIVE# |
title_sort |
#title_alternative# |
url |
https://digilib.itb.ac.id/gdl/view/30197 |
_version_ |
1822267357973708800 |