#TITLE_ALTERNATIVE#

<p align="justify">Researches on theclassification of emotions on onlinetext chat has not been done in Indonesian. The challenge of classifying emotions on online chatting text is the difference of characteristics on online chatting text with general text, such as unclear segmentatio...

Full description

Saved in:

Bibliographic Details
Main Author:	JANOAH HASUDUNGAN - NIM:13514089, RAMOS
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/30197
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:30197
spelling	id-itb.:301972018-06-29T08:26:54Z#TITLE_ALTERNATIVE# JANOAH HASUDUNGAN - NIM:13514089, RAMOS Indonesia Final Project INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/30197 <p align="justify">Researches on theclassification of emotions on onlinetext chat has not been done in Indonesian. The challenge of classifying emotions on online chatting text is the difference of characteristics on online chatting text with general text, such as unclear segmentation in the text, the existence of inter-document contexts affecting each other's labels, abnormal and multilingual vocabulary, and unclear syntacticstructures. To handle these matters, various adaptations of the text classification process in general, such as pre-processing of word normalization, pragmatic feature extraction, and not performing syntactic feature extraction are done in this final project. In addition, to handle the context of the sentence, in this final project, there aretwo approaches, non-sequential approach and sequential approach. In the non-sequential approach, the context between documents will not be taken into account, whereas in the sequential approach, the context between chat bubbles will be taken into account, whereas in the sequential approach, the context betweenchatbubbleswillhaveaneffect. In both of these approaches, experiments were conducted on the combination of features and machine learning algorithms. The features are a combination of ngram, pragmatic features, non-textual features, and word embedding features. The learning algorithms tried are K-nearest neighbor (KNN), support vector machine (SVM), multilayer perceptron (MLP), and random forest for non-sequential approachalsoLongShort-TermMemory(LSTM)forsequentialapproach. Model building and testing were performed using data derived from social media LINE, by splitting the data into training, validation, and testing data. The best F1micro emotion evaluated on the test and validation data is 0.326 and 0.325 respectively using multilayer perceptron with 1-gram, pragmatic feature and averagewordvectorwordembeddingfeature.<p align="justify"> <br /> text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	<p align="justify">Researches on theclassification of emotions on onlinetext chat has not been done in Indonesian. The challenge of classifying emotions on online chatting text is the difference of characteristics on online chatting text with general text, such as unclear segmentation in the text, the existence of inter-document contexts affecting each other's labels, abnormal and multilingual vocabulary, and unclear syntacticstructures. To handle these matters, various adaptations of the text classification process in general, such as pre-processing of word normalization, pragmatic feature extraction, and not performing syntactic feature extraction are done in this final project. In addition, to handle the context of the sentence, in this final project, there aretwo approaches, non-sequential approach and sequential approach. In the non-sequential approach, the context between documents will not be taken into account, whereas in the sequential approach, the context between chat bubbles will be taken into account, whereas in the sequential approach, the context betweenchatbubbleswillhaveaneffect. In both of these approaches, experiments were conducted on the combination of features and machine learning algorithms. The features are a combination of ngram, pragmatic features, non-textual features, and word embedding features. The learning algorithms tried are K-nearest neighbor (KNN), support vector machine (SVM), multilayer perceptron (MLP), and random forest for non-sequential approachalsoLongShort-TermMemory(LSTM)forsequentialapproach. Model building and testing were performed using data derived from social media LINE, by splitting the data into training, validation, and testing data. The best F1micro emotion evaluated on the test and validation data is 0.326 and 0.325 respectively using multilayer perceptron with 1-gram, pragmatic feature and averagewordvectorwordembeddingfeature.<p align="justify"> <br />
format	Final Project
author	JANOAH HASUDUNGAN - NIM:13514089, RAMOS
spellingShingle	JANOAH HASUDUNGAN - NIM:13514089, RAMOS #TITLE_ALTERNATIVE#
author_facet	JANOAH HASUDUNGAN - NIM:13514089, RAMOS
author_sort	JANOAH HASUDUNGAN - NIM:13514089, RAMOS
title	#TITLE_ALTERNATIVE#
title_short	#TITLE_ALTERNATIVE#
title_full	#TITLE_ALTERNATIVE#
title_fullStr	#TITLE_ALTERNATIVE#
title_full_unstemmed	#TITLE_ALTERNATIVE#
title_sort	#title_alternative#
url	https://digilib.itb.ac.id/gdl/view/30197
_version_	1822267357973708800

#TITLE_ALTERNATIVE#

Similar Items