EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE
Emotion recognition from utterance signal have been developed in many languages include Indonesia spoken language. There are four class emotion that can be recognized by a system in Indonesia spoken language such as angry, happiness, contentment and sadness. These are considered as general emotion t...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/22088 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:22088 |
---|---|
spelling |
id-itb.:220882017-09-28T14:54:27ZEMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE KASYIDI (NIM : 23515025), FATAN Indonesia Theses INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/22088 Emotion recognition from utterance signal have been developed in many languages include Indonesia spoken language. There are four class emotion that can be recognized by a system in Indonesia spoken language such as angry, happiness, contentment and sadness. These are considered as general emotion that use in daily activities. <br /> <br /> This thesis developed a model for emotion recognition from utterance signal in Indonesia spoken language using acoustic and lexical features. Acoustic features can be mention such as spectral, ceptral, voicing-related, energy, pitch contour, jitter and shimmer. Beside that, lexical feature that use in this thesis such as Bag-of-word and TF-IDF. From that kind of feature, there will be three groups of model that consist of acoustic model, lexical model and combined model from both of features. These features were extracted from emotion corpus that developed alongside the transcript from the audio file. <br /> <br /> There are 100 segmen of utterance as data testing with four class emotion. Data development are use to develop the model of three with 982 segmen of utterance in Indonesia spoken language. The experimental scenario for data development is using 5-fold cross validation and three machine learning algorithm such as naive bayes, random forest and support vector machine (SVM) will included. The result of experiment shows that acoustic model have F-measure 0.40361 by using random forest, F-measure 0.40823 by using random forest and combined model have F-measure 0.40823 by using SVM RBF kernel. Finally the result from testing phase by use the data testing show that the best accuracy given by combined model between acoustic and lexical TFIDF using SVM RBF kernel with 0.476 of F-measure. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Emotion recognition from utterance signal have been developed in many languages include Indonesia spoken language. There are four class emotion that can be recognized by a system in Indonesia spoken language such as angry, happiness, contentment and sadness. These are considered as general emotion that use in daily activities. <br />
<br />
This thesis developed a model for emotion recognition from utterance signal in Indonesia spoken language using acoustic and lexical features. Acoustic features can be mention such as spectral, ceptral, voicing-related, energy, pitch contour, jitter and shimmer. Beside that, lexical feature that use in this thesis such as Bag-of-word and TF-IDF. From that kind of feature, there will be three groups of model that consist of acoustic model, lexical model and combined model from both of features. These features were extracted from emotion corpus that developed alongside the transcript from the audio file. <br />
<br />
There are 100 segmen of utterance as data testing with four class emotion. Data development are use to develop the model of three with 982 segmen of utterance in Indonesia spoken language. The experimental scenario for data development is using 5-fold cross validation and three machine learning algorithm such as naive bayes, random forest and support vector machine (SVM) will included. The result of experiment shows that acoustic model have F-measure 0.40361 by using random forest, F-measure 0.40823 by using random forest and combined model have F-measure 0.40823 by using SVM RBF kernel. Finally the result from testing phase by use the data testing show that the best accuracy given by combined model between acoustic and lexical TFIDF using SVM RBF kernel with 0.476 of F-measure. |
format |
Theses |
author |
KASYIDI (NIM : 23515025), FATAN |
spellingShingle |
KASYIDI (NIM : 23515025), FATAN EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE |
author_facet |
KASYIDI (NIM : 23515025), FATAN |
author_sort |
KASYIDI (NIM : 23515025), FATAN |
title |
EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE |
title_short |
EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE |
title_full |
EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE |
title_fullStr |
EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE |
title_full_unstemmed |
EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE |
title_sort |
emotion recognition model from indonesia spoken language using acoustic and lexical feature |
url |
https://digilib.itb.ac.id/gdl/view/22088 |
_version_ |
1821120663864213504 |