EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE

Emotion recognition from utterance signal have been developed in many languages include Indonesia spoken language. There are four class emotion that can be recognized by a system in Indonesia spoken language such as angry, happiness, contentment and sadness. These are considered as general emotion t...

Full description

Saved in:
Bibliographic Details
Main Author: KASYIDI (NIM : 23515025), FATAN
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/22088
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:22088
spelling id-itb.:220882017-09-28T14:54:27ZEMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE KASYIDI (NIM : 23515025), FATAN Indonesia Theses INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/22088 Emotion recognition from utterance signal have been developed in many languages include Indonesia spoken language. There are four class emotion that can be recognized by a system in Indonesia spoken language such as angry, happiness, contentment and sadness. These are considered as general emotion that use in daily activities. <br /> <br /> This thesis developed a model for emotion recognition from utterance signal in Indonesia spoken language using acoustic and lexical features. Acoustic features can be mention such as spectral, ceptral, voicing-related, energy, pitch contour, jitter and shimmer. Beside that, lexical feature that use in this thesis such as Bag-of-word and TF-IDF. From that kind of feature, there will be three groups of model that consist of acoustic model, lexical model and combined model from both of features. These features were extracted from emotion corpus that developed alongside the transcript from the audio file. <br /> <br /> There are 100 segmen of utterance as data testing with four class emotion. Data development are use to develop the model of three with 982 segmen of utterance in Indonesia spoken language. The experimental scenario for data development is using 5-fold cross validation and three machine learning algorithm such as naive bayes, random forest and support vector machine (SVM) will included. The result of experiment shows that acoustic model have F-measure 0.40361 by using random forest, F-measure 0.40823 by using random forest and combined model have F-measure 0.40823 by using SVM RBF kernel. Finally the result from testing phase by use the data testing show that the best accuracy given by combined model between acoustic and lexical TFIDF using SVM RBF kernel with 0.476 of F-measure. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Emotion recognition from utterance signal have been developed in many languages include Indonesia spoken language. There are four class emotion that can be recognized by a system in Indonesia spoken language such as angry, happiness, contentment and sadness. These are considered as general emotion that use in daily activities. <br /> <br /> This thesis developed a model for emotion recognition from utterance signal in Indonesia spoken language using acoustic and lexical features. Acoustic features can be mention such as spectral, ceptral, voicing-related, energy, pitch contour, jitter and shimmer. Beside that, lexical feature that use in this thesis such as Bag-of-word and TF-IDF. From that kind of feature, there will be three groups of model that consist of acoustic model, lexical model and combined model from both of features. These features were extracted from emotion corpus that developed alongside the transcript from the audio file. <br /> <br /> There are 100 segmen of utterance as data testing with four class emotion. Data development are use to develop the model of three with 982 segmen of utterance in Indonesia spoken language. The experimental scenario for data development is using 5-fold cross validation and three machine learning algorithm such as naive bayes, random forest and support vector machine (SVM) will included. The result of experiment shows that acoustic model have F-measure 0.40361 by using random forest, F-measure 0.40823 by using random forest and combined model have F-measure 0.40823 by using SVM RBF kernel. Finally the result from testing phase by use the data testing show that the best accuracy given by combined model between acoustic and lexical TFIDF using SVM RBF kernel with 0.476 of F-measure.
format Theses
author KASYIDI (NIM : 23515025), FATAN
spellingShingle KASYIDI (NIM : 23515025), FATAN
EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE
author_facet KASYIDI (NIM : 23515025), FATAN
author_sort KASYIDI (NIM : 23515025), FATAN
title EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE
title_short EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE
title_full EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE
title_fullStr EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE
title_full_unstemmed EMOTION RECOGNITION MODEL FROM INDONESIA SPOKEN LANGUAGE USING ACOUSTIC AND LEXICAL FEATURE
title_sort emotion recognition model from indonesia spoken language using acoustic and lexical feature
url https://digilib.itb.ac.id/gdl/view/22088
_version_ 1821120663864213504