Emotional speech recognition using acoustic models of decomposed component words

This paper presents a novel approach for emotional speech recognition. Instead of using a full length of speech for classification, the proposed method decomposes speech signals into component words, groups the words into segments and generates an acoustic model for each segment by using features su...

Full description

Saved in:

Bibliographic Details
Main Authors:	Kaveeta V., Patanukhom K.
Format:	Conference or Workshop Item
Published:	2015
Online Access:	http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84899094356&origin=inward http://cmuir.cmu.ac.th/handle/6653943832/39035
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Chiang Mai University

id	th-cmuir.6653943832-39035
record_format	dspace
spelling	th-cmuir.6653943832-390352015-06-16T08:01:17Z Emotional speech recognition using acoustic models of decomposed component words Kaveeta V. Patanukhom K. This paper presents a novel approach for emotional speech recognition. Instead of using a full length of speech for classification, the proposed method decomposes speech signals into component words, groups the words into segments and generates an acoustic model for each segment by using features such as audio power, MFCC, log attack time, spectrum spread and segment duration. Based on the proposed segment-based classification, unknown speech signals can be recognized into sequences of segment emotions. Emotion profiles (EPs) are extracted from the emotion sequences. Finally, speech emotion can be determined by using EP as features. Experiments are conducted by using 6,810 training samples and 722 test samples which are composed of eight emotional classes from IEMOCAP database. In comparison with a conventional method, the proposed method can improve recognition rate from 46.81% to 58.59% in eight emotion classification and from 60.18% to 71.25% in four emotion classification. © 2013 IEEE. 2015-06-16T08:01:17Z 2015-06-16T08:01:17Z 2013-01-01 Conference Paper 2-s2.0-84899094356 10.1109/ACPR.2013.13 http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84899094356&origin=inward http://cmuir.cmu.ac.th/handle/6653943832/39035
institution	Chiang Mai University
building	Chiang Mai University Library
country	Thailand
collection	CMU Intellectual Repository
description	This paper presents a novel approach for emotional speech recognition. Instead of using a full length of speech for classification, the proposed method decomposes speech signals into component words, groups the words into segments and generates an acoustic model for each segment by using features such as audio power, MFCC, log attack time, spectrum spread and segment duration. Based on the proposed segment-based classification, unknown speech signals can be recognized into sequences of segment emotions. Emotion profiles (EPs) are extracted from the emotion sequences. Finally, speech emotion can be determined by using EP as features. Experiments are conducted by using 6,810 training samples and 722 test samples which are composed of eight emotional classes from IEMOCAP database. In comparison with a conventional method, the proposed method can improve recognition rate from 46.81% to 58.59% in eight emotion classification and from 60.18% to 71.25% in four emotion classification. © 2013 IEEE.
format	Conference or Workshop Item
author	Kaveeta V. Patanukhom K.
spellingShingle	Kaveeta V. Patanukhom K. Emotional speech recognition using acoustic models of decomposed component words
author_facet	Kaveeta V. Patanukhom K.
author_sort	Kaveeta V.
title	Emotional speech recognition using acoustic models of decomposed component words
title_short	Emotional speech recognition using acoustic models of decomposed component words
title_full	Emotional speech recognition using acoustic models of decomposed component words
title_fullStr	Emotional speech recognition using acoustic models of decomposed component words
title_full_unstemmed	Emotional speech recognition using acoustic models of decomposed component words
title_sort	emotional speech recognition using acoustic models of decomposed component words
publishDate	2015
url	http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84899094356&origin=inward http://cmuir.cmu.ac.th/handle/6653943832/39035
_version_	1681421581625065472

Emotional speech recognition using acoustic models of decomposed component words

Similar Items