Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
In the speech signal, emotion is considered one of the most critical elements. For the recognition of emotions, the field of speech emotion recognition came into ex-istence. Speech Emotion Recognition (SER) is becoming an area of research in-terest in the last few years. A typical SER system focuses...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Book Chapter |
Language: | English English |
Published: |
Springer
2020
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/82534/1/Paper_182.pdf http://irep.iium.edu.my/82534/2/Acceptance%20Letter_DrTeddy_IIUM.pdf http://irep.iium.edu.my/82534/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Islam Antarabangsa Malaysia |
Language: | English English |
id |
my.iium.irep.82534 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.825342020-09-18T02:58:06Z http://irep.iium.edu.my/82534/ Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases Ahmad Qadri, Syed Asif Gunawan, Teddy Surya Kartiwi, Mira Mansor, Hasmah TK7885 Computer engineering In the speech signal, emotion is considered one of the most critical elements. For the recognition of emotions, the field of speech emotion recognition came into ex-istence. Speech Emotion Recognition (SER) is becoming an area of research in-terest in the last few years. A typical SER system focuses on extracting features such as pitch frequency, formant features, energy-related features, and spectral features from speech, tailing it with a classification quest to foresee different clas-ses of emotion. The critical issue to be addressed for a successful SER system is the emotional feature extraction, which can be solved by using different feature extraction techniques. In this paper, along with Teager Energy Operator (TEO) and Mel Frequency Cepstral Coefficients (MFCC) a trailblazing feature extrac-tion method, a fusion of MFCC and TEO as Teager-MFCC (T-MFCC) is used for the recognition of energy-based emotions. We have used three corpora of emotions in German, English, and Hindi to develop the multilingual SER system. The classification of these energy-based emotions is done by Deep Neural Net-work (DNN). It is found that TEO achieves a better recognition rate compared to MFCC and T-MFCC. Springer 2020-07 Book Chapter NonPeerReviewed application/pdf en http://irep.iium.edu.my/82534/1/Paper_182.pdf application/pdf en http://irep.iium.edu.my/82534/2/Acceptance%20Letter_DrTeddy_IIUM.pdf Ahmad Qadri, Syed Asif and Gunawan, Teddy Surya and Kartiwi, Mira and Mansor, Hasmah (2020) Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases. In: Springer's Lecture Nores in Electrical Engineering (LNEE). Springer, pp. 1-10. (In Press) |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English English |
topic |
TK7885 Computer engineering |
spellingShingle |
TK7885 Computer engineering Ahmad Qadri, Syed Asif Gunawan, Teddy Surya Kartiwi, Mira Mansor, Hasmah Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases |
description |
In the speech signal, emotion is considered one of the most critical elements. For the recognition of emotions, the field of speech emotion recognition came into ex-istence. Speech Emotion Recognition (SER) is becoming an area of research in-terest in the last few years. A typical SER system focuses on extracting features such as pitch frequency, formant features, energy-related features, and spectral features from speech, tailing it with a classification quest to foresee different clas-ses of emotion. The critical issue to be addressed for a successful SER system is the emotional feature extraction, which can be solved by using different feature extraction techniques. In this paper, along with Teager Energy Operator (TEO) and Mel Frequency Cepstral Coefficients (MFCC) a trailblazing feature extrac-tion method, a fusion of MFCC and TEO as Teager-MFCC (T-MFCC) is used for the recognition of energy-based emotions. We have used three corpora of emotions in German, English, and Hindi to develop the multilingual SER system. The classification of these energy-based emotions is done by Deep Neural Net-work (DNN). It is found that TEO achieves a better recognition rate compared to MFCC and T-MFCC. |
format |
Book Chapter |
author |
Ahmad Qadri, Syed Asif Gunawan, Teddy Surya Kartiwi, Mira Mansor, Hasmah |
author_facet |
Ahmad Qadri, Syed Asif Gunawan, Teddy Surya Kartiwi, Mira Mansor, Hasmah |
author_sort |
Ahmad Qadri, Syed Asif |
title |
Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases |
title_short |
Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases |
title_full |
Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases |
title_fullStr |
Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases |
title_full_unstemmed |
Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases |
title_sort |
speech emotion recognition using feature fusion of teo and mfcc on multilingual databases |
publisher |
Springer |
publishDate |
2020 |
url |
http://irep.iium.edu.my/82534/1/Paper_182.pdf http://irep.iium.edu.my/82534/2/Acceptance%20Letter_DrTeddy_IIUM.pdf http://irep.iium.edu.my/82534/ |
_version_ |
1678592805745721344 |