Gender dependent word-level emotion detection using global spectral speech features

In this study, global spectral features extracted from word and sentence levels are studied for speech emotion recognition. MFCC (Mel Frequency Cepstral Coefficient) were used as spectral information for recognition purpose. Global spectral features representing gross statistics such as mean of MFCC...

Full description

Saved in:

Bibliographic Details
Main Author:	Siddique, Haris
Format:	Thesis
Language:	English English
Published:	2015
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://etd.uum.edu.my/4518/
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Utara Malaysia
Language:	English English

id	my.uum.etd.4518
record_format	eprints
spelling	my.uum.etd.45182021-04-05T01:19:13Z http://etd.uum.edu.my/4518/ Gender dependent word-level emotion detection using global spectral speech features Siddique, Haris QA75 Electronic computers. Computer science In this study, global spectral features extracted from word and sentence levels are studied for speech emotion recognition. MFCC (Mel Frequency Cepstral Coefficient) were used as spectral information for recognition purpose. Global spectral features representing gross statistics such as mean of MFCC are used. This study also examine words at different positions (initial, middle and end) separately in a sentence. Word-level feature extraction is used to analyze emotion recognition performance of words at different positions. Word boundaries are manually identified. Gender dependent and independent models are also studied to analyze the gender impact on emotion recognition performance. Berlin’s Emo-DB (Emotional Database) was used for emotional speech dataset. Performance of different classifiers also been studied. NN (Neural Network), KNN (K-Nearest Neighbor) and LDA (Linear Discriminant Analysis) are included in the classifiers. Anger and neutral emotions were also studied. Results showed that, using all 13 MFCC coefficients provide better classification results than other combinations of MFCC coefficients for the mentioned emotions. Words at initial and ending positions provide more emotion, specific information than words at middle position. Gender dependent models are more efficient than gender independent models. Moreover, female are more efficient than male model and female exhibit emotions better than the male. General, NN performs the worst compared to KNN and LDA in classifying anger and neutral. LDA performs better than KNN almost 15% for gender independent model and almost 25% for gender dependent. 2015 Thesis NonPeerReviewed text en /4518/1/s814534.pdf text en /4518/2/s814534_abstract.pdf Siddique, Haris (2015) Gender dependent word-level emotion detection using global spectral speech features. Masters thesis, Universiti Utara Malaysia.
institution	Universiti Utara Malaysia
building	UUM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Utara Malaysia
content_source	UUM Electronic Theses
url_provider	http://etd.uum.edu.my/
language	English English
topic	QA75 Electronic computers. Computer science
spellingShingle	QA75 Electronic computers. Computer science Siddique, Haris Gender dependent word-level emotion detection using global spectral speech features
description	In this study, global spectral features extracted from word and sentence levels are studied for speech emotion recognition. MFCC (Mel Frequency Cepstral Coefficient) were used as spectral information for recognition purpose. Global spectral features representing gross statistics such as mean of MFCC are used. This study also examine words at different positions (initial, middle and end) separately in a sentence. Word-level feature extraction is used to analyze emotion recognition performance of words at different positions. Word boundaries are manually identified. Gender dependent and independent models are also studied to analyze the gender impact on emotion recognition performance. Berlin’s Emo-DB (Emotional Database) was used for emotional speech dataset. Performance of different classifiers also been studied. NN (Neural Network), KNN (K-Nearest Neighbor) and LDA (Linear Discriminant Analysis) are included in the classifiers. Anger and neutral emotions were also studied. Results showed that, using all 13 MFCC coefficients provide better classification results than other combinations of MFCC coefficients for the mentioned emotions. Words at initial and ending positions provide more emotion, specific information than words at middle position. Gender dependent models are more efficient than gender independent models. Moreover, female are more efficient than male model and female exhibit emotions better than the male. General, NN performs the worst compared to KNN and LDA in classifying anger and neutral. LDA performs better than KNN almost 15% for gender independent model and almost 25% for gender dependent.
format	Thesis
author	Siddique, Haris
author_facet	Siddique, Haris
author_sort	Siddique, Haris
title	Gender dependent word-level emotion detection using global spectral speech features
title_short	Gender dependent word-level emotion detection using global spectral speech features
title_full	Gender dependent word-level emotion detection using global spectral speech features
title_fullStr	Gender dependent word-level emotion detection using global spectral speech features
title_full_unstemmed	Gender dependent word-level emotion detection using global spectral speech features
title_sort	gender dependent word-level emotion detection using global spectral speech features
publishDate	2015
url	http://etd.uum.edu.my/4518/
_version_	1696978265417187328

Gender dependent word-level emotion detection using global spectral speech features

Similar Items