Speech emotion analysis
Emotion recognition in speech has become increasingly important mainly due to the prevailing usage of speech in human-computer interaction. However, the studies done in the last century are still not enough to achieve satisfactory accuracies for the speech emotion recognition system. Therefore, this...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/18054 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-18054 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-180542023-07-07T17:23:43Z Speech emotion analysis Poh, Esther Ee Chen. Ser Wee School of Electrical and Electronic Engineering Centre for Signal Processing DRNTU::Engineering::Electrical and electronic engineering Emotion recognition in speech has become increasingly important mainly due to the prevailing usage of speech in human-computer interaction. However, the studies done in the last century are still not enough to achieve satisfactory accuracies for the speech emotion recognition system. Therefore, this project seeks to investigate the performance of a speech recognition system implemented using the Probabilistic Neural Networks (PNN) under various conditions and parameters. A speaker-gender independent, gender dependent and speaker dependent system were performed using four different feature set namely Pitch and Energy, formants, LPCC and MFCC. In addition, difference in the effect of using a 15-emotion set and a 5-emotion set were also investigated in this project. Experimental results showed that a speaker-dependent system produces higher accuracy as the system is able to account for variability existing between different speakers. It also shows that MFCC outperforms the rest of the speech features in terms of accuracy but lowest performance in terms of speed. Also, a dataset with less emotion classes also proved to improve the accuracies significantly as compared to a larger emotion set due to the reduction in sources of confusion between various emotional states. Bachelor of Engineering 2009-06-19T03:32:38Z 2009-06-19T03:32:38Z 2009 2009 Final Year Project (FYP) http://hdl.handle.net/10356/18054 en Nanyang Technological University 118 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Poh, Esther Ee Chen. Speech emotion analysis |
description |
Emotion recognition in speech has become increasingly important mainly due to the prevailing usage of speech in human-computer interaction. However, the studies done in the last century are still not enough to achieve satisfactory accuracies for the speech emotion recognition system. Therefore, this project seeks to investigate the performance of a speech recognition system implemented using the Probabilistic Neural Networks (PNN) under various conditions and parameters. A speaker-gender independent, gender dependent and speaker dependent system were performed using four different feature set namely Pitch and Energy, formants, LPCC and MFCC. In addition, difference in the effect of using a 15-emotion set and a 5-emotion set were also investigated in this project.
Experimental results showed that a speaker-dependent system produces higher accuracy as the system is able to account for variability existing between different speakers. It also shows that MFCC outperforms the rest of the speech features in terms of accuracy but lowest performance in terms of speed. Also, a dataset with less emotion classes also proved to improve the accuracies significantly as compared to a larger emotion set due to the reduction in sources of confusion between various emotional states. |
author2 |
Ser Wee |
author_facet |
Ser Wee Poh, Esther Ee Chen. |
format |
Final Year Project |
author |
Poh, Esther Ee Chen. |
author_sort |
Poh, Esther Ee Chen. |
title |
Speech emotion analysis |
title_short |
Speech emotion analysis |
title_full |
Speech emotion analysis |
title_fullStr |
Speech emotion analysis |
title_full_unstemmed |
Speech emotion analysis |
title_sort |
speech emotion analysis |
publishDate |
2009 |
url |
http://hdl.handle.net/10356/18054 |
_version_ |
1772827152607084544 |