Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN
Stress classification involves categorizing an individual's perceived stress state. One approach involves analyzing human speech due to its non-invasive nature, offering advantages over traditional methods requiring intrusive procedures. Presently, two main types of datasets are used in th...
Saved in:
Main Authors: | , |
---|---|
Format: | Proceeding Paper |
Language: | English English |
Published: |
IEEE
2024
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/114504/7/114504_Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf http://irep.iium.edu.my/114504/1/Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf http://irep.iium.edu.my/114504/ https://ieeexplore.ieee.org/abstract/document/10652270 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Islam Antarabangsa Malaysia |
Language: | English English |
id |
my.iium.irep.114504 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.1145042024-09-20T02:43:00Z http://irep.iium.edu.my/114504/ Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN Zainal, Nur Aishah Asnawi, Ani Liza TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices TK7885 Computer engineering Stress classification involves categorizing an individual's perceived stress state. One approach involves analyzing human speech due to its non-invasive nature, offering advantages over traditional methods requiring intrusive procedures. Presently, two main types of datasets are used in this research field: scripted and unscripted. Scripted datasets feature staged performances by actors depicting emotions, while unscripted datasets capture natural reactions, though acquiring them poses challenges and requires collaboration with experts. Convolutional Neural Networks (CNNs) have been favored for stress classification, but they require substantial data points per class. Alternatively, traditional machine learning classifiers have shown promising with smaller datasets, though their accuracy rates often fall short. This study fused two scripted datasets, RAVDESS and TESS, to enhance stress classification. Utilizing Mel-frequency Cepstral Coefficients (MFCCs) alongside CNNs proved vital in highlighting stress attributes for effective classification with 94.5% accuracy and surpassed the previous studies. IEEE 2024-09-05 Proceeding Paper PeerReviewed application/pdf en http://irep.iium.edu.my/114504/7/114504_Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf application/pdf en http://irep.iium.edu.my/114504/1/Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf Zainal, Nur Aishah and Asnawi, Ani Liza (2024) Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN. In: 2024 9th International Conferences on Mechatronics (ICOM), 13-14 August 2024, Kuala Lumpur. https://ieeexplore.ieee.org/abstract/document/10652270 |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English English |
topic |
TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices TK7885 Computer engineering |
spellingShingle |
TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices TK7885 Computer engineering Zainal, Nur Aishah Asnawi, Ani Liza Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN |
description |
Stress classification involves categorizing an
individual's perceived stress state. One approach involves
analyzing human speech due to its non-invasive nature, offering
advantages over traditional methods requiring intrusive
procedures. Presently, two main types of datasets are used in
this research field: scripted and unscripted. Scripted datasets
feature staged performances by actors depicting emotions, while
unscripted datasets capture natural reactions, though acquiring
them poses challenges and requires collaboration with experts.
Convolutional Neural Networks (CNNs) have been favored for
stress classification, but they require substantial data points per
class. Alternatively, traditional machine learning classifiers
have shown promising with smaller datasets, though their
accuracy rates often fall short. This study fused two scripted
datasets, RAVDESS and TESS, to enhance stress classification.
Utilizing Mel-frequency Cepstral Coefficients (MFCCs)
alongside CNNs proved vital in highlighting stress attributes for
effective classification with 94.5% accuracy and surpassed the
previous studies. |
format |
Proceeding Paper |
author |
Zainal, Nur Aishah Asnawi, Ani Liza |
author_facet |
Zainal, Nur Aishah Asnawi, Ani Liza |
author_sort |
Zainal, Nur Aishah |
title |
Enhancing stress speech classification through the
fusion of emotional datasets utilizing MFCCs with
CNN |
title_short |
Enhancing stress speech classification through the
fusion of emotional datasets utilizing MFCCs with
CNN |
title_full |
Enhancing stress speech classification through the
fusion of emotional datasets utilizing MFCCs with
CNN |
title_fullStr |
Enhancing stress speech classification through the
fusion of emotional datasets utilizing MFCCs with
CNN |
title_full_unstemmed |
Enhancing stress speech classification through the
fusion of emotional datasets utilizing MFCCs with
CNN |
title_sort |
enhancing stress speech classification through the
fusion of emotional datasets utilizing mfccs with
cnn |
publisher |
IEEE |
publishDate |
2024 |
url |
http://irep.iium.edu.my/114504/7/114504_Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf http://irep.iium.edu.my/114504/1/Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf http://irep.iium.edu.my/114504/ https://ieeexplore.ieee.org/abstract/document/10652270 |
_version_ |
1811679648733462528 |