Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN

Stress classification involves categorizing an individual's perceived stress state. One approach involves analyzing human speech due to its non-invasive nature, offering advantages over traditional methods requiring intrusive procedures. Presently, two main types of datasets are used in th...

Full description

Saved in:
Bibliographic Details
Main Authors: Zainal, Nur Aishah, Asnawi, Ani Liza
Format: Proceeding Paper
Language:English
English
Published: IEEE 2024
Subjects:
Online Access:http://irep.iium.edu.my/114504/7/114504_Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf
http://irep.iium.edu.my/114504/1/Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf
http://irep.iium.edu.my/114504/
https://ieeexplore.ieee.org/abstract/document/10652270
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Islam Antarabangsa Malaysia
Language: English
English
id my.iium.irep.114504
record_format dspace
spelling my.iium.irep.1145042024-09-20T02:43:00Z http://irep.iium.edu.my/114504/ Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN Zainal, Nur Aishah Asnawi, Ani Liza TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices TK7885 Computer engineering Stress classification involves categorizing an individual's perceived stress state. One approach involves analyzing human speech due to its non-invasive nature, offering advantages over traditional methods requiring intrusive procedures. Presently, two main types of datasets are used in this research field: scripted and unscripted. Scripted datasets feature staged performances by actors depicting emotions, while unscripted datasets capture natural reactions, though acquiring them poses challenges and requires collaboration with experts. Convolutional Neural Networks (CNNs) have been favored for stress classification, but they require substantial data points per class. Alternatively, traditional machine learning classifiers have shown promising with smaller datasets, though their accuracy rates often fall short. This study fused two scripted datasets, RAVDESS and TESS, to enhance stress classification. Utilizing Mel-frequency Cepstral Coefficients (MFCCs) alongside CNNs proved vital in highlighting stress attributes for effective classification with 94.5% accuracy and surpassed the previous studies. IEEE 2024-09-05 Proceeding Paper PeerReviewed application/pdf en http://irep.iium.edu.my/114504/7/114504_Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf application/pdf en http://irep.iium.edu.my/114504/1/Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf Zainal, Nur Aishah and Asnawi, Ani Liza (2024) Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN. In: 2024 9th International Conferences on Mechatronics (ICOM), 13-14 August 2024, Kuala Lumpur. https://ieeexplore.ieee.org/abstract/document/10652270
institution Universiti Islam Antarabangsa Malaysia
building IIUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider International Islamic University Malaysia
content_source IIUM Repository (IREP)
url_provider http://irep.iium.edu.my/
language English
English
topic TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices
TK7885 Computer engineering
spellingShingle TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices
TK7885 Computer engineering
Zainal, Nur Aishah
Asnawi, Ani Liza
Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN
description Stress classification involves categorizing an individual's perceived stress state. One approach involves analyzing human speech due to its non-invasive nature, offering advantages over traditional methods requiring intrusive procedures. Presently, two main types of datasets are used in this research field: scripted and unscripted. Scripted datasets feature staged performances by actors depicting emotions, while unscripted datasets capture natural reactions, though acquiring them poses challenges and requires collaboration with experts. Convolutional Neural Networks (CNNs) have been favored for stress classification, but they require substantial data points per class. Alternatively, traditional machine learning classifiers have shown promising with smaller datasets, though their accuracy rates often fall short. This study fused two scripted datasets, RAVDESS and TESS, to enhance stress classification. Utilizing Mel-frequency Cepstral Coefficients (MFCCs) alongside CNNs proved vital in highlighting stress attributes for effective classification with 94.5% accuracy and surpassed the previous studies.
format Proceeding Paper
author Zainal, Nur Aishah
Asnawi, Ani Liza
author_facet Zainal, Nur Aishah
Asnawi, Ani Liza
author_sort Zainal, Nur Aishah
title Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN
title_short Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN
title_full Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN
title_fullStr Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN
title_full_unstemmed Enhancing stress speech classification through the fusion of emotional datasets utilizing MFCCs with CNN
title_sort enhancing stress speech classification through the fusion of emotional datasets utilizing mfccs with cnn
publisher IEEE
publishDate 2024
url http://irep.iium.edu.my/114504/7/114504_Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf
http://irep.iium.edu.my/114504/1/Enhancing%20Stress%20Speech%20Classification%20Through%20the%20Fusion%20of%20Emotional%20Datasets%20Utilizing%20MFCCs%20with%20CNN.pdf
http://irep.iium.edu.my/114504/
https://ieeexplore.ieee.org/abstract/document/10652270
_version_ 1811679648733462528