Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset
Stress is an interaction between individuals and their environment, where perceived threats can lead to serious consequences if prolonged and consistently linked to adverse physical and mental health outcomes. Our study explores methods for stress classification via speech, utilizing an unscripted d...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IIUM Press
2024
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/114502/7/114502_Integration%20of%20MFCCS%20and%20CNN.pdf http://irep.iium.edu.my/114502/ https://journals.iium.edu.my/ejournal/index.php/iiumej/article/view/3207/1007 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Islam Antarabangsa Malaysia |
Language: | English |
id |
my.iium.irep.114502 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.1145022024-09-19T01:25:19Z http://irep.iium.edu.my/114502/ Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset Zainal, Nur Aishah Asnawi, Ani Liza Jusoh, Ahmad Zamani Ibrahim, Siti Noorjannah Mohd. Ramli, Huda Adibah TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices TK7885 Computer engineering Stress is an interaction between individuals and their environment, where perceived threats can lead to serious consequences if prolonged and consistently linked to adverse physical and mental health outcomes. Our study explores methods for stress classification via speech, utilizing an unscripted dataset from an experimental study that was able to show the spontaneous reactions of stressed individuals. Mel-Frequency Cepstral Coefficients (MFCCs) emerge as promising speech features, adept at representing the power spectrum crucial to human auditory perception, especially in stress speech recognition. Leveraging deep learning technology, specifically Convolutional Neural Network (CNN), our research optimally combines speech features and CNN algorithms for stress classification. Despite the scarcity of publications on unscripted datasets and multi-class stress classifications, our study advocates their adoption, aiming to enhance performance metrics and contribute to research expansion. The proposed system shows that MFCCs achieve an accuracy of 95.67% in distinguishing among three stress classes (low-stress, medium-stress, and high-stress), surpassing the prior unscripted dataset study by 81.86%. This highlights the efficacy of the proposed MFCCs-CNN system in stress classification. IIUM Press 2024-07-14 Article PeerReviewed application/pdf en http://irep.iium.edu.my/114502/7/114502_Integration%20of%20MFCCS%20and%20CNN.pdf Zainal, Nur Aishah and Asnawi, Ani Liza and Jusoh, Ahmad Zamani and Ibrahim, Siti Noorjannah and Mohd. Ramli, Huda Adibah (2024) Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset. IIUM Egineering Journal, 25 (2). pp. 381-395. ISSN 1511-788X E-ISSN 2289-7860 https://journals.iium.edu.my/ejournal/index.php/iiumej/article/view/3207/1007 |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English |
topic |
TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices TK7885 Computer engineering |
spellingShingle |
TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices TK7885 Computer engineering Zainal, Nur Aishah Asnawi, Ani Liza Jusoh, Ahmad Zamani Ibrahim, Siti Noorjannah Mohd. Ramli, Huda Adibah Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset |
description |
Stress is an interaction between individuals and their environment, where perceived threats can lead to serious consequences if prolonged and consistently linked to adverse physical and mental health outcomes. Our study explores methods for stress classification via speech, utilizing an unscripted dataset from an experimental study that was able to show the spontaneous reactions of stressed individuals. Mel-Frequency Cepstral Coefficients (MFCCs) emerge as promising speech features, adept at representing the power spectrum crucial to human auditory perception, especially in stress speech recognition. Leveraging deep learning technology, specifically Convolutional Neural Network (CNN), our research optimally combines speech features and CNN algorithms for stress classification. Despite the scarcity of publications on unscripted datasets and multi-class stress classifications, our study advocates their adoption, aiming to enhance performance metrics and contribute to research expansion. The proposed system shows that MFCCs achieve an accuracy of 95.67% in distinguishing among three stress classes (low-stress, medium-stress, and high-stress), surpassing the prior unscripted dataset study by 81.86%. This highlights the efficacy of the proposed MFCCs-CNN system in stress classification. |
format |
Article |
author |
Zainal, Nur Aishah Asnawi, Ani Liza Jusoh, Ahmad Zamani Ibrahim, Siti Noorjannah Mohd. Ramli, Huda Adibah |
author_facet |
Zainal, Nur Aishah Asnawi, Ani Liza Jusoh, Ahmad Zamani Ibrahim, Siti Noorjannah Mohd. Ramli, Huda Adibah |
author_sort |
Zainal, Nur Aishah |
title |
Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset |
title_short |
Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset |
title_full |
Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset |
title_fullStr |
Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset |
title_full_unstemmed |
Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset |
title_sort |
integration of mfccs and cnn for multi-class stress speech classification on unscripted dataset |
publisher |
IIUM Press |
publishDate |
2024 |
url |
http://irep.iium.edu.my/114502/7/114502_Integration%20of%20MFCCS%20and%20CNN.pdf http://irep.iium.edu.my/114502/ https://journals.iium.edu.my/ejournal/index.php/iiumej/article/view/3207/1007 |
_version_ |
1811679648053985280 |