Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset

Stress is an interaction between individuals and their environment, where perceived threats can lead to serious consequences if prolonged and consistently linked to adverse physical and mental health outcomes. Our study explores methods for stress classification via speech, utilizing an unscripted d...

Full description

Saved in:
Bibliographic Details
Main Authors: Zainal, Nur Aishah, Asnawi, Ani Liza, Jusoh, Ahmad Zamani, Ibrahim, Siti Noorjannah, Mohd. Ramli, Huda Adibah
Format: Article
Language:English
Published: IIUM Press 2024
Subjects:
Online Access:http://irep.iium.edu.my/114502/7/114502_Integration%20of%20MFCCS%20and%20CNN.pdf
http://irep.iium.edu.my/114502/
https://journals.iium.edu.my/ejournal/index.php/iiumej/article/view/3207/1007
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Islam Antarabangsa Malaysia
Language: English
id my.iium.irep.114502
record_format dspace
spelling my.iium.irep.1145022024-09-19T01:25:19Z http://irep.iium.edu.my/114502/ Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset Zainal, Nur Aishah Asnawi, Ani Liza Jusoh, Ahmad Zamani Ibrahim, Siti Noorjannah Mohd. Ramli, Huda Adibah TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices TK7885 Computer engineering Stress is an interaction between individuals and their environment, where perceived threats can lead to serious consequences if prolonged and consistently linked to adverse physical and mental health outcomes. Our study explores methods for stress classification via speech, utilizing an unscripted dataset from an experimental study that was able to show the spontaneous reactions of stressed individuals. Mel-Frequency Cepstral Coefficients (MFCCs) emerge as promising speech features, adept at representing the power spectrum crucial to human auditory perception, especially in stress speech recognition. Leveraging deep learning technology, specifically Convolutional Neural Network (CNN), our research optimally combines speech features and CNN algorithms for stress classification. Despite the scarcity of publications on unscripted datasets and multi-class stress classifications, our study advocates their adoption, aiming to enhance performance metrics and contribute to research expansion. The proposed system shows that MFCCs achieve an accuracy of 95.67% in distinguishing among three stress classes (low-stress, medium-stress, and high-stress), surpassing the prior unscripted dataset study by 81.86%. This highlights the efficacy of the proposed MFCCs-CNN system in stress classification. IIUM Press 2024-07-14 Article PeerReviewed application/pdf en http://irep.iium.edu.my/114502/7/114502_Integration%20of%20MFCCS%20and%20CNN.pdf Zainal, Nur Aishah and Asnawi, Ani Liza and Jusoh, Ahmad Zamani and Ibrahim, Siti Noorjannah and Mohd. Ramli, Huda Adibah (2024) Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset. IIUM Egineering Journal, 25 (2). pp. 381-395. ISSN 1511-788X E-ISSN 2289-7860 https://journals.iium.edu.my/ejournal/index.php/iiumej/article/view/3207/1007
institution Universiti Islam Antarabangsa Malaysia
building IIUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider International Islamic University Malaysia
content_source IIUM Repository (IREP)
url_provider http://irep.iium.edu.my/
language English
topic TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices
TK7885 Computer engineering
spellingShingle TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices
TK7885 Computer engineering
Zainal, Nur Aishah
Asnawi, Ani Liza
Jusoh, Ahmad Zamani
Ibrahim, Siti Noorjannah
Mohd. Ramli, Huda Adibah
Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset
description Stress is an interaction between individuals and their environment, where perceived threats can lead to serious consequences if prolonged and consistently linked to adverse physical and mental health outcomes. Our study explores methods for stress classification via speech, utilizing an unscripted dataset from an experimental study that was able to show the spontaneous reactions of stressed individuals. Mel-Frequency Cepstral Coefficients (MFCCs) emerge as promising speech features, adept at representing the power spectrum crucial to human auditory perception, especially in stress speech recognition. Leveraging deep learning technology, specifically Convolutional Neural Network (CNN), our research optimally combines speech features and CNN algorithms for stress classification. Despite the scarcity of publications on unscripted datasets and multi-class stress classifications, our study advocates their adoption, aiming to enhance performance metrics and contribute to research expansion. The proposed system shows that MFCCs achieve an accuracy of 95.67% in distinguishing among three stress classes (low-stress, medium-stress, and high-stress), surpassing the prior unscripted dataset study by 81.86%. This highlights the efficacy of the proposed MFCCs-CNN system in stress classification.
format Article
author Zainal, Nur Aishah
Asnawi, Ani Liza
Jusoh, Ahmad Zamani
Ibrahim, Siti Noorjannah
Mohd. Ramli, Huda Adibah
author_facet Zainal, Nur Aishah
Asnawi, Ani Liza
Jusoh, Ahmad Zamani
Ibrahim, Siti Noorjannah
Mohd. Ramli, Huda Adibah
author_sort Zainal, Nur Aishah
title Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset
title_short Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset
title_full Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset
title_fullStr Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset
title_full_unstemmed Integration of MFCCS and CNN for multi-class stress speech classification on unscripted dataset
title_sort integration of mfccs and cnn for multi-class stress speech classification on unscripted dataset
publisher IIUM Press
publishDate 2024
url http://irep.iium.edu.my/114502/7/114502_Integration%20of%20MFCCS%20and%20CNN.pdf
http://irep.iium.edu.my/114502/
https://journals.iium.edu.my/ejournal/index.php/iiumej/article/view/3207/1007
_version_ 1811679648053985280