Classification of asphyxia infant cry using hybrid speech features and deep learning models

Single speech feature such as Mel-Frequency Cepstral Coefficient (MFCC) has been used in most of the studies to classify asphyxia cry among infants. Other speech features such as Chromagram, Mel-scaled Spectrogram, Spectral Contrast and Tonnetz have not been reported in any study related to the clas...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ting, Hua-Nong, Choo, Yao-Mun, Kamar, Azanna Ahmad
Format:	Article
Published:	Elsevier 2022
Subjects:	TA Engineering (General). Civil engineering (General)
Online Access:	http://eprints.um.edu.my/40947/
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Malaya

id	my.um.eprints.40947
record_format	eprints
spelling	my.um.eprints.409472023-08-28T03:00:57Z http://eprints.um.edu.my/40947/ Classification of asphyxia infant cry using hybrid speech features and deep learning models Ting, Hua-Nong Choo, Yao-Mun Kamar, Azanna Ahmad TA Engineering (General). Civil engineering (General) Single speech feature such as Mel-Frequency Cepstral Coefficient (MFCC) has been used in most of the studies to classify asphyxia cry among infants. Other speech features such as Chromagram, Mel-scaled Spectrogram, Spectral Contrast and Tonnetz have not been reported in any study related to the classification of asphyxia cry. The study investigated the use of hybrid features of MFCC, Chromagram, Mel-scaled Spectrogram, Spectral Contrast and Tonnetz and deep learning models in classifying asphyxia cry. Deep learning models such as Deep Neural Network (DNN) and Convolutional Neural Network (CNN) were used to classify infant cry between normal/non-asphyxia and asphyxia. The performance of the deep learning models was compared using concatenated hybrid features and single feature of MFCC. The Baby Chillanto Database was used in this study. CNN model performed better than DNN models when MFCC was used. DNN models performed better with hybrid features compared to that with single feature of MFCC. DNN with multiple hidden layers achieved an accuracy of 100% in classifying normal and asphyxia cry, and 99.96% for non-asphyxia and asphyxia cry when the hybrid features were used. Elsevier 2022-12 Article PeerReviewed Ting, Hua-Nong and Choo, Yao-Mun and Kamar, Azanna Ahmad (2022) Classification of asphyxia infant cry using hybrid speech features and deep learning models. Expert Systems with Applications, 208. ISSN 0957-4174, DOI https://doi.org/10.1016/j.eswa.2022.118064 <https://doi.org/10.1016/j.eswa.2022.118064>. 10.1016/j.eswa.2022.118064
institution	Universiti Malaya
building	UM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaya
content_source	UM Research Repository
url_provider	http://eprints.um.edu.my/
topic	TA Engineering (General). Civil engineering (General)
spellingShingle	TA Engineering (General). Civil engineering (General) Ting, Hua-Nong Choo, Yao-Mun Kamar, Azanna Ahmad Classification of asphyxia infant cry using hybrid speech features and deep learning models
description	Single speech feature such as Mel-Frequency Cepstral Coefficient (MFCC) has been used in most of the studies to classify asphyxia cry among infants. Other speech features such as Chromagram, Mel-scaled Spectrogram, Spectral Contrast and Tonnetz have not been reported in any study related to the classification of asphyxia cry. The study investigated the use of hybrid features of MFCC, Chromagram, Mel-scaled Spectrogram, Spectral Contrast and Tonnetz and deep learning models in classifying asphyxia cry. Deep learning models such as Deep Neural Network (DNN) and Convolutional Neural Network (CNN) were used to classify infant cry between normal/non-asphyxia and asphyxia. The performance of the deep learning models was compared using concatenated hybrid features and single feature of MFCC. The Baby Chillanto Database was used in this study. CNN model performed better than DNN models when MFCC was used. DNN models performed better with hybrid features compared to that with single feature of MFCC. DNN with multiple hidden layers achieved an accuracy of 100% in classifying normal and asphyxia cry, and 99.96% for non-asphyxia and asphyxia cry when the hybrid features were used.
format	Article
author	Ting, Hua-Nong Choo, Yao-Mun Kamar, Azanna Ahmad
author_facet	Ting, Hua-Nong Choo, Yao-Mun Kamar, Azanna Ahmad
author_sort	Ting, Hua-Nong
title	Classification of asphyxia infant cry using hybrid speech features and deep learning models
title_short	Classification of asphyxia infant cry using hybrid speech features and deep learning models
title_full	Classification of asphyxia infant cry using hybrid speech features and deep learning models
title_fullStr	Classification of asphyxia infant cry using hybrid speech features and deep learning models
title_full_unstemmed	Classification of asphyxia infant cry using hybrid speech features and deep learning models
title_sort	classification of asphyxia infant cry using hybrid speech features and deep learning models
publisher	Elsevier
publishDate	2022
url	http://eprints.um.edu.my/40947/
_version_	1776247421956784128

Classification of asphyxia infant cry using hybrid speech features and deep learning models

Similar Items