Modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients

This paper is about the creation of an artificial neural network (ANN) in MATLAB to analyze the features extracted from calculating the mel-frequency cepstral coefficients (MFCC) of the raw audio data. The paper explains basic concepts about the ANN, as well as the MFCC and other relevant theories....

Full description

Saved in:
Bibliographic Details
Main Authors: Bandala, Argel A., Lim, Allimzon M., Cai, Mark Anthony D., Bacar, Allan Jeffrey C., Mañosca, Aynna Claudine G.
Format: text
Published: Animo Repository 2015
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/1359
https://animorepository.dlsu.edu.ph/context/faculty_research/article/2358/type/native/viewcontent
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-2358
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-23582021-06-23T02:25:40Z Modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients Bandala, Argel A. Lim, Allimzon M. Cai, Mark Anthony D. Bacar, Allan Jeffrey C. Mañosca, Aynna Claudine G. This paper is about the creation of an artificial neural network (ANN) in MATLAB to analyze the features extracted from calculating the mel-frequency cepstral coefficients (MFCC) of the raw audio data. The paper explains basic concepts about the ANN, as well as the MFCC and other relevant theories. Regarding the design of the ANN, it uses multiple infant crying sounds, as well as non-crying sounds, to create a sample training set with a corresponding target that determines whether the sound is a cry or not. The paper uses relevant concepts heavily utilized in speech recognition for the design of the infant cry recognition, modifies them, and adds a few more calculations to fit the desired application to compensate for the differences present in a cry from human speech. © 2014 IEEE. 2015-01-26T08:00:00Z text text/html https://animorepository.dlsu.edu.ph/faculty_research/1359 https://animorepository.dlsu.edu.ph/context/faculty_research/article/2358/type/native/viewcontent Faculty Research Work Animo Repository Auditory perception Computational auditory scene analysis Electrical and Electronics Systems and Communications
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Auditory perception
Computational auditory scene analysis
Electrical and Electronics
Systems and Communications
spellingShingle Auditory perception
Computational auditory scene analysis
Electrical and Electronics
Systems and Communications
Bandala, Argel A.
Lim, Allimzon M.
Cai, Mark Anthony D.
Bacar, Allan Jeffrey C.
Mañosca, Aynna Claudine G.
Modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients
description This paper is about the creation of an artificial neural network (ANN) in MATLAB to analyze the features extracted from calculating the mel-frequency cepstral coefficients (MFCC) of the raw audio data. The paper explains basic concepts about the ANN, as well as the MFCC and other relevant theories. Regarding the design of the ANN, it uses multiple infant crying sounds, as well as non-crying sounds, to create a sample training set with a corresponding target that determines whether the sound is a cry or not. The paper uses relevant concepts heavily utilized in speech recognition for the design of the infant cry recognition, modifies them, and adds a few more calculations to fit the desired application to compensate for the differences present in a cry from human speech. © 2014 IEEE.
format text
author Bandala, Argel A.
Lim, Allimzon M.
Cai, Mark Anthony D.
Bacar, Allan Jeffrey C.
Mañosca, Aynna Claudine G.
author_facet Bandala, Argel A.
Lim, Allimzon M.
Cai, Mark Anthony D.
Bacar, Allan Jeffrey C.
Mañosca, Aynna Claudine G.
author_sort Bandala, Argel A.
title Modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients
title_short Modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients
title_full Modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients
title_fullStr Modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients
title_full_unstemmed Modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients
title_sort modelling and characterization of an artificial neural network for infant cry recognition using mel-frequency cepstral coefficients
publisher Animo Repository
publishDate 2015
url https://animorepository.dlsu.edu.ph/faculty_research/1359
https://animorepository.dlsu.edu.ph/context/faculty_research/article/2358/type/native/viewcontent
_version_ 1703981016520065024