Evaluation of sound-event classification for robot hearing
Over the years, there have been numerous ways of carrying out the process of sound event classification. Some of these known methods consists of the use of the spectrogram, a time-frequency spectral analysis that depicts the magnitude spectrum of the signal within a 2D time-frequency plane. Thoug...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/77561 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Over the years, there have been numerous ways of carrying out the process of sound
event classification. Some of these known methods consists of the use of the
spectrogram, a time-frequency spectral analysis that depicts the magnitude spectrum
of the signal within a 2D time-frequency plane. Though useful, there could have been
further improvements that could have been made. For example, what if the phase
spectrum was included as well? Would it have been able to provide more
distinguishable features to the sound for effective sound classification? Therefore, in
this Final Year Project, an evaluation will be carried out for a process called
“Regularized 2D complex-log-Fourier transform”, which was originally proposed by
Jiang Xudong and Ren Jianfeng. In the “Regularized 2D complex-log-Fourier
transform”, both magnitude and phase spectrums of the signal will be utilized for the
sound event classification. In addition, the Principal Component Analysis, PCA, will
be used to facilitate in the removal of unnecessary features in the samples which might
pose an issue during the classification process. Finally, the method chosen for the
sound classification is to calculate “Mahalanobis Distance” between the test samples
and the reference data of the training samples. |
---|