Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA

In this study, spectral subtraction speech enhancement is integrated to a two word vocabulary speech recognition system to effectively reduce the effects of background noise and increase the recognition rate. The whole system was implemented in FPGA and was modelled in MATLAB. The preprocessing subs...

Full description

Saved in:

Bibliographic Details
Main Authors:	Orillo, John William, Yap, Roderick, Sybingco, Edwin
Format:	text
Published:	Animo Repository 2012
Subjects:	Automatic speech recognition Noise control Electrical and Electronics
Online Access:	https://animorepository.dlsu.edu.ph/faculty_research/501 https://animorepository.dlsu.edu.ph/context/faculty_research/article/1500/type/native/viewcontent
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	De La Salle University

id	oai:animorepository.dlsu.edu.ph:faculty_research-1500
record_format	eprints
spelling	oai:animorepository.dlsu.edu.ph:faculty_research-15002021-12-13T01:07:56Z Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA Orillo, John William Yap, Roderick Sybingco, Edwin In this study, spectral subtraction speech enhancement is integrated to a two word vocabulary speech recognition system to effectively reduce the effects of background noise and increase the recognition rate. The whole system was implemented in FPGA and was modelled in MATLAB. The preprocessing subsystem contains the spectral subtraction algorithm and acoustic front end speech enhancements while the speech recognition subsystem contains the HMM and Viterbi search algorithms. 10 dirty speech samples of word 'stop' and 'clockwise' (sampled at 84 dB) were tested in the speech recognition prototype with varying background noise from 44.6 to 85.4 dB and noise floor (β) from 0.01 to 1. At the end of the testing, the system was able to recognize the two words (stop and clockwise) efficiently with accuracy rate of above 80% until a background noise of 68.6 dB. The best average recognition rate (from 44.6 to 85.4 dB background noise) of 48.5% on the other hand was recorded at 0.01 noise floor. The system without spectral subtraction enhancement was noticed to function efficiently only at 56.6 dB. © 2012 IEEE. 2012-12-01T08:00:00Z text text/html https://animorepository.dlsu.edu.ph/faculty_research/501 https://animorepository.dlsu.edu.ph/context/faculty_research/article/1500/type/native/viewcontent Faculty Research Work Animo Repository Automatic speech recognition Noise control Electrical and Electronics
institution	De La Salle University
building	De La Salle University Library
continent	Asia
country	Philippines Philippines
content_provider	De La Salle University Library
collection	DLSU Institutional Repository
topic	Automatic speech recognition Noise control Electrical and Electronics
spellingShingle	Automatic speech recognition Noise control Electrical and Electronics Orillo, John William Yap, Roderick Sybingco, Edwin Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA
description	In this study, spectral subtraction speech enhancement is integrated to a two word vocabulary speech recognition system to effectively reduce the effects of background noise and increase the recognition rate. The whole system was implemented in FPGA and was modelled in MATLAB. The preprocessing subsystem contains the spectral subtraction algorithm and acoustic front end speech enhancements while the speech recognition subsystem contains the HMM and Viterbi search algorithms. 10 dirty speech samples of word 'stop' and 'clockwise' (sampled at 84 dB) were tested in the speech recognition prototype with varying background noise from 44.6 to 85.4 dB and noise floor (β) from 0.01 to 1. At the end of the testing, the system was able to recognize the two words (stop and clockwise) efficiently with accuracy rate of above 80% until a background noise of 68.6 dB. The best average recognition rate (from 44.6 to 85.4 dB background noise) of 48.5% on the other hand was recorded at 0.01 noise floor. The system without spectral subtraction enhancement was noticed to function efficiently only at 56.6 dB. © 2012 IEEE.
format	text
author	Orillo, John William Yap, Roderick Sybingco, Edwin
author_facet	Orillo, John William Yap, Roderick Sybingco, Edwin
author_sort	Orillo, John William
title	Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA
title_short	Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA
title_full	Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA
title_fullStr	Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA
title_full_unstemmed	Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA
title_sort	improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in fpga
publisher	Animo Repository
publishDate	2012
url	https://animorepository.dlsu.edu.ph/faculty_research/501 https://animorepository.dlsu.edu.ph/context/faculty_research/article/1500/type/native/viewcontent
_version_	1719000561012965376

Improved noise robust automatic speech recognition system with spectral subtraction and minimum statistics algorithm implemented in FPGA

Similar Items