A smart eavesdropping system : recognizing keywords by human subjects
Speech Recognition (SR) gains its popularity in research area as the advance of modern technologies. It can translate speech into text with the aid of computers and speech recognition applications. In this final year project, an open source speech recognition engine named Pocketsphinx from Carnegie...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2015
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/63768 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-63768 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-637682023-07-07T16:50:36Z A smart eavesdropping system : recognizing keywords by human subjects Zhang, Chun Meng Khong Andy Wai Hoong School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Speech Recognition (SR) gains its popularity in research area as the advance of modern technologies. It can translate speech into text with the aid of computers and speech recognition applications. In this final year project, an open source speech recognition engine named Pocketsphinx from Carnegie Mellon University (CMU) is integrated into our existing Eavesdropping System to perform speech recognition or keyword spotting tasks on the output audio files from the system. This report covers the details on the development of whole speech recognition framework assembled. Pocketsphinx is compiled and installed in a Linux server remotely and communicates with the client Matlab programs using network sockets. System parameters are carefully tuned to ensure the performance. Experiments on different combinations of Acoustic Models (AM) and Language Models (LM) are also conducted and evaluated. Acoustic model adaptation which adapts the speech recognizer into specific acoustic environment or speaker to enhance the recognition performance is also presented. Furthermore, another commercially available speech recognition application named Dragon Naturally Speaking (DNS) 12 is also experimented and compared with Pocketsphinx used in the system. Bachelor of Engineering 2015-05-19T02:26:48Z 2015-05-19T02:26:48Z 2015 2015 Final Year Project (FYP) http://hdl.handle.net/10356/63768 en Nanyang Technological University 62 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Zhang, Chun Meng A smart eavesdropping system : recognizing keywords by human subjects |
description |
Speech Recognition (SR) gains its popularity in research area as the advance of modern technologies. It can translate speech into text with the aid of computers and speech recognition applications. In this final year project, an open source speech recognition engine named Pocketsphinx from Carnegie Mellon University (CMU) is integrated into our existing Eavesdropping System to perform speech recognition or keyword spotting tasks on the output audio files from the system. This report covers the details on the development of whole speech recognition framework assembled. Pocketsphinx is compiled and installed in a Linux server remotely and communicates with the client Matlab programs using network sockets. System parameters are carefully tuned to ensure the performance. Experiments on different combinations of Acoustic Models (AM) and Language Models (LM) are also conducted and evaluated. Acoustic model adaptation which adapts the speech recognizer into specific acoustic environment or speaker to enhance the recognition performance is also presented. Furthermore, another commercially available speech recognition application named Dragon Naturally Speaking (DNS) 12 is also experimented and compared with Pocketsphinx used in the system. |
author2 |
Khong Andy Wai Hoong |
author_facet |
Khong Andy Wai Hoong Zhang, Chun Meng |
format |
Final Year Project |
author |
Zhang, Chun Meng |
author_sort |
Zhang, Chun Meng |
title |
A smart eavesdropping system : recognizing keywords by human subjects |
title_short |
A smart eavesdropping system : recognizing keywords by human subjects |
title_full |
A smart eavesdropping system : recognizing keywords by human subjects |
title_fullStr |
A smart eavesdropping system : recognizing keywords by human subjects |
title_full_unstemmed |
A smart eavesdropping system : recognizing keywords by human subjects |
title_sort |
smart eavesdropping system : recognizing keywords by human subjects |
publishDate |
2015 |
url |
http://hdl.handle.net/10356/63768 |
_version_ |
1772826677017051136 |