Articulatory phonetic features for improved speech recognition
This thesis elaborates the use of speech production knowledge in the form of articulatory phonetic features to improve the robustness of speech recognition in practical situations. The main concept is that natural speech has three attributes in the human speech processing system, i.e., the motor act...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/53915 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-53915 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-539152023-07-04T16:14:36Z Articulatory phonetic features for improved speech recognition Huang, Guangpu. Er Meng Joo School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing This thesis elaborates the use of speech production knowledge in the form of articulatory phonetic features to improve the robustness of speech recognition in practical situations. The main concept is that natural speech has three attributes in the human speech processing system, i.e., the motor activation, the articulatory trajectory, and the auditory perception. Consequently, the research work has three components. First, it describes an adaptive neural control model, which reproduces the articulatory trajectories and retrieves the motor activation patterns in a bio-mechanical speech synthesizer. Second, by manipulating the elastic vocal tract walls, the synthesizer produces the overall articulatory-to-acoustic trajectory map for English pronunciations. Third, the articulatory phonetic features are extracted in neural networks for speech recognition in cross-speaker and noisy conditions. The experimental results are compared with the traditional hidden Markov baseline system. Doctor of Philosophy (EEE) 2013-06-10T04:12:20Z 2013-06-10T04:12:20Z 2012 2012 Thesis http://hdl.handle.net/10356/53915 en 154 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Huang, Guangpu. Articulatory phonetic features for improved speech recognition |
description |
This thesis elaborates the use of speech production knowledge in the form of articulatory phonetic features to improve the robustness of speech recognition in practical situations. The main concept is that natural speech has three attributes in the human speech processing system, i.e., the motor activation, the articulatory trajectory, and the auditory perception. Consequently, the research work has three components. First, it describes an adaptive neural control model, which reproduces the articulatory trajectories and retrieves the motor activation patterns in a bio-mechanical speech synthesizer. Second, by manipulating the elastic vocal tract walls, the synthesizer produces the overall articulatory-to-acoustic trajectory map for English pronunciations. Third, the articulatory phonetic features are extracted in neural networks for speech recognition in cross-speaker and noisy conditions. The experimental results are compared with the traditional hidden Markov baseline system. |
author2 |
Er Meng Joo |
author_facet |
Er Meng Joo Huang, Guangpu. |
format |
Theses and Dissertations |
author |
Huang, Guangpu. |
author_sort |
Huang, Guangpu. |
title |
Articulatory phonetic features for improved speech recognition |
title_short |
Articulatory phonetic features for improved speech recognition |
title_full |
Articulatory phonetic features for improved speech recognition |
title_fullStr |
Articulatory phonetic features for improved speech recognition |
title_full_unstemmed |
Articulatory phonetic features for improved speech recognition |
title_sort |
articulatory phonetic features for improved speech recognition |
publishDate |
2013 |
url |
http://hdl.handle.net/10356/53915 |
_version_ |
1772827392897712128 |