Model-based articulatory phonetic features for improved speech recognition

We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of a...

Full description

Saved in:
Bibliographic Details
Main Authors: Huang, Guangpu, Er, Meng Joo
Other Authors: School of Electrical and Electronic Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/97868
http://hdl.handle.net/10220/12393
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-97868
record_format dspace
spelling sg-ntu-dr.10356-978682020-03-07T13:24:48Z Model-based articulatory phonetic features for improved speech recognition Huang, Guangpu Er, Meng Joo School of Electrical and Electronic Engineering International Joint Conference on Neural Networks (2012 : Brisbane, Australia) DRNTU::Engineering::Electrical and electronic engineering We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping. 2013-07-26T06:36:35Z 2019-12-06T19:47:31Z 2013-07-26T06:36:35Z 2019-12-06T19:47:31Z 2012 2012 Conference Paper Huang, G., & Er, M. J. (2012). Model-based articulatory phonetic features for improved speech recognition. The 2012 International Joint Conference on Neural Networks (IJCNN). https://hdl.handle.net/10356/97868 http://hdl.handle.net/10220/12393 10.1109/IJCNN.2012.6252748 en © 2012 IEEE.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Huang, Guangpu
Er, Meng Joo
Model-based articulatory phonetic features for improved speech recognition
description We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Huang, Guangpu
Er, Meng Joo
format Conference or Workshop Item
author Huang, Guangpu
Er, Meng Joo
author_sort Huang, Guangpu
title Model-based articulatory phonetic features for improved speech recognition
title_short Model-based articulatory phonetic features for improved speech recognition
title_full Model-based articulatory phonetic features for improved speech recognition
title_fullStr Model-based articulatory phonetic features for improved speech recognition
title_full_unstemmed Model-based articulatory phonetic features for improved speech recognition
title_sort model-based articulatory phonetic features for improved speech recognition
publishDate 2013
url https://hdl.handle.net/10356/97868
http://hdl.handle.net/10220/12393
_version_ 1681042052472635392