Recognition of visual speech elements using adaptively boosted hidden Markov models

The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of vis...

Full description

Saved in:

Bibliographic Details
Main Authors:	Foo, Say Wei, Lian, Yong, Dong, Liang
Format:	Article
Language:	English
Published:	2009
Online Access:	https://hdl.handle.net/10356/91495 http://hdl.handle.net/10220/4584 http://sfxna09.hosted.exlibrisgroup.com:3410/ntu/sfxlcl3?sid=metalib:EVII&id=doi:10.1109/TCSVT.2004.826773&genre=&isbn=&issn=10518215&date=2004&volume=14&issue=5&spage=693&epage=705&aulast=Foo&aufirst=%20Say%20Wei&auinit=&title=IEEE%20Transactions%20on%20Circuits%20and%20Systems%20for%20Video%20Technology&atitle=Recognition%20of%20visual%20speech%20elements%20using%20adaptively%20boosted%20hidden%20markov%20models
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-91495
record_format	dspace
spelling	sg-ntu-dr.10356-914952020-03-07T14:02:40Z Recognition of visual speech elements using adaptively boosted hidden Markov models Foo, Say Wei Lian, Yong Dong, Liang The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov models (HMMs) to build an AdaBoost-HMM classifier. The composite HMMs of the AdaBoost-HMM classifier are trained to cover different groups of training samples using the AdaBoost technique and the biased Baum–Welch training method. By combining the decisions of the component classifiers of the composite HMMs according to a novel probability synthesis rule, a more complex decision boundary is formulated than using the single HMM classifier. The method is applied to the recognition of the basic visual speech elements. Experimental results show that the AdaBoost-HMM classifier outperforms the traditional HMM classifier in accuracy, especially for visemes extracted from contexts. Published version 2009-04-27T01:59:29Z 2019-12-06T18:06:40Z 2009-04-27T01:59:29Z 2019-12-06T18:06:40Z 2004 2004 Journal Article Foo, S. W., Lian, Y., & Dong, L. (2004). Recognition of visual speech elements using adaptively boosted hidden Markov models. IEEE Transactions on Circuits and Systems for Video Technology, 14(5), 693-705. 1051-8215 https://hdl.handle.net/10356/91495 http://hdl.handle.net/10220/4584 http://sfxna09.hosted.exlibrisgroup.com:3410/ntu/sfxlcl3?sid=metalib:EVII&id=doi:10.1109/TCSVT.2004.826773&genre=&isbn=&issn=10518215&date=2004&volume=14&issue=5&spage=693&epage=705&aulast=Foo&aufirst=%20Say%20Wei&auinit=&title=IEEE%20Transactions%20on%20Circuits%20and%20Systems%20for%20Video%20Technology&atitle=Recognition%20of%20visual%20speech%20elements%20using%20adaptively%20boosted%20hidden%20markov%20models 10.1109/TCSVT.2004.826773 en IEEE transactions on circuits and systems for video technology © 2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. 13 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
description	The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov models (HMMs) to build an AdaBoost-HMM classifier. The composite HMMs of the AdaBoost-HMM classifier are trained to cover different groups of training samples using the AdaBoost technique and the biased Baum–Welch training method. By combining the decisions of the component classifiers of the composite HMMs according to a novel probability synthesis rule, a more complex decision boundary is formulated than using the single HMM classifier. The method is applied to the recognition of the basic visual speech elements. Experimental results show that the AdaBoost-HMM classifier outperforms the traditional HMM classifier in accuracy, especially for visemes extracted from contexts.
format	Article
author	Foo, Say Wei Lian, Yong Dong, Liang
spellingShingle	Foo, Say Wei Lian, Yong Dong, Liang Recognition of visual speech elements using adaptively boosted hidden Markov models
author_facet	Foo, Say Wei Lian, Yong Dong, Liang
author_sort	Foo, Say Wei
title	Recognition of visual speech elements using adaptively boosted hidden Markov models
title_short	Recognition of visual speech elements using adaptively boosted hidden Markov models
title_full	Recognition of visual speech elements using adaptively boosted hidden Markov models
title_fullStr	Recognition of visual speech elements using adaptively boosted hidden Markov models
title_full_unstemmed	Recognition of visual speech elements using adaptively boosted hidden Markov models
title_sort	recognition of visual speech elements using adaptively boosted hidden markov models
publishDate	2009
url	https://hdl.handle.net/10356/91495 http://hdl.handle.net/10220/4584 http://sfxna09.hosted.exlibrisgroup.com:3410/ntu/sfxlcl3?sid=metalib:EVII&id=doi:10.1109/TCSVT.2004.826773&genre=&isbn=&issn=10518215&date=2004&volume=14&issue=5&spage=693&epage=705&aulast=Foo&aufirst=%20Say%20Wei&auinit=&title=IEEE%20Transactions%20on%20Circuits%20and%20Systems%20for%20Video%20Technology&atitle=Recognition%20of%20visual%20speech%20elements%20using%20adaptively%20boosted%20hidden%20markov%20models
_version_	1681036048188047360

Recognition of visual speech elements using adaptively boosted hidden Markov models

Similar Items