A review of audio-visual speech recognition

Speech is the most important tool of interaction among human beings. This has inspired researchers to study further on speech recognition and develop a computer system that is able to integrate and understand human speech. But acoustic noisy environment can highly contaminate audio speech and affect...

Full description

Saved in:
Bibliographic Details
Main Authors: Thum, Wei Seong, M. Z., Ibrahim
Format: Article
Language:English
Published: UTeM 2018
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/21637/1/A%20review%20of%20audio-visual%20speech%20recognition.pdf
http://umpir.ump.edu.my/id/eprint/21637/
http://journal.utem.edu.my/index.php/jtec/article/view/3573
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Pahang
Language: English
Description
Summary:Speech is the most important tool of interaction among human beings. This has inspired researchers to study further on speech recognition and develop a computer system that is able to integrate and understand human speech. But acoustic noisy environment can highly contaminate audio speech and affect the overall recognition performance. Thus, Audio-Visual Speech Recognition (AVSR) is designed to overcome the problems by utilising visual images which are unaffected by noise. The aim of this paper is to discuss the AVSR structures, which includes the front end processes, audio-visual data corpus used, recent works and accuracy estimation methods.