Using 3D deformable template trellis to describe the movement of the lip
Lip reading becomes a fascinating research area in multimedia since 1990’s. It is proven with many experiments that even small effort is made toward the incorporation of visual signal, the performance of a speech recognizer is improved compared with a purely acoustic recognizer. Such enhancement...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/91394 http://hdl.handle.net/10220/6037 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Lip reading becomes a fascinating research area in multimedia since 1990’s. It
is proven with many experiments that even small effort is made toward the
incorporation of visual signal, the performance of a speech recognizer is
improved compared with a purely acoustic recognizer. Such enhancement effect
is especially prominent under noisy environment such as in bus station, airport,
office and stock market. As a result, lip reading holds the promise to broaden
the range of applicability of speech recognition to unfavorable environments.
Most previous studies focus on tracking the movement of the lip during
speaking. In this paper, a novel approach for this purpose is also reported. By
applying 3D templates, together with probability trellis, we show that the
movement of the lip at various head positions can be properly tracked. |
---|