Learning from the master: Distilling cross-modal advanced knowledge for lip reading
Lip reading aims to predict the spoken sentences from silent lip videos. Due to the fact that such a vision task usually performs worse than its counterpart speech recognition, one potential scheme is to distill knowledge from a teacher pretrained by audio signals. However, the latent domain gap bet...
Saved in:
Main Authors: | REN, Sucheng, DU, Yong, LV, Jianming, HAN, Guoqiang, HE, Shengfeng |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8442 https://ink.library.smu.edu.sg/context/sis_research/article/9445/viewcontent/Ren_Learning_From_the_Master_Distilling_Cross_Modal_Advanced_Knowledge_for_Lip_CVPR_2021_paper.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
CROSS-MODALITY COMPLEMENTARITY FOR AUDIO-VISUAL SPEECH RECOGNITION
by: WANG JIADONG
Published: (2024) -
汉语情态助动词的主观性和主观化 = THE SUBJECTIVITY AND SUBJECTIFICATION OF MODAL AUXILIARIES IN CHINESE
by: 杨黎黎, et al.
Published: (2015) -
Epistemic modality in TED talks on education
by: Ton Nu, My Nhat, et al.
Published: (2019) -
Unifying text, tables, and images for multimodal question answering
by: LUO, Haohao, et al.
Published: (2023) -
Combining Speech with textual methods for arabic diacritization
by: AISHA SIDDIQA AZIM
Published: (2012)