Application of multimodal speech recognition based on deep neural networks in interpretation teaching

In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the adva...

Full description

Saved in:
Bibliographic Details
Main Authors: Nai, Ruihua, Hassan, Hanita
Format: Conference or Workshop Item
Published: 2023
Subjects:
Online Access:http://eprints.utm.my/108403/
http://dx.doi.org/10.1117/12.3011751
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.108403
record_format eprints
spelling my.utm.1084032024-10-28T10:05:55Z http://eprints.utm.my/108403/ Application of multimodal speech recognition based on deep neural networks in interpretation teaching Nai, Ruihua Hassan, Hanita L Education (General) P Philology. Linguistics In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the advantages of speech corpus denoising based on acoustic expert feature extraction and training algorithm, convolution decomposition method and interpretation element analysis are described. Finally, through the experimental verification, it is proved that the recognition system can effectively improve students’ interpretation efficiency and accuracy, and the accuracy rate is more than 93%. 2023 Conference or Workshop Item PeerReviewed Nai, Ruihua and Hassan, Hanita (2023) Application of multimodal speech recognition based on deep neural networks in interpretation teaching. In: 3rd International Conference on Artificial Intelligence, Virtual Reality, and Visualization, AIVRV 2023, 7 July 2023 - 9 July 2023, Chongqing, China. http://dx.doi.org/10.1117/12.3011751
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic L Education (General)
P Philology. Linguistics
spellingShingle L Education (General)
P Philology. Linguistics
Nai, Ruihua
Hassan, Hanita
Application of multimodal speech recognition based on deep neural networks in interpretation teaching
description In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the advantages of speech corpus denoising based on acoustic expert feature extraction and training algorithm, convolution decomposition method and interpretation element analysis are described. Finally, through the experimental verification, it is proved that the recognition system can effectively improve students’ interpretation efficiency and accuracy, and the accuracy rate is more than 93%.
format Conference or Workshop Item
author Nai, Ruihua
Hassan, Hanita
author_facet Nai, Ruihua
Hassan, Hanita
author_sort Nai, Ruihua
title Application of multimodal speech recognition based on deep neural networks in interpretation teaching
title_short Application of multimodal speech recognition based on deep neural networks in interpretation teaching
title_full Application of multimodal speech recognition based on deep neural networks in interpretation teaching
title_fullStr Application of multimodal speech recognition based on deep neural networks in interpretation teaching
title_full_unstemmed Application of multimodal speech recognition based on deep neural networks in interpretation teaching
title_sort application of multimodal speech recognition based on deep neural networks in interpretation teaching
publishDate 2023
url http://eprints.utm.my/108403/
http://dx.doi.org/10.1117/12.3011751
_version_ 1814932886311141376