Application of multimodal speech recognition based on deep neural networks in interpretation teaching
In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the adva...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference or Workshop Item |
Published: |
2023
|
Subjects: | |
Online Access: | http://eprints.utm.my/108403/ http://dx.doi.org/10.1117/12.3011751 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
id |
my.utm.108403 |
---|---|
record_format |
eprints |
spelling |
my.utm.1084032024-10-28T10:05:55Z http://eprints.utm.my/108403/ Application of multimodal speech recognition based on deep neural networks in interpretation teaching Nai, Ruihua Hassan, Hanita L Education (General) P Philology. Linguistics In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the advantages of speech corpus denoising based on acoustic expert feature extraction and training algorithm, convolution decomposition method and interpretation element analysis are described. Finally, through the experimental verification, it is proved that the recognition system can effectively improve students’ interpretation efficiency and accuracy, and the accuracy rate is more than 93%. 2023 Conference or Workshop Item PeerReviewed Nai, Ruihua and Hassan, Hanita (2023) Application of multimodal speech recognition based on deep neural networks in interpretation teaching. In: 3rd International Conference on Artificial Intelligence, Virtual Reality, and Visualization, AIVRV 2023, 7 July 2023 - 9 July 2023, Chongqing, China. http://dx.doi.org/10.1117/12.3011751 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
L Education (General) P Philology. Linguistics |
spellingShingle |
L Education (General) P Philology. Linguistics Nai, Ruihua Hassan, Hanita Application of multimodal speech recognition based on deep neural networks in interpretation teaching |
description |
In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the advantages of speech corpus denoising based on acoustic expert feature extraction and training algorithm, convolution decomposition method and interpretation element analysis are described. Finally, through the experimental verification, it is proved that the recognition system can effectively improve students’ interpretation efficiency and accuracy, and the accuracy rate is more than 93%. |
format |
Conference or Workshop Item |
author |
Nai, Ruihua Hassan, Hanita |
author_facet |
Nai, Ruihua Hassan, Hanita |
author_sort |
Nai, Ruihua |
title |
Application of multimodal speech recognition based on deep neural networks in interpretation teaching |
title_short |
Application of multimodal speech recognition based on deep neural networks in interpretation teaching |
title_full |
Application of multimodal speech recognition based on deep neural networks in interpretation teaching |
title_fullStr |
Application of multimodal speech recognition based on deep neural networks in interpretation teaching |
title_full_unstemmed |
Application of multimodal speech recognition based on deep neural networks in interpretation teaching |
title_sort |
application of multimodal speech recognition based on deep neural networks in interpretation teaching |
publishDate |
2023 |
url |
http://eprints.utm.my/108403/ http://dx.doi.org/10.1117/12.3011751 |
_version_ |
1814932886311141376 |