Incorporating knowledge sources into statistical speech recognition

Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible. The authors...

Full description

Saved in:
Bibliographic Details
Main Authors: Sakti, Sakriani, Markov, Konstantin, Nakamura, Satoshi, Minker, Wolfgang
Format: Book
Language:English
Published: Springer 2017
Subjects:
Online Access:http://repository.vnu.edu.vn/handle/VNU_123/30632
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Vietnam National University, Hanoi
Language: English
id oai:112.137.131.14:VNU_123-30632
record_format dspace
spelling oai:112.137.131.14:VNU_123-306322020-07-07T04:06:34Z Incorporating knowledge sources into statistical speech recognition Sakti, Sakriani Markov, Konstantin Nakamura, Satoshi Minker, Wolfgang Earth and Environmental Science ; Automatic speech recognition. 006.454 Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible. The authors provide an efficient general framework for incorporating knowledge sources into state-of-the-art statistical ASR systems. This framework, which is called GFIKS (graphical framework to incorporate additional knowledge sources), was designed by utilizing the concept of the Bayesian network (BN) framework. This framework allows probabilistic relationships among different information sources to be learned, various kinds of knowledge sources to be incorporated, and a probabilistic function of the model to be formulated. Incorporating Knowledge Sources into Statistical Speech Recognition demonstrates how the statistical speech recognition system may incorporate additional information sources by utilizing GFIKS at different levels of ASR. The incorporation of various knowledge sources, including background noises, accent, gender and wide phonetic knowledge information, in modeling is discussed theoretically and analyzed experimentally. 2017-04-18T08:07:37Z 2017-04-18T08:07:37Z 2009 Book 978-0-387-85829-6 http://repository.vnu.edu.vn/handle/VNU_123/30632 en 207 p. application/pdf Springer
institution Vietnam National University, Hanoi
building VNU Library & Information Center
country Vietnam
collection VNU Digital Repository
language English
topic Earth and Environmental Science ; Automatic speech recognition.
006.454
spellingShingle Earth and Environmental Science ; Automatic speech recognition.
006.454
Sakti, Sakriani
Markov, Konstantin
Nakamura, Satoshi
Minker, Wolfgang
Incorporating knowledge sources into statistical speech recognition
description Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible. The authors provide an efficient general framework for incorporating knowledge sources into state-of-the-art statistical ASR systems. This framework, which is called GFIKS (graphical framework to incorporate additional knowledge sources), was designed by utilizing the concept of the Bayesian network (BN) framework. This framework allows probabilistic relationships among different information sources to be learned, various kinds of knowledge sources to be incorporated, and a probabilistic function of the model to be formulated. Incorporating Knowledge Sources into Statistical Speech Recognition demonstrates how the statistical speech recognition system may incorporate additional information sources by utilizing GFIKS at different levels of ASR. The incorporation of various knowledge sources, including background noises, accent, gender and wide phonetic knowledge information, in modeling is discussed theoretically and analyzed experimentally.
format Book
author Sakti, Sakriani
Markov, Konstantin
Nakamura, Satoshi
Minker, Wolfgang
author_facet Sakti, Sakriani
Markov, Konstantin
Nakamura, Satoshi
Minker, Wolfgang
author_sort Sakti, Sakriani
title Incorporating knowledge sources into statistical speech recognition
title_short Incorporating knowledge sources into statistical speech recognition
title_full Incorporating knowledge sources into statistical speech recognition
title_fullStr Incorporating knowledge sources into statistical speech recognition
title_full_unstemmed Incorporating knowledge sources into statistical speech recognition
title_sort incorporating knowledge sources into statistical speech recognition
publisher Springer
publishDate 2017
url http://repository.vnu.edu.vn/handle/VNU_123/30632
_version_ 1680965113350193152