Incorporating knowledge sources into statistical speech recognition

Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible. The authors...

全面介紹

Saved in:
書目詳細資料
Main Authors: Sakti, Sakriani, Markov, Konstantin, Nakamura, Satoshi, Minker, Wolfgang
格式: 圖書
語言:English
出版: Springer 2017
主題:
在線閱讀:http://repository.vnu.edu.vn/handle/VNU_123/30632
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Vietnam National University, Hanoi
語言: English
實物特徵
總結:Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible. The authors provide an efficient general framework for incorporating knowledge sources into state-of-the-art statistical ASR systems. This framework, which is called GFIKS (graphical framework to incorporate additional knowledge sources), was designed by utilizing the concept of the Bayesian network (BN) framework. This framework allows probabilistic relationships among different information sources to be learned, various kinds of knowledge sources to be incorporated, and a probabilistic function of the model to be formulated. Incorporating Knowledge Sources into Statistical Speech Recognition demonstrates how the statistical speech recognition system may incorporate additional information sources by utilizing GFIKS at different levels of ASR. The incorporation of various knowledge sources, including background noises, accent, gender and wide phonetic knowledge information, in modeling is discussed theoretically and analyzed experimentally.