Bayesian Learning of a Language Model from Continuous Speech

Graham NEUBIG; Masato MIMURA; Shinsuke MORI; Tatsuya KAWAHARA

首页> 外文期刊>IEICE Transactions on Information and Systems >Bayesian Learning of a Language Model from Continuous Speech

【24h】

Bayesian Learning of a Language Model from Continuous Speech

机译：从连续语音中进行语言模型的贝叶斯学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a novel scheme to learn a language model (LM) for automatic speech recognition (ASR) directly from continuous speech. In the proposed method, we first generate phoneme lattices using an acoustic model with no linguistic constraints, then perform training over these phoneme lattices, simultaneously learning both lexical units and an LM. As a statistical framework for this learning problem, we use non-parametric Bayesian statistics, which make it possible to balance the learned model's complexity (such as the size of the learned vocabulary) and expressive power, and provide a principled learning algorithm through the use of Gibbs sampling. Implementation is performed using weighted finite state transducers (WFSTs), which allow for the simple handling of lattice input. Experimental results on natural, adult-directed speech demonstrate that LMs built using only continuous speech are able to significantly reduce ASR phoneme error rates. The proposed technique of joint Bayesian learning of lexical units and an LM over lattices is shown to significantly contribute to this improvement.

机译：我们提出了一种新颖的方案，可以直接从连续语音中学习用于自动语音识别（ASR）的语言模型（LM）。在提出的方法中，我们首先使用没有语言约束的声学模型生成音素格，然后对这些音素格进行训练，同时学习词汇单元和LM。作为此学习问题的统计框架，我们使用非参数贝叶斯统计方法，可以平衡学习模型的复杂性（例如学习词汇的大小）和表达能力，并通过使用提供有原则的学习算法Gibbs抽样。使用加权有限状态换能器（WFST）来执行实现，这允许简单地处理晶格输入。关于自然的，成人定向语音的实验结果表明，仅使用连续语音构建的LM能够显着降低ASR音素错误率。提出的联合贝叶斯学习词法单元和基于矩阵的LM的建议技术将显着促进这一改进。

著录项

来源
《IEICE Transactions on Information and Systems》 |2012年第2期|p.614-625|共12页
作者
Graham NEUBIG; Masato MIMURA; Shinsuke MORI; Tatsuya KAWAHARA;
展开▼
作者单位

Graduate School of Informatics, Kyoto University, Kyoto-shi, 606-8501 Japan;

Graduate School of Informatics, Kyoto University, Kyoto-shi, 606-8501 Japan;

Graduate School of Informatics, Kyoto University, Kyoto-shi, 606-8501 Japan;

Graduate School of Informatics, Kyoto University, Kyoto-shi, 606-8501 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
language modeling; automatic speech recognition; bayesian learning; weighted finite state transducers;

机译：语言建模;自动语音识别;贝叶斯学习;加权有限状态传感器;
入库时间 2022-08-18 00:26:18

相似文献

外文文献
中文文献
专利

1. Bayesian Learning of a Language Model from Continuous Speech [J] . Graham NEUBIG, Masato MIMURA, Shinsuke MORI, IEICE transactions on information and systems . 2012,第2期

机译：从连续语音中进行语言模型的贝叶斯学习
2. Nonparametric Bayesian Double Articulation Analyzer for Direct Language Acquisition From Continuous Speech Signals [J] . Tadahiro Taniguchi, Shogo Nagasaka, Ryo Nakashima IEEE Transactions on Cognitive and Developmental Systems . 2016,第3期

机译：用于连续语音信号直接语言获取的非参数贝叶斯双发音分析器
3. Hierarchical Bayesian Language Models for Conversational Speech Recognition [J] . Huang S., Renals S. Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第8期

机译：会话语音识别的多层贝叶斯语言模型
4. Learning a Language Model from Continuous Speech [C] . Graham Neubig, Masato Mimura, Shinsuke Mori, Annual conference of the International Speech Communication Association;INTERSPEECH 2010 . 2011

机译：从连续语音学习语言模型
5. Multilingual Transfer Learning for Code-Switched Language and Speech Neural Modeling [D] . Winata, Genta Indra. 2021

机译：代码交换语言和语音神经建模的多语言转移学习
6. Computer simulations of coupled idiosyncrasies in speech perception and speech production with COSMO, a perceptuo-motor Bayesian model of speech communication [O] . Marie-Lou Barnaud, Jean-Luc Schwartz, Pierre Bessière, 2012

机译：语音感知和运动贝叶斯模型COSMO对语音感知和语音产生中的特质耦合的计算机模拟
7. Nonparametric Bayesian Double Articulation Analyzer for Direct Language Acquisition from Continuous Speech Signals [O] . Taniguchi, Tadahiro, Nakashima, Ryo, Nagasaka, Shogo 2016

机译：用于直接语言的非参数贝叶斯双关节分析仪从连续语音信号中获取
8. Continuous Speech Recognition in a Language Tutor - Using Learning Principles to Alleviate Underlying Problems [R] . Kaplan, J. D. , Holland, V. M. 1998

机译：语言导师中的连续语音识别 - 运用学习原则缓解潜在问题

Bayesian Learning of a Language Model from Continuous Speech

摘要

著录项

相似文献

相关主题

期刊订阅