Bayesian Learning of a Language Model from Continuous Speech

Graham NEUBIG; Masato MIMURA; Shinsuke MORITatsuya KAWAHARA

首页> 外文期刊>IEICE transactions on information and systems >Bayesian Learning of a Language Model from Continuous Speech

【24h】

Bayesian Learning of a Language Model from Continuous Speech

机译：Bayesian Learning of a Language Model from Continuous Speech

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

We propose a novel scheme to learn a language model (LM) for automatic speech recognition (ASR) directly from continuous speech. In the proposed method, we first generate phoneme lattices using an acoustic model with no linguistic constraints, then perform training over these phoneme lattices, simultaneously learning both lexical units and an LM. As a statistical framework for this learning problem, we use non-parametric Bayesian statistics, which make it possible to balance the learned model's complexity (such as the size of the learned vocabulary) and expressive power, and provide a principled learning algorithm through the use of Gibbs sampling. Implementation is performed using weighted finite state transducers (WFSTs), which allow for the simple handling of lattice input. Experimental results on natural, adult-directed speech demonstrate that LMs built using only continuous speech are able to significantly reduce ASR phoneme error rates. The proposed technique of joint Bayesian learning of lexical units and an LM over lattices is shown to significantly contribute to this improvement.

著录项

来源
《IEICE transactions on information and systems》 |2012年第2期|614-625|共12页
作者
Graham NEUBIG; Masato MIMURA; Shinsuke MORITatsuya KAWAHARA;
展开▼
作者单位

Graduate School of Informatics, Kyoto University, Kyoto-shi, 606-8501 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类无线电电子学、电信技术;
关键词
language modeling; automatic speech recognition; bayesian learning; weighted finite state transducers;

Bayesian Learning of a Language Model from Continuous Speech

摘要

著录项

相关主题

期刊订阅