...
【24h】

Improvement of N-gram language models using accent phrase boundaries

机译:使用重音短语边界改进N-gram语言模型

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Current continuous speech recognition systems make much use of segmental features but little use of prosodic features. This paper proposes a novel method to integrate prosodic boundary information into N-gram-based language modeling. In this method, two types of language sub-models are built. One characterizes word transitions crossing accent phrase boundaries and the other not crossing the boundaries. To realize these two sub-models directly from a speech corpus, its size should be comparable to a text corpus used for N-gram model training. However, the preparation of such a large speech corpus is not realistic. To solve this problem, we focus upon transition of words in terms of their part-of-speech (POS), and differences in FOS transition crossing and not crossing the boundaries are used to generate the two sub-models. Through experiments, the proposed model showed 11% perplexity reduction given the correct boundary position, and 8% reduction with the automatically extracted boundaries. Even when test speech samples were spoken by another speaker than the speaker used in characterizing the POS transitions, 6% reduction was observed.
机译:当前的连续语音识别系统大量使用分段特征,但是很少使用韵律特征。本文提出了一种将韵律边界信息集成到基于N-gram的语言建模中的新方法。在这种方法中,建立了两种类型的语言子模型。一个特征是跨越重音词组边界的单词过渡,而另一个不跨越边界。为了直接从语音语料库中实现这两个子模型,其大小应与用于N-gram模型训练的文本语料库相当。但是,准备这么大的语音语料库是不现实的。为了解决这个问题,我们将重点放在词的词性(POS)方面,并且使用FOS过渡跨越和不跨越边界的差异来生成两个子模型。通过实验,所提出的模型显示出在正确边界位置下的困惑度降低了11%,在自动提取边界的情况下降低了8%。即使当测试语音样本由表征POS转换的说话者以外的其他说话者讲话时,也观察到6%的减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号