首页> 外文会议> >Topic adaptation for language modeling using unnormalized exponential models
【24h】

Topic adaptation for language modeling using unnormalized exponential models

机译:使用非标准化指数模型进行语言建模的主题适应

获取原文

摘要

We present novel techniques for performing topic adaptation on an n-gram language model. Given training text labeled with topic information, we automatically identify the most relevant topics for new text. We adapt our language model toward these topics using an exponential model, by adjusting the probabilities in our model to agree with those found in the topical subset of the training data. For efficiency, we do not normalize the model; that is, we do not require that the "probabilities" in the language model sum to 1. With these techniques, we were able to achieve a modest reduction in speech recognition word-error rate in the broadcast news domain.
机译:我们提出了一种新颖的技术,用于在n-gram语言模型上执行主题适应。给定带有主题信息标签的培训文本,我们将自动为新文本标识最相关的主题。通过调整模型中的概率以使其与训练数据的主题子集中的概率相符,我们使用指数模型使语言模型适应这些主题。为了提高效率,我们不对模型进行归一化;也就是说,我们不需要语言模型中的“概率”之和为1。使用这些技术,我们能够在广播新闻领域中实现语音识别单词错误率的适度降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号