首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics
【24h】

A class-based language model for large-vocabulary speech recognition extracted from part-of-speech statistics

机译:基于类语言模型的语音语音识别从词性统计中提取

获取原文

摘要

A novel approach is presented to class-based language modeling based on part-of-speech statistics. It uses a deterministic word-to-class mapping, which handles words with alternative part-of-speech assignments through the use of ambiguity classes. The predictive power of word-based language models and the generalization capability of class-based language models are combined using both linear interpolation and word-to-class backoff, and both methods are evaluated. Since each word belongs to oneprecisely ambiguity class, an exact word-to-class backoff model can easily be constructed. Empirical evaluations on large-vocabulary speech-recognition tasks show perplexity improvements and significant reductions in word error-rate.
机译:基于词性统计数据的基于类语言建模的一种新方法。它使用了确定性的Word-to Class映射,通过使用歧义类处理具有替代词性分配的单词。基于Word的语言模型的预测力和基于类语言模型的泛化能力,使用线性插值和级别的退避组合,并且评估了两种方法。由于每个单词属于彼此歧义类,因此可以轻松构建精确的单位退避模型。大词汇表演讲任务的实证评估显示出困惑的改进和重大减少单词误差率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号