【24h】

Augmented Context Features for Arabic Speech Recognition

机译:阿拉伯语语音识别的增强上下文特征

获取原文

摘要

We investigate different types of features for language modeling in Arabic automatic speech recognition. While much effort in language modeling research has been directed at designing better models or smoothing techniques for n-gram language models, in this paper we take the approach of augmenting the context in the n-gram model with different sources of information. We start by adding word class labels to the context. The word classes are automatically derived from un-annotated training data. As a contrast, we also experiment with POS tags which require a tagger trained on annotated data. An amalgam of these two methods uses class labels defined on word and POS tag combinations. Other context features include super-tags derived from the syntactic tree structure as well as semantic features derived from PropBank. Experiments on the DARPA GALE Arabic speech recognition task show that augmented context features often improve both perplexity and word error rate.
机译:我们研究阿拉伯自动语音识别中语言建模的不同类型的功能。尽管在语言建模研究上已进行了大量工作,旨在为n-gram语言模型设计更好的模型或平滑技术,但在本文中,我们采用了利用不同信息源来增强n-gram模型中上下文的方法。我们首先将单词类标签添加到上下文中。单词类别是从未注释的训练数据中自动得出的。相比之下,我们还尝试使用POS标签,该标签需要在带注释的数据上训练过的标记器。这两种方法的组合使用了在单词和POS标签组合上定义的类标签。其他上下文特征包括从语法树结构派生的超级标记以及从PropBank派生的语义特征。 DARPA GALE阿拉伯语语音识别任务的实验表明,增强的上下文功能通常可以提高困惑度和单词错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号