首页> 外文会议>International conference on spoken language processing >Optimization of Units for Continuous-Digit Recognition Task
【24h】

Optimization of Units for Continuous-Digit Recognition Task

机译:优化连续数字识别任务的单位

获取原文

摘要

The choice of units, sub-word, is generally based on the size of the vocabulary and the maount of training data. In this work, we have introduced new constraints on the units: 1) they should contain sufficient statistics of the features and 2) they should contain sufficient statistics of the vocabulary. This led to minimization of two cost functions, first based o nthe confusion between the features and the units and the second based on the confusion between the units and the words. We minimized first cost function by forming broad phone classes that were less confusing among themselves than the phones. The second cost function was minimized by coding the wond-specific phone sequences. On the continuous digit recgnition task, the broad classes performed worse than the phones. The word-specific phone securences however significantly improved the performance over both the phones and the whole-word units. In this paper we ciscuss the new constraints, our specific implementation of the cost functions, and the corresponding recognition performance.
机译:单位,子字的选择通常基于词汇的大小和培训数据的千万。在这项工作中,我们对单位引入了新的约束:1)它们应包含特征的充分统计数据,2)它们应包含词汇量的充分统计数据。这导致最小化了两个成本函数,首先基于特征和单位之间的混淆,并基于单位与单词之间的混淆。我们通过在自己之间形成令人困惑的广泛的电话课程来最小化第一个成本函数。通过编码本特定的电话序列来最小化第二成本函数。在连续的数字重新预测任务上,广泛的类比手机更糟糕。然而,单词特定的电话证券显着提高了手机和整个单位的性能。在本文中,我们将新的制约因素,我们的具体实施成本函数以及相应的识别性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号