首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >Training data selection for improving discriminative training of acoustic models
【24h】

Training data selection for improving discriminative training of acoustic models

机译:培养数据选择,以改善声学模型的鉴别培训

获取原文

摘要

This paper considers training data selection for discriminative training of acoustic models for broadcast news speech recognition. Three novel data selection approaches were proposed. First, the average phone accuracy over all hypothesized word sequences in the word lattice of a training utterance was utilized for utterancelevel data selection. Second, phone-level data selection based on the difference between the expected accuracy of a phone arc and the average phone accuracy of the word lattice was investigated. Finally, frame-level data selection based on the normalized frame-level entropy of Gaussian posterior probabilities obtained from the word lattice was explored. The underlying characteristics of the presented approaches were extensively investigated and their performance was verified by comparison with the standard discriminative training approaches. Experiments conducted on the Mandarin broadcast news collected in Taiwan shown that both phone- and frame-level data selection could achieve slight but consistent improvements over the baseline systems at lower training iterations.
机译:本文考虑了培训数据选择,了解广播新闻语音识别的声学模型的鉴别培训。提出了三种新型数据选择方法。首先,用于训练话语的单词晶格中的所有假设字序列的平均电话准确性用于对齐的数据选择。其次,基于电话弧的预期精度与单词晶格的平均电话精度之间的电话级数据选择。最后,探讨了基于从单词晶格中获得的高斯后级概率的归一化帧级熵的帧级数据选择。广泛调查所提出的方法的潜在特征,并通过与标准歧视性培训方法进行比较来验证其性能。在台湾收集的普通话广播新闻中进行的实验表明,在较低培训迭代的基线系统中,两种电话和帧级数据选择都可以实现轻微但一致的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号