首页> 外文会议>Proceedings of the 6th International Conference on Speech Technology and Human-Computer Dialogue >Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion
【24h】

Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion

机译:通过分析声学模型的不确定性和混乱度,提高自动语音识别中的单词错误率并降低复杂度

获取原文

摘要

In this paper, a study about the uncertainty of the trained acoustic models and the confusion among these models is made in the context of speech recognition. The purpose is to find the most relevant voice features, hence the analysis is made on a per-feature basis. Model uncertainty is defined as a measure of feature distribution overlapping. A model is compared only to the models it is more similar to. Hence, confusion matrices are built from both feature distributions and recognition results. Next, the voice features are weighted according to their relevance in order to increase the discrimination among models, while relevance itself is deduced from the values of model uncertainty. Experimental results show that, by appropriate weighting, the recognition accuracy, in terms of Word Error Rate (WER), improves. Moreover, by removing the features with lower weights, the recognition accuracy is maintained, but the number of calculations is significantly reduced.
机译:本文在语音识别的背景下,对训练后的声学模型的不确定性以及这些模型之间的混淆进行了研究。目的是找到最相关的语音功能,因此将针对每个功能进行分析。模型不确定性定义为特征分布重叠的度量。仅将模型与与其更相似的模型进行比较。因此,混淆矩阵是根据特征分布和识别结果建立的。接下来,根据语音特征的相关性对语音特征进行加权,以增加模型之间的区分度,同时根据模型不确定性的值推导相关性本身。实验结果表明,通过适当的加权,就单词错误率(WER)而言,识别精度得以提高。而且,通过去除权重较低的特征,可以保持识别精度,但是计算数量却大大减少了。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号