首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >A comparison between deep neural nets and kernel acoustic models for speech recognition
【24h】

A comparison between deep neural nets and kernel acoustic models for speech recognition

机译:深度神经网络与核声学模型进行语音识别的比较

获取原文

摘要

We study large-scale kernel methods for acoustic modeling and compare to DNNs on performance metrics related to both acoustic modeling and recognition. Measuring perplexity and frame-level classification accuracy, kernel-based acoustic models are as effective as their DNN counterparts. However, on token-error-rates DNN models can be significantly better. We have discovered that this might be attributed to DNN's unique strength in reducing both the perplexity and the entropy of the predicted posterior probabilities. Motivated by our findings, we propose a new technique, entropy regularized perplexity, for model selection. This technique can noticeably improve the recognition performance of both types of models, and reduces the gap between them. While effective on Broadcast News, this technique could be also applicable to other tasks.
机译:我们研究了用于声学建模的大规模内核方法,并在与声学建模和识别相关的性能指标上与DNN进行了比较。基于内核的声学模型可以测量困惑度和帧级别的分类精度,其效果与DNN同类模型一样有效。但是,在令牌错误率方面,DNN模型可以明显更好。我们已经发现,这可能归因于DNN在减少预测后验概率的困惑和熵方面的独特优势。根据我们的发现,我们提出了一种新的技术,即熵正则化困惑度,用于模型选择。该技术可以显着提高两种类型的模型的识别性能,并缩小它们之间的差距。虽然对广播新闻有效,但该技术也可以应用于其他任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号