首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >On DNN posterior probability combination in multi-stream speech recognition for reverberant environments
【24h】

On DNN posterior probability combination in multi-stream speech recognition for reverberant environments

机译:混响环境下多流语音识别中的DNN后验概率组合

获取原文

摘要

A multi-stream framework with deep neural network (DNN) classifiers has been applied in this paper to improve automatic speech recognition (ASR) performance in environments with different reverberation characteristics. We propose a room parameter estimation model to determine the stream weights for DNN posterior probability combination with the aim of obtaining reliable log-likelihoods for decoding. The model is implemented by training a multi-layer perceptron to distinguish between various reverberant environments. The method is tested in known and unknown environments against approaches based on inverse entropy and autoencoders, with average relative word error rate improvements of 46% and 29%, respectively, when performing multi-stream ASR in different reverberant situations.
机译:本文应用了具有深度神经网络(DNN)分类器的多流框架,以改善具有不同混响特性的环境中的自动语音识别(ASR)性能。我们提出一种房间参数估计模型,以确定DNN后验概率组合的流权重,目的是获得可靠的对数似然解码。该模型是通过训练多层感知器来区分各种混响环境而实现的。该方法在已知和未知环境中针对基于逆熵和自动编码器的方法进行了测试,当在不同的混响情况下执行多流ASR时,平均相对字错误率分别提高了46%和29%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号