首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Single-channel mixed speech recognition using deep neural networks
【24h】

Single-channel mixed speech recognition using deep neural networks

机译:使用深度神经网络的单通道混合语音识别

获取原文

摘要

In this work, we study the problem of single-channel mixed speech recognition using deep neural networks (DNNs). Using a multi-style training strategy on artificially mixed speech data, we investigate several different training setups that enable the DNN to generalize to corresponding similar patterns in the test data. We also introduce a WFST-based two-talker decoder to work with the trained DNNs. Experiments on the 2006 speech separation and recognition challenge task demonstrate that the proposed DNN-based system has remarkable noise robustness to the interference of a competing speaker. The best setup of our proposed systems achieves an overall WER of 19.7% which improves upon the results obtained by the state-of-the-art IBM superhuman system by 1.9% absolute, with fewer assumptions and lower computational complexity.
机译:在这项工作中,我们使用深神经网络(DNN)研究单通道混合语音识别的问题。在人工混合语音数据上使用多种式培训策略,我们调查几种不同的训练设置,使DNN能够概括到测试数据中的相应类似模式。我们还介绍了一个基于WFST的双讲话者解码器,可以使用培训的DNN。 2006年演讲分离和识别挑战任务的实验表明,拟议的基于DNN的系统对竞争扬声器的干扰具有显着的噪声鲁棒性。我们所提出的系统的最佳设置实现了19.7%的整体增长,这提高了由最先进的IBM超人系统获得的结果1.9%,具有较少的假设和更低的计算复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号