首页> 外文会议>IEEE Workshop on Automatic Speech Recognition and Understanding >The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices
【24h】

The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices

机译:NTT CHiME-3系统:移动多麦克风设备的语音增强和识别方面的进展

获取原文

摘要

CHiME-3 is a research community challenge organised in 2015 to evaluate speech recognition systems for mobile multi-microphone devices used in noisy daily environments. This paper describes NTT's CHiME-3 system, which integrates advanced speech enhancement and recognition techniques. Newly developed techniques include the use of spectral masks for acoustic beam-steering vector estimation and acoustic modelling with deep convolutional neural networks based on the "network in network" concept. In addition to these improvements, our system has several key differences from the official baseline system. The differences include multi-microphone training, dereverberation, and cross adaptation of neural networks with different architectures. The impacts that these techniques have on recognition performance are investigated. By combining these advanced techniques, our system achieves a 3.45% development error rate and a 5.83% evaluation error rate. Three simpler systems are also developed to perform evaluations with constrained set-ups.
机译:CHiME-3是一个研究社区挑战赛,于2015年组织,旨在评估在嘈杂的日常环境中使用的移动式多麦克风设备的语音识别系统。本文介绍了NTT的CHiME-3系统,该系统集成了先进的语音增强和识别技术。新开发的技术包括使用频谱掩码进行声束转向矢量估计和基于“网络中的网络”概念的深层卷积神经网络进行声学建模。除了这些改进之外,我们的系统与官方基准系统还有几个主要区别。区别包括多麦克风训练,去混响和具有不同体系结构的神经网络的交叉适应。研究了这些技术对识别性能的影响。通过结合这些先进技术,我们的系统可实现3.45%的开发错误率和5.83%的评估错误率。还开发了三个更简单的系统来使用受限的设置执行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号