首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Easy does it: Robust spectro-temporal many-stream ASR without fine tuning streams
【24h】

Easy does it: Robust spectro-temporal many-stream ASR without fine tuning streams

机译:易于做到:强大的光谱 - 时间多流ASR,没有微调流

获取原文
获取外文期刊封面目录资料

摘要

Previous work has shown that spectro-temporal features reduce the word error rate for automatic speech recognition under noisy conditions. These systems, however, required significant hand-tuning in order to determine which spectral and temporal modulations should be included in a particular stream. In this work, streams are split into one spectral and temporal modulation each and their posterior probabilities are combined once each stream is discriminatively trained via multilayer perceptron. We show that this combination structure performs as well or better than more elaborate methods in which multiple spectral and temporal modulations are hand-picked per stream. In addition, these type of features outperform standard noise-robust features such as the “Advanced Front End” features, whereas our hand-picked spectro-temporal features do not.
机译:以前的工作表明,光谱时间特征在嘈杂的条件下减少了自动语音识别的字错误率。然而,这些系统需要显着的手动调整,以确定应包括在特定流中的光谱和时间调制。在这项工作中,流被分成一个光谱和时间调制,并且一旦通过多层的Perceptron识别每个流,它们就会组合它们的后验概率。我们表明,该组合结构也表现不佳或更好地比更精细的方法,其中每条流拾取多种光谱和时间调制。此外,这些特征差异优于标准噪声强大的功能,如“先进的前端”功能,而我们的手工采摘的光谱 - 时间特征则不是。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号