首页> 外文会议>Workshop on Automatic Speech Recognition and Understanding >LEARNING FILTER BANKS WITHIN A DEEP NEURAL NETWORK FRAMEWORK
【24h】

LEARNING FILTER BANKS WITHIN A DEEP NEURAL NETWORK FRAMEWORK

机译:学习深层神经网络框架内的过滤器银行

获取原文
获取外文期刊封面目录资料

摘要

Mel-filter banks are commonly used in speech recognition, as they are motivated from theory related to speech production and perception. While features derived from mel-filter banks are quite popular, we argue that this filter bank is not really an appropriate choice as it is not learned for the objective at hand, i.e. speech recognition. In this paper, we explore replacing the filter bank with a filter bank layer that is learned jointly with the rest of a deep neural network. Thus, the filter bank is learned to minimize cross-entropy, which is more closely tied to the speech recognition objective. On a 50-hour English Broadcast News task, we show that we can achieve a 5% relative improvement in word error rate (WER) using the filter bank learning approach, compared to having a fixed set of filters.
机译:熔融滤波器银行通常用于语音识别,因为它们具有与语音生产和感知有关的理论的动机。虽然来自熔融滤波器银行的功能非常受欢迎,但我们认为这个过滤器银行并不是一个适当的选择,因为它没有为手头的客观而学习,即语音识别。在本文中,我们探索用滤波器组替换滤波器组层,该滤波器银行层与深神经网络的其余部分共同学习。因此,学习滤波器组以最小化交叉熵,这更紧密地绑定到语音识别目标。在50小时的英语广播新闻任务中,我们表明,与具有固定的过滤器集相比,我们可以使用滤波器组学习方法实现单词错误率(WER)的相对改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号