首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Adaptation to New Microphones Using Artificial Neural Networks With Trainable Activation Functions
【24h】

Adaptation to New Microphones Using Artificial Neural Networks With Trainable Activation Functions

机译:使用具有可训练激活功能的人工神经网络适应新的麦克风

获取原文
获取原文并翻译 | 示例
           

摘要

Model adaptation is a key technique that enables a modern automatic speech recognition (ASR) system to adjust its parameters, using a small amount of enrolment data, to the nuances in the speech spectrum due to microphone mismatch in the training and test data. In this brief, we investigate four different adaptation schemes for connectionist (also known as hybrid) ASR systems that learn microphone-specific hidden unit contributions, given some adaptation material. This solution is made possible adopting one of the following schemes: 1) the use of Hermite activation functions; 2) the introduction of bias and slope parameters in the sigmoid activation functions; 3) the injection of an amplitude parameter specific for each sigmoid unit; or 4) the combination of 2) and 3). Such a simple yet effective solution allows the adapted model to be stored in a small-sized storage space, a highly desirable property of adaptation algorithms for deep neural networks that are suitable for large-scale online deployment. Experimental results indicate that the investigated approaches reduce word error rates on the standard Spoke 6 task of the Wall Street Journal corpus compared with unadapted ASR systems. Moreover, the proposed adaptation schemes all perform better than simple multicondition training and comparable favorably against conventional linear regression-based approaches while using up to 15 orders of magnitude fewer parameters. The proposed adaptation strategies are also effective when a single adaptation sentence is available.
机译:模型自适应是一项关键技术,该技术使现代自动语音识别(ASR)系统能够使用少量的注册数据,将其参数调整为由于训练和测试数据中的麦克风不匹配而导致的语音频谱差异。在本简介中,我们研究了针对连接器(也称为混合)ASR系统的四种不同的适应方案,这些方案可以学习麦克风特定的隐藏单元贡献,并提供一些适应材料。采用以下方案之一可以使该解决方案成为可能:1)使用Hermite激活功能; 2)在S形激活函数中引入偏置和斜率参数; 3)注入特定于每个S形单位的幅度参数;或4)2)和3)的组合。这种简单而有效的解决方案允许将适应的模型存储在小型存储空间中,这是适用于大规模在线部署的深度神经网络的适应算法的高度期望的特性。实验结果表明,与不适用的ASR系统相比,所研究的方法降低了《华尔街日报》语料库标准Spoke 6任务上的单词错误率。此外,所提出的自适应方案都比简单的多条件训练更好,并且与传统的基于线性回归的方法相比具有可比性,同时使用的参数最多减少了15个数量级。当单个适应语句可用时,所提出的适应策略也是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号