首页> 外文会议>Workshop on Automatic Speech Recognition and Understanding >COMBINING STOCHASTIC AVERAGE GRADIENT AND HESSIAN-FREE OPTIMIZATION FOR SEQUENCE TRAINING OF DEEP NEURAL NETWORKS
【24h】

COMBINING STOCHASTIC AVERAGE GRADIENT AND HESSIAN-FREE OPTIMIZATION FOR SEQUENCE TRAINING OF DEEP NEURAL NETWORKS

机译:结合随机平均梯度和Hessian-USE优化对深神经网络的序列训练

获取原文

摘要

Minimum phone error (MPE) training of deep neural networks (DNN) is an effective technique for reducing word error rate of automatic speech recognition tasks. This training is often carried out using a Hessian-free (HF) quasi-Newton approach, although other methods such as stochastic gradient descent have also been applied successfully. In this paper we present a novel stochastic approach to HF sequence training inspired by recently proposed stochastic average gradient (SAG) method. SAG reuses gradient information from past updates, and consequently simulates the presence of more training data than is really observed for each model update. We extend SAG by dynamically weighting the contribution of previous gradients, and by combining it to a stochastic HF optimization. We term the resulting procedure DSAG-HF. Experimental results for training DNNs on 1500h of audio data show that compared to baseline HF training, DSAG-HF leads to better held-out MPE loss after each model parameter update, and converges to an overall better loss value. Furthermore, since each update in DSAG-HF takes place over smaller amount of data, this procedure converges in about half the time as baseline HF sequence training.
机译:深度神经网络(DNN)的最低电话错误(MPE)培训是减少自动语音识别任务的字错误率的有效技术。该培训通常使用Hessianian的(HF)Quasi-Newton方法进行,尽管其他方法如随机梯度下降也已成功应用。在本文中,我们提出了一种新的随机序列训练的随机序列训练,其受到最近提出的随机平均梯度(SAG)方法的启发。 SAG从过去的更新中重用梯度信息,从而模拟了每个模型更新的更多训练数据的存在。通过动态加权先前梯度的贡献,以及将其与随机HF优化的贡献进行延伸,我们扩展了SAG。我们术语术语结果程序DSAG-HF。在1500H的音频数据上训练DNN的实验结果表明,与基线HF训练相比,DSAG-HF在每个模型参数更新后更好地保持了更好的MPE损耗,并收敛到整体更好的损失值。此外,由于DSAG-HF中的每个更新发生在更小的数据上,因此该过程将大约一半的时间收敛为基线HF序列训练。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号