首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Discriminative importance weighting of augmented training data for acoustic model training
【24h】

Discriminative importance weighting of augmented training data for acoustic model training

机译:用于声学模型培训的增强培训数据的判别重要性

获取原文

摘要

DNN based acoustic models require a large amount of training data. Parametric data augmentation techniques such as adding noise, reverberation, or changing the speech rate, are often employed to boost the dataset size and the ASR performance. The choice of augmentation techniques and the associated parameters has been handled heuristically so far. In this work we propose an algorithm to automatically weight data perturbed using a variety of augmentation techniques and/or parameters. The weights are learned in a discriminative fashion so as to minimize the frame error rate using the standard gradient descent algorithm in an iterative manner. Experiments were performed using the CHiME-3 dataset. Data augmentation was done by adding noise at different SNRs. A relative WER improvement of 15% was obtained with the proposed data weighting algorithm compared to the unweighted augmented dataset. Interestingly, the resulting distribution of SNRs in the weighted training set differs significantly from that of the test set.
机译:基于DNN的声学模型需要大量的训练数据。参数数据增强技术,例如添加噪声,混响或改变语音率,通常用于提高数据集大小和ASR性能。到目前为止,可以处理增强技术和相关参数的选择。在这项工作中,我们提出了一种算法来使用各种增强技术和/或参数自动地进行扰动数据。以判别方式学习权重,以便以迭代方式使用标准梯度下降算法最小化帧误差率。使用Chime-3数据集进行实验。通过添加不同SNR的噪声来完成数据增强。与未加权的增强数据集相比,通过所提出的数据加权算法获得15%的相对加速。有趣的是,加权训练集中的SNR的分布与测试集的SNR分布显着不同。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号