Discriminative importance weighting of augmented training data for acoustic model training

机译：用于声学模型培训的增强培训数据的判别重要性

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

DNN based acoustic models require a large amount of training data. Parametric data augmentation techniques such as adding noise, reverberation, or changing the speech rate, are often employed to boost the dataset size and the ASR performance. The choice of augmentation techniques and the associated parameters has been handled heuristically so far. In this work we propose an algorithm to automatically weight data perturbed using a variety of augmentation techniques and/or parameters. The weights are learned in a discriminative fashion so as to minimize the frame error rate using the standard gradient descent algorithm in an iterative manner. Experiments were performed using the CHiME-3 dataset. Data augmentation was done by adding noise at different SNRs. A relative WER improvement of 15% was obtained with the proposed data weighting algorithm compared to the unweighted augmented dataset. Interestingly, the resulting distribution of SNRs in the weighted training set differs significantly from that of the test set.

机译：基于DNN的声学模型需要大量的训练数据。参数数据增强技术，例如添加噪声，混响或改变语音率，通常用于提高数据集大小和ASR性能。到目前为止，可以处理增强技术和相关参数的选择。在这项工作中，我们提出了一种算法来使用各种增强技术和/或参数自动地进行扰动数据。以判别方式学习权重，以便以迭代方式使用标准梯度下降算法最小化帧误差率。使用Chime-3数据集进行实验。通过添加不同SNR的噪声来完成数据增强。与未加权的增强数据集相比，通过所提出的数据加权算法获得15％的相对加速。有趣的是，加权训练集中的SNR的分布与测试集的SNR分布显着不同。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2017年|4606-5264p|共5页
会议地点
作者
Sunit Sivasankaran; Emmanuel Vincent; Irina Illina;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
ASR; data augmentation; feature simulation; DNN; CHiME;

机译：ASR;数据增强;特征模拟;DNN;CHIME;

相似文献

外文文献
中文文献
专利

1. Training data selection for improving discriminative training of acoustic models [J] . Berlin Chen, Shih-Hung Liu, Fang-Hui Chu Pattern recognition letters . 2009,第13期

机译：选择训练数据以改善声学模型的判别训练
2. Semi-Supervised Acoustic Model Training by Discriminative Data Selection From Multiple ASR Systems’ Hypotheses [J] . Sheng Li, Yuya Akita, Tatsuya Kawahara Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第9期

机译：通过从多个ASR系统的假设中进行区分数据选择来半监督声学模型训练
3. Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training [J] . Sheng LI, Yuya AKITA, Tatsuya KAWAHARA IEICE transactions on information and systems . 2015,第8期

机译：基于区分数据选择的自动演讲转录，用于轻度监督的声学模型训练
4. Discriminative importance weighting of augmented training data for acoustic model training [C] . Sunit Sivasankaran, Emmanuel Vincent, Irina Illina IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：声学模型训练中增强训练数据的判别重要性加权
5. Optimal generative and discriminative acoustic model training for speech recognition. [D] . Joshi, Neil. 2009

机译：用于语音识别的最佳生成和判别声学模型训练。
6. Cue-specific effects of categorization training on the relative weighting of acoustic cues to consonant voicing in English [O] . Alexander L. Francis, Natalya Kaganovich, Courtney Driscoll-Huber -1

机译：分类训练对特定提示的影响对英语中声音提示与辅音发音的相对权重的影响
7. Training Data Selection for Discriminative Training of Acoustic Models [O] . 朱芳輝, Fang-Hui Chu 2011

机译：声学模型判别训练的训练数据选择

Discriminative importance weighting of augmented training data for acoustic model training

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅