A Perceptually Inspired Data Augmentation Method for Noise Robust CNN Acoustic Models

机译：噪声鲁棒的CNN声学模型的感知启发数据增强方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Here, we present a data augmentation method that improves the robustness of convolutional neural network-based speech recognizers to additive noise. The proposed technique has its roots in the input dropout method because it discards a subset of the input features. However, instead of doing this in a completely random fashion, we introduce two simple heuristics that select the less reliable components of the spectrum of the speech signal as candidates for dropout. The first heuristic retains spectro-temporal maxima, while the second is based on a crude estimation of spectral dominance. The selected components are always retained, while the dropout step discards or retains the unselected ones in a probabilistic manner. Due to the randomness involved in dropout, the whole process may be interpreted as a data augmentation method that perturbs the data by creating new data instances from the existing ones on the fly. We evaluated the method on the Aurora-4 corpus just using the clean training data set, and we got relative word error rate reductions between 22% and 25%.

机译：在这里，我们提出了一种数据增强方法，该方法可提高基于卷积神经网络的语音识别器对加性噪声的鲁棒性。所提出的技术源于输入丢弃方法，因为它丢弃了输入特征的子集。但是，不是以完全随机的方式执行此操作，而是引入了两种简单的启发式方法，它们选择语音信号频谱中较不可靠的分量作为丢失的候选者。第一种启发式方法保留了频谱时间最大值，而第二种启发式方法则基于对频谱优势的粗略估计。选定的组件始终保留，而退出步骤以概率方式丢弃或保留未选定的组件。由于辍学涉及的随机性，整个过程可以解释为一种数据扩充方法，该方法通过动态地从现有实例中创建新的数据实例来扰乱数据。我们仅使用干净的训练数据集就Aurora-4语料库评估了该方法，并且相对单词错误率降低了22％至25％。

著录项

来源
《International Conference on speech and computer》|2018年|697-706|共10页
会议地点
作者
Laszlo Toth; Gyoergy Kovacs; Dirk Van Compernolle;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Robust speech recognition Convolutional neural networks; Data augmentation; Input dropout;

机译：鲁棒的语音识别卷积神经网络;数据扩充;输入丢失;

相似文献

外文文献
中文文献
专利

1. On the combination of data augmentation method and gated convolution model for building effective and robust intrusion detection [J] . Wang Yixiang, lv Shaohua, Liu Jiqiang, Cybersecurity . 2020,第a期

机译：关于数据增强方法和门控卷积模型的组合，用于建立有效且鲁棒入侵检测
2. On the combination of data augmentation method and gated convolution model for building effective and robust intrusion detection [J] . Yixiang Wang, Shaohua lv, Jiqiang Liu, 网络空间安全科学与技术（英文版） . 2020,第004期

机译：关于数据增强方法和门控卷积模型的组合，用于建立有效且鲁棒入侵检测
3. A Data-Driven Model Parameter Compensation Method for Noise-Robust Speech Recognition [J] . Yongjoo CHUNG IEICE Transactions on Information and Systems . 2005,第3期

机译：噪声鲁棒语音识别的数据驱动模型参数补偿方法
4. A Perceptually Inspired Data Augmentation Method for Noise Robust CNN Acoustic Models [C] . Laszlo Toth, Gyoergy Kovacs, Dirk Van Compernolle International Conference on Speech and Computer . 2018

机译：一种感知激发噪声鲁棒CNN声学模型的数据增强方法
5. A hybrid geostatistical-acoustical model for estimating single-event noise levels from noise monitor data [D] . Nykaza, Edward T. 2013

机译：用于从噪声监控器数据估计单事件噪声级别的混合地统计-声学模型
6. Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data [O] . Ayesha Pervaiz, Fawad Hussain, Huma Israr, 2020

机译：通过训练数据的噪声增强将噪声鲁棒性纳入语音命令识别中
7. On the combination of data augmentation method and gated convolution model for building effective and robust intrusion detection [O] . Yixiang Wang, Shaohua lv, Jiqiang Liu, 2020

机译：关于数据增强方法和门控卷积模型的组合，用于建立有效且鲁棒入侵检测
8. Robust Command Augmentation System Design Using Genetic Methods [R] . Sweriduk, G. D. , Menon, P. K. , Stienberg, M. L. 1998

机译：基于遗传算法的鲁棒指挥增强系统设计

A Perceptually Inspired Data Augmentation Method for Noise Robust CNN Acoustic Models

摘要

著录项

相似文献

相关主题

期刊订阅