A Perceptually Inspired Data Augmentation Method for Noise Robust CNN Acoustic Models

机译：一种感知激发噪声鲁棒CNN声学模型的数据增强方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Here, we present a data augmentation method that improves the robustness of convolutional neural network-based speech recognizers to additive noise. The proposed technique has its roots in the input dropout method because it discards a subset of the input features. However, instead of doing this in a completely random fashion, we introduce two simple heuristics that select the less reliable components of the spectrum of the speech signal as candidates for dropout. The first heuristic retains spectro-temporal maxima, while the second is based on a crude estimation of spectral dominance. The selected components are always retained, while the dropout step discards or retains the unselected ones in a probabilistic manner. Due to the randomness involved in dropout, the whole process may be interpreted as a data augmentation method that perturbs the data by creating new data instances from the existing ones on the fly. We evaluated the method on the Aurora-4 corpus just using the clean training data set, and we got relative word error rate reductions between 22% and 25%.

机译：在这里，我们提出了一种数据增强方法，其提高了基于卷积神经网络的语音识别器的鲁棒性与加性噪声。所提出的技术在输入丢弃方法中具有其根源，因为它会丢弃输入功能的子集。然而，我们不是以完全随机的方式执行此操作，我们介绍了两个简单的启发式方法，可以选择语音信号频谱的可靠组件作为辍学的候选者。第一启发式保留光谱时间最大值，而第二种启发式是基于光谱优势的粗略估计。始终保留所选择的组件，而辍学步骤以概率的方式丢弃或保留未选择的组件。由于丢失中所涉及的随机性，整个过程可以被解释为数据增强方法，它通过从现有的现有数据实例开始使用来自现有的新数据实例。我们在Aurora-4语料库上评估了Method的方法，只需使用清洁训练数据集，我们得到了22％和25％之间的相对字错误率。

著录项

来源
《International Conference on Speech and Computer》|2018年|xv 791 p.|共10页
会议地点
作者
Laszlo Toth; Gyoergy Kovacs; Dirk Van Compernolle;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
Robust speech recognition Convolutional neural networks; Data augmentation; Input dropout;

机译：强大的语音识别卷积神经网络;数据增强;输入辍学;

相似文献

外文文献
中文文献
专利

1. On the combination of data augmentation method and gated convolution model for building effective and robust intrusion detection [J] . Wang Yixiang, lv Shaohua, Liu Jiqiang, Cybersecurity . 2020,第a期

机译：关于数据增强方法和门控卷积模型的组合，用于建立有效且鲁棒入侵检测
2. On the combination of data augmentation method and gated convolution model for building effective and robust intrusion detection [J] . Yixiang Wang, Shaohua lv, Jiqiang Liu, 网络空间安全科学与技术（英文版） . 2020,第004期

机译：关于数据增强方法和门控卷积模型的组合，用于建立有效且鲁棒入侵检测
3. A Data-Driven Model Parameter Compensation Method for Noise-Robust Speech Recognition [J] . Yongjoo CHUNG IEICE Transactions on Information and Systems . 2005,第3期

机译：噪声鲁棒语音识别的数据驱动模型参数补偿方法
4. A Perceptually Inspired Data Augmentation Method for Noise Robust CNN Acoustic Models [C] . Laszlo Toth, Gyoergy Kovacs, Dirk Van Compernolle International Conference on speech and computer . 2018

机译：噪声鲁棒的CNN声学模型的感知启发数据增强方法
5. A hybrid geostatistical-acoustical model for estimating single-event noise levels from noise monitor data [D] . Nykaza, Edward T. 2013

机译：用于从噪声监控器数据估计单事件噪声级别的混合地统计-声学模型
6. Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data [O] . Ayesha Pervaiz, Fawad Hussain, Huma Israr, 2020

机译：通过训练数据的噪声增强将噪声鲁棒性纳入语音命令识别中
7. On the combination of data augmentation method and gated convolution model for building effective and robust intrusion detection [O] . Yixiang Wang, Shaohua lv, Jiqiang Liu, 2020

机译：关于数据增强方法和门控卷积模型的组合，用于建立有效且鲁棒入侵检测
8. Robust Command Augmentation System Design Using Genetic Methods [R] . Sweriduk, G. D. , Menon, P. K. , Stienberg, M. L. 1998

机译：基于遗传算法的鲁棒指挥增强系统设计

A Perceptually Inspired Data Augmentation Method for Noise Robust CNN Acoustic Models

摘要

著录项

相似文献

相关主题

期刊订阅