Increasing the robustness of CNN acoustic models using autoregressive moving average spectrogram features and channel dropout

Kovacs Gyorgy; Toth Laszlo; Van Compernolle Dirk; Ganapathy Sriram

首页> 外文期刊>Pattern recognition letters >Increasing the robustness of CNN acoustic models using autoregressive moving average spectrogram features and channel dropout

【24h】

Increasing the robustness of CNN acoustic models using autoregressive moving average spectrogram features and channel dropout

机译：使用自回归移动平均频谱图特征和通道丢失来提高CNN声学模型的鲁棒性

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Developing automatic speech recognition systems that are robust to mismatched and noisy channel conditions is a challenging problem, especially when the training and the test conditions are different. Here, we seek to increase the robustness of convolutional neural network (CNN) acoustic models under such circumstances by combining two methods. Firstly, we propose an improved version of input dropout, which exploits the special structure of the input time-frequency representation. Instead of just dropping out random 'pixels' of the spectrogram, the proposed channel dropout approach discards whole spectral channels. We expect that this dropout strategy will force the network to rely less on the whole spectrum, and make it more robust to channel mismatches and narrow-band noise. Secondly, we replaced the standard mel-spectrogram input representation with the autoregressive moving average (ARMA) spectrogram, which was recently shown to outperform the former under mismatched train-test conditions. In our experiments on the Aurora-4 database, the proposed channel dropout method attained relative word error rate reductions of 16% with ARMA features (an absolute improvement of 3%), and 20% with FBANK features (an absolute improvement of 7%) over the baseline CNN, when using the clean training scenario. (C) 2017 Elsevier B.V. All rights reserved.

机译：开发对不匹配和嘈杂的信道条件具有鲁棒性的自动语音识别系统是一个具有挑战性的问题，尤其是在训练和测试条件不同的情况下。在这里，我们试图通过结合两种方法来提高卷积神经网络（CNN）声学模型在这种情况下的鲁棒性。首先，我们提出了输入缺失的改进版本，它利用了输入时频表示的特殊结构。提出的通道丢失方法不仅丢弃了频谱图的随机“像素”，还丢弃了整个光谱通道。我们预计，这种丢包策略将迫使网络减少对整个频谱的依赖，并使它对信道不匹配和窄带噪声更加健壮。其次，我们用自回归移动平均（ARMA）谱图代替了标准的梅尔谱图输入表示法，最近证明，在不匹配的列车测试条件下，该谱图的性能优于前者。在我们对Aurora-4数据库进行的实验中，提出的通道丢失方法在ARMA功能下的相对字错误率降低了16％（绝对提高了3％），在FBANK功能下的相对字错误率降低了20％（绝对提高了7％）。使用干净培训方案时，超出基线CNN。（C）2017 Elsevier B.V.保留所有权利。

著录项

来源
《Pattern recognition letters》 |2017年第1期|44-50|共7页
作者
Kovacs Gyorgy; Toth Laszlo; Van Compernolle Dirk; Ganapathy Sriram;
展开▼
作者单位

MTA SZTE, Res Grp Artificial Intelligence, Tisza Lajos Krt 103, H-6720 Szeged, Hungary;

MTA SZTE, Res Grp Artificial Intelligence, Tisza Lajos Krt 103, H-6720 Szeged, Hungary;

Katholieke Univ Leuven, Dept Elect Engn ESAT, Kasteelpk Arenberg 10,Postbus 2440, B-3001 Leuven, Belgium;

Indian Inst Sci, Dept Elect Engn, Javanica Marg, Bengaluru 560012, India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Convolutional neural network; Input dropout; ARMA spectrogram; Aurora-4;

机译：卷积神经网络;输入丢失;ARMA频谱图;Aurora-4;

相似文献

外文文献
中文文献
专利

1. Parameters Estimate of Autoregressive Moving Average and Autoregressive Integrated Moving Average Models and Compare Their Ability for Inflow Forecasting | Science Publications [J] . Mohammad Ebrahim Banihabib, Mohammad Valipour, Seyyed Mahmood Reza Behbahani Journal of Mathematics and Statistics . 2012,第3期

机译：自回归移动平均线和自回归综合移动平均线模型的参数估计，并比较它们的流量预测能力科学出版物
2. On the Comparative Performance of Pure Vector Autoregressive-Moving Average and Vector Bilinear Autoregressive-Moving Average Time Series Models [J] . I.A. Iwok, E.H. Etuk Asian Journal of Mathematics & Statistics . 2009,第2期

机译：纯矢量自回归移动平均时间模型和矢量双线性自回归移动平均时间序列模型的比较性能
3. Application of one-, three-, and seven-day forecasts during early onset on the COVID-19 epidemic dataset using moving average, autoregressive, autoregressive moving average, autoregressive integrated moving average, and na?ve forecasting methods [J] . Christopher J. Lynch, Ross Gore Data in Brief . 2021,第3期

机译：在Covid-19流行性数据集早期应用中的一个，三个和七天预报的应用使用搬家平均，自回归，自回归移动平均，自回归综合移动平均线和NA ve预测方法
4. Autoregressive, moving average and mixed autoregressive-moving average processes for forecasting QoS in ad hoc networks for real-time service support [C] . Tabbane, N., Tabbane, . 2004

机译：自回归，移动平均和混合自回归-移动平均过程，用于预测ad hoc网络中的QoS，以提供实时服务支持
5. Feature extraction and reconstruction of two dimensional patterns using autoregressive moving average models and Fourier descriptors [D] . Leung, Siu Yun 1989

机译：使用自回归移动平均模型和傅立叶描述符对二维模式进行特征提取和重构
6. Application of one- three- and seven-day forecasts during early onset on the COVID-19 epidemic dataset using moving average autoregressive autoregressive moving average autoregressive integrated moving average and naïve forecasting methods [O] . Christopher J. Lynch, Ross Gore 2021

机译：在Covid-19流行数据集早期应用中的一个三个和七天预报的应用使用移动平均自回归自回归移动平均自回归综合移动平均值和天真预测方法
7. Application of one-, three-, and seven-day forecasts during early onset on the COVID-19 epidemic dataset using moving average, autoregressive, autoregressive moving average, autoregressive integrated moving average, and naïve forecasting methods [O] . Christopher J. Lynch, Ross Gore 2021

机译：在Covid-19流行数据集早期应用中的一个，三个和七天预报的应用使用移动平均，自回归，自回归移动平均，自回归综合移动平均值和天真预测方法
8. Maximum Likelihood Estimation of the Autoregressive Coefficients and Moving Average Covariances of Vector Autoregressive Moving Average Models. [R] . Ahrabi, F. 1979

机译：向量自回归滑动平均模型的自回归系数和移动平均协方差的极大似然估计。

Increasing the robustness of CNN acoustic models using autoregressive moving average spectrogram features and channel dropout

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅