Improving deep neural networks for LVCSR using dropout and shrinking structure

机译：利用辍学和收缩结构改善LVCSR的深度神经网络

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently, the hybrid deep neural networks and hidden Markov models (DNN/HMMs) have achieved dramatic gains over the conventional GMM/HMMs method on various large vocabulary continuous speech recognition (LVCSR) tasks. In this paper, we propose two new methods to further improve the hybrid DNN/HMMs model: i) use dropout as pre-conditioner (DAP) to initialize DNN prior to back-propagation (BP) for better recognition accuracy; ii) employ a shrinking DNN structure (sDNN) with hidden layers decreasing in size from bottom to top for the purpose of reducing model size and expediting computation time. The proposed DAP method is evaluated in a 70-hour Mandarin transcription (PSC) task and the 309-hour Switchboard (SWB) task. Compared with the traditional greedy layer-wise pre-trained DNN, it can achieve about 10% and 6.8% relative recognition error reduction for PSC and SWB tasks respectively. In addition, we also evaluate sDNN as well as its combination with DAP on the SWB task. Experimental results show that these methods can reduce model size to 45% of original size and accelerate training and test time by 55%, without losing recognition accuracy.

机译：近来，在各种大词汇量连续语音识别（LVCSR）任务上，混合深度神经网络和隐马尔可夫模型（DNN / HMM）取得了优于常规GMM / HMMs方法的显着进步。在本文中，我们提出了两种新方法来进一步改进混合DNN / HMMs模型：i）使用压差作为前置条件（DAP）在反向传播（BP）之前初始化DNN，以获得更好的识别精度; ii）使用缩小的DNN结构（sDNN），其隐藏层的大小从下到上减小，以减小模型大小并加快计算时间。在70小时的普通话转录（PSC）任务和309小时的总机（SWB）任务中评估了提出的DAP方法。与传统的贪婪分层预训练DNN相比，PSC和SWB任务的相对识别误差分别降低了约10％和6.8％。此外，我们还在SWB任务上评估sDNN及其与DAP的组合。实验结果表明，这些方法可以将模型尺寸减小到原始尺寸的45％，并将训练和测试时间缩短55％，而不会降低识别精度。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2014年|6849-6853|共5页
会议地点
作者
Zhang Shiliang; Bao Yebo; Zhou Pan; Jiang Hui; Dai Lirong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
DNN-HMM; LVCSR; deep neural networks; dropout; dropout as pre-conditioner (DAP); shrinking hidden layer;

机译：DNN-HMM; LVCSR;深度神经网络;退出;退出作为前置条件（DAP）;缩小隐藏层;

相似文献

外文文献
中文文献
专利

1. Improving quantitative structure-activity relationship models using Artificial Neural Networks trained with dropout [J] . Mendenhall Jeffrey, Meiler Jens Journal of Computer-Aided Molecular Design . 2016,第2期

机译：使用经过辍学训练的人工神经网络改善定量构效关系模型
2. Offline to online speaker adaptation for real-time deep neural network based LVCSR systems [J] . Long Yanhua, Li Yijie, Zhang Bo Multimedia Tools and Applications . 2018,第21期

机译：离线到在线说话者自适应，用于基于实时深度神经网络的LVCSR系统
3. Rademacher dropout: An adaptive dropout for deep neural network via optimizing generalization gap [J] . Wang Haotian, Yang Wenjing, Zhao Zhenyu, Neurocomputing . 2019,第SEPa10期

机译：Rademacher辍学：通过优化泛化差距来进行深度神经网络的自适应辍学
4. Improving deep neural networks for LVCSR using dropout and shrinking structure [C] . Zhang Shiliang, Bao Yebo, Zhou Pan, IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：使用辍学和缩小结构改善LVCSR的深神经网络
5. Adaptive dropout for training deep neural networks. [D] . Ba, Jimmy Lei. 2014

机译：用于训练深度神经网络的自适应辍学。
6. Improving Quantitative Structure-Activity Relationship Models using Artificial Neural Networks Trained with Dropout [O] . Jeffrey Mendenhall, Jens Meiler -1

机译：利用辍学训练的人工神经网络改进定量构效关系模型
7. IMPROVING DEEP NEURAL NETWORKS FOR LVCSR USING RECTIFIED LINEAR UNITS AND DROPOUT [O] . 2013

机译：使用校正后的线性单元和压差法改善LVCSR的深层神经网络
8. Very Deep Multilingual Convolutional Neural Networks for LVCSR (Author's Manuscript). [R] . Sercu, T., Puhrsch, C., Kingsbury, B., 2016

机译：用于LVCsR的深度多语言卷积神经网络（作者手稿）。

Improving deep neural networks for LVCSR using dropout and shrinking structure

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅