A comparison between deep neural nets and kernel acoustic models for speech recognition

机译：深度神经网络与核声学模型进行语音识别的比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study large-scale kernel methods for acoustic modeling and compare to DNNs on performance metrics related to both acoustic modeling and recognition. Measuring perplexity and frame-level classification accuracy, kernel-based acoustic models are as effective as their DNN counterparts. However, on token-error-rates DNN models can be significantly better. We have discovered that this might be attributed to DNN's unique strength in reducing both the perplexity and the entropy of the predicted posterior probabilities. Motivated by our findings, we propose a new technique, entropy regularized perplexity, for model selection. This technique can noticeably improve the recognition performance of both types of models, and reduces the gap between them. While effective on Broadcast News, this technique could be also applicable to other tasks.

机译：我们研究了用于声学建模的大规模内核方法，并在与声学建模和识别相关的性能指标上与DNN进行了比较。基于内核的声学模型可以测量困惑度和帧级别的分类精度，其效果与DNN同类模型一样有效。但是，在令牌错误率方面，DNN模型可以明显更好。我们已经发现，这可能归因于DNN在减少预测后验概率的困惑和熵方面的独特优势。根据我们的发现，我们提出了一种新的技术，即熵正则化困惑度，用于模型选择。该技术可以显着提高两种类型的模型的识别性能，并缩小它们之间的差距。虽然对广播新闻有效，但该技术也可以应用于其他任务。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2016年|5070-5074|共5页
会议地点
作者
Zhiyun Lu; Dong Quo; Alireza Bagheri Garakani; Kuan Liu; Avner May; Aurlien Bellet; Linxi Fan; Michael Collins; Brian Kingsbury; Michael Picheny; Fei Sha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
acoustic models; automatic speech recognition; deep neural networks; kernel methods;

机译：声学模型;自动语音识别;深度神经网络;核方法;

相似文献

外文文献
中文文献
专利

1. Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies [J] . Cui Xiaodong, Zhang Wei, Finkler Ulrich, IEEE Signal Processing Magazine . 2020,第3期

机译：自动语音识别深神经网络声学模型的分布式训练：当前训练策略的比较
2. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
3. Acoustic landmarks contain more information about the phone string than other frames for automatic speech recognition with deep neural network acoustic model [J] . He Di, Lim Boon Pang, Yang Xuesong, The Journal of the Acoustical Society of America . 2018,第6aPta1期

机译：声学地标包含与具有深度神经网络声学模型的自动语音识别的其他帧的更多信息
4. A comparison between deep neural nets and kernel acoustic models for speech recognition [C] . Zhiyun Lu, Dong Quo, Alireza Bagheri Garakani, IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：语音识别深神经网络与内核声学模型的比较
5. Dysarthric Speech Recognition and Offline Handwriting Recognition using Deep Neural Networks. [D] . Pillai, Suhas Balkrishna. 2017

机译：使用深度神经网络的表情异常语音识别和离线手写识别。
6. Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training [O] . Arun Narayanan, DeLiang Wang -1

机译：通过语音分离和联合自适应训练提高深度神经网络声学模型的鲁棒性
7. A Comparison Between Deep Neural Nets and Kernel Acoustic Models for Speech Recognition [O] . Lu, Zhiyun, Guo, Dong, Garakani, Alireza Bagheri, 2016

机译：深度神经网络与核声学模型在语音识别中的比较

A comparison between deep neural nets and kernel acoustic models for speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅