Integrating Gaussian mixtures into deep neural networks: Softmax layer with hidden variables

机译：将高斯混合物集成到深度神经网络中：带有隐藏变量的Softmax层

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the hybrid approach, neural network output directly serves as hidden Markov model (HMM) state posterior probability estimates. In contrast to this, in the tandem approach neural network output is used as input features to improve classic Gaussian mixture model (GMM) based emission probability estimates. This paper shows that GMM can be easily integrated into the deep neural network framework. By exploiting its equivalence with the log-linear mixture model (LMM), GMM can be transformed to a large softmax layer followed by a summation pooling layer. Theoretical and experimental results indicate that the jointly trained and optimally chosen GMM and bottleneck tandem features cannot perform worse than a hybrid model. Thus, the question “hybrid vs. tandem” simplifies to optimizing the output layer of a neural network. Speech recognition experiments are carried out on a broadcast news and conversations task using up to 12 feed-forward hidden layers with sigmoid and rectified linear unit activation functions. The evaluation of the LMM layer shows recognition gains over the classic softmax output.

机译：在混合方法中，神经网络输出直接用作隐马尔可夫模型（HMM）状态的后验概率估计。与此相反，在串联方法中，将神经网络输出用作输入功能，以改进基于经典高斯混合模型（GMM）的排放概率估计。本文表明，可以将GMM轻松集成到深度神经网络框架中。通过利用与对数线性混合模型（LMM）等效，GMM可以转换为较大的softmax层，然后转换为求和合并层。理论和实验结果表明，经过联合训练和最优选择的GMM和瓶颈串联特征不会比混合模型更差。因此，“混合与串联”问题简化为优化神经网络的输出层。语音识别实验是在广播新闻和对话任务上进行的，使用多达12个具有S型和整流线性单元激活功能的前馈隐藏层进行。对LMM层的评估表明，与传统的softmax输出相比，识别增益更高。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2015年|4285-4289|共5页
会议地点
作者
Tuske Zoltan; Tahir Muhammad Ali; Schluter Ralf; Ney Hermann;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
ASR; DNN; GMM; LMM; Log-linear; bottleneck; hybrid; mixture model; neural network; tandem;

机译：ASR; DNN; GMM; LMM;对数线性;瓶颈;混合;混合模型;神经网络;串联;

相似文献

外文文献
中文文献
专利

1. A new deep neural network based on a stack of single-hidden-layer feedforward neural networks with randomly fixed hidden neurons [J] . Hu Junying, Zhang Jiangshe, Zhang Chunxia, Neurocomputing . 2016,第JANa1期

机译：一种新的深度神经网络，该网络基于具有随机固定的隐藏神经元的单层前馈神经网络的堆栈
2. Optimization of Softmax Layer in Deep Neural Network Using Integral Stochastic Computation [J] . Hu Ruofei, Tian Binren, Yin Shouyi, Journal of Low Power Electronics . 2018,第4期

机译：基于整体随机计算的深神经网络中软MAX层的优化
3. Layered Neural Networks with Gaussian Hidden Units as Universal Approximations [J] . Hartman E, Keeler J, Kowalski J Neural computation . 1990,第2期

机译：具有高斯隐藏单元作为通用逼近的分层神经网络
4. Integrating Gaussian mixtures into deep neural networks: Softmax layer with hidden variables [C] . Z. Tuske, M. A. Tahir, R. Schluter, IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：将高斯混合集成到深神经网络中：软墨罩层，隐藏变量
5. A Responsible Softmax Layer in Deep Learning [D] . Coatney, Ryan. 2020

机译：深度学习中的一个负责的软邮件层
6. Toward Scalable Efficient and Accurate Deep Spiking Neural Networks With Backward Residual Connections Stochastic Softmax and Hybridization [O] . Priyadarshini Panda, Sai Aparna Aketi, Kaushik Roy 2020

机译：朝着可扩展高效准确的深度尖峰神经网络具有向后剩余连接随机软墨乳和杂交
7. Setting the Hidden Layer Neuron Number in Feedforward Neural Network for an Image Recognition Problem under Gaussian Noise of Distortion [O] . Vadim Romanuke 2013

机译：在高斯失真噪声下为图像识别问题在前馈神经网络中设置隐藏层神经元数
8. Exploiting Hidden Layer Responses of Deep Neural Networks for Language Recognition. [R] . Li, R., Mallidi, S. H., Burget, L., 2016

机译：利用深层神经网络隐藏层响应进行语言识别。

Integrating Gaussian mixtures into deep neural networks: Softmax layer with hidden variables

摘要

著录项

相似文献

相关主题

期刊订阅