Comparison and Combination of Multilayer Perceptrons and Deep Belief Networks in Hybrid Automatic Speech Recognition Systems

机译：混合自动语音识别系统中多层感知器与深信度网络的比较与组合

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

To improve the speech recognition performance, many ways to augment or combine HMMs (Hidden Markov Models) with other models to build hybrid architectures have been proposed. The hybrid HMM/ANN (Hidden Markov Model / Artificial Neural Network) architecture is one of the most successful approaches. In this hybrid model, ANNs (which are often multilayer perceptron neural networks - MLPs) are used as an HMM-state posterior estimator. Recently, Deep Belief Networks (DBNs) were introduced as a newly powerful machine learning technique. Generally, DBNs are MLPs with many hidden layers, however, while weights of MLPs are often initialized randomly, DBNs use a greedy layer-by-layer pretraining algorithm to initialize the network weights. This pretraining initialization step has resulted in successful realizations of DBNs for various applications such as handwriting recognition, 3-D object recognition, dimensionality reduction and automatic speech recognition (ASR) tasks. To evaluate the effectiveness of the pre-initialization steps that characterize DBNs from MLPs for ASR tasks, we conduct a comparative evaluation between the two systems on phone recognition for the TIMIT database. The effectiveness, advantages and computational cost of each method will be investigated and analyzed. We also show that the information generated by DBNs and MLPs are complementary,where a consistent improvement is observed when the two systems are combined. In addition, we investigate the ability of the hybrid HMM/DBN system in the case only a limited amount of labeled training data is available.

机译：为了提高语音识别性能，已经提出了许多将HMM（隐藏马尔可夫模型）与其他模型进行增强或组合以构建混合架构的方法。混合HMM / ANN（隐马尔可夫模型/人工神经网络）体系结构是最成功的方法之一。在这种混合模型中，人工神经网络（通常是多层感知器神经网络-MLP）被用作HMM状态后验估计器。最近，深度信仰网络（DBN）被引入作为一种新的强大的机器学习技术。通常，DBN是具有许多隐藏层的MLP，但是，虽然MLP的权重通常是随机初始化的，但DBN使用贪婪的逐层预训练算法来初始化网络权重。此预训练初始化步骤已成功实现了针对各种应用程序的DBN，例如手写识别，3-D对象识别，降维和自动语音识别（ASR）任务。为了评估表征MLP中用于ASR任务的DBN的预初始化步骤的有效性，我们在TIMIT数据库的电话识别两个系统之间进行了比较评估。将研究和分析每种方法的有效性，优势和计算成本。我们还表明，由DBN和MLP生成的信息是互补的，当将两个系统结合在一起时，可以观察到一致的改进。此外，在只有有限数量的标记训练数据可用的情况下，我们研究了混合HMM / DBN系统的功能。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2011年|1-6|共6页
会议地点
作者
Van Hai Do; Xiong Xiao; Eng Siong Chng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机的应用;信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. A Fast Learning Method for Multilayer Perceptrons in Automatic Speech Recognition Systems [J] . Chenghao Cai, Yanyan Xu, Dengfeng Ke, Journal of robotics . 2015,第期

机译：自动语音识别系统中多层感知器的快速学习方法
2. A Fast Learning Method for Multilayer Perceptrons in Automatic Speech Recognition Systems [J] . ChenghaoCai, YanyanXu, DengfengKe, Journal of robotics . 2015,第1期

机译：自动语音识别系统中多层感知器的快速学习方法
3. Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies [J] . Cui Xiaodong, Zhang Wei, Finkler Ulrich, IEEE Signal Processing Magazine . 2020,第3期

机译：自动语音识别深神经网络声学模型的分布式训练：当前训练策略的比较
4. Comparison and Combination of Multilayer Perceptrons and Deep Belief Networks in Hybrid Automatic Speech Recognition Systems [C] . Van Hai Do, Xiong Xiao, Eng Siong Chng Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2011

机译：混合自动语音识别系统中多层情感和深度信仰网络的比较与组合
5. Multi-task learning deep neural networks for automatic speech recognition [D] . Chen, Dongpeng. 2015

机译：多任务学习深度神经网络自动语音识别
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. A fast learning method for multilayer perceptrons in automatic speech recognition systems [O] . Cai Chenghao, Xu Yanyan, Ke Dengfeng, 2015

机译：自动语音识别系统中多层感知器的快速学习方法

Comparison and Combination of Multilayer Perceptrons and Deep Belief Networks in Hybrid Automatic Speech Recognition Systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅