首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >Comparison and Combination of Multilayer Perceptrons and Deep Belief Networks in Hybrid Automatic Speech Recognition Systems
【24h】

Comparison and Combination of Multilayer Perceptrons and Deep Belief Networks in Hybrid Automatic Speech Recognition Systems

机译:混合自动语音识别系统中多层感知器与深信度网络的比较与组合

获取原文
获取外文期刊封面目录资料

摘要

To improve the speech recognition performance, many ways to augment or combine HMMs (Hidden Markov Models) with other models to build hybrid architectures have been proposed. The hybrid HMM/ANN (Hidden Markov Model / Artificial Neural Network) architecture is one of the most successful approaches. In this hybrid model, ANNs (which are often multilayer perceptron neural networks - MLPs) are used as an HMM-state posterior estimator. Recently, Deep Belief Networks (DBNs) were introduced as a newly powerful machine learning technique. Generally, DBNs are MLPs with many hidden layers, however, while weights of MLPs are often initialized randomly, DBNs use a greedy layer-by-layer pretraining algorithm to initialize the network weights. This pretraining initialization step has resulted in successful realizations of DBNs for various applications such as handwriting recognition, 3-D object recognition, dimensionality reduction and automatic speech recognition (ASR) tasks. To evaluate the effectiveness of the pre-initialization steps that characterize DBNs from MLPs for ASR tasks, we conduct a comparative evaluation between the two systems on phone recognition for the TIMIT database. The effectiveness, advantages and computational cost of each method will be investigated and analyzed. We also show that the information generated by DBNs and MLPs are complementary,where a consistent improvement is observed when the two systems are combined. In addition, we investigate the ability of the hybrid HMM/DBN system in the case only a limited amount of labeled training data is available.
机译:为了提高语音识别性能,已经提出了许多将HMM(隐藏马尔可夫模型)与其他模型进行增强或组合以构建混合架构的方法。混合HMM / ANN(隐马尔可夫模型/人工神经网络)体系结构是最成功的方法之一。在这种混合模型中,人工神经网络(通常是多层感知器神经网络-MLP)被用作HMM状态后验估计器。最近,深度信仰网络(DBN)被引入作为一种新的强大的机器学习技术。通常,DBN是具有许多隐藏层的MLP,但是,虽然MLP的权重通常是随机初始化的,但DBN使用贪婪的逐层预训练算法来初始化网络权重。此预训练初始化步骤已成功实现了针对各种应用程序的DBN,例如手写识别,3-D对象识别,降维和自动语音识别(ASR)任务。为了评估表征MLP中用于ASR任务的DBN的预初始化步骤的有效性,我们在TIMIT数据库的电话识别两个系统之间进行了比较评估。将研究和分析每种方法的有效性,优势和计算成本。我们还表明,由DBN和MLP生成的信息是互补的,当将两个系统结合在一起时,可以观察到一致的改进。此外,在只有有限数量的标记训练数据可用的情况下,我们研究了混合HMM / DBN系统的功能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号