首页> 外文期刊>Computer speech and language >Product of Gaussians for speech recognition
【24h】

Product of Gaussians for speech recognition

机译:高斯语音识别产品

获取原文
获取原文并翻译 | 示例
       

摘要

Recently, there has been interest in the use of classifiers based on the product of experts (PoE) framework. PoEs offer an alternative to the standard mixture of experts (MoE) framework. It may be viewed as examining the intersection of a series of experts, rather than the union as in the MoE framework. This paper presents a particular implementation of PoEs, the normalised product of Gaussians (PoG). Here, each expert is a Gaussian mixture model. In this work, the PoG model is presented within a hidden Markov model framework. This allows the classification of variable length data, such as speech data. Training and initialisation procedures are described for this PoG system. The relationship of the PoG system with other schemes, including covariance modeling schemes, is also discussed. In addition the scheme is shown to be related to a standard speech recognition approach, multiple stream systems. The PoG system performance is examined on an automatic speech recognition task, Switchboard. The performance is compared to standard Gaussian mixture systems and multiple stream systems.
机译:最近,人们对基于专家产品(PoE)框架的分类器的使用感兴趣。 PoE提供了标准专家混合(MoE)框架的替代方案。可以将其视为研究一系列专家的交集,而不是像MoE框架中那样结合工会。本文介绍了PoE的一种特定实现方式,它是高斯(PoG)的标准化产品。在这里,每个专家都是一个高斯混合模型。在这项工作中,在隐藏的马尔可夫模型框架内展示了PoG模型。这允许对可变长度数据(例如语音数据)进行分类。描述了此PoG系统的培训和初始化过程。还讨论了PoG系统与其他方案(包括协方差建模方案)的关系。另外,该方案显示为与标准语音识别方法,多流系统有关。在自动语音识别任务Switchboard上检查PoG系统的性能。将性能与标准高斯混合系统和多流系统进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号