...
首页> 外文期刊>IEEE transactions on audio, speech and language processing >Minimum phone error training of precision matrix models
【24h】

Minimum phone error training of precision matrix models

机译:精确矩阵模型的最小电话错误训练

获取原文
获取原文并翻译 | 示例
           

摘要

Gaussian mixture models (GMMs) are commonly used as the output density function for large-vocabulary continuous speech recognition (LVCSR) systems. A standard problem when using multivariate GMMs to classify data is how to accurately represent the correlations in the feature vector. Full covariance matrices yield a good model, but dramatically increase the number of model parameters. Hence, diagonal covariance matrices are commonly used. Structured precision matrix approximations provide an alternative, flexible, and compact representation. Schemes in this category include the extended maximum likelihood linear transform and subspace for precision and mean models. This paper examines how these precision matrix models can be discriminatively trained and used on state-of-the-art speech recognition tasks. In particular, the use of the minimum phone error criterion is investigated. Implementation issues associated with building LVCSR systems are also addressed. These models are evaluated and compared using large vocabulary continuous telephone speech and broadcast news English tasks.
机译:高斯混合模型(GMM)通常用作大词汇量连续语音识别(LVCSR)系统的输出密度函数。使用多元GMM对数据进行分类时的一个标准问题是如何准确表示特征向量中的相关性。完整的协方差矩阵可生成一个好的模型,但会大大增加模型参数的数量。因此,通常使用对角协方差矩阵。结构化的精确矩阵近似提供了一种替代,灵活和紧凑的表示形式。此类方案包括扩展的最大似然线性变换和用于精确度和均值模型的子空间。本文探讨了如何对这些精度矩阵模型进行有区别的训练,并将其用于最新的语音识别任务。特别是,研究了最小电话错误准则的使用。还解决了与构建LVCSR系统相关的实施问题。使用大词汇量连续电话语音和广播新闻英语任务对这些模型进行评估和比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号