首页> 外文会议>International conference on spoken language processing >Minimum Risk Acoustic clustering for Multilingual Acoustic Model Combination
【24h】

Minimum Risk Acoustic clustering for Multilingual Acoustic Model Combination

机译:多语言声学模型组合的最小风险声学聚类

获取原文

摘要

In this paper we describe procedures for combining multiple acoustic models, obtained using traming corpora from different languages, in order to improve ASR performance i nlanguages for which large amounts of training data are not available. We treat these mdoels as multiple sources of information whose scores are conbined in a log-linear model to compute the hypothesis likelihood. The model combination can either be performed in a static way, with constant combination weights, or in a dynamic way, with parameters that can vary for different segments of a hypothesis. The aim is to optimize the parameters so as to achieve minimum word error rate. In order to achieve robust parameter estiamtion in the dynamic combination case, the parameters are defined to be piecewise constant on different phonetic classes that form a partition of the space of hypothesis segments. The partition is defined, using phonological knowledge, on segments that correspond to hypothesized phones. We examien different ways to define such a partition, including an automatic approach that gives a binary tree structured partion which tries to achieve the minimum WER with the minimum number of classes.
机译:在本文中,我们描述了使用来自不同语言的托管语料库获得的多个声学模型的组合程序,以改善ASR性能I Nlanguages,其中不可用培训数据大量训练数据。我们将这些MDOERS视为多个信息来源,其分数在Log-Linear模型中突出,以计算假设可能性。模型组合可以以静态方式进行,恒定的组合重量或以动态方式执行,其中参数可以因假设的不同段而变化。目的是优化参数,以实现最小字错误率。为了在动态组合箱中实现鲁棒参数estiamtion,参数被定义为在形成假设段空间分区的不同语音类上是分段常数。使用语音知识,在对应于假设手机的段上定义分区。我们考生不同的方法来定义这样的分区,包括一种自动方法,它给出了二进制树结构局的,它试图以最小数量的类实现最小的WER。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号