首页> 外文期刊>ETRI journal >A New Distance Measure for a Variable-Sized Acoustic Model Based on MDL Technique
【24h】

A New Distance Measure for a Variable-Sized Acoustic Model Based on MDL Technique

机译:基于MDL技术的变尺寸声学模型的新型测距

获取原文
           

摘要

Embedding a large vocabulary speech recognition system in mobile devices requires a reduced acoustic model obtained by eliminating redundant model parameters. In conventional optimization methods based on the minimum description length (MDL) criterion, a binary Gaussian tree is built at each state of a hidden Markov model by iteratively finding and merging similar mixture components. An optimal subset of the tree nodes is then selected to generate a downsized acoustic model. To obtain a better binary Gaussian tree by improving the process of finding the most similar Gaussian components, this paper proposes a new distance measure that exploits the difference in likelihood values for cases before and after two components are combined. The mixture weight of Gaussian components is also introduced in the component merging step. Experimental results show that the proposed method outperforms MDL-based optimization using either a Kullback-Leibler (KL) divergence or weighted KL divergence measure. The proposed method could also reduce the acoustic model size by 50% with less than a 1.5% increase in error rate compared to a baseline system.
机译:在移动设备中嵌入大型词汇语音识别系统需要通过消除冗余模型参数而获得的简化声学模型。在基于最小描述长度(MDL)准则的常规优化方法中,通过迭代查找和合并相似的混合成分,在隐马尔可夫模型的每个状态下都建立了二元高斯树。然后选择树节点的最佳子集以生成缩小的声学模型。为了通过改进查找最相似的高斯分量的过程来获得更好的二叉高斯树,本文提出了一种新的距离度量,该度量利用了两个分量组合前后情况下似然值的差异。高斯成分的混合重量也被引入到成分合并步骤中。实验结果表明,所提出的方法优于使用Kullback-Leibler(KL)发散或加权KL发散度量的基于MDL的优化。与基线系统相比,所提出的方法还可以将声学模型的尺寸减少50%,而错误率的增加幅度不到1.5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号