首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Large-Margin Estimation of Hidden Markov Models With Second-Order Cone Programming for Speech Recognition
【24h】

Large-Margin Estimation of Hidden Markov Models With Second-Order Cone Programming for Speech Recognition

机译:语音识别的二阶锥规划隐马尔可夫模型的大余量估计

获取原文
获取原文并翻译 | 示例
       

摘要

Large-margin estimation (LME) holds a property of good generalization on unseen test data. In our previous work, LME of HMMs has been successfully applied to some small-scale speech recognition tasks, using the SDP (semi-definite programming) technique. In this paper, we further extend the previous work by exploring a more efficient convex optimization method with the technique of second-order cone programming (SOCP). More specifically, we have studied and proposed several SOCP relaxation techniques to convert LME of HMMs in speech recognition into a standard SOCP problem so that LME can be solved with more efficient SOCP methods. The formulation is general enough to deal with various types of competing hypothesis space, such as N-best lists and word graphs. The proposed LME/SOCP approaches have been evaluated on two standard speech recognition tasks. The experimental results on the TIDIGITS task show that the SOCP method significantly outperforms the gradient descent method, and achieve comparable performance with SDP, but with 20-200 times faster speed, requiring less memory and computing resources. Furthermore, the proposed LME/SOCP method has also been successfully applied to a large vocabulary task using the Wall Street Journals (WSJ0) database. The WSJ-5k recognition results show that the proposed method yields better performance than the conventional approaches including maximum-likelihood estimation (MLE), maximum mutual information estimation (MMIE), and more recent boosted MMIE methods.
机译:大余量估计(LME)对看不见的测试数据具有良好的概括性。在我们之前的工作中,HMM的LME已使用SDP(半定编程)技术成功应用于一些小规模的语音识别任务。在本文中,我们通过使用二阶锥规划(SOCP)技术探索更有效的凸优化方法,进一步扩展了先前的工作。更具体地说,我们已经研究并提出了几种SOCP松弛技术,可以将语音识别中的HMM的LME转换为标准的SOCP问题,从而可以使用更有效的SOCP方法来解决LME。这种表述足以应付各种类型的竞争假设空间,例如N个最佳列表和单词图。所提出的LME / SOCP方法已经在两个标准语音识别任务上进行了评估。在TIDIGITS任务上进行的实验结果表明,SOCP方法明显优于梯度下降方法,并可以与SDP媲美,但速度要快20-200倍,所需的内存和计算资源也更少。此外,使用华尔街日报(WSJ0)数据库,所提出的LME / SOCP方法也已成功应用于大型词汇任务。 WSJ-5k识别结果表明,与包括最大似然估计(MLE),最大互信息估计(MMIE)和更新的增强MMIE方法的常规方法相比,该方法产生的性能更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号