Large-Margin Estimation of Hidden Markov Models With Second-Order Cone Programming for Speech Recognition

Dalei Wu; Yan Yin; Hui Jiang

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Large-Margin Estimation of Hidden Markov Models With Second-Order Cone Programming for Speech Recognition

【24h】

Large-Margin Estimation of Hidden Markov Models With Second-Order Cone Programming for Speech Recognition

机译：语音识别的二阶锥规划隐马尔可夫模型的大余量估计

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Large-margin estimation (LME) holds a property of good generalization on unseen test data. In our previous work, LME of HMMs has been successfully applied to some small-scale speech recognition tasks, using the SDP (semi-definite programming) technique. In this paper, we further extend the previous work by exploring a more efficient convex optimization method with the technique of second-order cone programming (SOCP). More specifically, we have studied and proposed several SOCP relaxation techniques to convert LME of HMMs in speech recognition into a standard SOCP problem so that LME can be solved with more efficient SOCP methods. The formulation is general enough to deal with various types of competing hypothesis space, such as N-best lists and word graphs. The proposed LME/SOCP approaches have been evaluated on two standard speech recognition tasks. The experimental results on the TIDIGITS task show that the SOCP method significantly outperforms the gradient descent method, and achieve comparable performance with SDP, but with 20-200 times faster speed, requiring less memory and computing resources. Furthermore, the proposed LME/SOCP method has also been successfully applied to a large vocabulary task using the Wall Street Journals (WSJ0) database. The WSJ-5k recognition results show that the proposed method yields better performance than the conventional approaches including maximum-likelihood estimation (MLE), maximum mutual information estimation (MMIE), and more recent boosted MMIE methods.

机译：大余量估计（LME）对看不见的测试数据具有良好的概括性。在我们之前的工作中，HMM的LME已使用SDP（半定编程）技术成功应用于一些小规模的语音识别任务。在本文中，我们通过使用二阶锥规划（SOCP）技术探索更有效的凸优化方法，进一步扩展了先前的工作。更具体地说，我们已经研究并提出了几种SOCP松弛技术，可以将语音识别中的HMM的LME转换为标准的SOCP问题，从而可以使用更有效的SOCP方法来解决LME。这种表述足以应付各种类型的竞争假设空间，例如N个最佳列表和单词图。所提出的LME / SOCP方法已经在两个标准语音识别任务上进行了评估。在TIDIGITS任务上进行的实验结果表明，SOCP方法明显优于梯度下降方法，并可以与SDP媲美，但速度要快20-200倍，所需的内存和计算资源也更少。此外，使用华尔街日报（WSJ0）数据库，所提出的LME / SOCP方法也已成功应用于大型词汇任务。 WSJ-5k识别结果表明，与包括最大似然估计（MLE），最大互信息估计（MMIE）和更新的增强MMIE方法的常规方法相比，该方法产生的性能更好。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2011年第6期|p.1652-1664|共13页
作者
Dalei Wu; Yan Yin; Hui Jiang;
展开▼
作者单位

Dept. of Comput. Sci. & Eng., York Univ., Toronto, ON, Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Convex optimization; convex relaxation; discriminative training (DT); large-margin estimation (LME); second-order cone programming (SOCP);

机译：凸优化;凸松弛;判别训练（DT）;大幅度估计（LME）;二阶锥规划（SOCP）;

相似文献

外文文献
中文文献
专利

1. Segmental corrective training for hidden Markov model parameter estimation in speech recognition [J] . Kim H.R., Lee H.S. Electronics Letters . 1991,第18期

机译：语音识别中隐马尔可夫模型参数估计的分段校正训练
2. Hybrid approach to speech recognition using hidden Markov models and Markov chains [J] . Dai J. IEE Proceedings. Part K . 1994,第5期

机译：使用隐马尔可夫模型和马尔可夫链的混合语音识别方法
3. Speech Silicon: An FPGA Architecture for Real-Time Hidden Markov-Model-Based Speech Recognition [J] . Jeffrey Schuster, Kshitij Gupta, Raymond Hoare, EURASIP journal on embedded systems . 2006,第1期

机译：语音芯片：基于实时隐马尔可夫模型的语音识别的FPGA架构
4. Boosted Large-Margin Estimation of hidden Markov models for speech recognition [C] . Xu Shuangyin, Qu Dan IEEE International Conference on Signal Processing . 2012

机译：用于语音识别的隐马尔可夫模型的增强大余量估计
5. Hidden Markov models, maximum mutual information estimation, and the speech recognition problem [D] . Normandin, Yves. 1991

机译：隐藏的马尔可夫模型，最大互信息估计和语音识别问题
6. Assessment of Dysarthria Using One-Word Speech Recognition with Hidden Markov Models [O] . Seung Hak Lee, Minje Kim, Han Gil Seo, 2019

机译：使用隐马尔可夫模型的单字语音识别评估构音障碍
7. Large-Margin Discriminative Training of Hidden Markov Models for Speech Recognition [O] . Dong Yu, Li Deng 2009

机译：隐藏语音的隐马尔可夫模型的大幅度判别训练
8. Robust Speech Recognition Using Hidden Markov Models: Overview of a Research Program. [R] . Weinstein, C. J., Paul, D. B., Lippmann, R. P. 1990

机译：使用隐马尔可夫模型的鲁棒语音识别：研究计划概述。

Large-Margin Estimation of Hidden Markov Models With Second-Order Cone Programming for Speech Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅