首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Deep-Structured Hidden Conditional Random Fields for Phonetic Recognition
【24h】

Deep-Structured Hidden Conditional Random Fields for Phonetic Recognition

机译:用于语音识别的深层结构隐藏条件随机场

获取原文

摘要

We extend our earlier work on deep-structured conditional random field (DCRF) and develop deep-structured hidden conditional random field (DHCRF). We investigate the use of this new sequential deep-learning model for phonetic recognition. DHCRF is a hierarchical model in which the final layer is a hidden conditional random field (HCRF) and the intermediate layers are zero-th-order conditional random fields (CRFs). Parameter estimation and sequence inference in the DHCRF are developed in this work. They are carried out layer by layer so that the time complexity is linear to the number of layers. In the DHCRF, the training label is available only at the final layer and the state boundary is unknown. This difficulty is addressed by using unsupervised learning for the intermediate layers and lattice-based supervised learning for the final layer. Experiments on the standard TIMIT phone recognition task show small performance improvement of a three-layer DHCRF over a two-layer DHCRF; both are significantly better than the single-layer DHCRF and are superior to the discriminatively trained tri-phone hidden Markov model (HMM) using identical input features.
机译:我们扩展了对深结构条件随机场(DCRF)的早期工作,并开发了深结构隐藏条件随机场(DHCRF)。我们研究了这种新的顺序深度学习模型在语音识别中的使用。 DHCRF是一个层次模型,其中最后一层是隐藏条件随机场(HCRF),中间层是零阶条件随机场(CRF)。在这项工作中开发了DHCRF中的参数估计和序列推断。它们是逐层执行的,因此时间复杂度与层数成线性关系。在DHCRF中,训练标签仅在最后一层可用,并且状态边界未知。通过对中间层使用无监督学习,对最后一层使用基于网格的监督学习,可以解决此难题。标准TIMIT电话识别任务的实验表明,与两层DHCRF相比,三层DHCRF的性能有所提高。两者均明显优于单层DHCRF,并且优于使用相同输入功能的经过区别训练的三音机隐藏马尔可夫模型(HMM)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号