Deep-Structured Hidden Conditional Random Fields for Phonetic Recognition

机译：用于语音识别的深层结构隐藏条件随机场

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We extend our earlier work on deep-structured conditional random field (DCRF) and develop deep-structured hidden conditional random field (DHCRF). We investigate the use of this new sequential deep-learning model for phonetic recognition. DHCRF is a hierarchical model in which the final layer is a hidden conditional random field (HCRF) and the intermediate layers are zero-th-order conditional random fields (CRFs). Parameter estimation and sequence inference in the DHCRF are developed in this work. They are carried out layer by layer so that the time complexity is linear to the number of layers. In the DHCRF, the training label is available only at the final layer and the state boundary is unknown. This difficulty is addressed by using unsupervised learning for the intermediate layers and lattice-based supervised learning for the final layer. Experiments on the standard TIMIT phone recognition task show small performance improvement of a three-layer DHCRF over a two-layer DHCRF; both are significantly better than the single-layer DHCRF and are superior to the discriminatively trained tri-phone hidden Markov model (HMM) using identical input features.

机译：我们扩展了对深结构条件随机场（DCRF）的早期工作，并开发了深结构隐藏条件随机场（DHCRF）。我们研究了这种新的顺序深度学习模型在语音识别中的使用。 DHCRF是一个层次模型，其中最后一层是隐藏条件随机场（HCRF），中间层是零阶条件随机场（CRF）。在这项工作中开发了DHCRF中的参数估计和序列推断。它们是逐层执行的，因此时间复杂度与层数成线性关系。在DHCRF中，训练标签仅在最后一层可用，并且状态边界未知。通过对中间层使用无监督学习，对最后一层使用基于网格的监督学习，可以解决此难题。标准TIMIT电话识别任务的实验表明，与两层DHCRF相比，三层DHCRF的性能有所提高。两者均明显优于单层DHCRF，并且优于使用相同输入功能的经过区别训练的三音机隐藏马尔可夫模型（HMM）。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.2986-2989|共4页
会议地点
作者
Dong Yu; Li Deng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
hidden conditional random field; conditional random field; deep structure; phone recognition; TIMIT;

机译：隐藏的条件随机场;条件随机场深层结构电话识别;蒂米特;

相似文献

外文文献
中文文献
专利

1. Sequential Labeling Using Deep-Structured Conditional Random Fields [J] . Yu D., Wang S., Deng L. Selected Topics in Signal Processing, IEEE Journal of . 2010,第6期

机译：使用深度结构化条件随机场的顺序标记
2. Human Activity Recognition Using Gaussian Mixture Hidden Conditional Random Fields [J] . Muhammad Hameed Siddiqi, Madallah Alruwaili, Amjad Ali, Computational intelligence and neuroscience . 2019,第4期

机译：使用高斯混合隐藏条件随机字段的人类活动识别
3. Human Facial Expression Recognition Using Stepwise Linear Discriminant Analysis and Hidden Conditional Random Fields [J] . Siddiqi Muhammad Hameed, Ali Rahman, Khan Adil Mehmood, Image Processing, IEEE Transactions on . 2015,第4期

机译：基于逐步线性判别分析和隐藏条件随机场的人脸表情识别
4. Deep-Structured Hidden Conditional Random Fields for Phonetic Recognition [C] . Dong Yu, Li Deng Annual conference of the International Speech Communication Association . 2010

机译：用于语音识别的深层隐藏条件随机字段
5. A study on the use of conditional random fields for automatic speech recognition. [D] . Morris, Jeremy J. 2010

机译：关于使用条件随机场进行自动语音识别的研究。
6. Human Activity Recognition Using Gaussian Mixture Hidden Conditional Random Fields [O] . Muhammad Hameed Siddiqi, Madallah Alruwaili, Amjad Ali, 2019

机译：使用高斯混合隐藏条件随机场的人类活动识别
7. LANGUAGE RECOGNITION USING DEEP-STRUCTURED CONDITIONAL RANDOM FIELDS [O] . Dong Yu A, Shizhen Wang B, Zahi Karam C, 2011

机译：使用深层结构条件随机场的语言识别

Deep-Structured Hidden Conditional Random Fields for Phonetic Recognition

摘要

著录项

相似文献

相关主题

期刊订阅