首页> 外文期刊>IEICE Transactions on Information and Systems >Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition
【24h】

Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition

机译:隐藏条件神经场的连续音素语音识别。

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, we propose Hidden Conditional Neural Fields (HCNF) for continuous phoneme speech recognition, which are a combination of Hidden Conditional Random Fields (HCRF) and a Multi-Layer Perceptron (MLP), and inherit their merits, namely, the discriminative property for sequences from HCRF and the ability to extract non-linear features from an MLP. HCNF can incorporate many types of features from which non-linear features can be extracted, and is trained by sequential criteria. We first present the formulation of HCNF and then examine three methods to further improve automatic speech recognition using HCNF, which is an objective function that explicitly considers training errors, provides a hierarchical tandem-style feature and includes a deep non-linear feature extractor for the observation function. We show that HCNF can be trained realistically without any initial model and outperforms HCRF and the triphone hidden Markov model trained by the minimum phone error (MPE) manner using experimental results for continuous English phoneme recognition on the TIMIT core test set and Japanese phoneme recognition on the IPA 100 test set.
机译:在本文中,我们提出了用于连续音素语音识别的隐藏条件神经场(HCNF),它是隐藏条件随机场(HCRF)和多层感知器(MLP)的组合,并继承了它们的优点,即判别式HCRF序列的特性和从MLP提取非线性特征的能力。 HCNF可以合并许多类型的特征,从中可以提取非线性特征,并通过顺序标准进行训练。我们首先介绍HCNF的公式,然后研究使用HCNF进一步改善自动语音识别的三种方法,HCNF是一个明确考虑训练错误的目标函数,提供了分层的串联样式特征,并包括针对该特征的深层非线性特征提取器观察功能。我们展示了HCNF可以在没有任何初始模型的情况下进行实际训练,并且优于TIFF核心测试集上连续英语音素识别和日语音素识别的实验结果,通过最小电话误差(MPE)方式训练了HCRF和三音素隐藏马尔可夫模型。 IPA 100测试仪。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号