Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition

Yasuhisa FUJII; Kazumasa YAMAMOTO; Seiichi NAKAGAWA

首页> 外文期刊>IEICE Transactions on Information and Systems >Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition

【24h】

Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition

机译：隐藏条件神经场的连续音素语音识别。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose Hidden Conditional Neural Fields (HCNF) for continuous phoneme speech recognition, which are a combination of Hidden Conditional Random Fields (HCRF) and a Multi-Layer Perceptron (MLP), and inherit their merits, namely, the discriminative property for sequences from HCRF and the ability to extract non-linear features from an MLP. HCNF can incorporate many types of features from which non-linear features can be extracted, and is trained by sequential criteria. We first present the formulation of HCNF and then examine three methods to further improve automatic speech recognition using HCNF, which is an objective function that explicitly considers training errors, provides a hierarchical tandem-style feature and includes a deep non-linear feature extractor for the observation function. We show that HCNF can be trained realistically without any initial model and outperforms HCRF and the triphone hidden Markov model trained by the minimum phone error (MPE) manner using experimental results for continuous English phoneme recognition on the TIMIT core test set and Japanese phoneme recognition on the IPA 100 test set.

机译：在本文中，我们提出了用于连续音素语音识别的隐藏条件神经场（HCNF），它是隐藏条件随机场（HCRF）和多层感知器（MLP）的组合，并继承了它们的优点，即判别式HCRF序列的特性和从MLP提取非线性特征的能力。 HCNF可以合并许多类型的特征，从中可以提取非线性特征，并通过顺序标准进行训练。我们首先介绍HCNF的公式，然后研究使用HCNF进一步改善自动语音识别的三种方法，HCNF是一个明确考虑训练错误的目标函数，提供了分层的串联样式特征，并包括针对该特征的深层非线性特征提取器观察功能。我们展示了HCNF可以在没有任何初始模型的情况下进行实际训练，并且优于TIFF核心测试集上连续英语音素识别和日语音素识别的实验结果，通过最小电话误差（MPE）方式训练了HCRF和三音素隐藏马尔可夫模型。 IPA 100测试仪。

著录项

来源
《IEICE Transactions on Information and Systems》 |2012年第8期|p.2094-2104|共11页
作者
Yasuhisa FUJII; Kazumasa YAMAMOTO; Seiichi NAKAGAWA;
展开▼
作者单位

Department of Information and Com-puter Sciences Toyohashi University of Technology, Toyohashi-shi, 441-8580 Japan;

Department of Information and Com-puter Sciences Toyohashi University of Technology, Toyohashi-shi, 441-8580 Japan;

Department of Information and Com-puter Sciences Toyohashi University of Technology, Toyohashi-shi, 441-8580 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
hidden conditional neural fields; hidden conditional random fields; hidden markov model; speech recognition; deep learning;

机译：隐藏的条件神经场;隐藏的条件随机场;隐藏的马尔可夫模型;语音识别;深度学习;
入库时间 2022-08-18 00:26:19

相似文献

外文文献
中文文献
专利

1. Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition [J] . Yasuhisa FUJII, Kazumasa YAMAMOTO, Seiichi NAKAGAWA IEICE transactions on information and systems . 2012,第8期

机译：用于连续音素语音识别的隐藏条件神经场
2. Neural speech recognition: continuous phoneme decoding using spatiotemporal representations of human cortical activity [J] . David A Moses, Nima Mesgarani, Matthew K Leonard, Journal of neural engineering . 2016,第5期

机译：神经语音识别：使用人类皮层活动的时空表示进行连续音素解码
3. Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition [J] . Wei-Tyng Hong Journal of information science and engineering . 2013,第4期

机译：隐藏条件随机场用于语音和说话者识别的最小分类误差训练
4. Hidden Boosted MMI and Hierarchical State Posterior Feature for Automatic Speech Recognition based on Hidden Conditional Neural Fields [C] . Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa Annual conference of the International Speech Communication Association;INTERSPEECH 2011 . 2011

机译：隐藏的MMI和层次状态后验特征用于基于隐藏条件神经场的自动语音识别
5. A study on the use of conditional random fields for automatic speech recognition. [D] . Morris, Jeremy J. 2010

机译：关于使用条件随机场进行自动语音识别的研究。
6. Neural speech recognition: Continuous phoneme decoding using spatiotemporal representations of human cortical activity [O] . David A Moses, Nima Mesgarani, Matthew K Leonard, -1

机译：神经语音识别：使用人类皮层活动的时空表示进行连续音素解码
7. AUTOMATIC SPEECH RECOGNITION USING HIDDEN CONDITIONAL NEURAL FIELDS [O] . Yasuhisa Fujii, Kazumasa Yamamoto, Seiichi Nakagawa 2015

机译：使用隐藏的条件神经场进行自动语音识别
8. Simulation and Evaluation of Phonetic Speech Recognition Techniques. Volume II. Segmentation of Continuous Speech into Phonemes [R] . Otten, K. W. 1964

机译：语音识别技术的仿真与评估。第二卷。将连续语音分割成音素

Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅