Training data selection for improving discriminative training of acoustic models

机译：培养数据选择，以改善声学模型的鉴别培训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper considers training data selection for discriminative training of acoustic models for broadcast news speech recognition. Three novel data selection approaches were proposed. First, the average phone accuracy over all hypothesized word sequences in the word lattice of a training utterance was utilized for utterancelevel data selection. Second, phone-level data selection based on the difference between the expected accuracy of a phone arc and the average phone accuracy of the word lattice was investigated. Finally, frame-level data selection based on the normalized frame-level entropy of Gaussian posterior probabilities obtained from the word lattice was explored. The underlying characteristics of the presented approaches were extensively investigated and their performance was verified by comparison with the standard discriminative training approaches. Experiments conducted on the Mandarin broadcast news collected in Taiwan shown that both phone- and frame-level data selection could achieve slight but consistent improvements over the baseline systems at lower training iterations.

机译：本文考虑了培训数据选择，了解广播新闻语音识别的声学模型的鉴别培训。提出了三种新型数据选择方法。首先，用于训练话语的单词晶格中的所有假设字序列的平均电话准确性用于对齐的数据选择。其次，基于电话弧的预期精度与单词晶格的平均电话精度之间的电话级数据选择。最后，探讨了基于从单词晶格中获得的高斯后级概率的归一化帧级熵的帧级数据选择。广泛调查所提出的方法的潜在特征，并通过与标准歧视性培训方法进行比较来验证其性能。在台湾收集的普通话广播新闻中进行的实验表明，在较低培训迭代的基线系统中，两种电话和帧级数据选择都可以实现轻微但一致的改进。

著录项

来源
《IEEE Workshop on Automatic Speech Recognition and Understanding》|2007年||共6页
会议地点
作者
Shih-Hung Liu; Fang-Hui Chu; Shih-Hsiang Lin; Hung-Shin Lee; Berlin Chen; ASRU;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词
acoustic models; data selection; discriminative training; entropy; speech recognition;

机译：声学模型;数据选择;鉴别培训;熵;语音识别;
入库时间 2022-08-21 01:02:21

相似文献

外文文献
中文文献
专利

1. Training data selection for improving discriminative training of acoustic models [J] . Berlin Chen, Shih-Hung Liu, Fang-Hui Chu Pattern recognition letters . 2009,第13期

机译：选择训练数据以改善声学模型的判别训练
2. Semi-Supervised Acoustic Model Training by Discriminative Data Selection From Multiple ASR Systems’ Hypotheses [J] . Sheng Li, Yuya Akita, Tatsuya Kawahara Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第9期

机译：通过从多个ASR系统的假设中进行区分数据选择来半监督声学模型训练
3. Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training [J] . Sheng LI, Yuya AKITA, Tatsuya KAWAHARA IEICE transactions on information and systems . 2015,第8期

机译：基于区分数据选择的自动演讲转录，用于轻度监督的声学模型训练
4. Training data selection for improving discriminative training of acoustic models [C] . Shih-Hung Liu, Fang-Hui Chu, Shih-Hsiang Lin, IEEE Workshop on Automatic Speech Recognition and Understanding . 2007

机译：培养数据选择，以改善声学模型的鉴别培训
5. Optimal generative and discriminative acoustic model training for speech recognition. [D] . Joshi, Neil. 2009

机译：用于语音识别的最佳生成和判别声学模型训练。
6. Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training [O] . Arun Narayanan, DeLiang Wang -1

机译：通过语音分离和联合自适应训练提高深度神经网络声学模型的鲁棒性
7. Training Data Selection for Discriminative Training of Acoustic Models [O] . 朱芳輝, Fang-Hui Chu 2011

机译：声学模型判别训练的训练数据选择

Training data selection for improving discriminative training of acoustic models

摘要

著录项

相似文献

相关主题

期刊订阅