Training data selection for improving discriminative training of acoustic models

Berlin Chen; Shih-Hung Liu; Fang-Hui Chu

首页> 外文期刊>Pattern recognition letters >Training data selection for improving discriminative training of acoustic models

【24h】

Training data selection for improving discriminative training of acoustic models

机译：选择训练数据以改善声学模型的判别训练

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper considers training data selection for discriminative training of acoustic models for large vocabulary continuous speech recognition (LVCSR). Three novel data selection approaches are proposed. First, the average phone accuracy over all hypothesized word sequences in the word lattice of a training utterance is utilized for utterance-level data selection. Second, phone-level data selection based on the difference between the expected accuracy of a phone arc and the average phone accuracy of the word lattice is investigated. Finally, frame-level data selection based on the normalized frame-level entropy of Gaussian posterior probabilities obtained from the word lattice is explored. The underlying characteristics of the presented approaches are extensively investigated and their performance is verified by comparison with standard discriminative training approaches. Experiments conducted on a broadcast news speech transcription task show that with the aid of phone- and frame-level data selection we can reduce more than half of the turnaround time for acoustic model training and simultaneously obtain a comparably good set of discriminative acoustic models.

机译：本文考虑了训练数据的选择，以用于大词汇量连续语音识别（LVCSR）的声学模型的判别训练。提出了三种新颖的数据选择方法。首先，将训练话语的单词格中所有假设的单词序列的平均电话准确性用于话语级数据选择。其次，研究了基于电话弧的预期准确度与词格平均电话准确度之间差异的电话级别数据选择。最后，探索了基于从词格获得的高斯后验概率的归一化帧级熵的帧级数据选择。所提出的方法的基本特征已得到广泛研究，并通过与标准判别训练方法进行比较来验证其性能。在广播新闻语音转录任务上进行的实验表明，借助电话和帧级数据选择，我们可以减少一半以上的声学模型训练所需的周转时间，同时可以得到一组相当好的判别声学模型。

著录项

来源
《Pattern recognition letters》 |2009年第13期|1228-1235|共8页
作者
Berlin Chen; Shih-Hung Liu; Fang-Hui Chu;
展开▼
作者单位

Department of Computer Science and Information Engineering, National Taiwan Normal University, Taipei 116, Taiwan;

Department of Computer Science and Information Engineering, National Taiwan Normal University, Taipei 116, Taiwan;

Department of Computer Science and Information Engineering, National Taiwan Normal University, Taipei 116, Taiwan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
continuous speech recognition; discriminative training; acoustic models; data selection; phone accuracy; entropy;

机译：连续语音识别歧视性培训;声学模型;数据选择;电话准确性;熵;
入库时间 2022-08-18 02:50:05

相似文献

外文文献
中文文献
专利

1. Semi-Supervised Acoustic Model Training by Discriminative Data Selection From Multiple ASR Systems’ Hypotheses [J] . Sheng Li, Yuya Akita, Tatsuya Kawahara Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第9期

机译：通过从多个ASR系统的假设中进行区分数据选择来半监督声学模型训练
2. Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training [J] . Sheng LI, Yuya AKITA, Tatsuya KAWAHARA IEICE transactions on information and systems . 2015,第8期

机译：基于区分数据选择的自动演讲转录，用于轻度监督的声学模型训练
3. Discriminative Data Selection from Multiple ASR Systems' Hypotheses for Unsupervised Acoustic Model Training [J] . SHENG LI, YUYA AKITA, TATSUYA KAWAHARA 電子情報通信学会技術研究報告. 音声. Speech . 2015,第346期

机译：从多个ASR系统假设中进行区分数据选择，以进行无监督的声学模型训练
4. Training data selection for improving discriminative training of acoustic models [C] . Shih-Hung Liu, Fang-Hui Chu, Shih-Hsiang Lin, IEEE Workshop on Automatic Speech Recognition and Understanding . 2007

机译：培养数据选择，以改善声学模型的鉴别培训
5. Optimal generative and discriminative acoustic model training for speech recognition. [D] . Joshi, Neil. 2009

机译：用于语音识别的最佳生成和判别声学模型训练。
6. Improving Robustness of Deep Neural Network Acoustic Models via Speech Separation and Joint Adaptive Training [O] . Arun Narayanan, DeLiang Wang -1

机译：通过语音分离和联合自适应训练提高深度神经网络声学模型的鲁棒性
7. Training Data Selection for Discriminative Training of Acoustic Models [O] . 朱芳輝, Fang-Hui Chu 2011

机译：声学模型判别训练的训练数据选择

Training data selection for improving discriminative training of acoustic models

摘要

著录项

相似文献

相关主题

期刊订阅