首页> 外文会议>IEEE Power Engineering Society Winter Meeting, 2001, 2001 >Methods for improving robustness of decision tree in Mandarin speech recognition

【24h】

Methods for improving robustness of decision tree in Mandarin speech recognition

机译：提高普通话语音识别中决策树鲁棒性的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Phonetic decision tree based state tying has been widely used in most large vocabulary continuous speech recognition (LVCSR) systems. However, in most cases, the samples of different leaf nodes are very unbalanced, which may affect the recognition performance. In This work, node merging techniques are proposed to alleviate the problem and further decrease the number of senones. On the other hand, in order to lessen the impact of rare triphones on the quality of the decision tree based state tying and improve the accuracy of every final senone, two methods of dealing with rare triphones are added to hidden Markov model (HMM) acoustic modeling before state tying. Experimental results show that these methods greatly improve the robustness of the decision tree and can achieve better performance with even fewer parameters.

机译：基于语音决策树的状态绑定已在大多数大型词汇连续语音识别（LVCSR）系统中广泛使用。但是，在大多数情况下，不同叶节点的样本非常不平衡，这可能会影响识别性能。在这项工作中，提出了节点合并技术来缓解该问题并进一步减少senone的数量。另一方面，为了减少稀有三音对基于决策树的状态绑定质量的影响并提高每个最终senone的准确性，在隐马尔可夫模型（HMM）声学中增加了两种处理稀有三音的方法状态绑定之前进行建模。实验结果表明，这些方法大大提高了决策树的鲁棒性，甚至可以用更少的参数获得更好的性能。

著录项

来源
《IEEE Power Engineering Society Winter Meeting, 2001, 2001 》|2001年|p.1975-1978|共4页
会议地点
作者
Xianghua Xu; Jie Zhu; Qiang Guo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术 ;
关键词

相似文献

外文文献
中文文献
专利

1. SPEECH EMOTION RECOGNITION METHOD BASED ON IMPROVED DECISION TREE AND LAYERED FEATURE SELECTION [J] . QIRONG MAO, XIAOJIA WANG, YONGZHAO ZHAN International journal of humanoid robotics . 2010 ,第2期

机译：基于改进决策树和分层特征选择的语音情感识别方法
2. Robust SBR method for adverse Mandarin speech recognition [J] . Wei-Tyng Hong, Sin-Horng Chen Electronics Letters . 1999 ,第11期

机译：鲁棒的SBR方法用于普通话语音识别
3. Robust decision tree state tying for continuous speech recognition [J] . Reichl W., Wu Chou IEEE Transactions on Speech and Audio Proceeding . 2000 ,第5期

机译：鲁棒的决策树状态绑定可实现连续语音识别
4. Methods for improving robustness of decision tree in Mandarin speech recognition [C] . Xianghua Xu, Jie Zhu, Qiang Guo . 2004

机译：提高普通话语音识别中决策树鲁棒性的方法
5. Compressive nonlinearity for representing speech spectral magnitude to improve noise robustness of automatic speech recognition . [D] . Wong, Brian. 2011

机译：压缩非线性表示语音频谱幅度提高语音自动识别的鲁棒性。
6. The Binaural Masking-Level Difference of Mandarin Tone Detection and the Binaural Intelligibility-Level Difference of Mandarin Tone Recognition in the Presence of Speech-Spectrum Noise [O] . Cheng-Yu Ho, Pei-Chun Li, Yuan-Chuan Chiang, -1

机译：语音频谱噪声下普通话检测的双耳掩蔽水平差异和普通话识别的双耳可懂度水平差异
7. Robust Visual Lips Feature Extraction Method for Improved Visual Speech Recognition System [O] . 2018

机译：强大的视觉嘴唇特征提取方法，用于改进的视觉语音识别系统

Methods for improving robustness of decision tree in Mandarin speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅