首页> 外文会议>Proceedings of the 6th International Conference on Speech Technology and Human-Computer Dialogue >Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion

【24h】

Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion

机译：通过分析声学模型的不确定性和混乱度，提高自动语音识别中的单词错误率并降低复杂度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a study about the uncertainty of the trained acoustic models and the confusion among these models is made in the context of speech recognition. The purpose is to find the most relevant voice features, hence the analysis is made on a per-feature basis. Model uncertainty is defined as a measure of feature distribution overlapping. A model is compared only to the models it is more similar to. Hence, confusion matrices are built from both feature distributions and recognition results. Next, the voice features are weighted according to their relevance in order to increase the discrimination among models, while relevance itself is deduced from the values of model uncertainty. Experimental results show that, by appropriate weighting, the recognition accuracy, in terms of Word Error Rate (WER), improves. Moreover, by removing the features with lower weights, the recognition accuracy is maintained, but the number of calculations is significantly reduced.

机译：本文在语音识别的背景下，对训练后的声学模型的不确定性以及这些模型之间的混淆进行了研究。目的是找到最相关的语音功能，因此将针对每个功能进行分析。模型不确定性定义为特征分布重叠的度量。仅将模型与与其更相似的模型进行比较。因此，混淆矩阵是根据特征分布和识别结果建立的。接下来，根据语音特征的相关性对语音特征进行加权，以增加模型之间的区分度，同时根据模型不确定性的值推导相关性本身。实验结果表明，通过适当的加权，就单词错误率（WER）而言，识别精度得以提高。而且，通过去除权重较低的特征，可以保持识别精度，但是计算数量却大大减少了。

著录项

来源
《Proceedings of the 6th International Conference on Speech Technology and Human-Computer Dialogue 》|2011年|p.1-8|共8页
会议地点
作者
Buzo Andi; Cucu Horia; Burileanu Corneliu; Pasca Miruna; Popescu Vladimir;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电声技术和语音信号处理 ;
关键词
Acoustic Model Uncertainty; Automatic Speech Recognition; Model Confusion;

机译：声学模型不确定度;自动语音识别;模型混淆;

相似文献

外文文献
中文文献
专利

1. Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies [J] . Cui Xiaodong, Zhang Wei, Finkler Ulrich, IEEE Signal Processing Magazine . 2020 ,第3期

机译：自动语音识别深神经网络声学模型的分布式训练：当前训练策略的比较
2. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2019 ,第12期

机译：潜在词递归神经网络语言模型用于自动语音识别
3. Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2018 ,第6期

机译：基于潜在词语言模型混合的领域自适应语音自动识别
4. Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion [C] . Buzo Andi, Cucu Horia, Burileanu Corneliu, Conference on Speech Technology and Human-Computer Dialogue . 2011

机译：通过分析声学模型不确定性和混淆来单词误差率改善和自动语音识别的复杂性降低
5. Graph-based Semi-Supervised Learning in Acoustic Modeling for Automatic Speech Recognition. [D] . Liu, Yuzong. 2016

机译：用于自动语音识别的声学建模中基于图的半监督学习。
6. Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features bypassing the phoneme as recognition unit [O] . Denis Arnold, Fabian Tomaschek, Konstantin Sering, -1

机译：通过错误驱动的学习算法可以区分自发会话语音中的单词其准确性与人类类似可以从智能声学特征中区分出含义而绕过音素作为识别单元
7. Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition [O] . Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr 2021

机译：分析扬声器本地化误差对自动语音识别语音分离的影响

Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion

摘要

著录项

相似文献

相关主题

期刊订阅