Performance Prediction of Speech Recognition Using Average-Voice-Based Speech Synthesis

机译：基于平均语音的语音合成对语音识别的性能预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a performance prediction technique of a speech recognition system using a small amount of target speakers' data. In the conventional HMM-based technique, a speaker-dependent model was used and thus a considerable amount of training data was needed. To reduce the amount of training data, we introduce an average voice model as a prior knowledge for the target speakers' acoustic models, and adapt it to the target speakers' ones using speaker adaptation. Experimental results show that the use of average voice model effectively save the amount of training data of the target speakers, and the prediction accuracy is significantly improved compared to the conventional technique especially when a smaller amount of training data is available.

机译：本文介绍了一种使用少量目标说话者数据的语音识别系统的性能预测技术。在传统的基于HMM的技术中，使用了与说话者相关的模型，因此需要大量的训练数据。为了减少训练数据的数量，我们引入了平均语音模型作为目标说话人声学模型的先验知识，并使用说话人自适应将其适应目标说话人的声学模型。实验结果表明，平均语音模型的使用有效地节省了目标说话者的训练数据量，与传统技术相比，尤其是在训练数据量较少的情况下，预测精度得到了显着提高。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.1964-1967|共4页
会议地点
作者
Tatsuhiko Saito; Takashi Nose; Takao Kobayashi; Yohei Okato; Akio Horii;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
speech recognition; performance prediction; average-voice-based speech synthesis; speaker adaptation;

机译：语音识别;性能预测;基于平均语音的语音合成;说话人适应;

相似文献

外文文献
中文文献
专利

1. The relationship between speech recognition in noise and non-speech recognition in noise test performances: Implications for central auditory processing disorders testing [J] . Vermiglio Andrew J., Velappan Keerthana, Heeke Paige, Journal of communication disorders . 2019,第期

机译：噪声测试表演中噪声识别与非语音识别的关系：中央听觉处理障碍测试的影响
2. Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis | Science Publications [J] . Chutarat Chompunth, Suphattharachai Chomphan American journal of applied sciences . 2012,第3期

机译：基于平均语音的泰语语音合成的音质清晰度改善科学出版物
3. Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis [J] . Suphattharachai Chomphan, Chutarat Chompunth American journal of applied sciences . 2012,第3期

机译：改善基于平均语音的泰语语音合成的音质清晰度
4. Tonal context labeling using quantized F0 symbols for improving tone correctness in average-voice-based speech synthesis [C] . Chunwijitra Vataya, Nose Takashi, Kobayashi Takao 2011 IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：在基于平均语音的语音合成中使用量化的F0符号进行音调上下文标记以提高音调正确性
5. HMM-based non-intrusive speech quality and implementation of Viterbi score distribution and hiddenness based measures to improve the performance of speech recognition [D] . Talwar, Gaurav 2006

机译：基于HMM的非侵入式语音质量以及基于Viterbi分数分布和隐蔽性的措施的实施，以提高语音识别的性能
6. Individual Aided Speech-Recognition Performance and Predictions of Benefit for Listeners With Impaired Hearing Employing FADE [O] . Marc R. Schädler, David Hülsmeier, Anna Warzybok, 2020

机译：个人辅助语音识别性能和对听力受损的听众的益处预测接受淡入褪色
7. Improvement of Tone Intelligibility for Average-Voice-Based Thai Speech Synthesis [O] . Suphattharachai Chomphan, Chutarat Chompunth 2012

机译：改善基于平均语音的泰语语音合成的音质清晰度

Performance Prediction of Speech Recognition Using Average-Voice-Based Speech Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅