首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2011 >Performance Prediction of Speech Recognition Using Average-Voice-Based Speech Synthesis
【24h】

Performance Prediction of Speech Recognition Using Average-Voice-Based Speech Synthesis

机译:基于平均语音的语音合成对语音识别的性能预测

获取原文

摘要

This paper describes a performance prediction technique of a speech recognition system using a small amount of target speakers' data. In the conventional HMM-based technique, a speaker-dependent model was used and thus a considerable amount of training data was needed. To reduce the amount of training data, we introduce an average voice model as a prior knowledge for the target speakers' acoustic models, and adapt it to the target speakers' ones using speaker adaptation. Experimental results show that the use of average voice model effectively save the amount of training data of the target speakers, and the prediction accuracy is significantly improved compared to the conventional technique especially when a smaller amount of training data is available.
机译:本文介绍了一种使用少量目标说话者数据的语音识别系统的性能预测技术。在传统的基于HMM的技术中,使用了与说话者相关的模型,因此需要大量的训练数据。为了减少训练数据的数量,我们引入了平均语音模型作为目标说话人声学模型的先验知识,并使用说话人自适应将其适应目标说话人的声学模型。实验结果表明,平均语音模型的使用有效地节省了目标说话者的训练数据量,与传统技术相比,尤其是在训练数据量较少的情况下,预测精度得到了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号