首页> 外文会议>Annual conference of the International Speech Communication Association >Synthetic References for Template-based ASR using Posterior Features
【24h】

Synthetic References for Template-based ASR using Posterior Features

机译:基于后验特征的基于模板的ASR的合成参考

获取原文

摘要

Recently, the use of phoneme class-conditional probabilities as features (posterior features) for template-based ASR has been proposed. These features have been found to generalize well to unseen data and yield better systems than standard spectral-based features. In this paper, motivated by the high quality of current text-to-speech systems and the robustness of posterior features toward undesired variability, we investigate the use of synthetic speech to generate reference templates. The use of synthetic speech in template-based ASR not only allows to address the issue of in-domain data collection but also expansion of vocabulary. Using 75- and 600-word task-independent and speaker-independent setup on Phonebook database, we investigate different synthetic voices produced by the Festival HTS-based synthesizer trained on CMU ARCTIC databases. Our study shows that synthetic speech templates can yield performance comparable to the natural speech templates, especially with synthetic voices that have high intelligibility.
机译:最近,已经提出了将音素类别条件概率用作基于模板的ASR的特征(后验特征)。已经发现这些功能可以很好地推广到看不见的数据,并且比基于标准频谱的功能可以提供更好的系统。在本文中,受当前文本到语音系统的高质量以及后部特征对不希望的可变性的鲁棒性的影响,我们研究了使用合成语音来生成参考模板的情况。在基于模板的ASR中使用合成语音不仅可以解决域内数据收集问题,而且还可以扩展词汇量。使用电话簿数据库上的75字和600字独立于任务和与说话者无关的设置,我们研究了由在CMU ARCTIC数据库上训练的基于Festival HTS的合成器产生的不同合成声音。我们的研究表明,合成语音模板可以产生与自然语音模板相当的性能,尤其是对于具有高清晰度的合成语音而言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号