首页> 外文会议>INTERSPEECH 2012 >Synthetic References for Template-based ASR using Posterior Features
【24h】

Synthetic References for Template-based ASR using Posterior Features

机译:使用后部特征的基于模板的ASR的合成引用

获取原文

摘要

Recently, the use of phoneme class-conditional probabilities as features (posterior features) for template-based ASR has been proposed. These features have been found to generalize well to unseen data and yield better systems than standard spectralbased features. In this paper, motivated by the high quality of current text-to-speech systems and the robustness of posterior features toward undesired variability, we investigate the use of synthetic speech to generate reference templates. The use of synthetic speech in template-based ASR not only allows to address the issue of in-domain data collection but also expansion of vocabulary. Using 75- and 600-word task-independent and speaker-independent setup on Phonebook database, we investigate different synthetic voices produced by the Festival HTS-based synthesizer trained on CMU ARCTIC databases. Our study shows that synthetic speech templates can yield performance cornparable to the natural speech templates, especially with synthetic voices that have high intelligibility.
机译:最近,基于模板的ASR使用音素类条件概率为特征(后功能)已经提出。这些功能已被发现以及推广到看不见的数据,并产生比标准spectralbased功能更好的系统。在本文中,由当前的文本到语音系统的高品质和对不需要的变异后的功能鲁棒性动机,我们调查使用合成语音的生成参考模板。在基于模板的ASR使用合成语音,不仅可以解决域数据采集的问题,也是扩大词汇量。使用75-和电话簿数据库600字的任务无关和独立扬声器的设置,我们调查由经过培训的CMU ARCTIC数据库节HTS为基础的合成器产生不同的合成声音。我们的研究表明,合成语音的模板也能产生性能cornparable到自然语音模板,特别是具有高清晰度合成声音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号