首页> 外文会议>IEEE EUROCON 2013 >Runtime and speech quality survey of a voice conversion method
【24h】

Runtime and speech quality survey of a voice conversion method

机译:语音转换方法的运行时和语音质量调查

获取原文
获取原文并翻译 | 示例

摘要

Several methods for voice conversion have been established. The research aims at the characteristics of a target speaker and a near-to-natural speech quality. This contribution summarizes the listening experiments with four conversion methods including the assessment of speech quality, listening effort and similarity to the target voice. The subjective evaluation of similarity is checked by an instrumental distance measure based on logarithmic spectral distortion. Practical applications of voice conversion require an appropriate runtime performance and memory use. We select a conversion method based on VTLN to demonstrate the runtime and quality trade-off. In the case example, we survey the quality assessment depending on different training constellations with a varied data amount and training time. Furthermore, we discuss the runtime performance of the selected conversion method under typical operating conditions. The experiments cover the influence of system resources, setting of conversion parameters (warping factors) and different training constellations. The observed real-time factors of a non-optimized laboratory VC version are inappropriate for typical application scenarios.
机译:已经建立了几种语音转换方法。该研究旨在针对目标说话者的特征和接近自然的语音质量。该贡献总结了使用四种转换方法的听力实验,包括评估语音质量,聆听效果以及与目标语音的相似性。通过基于对数频谱失真的仪器距离测量来检查相似性的主观评估。语音转换的实际应用需要适当的运行时性能和内存使用。我们选择一种基于VTLN的转换方法来演示运行时和质量之间的权衡。在本例中,我们根据具有不同数据量和培训时间的不同培训星座调查质量评估。此外,我们讨论了在典型操作条件下所选转换方法的运行时性能。实验涵盖了系统资源的影响,转换参数(翘曲因子)的设置以及不同的训练星座。未优化的实验室VC版本的观察到的实时因素不适用于典型的应用场景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号