...
首页> 外文期刊>IEICE transactions on information and systems >Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models
【24h】

Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models

机译:基于高斯混合模型的统计语音转换的食道语音增强

获取原文
   

获取外文期刊封面封底 >>

       

摘要

This paper presents a novel method of enhancing esophageal speech using statistical voice conversion. Esophageal speech is one of the alternative speaking methods for laryngectomees. Although it doesn't require any external devices, generated voices usually sound unnatural compared with normal speech. To improve the intelligibility and naturalness of esophageal speech, we propose a voice conversion method from esophageal speech into normal speech. A spectral parameter and excitation parameters of target normal speech are separately estimated from a spectral parameter of the esophageal speech based on Gaussian mixture models. The experimental results demonstrate that the proposed method yields significant improvements in intelligibility and naturalness. We also apply one-to-many eigenvoice conversion to esophageal speech enhancement to make it possible to flexibly control the voice quality of enhanced speech.
机译:本文提出了一种使用统计语音转换来增强食道语音的新方法。食道语音是喉切除术的另一种口语方法。尽管不需要任何外部设备,但是与正常语音相比,生成的声音通常听起来不自然。为了提高食道语音的清晰度和自然性,我们提出了一种从食道语音到普通语音的语音转换方法。基于高斯混合模型,从食道语音的频谱参数分别估计目标正常语音的频谱参数和激励参数。实验结果表明,提出的方法在清晰度和自然度方面产生了显着的改进。我们还将一对多特征语音转换应用于食道语音增强,从而可以灵活地控制增强语音的语音质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号