首页> 外文会议>INTERSPEECH 2012 >Iterative MMSE Estimation of Vocal Tract Length Normalization Factors for Voice Transformation
【24h】

Iterative MMSE Estimation of Vocal Tract Length Normalization Factors for Voice Transformation

机译:语音变换声音截面归一化因子的迭代MMSE估计

获取原文

摘要

We present a method that determines the optimal configuration of a bilinear vocal tract length normalization function to transform the frequency axis of one voice according to a specific target voice. Given a number of parallel utterances of the involved speakers, the single parameter of this function- can be calculated through an iterative procedure by minimizing an objective error measure defined in the cepstral domain. This method is also applicable when multiple warping classes are considered, and it can be complemented with amplitude correction filters. The resulting physically motivated cepstral transformation results in highly satisfactory conversion accuracy and improved quality with respect to standard satistical systems.
机译:我们提出了一种方法,该方法确定双线性声带长度归一化功能的最佳配置,根据特定的目标语音来转换一个语音的频率轴。鉴于所涉及的扬声器的许多并行话语,通过最小化谱系统域中定义的客观误差测量,可以通过迭代过程来计算此功能的单个参数。当考虑多个翘曲类时,该方法也适用,并且可以互补幅度校正滤波器。由此产生的物理促进的倒谱转化导致高度令人满意的转换精度和关于标准审查系统的提高质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号