首页> 外文会议>Annual conference of the International Speech Communication Association >Modeling a Noisy-channel for Voice Conversion Using Articulatory Features
【24h】

Modeling a Noisy-channel for Voice Conversion Using Articulatory Features

机译:使用发音特征对噪声通道建模以进行语音转换

获取原文

摘要

In this paper, we propose modeling a noisy-channel for the task of voice conversion (VC). We have used the artificial neural networks (ANN) to capture speaker-specific characteristics of a target speaker which avoid the need for any training utterance from a source speaker. We use articulatory features (AFs) as a canonical form or speaker-independent representation of a speech signal. Our studies show that AFs contain a significant amount of speaker information in their trajectories. Suitable techniques are proposed to normalize the speaker-specific information in AF trajectories and the resultant AFs are used in voice conversion. The results of voice conversion evaluated using objective and subjective measures confirm that AFs can be used as a canonical form in nosiy-channel to capture speaker-specific characteristics of a target speaker.
机译:在本文中,我们建议为语音转换(VC)的任务建模一个噪声通道。我们已经使用人工神经网络(ANN)来捕获目标说话者的说话者特定特征,从而避免了源说话者的任何训练说话。我们使用发音特征(AF)作为语音信号的规范形式或与说话者无关的表示形式。我们的研究表明,AF在其轨迹中包含大量说话人信息。提出了合适的技术来标准化AF轨迹中的说话者特定信息,并且所得到的AF被用于语音转换。使用客观和主观措施评估的语音转换结果证实,AF可以用作嘈杂通道中的规范形式,以捕获目标说话者的特定说话者特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号