首页> 外文会议>International Speech Communication Association >A Novel Technique for Voice Conversion Based on Style and ContentDecomposition with Bilinear Models
【24h】

A Novel Technique for Voice Conversion Based on Style and ContentDecomposition with Bilinear Models

机译:一种基于Bilinear模型的样式和ContentDecomposition的语音转换新技术

获取原文

摘要

This paper presents a novel technique for voice conversion by solving a two-factor task using bilinear models. The spectral content of the speech represented as line spectral frequencies is separated into so-called style and content parameterizations using a framework proposed in [1]. This formulation of the voice conversion problem in terms of style and content offers a flexible representation of factor interactions and facilitates the use of efficient training algorithms based on singular value decomposition and expectation maximization. Promising results in a comparison with the traditional Gaussian mixture model based method indicate increased robustness with small training sets.
机译:本文通过使用Bilinear模型来解决双因素任务,提出了一种用于语音转换的新技术。表示作为线谱频率的语音的光谱含量被[1]中提出的框架分离为所谓的样式和内容参数化。这种在风格和内容方面的语音转换问题的配方提供了因子交互的灵活表示,并促进了基于奇异值分解和期望最大化的高效训练算法。有希望的结果与传统的高斯混合模型的方法相比表明,对小型训练集的鲁棒性增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号