STRAIGHT model for voice conversion based on acoustical universal structure

机译：基于声学通用结构的STRAIGHT语音转换模型

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The existing voice conversion (VC) systems, those based on Gaussian mixture models(GMM), bring the problems of over smoothing of GMM mapping. With an aim towards resolving these problems, this paper provides a method on Acoustical Universal Structure (ASU) that can be applied to voice conversion based on GMM. Our contributions include:1) speech transformation and representation using adaptive interpolation of weighted-spectrum (STRAIGHT) model is taken which allows flexible manipulation of speech parameters such as pitch, vocal tract length, and speaking rate while maintaining high reproduction quality;2) The advantage of the paper is attributed to the introduction of the predictable spectrum, the ASU, in this paper, is introduced to form the mapping relationship between the source speaker and target speaker.3) In the training phase, the feedback strategy is adopted, which guarantee the smooth translation of spectral parameters between frames. Experimental results indicate that the performance of VC can be dramatically improved by the proposed method in view of speech quality, conversion accuracy and naturalness for speaker individuality from the objective and subjective tests.

机译：现有的基于高斯混合模型（GMM）的语音转换（VC）系统带来了GMM映射过度平滑的问题。为了解决这些问题，本文提供了一种基于声学通用结构（ASU）的方法，该方法可应用于基于GMM的语音转换。我们的贡献包括：1）使用加权频谱自适应插值（STRAIGHT）模型进行语音转换和表示，可以在保持高音质的同时灵活地控制语音参数，例如音高，声道长度和讲话率; 2）本文的优势是由于可预测频谱的引入，本文引入了ASU来形成源说话者与目标说话者之间的映射关系。3）在训练阶段，采用了反馈策略，确保帧之间频谱参数的平滑转换。实验结果表明，从语音质量，转换精度和说话人个性的自然性来看，通过主观和客观测试，该方法可以显着提高VC的性能。

著录项

来源
《2012 International Conference on Audio, Language and Image Processing.》|2012年|p.454- 458|共5页
会议地点 Shanghai(CN);Shanghai(CN)
作者
Xu Gang; Zhou Qi; Zhao Dong; Huang Ding;
展开▼
作者单位

Electrical Electronic Engineering School, NCEPU;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models [J] . Shinnosuke TAKAMICHI, Tomoki TODA, Graham NEUBIG, IEICE transactions on information and systems . 2016,第10期

机译：基于协方差声学模型的基于统计样本的基于GMM的语音转换方法
2. Voice conversion algorithm based on Gaussian mixture model applied to STRAIGHT [J] . Tomoki Toda, Jinlin Lu, Satoshi Nakamura, 電子情報通信学会技術研究報告. 音声. Speech . 2000,第97期

机译：基于高斯混合模型的语音转换算法应用于STRAIGHT
3. Voice conversion algorithm based on Gaussian mixture model applied to STRAIGHT [J] . Tomoki Toda, Jinlin Lu, Satoshi Nakamura, 電子情報通信学会技術研究報告. 音声. Speech . 2000,第97期

机译：基于高斯混合模型的语音转换算法应用于直线
4. STRAIGHT model for voice conversion based on acoustical universal structure [C] . Xu Gang, Zhou Qi, Zhao Dong, International Conference on Audio, Language and Image Processing . 2012

机译：基于声学通用结构的语音转换直线模型
5. Posteriorgram-to-Acoustic Modeling for Unconstrained Voice Conversion with Deep Learning [D] . Sun, Lifa. 2017

机译：用于深度学习的无约束语音转换的后部图到声音建模
6. Computational Modeling of Fluid–Structure–Acoustics Interaction during Voice Production [O] . Weili Jiang, Xudong Zheng, Qian Xue 2017

机译：声音产生过程中流固声相互作用的计算模型
7. Straight-Based Voice Conversion Algorithm Based on Gaussian Mixture Model [O] . Tomoki Toda, Jinlin Lu, Hiroshi Saruwatari, 2000

机译：基于高斯混合模型的基于语音的语音转换算法

STRAIGHT model for voice conversion based on acoustical universal structure

摘要

著录项

相似文献

相关主题

期刊订阅