Statistical sequence-to-frame mapping techniques for voice conversion

Yu QIAO; Daisuke SAITO; Nobuaki MINEMATSU

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Statistical sequence-to-frame mapping techniques for voice conversion

【24h】

Statistical sequence-to-frame mapping techniques for voice conversion

机译：Statistical sequence-to-frame mapping techniques for voice conversion

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

Voice conversion, a task to transform one speaker's voice to another's, can be regarded as a problem to find a mapping function between voice spaces of two speakers. GMM-based statistical mapping methods have been widely used for voice conversion. However, the classical GMM-based techniques make use of a frame-to-frame mapping function, which largely ignores the contextual information existing over a speech sequence and usually causes over-smoothness of converted speech. It is well known that HMM yields an efficient method to model the density of a whole speech sequence and has found successes in speech recognition and synthesis. Inspired by this fact, this paper studies how to use HMM for voice conversion. We derive an HMM-based sequence-to-frame mapping function with statistical analysis. Different from previous HMM-based voice conversion methods that used forced alignment for segmentation and transform frames aligned to a state with its associated linear transformation, our method has a soft mapping function as a weighted summation of linear transformations. The weights are calculated as the HMM posterior probabilities of frames. We also propose and compare two methods to learn the parameters of our mapping functions, namely least square error estimation and maximum likelihood estimation. We carried out experiments to examine the proposed HMM-based method for voice conversion.

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2009年第375期|285-290|共6页
作者
Yu QIAO; Daisuke SAITO; Nobuaki MINEMATSU;
展开▼
作者单位

Grad. School of Info. Sci. and Tech., Univ. of Tokyo, Japan;

Grad. School of Engineering, Univ. of Tokyo Japan;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类电报、传真;
关键词
Voice conversion; Linear regression; Sequence-to-frame mapping; HMM;
入库时间 2024-01-25 20:26:30

Statistical sequence-to-frame mapping techniques for voice conversion

摘要

著录项

相关主题

期刊订阅