首页> 外文期刊>IEICE Transactions on Information and Systems >HMM-Based Voice Conversion Using Quantized FO Context
【24h】

HMM-Based Voice Conversion Using Quantized FO Context

机译:使用量化FO上下文的基于HMM的语音转换

获取原文
获取原文并翻译 | 示例
       

摘要

We propose a segment-based voice conversion technique using hidden Markov model (HMM)-based speech synthesis with nonpar-allel training data. In the proposed technique, the phoneme information with durations and a quantized FO contour are extracted from the input speech of a source speaker, and are transmitted to a synthesis part. In the synthesis part, the quantized FO symbols are used as prosodic context. A phonetically and prosodically context-dependent label sequence is generated from the transmitted phoneme and the FO symbols. Then, converted speech is generated from the label sequence with durations using the target speaker's pre-trained context-dependent HMMs. In the model training, the models of the source and target speakers can be trained separately, hence there is no need to prepare parallel speech data of the source and target speakers. Objective and subjective experimental results show that the segment-based voice conversion with phonetic and prosodic contexts works effectively even if the parallel speech data is not available.
机译:我们提出了一种基于分段的语音转换技术,该技术使用了基于隐马尔可夫模型(HMM)的语音合成技术,并具有非等位训练数据。在所提出的技术中,具有持续时间和量化的FO轮廓的音素信息是从源说话者的输入语音中提取的,并且被发送到合成部分。在合成部分中,将量化的FO符号用作韵律情境。从发送的音素和FO符号生成语音和语音上下文相关的标签序列。然后,使用目标说话者的预先训练的上下文相关HMM从标签序列中生成具有持续时间的转换语音。在模型训练中,可以分别训练源说话者和目标说话者的模型,因此不需要准备源说话者和目标说话者的并行语音数据。客观的主观实验结果表明,即使没有并行语音数据,带有语音和韵律上下文的基于段的语音转换也可以有效地工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号