HMM-Based Voice Conversion Using Quantized FO Context

Takashi NOSE; Yuhei OTA; Takao KOBAYASHI

首页> 外文期刊>IEICE Transactions on Information and Systems >HMM-Based Voice Conversion Using Quantized FO Context

【24h】

HMM-Based Voice Conversion Using Quantized FO Context

机译：使用量化FO上下文的基于HMM的语音转换

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a segment-based voice conversion technique using hidden Markov model (HMM)-based speech synthesis with nonpar-allel training data. In the proposed technique, the phoneme information with durations and a quantized FO contour are extracted from the input speech of a source speaker, and are transmitted to a synthesis part. In the synthesis part, the quantized FO symbols are used as prosodic context. A phonetically and prosodically context-dependent label sequence is generated from the transmitted phoneme and the FO symbols. Then, converted speech is generated from the label sequence with durations using the target speaker's pre-trained context-dependent HMMs. In the model training, the models of the source and target speakers can be trained separately, hence there is no need to prepare parallel speech data of the source and target speakers. Objective and subjective experimental results show that the segment-based voice conversion with phonetic and prosodic contexts works effectively even if the parallel speech data is not available.

机译：我们提出了一种基于分段的语音转换技术，该技术使用了基于隐马尔可夫模型（HMM）的语音合成技术，并具有非等位训练数据。在所提出的技术中，具有持续时间和量化的FO轮廓的音素信息是从源说话者的输入语音中提取的，并且被发送到合成部分。在合成部分中，将量化的FO符号用作韵律情境。从发送的音素和FO符号生成语音和语音上下文相关的标签序列。然后，使用目标说话者的预先训练的上下文相关HMM从标签序列中生成具有持续时间的转换语音。在模型训练中，可以分别训练源说话者和目标说话者的模型，因此不需要准备源说话者和目标说话者的并行语音数据。客观的主观实验结果表明，即使没有并行语音数据，带有语音和韵律上下文的基于段的语音转换也可以有效地工作。

著录项

来源
《IEICE Transactions on Information and Systems》 |2010年第9期|P.2483-2490|共8页
作者
Takashi NOSE; Yuhei OTA; Takao KOBAYASHI;
展开▼
作者单位

Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama-shi, 226-8502 Japan;

rnGraduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama-shi, 226-8502 Japan;

rnGraduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama-shi, 226-8502 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
voice conversion; HMM-based speech synthesis; FO quantization; prosodic context; nonparallel data;

机译：语音转换;基于HMM的语音合成;FO量化;韵律情境;非平行数据;
入库时间 2022-08-18 00:27:04

相似文献

外文文献
中文文献
专利

1. HMM-Based Voice Conversion Using Quantized F0 Context [J] . Takashi NOSE, Yuhei OTA, Takao KOBAYASHI IEICE transactions on information and systems . 2010,第9期

机译：使用量化F0上下文的基于HMM的语音转换
2. HMM-based Speech Synthesis with Multiple Individual Voices using Exemplar-based Voice Conversion [J] . Trung-Nghia Phung International journal of computer science and network security . 2017,第5期

机译：使用基于示例的语音转换，具有多个单独语音的基于HMM的语音合成
3. HMM-Based Maximum Likelihood Frame Alignment for Voice Conversion from a Nonparallel Corpus [J] . Ki-Seung LEE IEICE transactions on information and systems . 2017,第12期

机译：基于HMM的最大似然帧对齐，用于非平行语料库的语音转换
4. Speaker-independent HMM-based Voice Conversion Using Quantized Fundamental Frequency [C] . Takashi Nose, Takao Kobayashi Annual conference of the International Speech Communication Association;INTERSPEECH 2010 . 2011

机译：使用量化基本频率的与说话者无关的基于HMM的语音转换
5. Voices of foster parents and youth: Factors associated with optimal functional adaptation in foster care. [D] . Affronti, Melissa L. 2009

机译：寄养父母和青年的声音：与寄养中最佳功能适应有关的因素。
6. IDOPS a Profile HMM-Based Tool to Detect Pesticidal Sequences and Compare Their Genetic Context [O] . Stefani Díaz-Valerio, Anat Lev Hacohen, Raphael Schöppe, 2021

机译：IDOPS一种概况基于HMM的工具用于检测杀虫序列并比较他们的遗传背景
7. HMM-Based Maximum Likelihood Frame Alignment for Voice Conversion from a Nonparallel Corpus [O] . Ki-Seung LEE 2017

机译：基于HMM的最大似然帧对齐，用于语音转换来自非平行语料品

HMM-Based Voice Conversion Using Quantized FO Context

摘要

著录项

相似文献

相关主题

期刊订阅