Estimation of GMM in voice conversion including unaligned data

机译：语音转换中的GMM估计，包括未对齐的数据

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Voice conversion consists in transforming a source speaker voice into a target speaker voice. There are many applications of voice conversion systems where the amount of training data from the source speaker and the target speaker is different. Usually, the amount of source data available is large, but it is desired to estimate the transformation with a small amount of target data. Systems based on joint Gaussian Mixture Models (GMM) are well suited to voice conversion, but they can't deal with source data without its corresponding aligned target data. In this paper, two alternatives are studied to incorporate unaligned source data in the estimation of a GMM for a voice conversion task. It is shown that when a limited amount of aligned parameters are available in the training step, to only include data from the source speaker increases the performance of the voice transformation.

机译：语音转换包括将源说话者语音转换为目标说话者语音。语音转换系统有许多应用，其中来自源说话者和目标说话者的训练数据量是不同的。通常，可用的源数据量很大，但是希望用少量的目标数据来估计转换。基于联合高斯混合模型（GMM）的系统非常适合语音转换，但是如果没有对应的对齐目标数据，它们就无法处理源数据。在本文中，研究了两种选择，以将未对齐的源数据合并到语音转换任务的GMM估计中。示出了当在训练步骤中有限数量的对齐参数可用时，仅包括来自源说话者的数据将提高语音转换的性能。

著录项

来源
《European Conference on Speech Communication and Technology - EUROSPEECH 2003(INTERSPEECH 2003) vol.2; 20030901-04; Geneva(CH)》|2003年|P.861-864|共4页
会议地点 Geneva(CH)
作者
Helenca Duxans; Antonio Bonafonte;
展开▼
作者单位

Department of Signal Theory and Communications TALP Research Center Universitat Politecnica de Catalunya, Barcelona;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动信息理论;
关键词

相似文献

外文文献
中文文献
专利

1. Non-parallel training for voice conversion using background-based alignment of GMMs and INCA algorithm [J] . Mostafa Ghorbandoost, Valiallah Saba Signal Processing, IET . 2017,第8期

机译：使用基于背景的GMM对齐和INCA算法进行语音转换的非并行训练
2. Unaligned training for voice conversion based on a local nonlinear principal component analysis approach [J] . Behrooz Makki, Mona Noori Hosseini, Seyyed Ali Seyyedsalehi, Neural computing & applications . 2010,第3期

机译：基于局部非线性主成分分析方法的语音转换不对齐训练
3. Unaligned training for voice conversion based on a local nonlinear principal component analysis approach [J] . Behrooz Makki, Mona Noori Hosseini, Seyyed Ali Seyyedsalehi, Neural Computing & Applications . 2010,第3期

机译：基于局部非线性主成分分析方法的语音转换不对齐训练
4. Estimation of GMM in voice conversion including unaligned data [C] . Helenca Duxans, Antonio Bonafonte, International Speech Communication Association(ISCA) European Conference on Speech Communication and Technology . 2003

机译：估计语音转换中的GMM，包括未对准数据
5. Compensation for the Error/Non-Ideality in Data Conversion and Transmission Using Statistical Estimation and Coding Techniques [D] . Tao, Sen. 2018

机译：使用统计估计和编码技术补偿数据转换和传输中的错误/非理想性
6. Including all voices in international data-sharing governance [O] . Jane Kaye, Sharon F. Terry, Eric Juengst, 2018

机译：在国际数据共享治理中包括所有声音
7. Voice conversion using exclusively unaligned training data [O] . Sündermann David, Bonafonte Antonio, Höge Harald, 2004

机译：使用完全不对齐的训练数据进行语音转换
8. PARAMETER ESTIMATION IN MATHEMATICAL MODELS OF THE ESRO 1A ATTITUDE DYNAMICS (INCLUDING ATMOSPHERIC EFFECTS) USING NUMERICAL DIFFERENTIATION OF MEASURED DATA WITH SMOOTHING SPLINES [R] . J. W. Boerstoel, G. H. Huizing, P. Th. L. M. van Woerkom 1976

机译：EsRO 1a姿态动力学（包括大气效应）的数学模型中的参数估计使用光滑的斜面测量数据的数值微分

Estimation of GMM in voice conversion including unaligned data

摘要

著录项

相似文献

相关主题

期刊订阅