DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features

M. Kiran Reddy; K. Sreenivasa Rao

首页> 外文期刊>Neural processing letters >DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features

【24h】

DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features

机译：基于DNN的交叉语音转换使用瓶颈特征

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cross-lingual voice conversion (CLVC) is quite challenging since the source and target speakers speak different languages. It is essential for various applications such as developing mixed-language speech synthesis systems, customization of speaking devices, etc. This paper proposes a deep neural network (DNN)-based approach utilizing bottleneck features for CLVC. In the proposed method, the speaker-independent information present in the speech signals from different languages is represented by using the bottleneck features extracted from a deep auto-encoder. A DNN model is trained to learn the mapping between bottleneck features and the corresponding spectral features of the target speaker. The proposed approach can capture speaker-specific characteristics of a target speaker, and requires no speech data from the source speaker during training. The performance of the proposed method is evaluated using data from three Indian languages: Telugu, Tamil and Malayalam. The experimental results show that the proposed method can effectively convert the source speaker voice to target speaker voice in a cross-lingual scenario.

机译：由于源头和目标发言者说不同的语言，交叉语言转换（CLVC）非常具有挑战性。对于开发混合语言语音合成系统，讲话装置的定制等来说，这是必不可少的应用。本文提出了利用CLVC的瓶颈特征的基于神经网络（DNN）的基于替代方法。在所提出的方法中，通过使用从深自动编码器提取的瓶颈特征来表示来自不同语言的语音信号中的扬声器的独立信息。培训DNN模型以学习瓶颈特征和目标扬声器的相应光谱特征之间的映射。所提出的方法可以捕获目标扬声器的扬声器特定特征，并且在训练期间不需要来自源扬声器的语音数据。使用来自三种印度语言的数据进行评估所提出的方法的性能：Telugu，Tamil和Malayalam。实验结果表明，该方法可以有效地将源代言语音转换为在交叉场景中针对扬声器语音。

著录项

来源
《Neural processing letters》 |2020年第2期|2029-2042|共14页
作者
M. Kiran Reddy; K. Sreenivasa Rao;
展开▼
作者单位

Department of Computer Science and Engineering Indian Institute of Technology Kharagpur Kharagpur West Bengal India;

Department of Computer Science and Engineering Indian Institute of Technology Kharagpur Kharagpur West Bengal India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Cross-lingual voice conversion; Deep autoencoder; Deep neural network; Gaussian mixture model;

机译：交叉语音转换;深度自动化器;深神经网络;高斯混合模型;

相似文献

外文文献
中文文献
专利

1. Improving Trajectory Modelling for DNN-Based Speech Synthesis by Using Stacked Bottleneck Features and Minimum Generation Error Training [J] . Zhizheng Wu, Simon King Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第7期

机译：通过使用堆叠的瓶颈特征和最小生成误差训练来改进基于DNN的语音合成的轨迹模型
2. A Multi-level GMM-Based Cross-Lingual Voice Conversion Using Language-Specific Mixture Weights for Polyglot Synthesis [J] . Ramani B., Jeeva M. P. Actlin, Vijayalakshmi P., Circuits, systems, and signal processing . 2016,第4期

机译：使用多语言合成的基于特定语言的混合权重的基于多级GMM的跨语言语音转换
3. Unsupervised Representation Disentanglement Using Cross Domain Features and Adversarial Learning in Variational Autoencoder Based Voice Conversion [J] . Wen-Chin Huang, Hao Luo, Hsin-Te Hwang, IEEE Transactions on Emerging Topics in Computational Intelligence . 2020,第4期

机译：基于变化的自动化器语音转换中的跨域特征和对逆势学习的无监督的表示解剖
4. Many-to-Many Voice Conversion based on Bottleneck Features with Variational Autoencoder for Non-parallel Training Data [C] . Yanping Li, Kong Aik Lee, Yougen Yuan, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2018

机译：基于瓶颈功能的多对多语音转换，带有可变自动编码器，用于非并行训练数据
5. Time-dependent traffic flow features at a freeway bottleneck downstream of a merge. [D] . Bertini, Robert Lawrence. 1999

机译：与时间相关的交通流位于合并下游的高速公路瓶颈处。
6. Redox homeostasis: unlocking the bottleneck in glia-to-neuron conversion [O] . Friederike Klempin, Karen Gertz, Golo Kronenberg 2017

机译：氧化还原稳态：释放神经胶质到神经元转换的瓶颈
7. Voice Conversion Challenge 2020 –- Intra-lingual semi-parallel and cross-lingual voice conversion –- [O] . Zhao Yi, Wen-Chin Huang, Xiaohai Tian, 2020

机译：语音转换挑战2020 - 语音内半平行和交叉语音转换 -

DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features

摘要

著录项

相似文献

相关主题

期刊订阅