...
首页> 外文期刊>Neural processing letters >DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features
【24h】

DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features

机译:基于DNN的交叉语音转换使用瓶颈特征

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Cross-lingual voice conversion (CLVC) is quite challenging since the source and target speakers speak different languages. It is essential for various applications such as developing mixed-language speech synthesis systems, customization of speaking devices, etc. This paper proposes a deep neural network (DNN)-based approach utilizing bottleneck features for CLVC. In the proposed method, the speaker-independent information present in the speech signals from different languages is represented by using the bottleneck features extracted from a deep auto-encoder. A DNN model is trained to learn the mapping between bottleneck features and the corresponding spectral features of the target speaker. The proposed approach can capture speaker-specific characteristics of a target speaker, and requires no speech data from the source speaker during training. The performance of the proposed method is evaluated using data from three Indian languages: Telugu, Tamil and Malayalam. The experimental results show that the proposed method can effectively convert the source speaker voice to target speaker voice in a cross-lingual scenario.
机译:由于源头和目标发言者说不同的语言,交叉语言转换(CLVC)非常具有挑战性。对于开发混合语言语音合成系统,讲话装置的定制等来说,这是必不可少的应用。本文提出了利用CLVC的瓶颈特征的基于神经网络(DNN)的基于替代方法。在所提出的方法中,通过使用从深自动编码器提取的瓶颈特征来表示来自不同语言的语音信号中的扬声器的独立信息。培训DNN模型以学习瓶颈特征和目标扬声器的相应光谱特征之间的映射。所提出的方法可以捕获目标扬声器的扬声器特定特征,并且在训练期间不需要来自源扬声器的语音数据。使用来自三种印度语言的数据进行评估所提出的方法的性能:Telugu,Tamil和Malayalam。实验结果表明,该方法可以有效地将源代言语音转换为在交叉场景中针对扬声器语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号