The Voice Conversion Method Based on Sparse Convolutive Non-negative Matrix Factorization

机译：基于稀疏卷积非负矩阵分解的语音转换方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a voice conversion method based on sparse convolutive non-negative matrix factorization. The method utilizes the Itakura-Saito distance as the objective cost function, making the smaller matrix element with a smaller reconstruction error due to the property of scale invariant of the cost function. The time-frequency basis of the source and target were extracted during the training phase, and the speech is converted through time-frequency basis substitution. The eifect of whisper-to-normal speech conversion experiment is also conducted. Experimental results show that the proposed voice conversion method outperforms the method based on the conventional convolutive non-negative matrix factorization and the method based on the Kullback-Leibler (K-L) cost function in the aspects of speech intelligibility.

机译：我们提出了一种基于稀疏卷积非负矩阵分解的语音转换方法。该方法利用Itakura-Saito距离作为目标成本函数，由于成本函数的尺度不变性，使得较小的矩阵元素具有较小的重构误差。在训练阶段提取源和目标的时频基础，并通过时频基础替换来转换语音。还进行了耳语到正常语音转换实验的效果。实验结果表明，所提出的语音转换方法在语音清晰度方面优于传统的卷积非负矩阵分解方法和基于Kullback-Leibler（K-L）代价函数的方法。

著录项

来源
《International Conference on Electrical and Information Technologies for Rail Transportation》|2016年|259-267|共9页
会议地点
作者
Qianmin Zhang; Liang Tao; Jian Zhou; Huabin Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speech intelligibility; Voice conversion; Itakura-Saito distance; Time-frequency basis;

机译：语音清晰度语音转换; Itakura-Saito距离;时频基础;

相似文献

外文文献
中文文献
专利

1. Noise-Robust Voice Conversion Based on Sparse Spectral Mapping Using Non-negative Matrix Factorization [J] . Ryo AIHARA, Ryoichi TAKASHIMA, Tetsuya TAKIGUCHI, IEICE transactions on information and systems . 2014,第6期

机译：基于非负矩阵分解的稀疏谱映射的强噪声语音转换
2. Multimodal voice conversion based on non-negative matrix factorization [J] . Kenta Masaka, Ryo Aihara, Tetsuya Takiguchi, EURASIP journal on audio, speech, and music processing . 2015,第1期

机译：基于非负矩阵分解的多峰语音转换
3. Multiple Non-Negative Matrix Factorization for Many-to-Many Voice Conversion [J] . Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第7期

机译：多对多语音转换的多个非负矩阵分解
4. The Voice Conversion Method Based on Sparse Convolutive Non-negative Matrix Factorization [C] . Qianmin Zhang, Liang Tao, Jian Zhou, International Conference on Electrical and Information Technologies for Rail Transportation . 2016

机译：基于稀疏卷积非负矩阵分子的语音转换方法
5. A Study of Non-Negative Matrix Factorizations: Foundations, Methods, Algorithms, and Applications [D] . Liu, Kai. 2019

机译：非负矩阵要素的研究：基础，方法，算法和应用
6. A Novel Signal Separation Method Based on Improved Sparse Non-Negative Matrix Factorization [O] . Huaqing Wang, Mengyang Wang, Junlin Li, 2019

机译：一种基于改进稀疏非负矩阵分解的新型信号分离方法
7. Multimodal voice conversion based on non-negative matrix factorization [O] . Masaka Kenta, Aihara Ryo, Takiguchi Tetsuya, 2015

机译：基于非负矩阵分解的多峰语音转换

The Voice Conversion Method Based on Sparse Convolutive Non-negative Matrix Factorization

摘要

著录项

相似文献

相关主题

期刊订阅