Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models

Hironori DOI; Keigo NAKAMURA; Tomoki TODA; Hiroshi SARUWATARI; Kiyohiro SHIKANO

首页> 外文期刊>IEICE transactions on information and systems >Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models

【24h】

Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models

机译：基于高斯混合模型的统计语音转换的食道语音增强

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel method of enhancing esophageal speech using statistical voice conversion. Esophageal speech is one of the alternative speaking methods for laryngectomees. Although it doesn't require any external devices, generated voices usually sound unnatural compared with normal speech. To improve the intelligibility and naturalness of esophageal speech, we propose a voice conversion method from esophageal speech into normal speech. A spectral parameter and excitation parameters of target normal speech are separately estimated from a spectral parameter of the esophageal speech based on Gaussian mixture models. The experimental results demonstrate that the proposed method yields significant improvements in intelligibility and naturalness. We also apply one-to-many eigenvoice conversion to esophageal speech enhancement to make it possible to flexibly control the voice quality of enhanced speech.

机译：本文提出了一种使用统计语音转换来增强食道语音的新方法。食道语音是喉切除术的另一种口语方法。尽管不需要任何外部设备，但是与正常语音相比，生成的声音通常听起来不自然。为了提高食道语音的清晰度和自然性，我们提出了一种从食道语音到普通语音的语音转换方法。基于高斯混合模型，从食道语音的频谱参数分别估计目标正常语音的频谱参数和激励参数。实验结果表明，提出的方法在清晰度和自然度方面产生了显着的改进。我们还将一对多特征语音转换应用于食道语音增强，从而可以灵活地控制增强语音的语音质量。

著录项

来源
《IEICE transactions on information and systems》 |2010年第9期|共11页
作者
Hironori DOI; Keigo NAKAMURA; Tomoki TODA; Hiroshi SARUWATARI; Kiyohiro SHIKANO;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models [J] . Hironori DOI, Keigo NAKAMURA, Tomoki TODA, IEICE Transactions on Information and Systems . 2010,第9期

机译：基于高斯混合模型的统计语音转换的食道语音增强
2. Voice conversion based on Gaussian processes by using kernels modeling the spectral density with Gaussian mixture models [J] . Bao Jingyi, Xu Ning Modern Physics Letters, B. Condensed Matter Physics, Statistical Physics, Applied Physics . 2018,第34a36期

机译：利用高斯混合模型使用核心模拟谱密度的基于高斯过程的语音转换
3. TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL [J] . YANPING LI, LINGHUA ZHANG, HUI DING International Journal of Information Acquisition . 2011,第4期

机译：基于汉语语音分类和核本征语音高斯混合模型的文本无关语音转换
4. Statistical approach to enhancing esophageal speech based on Gaussian mixture models [C] . Doi, Hironori, Nakamura, Keigo, Toda, Tomoki, IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010 . 2010

机译：基于高斯混合模型的食道语音增强统计方法
5. Speech statistical modelling and its applications in voice activity detector and speech enhancement. [D] . Zhang, Wei. 2002

机译：语音统计建模及其在语音活动检测器和语音增强中的应用。
6. A preliminary study on improving the recognition of esophageal speech using a hybrid system based on statistical voice conversion [O] . Othman Lachhab, Joseph Di Martino, Elhassane Ibn Elhaj, -1

机译：基于统计语音转换的混合系统改善食道语音识别的初步研究
7. Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models [O] . Hironori Doi, Keigo Nakamura, Tomoki Toda, 2010

机译：基于高斯混合模型的统计语音转换的食道语音增强
8. Automatic Detection of Voice Impairments Due to Vocal Misuse by Means of Gaussian Mixture Models. [R] . Godino-Llorente, J. I., Aguilera-Navarro, S., Gomez- Vilda, P. 2001

机译：利用高斯混合模型自动检测声音误用造成的语音损伤。

Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models

摘要

著录项

相似文献

相关主题

期刊订阅