A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation

Kou TANAKA; Tomoki TODA; Graham NEUBIG; Sakriani SAKTI; Satoshi NAKAMURA

首页> 外文期刊>IEICE transactions on information and systems >A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation

【24h】

A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation

机译：基于降噪和统计激励生成的电喉语音增强混合方法

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an electrolaryngeal (EL) speech enhancement method capable of significantly improving naturalness of EL speech while causing no degradation in its intelligibility. An electrolarynx is an external device that artificially generates excitation sounds to enable laryngectomees to produce EL speech. Although proficient laryngectomees can produce quite intelligible EL speech, it sounds very unnatural due to the mechanical excitation produced by the device. Moreover, the excitation sounds produced by the device often leak outside, adding to EL speech as noise. To address these issues, there are mainly two conventional approached to EL speech enhancement through either noise reduction or statistical voice conversion (VC). The former approach usually causes no degradation in intelligibility but yields only small improvements in naturalness as the mechanical excitation sounds remain essentially unchanged. On the other hand, the latter approach significantly improves naturalness of EL speech using spectral and excitation parameters of natural voices converted from acoustic parameters of EL speech, but it usually causes degradation in intelligibility owing to errors in conversion. We propose a hybrid approach using a noise reduction method for enhancing spectral parameters and statistical voice conversion method for predicting excitation parameters. Moreover, we further modify the prediction process of the excitation parameters to improve its prediction accuracy and reduce adverse effects caused by unvoiced/voiced prediction errors. The experimental results demonstrate the proposed method yields significant improvements in naturalness compared with EL speech while keeping intelligibility high enough.

机译：本文提出了一种电喉（EL）语音增强方法，该方法能够显着提高EL语音的自然度，同时不会降低其清晰度。电喉是一种外部设备，可以人工产生激励声音，使喉头切除术能够产生EL语音。尽管熟练的喉头切除术可以产生非常清晰的EL语音，但是由于该设备产生的机械激励，听起来非常不自然。此外，设备产生的激励声音经常泄漏到外部，从而增加了EL语音的噪音。为了解决这些问题，主要有两种通过降噪或统计语音转换（VC）来增强EL语音的常规方法。前一种方法通常不会导致清晰度下降，但是由于机械激励声音基本上保持不变，因此自然度只会产生很小的改善。另一方面，后一种方法使用从EL语音的声学参数转换而来的自然声音的频谱和激励参数来显着提高EL语音的自然性，但是由于转换错误，通常会导致清晰度下降。我们提出一种使用降噪方法增强频谱参数和使用统计语音转换方法预测激励参数的混合方法。此外，我们进一步修改了励磁参数的预测过程，以提高其预测精度，并减少由未发声/发声的预测误差引起的不利影响。实验结果表明，所提出的方法与EL语音相比在自然度上有显着提高，同时保持了足够高的清晰度。

著录项

来源
《IEICE transactions on information and systems》 |2014年第6期|共9页
作者
Kou TANAKA; Tomoki TODA; Graham NEUBIG; Sakriani SAKTI; Satoshi NAKAMURA;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. A Hybrid Approach on Electrolaryngeal Speech Enhancement based on Spectral Differential Features and Noise Suppression [J] . Mohammad ESHGHI, Kazuhiro KOBAYASHI, Tomoki TODA 電子情報通信学会技術研究報告. 音声. Speech . 2017,第517期

机译：基于光谱差分特征和噪声抑制的电解语音增强杂交方法
2. A Hybrid Approach on Electrolaryngeal Speech Enhancement based on Spectral Differential Features and Noise Suppression [J] . Mohammad ESHGHI, Kazuhiro KOBAYASHI, Tomoki TODA 電子情報通信学会技術研究報告. 信号処理. Signal Processing . 2017,第516期

机译：基于光谱差分特征和噪声抑制的电解语音增强杂交方法
3. A Hybrid Approach on Electrolaryngeal Speech Enhancement based on Spectral Differential Features and Noise Suppression [J] . Mohammad ESHGHI, Kazuhiro KOBAYASHI, Tomoki TODA 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2017,第515期

机译：基于光谱鉴别上 Electrolaryngeal 语音增强的混合方式功能和噪声抑制
4. A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Spectral Subtraction and Statistical Voice Conversion [C] . Kou Tanaka, Tomoki Toda, Graham Neubig, Conference of the International Speech Communication Association . 2013

机译：基于光谱减法和统计语音转换的杂种致力于电解语音增强方法
5. Speech enhancement based on perceptual loudness and statistical models of speech. [D] . Zhang, Wei. 2009

机译：基于感知响度和语音统计模型的语音增强。
6. Neural Representation Enhanced for Speech and Reduced for Background Noise With a Hearing Aid Noise Reduction Scheme During a Selective Attention Task [O] . Emina Alickovic, Thomas Lunner, Dorothea Wendt, 2020

机译：神经表示增强了言论减少了在选择性注意任务期间具有助听器降噪方案的背景噪声减少
7. The Use of Air-Pressure Sensor in Electrolaryngeal Speech Enhancement Based on Statistical Voice Conversion [O] . Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, 2010

机译：气压传感器在基于统计语音转换的电喉语音增强中的应用

A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation

摘要

著录项

相似文献

相关主题

期刊订阅