首页> 外文会议>Conference on sound and music technology >Bandwidth Extension WaveNet for Bone-Conducted Speech Enhancement
【24h】

Bandwidth Extension WaveNet for Bone-Conducted Speech Enhancement

机译:用于骨展语音增强的带宽扩展波形

获取原文

摘要

Bone-conducted (BC) speech is immune to background noise, but sounds muffled due to the characteristic of human body channel. The existing enhancement methods about BC speech mainly focus on the enhancement of the magnitude spectrum and ignore processing the mismatched phase. Besides, most of the existing communication systems of BC speech are based on low sampling rate configuration and cannot meet the better speech quality demand of the wideband speech communication. In this paper, a novel waveform generation method based on Bandwidth Extension WaveNet (BE-WaveNet) model is proposed, that builds the probability density between the conditional acoustic feature and the target waveform using the deep convolutional neural networks WaveNet and is able to generate the waveform from the enhanced magnitude spectrum directly. In order to further improve the speech quality, an up-sampling module of cross resampling rate is introduced, enabling the WaveNet to generate high-sampling rate speech with low-sampling rate acoustic feature, namely the BE-WaveNet is able to extend the bandwidth without increasing the communication cost. The experimental results show that compared with the waveform synthesis methods such as using the original phase and the phase estimation method of Griffm-Lim, the proposed method significantly improves the quality of BC speech. At the same time, high-sampling rate enhanced speech can be obtained without complex system configuration updates.
机译:骨骼进行(BC)语音对背景噪声免疫,但由于人体通道的特征,声音偏差。关于BC语音的现有增强方法主要集中在幅度谱的增强,忽略处理不匹配的阶段。此外,BC演讲的大多数现有通信系统基于低采样率配置,不能满足宽带语音通信的更好的语音质量需求。在本文中,提出了一种基于带宽扩展波(BE-WaveNet)模型的新颖的波形生成方法,其使用深卷积神经网络Wavenet构建条件声学特征和目标波形之间的概率密度,并且能够产生直接从增强幅度谱波形。为了进一步提高语音质量,引入了跨重采样率的上采样模块,使波老节能够通过低采样速率声学特征产生高采样速率语音,即Be-Wavenet能够扩展带宽不增加通信成本。实验结果表明,与使用原始相位和GRIFFM-LIM的相位估计方法的波形合成方法相比,所提出的方法显着提高了BC语音的质量。同时,可以在没有复杂的系统配置更新的情况下获得高采样速率增强的语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号