首页> 外文会议>International Symposium on Chinese Spoken Language Processing >An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement
【24h】

An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement

机译:用于单一语音增强的注意力增强全卷积神经网络

获取原文

摘要

Convolutional neural networks (CNN) have made remarkable achievements in speech enhancement. However, the convolution operation is difficult to obtain the global context of the feature map due to its locality. To solve the above problem, we propose an attention-augmented fully convolutional neural network for monaural speech enhancement. More specifically, the method is to integrate a new two-dimensional relative selfattention mechanism into fully convolutional networks. Besides, we utilize Huber Loss as the loss function, which is more robust to noise. Experimental results indicate that compared with the optimally modified log-spectral amplitude (OMLSA) estimator and other CNN-based models, our proposed network has better performance in five indicators, and can well balance noise suppression and speech distortion. What is more, we also embed the proposed attention mechanism into other convolutional networks and get satisfactory results, showing that this mechanism has great generalization ability.
机译:卷积神经网络(CNN)在语音增强中取得了显着的成就。然而,由于其位置,难以获得卷积操作难以获得特征贴图的全局背景。为了解决上述问题,我们提出了一个用于单一语音增强的注意力全卷积神经网络。更具体地,该方法是将新的二维相对自助派机制集成到完全卷积网络中。此外,我们利用Huber损失作为损失功能,这对噪声更加坚固。实验结果表明,与最佳修改的日志谱幅度(OMLSA)估计器和其他基于CNN的模型相比,我们所提出的网络在五个指标中具有更好的性能,并且可以很好地平衡噪声抑制和语音失真。更重要的是,我们还将提议的注意机制嵌入到其他卷积网络中并获得令人满意的结果,表明该机制具有很大的概括能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号