An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement

机译：用于单一语音增强的注意力增强全卷积神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convolutional neural networks (CNN) have made remarkable achievements in speech enhancement. However, the convolution operation is difficult to obtain the global context of the feature map due to its locality. To solve the above problem, we propose an attention-augmented fully convolutional neural network for monaural speech enhancement. More specifically, the method is to integrate a new two-dimensional relative selfattention mechanism into fully convolutional networks. Besides, we utilize Huber Loss as the loss function, which is more robust to noise. Experimental results indicate that compared with the optimally modified log-spectral amplitude (OMLSA) estimator and other CNN-based models, our proposed network has better performance in five indicators, and can well balance noise suppression and speech distortion. What is more, we also embed the proposed attention mechanism into other convolutional networks and get satisfactory results, showing that this mechanism has great generalization ability.

机译：卷积神经网络（CNN）在语音增强中取得了显着的成就。然而，由于其位置，难以获得卷积操作难以获得特征贴图的全局背景。为了解决上述问题，我们提出了一个用于单一语音增强的注意力全卷积神经网络。更具体地，该方法是将新的二维相对自助派机制集成到完全卷积网络中。此外，我们利用Huber损失作为损失功能，这对噪声更加坚固。实验结果表明，与最佳修改的日志谱幅度（OMLSA）估计器和其他基于CNN的模型相比，我们所提出的网络在五个指标中具有更好的性能，并且可以很好地平衡噪声抑制和语音失真。更重要的是，我们还将提议的注意机制嵌入到其他卷积网络中并获得令人满意的结果，表明该机制具有很大的概括能力。

著录项

来源
《International Symposium on Chinese Spoken Language Processing》|2021年|1-5|共5页
会议地点
作者
Zezheng Xu; Ting Jiang; Chao Li; Jiacheng Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Convolution; Noise reduction; Speech enhancement; Distortion; Robustness; Convolutional neural networks;

机译：卷积;降噪;语音增强;失真;鲁棒性;卷积神经网络;

相似文献

外文文献
中文文献
专利

1. FLGCNN: A novel fully convolutional neural network for end-to-end monaural speech enhancement with utterance-based objective functions [J] . Zhu Yuanyuan, Xu Xu, Ye Zhongfu Applied Acoustics . 2020,第Deca期

机译：FLGCNN：具有基于话语的目标功能的端到端单声道语音增强新颖的全卷积神经网络
2. Learning Complex Spectral Mapping With Gated Convolutional Recurrent Networks for Monaural Speech Enhancement [J] . Ke Tan, DeLiang Wang Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：学习复杂谱映射与门控卷积经常性网络进行单一语音增强
3. Audio classification using attention-augmented convolutional neural network [J] . Wu Yu, Mao Hua, Yi Zhang Knowledge-Based Systems . 2018,第DECa1期

机译：使用注意力增强卷积神经网络进行音频分类
4. Dilated convolutional recurrent neural network for monaural speech enhancement [C] . Shadi Pirhosseinloo, Jonathan S. Brumberg Asilomar Conference on Signals, Systems, and Computers . 2019

机译：扩张卷积递归神经网络用于单声道语音增强
5. Convolutional Neural Networks for Speaker-Independent Speech Recognition. [D] . Belilovsky, Eugene. 2011

机译：用于与说话人无关的语音识别的卷积神经网络。
6. 3D Convolutional Neural Networks Initialized from Pretrained 2D Convolutional Neural Networks for Classification of Industrial Parts [O] . Ibon Merino, Jon Azpiazu, Anthony Remazeilles, 2021

机译：3D卷积神经网络从佩带的2D卷积神经网络初始化用于工业部件的分类
7. Gated Residual Networks With Dilated Convolutions for Monaural Speech Enhancement [O] . Ke Tan, Jitong Chen, DeLiang Wang 2019

机译：具有扩张卷曲的门控剩余网络，用于单声道语音增强

An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement

摘要

著录项

相似文献

相关主题

期刊订阅