首页> 外文会议>IEEE International Symposium on Signal Processing and Information Technology >Convolutional maxout neural networks for speech separation
【24h】

Convolutional maxout neural networks for speech separation

机译:用于语音分离的卷积漏洞神经网络

获取原文

摘要

Speech separation based on deep neural networks (DNNs) has been widely studied recently, and has achieved considerable success. However, previous studies are mostly based on fully-connected neural networks. In order to capture the local information of speech signals, we propose to use convolutional maxout neural networks (CMNNs) to separate speech and noise by estimating the ideal ratio mask of the time-frequency units. In our work the proposed CMNN is applied in the frequency domain. By using local filtering and max-pooling, convolutional neural networks can model the local structure of speech signals. Instead of sigmoid function, maxout is selected to address the saturation problem. In addition, dropout is integrated into the network to get better generalization ability. The proposed system outperforms a traditional DNN-based system in both objective speech quality and intelligibility.
机译:最近基于深度神经网络(DNN)的语音分离已被广泛研究,并取得了相当大的成功。然而,以前的研究主要基于完全连接的神经网络。为了捕获语音信号的本地信息,我们建议通过估计时频单元的理想比率掩模来使用卷积颤槽神经网络(CMNNS)来分离语音和噪声。在我们的工作中,所提出的CMNN应用于频域。通过使用本地滤波和最大池,卷积神经网络可以模拟语音信号的局部结构。选择MaxOut而不是SigMoid函数,以解决饱和问题。此外,辍学将集成到网络中以获得更好的泛化能力。该系统以客观语音质量和可懂度的基于传统DNN的系统优于一种基于DNN的系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号