首页> 外文会议>2018 52nd Asilomar Conference on Signals, Systems, and Computers >A new feature set for masking-based monaural speech separation
【24h】

A new feature set for masking-based monaural speech separation

机译:基于蒙版的单声道语音分离的新功能集

获取原文
获取原文并翻译 | 示例

摘要

We propose a new feature based on a gammatone filter bank for improving monaural speech separation using neural networks. This new feature encodes not only the local information of cochleagram, and spectrotemporal context, similar to previous approaches, but also captures time-frequency dynamics in the spectrotemporal context using an image processing technique. Speech separation was achieved by computing optimal time-frequency masks using two types of neural networks (DNN and LSTM) to determine the interactions between feature and training model properties. The performance of our feature was evaluated in a variety of simulated environments having different non-stationary noises and reverberation times and quantified using three objective measures. Experimental results show that the proposed monaural feature set improves the objective speech intelligibility, speech quality and signal-to-noise ratio compared to prior feature sets in noisy and reverberant environments with particular benefit in speech intelligibility.
机译:我们提出了一个基于gammatone滤波器库的新功能,用于使用神经网络改善单声道语音分离。与以前的方法类似,此新功能不仅对耳蜗图和时空上下文进行本地编码,而且还使用图像处理技术捕获时空上下文中的时频动态。语音分离是通过使用两种类型的神经网络(DNN和LSTM)计算最佳时频掩码来确定特征与训练模型属性之间的相互作用来实现的。我们在各种具有不同非平稳噪声和混响时间的模拟环境中对我们功能的性能进行了评估,并使用三个客观指标对其进行了量化。实验结果表明,与嘈杂和混响环境中的现有特征集相比,所提出的单声道特征集提高了客观语音清晰度,语音质量和信噪比,特别有利于语音清晰度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号