首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP >On generalization of classification based speech separation
【24h】

On generalization of classification based speech separation

机译:基于分类的语音分离一般化

获取原文

摘要

Monaural speech separation is a very challenging problem. Recent studies utilize supervised learning methods to estimate the ideal binary mask (IBM) to solve the problem. In a supervised learning framework, the issue of generalization to conditions different from those used in training is paramount. This paper describes methods that require only a small training corpus but can generalize to unseen conditions. The system utilizes support vector machines to learn classification cues and then employs a rethresholding method to estimate the IBM. A distribution fitting method is used to address unseen signal-to-noise ratio conditions and an iterative voice activity detection is used to address unseen noise conditions. Systematic evaluations show that the proposed approach produces high quality IBM estimates under unseen conditions.
机译:单耳语音分离是一个非常具有挑战性的问题。最近的研究利用监督学习方法来估计解决问题的理想二进制掩码(IBM)。在有监督的学习框架中,将条件泛化为不同于训练中所使用的条件的问题至关重要。本文介绍了只需要少量训练语料库但可以推广到看不见情况的方法。该系统利用支持向量机来学习分类线索,然后采用重新阈值方法来估计IBM。分布拟合方法用于解决看不见的信噪比条件,迭代语音活动检测用于解决看不见的噪声条件。系统评估表明,所提出的方法在看不见的情况下可以产生高质量的IBM估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号