首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >On generalization of classification based speech separation
【24h】

On generalization of classification based speech separation

机译:基于分类的语言分离的概括

获取原文

摘要

Monaural speech separation is a very challenging problem. Recent studies utilize supervised learning methods to estimate the ideal binary mask (IBM) to solve the problem. In a supervised learning framework, the issue of generalization to conditions different from those used in training is paramount. This paper describes methods that require only a small training corpus but can generalize to unseen conditions. The system utilizes support vector machines to learn classification cues and then employs a rethresholding method to estimate the IBM. A distribution fitting method is used to address unseen signal-to-noise ratio conditions and an iterative voice activity detection is used to address unseen noise conditions. Systematic evaluations show that the proposed approach produces high quality IBM estimates under unseen conditions.
机译:单声道言语分离是一个非常具有挑战性的问题。最近的研究利用受监督的学习方法来估计理想的二进制掩码(IBM)来解决问题。在监督学习框架中,与培训中使用的条件的概括是至关重要的。本文介绍了只需要小型培训语料库,但可以概括到看不见的条件的方法。该系统利用支持向量机来学习分类提示,然后采用Rethresholding方法来估计IBM。分配拟合方法用于解决看不见的信噪比条件,并且使用迭代语音活动检测来解决未经噪声条件。系统评估表明,该方法在看不见条件下产生了高质量的IBM估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号