【24h】

LEARNING INVARIANT FEATURES FOR SPEECH SEPARATION

机译:学习语音分离的不变功能

获取原文

摘要

Recent studies on speech separation show that the ideal binary mask (IBM) substantially improves speech intelligibility in noise. Supervised learning can be used to effectively estimate the IBM. However, supervised learning has trouble dealing with the situations where the probabilistic properties of the training data and the test data do not match, resulting in a challenging issue of generalization whereby the system trained under particular noise conditions may not generalize to new noise conditions. We propose to use a novel metric learning method to learn invariant speech features in the kernel space. As the learned features encode speech-related information that is robust to different noise types, the system is expected to generalize to unseen noise conditions. Evaluations show the advantage of the proposed approach over other speech separation systems.
机译:最近关于言语分离的研究表明,理想的二元掩模(IBM)大大提高了噪声的语音清晰度。监督学习可用于有效地估计IBM。然而,监督学习在训练数据和测试数据不匹配的情况下处理概率性质的情况难以处理,导致泛型的挑战性问题,从而在特定噪声条件下训练的系统可能不会概括到新的噪声条件。我们建议使用新的公制学习方法来学习内核空间中的不变语音功能。由于学习的功能编码对不同噪声类型具有鲁棒的语音相关信息,因此预计该系统将概括到未见的噪声条件。评估显示所提出的方法在其他语音分离系统中的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号