首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Speaker-Aware Target Speaker Enhancement by Jointly Learning with Speaker Embedding Extraction
【24h】

Speaker-Aware Target Speaker Enhancement by Jointly Learning with Speaker Embedding Extraction

机译:通过与说话人嵌入提取联合学习来增强说话人感知目标说话人

获取原文

摘要

Deep learning based speech separation approaches have received great interest, among which the recent speaker-aware speech enhancement methods are promising for solving difficulties such as arbitrary source permutation and unknown number of sources. In this paper, we propose a novel training framework which jointly learns the speaker-conditioned target speaker extraction model and its associated speaker embedding model. The resulting unified model directly learns the appropriate speaker embedding for improved target speech enhancement. We demonstrate, on our large simulated noisy and far-field evaluation sets of overlapped speech signals, that our proposed approach significantly improves the speech enhancement performance compared to the baseline speaker-aware speech enhancement models.
机译:基于深度学习的语音分离方法已经引起了极大的兴趣,其中最近的说话者感知的语音增强方法有望解决诸如任意源置换和未知数目的源之类的难题。在本文中,我们提出了一种新颖的训练框架,该框架可以共同学习说话人条件下的目标说话人提取模型及其相关的说话人嵌入模型。生成的统一模型直接学习适当的说话人嵌入,以改善目标语音。我们在重叠语音信号的大型模拟噪声和远场评估集上证明,与基线说话者感知的语音增强模型相比,我们提出的方法显着提高了语音增强性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号