首页> 外文会议>International Symposium on Telecommunications >Single-channel Music/Speech Separation Using Non-linear Masks
【24h】

Single-channel Music/Speech Separation Using Non-linear Masks

机译:单通道音乐/语音分离使用非线性面罩

获取原文

摘要

In this paper, we address the problem of monaural music and speech separation, based on soft mask filtering. Likewise other well-known techniques, the estimation of statistical model of the sources are needed. Hence, we employ Vector quantization (VQ) for synthesis stage which results in more accurate codebook entries for each source in contrast to the commonly used GMM (Gaussian Mixture Model) approach. In separation stage we compare the non linear mask proposed in this work with other well-known techniques in terms of undesirable signal to interference ratio (SIR) effects. It is demonstrated that the proposed semi soft mask results in the best performance in terms of both SIR and subjective measures.
机译:在本文中,基于软掩模过滤,我们解决了单声道音乐和言语分离的问题。同样,其他众所周知的技术,需要估计这些来源的统计模型。因此,我们采用了综合阶段的矢量量化(VQ),其与常用的GMM(高斯混合模型)方法相比,每个源的更准确的码本条目。在分离阶段,我们将在该工作中提出的非线性掩模与其他众所周知的技术的不希望的信号与干扰比(SIR)效应进行比较。据证明,所提出的半软片在先生和主观措施方面导致最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号