...
首页> 外文期刊>IEEE transactions on audio, speech and language processing >Blind Source Separation Exploiting Higher-Order Frequency Dependencies
【24h】

Blind Source Separation Exploiting Higher-Order Frequency Dependencies

机译:盲源分离,利用高阶频率相关性

获取原文
获取原文并翻译 | 示例

摘要

Blind source separation (BSS) is a challenging problem in real-world environments where sources are time delayed and convolved. The problem becomes more difficult in very reverberant conditions, with an increasing number of sources, and geometric configurations of the sources such that finding directionality is not sufficient for source separation. In this paper, we propose a new algorithm that exploits higher order frequency dependencies of source signals in order to separate them when they are mixed. In the frequency domain, this formulation assumes that dependencies exist between frequency bins instead of defining independence for each frequency bin. In this manner, we can avoid the well-known frequency permutation problem. To derive the learning algorithm, we define a cost function, which is an extension of mutual information between multivariate random variables. By introducing a source prior that models the inherent frequency dependencies, we obtain a simple form of a multivariate score function. In experiments, we generate simulated data with various kinds of sources in various environments. We evaluate the performances and compare it with other well-known algorithms. The results show the proposed algorithm outperforms the others in most cases. The algorithm is also able to accurately recover six sources with six microphones. In this case, we can obtain about 16-dB signal-to-interference ratio (SIR) improvement. Similar performance is observed in real conference room recordings with three human speakers reading sentences and one loudspeaker playing music
机译:盲源分离(BSS)在现实环境中是一个具有挑战性的问题,在现实环境中,这些源具有时间延迟和卷积。在非常混响的情况下,随着声源数量的增加以及声源的几何构造,问题变得更加困难,以致于寻找方向性不足以进行声源分离。在本文中,我们提出了一种新算法,该算法利用了源信号的高阶频率依赖性,以便在混合信号时将其分离。在频域中,此公式假定频率仓之间存在依赖性,而不是为每个频率仓定义独立性。这样,我们可以避免众所周知的频率排列问题。为了得出学习算法,我们定义了一个成本函数,它是多元随机变量之间相互信息的扩展。通过引入先验建模固有频率依赖性的源,我们获得了多元得分函数的简单形式。在实验中,我们使用各种环境中的各种来源生成模拟数据。我们评估性能并将其与其他知名算法进行比较。结果表明,该算法在大多数情况下都优于其他算法。该算法还能够使用六个麦克风准确地恢复六个信号源。在这种情况下,我们可以获得大约16 dB的信号干扰比(SIR)改善。在真实的会议室录音中观察到了类似的性能,其中三位人类演讲者朗读句子,一位扬声器朗诵音乐

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号