首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >UNSUPERVISED SPATIAL DICTIONARY LEARNING FOR SPARSE UNDERDETERMINED MULTICHANNEL SOURCE SEPARATION
【24h】

UNSUPERVISED SPATIAL DICTIONARY LEARNING FOR SPARSE UNDERDETERMINED MULTICHANNEL SOURCE SEPARATION

机译:无监督的空间字典学习稀疏未确定的多通道源分离

获取原文

摘要

Multichannel sparse representation of acoustic sources has shown to provide an attractive framework for source separation. The multichannel sparse modeling assumes an ability to describe signals as linear combinations of few atoms from a pre-specified dictionary. The dictionary is built by simulating room impulse responses on a grid of locations, exploiting a prior knowledge on the room geometry and reflection coefficients. However, due to the simplified modeling, any mismatch between the simulated and true observed RIRs would generate a considerable distortion in the recovered output signals. In this work we propose an unsupervised adaptation of the dictionary through a semi-blind weighted Natural Gradient, assuming spatio-temporal source sparseness. The system continuously adapts the atoms with the incoming data, improving the match between the dictionary and the true mixing parameters. Results over simulated data show that the proposed framework is a promising solution to underdetermined convolutive source separation in difficult acoustic scenarios.
机译:声源的多通道稀疏表示已显示为源分离提供有吸引力的框架。多通道稀疏建模假定能够将信号描述为来自预先指定的字典的线性组合。该词典是通过模拟在地点网格上的房间脉冲响应构建的构建,利用了房间几何和反射系数的先验知识。然而,由于简化的建模,模拟和真实的RIR之间的任何不匹配会在恢复的输出信号中产生相当大的失真。在这项工作中,我们通过半盲加权自然梯度提出了无监督的文章,假设时空源稀疏性。系统连续使用传入数据来互动原子,从而改善字典和真正的混合参数之间的匹配。结果仿真数据显示,所提出的框架是对难以解释的声学情景中有未确定的卷曲源分离的有希望的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号